43 40 67

Marc Sun

marcsun13

AI & ML interests

LLM, Quantization, Training, Inference

Recent Activity

New activity 2 days ago

DDUF/stable-diffusion-3-medium-diffusers-DDUF:Upload folder using huggingface_hub

updated a model 2 days ago

DDUF/stable-diffusion-3-medium-diffusers-DDUF

New activity 2 days ago

DDUF/stable-diffusion-3-medium-diffusers-DDUF:Upload folder using huggingface_hub

View all activity

Articles

Organizations

marcsun13's activity

upvoted an article about 2 months ago

Article

Fixing Gradient Accumulation

Oct 16

• 43

upvoted 2 articles 2 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18

• 204

Article

Accelerate 1.0.0

Sep 13

• 50

upvoted an article 3 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 278

upvoted an article 4 months ago

Article

XetHub is joining Hugging Face!

Aug 8

• 80

upvoted an article 6 months ago

Article

Benchmarking Text Generation Inference

May 29

• 27

upvoted a paper 6 months ago

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

Paper • 2405.18392 • Published May 28 • 12

upvoted an article 6 months ago

Article

License to Call: Introducing Transformers Agents 2.0

May 13

• 118

upvoted a paper 7 months ago

BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 96

upvoted an article 8 months ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18

• 279

upvoted a collection 8 months ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Sep 25 • 683

upvoted 9 articles 8 months ago

Article

Vision Language Models Explained

Apr 11

• 217

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15

• 168

Article

Overview of natively supported quantization schemes in 🤗 Transformers

Sep 12, 2023

• 11

Article

Making LLMs lighter with AutoGPTQ and transformers

Aug 23, 2023

• 34

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Aug 17, 2022

• 63

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 96

Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Mar 22

• 66

Article

quanto: a pytorch quantization toolkit

Mar 18

• 31

Article

GaLore: Advancing Large Model Training on Consumer-grade Hardware

Mar 20

• 25

Marc Sun

AI & ML interests

Recent Activity

Articles

Introducing SynthID Text

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Accelerate 1.0.0

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

quanto: a pytorch quantization toolkit

Overview of natively supported quantization schemes in 🤗 Transformers

Making LLMs lighter with AutoGPTQ and transformers

Organizations

marcsun13's activity

Fixing Gradient Accumulation

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Accelerate 1.0.0

SmolLM - blazingly fast and remarkably powerful

XetHub is joining Hugging Face!

Benchmarking Text Generation Inference

License to Call: Introducing Transformers Agents 2.0

Welcome Llama 3 - Meta's new open LLM

Vision Language Models Explained

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Overview of natively supported quantization schemes in 🤗 Transformers

Making LLMs lighter with AutoGPTQ and transformers

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

quanto: a pytorch quantization toolkit

GaLore: Advancing Large Model Training on Consumer-grade Hardware