Marc Sun

marcsun13

AI & ML interests

LLM, Quantization, Training, Inference

Recent Activity

Articles

Organizations

Hugging Face's profile picture Hugging Face Internal Testing Organization's profile picture HuggingFaceM4's profile picture Hugging Face OSS Metrics's profile picture accelerate's profile picture Hugging Face TB Research's profile picture Quanto library's profile picture LocalLLaMA's profile picture MLX Community's profile picture Hugging Face 1Bit LLMs's profile picture Paris AI Running Club's profile picture LLHF's profile picture SLLHF's profile picture Hugging Quants's profile picture Hugging Face Party @ PyTorch Conference's profile picture qrias's profile picture DDUF's profile picture

marcsun13's activity

upvoted an article about 2 months ago
view article
Article

Fixing Gradient Accumulation

• 43
upvoted 2 articles 2 months ago
view article
Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

• 204
upvoted an article 3 months ago
view article
Article

SmolLM - blazingly fast and remarkably powerful

• 278
upvoted an article 4 months ago
view article
Article

XetHub is joining Hugging Face!

• 80
upvoted an article 6 months ago
view article
Article

Benchmarking Text Generation Inference

• 27
upvoted an article 6 months ago
view article
Article

License to Call: Introducing Transformers Agents 2.0

• 118
upvoted an article 8 months ago
view article
Article

Welcome Llama 3 - Meta's new open LLM

• 279
upvoted 9 articles 8 months ago
view article
Article

Vision Language Models Explained

• 217
view article
Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

• 168
view article
Article

Overview of natively supported quantization schemes in 🤗 Transformers

• 11
view article
Article

Making LLMs lighter with AutoGPTQ and transformers

• 34
view article
Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

• 63
view article
Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

• 96
view article
Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

• 66
view article
Article

quanto: a pytorch quantization toolkit

• 31
view article
Article

GaLore: Advancing Large Model Training on Consumer-grade Hardware

• 25