zsw1129's picture

9 1

zsw1129

zsw1129

·

AI & ML interests

None yet

Organizations

None yet

zsw1129's activity

upvoted a collection 4 days ago

Llama 3.2 All Versions

Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 20 items • Updated 4 days ago • 29

upvoted a paper 6 days ago

MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling

Paper • 2409.16160 • Published 7 days ago • 28

upvoted an article 6 days ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

13 days ago

• 135

upvoted 5 papers 2 months ago

EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Paper • 2407.11062 • Published Jul 10 • 8

Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models

Paper • 2407.12327 • Published Jul 17 • 75

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19 • 35

SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models

Paper • 2407.15841 • Published Jul 22 • 38

DDK: Distilling Domain Knowledge for Efficient Large Language Models

Paper • 2407.16154 • Published Jul 23 • 20

upvoted a paper 4 months ago

LLaMA-NAS: Efficient Neural Architecture Search for Large Language Models

Paper • 2405.18377 • Published May 28 • 18