YangWang92's picture

YangWang92

yangwang92

·

AI & ML interests

None yet

Organizations

yangwang92's activity

New activity in nvidia/Llama-3.1-Nemotron-70B-Instruct-HF 22 days ago

Will quantised version be available?

#9 opened 26 days ago by

How to inference it on a 40 GB A100 and 80 GB Ram of Colab PRO?

#17 opened 23 days ago by

commented 2 papers about 1 month ago

VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models

Paper • 2409.17066 • Published Sep 25 • 27 •

VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models

Paper • 2409.17066 • Published Sep 25 • 27 •

New activity in huggingface/HuggingDiscussions about 2 months ago

[FEEDBACK] Daily Papers

#32 opened 5 months ago by