Haihao Shen's picture

Haihao Shen

Haihao

·

https://github.com/intel/auto-round

AI & ML interests

LLM quantization, sparsity, and acceleration

Recent Activity

New activity 13 days ago

Intel/neural-chat-7b-v3:Adding `safetensors` variant of this model

New activity 17 days ago

Intel/neural-chat-7b-v3-3:Adding `safetensors` variant of this model

authored a paper about 2 months ago

Efficient LLM Inference on CPUs

View all activity

Articles

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

Organizations

Papers 9

arxiv:2311.16133

arxiv:2311.00502

arxiv:2310.10944

arxiv:2309.14592

models

None public yet

datasets

None public yet