Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
10
9
35
Haihao Shen
Haihao
Follow
Nocode3000's profile picture
Agnuxo's profile picture
Pent's profile picture
9 followers
·
1 following
https://github.com/intel/auto-round
HaihaoShen
hshen14
AI & ML interests
LLM quantization, sparsity, and acceleration
Recent Activity
New activity
13 days ago
Intel/neural-chat-7b-v3:
Adding `safetensors` variant of this model
New activity
17 days ago
Intel/neural-chat-7b-v3-3:
Adding `safetensors` variant of this model
authored
a paper
about 2 months ago
Efficient LLM Inference on CPUs
View all activity
Articles
Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon
May 9
•
11
Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding
Jan 30
•
4
Organizations
Papers
9
arxiv:
2311.16133
arxiv:
2311.00502
arxiv:
2310.10944
arxiv:
2309.14592
Expand 9 papers
models
None public yet
datasets
None public yet