Shangzhi Zhang

Snorlax
ยท

AI & ML interests

None yet

Recent Activity

updated a model 1 day ago
Snorlax/ppo-Pyramids
updated a model 1 day ago
Snorlax/ppo-SnowballTarget
updated a model 1 day ago
Snorlax/Reinforce-PixelCopter
View all activity

Organizations

Diffusers Pipelines Library for Stable Diffusion's profile picture

Snorlax's activity

liked a Space 7 days ago
liked a Space 4 months ago
upvoted an article 5 months ago
view article
Article

SmolLM - blazingly fast and remarkably powerful

โ€ข 279
reacted to georgewritescode's post with ๐Ÿ‘ 7 months ago
view post
Post
1006
Visualization of GPT-4o breaking away from the quality & speed trade-off curve the LLMs have followed thus far โœ‚๏ธ

Key GPT-4o takeaways
โ€ฃ GPT-4o not only offers the highest quality, it also sits amongst the fastest LLMs
โ€ฃ For those with speed/latency-sensitive use cases, where previously Claude 3 Haiku or Mixtral 8x7b were leaders, GPT-4o is now a compelling option (though significantly more expensive)
โ€ฃ Previously Groq was the only provider to break from the curve using its own LPU chips. OpenAI has done it on Nvidia hardware (one can imagine the potential for GPT-4o on Groq)

๐Ÿ‘‰ How did they do it? Will follow up with more analysis on this but potential approaches include a very large but sparse MoE model (similar to Snowflake's Arctic) and improvements in data quality (likely to have driven much of Llama 3's impressive quality relative to parameter count)

Notes: Throughput represents the median across providers over the last 14 days of measurements (8x per day)

Data is present on our HF leaderboard: ArtificialAnalysis/LLM-Performance-Leaderboard and graphs present on our website
  • 1 reply
ยท
updated a collection 9 months ago
updated a collection 10 months ago