vansin

vansin

AI & ML interests

None yet

Recent Activity

Organizations

OpenMMLab's profile picture InternLM's profile picture Blog-explorers's profile picture OpenCompass's profile picture SmartFlowAI's profile picture

vansin's activity

Reacted to loubnabnl's post with 🔥 4 days ago
view post
Post
1288
Making SmolLM2 reproducible: open-sourcing our training & evaluation toolkit 🛠️ https://github.com/huggingface/smollm/

- Pre-training code with nanotron
- Evaluation suite with lighteval
- Synthetic data generation using distilabel (powers our new SFT dataset HuggingFaceTB/smoltalk)
- Post-training scripts with TRL & the alignment handbook
- On-device tools with llama.cpp for summarization, rewriting & agents

Apache 2.0 licensed. V2 pre-training data mix coming soon!

Which other tools should we add next?
posted an update 4 days ago
view post
Post
1176
Amazing !!!! test Post
New activity in meta-llama/Meta-Llama-3-8B 19 days ago

llama3 answer repeat to many times

1
#220 opened 2 months ago by hktk07