Raven

raveninrhythm

AI & ML interests

None yet

Recent Activity

Organizations

None yet

raveninrhythm's activity

Reacted to fdaudens's post with šŸ¤šŸ”„ 4 months ago
view post
Post
3288
Small models, BIG impact: SmolLM is here! šŸš€šŸ”¬

We're launching a series of small but mighty language models:
šŸŽļø Super fast - runs on laptops, phones, you name it!
šŸ“ 3 sizes: 130M, 350M, and 1.5B parameters
šŸ„‡ Outperforms same size models from Meta, Microsoft, and Qwen
šŸ”“ Fully open-source: datasets, training code, models

šŠšžš² šŸšžššš­š®š«šžš¬
- Trained on FineWeb-Edu and Cosmopedia v2 (largest synthetic pre-training dataset)
- No cloud needed - run locally for privacy and energy efficiency
- Everything is public, from data curation to training steps

ššØš­šžš§š­š¢ššš„ š®š¬šž šœššš¬šžš¬
- On-device autocomplete
- Local request parsing
- Custom fine-tuning for specific needs without the need for expensive GPUs

š†šØ ššžšžš©šžš«
šŸ‘‰ Check it out: https://huggingface.co/collections/HuggingFaceTB/smollm-models-6695016cad7167254ce15966
šŸ‘‰ Run the 360M model in your browser, 100 % private: HuggingFaceTB/SmolLM-360M-Instruct-WebGPU
šŸ‘‰ Read the blog explaining everything in detail: huggingface.co/blog/smollm

Kudos to the stellar team who worked on this project: @loubnabnl @anton-l @eliebak @lvwerra