view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais • 17 days ago • 96
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders Paper • 2410.22366 • Published Oct 28 • 75
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 1 day ago • 183
Pyramidal Flow Matching for Efficient Video Generative Modeling Paper • 2410.05954 • Published Oct 8 • 38
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22 • 121
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Paper • 2403.03206 • Published Mar 5 • 59
view article Article Orchestration of Experts: The First-Principle Multi-Model System By alirezamsh • May 30 • 15
Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware Paper • 2304.13705 • Published Apr 23, 2023 • 3