ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 9 days ago • 98
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated about 1 month ago • 445
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated about 1 month ago • 289
Multimodal Latent Language Modeling with Next-Token Diffusion Paper • 2412.08635 • Published 17 days ago • 41
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper • 2411.14405 • Published Nov 21 • 58