view article Article LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!) By wolfram • Apr 24 • 58
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 8 items • Updated 5 days ago • 159
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper • 2402.14905 • Published Feb 22 • 126
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 8 items • Updated 3 days ago • 88
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages Paper • 2410.16153 • Published 19 days ago • 42
Recent highlights Collection Some recent models worth checking out • 18 items • Updated 8 days ago • 38
view article Article Transformers.js v3: WebGPU support, new models & tasks, and more… 19 days ago • 57
view article Article Advanced Flux Dreambooth LoRA Training with 🧨 diffusers By linoyts • 19 days ago • 27
Granite 3.0 Language Models Collection A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 5 days ago • 86
Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities Paper • 2410.11190 • Published 26 days ago • 20
Quantization Spaces on the Hub ⚡ Collection A collection of spaces that allow you to quantize on the Hub • 4 items • Updated 5 days ago • 4