ShowUI: One Vision-Language-Action Model for GUI Visual Agent Paper • 2411.17465 • Published 1 day ago • 49
Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published 2 days ago • 32
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated 1 day ago • 18
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published 7 days ago • 37
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization Paper • 2411.10442 • Published 12 days ago • 60
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper • 2411.14405 • Published 6 days ago • 51
RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published 9 days ago • 47
SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration Paper • 2411.10958 • Published 11 days ago • 47
BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices Paper • 2411.10640 • Published 12 days ago • 41
FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI Paper • 2411.04872 • Published 20 days ago • 4
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems Paper • 2411.02959 • Published 23 days ago • 64
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent Paper • 2411.02265 • Published 23 days ago • 24
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated about 23 hours ago • 97