Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models Paper ā¢ 2408.15518 ā¢ Published Aug 28 ā¢ 42
The Mamba in the Llama: Distilling and Accelerating Hybrid Models Paper ā¢ 2408.15237 ā¢ Published Aug 27 ā¢ 37 ā¢ 4