MoH: Multi-Head Attention as Mixture-of-Head Attention Paper • 2410.11842 • Published 23 days ago • 20
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models Paper • 2410.13085 • Published 22 days ago • 20
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27 • 602