-
Rho-1: Not All Tokens Are What You Need
Paper • 2404.07965 • Published • 84 -
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Paper • 2404.10667 • Published • 17 -
Instruction-tuned Language Models are Better Knowledge Learners
Paper • 2402.12847 • Published • 24 -
DoRA: Weight-Decomposed Low-Rank Adaptation
Paper • 2402.09353 • Published • 26
Collections
Discover the best community collections!
Collections including paper arxiv:2404.05829
-
SambaLingo: Teaching Large Language Models New Languages
Paper • 2404.05829 • Published • 12 -
sambanovasystems/SambaLingo-Arabic-Chat
Text Generation • Updated • 3k • 60 -
sambanovasystems/SambaLingo-Arabic-Base
Text Generation • Updated • 2.95k • 37 -
sambanovasystems/SambaLingo-Arabic-Base-70B
Text Generation • Updated • 2.85k • 1
-
PLaMo-100B: A Ground-Up Language Model Designed for Japanese Proficiency
Paper • 2410.07563 • Published • 2 -
LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs
Paper • 2407.03963 • Published • 15 -
Tagengo: A Multilingual Chat Dataset
Paper • 2405.12612 • Published • 3 -
Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities
Paper • 2404.17790 • Published • 5