Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2404.05829

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11 • 84
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time

Paper • 2404.10667 • Published Apr 16 • 17
Instruction-tuned Language Models are Better Knowledge Learners

Paper • 2402.12847 • Published Feb 20 • 24
DoRA: Weight-Decomposed Low-Rank Adaptation

Paper • 2402.09353 • Published Feb 14 • 26

multi-lingual llms

SambaLingo: Teaching Large Language Models New Languages

Paper • 2404.05829 • Published Apr 8 • 12

SambaLingo: Teaching Large Language Models New Languages

Paper • 2404.05829 • Published Apr 8 • 12

SambaLingo: Teaching Large Language Models New Languages

Paper • 2404.05829 • Published Apr 8 • 12

Expert models that adapt Llama2 to a diverse set of languages from around the world.

SambaLingo: Teaching Large Language Models New Languages

Paper • 2404.05829 • Published Apr 8 • 12
sambanovasystems/SambaLingo-Arabic-Chat

Text Generation • Updated Apr 16 • 3k • 60
sambanovasystems/SambaLingo-Arabic-Base

Text Generation • Updated May 14 • 2.95k • 37
sambanovasystems/SambaLingo-Arabic-Base-70B

Text Generation • Updated May 14 • 2.85k • 1

Japanese LLMs (papers)

PLaMo-100B: A Ground-Up Language Model Designed for Japanese Proficiency

Paper • 2410.07563 • Published Oct 10 • 2
LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

Paper • 2407.03963 • Published Jul 4 • 15
Tagengo: A Multilingual Chat Dataset

Paper • 2405.12612 • Published May 21 • 3
Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities

Paper • 2404.17790 • Published Apr 27 • 5

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs