Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models Mar 20 • 67
Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model Aug 22, 2023 • 28
Huggy Lingo: Using Machine Learning to Improve Language Metadata on the Hugging Face Hub Aug 2, 2023 • 1
On Limitations of LLM as Annotator for Low Resource Languages Paper • 2411.17637 • Published 4 days ago • 2
view article Article Fine-Tuning 1B LLaMA 3.2: A Comprehensive Step-by-Step Guide with Code By ImranzamanML • Oct 2 • 34
view article Article Let’s make a generation of amazing image generation models By burtenshaw • 4 days ago • 32
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled • Oct 14 • 56
UnifiedCrawl: Aggregated Common Crawl for Affordable Adaptation of LLMs on Low-Resource Languages Paper • 2411.14343 • Published 9 days ago • 7
Multimodal Autoregressive Pre-training of Large Vision Encoders Paper • 2411.14402 • Published 9 days ago • 38
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated 3 days ago • 48
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 7 items • Updated 3 days ago • 24
Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline Paper • 2411.12814 • Published 11 days ago • 20
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK By davidberenstein1957 • 9 days ago • 32
OpenScholar_V1 Collection The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 8 items • Updated 9 days ago • 26
RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published 11 days ago • 47
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published 15 days ago • 106
Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language Paper • 2410.23956 • Published about 1 month ago • 1
AstroMLab 3: Achieving GPT-4o Level Performance in Astronomy with a Specialized 8B-Parameter Large Language Model Paper • 2411.09012 • Published 17 days ago • 1
Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation? Paper • 2309.07462 • Published Sep 14, 2023 • 4