view article Article Towards a Fully Arabic Retrieval-Augmented Generation (RAG) Pipeline: By Omartificial-Intelligence-Space • 2 days ago • 5
view article Article To what extent are we responsible for our content and how to create safer Spaces? By davidberenstein1957 • Aug 30 • 3
view article Article Let’s make a generation of amazing image generation models By burtenshaw • 6 days ago • 33
view article Article Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models By mikelabs • 11 days ago • 2
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK By davidberenstein1957 • 12 days ago • 34
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais • 19 days ago • 97
Marqo-Ecommerce-Embeddings Collection State-of-the-art embedding models fine-tuned for the ecommerce domain. +67% increase in evaluation metrics vs ViT-B-16-SigLIP. • 10 items • Updated 18 days ago • 17
view article Article PyTorchModelHubMixin: Bridging the Gap for Custom AI Models on Hugging Face By not-lain • 21 days ago • 12
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models Paper • 2411.04996 • Published 25 days ago • 49
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 By manu • Jul 5 • 169
view article Article Recipe: Preparing Multilingual Speech Datasets for TTS Training By PHBJT • 28 days ago • 14
AMD-OLMo Collection AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinct™ MI250 GPUs based on OLMo. • 4 items • Updated Oct 31 • 17
VidToMe: Video Token Merging for Zero-Shot Video Editing Paper • 2312.10656 • Published Dec 17, 2023 • 10
view article Article Hugging Face welcomes the Aya Expanse family of multilingual models By ariG23498 • Oct 24 • 10