724 58 257

Younes Belkada

ybelkada

AI & ML interests

Large Language Models, Quantization, Vision, Multimodality, Diffusion models

Recent Activity

New activity 7 days ago

ybelkada/t5-11b-sharded:Adding `safetensors` variant of this model

New activity 11 days ago

ybelkada/mpt-7b-bf16-sharded:Adding `safetensors` variant of this model

New activity 13 days ago

mlx-community/falcon-mamba-7b-bf16:Upload folder using huggingface_hub

View all activity

Articles

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 94

Introducing RWKV — An RNN with the advantages of a transformer

May 15, 2023

• 14

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Apr 5, 2023

• 20

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Mar 9, 2023

• 34

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Aug 17, 2022

• 63

Organizations

Posts 4

Post

2614

Falcon Mamba now available now in llama.cpp !
Check out GGUF files uploaded here: tiiuae/falconmamba-7b-66b9a580324dd1598b0f6d4a

Post

3430

FalconMamba 7B - a new model from TII (Technology Innovation Institute) is out !

- Blogpost: https://huggingface.co/blog/falconmamba
- Link to collection: tiiuae/falconmamba-7b-66b9a580324dd1598b0f6d4a
- Link to playground: tiiuae/falcon-mamba-playground

View all posts

Collections 1

Papers 8

spaces 25

Running

🦙

GGUF My Repo

No application file

👀

Test Zero

Sleeping

🐠

Dlai Test 2

No application file

🚀

Blip Imagecaptioning Dlai

Sleeping

⚡

Open Source List Models

Running on Zero

🌖

Llava 1.5 Dlai

models 143

ybelkada/t5-11b-sharded

Translation • Updated 7 days ago • 49 • 1

ybelkada/mpt-7b-bf16-sharded

Text Generation • Updated 11 days ago • 56

ybelkada/gpt-j-6b-sharded-bf16

Text Generation • Updated 18 days ago • 3.53k • 2

ybelkada/t5-3b-sharded

Text2Text Generation • Updated Oct 26 • 48 • 1

ybelkada/test-gguf-trainer-Q8_0-GGUF

Updated May 28 • 5

ybelkada/test-gguf-trainer

Text Generation • Updated May 28 • 10 • 1

ybelkada/tiny-random-llama-Q6_K-GGUF

Updated May 28 • 6

ybelkada/test-gguf-trainer-Q4_K_M-GGUF

Updated May 27 • 8

ybelkada/tiny-random-llama-Q4_K_M-GGUF

Updated May 22 • 3

ybelkada/tiny-random-llama

Text Generation • Updated May 22 • 15

datasets 12

ybelkada/model_cards_correct_tag

Viewer • Updated Mar 19 • 54 • 40

ybelkada/model-info-library-name

Updated Jan 23 • 3

ybelkada/test-model-info-library-name

Viewer • Updated Jan 23 • 1 • 47

ybelkada/documentation-images

Viewer • Updated Jan 19 • 2 • 40.5k

ybelkada/oasst1-tiny-subset

Viewer • Updated May 11, 2023 • 44.1k • 44 • 2

ybelkada/oasst1

Viewer • Updated May 11, 2023 • 44.1k • 47 • 1

ybelkada/food101-tiny

Viewer • Updated May 5, 2023 • 100 • 39

ybelkada/test-onepiece-dataset

Viewer • Updated May 5, 2023 • 10 • 47

ybelkada/common_voice_mr_11_0_copy

Viewer • Updated Apr 4, 2023 • 10.8k • 237

ybelkada/english_quotes_copy

Viewer • Updated Apr 4, 2023 • 2.51k • 4.52k

Younes Belkada

AI & ML interests

Recent Activity

Articles

Welcome FalconMamba: The first strong attention-free 7B model

Welcome Llama 3 - Meta's new open LLM

GaLore: Advancing Large Model Training on Consumer-grade Hardware

quanto: a pytorch quantization toolkit

Fine-Tuning Gemma Models in Hugging Face

Mixture of Experts Explained

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

Overview of natively supported quantization schemes in 🤗 Transformers

Making LLMs lighter with AutoGPTQ and transformers

Fine-tune Llama 2 with DPO

The Falcon has landed in the Hugging Face ecosystem