Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
20
36
43
Elie Bakouch
eliebak
Follow
ntnq's profile picture
clementdesroches's profile picture
clem's profile picture
58 followers
·
60 following
eliebakouch
eliebak
eliebak
eliebak.bsky.social
AI & ML interests
Training LLM's @ 🤗
Recent Activity
liked
a Space
1 day ago
HuggingFaceFW/discussion
liked
a dataset
5 days ago
HuggingFaceTB/smoltalk
Reacted to
cfahlgren1
's
post
with ❤️
7 days ago
You can clean and format datasets entirely in the browser with a few lines of SQL. In this post, I replicate the process @mlabonne used to clean the new https://huggingface.co/datasets/microsoft/orca-agentinstruct-1M-v1 dataset. The cleaning process consists of: - Joining the separate splits together / add split column - Converting string messages into list of structs - Removing empty system prompts https://huggingface.co/blog/cfahlgren1/the-beginners-guide-to-cleaning-a-dataset Here's his new cleaned dataset: https://huggingface.co/datasets/mlabonne/orca-agentinstruct-1M-v1-cleaned
View all activity
Articles
SmolVLM - small yet mighty Vision Language Model
2 days ago
•
68
SmolLM - blazingly fast and remarkably powerful
Jul 16
•
272
Organizations
eliebak
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a Space
1 day ago
Running
5
💬
Discussion Forum
liked
a dataset
5 days ago
HuggingFaceTB/smoltalk
Viewer
•
Updated
1 day ago
•
2.2M
•
1.36k
•
156
liked
a model
27 days ago
HuggingFaceTB/SmolLM2-1.7B-Instruct
Text Generation
•
Updated
1 day ago
•
82.3k
•
•
381
liked
a Space
28 days ago
Running
218
😻
Repo duplicator
liked
a Space
about 2 months ago
Running
89
📖
TxT360: Trillion Extracted Text
liked
a model
about 2 months ago
nvidia/Mistral-NeMo-Minitron-8B-Instruct
Text Generation
•
Updated
Oct 9
•
3.5k
•
64
liked
a model
2 months ago
amd/AMD-Llama-135m
Text Generation
•
Updated
Oct 9
•
17.1k
•
109
liked
2 models
3 months ago
mistral-community/pixtral-12b-240910
Image-Text-to-Text
•
Updated
Oct 1
•
383
G-reen/gpt5o-reflexion-q-agi-llama-3.1-8b
Text Generation
•
Updated
Sep 13
•
406
•
64
liked
4 datasets
3 months ago
bigcode/the-stack-smol
Viewer
•
Updated
May 2, 2023
•
300k
•
291
•
43
togethercomputer/Long-Data-Collections
Viewer
•
Updated
Jul 26, 2023
•
42.2k
•
439
•
130
emozilla/dolma-v1_7-books
Viewer
•
Updated
May 29
•
56k
•
17
•
1
cerebras/SlimPajama-627B
Preview
•
Updated
Jul 7, 2023
•
29.7k
•
429
liked
a model
3 months ago
Aleph-Alpha/Pharia-1-LLM-7B-control
Text Generation
•
Updated
Aug 30
•
61
liked
a dataset
3 months ago
HuggingFaceTB/smollm-corpus
Viewer
•
Updated
Sep 6
•
237M
•
24.7k
•
248
liked
a model
3 months ago
gordicaleksa/SlovenianGPT
Updated
Aug 19
•
121
•
8
liked
a Space
3 months ago
Running
29
🦙
Wllama
Run GGUF directly on your browser!
liked
a model
3 months ago
HuggingFaceTB/SmolLM-360M-Instruct
Text Generation
•
Updated
Aug 18
•
14.9k
•
76
liked
a Space
3 months ago
Running
48
🤏
Instant SmolLM
Run SmolLM-360M-Instruct in realtime with MLC WebLLM
liked
a model
4 months ago
PleIAs/OCRonos-Vintage
Text Generation
•
Updated
Aug 8
•
1.26k
•
74
Load more