Bertrand Chevrier

kramp

AI & ML interests

text 2 speech, ai for music writting

Recent Activity

Articles

Organizations

kramp's activity

Reacted to cfahlgren1's post with ❤️ 7 days ago
view post
Post
2900
You can clean and format datasets entirely in the browser with a few lines of SQL.

In this post, I replicate the process @mlabonne used to clean the new microsoft/orca-agentinstruct-1M-v1 dataset.

The cleaning process consists of:
- Joining the separate splits together / add split column
- Converting string messages into list of structs
- Removing empty system prompts

https://huggingface.co/blog/cfahlgren1/the-beginners-guide-to-cleaning-a-dataset

Here's his new cleaned dataset: mlabonne/orca-agentinstruct-1M-v1-cleaned
  • 1 reply
·
New activity in huggingface/HuggingDiscussions about 1 month ago

[FEEDBACK] Notifications

137
#6 opened over 2 years ago by victor
upvoted an article about 1 month ago
view article
Article

Hugging Face welcomes the Aya Expanse family of multilingual models

By ariG23498
10
Reacted to clem's post with 🔥 about 1 month ago
view post
Post
4405
This is no Woodstock AI but will be fun nonetheless haha. I’ll be hosting a live workshop with team members next week about the Enterprise Hugging Face hub.

1,000 spots available first-come first serve with some surprises during the stream!

You can register and add to your calendar here: https://streamyard.com/watch/JS2jHsUP3NDM
·
liked a Space about 1 month ago
updated a Space about 2 months ago
liked a Space about 2 months ago