Thomas Wolf PRO

thomwolf

AI & ML interests

NLP and open-source :-)

Recent Activity

Articles

Organizations

thomwolf's activity

Reacted to davanstrien's post with ❀️ 1 day ago
view post
Post
1433
First dataset for the new Hugging Face Bluesky community organisation: bluesky-community/one-million-bluesky-posts πŸ¦‹

πŸ“Š 1M public posts from Bluesky's firehose API
πŸ” Includes text, metadata, and language predictions
πŸ”¬ Perfect to experiment with using ML for Bluesky πŸ€—

Excited to see people build more open tools for a more open social media platform!
Reacted to ZennyKenny's post with πŸ‘ 3 days ago
view post
Post
1140
I've joined the Bluesky community. Interested to see what decentralized social media looks like in action: https://bsky.app/profile/kghamilton.bsky.social

Looking forward to following other AI builders, tech enthusiasts, goth doomscrollers, and ironic meme creators.
Reacted to as-cle-bert's post with πŸ”₯ 3 days ago
view post
Post
1181
Hi HuggingFacers!πŸ€—
I'm thrilled to introduce my latest project: π—¦π—²π—»π—§π—Ώπ—˜π˜ƒ (𝗦𝗲𝗻tence 𝗧𝗿ansformers π—˜π˜ƒaluator), a python package that offers simple customizable evaluation for text retrieval accuracy and time performance of Sentence Transformers-compatible text embedders on PDF data!πŸ“Š

Learn more in my LinkedIn post: https://www.linkedin.com/posts/astra-clelia-bertelli-583904297_python-embedders-semanticsearch-activity-7266754133557190656-j1e3

And on the GitHub repo: https://github.com/AstraBert/SenTrEv

Have fun!πŸ•
posted an update 3 days ago
liked a Space 6 days ago
replied to nyuuzyou's post 9 days ago
Reacted to nyuuzyou's post with πŸ”₯ 9 days ago
view post
Post
943
πŸ–ΌοΈ Introducing Public Domain Pictures Dataset - nyuuzyou/publicdomainpictures

Dataset highlights:
- 644,412 public domain images with comprehensive metadata from publicdomainpictures.net
- English language metadata including titles, descriptions, and keywords
- Each entry contains rich metadata including:
- Unique image ID and full-size image URLs
- Detailed titles and descriptions
- Keyword/tag collections
- Creator attribution
- Released to the public domain under Creative Commons Zero (CC0) license
  • 2 replies
Β·