17 43 125

Jeff Boudier

jeffboudier

https://huggingface.co/

AI & ML interests

Hugging Face!

Recent Activity

Reacted to andito's post with ❤️ 3 days ago

Let's go! We are releasing SmolVLM, a smol 2B VLM built for on-device inference that outperforms all models at similar GPU RAM usage and tokens throughputs. - SmolVLM generates tokens 7.5 to 16 times faster than Qwen2-VL! 🤯 - Other models at this size crash a laptop, but SmolVLM comfortably generates 17 tokens/sec on a macbook! 🚀 - SmolVLM can be fine-tuned on a Google collab! Or process millions of documents with a consumer GPU! - SmolVLM even outperforms larger models in video benchmarks, despite not even being trained on videos! Check out more! Demo: https://huggingface.co/spaces/HuggingFaceTB/SmolVLM Blog: https://huggingface.co/blog/smolvlm Model: https://huggingface.co/HuggingFaceTB/SmolVLM-Instruct Fine-tuning script: https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynb

liked a Space 5 days ago

fffiloni/expression-editor

liked a Space 5 days ago

r-neuschulz/h94-IP-Adapter-FaceID-SDXL

View all activity

Articles

Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap

Jun 19

• 11

Introducing the Hugging Face Embedding Container for Amazon SageMaker

Jun 7

• 14

Deploy models on AWS Inferentia2 from Hugging Face

May 22

• 13

From cloud to developers: Hugging Face and Microsoft Deepen Collaboration

May 21

• 8

Build AI on premise with Dell Enterprise Hub

May 21

• 18

Subscribe to Enterprise Hub with your AWS Account

May 9

• 6

Making thousands of open LLMs bloom in the Vertex AI Model Garden

Apr 10

• 18

Bringing serverless GPU inference to Hugging Face users

Apr 2

• 11

Easily Train Models with H100 GPUs on NVIDIA DGX Cloud

Mar 18

• 7

Hugging Face and Google partner for open AI collaboration

Jan 25

• 4

Introducing SafeCoder

Aug 22, 2023

Hugging Face Platform on the AWS Marketplace: Pay with your AWS Account

Aug 10, 2023

Leveraging Hugging Face for complex generative AI use cases

Jul 1, 2023

Hugging Face Collaborates with Microsoft to Launch Hugging Face Model Catalog on Azure

May 24, 2023

Hugging Face and AWS partner to make AI more accessible

Feb 21, 2023

• 2

Case Study: Millisecond Latency using Hugging Face Infinity and modern CPUs

Jan 13, 2022

• 2

Scaling up BERT-like model Inference on modern CPU - Part 2

Nov 4, 2021

• 1

Introducing Optimum: The Optimization Toolkit for Transformers at Scale

Sep 14, 2021

• 1

Organizations

jeffboudier's activity

Reacted to andito's post with ❤️ 3 days ago

Post

2970

Let's go! We are releasing SmolVLM, a smol 2B VLM built for on-device inference that outperforms all models at similar GPU RAM usage and tokens throughputs.

- SmolVLM generates tokens 7.5 to 16 times faster than Qwen2-VL! 🤯
- Other models at this size crash a laptop, but SmolVLM comfortably generates 17 tokens/sec on a macbook! 🚀
- SmolVLM can be fine-tuned on a Google collab! Or process millions of documents with a consumer GPU!
- SmolVLM even outperforms larger models in video benchmarks, despite not even being trained on videos!

Check out more!
Demo: HuggingFaceTB/SmolVLM
Blog: https://huggingface.co/blog/smolvlm
Model: HuggingFaceTB/SmolVLM-Instruct
Fine-tuning script: https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynb

liked 2 Spaces 5 days ago

Running on L40S

902

🐨

Expression Editor

Quickly edit the expression of a face

Running on Zero

💗

H94 IP Adapter FaceID SDXL

posted an update 8 days ago

Post

911

New - add your bluesky account to your HF profile:
https://huggingface.co/settings/profile

Is the grass greener, the sky bluer? Will try and figure it out at https://bsky.app/profile/jeffboudier.bsky.social

By the way, HF people starter pack https://bsky.app/starter-pack/huggingface.bsky.social/3laz5x7naiz22

liked 2 Spaces about 1 month ago

Running

📝

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

Running on CPU Upgrade

5.55k

👕

Kolors Virtual Try-On

replied to clem's post about 1 month ago

Didn't have this in my tarot cards

replied to clem's post about 1 month ago

📆 Wed Oct 30th - 9am PT / 12pm ET / 18h CET
Can't wait!

Reacted to clem's post with ❤️🤗🔥🚀 about 1 month ago

Post

4409

This is no Woodstock AI but will be fun nonetheless haha. I’ll be hosting a live workshop with team members next week about the Enterprise Hugging Face hub.

1,000 spots available first-come first serve with some surprises during the stream!

You can register and add to your calendar here: https://streamyard.com/watch/JS2jHsUP3NDM

4 replies

liked a model about 1 month ago

jimmycarter/LibreFLUX

Text-to-Image • Updated Oct 24 • 590 • 148

Reacted to victor's post with 🚀❤️🔥🤗 about 2 months ago

Post

2659

NEW - Inference Playground

Maybe like me you have always wanted a super easy way to compare llama3.2-1B vs. llama3.2-3B? or the same model with different temperatures?

Trying and comparing warm Inference API models has never been easier!
Just go to https://hf.co/playground, set your token and you're ready to go.
We'll keep improving, feedback welcome 😊

2 replies

posted an update about 2 months ago

Post

1038

This week in Inference Endpoints - thx @erikkaum for the update!

👀 https://huggingface.co/blog/erikkaum/endpoints-changelog

1 reply

upvoted an article about 2 months ago

Article

Inference Endpoints Changelog 🚀

•

Oct 11

• 18

liked a Space 2 months ago

Running on Zero

1.22k

🏆

Jeff Boudier

AI & ML interests

Recent Activity

Articles

Introducing HUGS - Scale your AI with Open Models

Deploy Meta Llama 3.1 405B on Google Cloud Vertex AI

Serverless Inference with Hugging Face and NVIDIA NIMs

Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap

Introducing the Hugging Face Embedding Container for Amazon SageMaker

Deploy models on AWS Inferentia2 from Hugging Face

From cloud to developers: Hugging Face and Microsoft Deepen Collaboration

Build AI on premise with Dell Enterprise Hub

Subscribe to Enterprise Hub with your AWS Account

Making thousands of open LLMs bloom in the Vertex AI Model Garden

Bringing serverless GPU inference to Hugging Face users

Easily Train Models with H100 GPUs on NVIDIA DGX Cloud

Hugging Face and Google partner for open AI collaboration

Introducing SafeCoder

Hugging Face Platform on the AWS Marketplace: Pay with your AWS Account

Leveraging Hugging Face for complex generative AI use cases

Hugging Face Collaborates with Microsoft to Launch Hugging Face Model Catalog on Azure

Hugging Face and AWS partner to make AI more accessible

Case Study: Millisecond Latency using Hugging Face Infinity and modern CPUs

Scaling up BERT-like model Inference on modern CPU - Part 2

Introducing Optimum: The Optimization Toolkit for Transformers at Scale

Organizations

jeffboudier's activity

Expression Editor

H94 IP Adapter FaceID SDXL

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

Kolors Virtual Try-On

Inference Endpoints Changelog 🚀

FLUX LoRa the Explorer