Andres Marafioti

andito

AI & ML interests

Multimodal models, VLM and TTS

Recent Activity

Articles

Organizations

andito's activity

Reacted to merve's post with 🔥 about 7 hours ago
view post
Post
417
The authors of ColPali trained a retrieval model based on SmolVLM 🤠 vidore/colsmolvlm-alpha
TLDR;

- ColSmolVLM performs better than ColPali and DSE-Qwen2 on all English tasks

- ColSmolVLM is more memory efficient than ColQwen2 💗
New activity in HuggingFaceTB/SmolVLM-Instruct about 8 hours ago

Will this work with vLLM?

3
#10 opened 1 day ago by nickandbro
updated a Space about 8 hours ago
posted an update about 11 hours ago
view post
Post
372
Let's go! We are releasing SmolVLM, a smol 2B VLM built for on-device inference that outperforms all models at similar GPU RAM usage and tokens throughputs.

- SmolVLM generates tokens 7.5 to 16 times faster than Qwen2-VL! 🤯
- Other models at this size crash a laptop, but SmolVLM comfortably generates 17 tokens/sec on a macbook! 🚀
- SmolVLM can be fine-tuned on a Google collab! Or process millions of documents with a consumer GPU!
- SmolVLM even outperforms larger models in video benchmarks, despite not even being trained on videos!

Check out more!
Demo: HuggingFaceTB/SmolVLM
Blog: https://huggingface.co/blog/smolvlm
Model: HuggingFaceTB/SmolVLM-Instruct
Fine-tuning script: https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynb
New activity in HuggingFaceTB/SmolVLM-Instruct 1 day ago

Link to blog

#5 opened 1 day ago by pcuenq
New activity in HuggingFaceTB/SmolVLM-Synthetic 1 day ago

add model card

#2 opened 1 day ago by ariG23498
New activity in HuggingFaceTB/SmolVLM-Base 1 day ago

add model card

#5 opened 1 day ago by ariG23498
liked a Space 1 day ago
New activity in HuggingFaceTB/SmolVLM-Instruct 1 day ago

Revert chat template

#4 opened 1 day ago by merve
New activity in HuggingFaceTB/SmolVLM 1 day ago

Update app.py

#3 opened 1 day ago by andito