12 63 78

Sylvestre Bcht

Sylvestre

Kakulukian

AI & ML interests

None yet

Recent Activity

Reacted to victor's post with 🚀 2 days ago

Perfect example of why https://huggingface.co/Qwen/Qwen2.5-Coder-32B-Instruct is insane? Introducing: AI Video Composer 🔥 https://huggingface.co/spaces/huggingface-projects/ai-video-composer Drag and drop your assets (images/videos/audios) to create any video you want using natural language! It works by asking the model to output a valid FFMPEG and this can be quite complex but most of the time Qwen2.5-Coder-32B gets it right (that thing is a beast). It's an update of an old project made with GPT4 and it was almost impossible to make it work with open models back then (~1.5 years ago), but not anymore, let's go open weights 🚀.

liked a model 5 days ago

black-forest-labs/FLUX.1-Canny-dev-lora

liked a model 5 days ago

black-forest-labs/FLUX.1-Redux-dev

View all activity

Articles

Deprecation of Git Authentication using password

Aug 25, 2023

• 19

Introducing DOI: the Digital Object Identifier to Datasets and Models

Oct 7, 2022

• 2

Organizations

Sylvestre's activity

upvoted a paper about 1 month ago

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Paper • 2410.12628 • Published Oct 16 • 27

upvoted 2 collections about 1 month ago

DocLayout-YOLO

Collection

Dataset and model for DocLayout-YOLO • 9 items • Updated Oct 22 • 12

Granite 3.0 Language Models

Collection

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 23 days ago • 92

upvoted an article 4 months ago

Article

Deprecation of Git Authentication using password

Aug 25, 2023

• 19

upvoted 2 papers 4 months ago

An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion

Paper • 2408.03178 • Published Aug 6 • 36

SaulLM-54B & SaulLM-141B: Scaling Up Domain Adaptation for the Legal Domain

Paper • 2407.19584 • Published Jul 28 • 62

upvoted a collection 4 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Sep 25 • 627

upvoted 11 papers 4 months ago

Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models

Paper • 2407.15642 • Published Jul 22 • 10

MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation

Paper • 2407.15060 • Published Jul 21 • 9

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Paper • 2407.15762 • Published Jul 22 • 9

Artist: Aesthetically Controllable Text-Driven Stylization without Training

Paper • 2407.15842 • Published Jul 22 • 13

HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions

Paper • 2407.15187 • Published Jul 21 • 10

SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models

Paper • 2407.15841 • Published Jul 22 • 39

upvoted 2 papers 5 months ago

Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion

Paper • 2407.01392 • Published Jul 1 • 39

LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published Jun 29 • 37