Apolinário from multimodal AI art PRO

multimodalart

AI & ML interests

None yet

Recent Activity

liked a Space about 1 hour ago
Yuanshi/OminiControl
liked a Space about 3 hours ago
black-forest-labs/FLUX.1-Redux-dev
liked a Space about 3 hours ago
black-forest-labs/FLUX.1-canny-dev
View all activity

Articles

Organizations

Posts 5

view post
Post
22085
The first open Stable Diffusion 3-like architecture model is JUST out 💣 - but it is not SD3! 🤔

It is Tencent-Hunyuan/HunyuanDiT by Tencent, a 1.5B parameter DiT (diffusion transformer) text-to-image model 🖼️✨, trained with multi-lingual CLIP + multi-lingual T5 text-encoders for english 🤝 chinese understanding

Try it out by yourself here ▶️ https://huggingface.co/spaces/multimodalart/HunyuanDiT
(a bit too slow as the model is chunky and the research code isn't super optimized for inference speed yet)

In the paper they claim to be SOTA open source based on human preference evaluation!