่ฐข้›†

sanaka87
ยท

AI & ML interests

Image Generation

Recent Activity

posted an update about 14 hours ago
๐Ÿš€ Excited to Share Our Latest Work: 3DIS & 3DIS-FLUX for Multi-Instance Layout-to-Image Generation! โค๏ธโค๏ธโค๏ธ ๐ŸŽจ Daily Paper: https://huggingface.co/papers/2501.05131#community ๐Ÿ”“ Code is now open source! ๐ŸŒ Project Website: https://limuloo.github.io/3DIS/ ๐Ÿ  GitHub Repository: https://github.com/limuloo/3DIS ๐Ÿ“„ 3DIS Paper: https://arxiv.org/abs/2410.12669 ๐Ÿ“„ 3DIS-FLUX Tech Report: https://arxiv.org/abs/2501.05131 ๐Ÿ”ฅ Why 3DIS & 3DIS-FLUX? Current SOTA multi-instance generation methods are typically adapter-based, requiring additional control modules trained on pre-trained models for layout and instance attribute control. However, with the emergence of more powerful models like FLUX and SD3.5, these methods demand constant retraining and extensive resources. โœจ Our Solution: 3DIS We introduce a decoupled approach that only requires training a low-resolution Layout-to-Depth model to convert layouts into coarse-grained scene depth maps. Leveraging community and company pre-trained models like ControlNet + SAM2, we enable training-free controllable image generation on high-resolution models such as SDXL and FLUX. ๐ŸŒŸ Benefits of Our Decoupled Multi-Instance Generation: 1. Enhanced Control: By constructing scenes using depth maps in the first stage, the model focuses on coarse-grained scene layout, improving control over instance placement. 2. Flexibility & Preservation: The second stage employs training-free rendering methods, allowing seamless integration with various models (e.g., fine-tuned weights, LoRA) while maintaining the generative capabilities of pre-trained models. Join us in advancing Layout-to-Image Generation! Follow and star our repository to stay updated! โญ
updated a model about 16 hours ago
sanaka87/3DIS
liked a model about 16 hours ago
sanaka87/3DIS
View all activity

Organizations

None yet

Posts 1

view post
Post
503
๐Ÿš€ Excited to Share Our Latest Work: 3DIS & 3DIS-FLUX for Multi-Instance Layout-to-Image Generation! โค๏ธโค๏ธโค๏ธ

๐ŸŽจ Daily Paper: 3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering (2501.05131)
๐Ÿ”“ Code is now open source!
๐ŸŒ Project Website: https://limuloo.github.io/3DIS/
๐Ÿ  GitHub Repository: https://github.com/limuloo/3DIS
๐Ÿ“„ 3DIS Paper: https://arxiv.org/abs/2410.12669
๐Ÿ“„ 3DIS-FLUX Tech Report: https://arxiv.org/abs/2501.05131

๐Ÿ”ฅ Why 3DIS & 3DIS-FLUX?
Current SOTA multi-instance generation methods are typically adapter-based, requiring additional control modules trained on pre-trained models for layout and instance attribute control. However, with the emergence of more powerful models like FLUX and SD3.5, these methods demand constant retraining and extensive resources.

โœจ Our Solution: 3DIS
We introduce a decoupled approach that only requires training a low-resolution Layout-to-Depth model to convert layouts into coarse-grained scene depth maps. Leveraging community and company pre-trained models like ControlNet + SAM2, we enable training-free controllable image generation on high-resolution models such as SDXL and FLUX.

๐ŸŒŸ Benefits of Our Decoupled Multi-Instance Generation:
1. Enhanced Control: By constructing scenes using depth maps in the first stage, the model focuses on coarse-grained scene layout, improving control over instance placement.
2. Flexibility & Preservation: The second stage employs training-free rendering methods, allowing seamless integration with various models (e.g., fine-tuned weights, LoRA) while maintaining the generative capabilities of pre-trained models.

Join us in advancing Layout-to-Image Generation! Follow and star our repository to stay updated! โญ

datasets

None public yet