Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation Paper • 2412.09428 • Published Dec 12, 2024 • 7
BrushEdit: All-In-One Image Inpainting and Editing Paper • 2412.10316 • Published about 1 month ago • 33
FashionComposer: Compositional Fashion Image Generation Paper • 2412.14168 • Published 26 days ago • 16
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper • 2412.18925 • Published 19 days ago • 89
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization Paper • 2412.21037 • Published 14 days ago • 23