Learning Flow Fields in Attention for Controllable Person Image Generation Paper • 2412.08486 • Published 15 days ago • 32
StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements Paper • 2412.08503 • Published 15 days ago • 8
MIT-10M: A Large Scale Parallel Corpus of Multilingual Image Translation Paper • 2412.07147 • Published 17 days ago • 5
Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel Paper • 2412.08467 • Published 15 days ago • 5
KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models Paper • 2412.06071 • Published 18 days ago • 7
FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models Paper • 2412.08629 • Published 15 days ago • 11
ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting Paper • 2411.17176 • Published about 1 month ago • 22
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning Paper • 2411.18203 • Published 30 days ago • 31