ShowUI: One Vision-Language-Action Model for GUI Visual Agent Paper • 2411.17465 • Published 1 day ago • 46
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator Paper • 2411.15466 • Published 5 days ago • 32
Style-Friendly SNR Sampler for Style-Driven Generation Paper • 2411.14793 • Published 6 days ago • 34
OminiControl: Minimal and Universal Control for Diffusion Transformer Paper • 2411.15098 • Published 5 days ago • 38
Attention Prompting on Image for Large Vision-Language Models Paper • 2409.17143 • Published Sep 25 • 7
Heavy Labels Out! Dataset Distillation with Label Space Lightening Paper • 2408.08201 • Published Aug 15 • 18