MotiF: Making Text Count in Image Animation with Motion Focal Loss Paper • 2412.16153 • Published 6 days ago • 4
In Case You Missed It: ARC 'Challenge' Is Not That Challenging Paper • 2412.17758 • Published 3 days ago • 11
DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation Paper • 2412.18597 • Published 2 days ago • 13
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization Paper • 2412.17739 • Published 3 days ago • 27 • 4
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization Paper • 2412.17739 • Published 3 days ago • 27
3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding Paper • 2412.18450 • Published 2 days ago • 29
Sequence Matters: Harnessing Video Models in 3D Super-Resolution Paper • 2412.11525 • Published 11 days ago • 10
AnySat: An Earth Observation Model for Any Resolutions, Scales, and Modalities Paper • 2412.14123 • Published 8 days ago • 11
GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs Paper • 2412.11258 • Published 11 days ago • 13
Causal Diffusion Transformers for Generative Modeling Paper • 2412.12095 • Published 10 days ago • 23
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion Paper • 2412.09626 • Published 14 days ago • 19
FireFlow: Fast Inversion of Rectified Flow for Image Semantic Editing Paper • 2412.07517 • Published 16 days ago • 11
FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction Paper • 2412.09573 • Published 14 days ago • 7
PIG: Physics-Informed Gaussians as Adaptive Parametric Mesh Representations Paper • 2412.05994 • Published 18 days ago • 17