Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free Paper β’ 2410.10814 β’ Published Oct 14 β’ 48
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis Paper β’ 2410.08261 β’ Published Oct 10 β’ 49
Intriguing Properties of Large Language and Vision Models Paper β’ 2410.04751 β’ Published Oct 7 β’ 16
DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation Paper β’ 2410.08159 β’ Published Oct 10 β’ 25
Space-Time Video Super-resolution with Neural Operator Paper β’ 2404.06036 β’ Published Apr 9 β’ 1
VideoGigaGAN: Towards Detail-rich Video Super-Resolution Paper β’ 2404.12388 β’ Published Apr 18 β’ 1
EvTexture: Event-driven Texture Enhancement for Video Super-Resolution Paper β’ 2406.13457 β’ Published Jun 19 β’ 16
Improving Generative Adversarial Networks for Video Super-Resolution Paper β’ 2406.16359 β’ Published Jun 24 β’ 1
Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors Paper β’ 2407.09919 β’ Published Jul 13 β’ 1