Chameleon: Mixed-Modal Early-Fusion Foundation Models Paper β’ 2405.09818 β’ Published May 16 β’ 126
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis Paper β’ 2402.14797 β’ Published Feb 22 β’ 19