UIP2P: Unsupervised Instruction-based Image Editing via Cycle Edit Consistency Paper • 2412.15216 • Published 8 days ago • 5
BRAVE: Broadening the visual encoding of vision-language models Paper • 2404.07204 • Published Apr 10 • 18
InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes Paper • 2401.05335 • Published Jan 10 • 27
Text-Conditioned Resampler For Long Form Video Understanding Paper • 2312.11897 • Published Dec 19, 2023 • 5
Text-Conditioned Resampler For Long Form Video Understanding Paper • 2312.11897 • Published Dec 19, 2023 • 5
LIME: Localized Image Editing via Attention Regularization in Diffusion Models Paper • 2312.09256 • Published Dec 14, 2023 • 8