edalvb (Edward Alexis Vásquez Becerra)

upvoted a paper 6 months ago

InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation

Paper • 2404.19427 • Published Apr 30 • 71

upvoted a paper 7 months ago

Gecko: Versatile Text Embeddings Distilled from Large Language Models

Paper • 2403.20327 • Published Mar 29 • 47

upvoted a paper 8 months ago

RadSplat: Radiance Field-Informed Gaussian Splatting for Robust Real-Time Rendering with 900+ FPS

Paper • 2403.13806 • Published Mar 20 • 18

upvoted a paper 9 months ago

Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing

Paper • 2402.15151 • Published Feb 23 • 7

upvoted a collection 9 months ago

OpenMath

Collection

A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated Oct 1 • 37

upvoted 2 papers 10 months ago

Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All

Paper • 2401.13795 • Published Jan 24 • 65

Lumiere: A Space-Time Diffusion Model for Video Generation

Paper • 2401.12945 • Published Jan 23 • 86

upvoted 11 papers about 1 year ago

DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory

Paper • 2308.08089 • Published Aug 16, 2023 • 21

TeCH: Text-guided Reconstruction of Lifelike Clothed Humans

Paper • 2308.08545 • Published Aug 16, 2023 • 33

upvoted 2 papers over 1 year ago

AutoDecoding Latent 3D Diffusion Models

Paper • 2307.05445 • Published Jul 7, 2023 • 13

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

Paper • 2307.04725 • Published Jul 10, 2023 • 64

Edward Alexis Vásquez Becerra

AI & ML interests

Organizations

edalvb's activity

InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation

Gecko: Versatile Text Embeddings Distilled from Large Language Models

RadSplat: Radiance Field-Informed Gaussian Splatting for Robust Real-Time Rendering with 900+ FPS

Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing

OpenMath

Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All

Lumiere: A Space-Time Diffusion Model for Video Generation

Text-to-3D using Gaussian Splatting

ProPainter: Improving Propagation and Transformer for Video Inpainting

Tracking Anything with Decoupled Video Segmentation

DrugChat: Towards Enabling ChatGPT-Like Capabilities on Drug Molecule Graphs

Large-Scale Automatic Audiobook Creation

NExT-GPT: Any-to-Any Multimodal LLM

PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models

MagiCapture: High-Resolution Multi-Concept Portrait Customization

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory

TeCH: Text-guided Reconstruction of Lifelike Clothed Humans

AutoDecoding Latent 3D Diffusion Models

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning