Yang's picture

14

Yang

XaiverYang

RFKxavieryang

AI & ML interests

None yet

Organizations

None yet

XaiverYang's activity

upvoted a paper 7 days ago

ReferEverything: Towards Segmenting Everything We Can Speak of in Videos

Paper • 2410.23287 • Published 8 days ago • 17

upvoted a paper 11 days ago

ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting

Paper • 2410.17856 • Published 16 days ago • 48

upvoted a paper 17 days ago

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Paper • 2410.16268 • Published 17 days ago • 65

upvoted a paper 20 days ago

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published 21 days ago • 86

upvoted a paper 22 days ago

HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks

Paper • 2410.12381 • Published 23 days ago • 41

upvoted a paper 24 days ago

Think While You Generate: Discrete Diffusion with Planned Denoising

Paper • 2410.06264 • Published about 1 month ago • 9

upvoted a paper 29 days ago

ControlAR: Controllable Image Generation with Autoregressive Models

Paper • 2410.02705 • Published Oct 3 • 7

upvoted a paper about 1 month ago

YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models

Paper • 2409.13592 • Published Sep 20 • 48

upvoted 2 papers about 2 months ago

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17 • 71

Gated Slot Attention for Efficient Linear-Time Sequence Modeling

Paper • 2409.07146 • Published Sep 11 • 19

upvoted 2 papers 3 months ago

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12 • 115

Mixture of Nested Experts: Adaptive Processing of Visual Tokens

Paper • 2407.19985 • Published Jul 29 • 34

upvoted 2 papers 4 months ago

MambaVision: A Hybrid Mamba-Transformer Vision Backbone

Paper • 2407.08083 • Published Jul 10 • 27

Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion

Paper • 2407.01392 • Published Jul 1 • 39