ReferEverything: Towards Segmenting Everything We Can Speak of in Videos Paper • 2410.23287 • Published Oct 30 • 19
Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation Paper • 2305.03907 • Published May 6, 2023 • 1
LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning Paper • 2312.03849 • Published Dec 6, 2023 • 5