17 13 21

Zhang Yuanhan

ZhangYuanhan

https://zhangyuanhan-ai.github.io/

zhang_yuanhan
ZhangYuanhan-AI

AI & ML interests

None yet

Recent Activity

upvoted a paper about 21 hours ago

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

New activity 17 days ago

lmms-lab/LLaVA-Video-178K:Query about how many frames are used to generate each caption?

upvoted a paper 18 days ago

HourVideo: 1-Hour Video-Language Understanding

View all activity

Organizations

Collections 1

Bamboo ViT-B16 Demo

models 2

ZhangYuanhan/llava-1.6-Yi-34b-8k

Updated Mar 21

Zhang Yuanhan

AI & ML interests

Recent Activity

Organizations

Collections 1

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Latent Action Pretraining from Videos

TVBench: Redesigning Video-Language Evaluation

Papers 16

spaces 2

Visual Prompt Retrieval

Bamboo ViT-B16 Demo

models 2

ZhangYuanhan/llava-1.6-Yi-34b-8k

ZhangYuanhan/Bamboo-ViTB_16

datasets 1

ZhangYuanhan/OmniBenchmark

Zhang Yuanhan

AI & ML interests

Recent Activity

Organizations

Collections 1

Papers 16

spaces 2 Sort: Recently updated

Visual Prompt Retrieval

Bamboo ViT-B16 Demo

models 2 Sort: Recently updated

datasets 1

spaces 2

models 2