arxiv:2412.09585
Zhengyuan Yang
zyang39
AI & ML interests
None yet
Recent Activity
authored
a paper
10 days ago
OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary
Embedding Distillation
upvoted
a
paper
20 days ago
Scaling Inference-Time Search with Vision Value Model for Improved
Visual Comprehension
Organizations
Papers
16
models
None public yet
datasets
None public yet