Kanzhi Cheng's picture

3 5 1

Kanzhi Cheng

cckevinn

·

AI & ML interests

None yet

Recent Activity

Reacted to Symbol-LLM's post with 🚀 7 days ago

🥳 Thrilled to introduce our recent efforts on bootstrapping VLMs for multi-modal chain-of-thought reasoning ! 📕 Title: Vision-Language Models Can Self-Improve Reasoning via Reflection 🔗 Link: https://huggingface.co/papers/2411.00855 😇Takeaways: - We found that VLMs can self-improve reasoning performance through a reflection mechanism, and importantly, this approach can scale through test-time computing. - Evaluation on comprehensive and diverse Vision-Language reasoning tasks are included !

Reacted to Symbol-LLM's post with 🔥 7 days ago

🥳 Thrilled to introduce our recent efforts on bootstrapping VLMs for multi-modal chain-of-thought reasoning ! 📕 Title: Vision-Language Models Can Self-Improve Reasoning via Reflection 🔗 Link: https://huggingface.co/papers/2411.00855 😇Takeaways: - We found that VLMs can self-improve reasoning performance through a reflection mechanism, and importantly, this approach can scale through test-time computing. - Evaluation on comprehensive and diverse Vision-Language reasoning tasks are included !

Reacted to Symbol-LLM's post with 🔥 7 days ago

🥳 Thrilled to introduce our recent efforts on bootstrapping VLMs for multi-modal chain-of-thought reasoning ! 📕 Title: Vision-Language Models Can Self-Improve Reasoning via Reflection 🔗 Link: https://huggingface.co/papers/2411.00855 😇Takeaways: - We found that VLMs can self-improve reasoning performance through a reflection mechanism, and importantly, this approach can scale through test-time computing. - Evaluation on comprehensive and diverse Vision-Language reasoning tasks are included !

View all activity

Organizations

Papers 2

arxiv:2410.23218

arxiv:2403.14734

models 4

cckevinn/SeeClick-miniwob

Text Generation • Updated Mar 28 • 6 • 1

cckevinn/SeeClick-aitw

Text Generation • Updated Mar 23 • 661 • 1

cckevinn/SeeClick-mind2web

Text Generation • Updated Feb 16 • 496

cckevinn/SeeClick

Text Generation • Updated Jan 29 • 709 • 14

datasets 1

cckevinn/SeeClick-WebImgs

Updated Mar 10 • 1 • 2