Kanzhi Cheng's picture

3 5 1

Kanzhi Cheng

cckevinn

·

AI & ML interests

None yet

Recent Activity

Reacted to Symbol-LLM's post with 🚀 9 days ago

🥳 Thrilled to introduce our recent efforts on bootstrapping VLMs for multi-modal chain-of-thought reasoning ! 📕 Title: Vision-Language Models Can Self-Improve Reasoning via Reflection 🔗 Link: https://huggingface.co/papers/2411.00855 😇Takeaways: - We found that VLMs can self-improve reasoning performance through a reflection mechanism, and importantly, this approach can scale through test-time computing. - Evaluation on comprehensive and diverse Vision-Language reasoning tasks are included !

Reacted to Symbol-LLM's post with 🔥 9 days ago

🥳 Thrilled to introduce our recent efforts on bootstrapping VLMs for multi-modal chain-of-thought reasoning ! 📕 Title: Vision-Language Models Can Self-Improve Reasoning via Reflection 🔗 Link: https://huggingface.co/papers/2411.00855 😇Takeaways: - We found that VLMs can self-improve reasoning performance through a reflection mechanism, and importantly, this approach can scale through test-time computing. - Evaluation on comprehensive and diverse Vision-Language reasoning tasks are included !

Reacted to Symbol-LLM's post with 🔥 9 days ago

🥳 Thrilled to introduce our recent efforts on bootstrapping VLMs for multi-modal chain-of-thought reasoning ! 📕 Title: Vision-Language Models Can Self-Improve Reasoning via Reflection 🔗 Link: https://huggingface.co/papers/2411.00855 😇Takeaways: - We found that VLMs can self-improve reasoning performance through a reflection mechanism, and importantly, this approach can scale through test-time computing. - Evaluation on comprehensive and diverse Vision-Language reasoning tasks are included !

View all activity

Organizations

cckevinn's activity

Reacted to Symbol-LLM's post with 🚀🔥🔥 9 days ago

Post

899

🥳 Thrilled to introduce our recent efforts on bootstrapping VLMs for multi-modal chain-of-thought reasoning !

📕 Title: Vision-Language Models Can Self-Improve Reasoning via Reflection

🔗 Link: Vision-Language Models Can Self-Improve Reasoning via Reflection (2411.00855)

😇Takeaways:

- We found that VLMs can self-improve reasoning performance through a reflection mechanism, and importantly, this approach can scale through test-time computing.

- Evaluation on comprehensive and diverse Vision-Language reasoning tasks are included !

upvoted a collection 10 days ago

Symbol-LLM

4 items • Updated 20 days ago • 5

upvoted a paper 10 days ago

Vision-Language Models Can Self-Improve Reasoning via Reflection

Paper • 2411.00855 • Published Oct 30 • 4

upvoted a paper 26 days ago

OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

Paper • 2410.23218 • Published Oct 30 • 46

authored a paper 26 days ago

OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

Paper • 2410.23218 • Published Oct 30 • 46

upvoted a paper about 1 month ago

AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant

Paper • 2410.18603 • Published Oct 24 • 30

New activity in cckevinn/SeeClick 8 months ago

Clarification towards the different models

#1 opened 8 months ago by

liked a model 8 months ago

cckevinn/SeeClick

Text Generation • Updated Jan 29 • 582 • 14

updated a model 8 months ago

cckevinn/SeeClick-miniwob

Text Generation • Updated Mar 28 • 6 • 1

authored a paper 8 months ago

A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond

Paper • 2403.14734 • Published Mar 21 • 22

upvoted a paper 8 months ago

A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond

Paper • 2403.14734 • Published Mar 21 • 22

updated a model 8 months ago

cckevinn/SeeClick-aitw

Text Generation • Updated Mar 23 • 679 • 1

updated a dataset 9 months ago

cckevinn/SeeClick-WebImgs

Updated Mar 10 • 1 • 2

updated 2 models 10 months ago

cckevinn/SeeClick-mind2web

Text Generation • Updated Feb 16 • 516

cckevinn/SeeClick

Text Generation • Updated Jan 29 • 582 • 14

New activity in adept/fuyu-8b about 1 year ago

Released capabilities

#42 opened about 1 year ago by

New activity in OFA-Sys/ofa-base-caption-fairseq-version almost 2 years ago

Farseq -> Transformers conversion

#1 opened about 2 years ago by

mys

Farseq -> Transformers conversion

#1 opened about 2 years ago by

mys