Post
891
π₯³ Thrilled to introduce our recent efforts on bootstrapping VLMs for multi-modal chain-of-thought reasoning !
π Title: Vision-Language Models Can Self-Improve Reasoning via Reflection
π Link: Vision-Language Models Can Self-Improve Reasoning via Reflection (2411.00855)
πTakeaways:
- We found that VLMs can self-improve reasoning performance through a reflection mechanism, and importantly, this approach can scale through test-time computing.
- Evaluation on comprehensive and diverse Vision-Language reasoning tasks are included !
π Title: Vision-Language Models Can Self-Improve Reasoning via Reflection
π Link: Vision-Language Models Can Self-Improve Reasoning via Reflection (2411.00855)
πTakeaways:
- We found that VLMs can self-improve reasoning performance through a reflection mechanism, and importantly, this approach can scale through test-time computing.
- Evaluation on comprehensive and diverse Vision-Language reasoning tasks are included !