Pretraining in Deep Reinforcement Learning: A Survey Paper • 2211.03959 • Published Nov 8, 2022 • 1
VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment Paper • 2410.09421 • Published Oct 12
VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models Paper • 2411.17451 • Published 6 days ago • 10
VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models Paper • 2411.17451 • Published 6 days ago • 10
Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale Paper • 2409.17115 • Published Sep 25 • 59