4 10 89

Xie

Zhihui

https://zhxie.site/

zhxieml

AI & ML interests

None yet

Recent Activity

liked a dataset 4 days ago

MMInstruction/VL-RewardBench

authored a paper 5 days ago

Pretraining in Deep Reinforcement Learning: A Survey

authored a paper 5 days ago

VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment

View all activity

Organizations

Zhihui's activity

liked a dataset 4 days ago

MMInstruction/VL-RewardBench

Viewer • Updated about 15 hours ago • 1.25k • 36 • 2

authored 3 papers 5 days ago

Pretraining in Deep Reinforcement Learning: A Survey

Paper • 2211.03959 • Published Nov 8, 2022 • 1

VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment

Paper • 2410.09421 • Published Oct 12

VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models

Paper • 2411.17451 • Published 6 days ago • 10

liked a Space 5 days ago

Running

🥇

VL RewardBench

upvoted a paper 6 days ago

VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models

Paper • 2411.17451 • Published 6 days ago • 10

liked a dataset 6 days ago

allenai/olmo-mix-1124

Viewer • Updated about 9 hours ago • 99.1M • 3.48k • 18

updated a Space 7 days ago

Running

🥇

VL RewardBench

liked a dataset 17 days ago

codeparrot/apps

Viewer • Updated Oct 20, 2022 • 20k • 4.17k • 128

updated a dataset 27 days ago

MMInstruction/VL-RewardBench

Viewer • Updated about 15 hours ago • 1.25k • 36 • 2

liked 2 models about 1 month ago

Qwen/Qwen2.5-Coder-7B-Instruct

Text Generation • Updated 15 days ago • 165k • 343

meta-llama/Llama-3.1-8B-Instruct

Text Generation • Updated Sep 25 • 6.41M • • 3.16k

liked a Space about 1 month ago

Running

554

🚀

Qwen2.5

updated a dataset about 2 months ago

MMInstruction/VLFeedback

Viewer • Updated Oct 17 • 80.3k • 432 • 43

New activity in MMInstruction/VLFeedback about 2 months ago

Upload parquet version of the dataset

#2 opened about 2 months ago by

davanstrien

liked a dataset about 2 months ago

deepmind/code_contests

Viewer • Updated Jun 11, 2023 • 4.04k • 5.47k • 118

upvoted a paper 2 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25 • 59

liked 2 models 2 months ago

allenai/Molmo-72B-0924

Image-Text-to-Text • Updated Oct 10 • 6.11k • 266

Qwen/Qwen2.5-Coder-1.5B-Instruct

Text Generation • Updated 15 days ago • 20.9k • 45

updated a dataset 3 months ago

Zhihui/VLFeedback

Viewer • Updated Sep 3 • 382k • 138 • 7