Shayekh Bin Islam's picture

Shayekh Bin Islam

shayekh

·

https://shayekhbinislam.github.io/

AI & ML interests

Natural Language Processing, Reinforcement Learning

Recent Activity

liked a dataset 13 days ago

Inevitablevalor/EmbodiedAgentInterface

authored a paper about 1 month ago

MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models

updated a collection about 1 month ago

M-RᴇᴡᴀʀᴅBᴇɴᴄʜ

View all activity

Organizations

shayekh's activity

upvoted a collection about 1 month ago

Multilingual RewardBench

Multilingual Reward Model Evaluation Dataset and Results • 2 items • Updated Oct 26 • 4

upvoted 4 papers about 1 month ago

How to Evaluate Reward Models for RLHF

Paper • 2410.14872 • Published Oct 18 • 1

LLM-as-a-Judge & Reward Model: What They Can and Cannot Do

Paper • 2409.11239 • Published Sep 17 • 1

MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models

Paper • 2410.17578 • Published Oct 23 • 1

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Paper • 2410.15522 • Published Oct 20 • 10