Asaf-Yehudai (Asaf Yehudai)

upvoted a paper 10 days ago

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Paper • 2410.15522 • Published 20 days ago • 10

upvoted a paper 26 days ago

LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content

Paper • 2410.10783 • Published 27 days ago • 25

upvoted a paper about 1 month ago

Contextual Document Embeddings

Paper • 2410.02525 • Published Oct 3 • 16

upvoted a paper about 2 months ago

What's In My Big Data?

Paper • 2310.20707 • Published Oct 31, 2023 • 10

upvoted a paper 2 months ago

The Future of Open Human Feedback

Paper • 2408.16961 • Published Aug 15 • 20

upvoted an article 2 months ago

Article

The Rise of Agentic Data Generation

By

•

Jul 15

• 75

upvoted a paper 3 months ago

Benchmark Agreement Testing Done Right: A Guide for LLM Benchmark Evaluation

Paper • 2407.13696 • Published Jul 18 • 5

upvoted a paper 4 months ago

Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP

Paper • 2407.00402 • Published Jun 29 • 22

upvoted an article 5 months ago

Article

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡

By

•

Jul 9

• 36

upvoted 3 papers 5 months ago

upvoted a paper 6 months ago

ChatQA: Building GPT-4 Level Conversational QA Models

Paper • 2401.10225 • Published Jan 18 • 33

upvoted an article 6 months ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Mar 20

• 65

upvoted a paper 6 months ago

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

Paper • 2405.04324 • Published May 7 • 22

upvoted an article 7 months ago

Article

Releasing Common Corpus: the largest public domain dataset for training LLMs

By

•

Mar 20

• 17

upvoted 3 papers 9 months ago

Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation

Paper • 2402.10210 • Published Feb 15 • 29

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning

Paper • 2402.06619 • Published Feb 9 • 54

Genie: Achieving Human Parity in Content-Grounded Datasets Generation

Paper • 2401.14367 • Published Jan 25 • 6

Asaf Yehudai

AI & ML interests

Organizations

Asaf-Yehudai's activity

M-RewardBench: Evaluating Reward Models in Multilingual Settings

LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content

Contextual Document Embeddings

What's In My Big Data?

The Future of Open Human Feedback

The Rise of Agentic Data Generation

Benchmark Agreement Testing Done Right: A Guide for LLM Benchmark Evaluation

Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡

WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild

Mixture-of-Agents Enhances Large Language Model Capabilities

Large Language Model Confidence Estimation via Black-Box Access

ChatQA: Building GPT-4 Level Conversational QA Models

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

Releasing Common Corpus: the largest public domain dataset for training LLMs

Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning

Genie: Achieving Human Parity in Content-Grounded Datasets Generation

Asaf Yehudai

AI & ML interests

Organizations

Asaf-Yehudai's activity

The Rise of Agentic Data Generation

BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*⚡

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Releasing Common Corpus: the largest public domain dataset for training LLMs

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡