3 5 2

Alexandre Rame

alexrame

https://alexrame.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

LearnLM: Improving Gemini for Learning

View all activity

Organizations

None yet

alexrame's activity

upvoted a paper 3 days ago

LearnLM: Improving Gemini for Learning

Paper • 2412.16429 • Published 6 days ago • 17

authored a paper 3 months ago

Diversity-Rewarded CFG Distillation

Paper • 2410.06084 • Published Oct 8 • 10

upvoted a paper 3 months ago

Diversity-Rewarded CFG Distillation

Paper • 2410.06084 • Published Oct 8 • 10

commented a paper 3 months ago

Diversity-Rewarded CFG Distillation

Paper • 2410.06084 • Published Oct 8 • 10 •

authored 3 papers 5 months ago

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31 • 75

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Paper • 2407.15762 • Published Jul 22 • 9

BOND: Aligning LLMs with Best-of-N Distillation

Paper • 2407.14622 • Published Jul 19 • 18

authored a paper 6 months ago

WARP: On the Benefits of Weight Averaged Rewarded Policies

Paper • 2406.16768 • Published Jun 24 • 22

upvoted a paper 6 months ago

WARP: On the Benefits of Weight Averaged Rewarded Policies

Paper • 2406.16768 • Published Jun 24 • 22

commented a paper 6 months ago

WARP: On the Benefits of Weight Averaged Rewarded Policies

Paper • 2406.16768 • Published Jun 24 • 22 •

upvoted a paper 10 months ago

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Paper • 2403.07816 • Published Mar 12 • 39

commented 2 papers 11 months ago

WARM: On the Benefits of Weight Averaged Reward Models

Paper • 2401.12187 • Published Jan 22 • 18 •

WARM: On the Benefits of Weight Averaged Reward Models

Paper • 2401.12187 • Published Jan 22 • 18 •

authored a paper 11 months ago

Direct Language Model Alignment from Online AI Feedback

Paper • 2402.04792 • Published Feb 7 • 29

commented a paper 11 months ago

WARM: On the Benefits of Weight Averaged Reward Models

Paper • 2401.12187 • Published Jan 22 • 18 •

authored a paper 11 months ago

WARM: On the Benefits of Weight Averaged Reward Models

Paper • 2401.12187 • Published Jan 22 • 18

upvoted a paper 11 months ago

WARM: On the Benefits of Weight Averaged Reward Models

Paper • 2401.12187 • Published Jan 22 • 18

liked a model about 1 year ago

mistralai/Mistral-7B-Instruct-v0.1

Text Generation • Updated Aug 22 • 3.56M • 1.54k

updated a model over 1 year ago

alexrame/llama-7b-hf-ppo-summary-dnews-grs-g0-kl05-p-08-02-1691005682

Updated Aug 3, 2023

authored a paper over 1 year ago

Unified Model for Image, Video, Audio and Language Tasks

Paper • 2307.16184 • Published Jul 30, 2023 • 14