-
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 144 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 48 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 81 -
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Paper • 2402.01739 • Published • 26
Valentin Perret
perretv
AI & ML interests
None yet
Recent Activity
liked
a model
21 days ago
meta-llama/Llama-3.1-8B-Instruct
Organizations
Collections
1
models
None public yet
datasets
None public yet