SimPO - a princeton-nlp Collection

princeton-nlp 's Collections

SimPO

ProLong

SimCSE

SimPO

updated 23 days ago

This collections contains a list of SimPO and baseline models.

princeton-nlp/gemma-2-9b-it-SimPO

Text Generation • Updated Aug 2 • 17.5k • 126
princeton-nlp/gemma-2-9b-it-DPO

Text Generation • Updated Jul 18 • 2.62k • 5
princeton-nlp/Llama-3-Base-8B-SFT-IPO

Text Generation • Updated Jun 17 • 2.61k
princeton-nlp/Llama-3-Base-8B-SFT-DPO

Text Generation • Updated Jun 17 • 5.49k
princeton-nlp/Llama-3-Base-8B-SFT-KTO

Text Generation • Updated Jun 17 • 5.07k
princeton-nlp/Llama-3-Base-8B-SFT-ORPO

Text Generation • Updated Jun 17 • 5.06k
princeton-nlp/Llama-3-Base-8B-SFT-RDPO

Text Generation • Updated Jun 17 • 5.35k
princeton-nlp/Llama-3-Base-8B-SFT-SimPO

Text Generation • Updated May 24 • 2.97k
princeton-nlp/Llama-3-Base-8B-SFT

Text Generation • Updated Jun 17 • 10.1k • 1
princeton-nlp/Llama-3-Instruct-8B-SimPO

Text Generation • Updated Jun 17 • 13.7k • 55
princeton-nlp/Llama-3-Instruct-8B-IPO

Text Generation • Updated Jun 17 • 2.58k
princeton-nlp/Llama-3-Instruct-8B-KTO

Text Generation • Updated Jun 17 • 5.06k
princeton-nlp/Llama-3-Instruct-8B-ORPO

Text Generation • Updated Jun 17 • 5.06k
princeton-nlp/Llama-3-Instruct-8B-RDPO

Text Generation • Updated Jun 17 • 5.05k
princeton-nlp/Llama-3-Instruct-8B-DPO

Text Generation • Updated Jun 17 • 5.08k
princeton-nlp/Mistral-7B-Instruct-RDPO

Text Generation • Updated Jun 17 • 3.05k
princeton-nlp/Mistral-7B-Instruct-DPO

Text Generation • Updated Jun 17 • 3.04k
princeton-nlp/Mistral-7B-Instruct-IPO

Text Generation • Updated Jun 17 • 3.04k
princeton-nlp/Mistral-7B-Instruct-KTO

Text Generation • Updated Jun 17 • 3.05k
princeton-nlp/Mistral-7B-Instruct-SimPO

Text Generation • Updated Jun 17 • 3.05k • 1
princeton-nlp/Mistral-7B-Instruct-ORPO

Text Generation • Updated Jun 17 • 3.05k
princeton-nlp/Mistral-7B-Base-SFT-IPO

Text Generation • Updated Jun 17 • 3.05k
princeton-nlp/Mistral-7B-Base-SFT-KTO

Text Generation • Updated Jun 17 • 3.05k
princeton-nlp/Mistral-7B-Base-SFT-DPO

Text Generation • Updated Jun 17 • 2.6k
princeton-nlp/Mistral-7B-Base-SFT-RDPO

Text Generation • Updated Jun 17 • 3.05k
princeton-nlp/Mistral-7B-Base-SFT-SimPO

Text Generation • Updated Jun 17 • 5.07k
princeton-nlp/llama3-ultrafeedback

Viewer • Updated Jul 18 • 61.8k • 1.24k • 15
princeton-nlp/Mistral-7B-Base-SFT-CPO

Text Generation • Updated Sep 30 • 3k • 1
princeton-nlp/Mistral-7B-Base-SFT-RRHF

Text Generation • Updated Sep 30 • 2.99k
princeton-nlp/Mistral-7B-Base-SFT-SLiC-HF

Text Generation • Updated Jul 7 • 2.99k
princeton-nlp/Mistral-7B-Instruct-CPO

Text Generation • Updated Jul 7 • 2.98k
princeton-nlp/Mistral-7B-Instruct-RRHF

Text Generation • Updated Jul 7 • 2.99k
princeton-nlp/Mistral-7B-Instruct-SLiC-HF

Text Generation • Updated Jul 7 • 2.98k
princeton-nlp/Llama-3-Base-8B-SFT-CPO

Text Generation • Updated Jul 7 • 5.06k
princeton-nlp/Llama-3-Base-8B-SFT-RRHF

Text Generation • Updated Jul 7 • 2.6k
princeton-nlp/Llama-3-Base-8B-SFT-SLiC-HF

Text Generation • Updated Jul 7 • 2.61k
princeton-nlp/Llama-3-Instruct-8B-CPO

Text Generation • Updated Jul 7 • 5.06k
princeton-nlp/Llama-3-Instruct-8B-RRHF

Text Generation • Updated Jul 7 • 2.59k
princeton-nlp/Llama-3-Instruct-8B-SLiC-HF

Text Generation • Updated Jul 7 • 2.59k
princeton-nlp/Llama-3-Instruct-8B-RRHF-v0.2

Text Generation • Updated Jul 7 • 2.6k
princeton-nlp/Llama-3-Instruct-8B-SLiC-HF-v0.2

Text Generation • Updated Jul 7 • 2.61k
princeton-nlp/Llama-3-Instruct-8B-DPO-v0.2

Text Generation • Updated Jul 7 • 5.12k
princeton-nlp/Llama-3-Instruct-8B-IPO-v0.2

Text Generation • Updated Jul 7 • 2.62k
princeton-nlp/Llama-3-Instruct-8B-CPO-v0.2

Text Generation • Updated Jul 7 • 5.09k
princeton-nlp/Llama-3-Instruct-8B-KTO-v0.2

Text Generation • Updated Jul 7 • 5.22k
princeton-nlp/Llama-3-Instruct-8B-ORPO-v0.2

Text Generation • Updated Jul 7 • 6.38k • 1
princeton-nlp/Llama-3-Instruct-8B-RDPO-v0.2

Text Generation • Updated Jul 7 • 2.6k • 1
princeton-nlp/Llama-3-Instruct-8B-SimPO-v0.2

Text Generation • Updated Jul 7 • 3.06k • 5
princeton-nlp/llama3-ultrafeedback-armorm

Viewer • Updated Jul 18 • 61.8k • 821 • 15