QuantFactory
/

Llama-3-Instruct-8B-RDPO-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

munish0838 commited on May 29

Commit

d5f5353

•

1 Parent(s): e0ed7dd

Create README.md

Files changed (1) hide show

README.md +11 -0

README.md ADDED Viewed

	@@ -0,0 +1,11 @@

+---
+library_name: transformers
+pipeline_tag: text-generation
+base_model: princeton-nlp/Llama-3-Instruct-8B-RDPO
+---
+# QuantFactory/Llama-3-Instruct-8B-RDPO-GGUF
+This is quantized version of [princeton-nlp/Llama-3-Instruct-8B-RDPO](https://huggingface.co/princeton-nlp/Llama-3-Instruct-8B-RDPO) created using llama.cpp
+# Model Description
+This is a model released from the preprint: *[SimPO: Simple Preference Optimization with a Reference-Free Reward](https://arxiv.org/abs/2405.14734)*  Please refer to our [repository](https://github.com/princeton-nlp/SimPO) for more details.