RDson
/

Orca-Llama-3-8B-Instruct-DPO

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

RDson commited on Apr 20

Commit

42a8f74

•

1 Parent(s): 1ef7615

Update README.md

Files changed (1) hide show

README.md +29 -1

README.md CHANGED Viewed

@@ -1,8 +1,36 @@
 ---
 library_name: transformers
-tags: []
 ---
 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->

 ---
 library_name: transformers
+tags:
+- llama 3
+datasets:
+- Intel/orca_dpo_pairs
+pipeline_tag: text-generation
 ---
+Finetuned on [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs) using a single 3090 24GB.
+ORPOConfig:
+```
+    learning_rate=1e-6,
+    lr_scheduler_type="linear",
+    max_length=1024,
+    max_prompt_length=512,
+    overwrite_output_dir=True,
+    beta=0.1,
+    per_device_train_batch_size=2,
+    per_device_eval_batch_size=2,
+    gradient_accumulation_steps=4,
+    optim="paged_adamw_8bit",
+    num_train_epochs=3,
+    evaluation_strategy="steps",
+    eval_steps=0.2,
+    logging_steps=1,
+    warmup_steps=10,
+    report_to="wandb",
+    output_dir="./results/",
+    fp16=True
+```
 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->