Update README.md
Browse files
README.md
CHANGED
@@ -6,11 +6,13 @@ datasets:
|
|
6 |
- Intel/orca_dpo_pairs
|
7 |
pipeline_tag: text-generation
|
8 |
---
|
|
|
9 |
|
10 |
Finetuned [Llama 3 8B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs) using a single 3090 24GB.
|
11 |
|
12 |
GGUF can be found here [RDson/Orca-Llama-3-8B-Instruct-DPO-GGUF](https://huggingface.co/RDson/Orca-Llama-3-8B-Instruct-DPO-GGUF)
|
13 |
|
|
|
14 |
ORPOConfig:
|
15 |
|
16 |
```
|
@@ -24,19 +26,20 @@ ORPOConfig:
|
|
24 |
per_device_eval_batch_size=2,
|
25 |
gradient_accumulation_steps=4,
|
26 |
optim="paged_adamw_8bit",
|
27 |
-
num_train_epochs=
|
28 |
evaluation_strategy="steps",
|
29 |
eval_steps=0.2,
|
30 |
logging_steps=1,
|
31 |
-
warmup_steps=
|
32 |
report_to="wandb",
|
33 |
output_dir="./results/",
|
34 |
-
fp16=True
|
|
|
35 |
```
|
36 |
|
37 |
<div style="text-align: center;">
|
38 |
-
<img src="https://i.imgur.com/
|
39 |
-
<img src="https://i.imgur.com/
|
40 |
</div>
|
41 |
|
42 |
# Model Card for Model ID
|
|
|
6 |
- Intel/orca_dpo_pairs
|
7 |
pipeline_tag: text-generation
|
8 |
---
|
9 |
+
# Orca-Llama-3-8B-Instruct-DPO
|
10 |
|
11 |
Finetuned [Llama 3 8B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs) using a single 3090 24GB.
|
12 |
|
13 |
GGUF can be found here [RDson/Orca-Llama-3-8B-Instruct-DPO-GGUF](https://huggingface.co/RDson/Orca-Llama-3-8B-Instruct-DPO-GGUF)
|
14 |
|
15 |
+
|
16 |
ORPOConfig:
|
17 |
|
18 |
```
|
|
|
26 |
per_device_eval_batch_size=2,
|
27 |
gradient_accumulation_steps=4,
|
28 |
optim="paged_adamw_8bit",
|
29 |
+
num_train_epochs=1,
|
30 |
evaluation_strategy="steps",
|
31 |
eval_steps=0.2,
|
32 |
logging_steps=1,
|
33 |
+
warmup_steps=35,
|
34 |
report_to="wandb",
|
35 |
output_dir="./results/",
|
36 |
+
fp16=True,
|
37 |
+
save_steps=50
|
38 |
```
|
39 |
|
40 |
<div style="text-align: center;">
|
41 |
+
<img src="https://i.imgur.com/vQ4RzSl.png" style="width: 100%; margin: 0 auto; display: inline-block;"/>
|
42 |
+
<img src="https://i.imgur.com/9H75ijW.png" style="width: 100%; margin: 0 auto; display: inline-block;"/>
|
43 |
</div>
|
44 |
|
45 |
# Model Card for Model ID
|