RDson commited on
Commit
b0dc11f
1 Parent(s): 645626b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -5
README.md CHANGED
@@ -6,11 +6,13 @@ datasets:
6
  - Intel/orca_dpo_pairs
7
  pipeline_tag: text-generation
8
  ---
 
9
 
10
  Finetuned [Llama 3 8B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs) using a single 3090 24GB.
11
 
12
  GGUF can be found here [RDson/Orca-Llama-3-8B-Instruct-DPO-GGUF](https://huggingface.co/RDson/Orca-Llama-3-8B-Instruct-DPO-GGUF)
13
 
 
14
  ORPOConfig:
15
 
16
  ```
@@ -24,19 +26,20 @@ ORPOConfig:
24
  per_device_eval_batch_size=2,
25
  gradient_accumulation_steps=4,
26
  optim="paged_adamw_8bit",
27
- num_train_epochs=3,
28
  evaluation_strategy="steps",
29
  eval_steps=0.2,
30
  logging_steps=1,
31
- warmup_steps=10,
32
  report_to="wandb",
33
  output_dir="./results/",
34
- fp16=True
 
35
  ```
36
 
37
  <div style="text-align: center;">
38
- <img src="https://i.imgur.com/3ObAZpO.png" style="width: 100%; margin: 0 auto; display: inline-block;"/>
39
- <img src="https://i.imgur.com/9dInJrS.png" style="width: 100%; margin: 0 auto; display: inline-block;"/>
40
  </div>
41
 
42
  # Model Card for Model ID
 
6
  - Intel/orca_dpo_pairs
7
  pipeline_tag: text-generation
8
  ---
9
+ # Orca-Llama-3-8B-Instruct-DPO
10
 
11
  Finetuned [Llama 3 8B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs) using a single 3090 24GB.
12
 
13
  GGUF can be found here [RDson/Orca-Llama-3-8B-Instruct-DPO-GGUF](https://huggingface.co/RDson/Orca-Llama-3-8B-Instruct-DPO-GGUF)
14
 
15
+
16
  ORPOConfig:
17
 
18
  ```
 
26
  per_device_eval_batch_size=2,
27
  gradient_accumulation_steps=4,
28
  optim="paged_adamw_8bit",
29
+ num_train_epochs=1,
30
  evaluation_strategy="steps",
31
  eval_steps=0.2,
32
  logging_steps=1,
33
+ warmup_steps=35,
34
  report_to="wandb",
35
  output_dir="./results/",
36
+ fp16=True,
37
+ save_steps=50
38
  ```
39
 
40
  <div style="text-align: center;">
41
+ <img src="https://i.imgur.com/vQ4RzSl.png" style="width: 100%; margin: 0 auto; display: inline-block;"/>
42
+ <img src="https://i.imgur.com/9H75ijW.png" style="width: 100%; margin: 0 auto; display: inline-block;"/>
43
  </div>
44
 
45
  # Model Card for Model ID