RDson commited on
Commit
42a8f74
1 Parent(s): 1ef7615

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +29 -1
README.md CHANGED
@@ -1,8 +1,36 @@
1
  ---
2
  library_name: transformers
3
- tags: []
 
 
 
 
4
  ---
5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6
  # Model Card for Model ID
7
 
8
  <!-- Provide a quick summary of what the model is/does. -->
 
1
  ---
2
  library_name: transformers
3
+ tags:
4
+ - llama 3
5
+ datasets:
6
+ - Intel/orca_dpo_pairs
7
+ pipeline_tag: text-generation
8
  ---
9
 
10
+ Finetuned on [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs) using a single 3090 24GB.
11
+ ORPOConfig:
12
+
13
+ ```
14
+ learning_rate=1e-6,
15
+ lr_scheduler_type="linear",
16
+ max_length=1024,
17
+ max_prompt_length=512,
18
+ overwrite_output_dir=True,
19
+ beta=0.1,
20
+ per_device_train_batch_size=2,
21
+ per_device_eval_batch_size=2,
22
+ gradient_accumulation_steps=4,
23
+ optim="paged_adamw_8bit",
24
+ num_train_epochs=3,
25
+ evaluation_strategy="steps",
26
+ eval_steps=0.2,
27
+ logging_steps=1,
28
+ warmup_steps=10,
29
+ report_to="wandb",
30
+ output_dir="./results/",
31
+ fp16=True
32
+ ```
33
+
34
  # Model Card for Model ID
35
 
36
  <!-- Provide a quick summary of what the model is/does. -->