ruslandev
/

llama-3-8b-samantha

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ruslandev commited on Apr 29

Commit

a803efa

•

1 Parent(s): 22a845c

Update README.md

Files changed (1) hide show

README.md +25 -1

README.md CHANGED Viewed

@@ -33,7 +33,31 @@ Prompt format is Alpaca. I used the same system prompt as the original Samantha.
 ### Response:
 """
 ```
-[Training code is here](https://github.com/RuslanPeresy/gptchain)
 2 epoch finetuning from llama-3-8b took 1 hour on a single A100 with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.

 ### Response:
 """
 ```
+# Training
+[gptchain](https://github.com/RuslanPeresy/gptchain) framework has been used for training.
+## Training hyperparameters
+- learning_rate: 2e-4
+- seed: 3407
+- gradient_accumulation_steps: 4
+- per_device_train_batch_size: 2
+- optimizer: adamw_8bit
+- lr_scheduler_type: linear
+- warmup_steps: 5
+- num_epochs: 2
+- weight_decay: 0.01
+## Training results
+|Training Loss | Epoch | Step |
+|--------------|-------|------|
+|2.0778        |0.0    |1     |
+|0.6255        |0.18   |120	  |
+|0.6208        |0.94   |620   |
+|0.6244        |2.0    |1306  |
 2 epoch finetuning from llama-3-8b took 1 hour on a single A100 with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.