Training completed!

Files changed (4) hide show

README.md CHANGED Viewed

@@ -11,10 +11,12 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/yspkm/PrunePath-LoRA/runs/21funio3)
 # Meta-Llama-3-8B-Instruct-mixalphalora-math
 This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on an unknown dataset.
 ## Model description
@@ -42,10 +44,17 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 100
-- num_epochs: 0.1
 ### Training results
 ### Framework versions

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/yspkm/PrunePath-LoRA/runs/shy1ed51)
 # Meta-Llama-3-8B-Instruct-mixalphalora-math
 This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.3783
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 100
+- num_epochs: 3
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss |
+|:-------------:|:------:|:----:|:---------------:|
+| 0.4349        | 0.5133 | 200  | 0.4295          |
+| 0.3872        | 1.0266 | 400  | 0.4000          |
+| 0.3662        | 1.5399 | 600  | 0.3870          |
+| 0.3252        | 2.0533 | 800  | 0.3815          |
+| 0.3227        | 2.5666 | 1000 | 0.3783          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2fc5f84f7829bb1c8336f805e2f991a2c0eee903d8c05837c997e5d873bb398d
 size 335778826

 version https://git-lfs.github.com/spec/v1
+oid sha256:276f68a5a378d57a428d291dd2f846cbeece5c2ebc0d8a0cfe7b8035610f66e1
 size 335778826

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0685438eeab2e5e0391c509439b6a6853bb79f7cb9bab463d236a58b7f182501
 size 335633928

 version https://git-lfs.github.com/spec/v1
+oid sha256:f3fa19998048ccc5ee46d746199ccff624377d6451ce15ff141620c112d49f9f
 size 335633928

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7fd54e4073405fd7b95dff56e1607329251dc729a99d7111d197477a60f30741
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:ab4591facce0a202667a4295994ea9ccc1d4def50d985f912acc8f99758bfedc
 size 5240