DeepDream2045
/

fc33bed3-bd72-4566-bde6-420747492880

Generated from Trainer

Model card Files Files and versions Community

DeepDream2045 commited on 22 days ago

Commit

728fed4

•

1 Parent(s): 4465eb7

End of training

Files changed (2) hide show

README.md +3 -3
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -102,7 +102,7 @@ xformers_attention: true
 This model is a fine-tuned version of [scb10x/llama-3-typhoon-v1.5-8b-instruct](https://huggingface.co/scb10x/llama-3-typhoon-v1.5-8b-instruct) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.1079
 ## Model description
@@ -140,8 +140,8 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 3.0767        | 0.0140 | 1    | 3.6313          |
-| 2.1327        | 0.3490 | 25   | 2.1874          |
-| 1.9024        | 0.6981 | 50   | 2.1079          |
 ### Framework versions

 This model is a fine-tuned version of [scb10x/llama-3-typhoon-v1.5-8b-instruct](https://huggingface.co/scb10x/llama-3-typhoon-v1.5-8b-instruct) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.1094
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 3.0767        | 0.0140 | 1    | 3.6313          |
+| 2.1337        | 0.3490 | 25   | 2.1876          |
+| 1.9006        | 0.6981 | 50   | 2.1094          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c1e3abab8b44d544abfa151541363e855d2c96c6311215d06fb69708bb2861bf
 size 335706186

 version https://git-lfs.github.com/spec/v1
+oid sha256:886f69db42cc0bee4d742de2a0554e326b7f39f0d2d7a92dbaa50ac416533feb
 size 335706186