DeepDream2045 commited on
Commit
728fed4
1 Parent(s): 4465eb7

End of training

Browse files
Files changed (2) hide show
  1. README.md +3 -3
  2. adapter_model.bin +1 -1
README.md CHANGED
@@ -102,7 +102,7 @@ xformers_attention: true
102
 
103
  This model is a fine-tuned version of [scb10x/llama-3-typhoon-v1.5-8b-instruct](https://huggingface.co/scb10x/llama-3-typhoon-v1.5-8b-instruct) on the None dataset.
104
  It achieves the following results on the evaluation set:
105
- - Loss: 2.1079
106
 
107
  ## Model description
108
 
@@ -140,8 +140,8 @@ The following hyperparameters were used during training:
140
  | Training Loss | Epoch | Step | Validation Loss |
141
  |:-------------:|:------:|:----:|:---------------:|
142
  | 3.0767 | 0.0140 | 1 | 3.6313 |
143
- | 2.1327 | 0.3490 | 25 | 2.1874 |
144
- | 1.9024 | 0.6981 | 50 | 2.1079 |
145
 
146
 
147
  ### Framework versions
 
102
 
103
  This model is a fine-tuned version of [scb10x/llama-3-typhoon-v1.5-8b-instruct](https://huggingface.co/scb10x/llama-3-typhoon-v1.5-8b-instruct) on the None dataset.
104
  It achieves the following results on the evaluation set:
105
+ - Loss: 2.1094
106
 
107
  ## Model description
108
 
 
140
  | Training Loss | Epoch | Step | Validation Loss |
141
  |:-------------:|:------:|:----:|:---------------:|
142
  | 3.0767 | 0.0140 | 1 | 3.6313 |
143
+ | 2.1337 | 0.3490 | 25 | 2.1876 |
144
+ | 1.9006 | 0.6981 | 50 | 2.1094 |
145
 
146
 
147
  ### Framework versions
adapter_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c1e3abab8b44d544abfa151541363e855d2c96c6311215d06fb69708bb2861bf
3
  size 335706186
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:886f69db42cc0bee4d742de2a0554e326b7f39f0d2d7a92dbaa50ac416533feb
3
  size 335706186