kajamo commited on
Commit
30acec3
1 Parent(s): b662471

Model save

Browse files
Files changed (3) hide show
  1. README.md +5 -5
  2. adapter_model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -34,11 +34,11 @@ More information needed
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 0.0001
37
- - train_batch_size: 8
38
- - eval_batch_size: 8
39
  - seed: 42
40
  - gradient_accumulation_steps: 4
41
- - total_train_batch_size: 32
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: cosine
44
  - lr_scheduler_warmup_steps: 500
@@ -50,6 +50,6 @@ The following hyperparameters were used during training:
50
 
51
  - PEFT 0.12.0
52
  - Transformers 4.44.2
53
- - Pytorch 2.4.0+cu121
54
- - Datasets 3.0.0
55
  - Tokenizers 0.19.1
 
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 0.0001
37
+ - train_batch_size: 16
38
+ - eval_batch_size: 16
39
  - seed: 42
40
  - gradient_accumulation_steps: 4
41
+ - total_train_batch_size: 64
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: cosine
44
  - lr_scheduler_warmup_steps: 500
 
50
 
51
  - PEFT 0.12.0
52
  - Transformers 4.44.2
53
+ - Pytorch 2.4.0
54
+ - Datasets 2.21.0
55
  - Tokenizers 0.19.1
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7b0d7c6b8e58e03c88383a04b22184e74ffec3b88abe3365e938174eafafd52e
3
  size 1189536
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5485cfb21f10a47d953a5cd40668a6dfdd74917ae00a4c899f301a996fe965af
3
  size 1189536
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6f2ed6e088e8be448b90960d9a882f5c2235278c12b441f1b2487780ad68dbfc
3
  size 5304
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:96b4bc93efcd32a060311a05cc3858940dfa4ee271d847172641adce4606b187
3
  size 5304