Migara Amarasinghe commited on
Commit
0a9912c
1 Parent(s): e17624f

Model save

Browse files
Files changed (2) hide show
  1. README.md +4 -6
  2. adapter_model.safetensors +1 -1
README.md CHANGED
@@ -5,8 +5,6 @@ tags:
5
  - trl
6
  - sft
7
  - generated_from_trainer
8
- datasets:
9
- - generator
10
  base_model: google/gemma-2b
11
  model-index:
12
  - name: Gemma2B-LORAfied
@@ -18,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  # Gemma2B-LORAfied
20
 
21
- This model is a fine-tuned version of [google/gemma-2b](https://huggingface.co/google/gemma-2b) on the generator dataset.
22
 
23
  ## Model description
24
 
@@ -41,12 +39,12 @@ The following hyperparameters were used during training:
41
  - train_batch_size: 2
42
  - eval_batch_size: 8
43
  - seed: 42
44
- - gradient_accumulation_steps: 8
45
- - total_train_batch_size: 16
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
  - lr_scheduler_warmup_ratio: 0.05
49
- - training_steps: 1480
50
 
51
  ### Framework versions
52
 
 
5
  - trl
6
  - sft
7
  - generated_from_trainer
 
 
8
  base_model: google/gemma-2b
9
  model-index:
10
  - name: Gemma2B-LORAfied
 
16
 
17
  # Gemma2B-LORAfied
18
 
19
+ This model is a fine-tuned version of [google/gemma-2b](https://huggingface.co/google/gemma-2b) on an unknown dataset.
20
 
21
  ## Model description
22
 
 
39
  - train_batch_size: 2
40
  - eval_batch_size: 8
41
  - seed: 42
42
+ - gradient_accumulation_steps: 4
43
+ - total_train_batch_size: 8
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
  - lr_scheduler_warmup_ratio: 0.05
47
+ - training_steps: 593
48
 
49
  ### Framework versions
50
 
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3a205c6a90e6665fed9f5b1bc1ad88b47b4dbc87c2a3229a0aecfac94937b5a1
3
  size 156926880
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:18cb5c51d94a91d60c4ba2583f3c82cbfdbaf7ebc396be1aba83968a328f6e0a
3
  size 156926880