dsmonk commited on
Commit
ffb2261
1 Parent(s): 875de25

End of training

Browse files
README.md CHANGED
@@ -6,6 +6,7 @@ tags:
6
  model-index:
7
  - name: falcon-7b-tuned-alpaca
8
  results: []
 
9
  ---
10
 
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -29,6 +30,17 @@ More information needed
29
 
30
  ## Training procedure
31
 
 
 
 
 
 
 
 
 
 
 
 
32
  ### Training hyperparameters
33
 
34
  The following hyperparameters were used during training:
@@ -47,6 +59,7 @@ The following hyperparameters were used during training:
47
 
48
  ### Framework versions
49
 
 
50
  - Transformers 4.32.0.dev0
51
  - Pytorch 2.0.1+cu117
52
  - Datasets 2.4.0
 
6
  model-index:
7
  - name: falcon-7b-tuned-alpaca
8
  results: []
9
+ library_name: peft
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
30
 
31
  ## Training procedure
32
 
33
+
34
+ The following `bitsandbytes` quantization config was used during training:
35
+ - load_in_8bit: False
36
+ - load_in_4bit: True
37
+ - llm_int8_threshold: 6.0
38
+ - llm_int8_skip_modules: None
39
+ - llm_int8_enable_fp32_cpu_offload: False
40
+ - llm_int8_has_fp16_weight: False
41
+ - bnb_4bit_quant_type: fp4
42
+ - bnb_4bit_use_double_quant: False
43
+ - bnb_4bit_compute_dtype: float32
44
  ### Training hyperparameters
45
 
46
  The following hyperparameters were used during training:
 
59
 
60
  ### Framework versions
61
 
62
+ - PEFT 0.5.0.dev0
63
  - Transformers 4.32.0.dev0
64
  - Pytorch 2.0.1+cu117
65
  - Datasets 2.4.0
runs/Jul18_22-40-02_nbd8bx3wd8/events.out.tfevents.1689720006.nbd8bx3wd8.69.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8e74a2aa23885fdb828f5e869ea2574e6e649a58b8b3646bb0a25ff233ed325c
3
- size 6868
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:55df5b9e19b6c89cc0253dcd1116a65b6b86ef123010595502c7928bd8cd29ab
3
+ size 7222