taylorbobaylor commited on
Commit
5398e97
1 Parent(s): 1e9e39c

Upload model

Browse files
Files changed (1) hide show
  1. README.md +21 -1
README.md CHANGED
@@ -1,8 +1,9 @@
1
  ---
2
  license: bigcode-openrail-m
3
- base_model: bigcode/tiny_starcoder_py
4
  tags:
5
  - generated_from_trainer
 
6
  model-index:
7
  - name: peft-lora-starcoder-chat-asst-A100-40GB-colab
8
  results: []
@@ -58,3 +59,22 @@ The following hyperparameters were used during training:
58
  - Pytorch 2.0.0
59
  - Datasets 2.15.0
60
  - Tokenizers 0.15.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: bigcode-openrail-m
3
+ library_name: peft
4
  tags:
5
  - generated_from_trainer
6
+ base_model: bigcode/tiny_starcoder_py
7
  model-index:
8
  - name: peft-lora-starcoder-chat-asst-A100-40GB-colab
9
  results: []
 
59
  - Pytorch 2.0.0
60
  - Datasets 2.15.0
61
  - Tokenizers 0.15.0
62
+ ## Training procedure
63
+
64
+
65
+ The following `bitsandbytes` quantization config was used during training:
66
+ - quant_method: bitsandbytes
67
+ - load_in_8bit: False
68
+ - load_in_4bit: True
69
+ - llm_int8_threshold: 6.0
70
+ - llm_int8_skip_modules: None
71
+ - llm_int8_enable_fp32_cpu_offload: False
72
+ - llm_int8_has_fp16_weight: False
73
+ - bnb_4bit_quant_type: nf4
74
+ - bnb_4bit_use_double_quant: True
75
+ - bnb_4bit_compute_dtype: bfloat16
76
+
77
+ ### Framework versions
78
+
79
+
80
+ - PEFT 0.6.3.dev0