gouthamsk commited on
Commit
38e1493
1 Parent(s): 4496292

gouthamsk/mistral_c_coder

Browse files
README.md CHANGED
@@ -1,40 +1,50 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
2
  should probably proofread and complete it, then remove this comment. -->
3
- ![This is an alt text.](http://biboxlabs.com/wp-content/uploads/2021/07/bibox-labs-logo.png 'Logo.')
4
 
5
- For more Info vist [Biboxlabs](https://biboxlabs.com/) or [Red Nerd](https://www.therednerds.com/)
6
  # mistral_embedded_c
7
 
8
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
9
- It has been fine-tuned using LoRa (Long Range) technology, enhancing its capabilities for applications requiring long-range communication.
10
 
11
  ## Model description
12
- The `mistral_embedded_c` model is specifically designed for generating accurate embedded C code targeting microcontrollers such as esp32, pic18f, pic10f, and 8051 MCU. It is a fine-tuned variant of the Mistral-7B-Instruct-v0.2 model, customized to cater to the needs of embedded systems development.
13
 
 
14
 
15
  ## Intended uses & limitations
16
- The model can be utilized for prompting to generate precise embedded C code suitable for a variety of microcontrollers including esp32, pic18f, pic10f, and 8051 MCU. However, it should be noted that while the model is trained on a diverse dataset encompassing assembly and C code for these microcontrollers, its performance may vary depending on the complexity and specificity of the task.
17
 
 
18
 
19
  ## Training and evaluation data
20
 
21
- The model is trained of the mixed dataset which contains assembly and c code for esp32, pic18f, pic10f, and 8051 MCU.
22
- Training Data is specifically curated for specific purposes to evaluate the embedded c code.
23
 
24
  ## Training procedure
25
- The training was done on an A100-40GB accelerator.
26
 
27
  ### Training hyperparameters
28
 
29
  The following hyperparameters were used during training:
30
- - learning_rate: 0.0002
31
- - train_batch_size: 4
32
  - eval_batch_size: 8
33
  - seed: 42
34
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
35
  - lr_scheduler_type: constant
36
  - lr_scheduler_warmup_steps: 0.03
37
- - training_steps: 300
38
 
39
  ### Training results
40
 
 
1
+ ---
2
+ license: apache-2.0
3
+ library_name: peft
4
+ tags:
5
+ - trl
6
+ - sft
7
+ - generated_from_trainer
8
+ base_model: mistralai/Mistral-7B-Instruct-v0.2
9
+ datasets:
10
+ - generator
11
+ model-index:
12
+ - name: mistral_embedded_c
13
+ results: []
14
+ ---
15
+
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
17
  should probably proofread and complete it, then remove this comment. -->
 
18
 
 
19
  # mistral_embedded_c
20
 
21
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
 
22
 
23
  ## Model description
 
24
 
25
+ More information needed
26
 
27
  ## Intended uses & limitations
 
28
 
29
+ More information needed
30
 
31
  ## Training and evaluation data
32
 
33
+ More information needed
 
34
 
35
  ## Training procedure
 
36
 
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
+ - learning_rate: 0.0003
41
+ - train_batch_size: 6
42
  - eval_batch_size: 8
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: constant
46
  - lr_scheduler_warmup_steps: 0.03
47
+ - training_steps: 250
48
 
49
  ### Training results
50
 
adapter_config.json CHANGED
@@ -19,8 +19,8 @@
19
  "rank_pattern": {},
20
  "revision": null,
21
  "target_modules": [
22
- "v_proj",
23
- "q_proj"
24
  ],
25
  "task_type": "CAUSAL_LM",
26
  "use_dora": false,
 
19
  "rank_pattern": {},
20
  "revision": null,
21
  "target_modules": [
22
+ "q_proj",
23
+ "v_proj"
24
  ],
25
  "task_type": "CAUSAL_LM",
26
  "use_dora": false,
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a5b582d51a443321ddf95a24b85586415f9811970e889dc150e572b57f9807ab
3
  size 109069176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:94c21bffc638995d4db56e042a6a362871c4eb542c0d2b348ec113e64c20a4e8
3
  size 109069176
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:74bfa0c4f571ce7c6291893b7d4a6e8285deb6607e3e98773d1dafefdc0a0757
3
  size 4920
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:aa3964f56281ec4e7750b2236aea67547bd036116947cf99c73ba51f320f2718
3
  size 4920