ajibawa-2023 commited on
Commit
5e68bb2
1 Parent(s): 038e57a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +71 -0
README.md CHANGED
@@ -1,3 +1,74 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ datasets:
4
+ - ajibawa-2023/Code-290k-ShareGPT
5
+ - m-a-p/Code-Feedback
6
+ - microsoft/orca-math-word-problems-200k
7
+ - teknium/openhermes
8
+ language:
9
+ - en
10
+ tags:
11
+ - code
12
+ - mathematics
13
  ---
14
+
15
+ **Code-Mistral-7B**
16
+
17
+
18
+ This Model is trained on refined version of my dataset [Code-290k-ShareGPT](https://huggingface.co/datasets/ajibawa-2023/Code-290k-ShareGPT). Besides this it is trained on following datasets:
19
+ [Code-Feedback](https://huggingface.co/datasets/m-a-p/Code-Feedback)
20
+ [orca-math-word-problems-200k](https://huggingface.co/datasets/microsoft/orca-math-word-problems-200k)
21
+ [Openhermes](https://huggingface.co/datasets/teknium/openhermes)
22
+
23
+ The idea was to check how this Model will perform with both Code & Maths datasets. This model is very good with Coding.
24
+ Maths is still hit & miss but you can test out this model.
25
+
26
+ This Model is trained on massive datasets so the results are very good.
27
+ I have used ChatML prompt format.
28
+
29
+ Kindly note this is qLoRA version, a rare exception.
30
+
31
+
32
+ **Training:**
33
+ Entire dataset was trained on 4 x A100 80GB. For 3 epoch, training took almost 33 Hours. Axolotl codebase was used for training purpose.
34
+ Entire data is trained on Mistral.
35
+
36
+ **Example Prompt:**
37
+ This model uses **ChatML** prompt format.
38
+
39
+ ```
40
+ <|im_start|>system
41
+ You are a helpful AI assistant.<|im_end|>
42
+ <|im_start|>user
43
+ {prompt}<|im_end|>
44
+ <|im_start|>assistant
45
+
46
+ ```
47
+ You can modify above Prompt as per your requirement.
48
+
49
+
50
+ I want to say special Thanks to the Open Source community for helping & guiding me to better understand the AI/Model development.
51
+
52
+ Thank you for your love & support.
53
+
54
+
55
+ **Example Output**
56
+
57
+ Example 1
58
+ **C++**
59
+
60
+ ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64aea8ff67511bd3d965697b/jcmEZSRX7s7-B_ZybWwwN.jpeg)
61
+
62
+ **Error Resolving**
63
+
64
+ ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64aea8ff67511bd3d965697b/iy89IxjiZXAY4Id-ieLg7.jpeg)
65
+
66
+ **Matrices**
67
+
68
+ ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64aea8ff67511bd3d965697b/zFfq9lBA63wQzy0tP3_hd.jpeg)
69
+
70
+ **Machine Learning**
71
+
72
+ ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64aea8ff67511bd3d965697b/Nv8dCpNxRtJGkOuulKzmn.jpeg)
73
+
74
+