Update README.md
Browse files
README.md
CHANGED
@@ -26,3 +26,8 @@ This should (hopefully) make it quite capable with Golang coding tasks.
|
|
26 |
|
27 |
I trained this model (based on Llama 3.1 8b) on a merged dataset I created consisting of 50,627 rows, 13.3M input tokens and 2.2M output tokens.
|
28 |
The total training consisted of 1,020,719 input tokens and 445,810 output tokens from 45,565 items in the dataset.
|
|
|
|
|
|
|
|
|
|
|
|
26 |
|
27 |
I trained this model (based on Llama 3.1 8b) on a merged dataset I created consisting of 50,627 rows, 13.3M input tokens and 2.2M output tokens.
|
28 |
The total training consisted of 1,020,719 input tokens and 445,810 output tokens from 45,565 items in the dataset.
|
29 |
+
|
30 |
+
The dataset I created for this consists of multiple golang/programming focused datasets cleaned and merged and my own synthetically generated dataset based on several open source golang coding guides.
|
31 |
+
|
32 |
+
- https://huggingface.co/datasets/smcleod/golang-coder
|
33 |
+
- https://huggingface.co/datasets/smcleod/golang-programming-style-best-practices
|