smcleod commited on
Commit
1b011b0
1 Parent(s): 56c089e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -26,3 +26,8 @@ This should (hopefully) make it quite capable with Golang coding tasks.
26
 
27
  I trained this model (based on Llama 3.1 8b) on a merged dataset I created consisting of 50,627 rows, 13.3M input tokens and 2.2M output tokens.
28
  The total training consisted of 1,020,719 input tokens and 445,810 output tokens from 45,565 items in the dataset.
 
 
 
 
 
 
26
 
27
  I trained this model (based on Llama 3.1 8b) on a merged dataset I created consisting of 50,627 rows, 13.3M input tokens and 2.2M output tokens.
28
  The total training consisted of 1,020,719 input tokens and 445,810 output tokens from 45,565 items in the dataset.
29
+
30
+ The dataset I created for this consists of multiple golang/programming focused datasets cleaned and merged and my own synthetically generated dataset based on several open source golang coding guides.
31
+
32
+ - https://huggingface.co/datasets/smcleod/golang-coder
33
+ - https://huggingface.co/datasets/smcleod/golang-programming-style-best-practices