smcleod
/

llama-3-1-8b-smcleod-golang-coder-v3

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

smcleod commited on Aug 22

Commit

1b011b0

•

1 Parent(s): 56c089e

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -26,3 +26,8 @@ This should (hopefully) make it quite capable with Golang coding tasks.
 I trained this model (based on Llama 3.1 8b) on a merged dataset I created consisting of 50,627 rows, 13.3M input tokens and 2.2M output tokens.
 The total training consisted of 1,020,719 input tokens and 445,810 output tokens from 45,565 items in the dataset.

 I trained this model (based on Llama 3.1 8b) on a merged dataset I created consisting of 50,627 rows, 13.3M input tokens and 2.2M output tokens.
 The total training consisted of 1,020,719 input tokens and 445,810 output tokens from 45,565 items in the dataset.
+The dataset I created for this consists of multiple golang/programming focused datasets cleaned and merged and my own synthetically generated dataset based on several open source golang coding guides.
+- https://huggingface.co/datasets/smcleod/golang-coder
+- https://huggingface.co/datasets/smcleod/golang-programming-style-best-practices