Update README.md
Browse files
README.md
CHANGED
@@ -54,7 +54,7 @@ The optimizer used is AdaFactor with inverse square root learning rate schedule
|
|
54 |
|
55 |
### Fine-tuning
|
56 |
|
57 |
-
This model was then fine-tuned on a single TPU Pod V2-8 for 500 steps in total, using sequence length 512 (batch size 256), using only the dataset only containing
|
58 |
|
59 |
|
60 |
## Evaluation results
|
|
|
54 |
|
55 |
### Fine-tuning
|
56 |
|
57 |
+
This model was then fine-tuned on a single TPU Pod V2-8 for 500 steps in total, using sequence length 512 (batch size 256), using only the dataset only containing csharp code.
|
58 |
|
59 |
|
60 |
## Evaluation results
|