pszemraj
/

griffin-llama3t-8L-v0.02-fineweb

Text Generation

recurrent_gemma

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on Apr 28, 2024

Commit

96dac12

•

1 Parent(s): f5e092e

Update README.md

Files changed (1) hide show

README.md +20 -0

README.md CHANGED Viewed

@@ -26,6 +26,26 @@ It achieves the following results on the evaluation set:
 - Accuracy: 0.1881
 - Num Input Tokens Seen: 766509056
 ## Training procedure
 ### Training hyperparameters

 - Accuracy: 0.1881
 - Num Input Tokens Seen: 766509056
+## evals
+tl;dr its bad/would need more training:
+hf (pretrained=pszemraj/griffin-llama3t-8L-v0.02-fineweb,trust_remote_code=True,dtype=float), gen_kwargs: (None), limit: None, num_fewshot: None, batch_size: 4
+|    Tasks     |Version|Filter|n-shot|  Metric  |   Value   |   |  Stderr  |
+|--------------|------:|------|-----:|----------|----------:|---|---------:|
+|winogrande    |      1|none  |     0|acc       |     0.4964|±  |    0.0141|
+|piqa          |      1|none  |     0|acc       |     0.5332|±  |    0.0116|
+|              |       |none  |     0|acc_norm  |     0.5299|±  |    0.0116|
+|openbookqa    |      1|none  |     0|acc       |     0.1280|±  |    0.0150|
+|              |       |none  |     0|acc_norm  |     0.2320|±  |    0.0189|
+|lambada_openai|      1|none  |     0|perplexity|638060.0702|±  |43608.0044|
+|              |       |none  |     0|acc       |     0.0000|±  |    0.0000|
+|boolq         |      2|none  |     0|acc       |     0.3783|±  |    0.0085|
+|arc_easy      |      1|none  |     0|acc       |     0.2614|±  |    0.0090|
+|              |       |none  |     0|acc_norm  |     0.2744|±  |    0.0092|
 ## Training procedure
 ### Training hyperparameters