autoevaluator HF staff commited on
Commit
8281edf
1 Parent(s): 00c25ba

Add evaluation results on the mathemakitten--winobias_antistereotype_dev config and validation split of mathemakitten/winobias_antistereotype_dev

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the mathemakitten--winobias_antistereotype_dev config and validation split of the [mathemakitten/winobias_antistereotype_dev](https://huggingface.co/datasets/mathemakitten/winobias_antistereotype_dev) dataset by

@puffy310

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-mathemakitten__winobias_antistereotype_dev-mathemakitte-398e1c-2536177709).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=mathemakitten/winobias_antistereotype_dev).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=mathemakitten/winobias_antistereotype_dev).

Files changed (1) hide show
  1. README.md +22 -0
README.md CHANGED
@@ -1,6 +1,28 @@
1
  ---
2
  datasets:
3
  - Whispering-GPT/yannic-kilcher-transcript
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4
  ---
5
  A LLM finetuned from [OLM's GPT model](https://huggingface.co/Tristan/olm-gpt2-oct-2022) on [Yannic K's Youtube Channel](https://huggingface.co/datasets/Whispering-GPT/yannic-kilcher-transcript)
6
  Under [BirdL-AirL++](https://github.com/BIRD-Laboratories/BirdL/blob/main/LICENSE.md)
 
1
  ---
2
  datasets:
3
  - Whispering-GPT/yannic-kilcher-transcript
4
+ model-index:
5
+ - name: BirdL/OLM-GPT2-Yannic
6
+ results:
7
+ - task:
8
+ type: zero-shot-classification
9
+ name: Zero-Shot Text Classification
10
+ dataset:
11
+ name: mathemakitten/winobias_antistereotype_dev
12
+ type: mathemakitten/winobias_antistereotype_dev
13
+ config: mathemakitten--winobias_antistereotype_dev
14
+ split: validation
15
+ metrics:
16
+ - type: accuracy
17
+ value: 0.5
18
+ name: Accuracy
19
+ verified: true
20
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNTAzZTVjMjllNDE4N2NhZjVmNWU5NGU5MmNmOGVmZWUwMDVmNTRlNDM2OTc2YzcwZjA1ZjMwMmM0OTVkYTgwNiIsInZlcnNpb24iOjF9.AYptBsoQxvVm8weMxGYfjmXaNOIrSSPkwSqMmCyxuSCpld8KmQksVzmf0fz1tmn16Mjh_rnT6a8pcOp5Otd_Ag
21
+ - type: loss
22
+ value: 1.1590285866899648
23
+ name: Loss
24
+ verified: true
25
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNWE1YmMxZWQ4NGNkZDE5MGI2OTFjYTU2MjhhNzY4MjhhYmVmMmFlZDc2ZGY1ZDJmNTI1ZWE2ZGEwNjk1ODliYyIsInZlcnNpb24iOjF9.0WbjBvdLgaF7NcG0TnSTgQ_-4blOCzdeW15comZNvylnm7sY7bjH_4soR_7LaBBrhiWiMImzU8YHRbXCKMDJDA
26
  ---
27
  A LLM finetuned from [OLM's GPT model](https://huggingface.co/Tristan/olm-gpt2-oct-2022) on [Yannic K's Youtube Channel](https://huggingface.co/datasets/Whispering-GPT/yannic-kilcher-transcript)
28
  Under [BirdL-AirL++](https://github.com/BIRD-Laboratories/BirdL/blob/main/LICENSE.md)