Add evaluation results on the mathemakitten--winobias_antistereotype_dev config and validation split of mathemakitten/winobias_antistereotype_dev

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the mathemakitten--winobias_antistereotype_dev config and validation split of the [mathemakitten/winobias_antistereotype_dev](https://huggingface.co/datasets/mathemakitten/winobias_antistereotype_dev) dataset by

@puffy310

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-mathemakitten__winobias_antistereotype_dev-mathemakitte-398e1c-2536177709).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=mathemakitten/winobias_antistereotype_dev).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=mathemakitten/winobias_antistereotype_dev).

Files changed (1) hide show

README.md +22 -0

README.md CHANGED Viewed

@@ -1,6 +1,28 @@
 ---
 datasets:
 - Whispering-GPT/yannic-kilcher-transcript
 ---
 A LLM finetuned from [OLM's GPT model](https://huggingface.co/Tristan/olm-gpt2-oct-2022) on [Yannic K's Youtube Channel](https://huggingface.co/datasets/Whispering-GPT/yannic-kilcher-transcript)
 Under [BirdL-AirL++](https://github.com/BIRD-Laboratories/BirdL/blob/main/LICENSE.md)

 ---
 datasets:
 - Whispering-GPT/yannic-kilcher-transcript
+model-index:
+- name: BirdL/OLM-GPT2-Yannic
+  results:
+  - task:
+      type: zero-shot-classification
+      name: Zero-Shot Text Classification
+    dataset:
+      name: mathemakitten/winobias_antistereotype_dev
+      type: mathemakitten/winobias_antistereotype_dev
+      config: mathemakitten--winobias_antistereotype_dev
+      split: validation
+    metrics:
+    - type: accuracy
+      value: 0.5
+      name: Accuracy
+      verified: true
+      verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNTAzZTVjMjllNDE4N2NhZjVmNWU5NGU5MmNmOGVmZWUwMDVmNTRlNDM2OTc2YzcwZjA1ZjMwMmM0OTVkYTgwNiIsInZlcnNpb24iOjF9.AYptBsoQxvVm8weMxGYfjmXaNOIrSSPkwSqMmCyxuSCpld8KmQksVzmf0fz1tmn16Mjh_rnT6a8pcOp5Otd_Ag
+    - type: loss
+      value: 1.1590285866899648
+      name: Loss
+      verified: true
+      verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNWE1YmMxZWQ4NGNkZDE5MGI2OTFjYTU2MjhhNzY4MjhhYmVmMmFlZDc2ZGY1ZDJmNTI1ZWE2ZGEwNjk1ODliYyIsInZlcnNpb24iOjF9.0WbjBvdLgaF7NcG0TnSTgQ_-4blOCzdeW15comZNvylnm7sY7bjH_4soR_7LaBBrhiWiMImzU8YHRbXCKMDJDA
 ---
 A LLM finetuned from [OLM's GPT model](https://huggingface.co/Tristan/olm-gpt2-oct-2022) on [Yannic K's Youtube Channel](https://huggingface.co/datasets/Whispering-GPT/yannic-kilcher-transcript)
 Under [BirdL-AirL++](https://github.com/BIRD-Laboratories/BirdL/blob/main/LICENSE.md)