powered_by_intel_llm_leaderboard / status /leaderboard_status_050124.csv
eduardo-alvarez's picture
updating scores
918e224
raw
history blame
3.99 kB
Inference Tested,Model,Average,Training Hardware,ARC,HellaSwag,MMLU,TruthfulQA,Winogrande,Model Type,Precision,Size,Infrastructure,Affiliation
<p>&#129000 &#128998 &#128997 &#128992 &#128994</p>,Intel/neural-chat-7b-v3-3,71.574,Gaudi,66.89,85.26,63.07,63.01,79.64,fine-tuned,fp16,7,Intel Developer Cloud,Intel Engineering
<p>&#129000 &#128998 &#128997 &#128992 &#128994</p>,Intel/neural-chat-7b-v3-2,70.858,Gaudi,67.49,83.92,63.55,59.68,79.65,fine-tuned,fp16,7,Intel Developer Cloud,Intel Engineering
<p>&#129000 &#128998 &#128997 &#128992 &#128994</p>,Intel/neural-chat-7b-v3-1,70.002,Gaudi,66.21,83.64,62.37,59.65,78.14,fine-tuned,fp16,7,Intel Developer Cloud,Intel Engineering
<p>&#129000 &#128998 &#128997 &#128992 &#128994</p>,Intel/neural-chat-7b-v3-1,69.972,Gaudi,66.3,83.6,62.44,59.54,77.98,fine-tuned,bf16,7,Intel Developer Cloud,Intel Engineering
<p>&#129000 &#128998 &#128997 &#128992 &#128994</p>,Intel/neural-chat-7b-v3,69.906,Gaudi,67.15,83.29,62.26,58.77,78.06,fine-tuned,fp16,7,Intel Developer Cloud,Intel Engineering
<p>&#129000 &#128998 &#128997 &#128992 &#128994</p>,Intel/neural-chat-7b-v3-1,69.89,Gaudi,65.7,83.54,62.12,59.48,78.61,fine-tuned,int8,7,Intel Developer Cloud,Intel Engineering
<p>&#129000 &#128998 &#128997 &#128992 &#128994</p>,Intel/neural-chat-7b-v3-1,68.256,Gaudi,64.25,82.49,60.79,56.4,77.35,fine-tuned,int4,7,Intel Developer Cloud,Intel Engineering
<p>&#129000 &#128998 &#128997 &#128992</p>,LaZeAsh/gemma-2b-lahacks,50.906,GPU Max,42.75,64.35,38.68,45.61,63.14,fine-tuned,bf16,2,Intel Developer Cloud,No Affiliation
<p>&#129000 &#128998 &#128997 &#128992</p>,mksethi/khalsaa,49.612,GPU Max,42.67,71.45,35.52,32.73,65.69,fine-tuned,bf16,2,Intel Developer Cloud,No Affiliation
<p>&#129000 &#128998 &#128997 &#128992</p>,migaraa/Gemma2B-LORAfied,49.566,GPU Max,42.56,71.56,35.47,32.57,65.67,fine-tuned,bf16,2,Intel Developer Cloud,Student Ambassador
<p>&#129000 &#128998 &#128997 &#128992</p>,Aprajita0/Gemma-2b-Lora,49.526,GPU Max,42.24,71.88,34.14,33.23,66.14,fine-tuned,bf16,2,Intel Developer Cloud,Student Ambassador
<p>&#129000 &#128998 &#128997 &#128992</p>,SSK0908/gemma-2b-dolly-qa,49.452,GPU Max,42.15,71.92,33.98,33.15,66.06,fine-tuned,bf16,2,Intel Developer Cloud,Student Ambassador
<p>&#129000 &#128998 &#128997 &#128992</p>,apfurman/gemma-dolly-agriculture,49.448,GPU Max,42.49,71.39,35.44,32.65,65.27,fine-tuned,bf16,2,Intel Developer Cloud,No Affiliation
<p>&#129000 &#128998 &#128997 &#128992</p>,gopalakrishnan-d/gemma-2b-dolly-ds-lora,49.378,GPU Max,42.24,71.88,34.11,33.15,65.51,fine-tuned,bf16,2,Intel Developer Cloud,Student Ambassador
<p>&#129000 &#128998 &#128997 &#128992</p>,FunDialogues/dollygem-2b-LoRA,49.368,GPU Max,41.55,71.56,35.39,32.59,65.75,fine-tuned,bf16,2,Intel Developer Cloud,No Affiliation
<p>&#129000 &#128998 &#128997 &#128992</p>,chchimdi/gemma-Chimdi-LORA-TUNED,49.326,GPU Max,41.89,71.8,34.23,33.2,65.51,fine-tuned,bf16,2,Intel Developer Cloud,Student Ambassador
<p>&#129000 &#128998 &#128997 &#128992</p>,utkarshsingh99/confused-gemma,49.306,GPU Max,42.06,71.86,34.14,33.2,65.27,fine-tuned,bf16,2,Intel Developer Cloud,Student Ambassador
<p>&#129000 &#128998 &#128997 &#128992</p>,jhineric/gemma-prompt,49.304,GPU Max,42.01,71.61,34.52,32.79,65.59,fine-tuned,bf16,2,Intel Developer Cloud,No Affiliation
<p>&#129000 &#128998 &#128997 &#128992</p>,ThejasElandassery/dallema,49.134,GPU Max,41.64,72.01,33.67,33.08,65.27,fine-tuned,bf16,2,Intel Developer Cloud,Student Ambassador
<p>&#129000 &#128998 &#128997 &#128992</p>,Maelstrome/gemma-2b-storytelling,48.946,Xeon,41.72,71.32,33.82,33.15,64.72,fine-tuned,bf16,2,Intel Developer Cloud,No Affiliation
<p>&#129000 &#128998 </p>,pikhan/gpt2-medium-biochem-bioasq-pubmedqa-demo,33.482,Xeon,25.85,40.14,23.11,39.19,39.12,fine-tuned,bf16,2,Intel Developer Cloud,No Affiliation
<p>&#129000 &#128998 &#128997 &#128992 &#128994</p>,FunDialogues/llamav2-LoRaco-7b-merged,58.16,Gaudi,44.97,77.16,43.1,45.77,79.8,fine-tuned,fp32,7,Intel Developer Cloud,Intel Engineering