Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
Commit History
added 'forbidden models' submission, to allow orgs to request their models to not be submitted in case of contamination
ed1fdef
Clémentine
commited on
simplified header text
6fefae4
Clémentine
commited on
Flagged model per discussion
6e039c4
Clémentine
commited on
removed need for tokens in the leaderboard + removed skull in flagged models
a40c960
Clémentine
commited on
Adding flagging systemi, removing changelog
699e8ff
Clémentine
commited on
Adding link to detailed results and evals (#203)
6254b87
Update src/auto_leaderboard/model_metadata_type.py
1108259
Update src/auto_leaderboard/model_metadata_type.py (#199)
0ba9d60
Small update and reorg of text (#198)
d313dbd
look at model info if not in request file
a33e66d
Clémentine
commited on
typo fix
1b7afb7
Clémentine
commited on
Updated model metadata according to #162
a5023e4
Clémentine
commited on
Re-labelled model types (#162)
16a8ecc
clarified how to get MMLU average
6095833
Clémentine
commited on
rm lighteval sha from id
2bb5ded
Clémentine
commited on
added precision
6eaad72
Clémentine
commited on
fix rounding
d350941
Clémentine
commited on
corrected display of symbols
35763fc
Clémentine
commited on
Merge branch 'main' into link_requests_and_results
a79408c
Nathan Habib
commited on
get model type info from request file
80f4eeb
Nathan Habib
commited on
added more nuance in ft models
95f85ed
Clémentine
commited on
Update src/auto_leaderboard/model_metadata_type.py (#159)
eedc0f1
corrected finetuned mentions
0f4a719
Clémentine
commited on
Update newcomers (#153)
d7daa68
look for model type in request file
d295afa
Nathan Habib
commited on
Update new model types (#150)
9977ce1
Fix Typo (#144)
6ddee3c
add two more (#142)
5d5681a
More metadata type updates (#141)
3602349
Wording: number of parameters? (#138)
edd3aac
Add type to more models (#140)
6d0d93c
model type filters (#139)
601f2e9
updated design to select columns to display
ecef2dc
Clémentine
commited on
Short emoji legend on first page (#133)
a66fcca
only display the scores for the latest result file
d6b3d82
Nathan Habib
commited on
Update model types (#126)
9d5015b
Added icons for types + fixed pending queue
b323764
Clémentine
commited on
wip adding symbols to model types
217b585
Clémentine
commited on
fix new config name
4aff44e
Nathan Habib
commited on
FT: precision and adapter models
12cea14
Clémentine
commited on
Update src/assets/text_content.py
a0b557b
updated model param number reader
1df8383
Clémentine
commited on
updated version
788108a
Clémentine
commited on
added precision for truthfulqa 6 shot
18916e3
Clémentine
commited on
Small fix - we do not want to display models where the MMLU is old with models where the MMLU is new - however, since version is displayed in the results, we keep the files
97b27da
Clémentine
commited on
Add details on the datasets for reproducibility (#107)
256c5d3
Using the new backend
d16cee2
Linker1907
commited on
small fix link Ilyas leaderboard
e868f35
Clémentine
commited on
added harness command
d2e8eca
Clémentine
commited on