Oscar Sainz
OSainz
AI & ML interests
Artificial Inteligence, Natural Language Processing, Information Extraction, Zero and Few Shot Learning.
Organizations
OSainz's activity
missing tokenizer?
1
#1 opened 5 months ago
by
mradermacher
GPT-3.5 HumanEval_R CodeForces2305 contamination based on https://arxiv.org/abs/2402.15938
1
#28 opened 6 months ago
by
suryanshs16103
Add reports from Benchmarking paper "Benchmark Leakage in Large Language Models"
1
#27 opened 7 months ago
by
SinclairWang
Update contamination_report.csv
1
#26 opened 7 months ago
by
suryanshs16103
Update contamination.csv
1
#25 opened 7 months ago
by
suryanshs16103
Add data from "An Open-Source Data Contamination Report for Large Language Models"
6
#5 opened 8 months ago
by
vishaal27
Add Reports Based on "Llemma: An Open Language Model For Mathematics"
1
#23 opened 7 months ago
by
wlchen
add flores contamination in xP3
5
#20 opened 7 months ago
by
davidstap
Add Aquila model series which have gsm8k test set contamination
1
#21 opened 7 months ago
by
bpHigh
GPT-3.5 Spider contamination based on https://arxiv.org/pdf/2402.08100
3
#18 opened 7 months ago
by
bpHigh
Should indirect data leakages be included in the Data Contamination Database?
2
#19 opened 7 months ago
by
bpHigh
File fixes and cleaning
#17 opened 7 months ago
by
OSainz
Superglue/RealNews Contamination based on "Noise-Robust De-Duplication at Scale"
1
#15 opened 7 months ago
by
emilys
Mistral 7B Arc Easy Contamination based on "Proving Test Set Contamination in Black Box Language Models"
1
#14 opened 7 months ago
by
AmeyaPrabhu
Added Contamination Evidence from GPT4 Tech Report using String matching on GPT-4
9
#11 opened 7 months ago
by
AmeyaPrabhu
GPT-3.5Turbo HumanEval Contamination based on "Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models"
1
#16 opened 7 months ago
by
jupyter31
Added Contamination Evidence on MMLU of ChatGPT/GPT4 from "Investigating data contamination in modern benchmarks for large language models"
7
#10 opened 7 months ago
by
AmeyaPrabhu
Added Contamination Info on Old Models: GPT3, FLAN, GLaM, PaLM, PaLM 2
3
#13 opened 7 months ago
by
AmeyaPrabhu
Contamination results based on "Data Contamination Quiz"
5
#9 opened 8 months ago
by
shahriargolchin
Code contamination in HumanEval and MBPP
1
#12 opened 7 months ago
by
AmeyaPrabhu