gemma_knowledg_tree - a ping0rr Collection

ping0rr 's Collections

gemma_knowledg_tree

gemma_knowledg_tree

updated Feb 26

Gemini: A Family of Highly Capable Multimodal Models

Paper • 2312.11805 • Published Dec 19, 2023 • 45
Measuring Massive Multitask Language Understanding

Paper • 2009.03300 • Published Sep 7, 2020 • 3
HellaSwag: Can a Machine Really Finish Your Sentence?

Paper • 1905.07830 • Published May 19, 2019 • 4
PIQA: Reasoning about Physical Commonsense in Natural Language

Paper • 1911.11641 • Published Nov 26, 2019 • 2
SocialIQA: Commonsense Reasoning about Social Interactions

Paper • 1904.09728 • Published Apr 22, 2019 • 2
BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions

Paper • 1905.10044 • Published May 24, 2019 • 1
On the Measure of Intelligence

Paper • 1911.01547 • Published Nov 5, 2019
Evaluating Large Language Models Trained on Code

Paper • 2107.03374 • Published Jul 7, 2021 • 7
Program Synthesis with Large Language Models

Paper • 2108.07732 • Published Aug 16, 2021 • 4
Training Verifiers to Solve Math Word Problems

Paper • 2110.14168 • Published Oct 27, 2021 • 4
AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models

Paper • 2304.06364 • Published Apr 13, 2023 • 2
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Paper • 2206.04615 • Published Jun 9, 2022 • 5
BBQ: A Hand-Built Bias Benchmark for Question Answering

Paper • 2110.08193 • Published Oct 15, 2021 • 1
RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models

Paper • 2009.11462 • Published Sep 24, 2020
TruthfulQA: Measuring How Models Mimic Human Falsehoods

Paper • 2109.07958 • Published Sep 8, 2021 • 1
ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection

Paper • 2203.09509 • Published Mar 17, 2022 • 2