Neel Nanda's picture

3 2 7

Neel Nanda

NeelNanda

·

https://neelnanda.io

AI & ML interests

Mechanistic Interpretability

Recent Activity

authored a paper 8 days ago

Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models

updated a model about 1 month ago

NeelNanda/crosscoders-gpt2-small

View all activity

Organizations

None yet

Papers 10

arxiv:2411.14257

arxiv:2408.05147

arxiv:2406.16254

arxiv:2405.08366

models 65

NeelNanda/crosscoders-gpt2-small

Updated Oct 27 • 5

NeelNanda/GELU_1L512W_C4_Code

Updated Apr 23 • 5.91k • 2

NeelNanda/gpt-neox-tokenizer-digits

Updated Nov 28, 2023 • 2

NeelNanda/sparse_autoencoder

Updated Oct 28, 2023 • 3

NeelNanda/redwood-attn-only-2l

Updated Feb 25, 2023 • 5

NeelNanda/Othello-GPT-Transformer-Lens

Updated Feb 13, 2023

NeelNanda/full_pred_log_probs

Updated Nov 28, 2022

NeelNanda/SoLU_1L256W_C4_Width_Scan

Updated Nov 1, 2022 • 4

NeelNanda/SoLU_1L128W_C4_Width_Scan

Updated Nov 1, 2022 • 7

NeelNanda/SoLU_1L64W_C4_Width_Scan

Updated Nov 1, 2022 • 4

datasets 15

NeelNanda/pile-small-tokenized-2b

Viewer • Updated Feb 12, 2023 • 10.8M • 2.03k

NeelNanda/pile-tokenized-10b

Viewer • Updated Jan 24, 2023 • 10.8M • 504

NeelNanda/openwebtext-tokenized-9b

Viewer • Updated Jan 19, 2023 • 8.83M • 1.7k

NeelNanda/code-10k

Viewer • Updated Dec 27, 2022 • 10k • 44 • 1

NeelNanda/wiki-10k

Viewer • Updated Dec 27, 2022 • 10k • 63

NeelNanda/c4-code-20k

Viewer • Updated Dec 26, 2022 • 20k • 180 • 4

NeelNanda/c4-10k

Viewer • Updated Dec 26, 2022 • 10k • 163

NeelNanda/c4-tokenized-2b

Viewer • Updated Nov 14, 2022 • 1.36M • 512

NeelNanda/code-tokenized

Viewer • Updated Nov 14, 2022 • 297k • 96

NeelNanda/c4-code-tokenized-2b

Viewer • Updated Nov 13, 2022 • 1.66M • 105 • 1