arxiv:2411.14257
Neel Nanda
NeelNanda
AI & ML interests
Mechanistic Interpretability
Recent Activity
authored
a paper
8 days ago
Do I Know This Entity? Knowledge Awareness and Hallucinations in
Language Models
updated
a model
about 1 month ago
NeelNanda/crosscoders-gpt2-small
Organizations
None yet
Papers
10
models
65
NeelNanda/crosscoders-gpt2-small
Updated
•
5
NeelNanda/GELU_1L512W_C4_Code
Updated
•
5.91k
•
2
NeelNanda/gpt-neox-tokenizer-digits
Updated
•
2
NeelNanda/sparse_autoencoder
Updated
•
3
NeelNanda/redwood-attn-only-2l
Updated
•
5
NeelNanda/Othello-GPT-Transformer-Lens
Updated
NeelNanda/full_pred_log_probs
Updated
NeelNanda/SoLU_1L256W_C4_Width_Scan
Updated
•
4
NeelNanda/SoLU_1L128W_C4_Width_Scan
Updated
•
7
NeelNanda/SoLU_1L64W_C4_Width_Scan
Updated
•
4
datasets
15
NeelNanda/pile-small-tokenized-2b
Viewer
•
Updated
•
10.8M
•
2.03k
NeelNanda/pile-tokenized-10b
Viewer
•
Updated
•
10.8M
•
504
NeelNanda/openwebtext-tokenized-9b
Viewer
•
Updated
•
8.83M
•
1.7k
NeelNanda/code-10k
Viewer
•
Updated
•
10k
•
44
•
1
NeelNanda/wiki-10k
Viewer
•
Updated
•
10k
•
63
NeelNanda/c4-code-20k
Viewer
•
Updated
•
20k
•
180
•
4
NeelNanda/c4-10k
Viewer
•
Updated
•
10k
•
163
NeelNanda/c4-tokenized-2b
Viewer
•
Updated
•
1.36M
•
512
NeelNanda/code-tokenized
Viewer
•
Updated
•
297k
•
96
NeelNanda/c4-code-tokenized-2b
Viewer
•
Updated
•
1.66M
•
105
•
1