sfeucht
/

footprints

Model card Files Files and versions Community

sfeucht commited on Jun 25

Commit

f880b38

•

1 Parent(s): f0f61ae

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ Linear probe checkpoints for https://footprints.baulab.info
 To load a Llama-2-7b checkpoint at layer 0 and target index -3:
-```
 import torch
 import torch.nn as nn
 from huggingface_hub import hf_hub_download
@@ -18,10 +18,15 @@ class LinearModel(nn.Module):
         output = self.fc(x)
         return output
 checkpoint_path = hf_hub_download(
     repo_id="sfeucht/footprints",
     filename="llama-2-7b/layer0_tgtidx-3.ckpt"
 )
 probe = LinearModel(4096, 32000)
 probe.load_state_dict(torch.load(checkpoint_path, map_location=torch.device('cpu')))
 ```

 To load a Llama-2-7b checkpoint at layer 0 and target index -3:
+```python
 import torch
 import torch.nn as nn
 from huggingface_hub import hf_hub_download
         output = self.fc(x)
         return output
+# example: llama-2-7b probe at layer 0, predicting 3 tokens ago
+# predicting the next token would be `layer0_tgtidx1.ckpt`
 checkpoint_path = hf_hub_download(
     repo_id="sfeucht/footprints",
     filename="llama-2-7b/layer0_tgtidx-3.ckpt"
 )
+# model_size is 4096 for both models.
+# vocab_size is 32000 for Llama-2-7b and 128256 for Llama-3-8b
 probe = LinearModel(4096, 32000)
 probe.load_state_dict(torch.load(checkpoint_path, map_location=torch.device('cpu')))
 ```