--- license: apache-2.0 datasets: - enjalot/fineweb-edu-sample-10BT-chunked-500-nomic-text-v1.5 language: - en --- # Latent SAE A series of SAEs trained on embeddings from [nomic-embed-text-v1.5](https://huggingface.co/nomic-ai/nomic-embed-text-v1.5) The SAEs were trained using the [10BT sample of Fineweb-Edu](https://huggingface.co/datasets/enjalot/fineweb-edu-sample-10BT-chunked-500). Run the models or train your own with [Latent SAE](https://github.com/enjalot/latent-sae)