license: mit | |
datasets: | |
- HuggingFaceFW/fineweb | |
library_name: Transformers | |
This is a Llama 2 architecture model series trained on the FineWeb dataset upto 1 Billion parameters and uses tiktoken cl100k_base model as tokenizer |
license: mit | |
datasets: | |
- HuggingFaceFW/fineweb | |
library_name: Transformers | |
This is a Llama 2 architecture model series trained on the FineWeb dataset upto 1 Billion parameters and uses tiktoken cl100k_base model as tokenizer |