expanded 49556 tokens
Following The Optimal Vocabulary Size Predictor, I recommend using this tokenizer with 13B model.
-