The latest o1 model from OpenAI is still unable to answer 9.11 > 9.9 correctly π€
A possible explanation? Tokenization - and our latest work investigates how it affects a model's ability to do math!
In this blog post, we discuss: π’ The different ways numbers are tokenized in modern LLMs π§ͺ Our detailed approach in comparing these various methods π₯ͺ How we got a free boost in arithmetic performance by adding a few lines of code to the base Llama 3 tokenizer π and a definitive, best tokenization method for math in LLMs!