Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems Paper • 2408.16293 • Published Aug 29 • 25
Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process Paper • 2407.20311 • Published Jul 29 • 4
Physics of Language Models: Part 3.1, Knowledge Storage and Extraction Paper • 2309.14316 • Published Sep 25, 2023 • 7
Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws Paper • 2404.05405 • Published Apr 8 • 9
Physics of Language Models: Part 3.2, Knowledge Manipulation Paper • 2309.14402 • Published Sep 25, 2023 • 6
Physics of Language Models: Part 1, Context-Free Grammar Paper • 2305.13673 • Published May 23, 2023 • 7