The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper ā¢ 2406.17557 ā¢ Published Jun 25 ā¢ 86 ā¢ 5
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper ā¢ 2404.14219 ā¢ Published Apr 22 ā¢ 254 ā¢ 42
StarCoder 2 and The Stack v2: The Next Generation Paper ā¢ 2402.19173 ā¢ Published Feb 29 ā¢ 136 ā¢ 4
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper ā¢ 2402.03300 ā¢ Published Feb 5 ā¢ 71 ā¢ 6
GAIA: a benchmark for General AI Assistants Paper ā¢ 2311.12983 ā¢ Published Nov 21, 2023 ā¢ 184 ā¢ 23