stoshniwal commited on
Commit
b26e5d0
1 Parent(s): 8b747ca

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -2
README.md CHANGED
@@ -15,8 +15,29 @@ tags:
15
 
16
  OpenMath2-Llama3.1-8B is obtained by finetuning [Llama3.1-8B-Base](https://huggingface.co/meta-llama/Llama-3.1-8B) with [OpenMathInstruct-2](https://huggingface.co/datasets/nvidia/OpenMathInstruct-2).
17
 
18
- The model outperforms [Llama3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) on popular math benchmarks, especially on [MATH](https://github.com/hendrycks/math) by 15.9%.
19
-
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
20
 
21
  | Model | GSM8K | MATH | AMC 2023 | AIME 2024 | Omni-MATH |
22
  |:---|:---:|:---:|:---:|:---:|:---:|
 
15
 
16
  OpenMath2-Llama3.1-8B is obtained by finetuning [Llama3.1-8B-Base](https://huggingface.co/meta-llama/Llama-3.1-8B) with [OpenMathInstruct-2](https://huggingface.co/datasets/nvidia/OpenMathInstruct-2).
17
 
18
+ The model outperforms [Llama3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) on all the popular math benchmarks we evaluate on, especially on [MATH](https://github.com/hendrycks/math) by 15.9%.
19
+
20
+ <!-- <p align="center">
21
+ <img src="scaling_plot.jpg" width="350"><img src="math_level_comp.jpg" width="350">
22
+ </p> -->
23
+
24
+ <style>
25
+ .image-container {
26
+ display: flex;
27
+ justify-content: center;
28
+ align-items: center;
29
+ gap: 20px;
30
+ }
31
+ .image-container img {
32
+ width: 350px;
33
+ height: auto;
34
+ }
35
+ </style>
36
+
37
+ <div class="image-container">
38
+ <img src="scaling_plot.jpg" title="Performance of Llama-3.1-8B-Instruct as it is trained on increasing proportions of OpenMathInstruct-2">
39
+ <img src="math_level_comp.jpg" title="Comparison of OpenMath2-Llama3.1-8B vs. Llama-3.1-8B-Instruct across MATH levels">
40
+ </div>
41
 
42
  | Model | GSM8K | MATH | AMC 2023 | AIME 2024 | Omni-MATH |
43
  |:---|:---:|:---:|:---:|:---:|:---:|