Text Generation
GGUF
English
math
Inference Endpoints
conversational
aashish1904 commited on
Commit
9a658e4
1 Parent(s): d218234

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +64 -0
README.md ADDED
@@ -0,0 +1,64 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+
4
+ license: apache-2.0
5
+ datasets:
6
+ - MathGenie/MathCode-Pile
7
+ language:
8
+ - en
9
+ metrics:
10
+ - accuracy
11
+ base_model:
12
+ - meta-llama/Meta-Llama-3-8B
13
+ pipeline_tag: text-generation
14
+ tags:
15
+ - math
16
+
17
+ ---
18
+
19
+ [![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory)
20
+
21
+
22
+ # QuantFactory/MathCoder2-Llama-3-8B-GGUF
23
+ This is quantized version of [MathGenie/MathCoder2-Llama-3-8B](https://huggingface.co/MathGenie/MathCoder2-Llama-3-8B) created using llama.cpp
24
+
25
+ # Original Model Card
26
+
27
+
28
+ # MathCoder2
29
+
30
+ ### Introduction
31
+
32
+ The MathCoder2 models are created by conducting continued pretraining on [MathCode-Pile](https://huggingface.co/datasets/MathGenie/MathCode-Pile). They are introduced in the paper [MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code](https://arxiv.org/abs/2410.08196).
33
+
34
+ The mathematical pretraining dataset includes mathematical code accompanied with natural language reasoning steps, making it a superior resource for models aimed at performing advanced mathematical reasoning tasks.
35
+
36
+ ### Evaluation
37
+
38
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65dd9e7b4a4fce1ec96dc6b7/BEZoDZLjp-fPFlt7oFXBa.png)
39
+
40
+ ### Citation
41
+
42
+ If you find this repository helpful, please consider citing our papers:
43
+
44
+ ```
45
+ @misc{lu2024mathcoder2bettermathreasoning,
46
+ title={MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code},
47
+ author={Zimu Lu and Aojun Zhou and Ke Wang and Houxing Ren and Weikang Shi and Junting Pan and Mingjie Zhan and Hongsheng Li},
48
+ year={2024},
49
+ eprint={2410.08196},
50
+ archivePrefix={arXiv},
51
+ primaryClass={cs.CL},
52
+ url={https://arxiv.org/abs/2410.08196},
53
+ }
54
+ ```
55
+ ```
56
+ @inproceedings{
57
+ wang2024mathcoder,
58
+ title={MathCoder: Seamless Code Integration in {LLM}s for Enhanced Mathematical Reasoning},
59
+ author={Zimu Lu and Aojun Zhou and Zimu Lu and Sichun Luo and Weikang Shi and Renrui Zhang and Linqi Song and Mingjie Zhan and Hongsheng Li},
60
+ booktitle={The Twelfth International Conference on Learning Representations},
61
+ year={2024},
62
+ url={https://openreview.net/forum?id=z8TW0ttBPp}
63
+ }
64
+ ```