jmeadows17 commited on
Commit
da27179
1 Parent(s): 71a9a2f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -0
README.md CHANGED
@@ -1,3 +1,14 @@
1
  ---
2
  license: openrail
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: openrail
3
  ---
4
+
5
+ MathT5-large is a version of FLAN-T5-large fine-tuned on (LaTeX) synthetic mathematical derivations, generated using a symbolic solver.
6
+
7
+ It outperforms GPT-4 and ChatGPT (paper link soon) on a derivation generation task (in a number of metrics) and can somewhat generalise to out-of-distribution examples.
8
+
9
+ It struggles with out-of-vocabulary symbols, but was trained on 155 physics symbols.
10
+
11
+
12
+ An example prompt:
13
+
14
+ Given \cos{(q)} = \theta{(q)}, then derive - \sin{(q)} = \frac{d}{d q} \theta{(q)}, then obtain (- \sin{(q)})^{q} (\frac{d}{d q} \cos{(q)})^{q} = (- \sin{(q)})^{2 q}