High quality pretraining and instruction datasets for law, mathematics, and science.
Casey
casey-martin
AI & ML interests
Biomedical Tool Usage
Graph Learning
Ecophysiology
Recent Activity
liked
a dataset
about 18 hours ago
allenai/peS2o
liked
a dataset
2 days ago
neo4j/text2cypher-2024v1
liked
a dataset
13 days ago
microsoft/orca-agentinstruct-1M-v1
Organizations
Collections
1
models
None public yet
datasets
8
casey-martin/math_notebooks
Viewer
•
Updated
•
18.1k
•
35
casey-martin/CommonLit-Ease-of-Readability
Viewer
•
Updated
•
4.72k
•
15
•
1
casey-martin/multilingual-mathematical-autoformalization
Viewer
•
Updated
•
666k
•
132
•
1
casey-martin/MedInstruct
Preview
•
Updated
•
51
•
6
casey-martin/qald_9_plus
Viewer
•
Updated
•
15.8k
•
156
casey-martin/vquanda
Viewer
•
Updated
•
5k
•
35
•
3
casey-martin/protocols_io
Updated
•
35
casey-martin/oa_cpp_annotate_gen
Viewer
•
Updated
•
104k
•
51
•
2