microsoft
/

rho-math-1b-interpreter-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

rho-math-1b-interpreter-v0.1 / README.md

zhibin-msft's picture

Update README.md

36b29db verified 7 months ago

|

515 Bytes

	---
	license: mit
	tags:
	- nlp
	- math
	language:
	- en
	pipeline_tag: text-generation
	---

	## Rho-1: Not All Tokens Are What You Need


	The Rho-1 series are pretrained language models that utilize Selective Language Modeling (SLM) objectives.
	In math reasoning pretraining, SLM improves average few-shot accuracy on GSM8k and MATH by over 16%, achieving the baseline performance 5-10x faster.


	For more details please check our [github](https://github.com/microsoft/rho) and [paper](https://arxiv.org/abs/2404.07965).