It works well with long system prompts.
It isn't generic in a sense that it shouldn't be used for story telling, for example, but only for reasoning and text comprehension.
This model is trained on a private dataset. The high GSM8K score is NOT because of the MetaMath dataset.
Prompt Format:
### System:
{add here the system prompt}
### Instruction:
{add here the instruction}
### Response:
{make sure you have a new line here}
- Downloads last month
- 6
Model tree for LoneStriker/Metis-0.1-3.0bpw-h6-exl2-2
Base model
mistralai/Mistral-7B-v0.1
Finetuned
HuggingFaceH4/mistral-7b-sft-beta