This is SFT built on a mix of public datasets. Setting up for DPO with custom data.
This is a finetune of Mistrial. It should exhibit a broad base of instuction tuning and some other fun roleplaying capablities.
Its being trained this is about 50% done.
- Downloads last month
- 1,318
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.