maldv
/

electric-mist-7b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

electric-mist-7b / README.md

maldv's picture

Update README.md

b96f19b verified 8 months ago

|

1.48 kB

	---
	language:
	- en
	license: cc-by-nc-4.0
	tags:
	- text-generation-inference
	- transformers
	- unsloth
	- mistral
	base_model: alpindale/Mistral-7B-v0.2-hf
	datasets:
	- maldv/cyberpunk
	- microsoft/orca-math-word-problems-200k
	- Weyaxi/sci-datasets
	- grimulkan/theory-of-mind
	- ResplendentAI/Synthetic_Soul_1k
	- GraphWiz/GraphInstruct-RFT-72K
	---

	# Electric Mist 7B

	- Developed by: maldv
	- License: cc-by-nc-4.0
	- Finetuned from model: alpindale/Mistral-7B-v0.2-hf
	- Methodology: Simple newline delimited, rolling window book and stripped conversation data.

	## Have you learned anything?

	Yes, I learned that if you try to train models that aren't the base model, that the results are trash. I have heard rumors that merging the LoRA's works,
	which is why the companion LoRA for this is published as well.

	## Will It Write

	It's good. It goes page after page. It needs an authors note to stay on track though.

	## Data

	90% sci-fi fiction text data (with a lot of the pulpiest removed), then 10% of the other datasets mixed together; around 6000 8192 context samples, lora r 64, lr .00005, 2 epochs.

	## Chat Template

	It was trained to follow no prompt at all, just to start going. There is explicitly no chat in the training data. Simply double newline delimited (even with the orca, math, etc)


	[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="50"/>](https://github.com/unslothai/unsloth)