electric-mist-7b / README.md
maldv's picture
Update README.md
b96f19b verified
|
raw
history blame
1.48 kB
metadata
language:
  - en
license: cc-by-nc-4.0
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - mistral
base_model: alpindale/Mistral-7B-v0.2-hf
datasets:
  - maldv/cyberpunk
  - microsoft/orca-math-word-problems-200k
  - Weyaxi/sci-datasets
  - grimulkan/theory-of-mind
  - ResplendentAI/Synthetic_Soul_1k
  - GraphWiz/GraphInstruct-RFT-72K

Electric Mist 7B

  • Developed by: maldv
  • License: cc-by-nc-4.0
  • Finetuned from model: alpindale/Mistral-7B-v0.2-hf
  • Methodology: Simple newline delimited, rolling window book and stripped conversation data.

Have you learned anything?

Yes, I learned that if you try to train models that aren't the base model, that the results are trash. I have heard rumors that merging the LoRA's works, which is why the companion LoRA for this is published as well.

Will It Write

It's good. It goes page after page. It needs an authors note to stay on track though.

Data

90% sci-fi fiction text data (with a lot of the pulpiest removed), then 10% of the other datasets mixed together; around 6000 8192 context samples, lora r 64, lr .00005, 2 epochs.

Chat Template

It was trained to follow no prompt at all, just to start going. There is explicitly no chat in the training data. Simply double newline delimited (even with the orca, math, etc)