PEFT
Safetensors
Transformers
English
text-generation-inference
unsloth
mistral
trl
Not-For-All-Audiences
mpasila commited on
Commit
6d2866e
1 Parent(s): 7223ef8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -1
README.md CHANGED
@@ -9,7 +9,15 @@ tags:
9
  - mistral
10
  - trl
11
  base_model: unsloth/mistral-7b-v0.2-bnb-4bit
 
 
 
12
  ---
 
 
 
 
 
13
 
14
  # Uploaded model
15
 
@@ -19,4 +27,4 @@ base_model: unsloth/mistral-7b-v0.2-bnb-4bit
19
 
20
  This mistral model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
21
 
22
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
9
  - mistral
10
  - trl
11
  base_model: unsloth/mistral-7b-v0.2-bnb-4bit
12
+ datasets:
13
+ - mpasila/PIPPA-ShareGPT-formatted-named
14
+ - KaraKaraWitch/PIPPA-ShareGPT-formatted
15
  ---
16
+ LoRA trained with Unsloth in 4-bit using [alpindale/Mistral-7B-v0.2-hf](https://huggingface.co/alpindale/Mistral-7B-v0.2-hf/) as the base model for 1 epoch. Dataset used with the LoRA is [pinzhenchen/alpaca-cleaned-fi](https://huggingface.co/datasets/pinzhenchen/alpaca-cleaned-fi/).
17
+
18
+ Dataset used is [a modified](https://huggingface.co/datasets/mpasila/PIPPA-ShareGPT-formatted-named) version of [KaraKaraWitch/PIPPA-ShareGPT-formatted](https://huggingface.co/datasets/KaraKaraWitch/PIPPA-ShareGPT-formatted).
19
+
20
+ ### Prompt format: ChatML
21
 
22
  # Uploaded model
23
 
 
27
 
28
  This mistral model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
29
 
30
+ [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)