mpasila
/

JP-EN-Translator-2K-steps-LoRA-7B

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

mpasila commited on Mar 28

Commit

c9cbeb5

•

1 Parent(s): ea640a8

Update README.md

Files changed (1) hide show

README.md +20 -0

README.md CHANGED Viewed

@@ -9,7 +9,27 @@ tags:
 - mistral
 - trl
 base_model: augmxnt/shisa-base-7b-v1
 ---
 # Uploaded  model

 - mistral
 - trl
 base_model: augmxnt/shisa-base-7b-v1
+datasets:
+- NilanE/ParallelFiction-Ja_En-100k
+- mpasila/ParallelFiction-Ja_En-100k-alpaca
 ---
+Experimental model, may not perform that well. Dataset used is [a modified](https://huggingface.co/datasets/mpasila/ParallelFiction-Ja_En-100k-alpaca) version of [NilanE/ParallelFiction-Ja_En-100k](https://huggingface.co/datasets/NilanE/ParallelFiction-Ja_En-100k).
+After training with an 8k context length it didn't appear to improve performance much at all. Not sure if I should keep training it (which is costly) or if I should fix some issues with the dataset (like it starting with Ch or Chapter) or I go back to finetuning Finnish models.
+### Prompt format: Alpaca
+```
+Below is a translation task, paired with an input that provides further context. Write a response that appropriately completes the request.
+### Instruction:
+{}
+### Input:
+{}
+### Response:
+{}
+```
 # Uploaded  model