Model Card for Bellman

This version of bellman is finetuned from llama-3-instruct-8b. It is arguable whether it's better at Swedish, because llama-3 is really good. It's however finetuned for prompt question answering, based on a dataset created from Swedish wikipedia, with a lot of Sweden-centric questions. New from previous versions is questions from a translated code-feedback dataset, as well as a number of stories. It's not great at generating stories, but better than previosly.

Please note, the HuggingFace inference api is probably trying to load the adapter (lora) which isn't going to work.

240609: I've uploaded a 4-bit GPTQ quant, but it's completely untested.

Model Details

Training run on 240606:

Step Training Loss Validation Loss
25 1.506400 1.164538
50 1.128500 1.059316
75 1.095100 1.040511
100 1.068700 1.031033
125 1.061300 1.024377
150 1.035700 1.017490
175 1.061200 1.012095
200 1.031600 1.007867
225 1.031900 1.002652
250 0.958300 1.003817
275 0.967900 1.000483
300 0.950000 0.998807
325 0.974300 0.996894
350 0.960700 0.994098
375 0.956000 0.991491
400 0.940500 0.988697
425 0.949100 0.987253
450 0.940600 0.986425 <-- Picked checkpoint
475 0.888300 0.994204
500 0.881700 0.994897

Model Description

Developed by: Me
Funded by: Me
Model type: Instruct
Language(s) (NLP): Swedish
License: llama-3
Finetuned from model: Llama3 Instruct 8b

Model Card Contact

rickard@mindemia.com

neph1
/

llama-3-instruct-bellman-8b-swedish

Model Card for Bellman

Model Details

Model Description

Model Card Contact

Collection including neph1/llama-3-instruct-bellman-8b-swedish

bellman