Alfitaria
/

Q25-1.5B-VeoLu

Model card Files Files and versions Community

inflatebot commited on Nov 4

Commit

478a093

•

1 Parent(s): 30f3b67

Update README.md

Files changed (1) hide show

README.md +24 -13

README.md CHANGED Viewed

@@ -1,28 +1,39 @@
 ---
 base_model:
 - Qwen/Qwen2.5-1.5B-Instruct
-library_name: transformers
 tags:
 - mergekit
 - merge
 ---
 # Q25-1.5-VeoLu-R2
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the [task arithmetic](https://arxiv.org/abs/2212.04089) merge method using [Qwen/Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct) as a base.
-### Models Merged
 The following models were included in the merge:
-* /home/asriel/AI/text/models/scribe
-* /home/asriel/AI/text/models/alchemist
-* /home/asriel/AI/text/models/bard
-* /home/asriel/AI/text/models/cartographer
 ### Configuration
@@ -54,4 +65,4 @@ slices:
       weight: 1.0
   - layer_range: [0, 28]
     model: Qwen/Qwen2.5-1.5B-Instruct
-```

 ---
 base_model:
 - Qwen/Qwen2.5-1.5B-Instruct
+library_name: peft
 tags:
 - mergekit
 - merge
+- llama-factory
+- lora
+datasets:
+- allura-org/fujin-cleaned-stage-1
+- Dampfinchen/Creative_Writing_Multiturn
+- ToastyPigeon/SpringDragon
+- allura-org/medquad_sharegpt
+- allura-org/scienceqa_sharegpt
+- Alignment-Lab-AI/orcamath-sharegpt
 ---
 # Q25-1.5-VeoLu-R2
+Q25-1.5B-Veo Lu is a tiny General-Purpose Creative model, made up of a merge of bespoke finetunes on Qwen 2.5-1.5B-Instruct.
+Inspired by the success of [MN-12B-Mag Mell](https://huggingface.co/inflatebot/MN-12B-Mag-Mell-R1) and [MS-Meadowlark-22B](https://huggingface.co/allura-org/MS-Meadowlark-22B), Veo Lu was trained on a healthy, balanced diet of of Internet fiction, roleplaying, adventuring, and reasoning/general knowledge.
+The components of Veo Lu are:
 The following models were included in the merge:
+* Bard (pretrain, writing): [Fujin (Cleaned/extended Rosier)](https://huggingface.co/allura-org/fujin-cleaned-stage-1)
+* Scribe (pretrain, roleplay): [Creative Writing Multiturn](https://huggingface.co/Dampfinchen/Creative_Writing_Multiturn)
+* Cartographer (pretrain, adventuring): [SpringDragon](https://huggingface.co/ToastyPigeon/SpringDragon)
+* Alchemist (SFT, science/reasoning): [ScienceQA,](https://huggingface.co/allura-org/scienceqa_sharegpt) [MedquadQA,](https://huggingface.co/allura-org/medquad_sharegpt) [Orca Math Word Problems](https://huggingface.co/Alignment-Lab-AI/orcamath-sharegpt)
+This model is capable of carrying on a scene without going completely off the rails. That being said, it only has 1.5B parameters. So please, for the love of God, *manage your expectations.*
+Made by inflatebot.
+Special thanks to our friends at Allura, and especially to Auri, who basically held my hand through the whole process. Her effort and enthusiasm carried this project forward.
 ### Configuration
       weight: 1.0
   - layer_range: [0, 28]
     model: Qwen/Qwen2.5-1.5B-Instruct
+```