Alfitaria
/

Q25-1.5B-VeoLu

Model card Files Files and versions Community

inflatebot commited on Nov 4

Commit

9e42382

•

1 Parent(s): ae7b272

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -31,6 +31,11 @@ The components of Veo Lu are:
 This model is capable of carrying on a scene without going completely off the rails. That being said, it only has 1.5B parameters. So please, for the love of God, *manage your expectations.*
 Since it's Qwen, use ChatML formatting. Turn the temperature down to ~0.7-0.8 and try a dash of rep-pen.
 Made by inflatebot.

 This model is capable of carrying on a scene without going completely off the rails. That being said, it only has 1.5B parameters. So please, for the love of God, *manage your expectations.*
 Since it's Qwen, use ChatML formatting. Turn the temperature down to ~0.7-0.8 and try a dash of rep-pen.
+GGUFs coming soon, but honestly, the full-precision model is 3.5GB in size. You might wanna have a go at running this unquantized with vLLM.
+```
+pip install vllm
+vllm serve Alfitaria/Q25-1.5B-VeoLu --max-model-len 16384 --max-num-seqs 1
+```
 Made by inflatebot.