inflatebot commited on
Commit
9e42382
1 Parent(s): ae7b272

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -31,6 +31,11 @@ The components of Veo Lu are:
31
  This model is capable of carrying on a scene without going completely off the rails. That being said, it only has 1.5B parameters. So please, for the love of God, *manage your expectations.*
32
  Since it's Qwen, use ChatML formatting. Turn the temperature down to ~0.7-0.8 and try a dash of rep-pen.
33
 
 
 
 
 
 
34
 
35
  Made by inflatebot.
36
 
 
31
  This model is capable of carrying on a scene without going completely off the rails. That being said, it only has 1.5B parameters. So please, for the love of God, *manage your expectations.*
32
  Since it's Qwen, use ChatML formatting. Turn the temperature down to ~0.7-0.8 and try a dash of rep-pen.
33
 
34
+ GGUFs coming soon, but honestly, the full-precision model is 3.5GB in size. You might wanna have a go at running this unquantized with vLLM.
35
+ ```
36
+ pip install vllm
37
+ vllm serve Alfitaria/Q25-1.5B-VeoLu --max-model-len 16384 --max-num-seqs 1
38
+ ```
39
 
40
  Made by inflatebot.
41