inflatebot
commited on
Commit
•
9e42382
1
Parent(s):
ae7b272
Update README.md
Browse files
README.md
CHANGED
@@ -31,6 +31,11 @@ The components of Veo Lu are:
|
|
31 |
This model is capable of carrying on a scene without going completely off the rails. That being said, it only has 1.5B parameters. So please, for the love of God, *manage your expectations.*
|
32 |
Since it's Qwen, use ChatML formatting. Turn the temperature down to ~0.7-0.8 and try a dash of rep-pen.
|
33 |
|
|
|
|
|
|
|
|
|
|
|
34 |
|
35 |
Made by inflatebot.
|
36 |
|
|
|
31 |
This model is capable of carrying on a scene without going completely off the rails. That being said, it only has 1.5B parameters. So please, for the love of God, *manage your expectations.*
|
32 |
Since it's Qwen, use ChatML formatting. Turn the temperature down to ~0.7-0.8 and try a dash of rep-pen.
|
33 |
|
34 |
+
GGUFs coming soon, but honestly, the full-precision model is 3.5GB in size. You might wanna have a go at running this unquantized with vLLM.
|
35 |
+
```
|
36 |
+
pip install vllm
|
37 |
+
vllm serve Alfitaria/Q25-1.5B-VeoLu --max-model-len 16384 --max-num-seqs 1
|
38 |
+
```
|
39 |
|
40 |
Made by inflatebot.
|
41 |
|