Mozilla
/

llm-compiler-13b-ftd-llamafile

Model card Files Files and versions Community

jartine commited on Jun 29

Commit

5aa7125

•

1 Parent(s): 7cbae06

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -34,6 +34,11 @@ chmod +x llm-compiler-13b-ftd.F16.llamafile
 ./llm-compiler-13b-ftd.F16.llamafile --help
 ```
 On GPUs with sufficient RAM, the `-ngl 999` flag may be passed to use
 the system's NVIDIA or AMD GPU(s). On Windows, only the graphics card
 driver needs to be installed. If the prebuilt DSOs should fail, the CUDA

 ./llm-compiler-13b-ftd.F16.llamafile --help
 ```
+This model has a max context window size of 16k tokens. The `.args` file
+inside these llamafiles have been configured to specify `-c 0 --temp 0`
+so that the max context size is used by default, and randomness is
+disabled by default too (since it's unhelpful for this model).
 On GPUs with sufficient RAM, the `-ngl 999` flag may be passed to use
 the system's NVIDIA or AMD GPU(s). On Windows, only the graphics card
 driver needs to be installed. If the prebuilt DSOs should fail, the CUDA