adding note about the LlamaTokenizerFast is not included in this build so the Inference API will not work. please use the LlamaTokenizerFast from: Doctor-Shotgun/TinyLlama-1.1B-32k-Instruct to use this model at this time to the README
6f804a0
matlok
commited on