Max tokens = 8192 or 512?
#1
by
robmeadows
- opened
The models says it supports max tokens of 8192, but I get an error when using it through Hugging Face Inference API:statusCode: 413, response: {"error":"Input validation error:
inputs must have less than 512 tokens. Given: 645","error_type":"Validation"}