tomer-nv's picture
Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model
775f652 verified