Nougat ONNX
Collection
Faster Nougat in ONNX format (optimum onnxruntime)
•
6 items
•
Updated
•
1
This was quantized from pszemraj/nougat-small-onnx
using the --avx512_vnni
flag. You need to have a processor with avx512_vnni instructions for this to work properly.
per_channel
is set to True for better accuracylscpu | grep avx512_vnni