Add quantized ONNX weights

by Xenova HF staff - opened Jan 26

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

-0

Xenova

Jan 26

No description provided.

Add quantized ONNX weights905d43e4

Move "model.onnx" to onnx subfolder8018aae9

numb3r3

Jina AI org Jan 29

•

edited Jan 29

~~Thanks for your interests. Do you have some specific preferences about the quantization? And may I ask you to give us more description about your use case?~~

Never mind, I misread your message.

numb3r3

Jina AI org Jan 29

It looks great. BTW, could I know more about the quantization methodology used in this PR? Is there some hackies we need to be aware?

numb3r3 changed pull request status to merged Jan 29

Xenova

Jan 29

Hey! I applied the same quantization settings as all the other bert-based transformers.js models on the HF hub, like https://huggingface.co/Supabase/gte-small (see here for the full list). You can find a detailed list of settings applied here.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment