ONNX weights for transformers.js

by do-me - opened Oct 30

Discussion

do-me

Oct 30

Hey, awesome work!
Could you add the onnx weights like in https://huggingface.co/minishlab/M2V_base_output/tree/main?

Pringled

The Minish Lab org Oct 30

Hey @do-me , thanks! I can try to create some onnx weights (Xenova created them the last time). I think there might be a few changes required to make it work since inference is now fully numpy based. Do you perhaps have some example code where you are using the onnx model from M2V_base_output that I can use to test if everything works?

do-me

Oct 30

Sure, this was the conversion code: https://github.com/MinishLab/model2vec/issues/75#issuecomment-2408746794

Pringled

The Minish Lab org Oct 30

Unfortunately that code does not work anymore since that was when we were still using Torch for inference. Do the old ONNX models from M2V_base_output still work for you (and if so, how are you using them)? If you have a code example, I can see if I can make it work with the new Numpy inference.

do-me

Oct 30

Yes they work perfectly with the latest transformers.js version. Xenova posted the example code in the GitHub issue. I will push an app later this evening that you can use for reference!

do-me

Oct 30

•

edited Oct 30

@Pringled here you go: https://jsfiddle.net/wohd0gsj/1. Just have a look at the console where the embs are logged.
If you're curious, I built an app based on model2vec: https://do-me.github.io/semantic-similarity-table/ (but will announce it only next week). I'd love to add the possibility to switch between models for higher quality (potion) or multilinguism.

Pringled

The Minish Lab org Oct 30

@do-me Awesome, this looks super cool! I will have a look tomorrow and try to write some conversion code to generate ONNX models for the new model2vec format!

Pringled

The Minish Lab org about 1 month ago

Hey @do-me , I've just upload the ONNX model, could you check if it works? If it does, I'll create and upload them for the other POTION models as well. Cheers!

do-me

about 1 month ago

Unfortunately it doesn't work yet, but it might be the fault of the way I'm calling the new model:

import { AutoModel, AutoTokenizer, Tensor } from 'https://cdn.jsdelivr.net/npm/@huggingface/transformers@3.0.1';

const model = await AutoModel.from_pretrained('minishlab/potion-base-8M', {
    config: { model_type: 'model2vec' },
    dtype: 'fp32'
});

const tokenizer = await AutoTokenizer.from_pretrained('minishlab/potion-base-8M');

const texts = ['hello', 'hello world'];
const { input_ids } = await tokenizer(texts, { add_special_tokens: false, return_tensor: false });

const cumsum = arr => arr.reduce((acc, num, i) => [...acc, num + (acc[i - 1] || 0)], []);
const offsets = [0, ...cumsum(input_ids.slice(0, -1).map(x => x.length))];

const flattened_input_ids = input_ids.flat();
const model_inputs = {
    input_ids: new Tensor('int64', flattened_input_ids, [flattened_input_ids.length]),
    offsets: new Tensor('int64', offsets, [offsets.length]),
}
const { embeddings } = await model(model_inputs);
console.log(embeddings.tolist()); // output matches python version

Seems like it's the missing (?) tokenizers fault:

Pringled

The Minish Lab org about 1 month ago

Ah, I see, those are needed for Transformers compatibility. I can add that as well, I'll ping you once those are added

Pringled

The Minish Lab org about 1 month ago

@do-me I just added the files for this model, could you check and see if it works now?

do-me

about 1 month ago

Works 🎉🎉
Thanks! Now I can add these models to my app :)

https://jsfiddle.net/o35ryzfw/

Pringled

The Minish Lab org about 1 month ago

Awesome, great to hear! I'll also add these files to the other POTION models as well as the multilingual model so that you can use those as well :).

Pringled changed discussion status to closed 13 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment