Is ModernBERT already fine-tuned for IR tasks?

#13

by belerico - opened 2 days ago

Discussion

belerico

2 days ago

Hi there, I was wondering if ModernBERT has already been fine-tuned on IR tasks

tomaarsen

Answer.AI org 2 days ago

Hello!

Yes, about a dozen times. See here a filter with Sentence Transformer models & ModernBERT architecture: https://huggingface.co/models?library=sentence-transformers&other=modernbert
I'm not sure which model is the best (it also depends on your use case), perhaps https://huggingface.co/NohTow/ModernBERT-base-DPR-fullneg-gte-0.0002 ?
But I suspect that none so far have reached the same performance as some of the models reported on MTEB. We'll have to wait a bit longer for that.

Tom Aarsen

belerico

2 days ago

⁠ Thank you for the information and the links!

I was also curious about the base versions, “answerdotai/ModernBERT-base” and “answerdotai/ModernBERT-large”—have these models already been fine-tuned for IR tasks, such as using contrastive loss? ⁠

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment