florian-hoenicke's picture
feat: push custom model
d43edd8 verified
metadata
license: apache-2.0
datasets:
  - fine-tuned/jinaai_jina-embeddings-v2-base-en-15072024-5xy1-webapp
  - allenai/c4
language:
  - en
pipeline_tag: feature-extraction
tags:
  - sentence-transformers
  - feature-extraction
  - sentence-similarity
  - mteb
  - Academic
  - Research
  - Papers
  - Information
  - System

This model is a fine-tuned version of jinaai/jina-embeddings-v2-base-en designed for the following use case:

information retrieval system for academic research papers

How to Use

This model can be easily integrated into your NLP pipeline for tasks such as text classification, sentiment analysis, entity recognition, and more. Here's a simple example to get you started:

from sentence_transformers import SentenceTransformer
from sentence_transformers.util import cos_sim

model = SentenceTransformer(
    'fine-tuned/jinaai_jina-embeddings-v2-base-en-15072024-5xy1-webapp',
    trust_remote_code=True
)

embeddings = model.encode([
    'first text to embed',
    'second text to embed'
])
print(cos_sim(embeddings[0], embeddings[1]))