-
sentence-transformers/gooaq
Viewer • Updated • 3.01M • 586 • 10 -
sentence-transformers/yahoo-answers
Viewer • Updated • 3.14M • 363 • 3 -
sentence-transformers/msmarco-msmarco-distilbert-base-tas-b
Viewer • Updated • 86.3M • 1.27k • 4 -
sentence-transformers/msmarco-msmarco-distilbert-base-v3
Viewer • Updated • 88.9M • 810 • 2
Sentence Transformers
university
AI & ML interests
In the following you find models tuned to be used for sentence / text embedding generation. They can be used with the sentence-transformers package.
Organization Card
SentenceTransformers 🤗 is a Python framework for state-of-the-art sentence, text and image embeddings.
Install the Sentence Transformers library.
pip install -U sentence-transformers
The usage is as simple as:
from sentence_transformers import SentenceTransformer
model = SentenceTransformer('paraphrase-MiniLM-L6-v2')
# Sentences we want to encode. Example:
sentence = ['This framework generates embeddings for each input sentence']
# Sentences are encoded by calling model.encode()
embedding = model.encode(sentence)
Hugging Face makes it easy to collaboratively build and showcase your Sentence Transformers models! You can collaborate with your organization, upload and showcase your own models in your profile ❤️
Documentation
Push your Sentence Transformers models to the Hub ❤️
Find all Sentence Transformers models on the 🤗 Hub
To upload your Sentence Transformers models to the Hugging Face Hub, log in with huggingface-cli login
and use the save_to_hub
method within the Sentence Transformers library.
from sentence_transformers import SentenceTransformer
# Load or train a model
model = SentenceTransformer(...)
# Push to Hub
model.push_to_hub("my_new_model")
Collections
3
A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers
These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual.
-
sentence-transformers/parallel-sentences-wikititles
Viewer • Updated • 14.7M • 93 -
sentence-transformers/parallel-sentences-tatoeba
Viewer • Updated • 8.35M • 1.25k -
sentence-transformers/parallel-sentences-talks
Viewer • Updated • 19.6M • 3.76k • 8 -
sentence-transformers/parallel-sentences-europarl
Viewer • Updated • 49.7M • 1.03k
models
124
sentence-transformers/xlm-r-base-en-ko-nli-ststb
Sentence Similarity
•
Updated
•
363
sentence-transformers/bert-base-wikipedia-sections-mean-tokens
Sentence Similarity
•
Updated
•
121
sentence-transformers/bert-base-nli-cls-token
Sentence Similarity
•
Updated
•
2.62k
•
2
sentence-transformers/all-MiniLM-L12-v1
Sentence Similarity
•
Updated
•
8.84k
•
8
sentence-transformers/all-MiniLM-L6-v1
Sentence Similarity
•
Updated
•
10k
•
13
sentence-transformers/all-mpnet-base-v1
Sentence Similarity
•
Updated
•
16.6k
•
7
sentence-transformers/facebook-dpr-ctx_encoder-multiset-base
Sentence Similarity
•
Updated
•
2.75k
•
3
sentence-transformers/facebook-dpr-ctx_encoder-single-nq-base
Sentence Similarity
•
Updated
•
2.08k
sentence-transformers/facebook-dpr-question_encoder-multiset-base
Sentence Similarity
•
Updated
•
425
•
1
sentence-transformers/facebook-dpr-question_encoder-single-nq-base
Sentence Similarity
•
Updated
•
423
•
2
datasets
76
sentence-transformers/parallel-sentences
Preview
•
Updated
•
1.2k
•
13
sentence-transformers/embedding-training-data
Updated
•
610
•
107
sentence-transformers/parallel-sentences-opus-100
Viewer
•
Updated
•
55M
•
5.64k
•
1
sentence-transformers/trivia-qa-triplet
Viewer
•
Updated
•
52.9M
•
1.07k
•
5
sentence-transformers/t2ranking
Viewer
•
Updated
•
5.53M
•
299
sentence-transformers/mr-tydi
Viewer
•
Updated
•
5.01M
•
2.05k
sentence-transformers/miracl
Viewer
•
Updated
•
8.95M
•
2.37k
•
2
sentence-transformers/mldr
Viewer
•
Updated
•
912k
•
1.55k
•
3
sentence-transformers/pubmedqa
Viewer
•
Updated
•
35.4k
•
179
sentence-transformers/lecard-v2
Viewer
•
Updated
•
13k
•
79