input embeddings, output similarity. Can this be done through an inference endpoint?

#38
by click37 - opened

Hi there

I am computing the embeddings of sentences using the model. I store them so that I don't run this computation in real time. I then get a new sentence in real-time, compute the embedding and want to run a sentence similarity between this new embedding and all others I did prior.

Can this be done here?

You can use any open source vector db(chroma, weaviate) to retrieve best matches of semantic similar sentences. I hope that's what you're looking for.

I'm also looking to do something similar with the inference API. However, the API seems to only support getting sentence similarity and not the actual vectors.

Supported

  • Inputting a source_sentence.
  • Inputting several other_sentences.
  • Getting the similarity of other_sentences with source_sentence.

Not Supported (Or maybe not well documented)

  • Inputting sentences or sentence.
  • Getting the 384 dimensional vectors.

Sign up or log in to comment