nomic-ai
/

nomic-embed-text-v1.5-GGUF

Sentence Similarity

feature-extraction

Model card Files Files and versions Community

Jared Van Bortel commited on Feb 15

Commit

f414603

•

1 Parent(s): 613d71b

update README

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ tags:
 ---
 ***
-**Warning**: There is a llama.cpp PR [about to be merged](https://github.com/ggerganov/llama.cpp/pull/5500) that will break compatibility with these files. Keep an eye out for updates to this repo.
 ***
 <br/>
@@ -31,7 +31,7 @@ This repo contains llama.cpp-compatible files for [nomic-embed-text-v1.5](https:
 llama.cpp will default to 2048 tokens of context with these files. To use the full 8192 tokens that Nomic Embed is benchmarked on, you will have to choose a context extension method. The original model uses Dynamic NTK-Aware RoPE scaling, but that is not currently available in llama.cpp. A combination of YaRN and linear scaling is an acceptable substitute.
-These files were converted and quantized with llama.cpp commit [594fca3fe](https://github.com/ggerganov/llama.cpp/commit/594fca3fefe27b8e95cfb1656eb0e160ad15a793).
 ## Example `llama.cpp` Command

 ---
 ***
+**Note**: For compatiblity with current llama.cpp, please download the files published on 2/15/2024. The files originally published here do not work after llama.cpp [PR 5500](https://github.com/ggerganov/llama.cpp/pull/5500).
 ***
 <br/>
 llama.cpp will default to 2048 tokens of context with these files. To use the full 8192 tokens that Nomic Embed is benchmarked on, you will have to choose a context extension method. The original model uses Dynamic NTK-Aware RoPE scaling, but that is not currently available in llama.cpp. A combination of YaRN and linear scaling is an acceptable substitute.
+These files were converted and quantized with llama.cpp [PR 5500](https://github.com/ggerganov/llama.cpp/pull/5500), commit [34aa045de](https://github.com/ggerganov/llama.cpp/pull/5500/commits/34aa045de44271ff7ad42858c75739303b8dc6eb).
 ## Example `llama.cpp` Command