---
title: README
emoji: 👀
colorFrom: purple
colorTo: green
sdk: static
pinned: false
---
## FinMTEB: Finance Massive Text Embedding Benchmark
Finance Massive Text Embedding Benchmark (FinMTEB), an embedding benchmark consists of **64 financial domain-specific text datasets**, across **English and Chinese**, spanning **seven different tasks**. All datasets in FinMTEB are finance-domain specific, either previously used in financial NLP research or newly developed by the authors.
---
* Paper: [Do We Need Domain-Specific Embedding Models? An Empirical Investigation](https://arxiv.org/pdf/2409.18511v1)
* GitHub: [FinMTEB](https://github.com/yixuantt/FinMTEB/blob/main/README.md)