File size: 835 Bytes
3ecc89b
 
 
 
 
 
 
 
3d8d6c6
7af270f
3d8d6c6
3ecc89b
c5dd5f7
47171e3
c5dd5f7
b0008ad
b3e3d42
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
---
title: README
emoji: 👀
colorFrom: purple
colorTo: green
sdk: static
pinned: false
---
<div align="center">
    <img src="https://github.com/yixuantt/picx-images-hosting/raw/master/bar.231u8j8ajg.webp" alt="Logo" width="100%" />
</div>

## FinMTEB: Finance Massive Text Embedding Benchmark
Finance Massive Text Embedding Benchmark (FinMTEB), an embedding benchmark consists of **64 financial domain-specific text datasets**, across **English and Chinese**, spanning **seven different tasks**. All datasets in FinMTEB are finance-domain specific, either previously used in financial NLP research or newly developed by the authors.
---
* Paper: [Do We Need Domain-Specific Embedding Models? An Empirical Investigation](https://arxiv.org/pdf/2409.18511v1)
* GitHub: [FinMTEB](https://github.com/yixuantt/FinMTEB/blob/main/README.md)