Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,69 @@
|
|
1 |
---
|
2 |
license: cc-by-4.0
|
|
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: cc-by-4.0
|
3 |
+
language:
|
4 |
+
- he
|
5 |
+
inference: false
|
6 |
---
|
7 |
+
# DictaBERT: A State-of-the-Art BERT Suite for Modern Hebrew
|
8 |
+
|
9 |
+
State-of-the-art language model for Hebrew, released [here](https://arxiv.org/abs/2308.16687).
|
10 |
+
|
11 |
+
This is the fine-tuned model for the question-answering task.
|
12 |
+
|
13 |
+
Sample usage:
|
14 |
+
|
15 |
+
```python
|
16 |
+
from transformers import pipeline
|
17 |
+
|
18 |
+
oracle = pipeline('question-answering', model='dicta-il/dictabert-qa')
|
19 |
+
|
20 |
+
|
21 |
+
context = 'ืื ืืืช ืคืจืืคืืืื ืฉื ืืฉืชืืฉืื ื ืืฉืืช ืขื ืืื ืจืืื ืืืืื ืคืืื ืฆืืืื ืขื ืืคืจืืืืช. ืืกืืื ืื ืืืืืื ืืืง ืืืืืื ืืช ืืืืฆืขืืช ืืงืืงื ืืช ืืืืืข ืฉื ืืชื ืืืฉืื ืืืืฆืขืืช ืขืืืืืช ืืืช ืืืคื ืืฉืืืืฉ ืืขืืืืืช. ืืจืฆืืช ืืืจืืช, ืืืฉื, ืงืืขื ืืืงืื ื ืืงืฉืื ืืื ืื ืืืข ืืืฆืืจืช ืขืืืืืช ืืืฉืืช. ืืืงืื ืืื, ืืฉืจ ื ืงืืขื ืืฉื ืช 2000, ื ืงืืขื ืืืืจ ืฉื ืืฉืฃ ืื ืืืฉืจื ืืืืฉืื ืืืืื ืืืช ืฉื ืืืืฉื ืืืืจืืงืื ื ืื ืืฉืืืืฉ ืืกืืื (ONDCP) ืืืืช ืืืื ืืฉืชืืฉ ืืขืืืืืช ืืื ืืขืงืื ืืืจื ืืฉืชืืฉืื ืฉืฆืคื ืืคืจืกืืืืช ื ืื ืืฉืืืืฉ ืืกืืื ืืืืจื ืืืืืง ืืื ืืฉืชืืฉืื ืืื ื ืื ืกื ืืืชืจืื ืืชืืืืื ืืฉืืืืฉ ืืกืืื. ืื ืืื ืืจืื ื, ืคืขืื ืืืืื ืืคืจืืืืช ืืืฉืชืืฉืื ืืืื ืืจื ื, ืืฉืฃ ืื ื-CIA ืฉืื ืขืืืืืช ืงืืืขืืช ืืืืฉืื ืืืจืืื ืืืฉื ืขืฉืจ ืฉื ืื. ื-25 ืืืฆืืืจ 2005 ืืืื ืืจืื ื ืื ืืกืืื ืืช ืืืืืืื ืืืืื (ื-NSA) ืืฉืืืจื ืฉืชื ืขืืืืืช ืงืืืขืืช ืืืืฉืื ืืืงืจืื ืืืื ืฉืืจืื ืชืืื ื. ืืืืจ ืฉืื ืืฉื ืคืืจืกื, ืื ืืืืื ืืื ืืช ืืฉืืืืฉ ืืื.'
|
22 |
+
question = 'ืืืฆื ืืืืื ืืืืืข ืฉื ืืชื ืืืฉืื ืืืืฆืขืืช ืืขืืืืืช?'
|
23 |
+
|
24 |
+
oracle(question=question, context=context)
|
25 |
+
```
|
26 |
+
|
27 |
+
Output:
|
28 |
+
```json
|
29 |
+
{
|
30 |
+
"score": 0.998887836933136,
|
31 |
+
"start": 101,
|
32 |
+
"end": 114,
|
33 |
+
"answer": "ืืืืฆืขืืช ืืงืืงื"
|
34 |
+
}
|
35 |
+
```
|
36 |
+
|
37 |
+
## Citation
|
38 |
+
|
39 |
+
If you use DictaBERT in your research, please cite ```DictaBERT: A State-of-the-Art BERT Suite for Modern Hebrew```
|
40 |
+
|
41 |
+
**BibTeX:**
|
42 |
+
|
43 |
+
```bibtex
|
44 |
+
@misc{shmidman2023dictabert,
|
45 |
+
title={DictaBERT: A State-of-the-Art BERT Suite for Modern Hebrew},
|
46 |
+
author={Shaltiel Shmidman and Avi Shmidman and Moshe Koppel},
|
47 |
+
year={2023},
|
48 |
+
eprint={2308.16687},
|
49 |
+
archivePrefix={arXiv},
|
50 |
+
primaryClass={cs.CL}
|
51 |
+
}
|
52 |
+
```
|
53 |
+
|
54 |
+
## License
|
55 |
+
|
56 |
+
Shield: [![CC BY 4.0][cc-by-shield]][cc-by]
|
57 |
+
|
58 |
+
This work is licensed under a
|
59 |
+
[Creative Commons Attribution 4.0 International License][cc-by].
|
60 |
+
|
61 |
+
[![CC BY 4.0][cc-by-image]][cc-by]
|
62 |
+
|
63 |
+
[cc-by]: http://creativecommons.org/licenses/by/4.0/
|
64 |
+
[cc-by-image]: https://i.creativecommons.org/l/by/4.0/88x31.png
|
65 |
+
[cc-by-shield]: https://img.shields.io/badge/License-CC%20BY%204.0-lightgrey.svg
|
66 |
+
|
67 |
+
|
68 |
+
|
69 |
+
|