create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,40 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language: en
|
3 |
+
license: mit
|
4 |
+
---
|
5 |
+
|
6 |
+
🦔 HEDGEhog 🦔: BERT-based multi-class uncertainty cues recognition
|
7 |
+
====================================================================
|
8 |
+
|
9 |
+
# Description
|
10 |
+
TBD
|
11 |
+
|
12 |
+
# Training Data
|
13 |
+
HEDGEhog is trained and evaluated on the [Szeged Uncertainty Corpus](https://rgai.inf.u-szeged.hu/node/160) (Szarvas et al. 2012<sup>1</sup>). The original sentence-level XML version of this dataset is available [here](https://rgai.inf.u-szeged.hu/node/160).
|
14 |
+
|
15 |
+
The token-level version that was used for the training can be downloaded from [here](https://1drv.ms/u/s!AvPkt_QxBozXk7BiazucDqZkVxLo6g?e=IisuM6) in a form of pickled pandas DataFrame's. You can download either the split sets (```train.pkl``` 137MB, ```test.pkl``` 17MB, ```dev.pkl``` 17MB) or the full dataset (```szeged_fixed.pkl``` 172MB).
|
16 |
+
|
17 |
+
Each row in the df contains a token, its features (these are not relevant for HEDGEhog; they were used to train the baseline CRF model, see [here](https://github.com/vanboefer/uncertainty_crf)), its sentence ID, and its label. The labels refer to different types of semantic uncertainty (Szarvas et al. 2012) -
|
18 |
+
|
19 |
+
- **E**pistemic: the proposition is possible, but its truth-value cannot be decided at the moment. Example: *She **may** be already asleep.*
|
20 |
+
- **I**nvestigation: the proposition is in the process of having its truth-value determined. Example: *She **examined** the role of NF-kappaB in protein activation.*
|
21 |
+
- **D**oxatic: the proposition expresses beliefs and hypotheses, which may be known as true or false by others. Example: *She **believes** that the Earth is flat*
|
22 |
+
- Co**N**dition: the proposition is true or false based on the truth-value of another proposition. Example: ***If** she gets the job, she will move to Utrecht.*
|
23 |
+
- **C**ertain: the token is not an uncertainty cue.
|
24 |
+
|
25 |
+
|
26 |
+
# Training Procedure
|
27 |
+
TBD
|
28 |
+
|
29 |
+
# Evaluation Results
|
30 |
+
class | precision | recall | F1-score | support
|
31 |
+
---|---|---|---|---
|
32 |
+
Epistemic | 0.90 | 0.85 | 0.88 | 624
|
33 |
+
Doxatic | 0.88 | 0.92 | 0.90 | 142
|
34 |
+
Investigation | 0.83 | 0.86 | 0.84 | 111
|
35 |
+
Condition | 0.85 | 0.87 | 0.86 | 86
|
36 |
+
Certain | 1.00 | 1.00 | 1.00 | 104,751
|
37 |
+
**macro average** | **0.89** | **0.90** | **0.89** | 105,714
|
38 |
+
|
39 |
+
# References
|
40 |
+
<sup>1</sup> Szarvas, G., Vincze, V., Farkas, R., Móra, G., & Gurevych, I. (2012). Cross-genre and cross-domain detection of semantic uncertainty. *Computational Linguistics, 38*(2), 335-367.
|