stormsidali2001
commited on
Commit
•
36365ca
1
Parent(s):
1b01a8e
Update README.md
Browse files
README.md
CHANGED
@@ -8,7 +8,7 @@ widget:
|
|
8 |
score: 0.9
|
9 |
- label: MOVE_1
|
10 |
score: 0.05
|
11 |
-
- label:
|
12 |
score: 0.05
|
13 |
license: mit
|
14 |
datasets:
|
@@ -18,29 +18,53 @@ language:
|
|
18 |
metrics:
|
19 |
- f1
|
20 |
- accuracy
|
21 |
-
base_model: google
|
22 |
---
|
23 |
|
24 |
-
##
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
25 |
|
26 |
-
|
27 |
|
28 |
-
|
29 |
|
30 |
-
|
31 |
|
32 |
-
|
33 |
|
34 |
-
|
|
|
|
|
35 |
|
36 |
-
##
|
37 |
|
|
|
38 |
|
39 |
-
|
|
|
40 |
|
41 |
-
|
42 |
-
<summary>View Model Plot</summary>
|
43 |
|
44 |
-
|
|
|
45 |
|
46 |
-
|
|
|
8 |
score: 0.9
|
9 |
- label: MOVE_1
|
10 |
score: 0.05
|
11 |
+
- label: MOVE_2
|
12 |
score: 0.05
|
13 |
license: mit
|
14 |
datasets:
|
|
|
18 |
metrics:
|
19 |
- f1
|
20 |
- accuracy
|
21 |
+
base_model: google/bert-base-cased
|
22 |
---
|
23 |
|
24 |
+
## IMRaD Introduction Move Classifier
|
25 |
+
|
26 |
+
This model is a fine-tuned BERT model designed to classify sentences from the introductions of scientific research papers into one of three IMRaD moves:
|
27 |
+
|
28 |
+
* **MOVE_0 (Establishing a Research Territory):** Setting the context and background information for the research.
|
29 |
+
* **MOVE_1 (Establishing a Niche):** Identifying a gap or problem in existing research.
|
30 |
+
* **MOVE_2 (Occupying the Niche):** Proposing a solution or approach to address the identified gap.
|
31 |
+
|
32 |
+
## Intended Uses & Limitations
|
33 |
+
|
34 |
+
**Intended Uses:**
|
35 |
+
|
36 |
+
* **Scientific Writing Assistance:** Help researchers and students analyze and improve the structure of their introductions by identifying the IMRaD moves present in each sentence.
|
37 |
+
* **Literature Review Analysis:** Assist in quickly understanding the rhetorical structure of introductions in a set of research papers.
|
38 |
+
* **Educational Tool:** Illustrate IMRaD concepts and their practical application in scientific writing.
|
39 |
+
|
40 |
+
**Limitations:**
|
41 |
+
|
42 |
+
* **Domain Specificity:** The model was trained on a dataset of scientific research papers and might not perform as well on other types of text.
|
43 |
+
* **Accuracy:** While the model achieves good accuracy, it's not perfect. Predictions should be reviewed carefully, especially in complex or ambiguous sentences.
|
44 |
+
* **Sentence-Level Classification:** The model classifies individual sentences. It does not provide an overall analysis of the entire introduction.
|
45 |
|
46 |
+
## Training and Evaluation Data
|
47 |
|
48 |
+
The model was trained and evaluated on the "IMRAD Introduction Sentences Moves & Sub-moves Dataset" available on Hugging Face: [https://huggingface.co/datasets/stormsidali2001/IMRAD-introduction-sentences-moves-sub-moves-dataset](https://huggingface.co/datasets/stormsidali2001/IMRAD-introduction-sentences-moves-sub-moves-dataset)
|
49 |
|
50 |
+
The dataset consists of sentences extracted from scientific research paper introductions, manually labeled with their corresponding IMRaD moves.
|
51 |
|
52 |
+
**Training Details:**
|
53 |
|
54 |
+
* The `bert-base-cased` model from Google was used as the base model.
|
55 |
+
* Fine-tuning was performed using a TensorFlow/Keras implementation.
|
56 |
+
* Evaluation metrics include F1 score and accuracy.
|
57 |
|
58 |
+
## How to Use
|
59 |
|
60 |
+
You can use this model with the `pipeline` function from the `transformers` library:
|
61 |
|
62 |
+
```python
|
63 |
+
from transformers import pipeline
|
64 |
|
65 |
+
classifier = pipeline("text-classification", model="your-username/your-model-name")
|
|
|
66 |
|
67 |
+
sentence = "Electronic cigarettes were introduced into the US market in 2007."
|
68 |
+
result = classifier(sentence)
|
69 |
|
70 |
+
print(result)
|