Text Classification
TF-Keras
English
File size: 4,919 Bytes
84c791a
3e200d5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
---
library_name: tf-keras
pipeline_tag: text-classification
widget:
- text: "However, existing research has primarily focused on urban environments, neglecting the unique challenges faced in rural areas."
  output:
  - label: Claim something is wrong with the previous research
    score: 0.1
  - label: Highlight a gap in the field
    score: 0.85
  - label: Raise a question where research in the field is unclear
    score: 0.03
  - label: Extend prior research to add more information on the topic
    score: 0.02
- text: "Previous studies have failed to adequately address the long-term effects of this intervention."
  output:
  - label: Claim something is wrong with the previous research
    score: 0.9
  - label: Highlight a gap in the field
    score: 0.05
  - label: Raise a question where research in the field is unclear
    score: 0.03
  - label: Extend prior research to add more information on the topic
    score: 0.02
- text: "It is therefore crucial to investigate the effectiveness of [your approach] in a rural context."
  output:
  - label: Claim something is wrong with the previous research
    score: 0.05
  - label: Highlight a gap in the field
    score: 0.15
  - label: Raise a question where research in the field is unclear
    score: 0.1
  - label: Extend prior research to add more information on the topic
    score: 0.7
license: mit
datasets:
- stormsidali2001/IMRAD-introduction-sentences-moves-sub-moves-dataset
language:
- en
metrics:
- f1
- accuracy
base_model: google/bert-base-cased
---

## IMRaD Introduction Move 1 Sub-move Classifier

This model is a fine-tuned BERT model that classifies sentences from the "Establishing a Niche" (Move 1) section of scientific research paper introductions into their corresponding sub-moves:

* **Claim something is wrong with the previous research:**  Pointing out limitations, flaws, or areas where past research falls short.
* **Highlight a gap in the field:** Identifying areas where knowledge or research is lacking.
* **Raise a question where research in the field is unclear:** Presenting an unanswered question or ambiguity in existing research. 
* **Extend prior research to add more information on the topic:** Suggesting a new direction or contribution building on previous work.

**Parent Classifier:**

This model works in tandem with the main IMRaD Introduction Move Classifier: [https://huggingface.co/stormsidali2001/IMRAD_introduction_moves_classifier](https://huggingface.co/stormsidali2001/IMRAD_introduction_moves_classifier)

First, use the parent classifier to identify sentences belonging to "Establishing a Niche" (Move 1). Then, utilize this sub-move classifier to analyze the specific role each Move 1 sentence plays in establishing the research niche.

## Intended Uses & Limitations

**Intended Uses:**

* **Scientific Writing Assistance:** Help researchers and students analyze and strengthen their "Establishing a Niche" section by precisely categorizing each sentence's sub-move. 
* **Literature Review Analysis:**  Gain a deeper understanding of how authors establish the need for their research by identifying the specific sub-moves used in Move 1.
* **Educational Tool:**  Demonstrate the various sub-moves employed to establish a research niche in scientific writing.

**Limitations:**

* **Domain Specificity:**  Trained on scientific research papers; accuracy may vary on other text types.
* **Sentence-Level Classification:**  Focuses on individual sentences; does not provide a holistic analysis of the entire Move 1 section.
* **Prediction Accuracy:**  While generally accurate, the model might misclassify complex or ambiguous sentences. Review predictions critically.

## Training and Evaluation Data

Trained and evaluated on a subset of the "IMRAD Introduction Sentences Moves & Sub-moves Dataset": [https://huggingface.co/datasets/stormsidali2001/IMRAD-introduction-sentences-moves-sub-moves-dataset](https://huggingface.co/datasets/stormsidali2001/IMRAD-introduction-sentences-moves-sub-moves-dataset)

Specifically, the model uses sentences labeled as Move 1, further classified into the four sub-moves. 

**Training Details:**

* **Base Model:**  `google/bert-base-cased`
* **Implementation:** TensorFlow/Keras
* **Evaluation Metrics:** F1 score and accuracy 

## How to Use

```python
from transformers import pipeline

# Parent classifier
move_classifier = pipeline("text-classification", model="stormsidali2001/IMRAD_introduction_moves_classifier")

# Move 1 sub-move classifier
submove_classifier_1 = pipeline("text-classification", model="stormsidali2001/IMRAD-introduction-move-one-sub-moves-classifier")

sentence = "This gap in research highlights the need for further investigation into [topic]." 

move_result = move_classifier(sentence)
move = move_result[0]['label']

if move == "Establishing a Niche":
    submove_result = submove_classifier_1(sentence)
    print(submove_result)