Text Classification
TF-Keras
English
stormsidali2001 commited on
Commit
36365ca
1 Parent(s): 1b01a8e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -14
README.md CHANGED
@@ -8,7 +8,7 @@ widget:
8
  score: 0.9
9
  - label: MOVE_1
10
  score: 0.05
11
- - label: MOVE_1
12
  score: 0.05
13
  license: mit
14
  datasets:
@@ -18,29 +18,53 @@ language:
18
  metrics:
19
  - f1
20
  - accuracy
21
- base_model: google-bert/bert-base-cased
22
  ---
23
 
24
- ## Model description
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
25
 
26
- More information needed
27
 
28
- ## Intended uses & limitations
29
 
30
- More information needed
31
 
32
- ## Training and evaluation data
33
 
34
- More information needed
 
 
35
 
36
- ## Metadata
37
 
 
38
 
39
- ## Model Plot
 
40
 
41
- <details>
42
- <summary>View Model Plot</summary>
43
 
44
- ![Model Image](./model.png)
 
45
 
46
- </details>
 
8
  score: 0.9
9
  - label: MOVE_1
10
  score: 0.05
11
+ - label: MOVE_2
12
  score: 0.05
13
  license: mit
14
  datasets:
 
18
  metrics:
19
  - f1
20
  - accuracy
21
+ base_model: google/bert-base-cased
22
  ---
23
 
24
+ ## IMRaD Introduction Move Classifier
25
+
26
+ This model is a fine-tuned BERT model designed to classify sentences from the introductions of scientific research papers into one of three IMRaD moves:
27
+
28
+ * **MOVE_0 (Establishing a Research Territory):** Setting the context and background information for the research.
29
+ * **MOVE_1 (Establishing a Niche):** Identifying a gap or problem in existing research.
30
+ * **MOVE_2 (Occupying the Niche):** Proposing a solution or approach to address the identified gap.
31
+
32
+ ## Intended Uses & Limitations
33
+
34
+ **Intended Uses:**
35
+
36
+ * **Scientific Writing Assistance:** Help researchers and students analyze and improve the structure of their introductions by identifying the IMRaD moves present in each sentence.
37
+ * **Literature Review Analysis:** Assist in quickly understanding the rhetorical structure of introductions in a set of research papers.
38
+ * **Educational Tool:** Illustrate IMRaD concepts and their practical application in scientific writing.
39
+
40
+ **Limitations:**
41
+
42
+ * **Domain Specificity:** The model was trained on a dataset of scientific research papers and might not perform as well on other types of text.
43
+ * **Accuracy:** While the model achieves good accuracy, it's not perfect. Predictions should be reviewed carefully, especially in complex or ambiguous sentences.
44
+ * **Sentence-Level Classification:** The model classifies individual sentences. It does not provide an overall analysis of the entire introduction.
45
 
46
+ ## Training and Evaluation Data
47
 
48
+ The model was trained and evaluated on the "IMRAD Introduction Sentences Moves & Sub-moves Dataset" available on Hugging Face: [https://huggingface.co/datasets/stormsidali2001/IMRAD-introduction-sentences-moves-sub-moves-dataset](https://huggingface.co/datasets/stormsidali2001/IMRAD-introduction-sentences-moves-sub-moves-dataset)
49
 
50
+ The dataset consists of sentences extracted from scientific research paper introductions, manually labeled with their corresponding IMRaD moves.
51
 
52
+ **Training Details:**
53
 
54
+ * The `bert-base-cased` model from Google was used as the base model.
55
+ * Fine-tuning was performed using a TensorFlow/Keras implementation.
56
+ * Evaluation metrics include F1 score and accuracy.
57
 
58
+ ## How to Use
59
 
60
+ You can use this model with the `pipeline` function from the `transformers` library:
61
 
62
+ ```python
63
+ from transformers import pipeline
64
 
65
+ classifier = pipeline("text-classification", model="your-username/your-model-name")
 
66
 
67
+ sentence = "Electronic cigarettes were introduced into the US market in 2007."
68
+ result = classifier(sentence)
69
 
70
+ print(result)