Adriane Boyd commited on
Commit
25fc841
1 Parent(s): 1d03e65

Update spaCy pipeline

Browse files
LICENSES_SOURCES CHANGED
@@ -64,11 +64,11 @@ Princeton University and LICENSEE agrees to preserve same.```
64
 
65
 
66
 
67
- # GloVe Common Crawl
68
 
69
- * Author: Jeffrey Pennington, Richard Socher, and Christopher D. Manning
70
- * URL: https://nlp.stanford.edu/projects/glove/
71
- * License: Public Domain Dedication and License v1.0
72
 
73
  ```
74
  The laws of most jurisdictions throughout the world automatically confer exclusive Copyright and Related Rights (defined below) upon the creator and subsequent owner(s) (each and all, an "owner") of an original work of authorship and/or a database (each, a "Work").
 
64
 
65
 
66
 
67
+ # Explosion Vectors (OSCAR 2109 + Wikipedia + OpenSubtitles + WMT News Crawl)
68
 
69
+ * Author: Explosion
70
+ * URL: https://github.com/explosion/spacy-vectors-builder
71
+ * License: CC0
72
 
73
  ```
74
  The laws of most jurisdictions throughout the world automatically confer exclusive Copyright and Related Rights (defined below) upon the creator and subsequent owner(s) (each and all, an "owner") of an original work of authorship and/or a database (each, a "Work").
README.md CHANGED
@@ -14,41 +14,41 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.8602117695
18
  - name: NER Recall
19
  type: recall
20
- value: 0.8462540064
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.8531758053
24
  - task:
25
  name: TAG
26
  type: token-classification
27
  metrics:
28
  - name: TAG (XPOS) Accuracy
29
  type: accuracy
30
- value: 0.9738145328
31
  - task:
32
  name: UNLABELED_DEPENDENCIES
33
  type: token-classification
34
  metrics:
35
  - name: Unlabeled Attachment Score (UAS)
36
  type: f_score
37
- value: 0.9188508811
38
  - task:
39
  name: LABELED_DEPENDENCIES
40
  type: token-classification
41
  metrics:
42
  - name: Labeled Attachment Score (LAS)
43
  type: f_score
44
- value: 0.9008477499
45
  - task:
46
  name: SENTS
47
  type: token-classification
48
  metrics:
49
  - name: Sentences F-Score
50
  type: f_score
51
- value: 0.9033533215
52
  ---
53
  ### Details: https://spacy.io/models/en#en_core_web_lg
54
 
@@ -57,12 +57,12 @@ English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter,
57
  | Feature | Description |
58
  | --- | --- |
59
  | **Name** | `en_core_web_lg` |
60
- | **Version** | `3.3.0` |
61
- | **spaCy** | `>=3.3.0.dev0,<3.4.0` |
62
  | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
63
  | **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
64
- | **Vectors** | 684830 keys, 342918 unique vectors (300 dimensions) |
65
- | **Sources** | [OntoNotes 5](https://catalog.ldc.upenn.edu/LDC2013T19) (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)<br />[ClearNLP Constituent-to-Dependency Conversion](https://github.com/clir/clearnlp-guidelines/blob/master/md/components/dependency_conversion.md) (Emory University)<br />[WordNet 3.0](https://wordnet.princeton.edu/) (Princeton University)<br />[GloVe Common Crawl](https://nlp.stanford.edu/projects/glove/) (Jeffrey Pennington, Richard Socher, and Christopher D. Manning) |
66
  | **License** | `MIT` |
67
  | **Author** | [Explosion](https://explosion.ai) |
68
 
@@ -70,11 +70,11 @@ English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter,
70
 
71
  <details>
72
 
73
- <summary>View label scheme (112 labels for 3 components)</summary>
74
 
75
  | Component | Labels |
76
  | --- | --- |
77
- | **`tagger`** | `$`, `''`, `,`, `-LRB-`, `-RRB-`, `.`, `:`, `ADD`, `AFX`, `CC`, `CD`, `DT`, `EX`, `FW`, `HYPH`, `IN`, `JJ`, `JJR`, `JJS`, `LS`, `MD`, `NFP`, `NN`, `NNP`, `NNPS`, `NNS`, `PDT`, `POS`, `PRP`, `PRP$`, `RB`, `RBR`, `RBS`, `RP`, `SYM`, `TO`, `UH`, `VB`, `VBD`, `VBG`, `VBN`, `VBP`, `VBZ`, `WDT`, `WP`, `WP$`, `WRB`, `XX`, ```` |
78
  | **`parser`** | `ROOT`, `acl`, `acomp`, `advcl`, `advmod`, `agent`, `amod`, `appos`, `attr`, `aux`, `auxpass`, `case`, `cc`, `ccomp`, `compound`, `conj`, `csubj`, `csubjpass`, `dative`, `dep`, `det`, `dobj`, `expl`, `intj`, `mark`, `meta`, `neg`, `nmod`, `npadvmod`, `nsubj`, `nsubjpass`, `nummod`, `oprd`, `parataxis`, `pcomp`, `pobj`, `poss`, `preconj`, `predet`, `prep`, `prt`, `punct`, `quantmod`, `relcl`, `xcomp` |
79
  | **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |
80
 
@@ -88,12 +88,12 @@ English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter,
88
  | `TOKEN_P` | 99.57 |
89
  | `TOKEN_R` | 99.58 |
90
  | `TOKEN_F` | 99.57 |
91
- | `TAG_ACC` | 97.38 |
92
- | `SENTS_P` | 91.77 |
93
- | `SENTS_R` | 88.94 |
94
- | `SENTS_F` | 90.34 |
95
- | `DEP_UAS` | 91.89 |
96
- | `DEP_LAS` | 90.08 |
97
- | `ENTS_P` | 86.02 |
98
- | `ENTS_R` | 84.63 |
99
- | `ENTS_F` | 85.32 |
 
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.8636641533
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.8489583333
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.8562481059
24
  - task:
25
  name: TAG
26
  type: token-classification
27
  metrics:
28
  - name: TAG (XPOS) Accuracy
29
  type: accuracy
30
+ value: 0.9734404547
31
  - task:
32
  name: UNLABELED_DEPENDENCIES
33
  type: token-classification
34
  metrics:
35
  - name: Unlabeled Attachment Score (UAS)
36
  type: f_score
37
+ value: 0.9204363007
38
  - task:
39
  name: LABELED_DEPENDENCIES
40
  type: token-classification
41
  metrics:
42
  - name: Labeled Attachment Score (LAS)
43
  type: f_score
44
+ value: 0.9023174614
45
  - task:
46
  name: SENTS
47
  type: token-classification
48
  metrics:
49
  - name: Sentences F-Score
50
  type: f_score
51
+ value: 0.90444794
52
  ---
53
  ### Details: https://spacy.io/models/en#en_core_web_lg
54
 
 
57
  | Feature | Description |
58
  | --- | --- |
59
  | **Name** | `en_core_web_lg` |
60
+ | **Version** | `3.4.0` |
61
+ | **spaCy** | `>=3.4.0,<3.5.0` |
62
  | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
63
  | **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
64
+ | **Vectors** | 514157 keys, 514157 unique vectors (300 dimensions) |
65
+ | **Sources** | [OntoNotes 5](https://catalog.ldc.upenn.edu/LDC2013T19) (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)<br />[ClearNLP Constituent-to-Dependency Conversion](https://github.com/clir/clearnlp-guidelines/blob/master/md/components/dependency_conversion.md) (Emory University)<br />[WordNet 3.0](https://wordnet.princeton.edu/) (Princeton University)<br />[Explosion Vectors (OSCAR 2109 + Wikipedia + OpenSubtitles + WMT News Crawl)](https://github.com/explosion/spacy-vectors-builder) (Explosion) |
66
  | **License** | `MIT` |
67
  | **Author** | [Explosion](https://explosion.ai) |
68
 
 
70
 
71
  <details>
72
 
73
+ <summary>View label scheme (113 labels for 3 components)</summary>
74
 
75
  | Component | Labels |
76
  | --- | --- |
77
+ | **`tagger`** | `$`, `''`, `,`, `-LRB-`, `-RRB-`, `.`, `:`, `ADD`, `AFX`, `CC`, `CD`, `DT`, `EX`, `FW`, `HYPH`, `IN`, `JJ`, `JJR`, `JJS`, `LS`, `MD`, `NFP`, `NN`, `NNP`, `NNPS`, `NNS`, `PDT`, `POS`, `PRP`, `PRP$`, `RB`, `RBR`, `RBS`, `RP`, `SYM`, `TO`, `UH`, `VB`, `VBD`, `VBG`, `VBN`, `VBP`, `VBZ`, `WDT`, `WP`, `WP$`, `WRB`, `XX`, `_SP`, ```` |
78
  | **`parser`** | `ROOT`, `acl`, `acomp`, `advcl`, `advmod`, `agent`, `amod`, `appos`, `attr`, `aux`, `auxpass`, `case`, `cc`, `ccomp`, `compound`, `conj`, `csubj`, `csubjpass`, `dative`, `dep`, `det`, `dobj`, `expl`, `intj`, `mark`, `meta`, `neg`, `nmod`, `npadvmod`, `nsubj`, `nsubjpass`, `nummod`, `oprd`, `parataxis`, `pcomp`, `pobj`, `poss`, `preconj`, `predet`, `prep`, `prt`, `punct`, `quantmod`, `relcl`, `xcomp` |
79
  | **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |
80
 
 
88
  | `TOKEN_P` | 99.57 |
89
  | `TOKEN_R` | 99.58 |
90
  | `TOKEN_F` | 99.57 |
91
+ | `TAG_ACC` | 97.34 |
92
+ | `SENTS_P` | 91.79 |
93
+ | `SENTS_R` | 89.14 |
94
+ | `SENTS_F` | 90.44 |
95
+ | `DEP_UAS` | 92.04 |
96
+ | `DEP_LAS` | 90.23 |
97
+ | `ENTS_P` | 86.37 |
98
+ | `ENTS_R` | 84.90 |
99
+ | `ENTS_F` | 85.62 |
accuracy.json CHANGED
@@ -3,328 +3,328 @@
3
  "token_p": 0.9956819193,
4
  "token_r": 0.9957659295,
5
  "token_f": 0.9957239226,
6
- "tag_acc": 0.9738145328,
7
- "sents_p": 0.9177103185,
8
- "sents_r": 0.8894386173,
9
- "sents_f": 0.9033533215,
10
- "dep_uas": 0.9188508811,
11
- "dep_las": 0.9008477499,
12
  "dep_las_per_type": {
13
  "prep": {
14
- "p": 0.8537864878,
15
- "r": 0.8645418327,
16
- "f": 0.8591305004
17
  },
18
  "det": {
19
- "p": 0.9790682522,
20
- "r": 0.9802658403,
21
- "f": 0.9796666802
22
  },
23
  "pobj": {
24
- "p": 0.9633579437,
25
- "r": 0.9684272531,
26
- "f": 0.965885947
27
  },
28
  "nsubj": {
29
- "p": 0.9564757243,
30
- "r": 0.9502738226,
31
- "f": 0.9533646873
32
  },
33
  "aux": {
34
- "p": 0.9809760868,
35
- "r": 0.9823733642,
36
- "f": 0.9816742283
37
  },
38
  "advmod": {
39
- "p": 0.8550492715,
40
- "r": 0.8541140838,
41
- "f": 0.8545814218
42
  },
43
  "relcl": {
44
- "p": 0.7709000356,
45
- "r": 0.7862844702,
46
- "f": 0.7785162565
47
  },
48
  "root": {
49
- "p": 0.9183576195,
50
- "r": 0.889702487,
51
- "f": 0.9038029821
52
  },
53
  "xcomp": {
54
- "p": 0.882620883,
55
- "r": 0.9041636755,
56
- "f": 0.8932624113
57
  },
58
  "amod": {
59
- "p": 0.9195970101,
60
- "r": 0.9166180758,
61
- "f": 0.9181051265
62
  },
63
  "compound": {
64
- "p": 0.9193539526,
65
- "r": 0.9320004455,
66
- "f": 0.9256340054
67
  },
68
  "poss": {
69
- "p": 0.9711422846,
70
- "r": 0.9754428341,
71
- "f": 0.9732878088
72
  },
73
  "ccomp": {
74
- "p": 0.7727868239,
75
- "r": 0.8409368635,
76
- "f": 0.8054228031
77
  },
78
  "attr": {
79
- "p": 0.8955042527,
80
- "r": 0.9297729184,
81
- "f": 0.912316897
82
  },
83
  "case": {
84
- "p": 0.9758144126,
85
- "r": 0.9894894895,
86
- "f": 0.9826043738
87
  },
88
  "mark": {
89
- "p": 0.9062829989,
90
- "r": 0.9096449391,
91
- "f": 0.9079608569
92
  },
93
  "intj": {
94
- "p": 0.6653322658,
95
- "r": 0.6087912088,
96
- "f": 0.635807192
97
  },
98
  "advcl": {
99
- "p": 0.6779661017,
100
- "r": 0.6648199446,
101
- "f": 0.6713286713
102
  },
103
  "cc": {
104
- "p": 0.8292624233,
105
- "r": 0.824303313,
106
- "f": 0.8267754319
107
  },
108
  "neg": {
109
- "p": 0.9393336648,
110
- "r": 0.9478173608,
111
- "f": 0.9435564436
112
  },
113
  "conj": {
114
- "p": 0.763665795,
115
- "r": 0.7720292044,
116
- "f": 0.7678247261
117
  },
118
  "nsubjpass": {
119
- "p": 0.9263266358,
120
- "r": 0.9220512821,
121
- "f": 0.9241840144
122
  },
123
  "auxpass": {
124
- "p": 0.9499329459,
125
- "r": 0.9681093394,
126
- "f": 0.9589350181
127
  },
128
  "dobj": {
129
- "p": 0.926432648,
130
- "r": 0.9442983505,
131
- "f": 0.9352801894
132
  },
133
  "nummod": {
134
- "p": 0.9362134689,
135
- "r": 0.9303030303,
136
- "f": 0.9332488917
137
  },
138
  "npadvmod": {
139
- "p": 0.7723030982,
140
- "r": 0.734991119,
141
- "f": 0.753185293
142
  },
143
  "prt": {
144
- "p": 0.8160066007,
145
- "r": 0.8862007168,
146
- "f": 0.8496563574
147
  },
148
  "pcomp": {
149
- "p": 0.8800841515,
150
- "r": 0.8788515406,
151
- "f": 0.8794674142
152
  },
153
  "expl": {
154
- "p": 0.9809322034,
155
- "r": 0.9914346895,
156
- "f": 0.9861554846
157
  },
158
  "acl": {
159
- "p": 0.7556456283,
160
- "r": 0.7119476268,
161
- "f": 0.7331460674
162
  },
163
  "agent": {
164
- "p": 0.8991452991,
165
- "r": 0.9426523297,
166
- "f": 0.9203849519
167
  },
168
  "dative": {
169
- "p": 0.810298103,
170
- "r": 0.6857798165,
171
- "f": 0.7428571429
172
  },
173
  "acomp": {
174
- "p": 0.9111721612,
175
- "r": 0.9024943311,
176
- "f": 0.9068124858
177
  },
178
  "dep": {
179
- "p": 0.3930131004,
180
- "r": 0.1461038961,
181
- "f": 0.2130177515
182
  },
183
  "csubj": {
184
- "p": 0.7068965517,
185
- "r": 0.7278106509,
186
- "f": 0.7172011662
187
  },
188
  "quantmod": {
189
- "p": 0.8746594005,
190
- "r": 0.7822908205,
191
- "f": 0.8259005146
192
  },
193
  "nmod": {
194
- "p": 0.7651217596,
195
- "r": 0.5935405241,
196
- "f": 0.6684969115
197
  },
198
  "appos": {
199
- "p": 0.6994459834,
200
- "r": 0.6572668113,
201
- "f": 0.6777007381
202
  },
203
  "predet": {
204
- "p": 0.8380566802,
205
- "r": 0.8884120172,
206
- "f": 0.8625
207
  },
208
  "preconj": {
209
- "p": 0.537037037,
210
- "r": 0.6744186047,
211
- "f": 0.5979381443
212
  },
213
  "oprd": {
214
- "p": 0.8477508651,
215
- "r": 0.7313432836,
216
- "f": 0.7852564103
217
  },
218
  "parataxis": {
219
- "p": 0.6187845304,
220
- "r": 0.4859002169,
221
- "f": 0.5443499392
222
  },
223
  "meta": {
224
- "p": 1.0,
225
- "r": 0.3269230769,
226
- "f": 0.4927536232
227
  },
228
  "csubjpass": {
229
- "p": 0.5555555556,
230
- "r": 0.8333333333,
231
- "f": 0.6666666667
232
  }
233
  },
234
- "ents_p": 0.8602117695,
235
- "ents_r": 0.8462540064,
236
- "ents_f": 0.8531758053,
237
  "ents_per_type": {
238
  "DATE": {
239
- "p": 0.872593068,
240
- "r": 0.8631746032,
241
- "f": 0.8678582828
242
  },
243
  "GPE": {
244
- "p": 0.9257256688,
245
- "r": 0.9073919107,
246
- "f": 0.916467108
247
  },
248
  "ORDINAL": {
249
  "p": 0.787965616,
250
  "r": 0.8540372671,
251
  "f": 0.8196721311
252
  },
 
 
 
 
 
253
  "ORG": {
254
- "p": 0.8203309693,
255
- "r": 0.8279427359,
256
- "f": 0.8241192769
 
 
 
 
 
 
 
 
 
 
257
  },
258
  "CARDINAL": {
259
- "p": 0.8304398148,
260
- "r": 0.8531510107,
261
- "f": 0.8416422287
262
  },
263
  "PERSON": {
264
- "p": 0.8953229399,
265
- "r": 0.9184073107,
266
- "f": 0.9067182214
267
  },
268
  "NORP": {
269
- "p": 0.8794048551,
270
- "r": 0.8984,
271
- "f": 0.8888009497
272
- },
273
- "LOC": {
274
- "p": 0.7147766323,
275
- "r": 0.6624203822,
276
- "f": 0.6876033058
277
- },
278
- "FAC": {
279
- "p": 0.3949579832,
280
- "r": 0.3615384615,
281
- "f": 0.3775100402
282
  },
283
  "TIME": {
284
- "p": 0.71875,
285
- "r": 0.6725146199,
286
- "f": 0.6948640483
287
  },
288
- "QUANTITY": {
289
- "p": 0.8014184397,
290
- "r": 0.6208791209,
291
- "f": 0.6996904025
292
  },
293
  "EVENT": {
294
- "p": 0.6354166667,
295
- "r": 0.3505747126,
296
- "f": 0.4518518519
297
  },
298
- "WORK_OF_ART": {
299
- "p": 0.5,
300
- "r": 0.3092783505,
301
- "f": 0.3821656051
302
  },
303
  "MONEY": {
304
- "p": 0.9039145907,
305
- "r": 0.8996458087,
306
- "f": 0.9017751479
307
- },
308
- "LAW": {
309
- "p": 0.6428571429,
310
- "r": 0.421875,
311
- "f": 0.5094339623
312
  },
313
  "PERCENT": {
314
- "p": 0.9187898089,
315
- "r": 0.8836140888,
316
- "f": 0.9008587041
317
  },
318
  "LANGUAGE": {
319
- "p": 0.75,
320
- "r": 0.65625,
321
- "f": 0.7
322
  },
323
  "PRODUCT": {
324
- "p": 0.6097560976,
325
- "r": 0.2369668246,
326
- "f": 0.3412969283
327
  }
328
  },
329
- "speed": 7281.6726563626
330
  }
 
3
  "token_p": 0.9956819193,
4
  "token_r": 0.9957659295,
5
  "token_f": 0.9957239226,
6
+ "tag_acc": 0.9734404547,
7
+ "sents_p": 0.9179347826,
8
+ "sents_r": 0.8913516723,
9
+ "sents_f": 0.90444794,
10
+ "dep_uas": 0.9204363007,
11
+ "dep_las": 0.9023174614,
12
  "dep_las_per_type": {
13
  "prep": {
14
+ "p": 0.8597877625,
15
+ "r": 0.8669322709,
16
+ "f": 0.8633452361
17
  },
18
  "det": {
19
+ "p": 0.9797074284,
20
+ "r": 0.9803066134,
21
+ "f": 0.9800069293
22
  },
23
  "pobj": {
24
+ "p": 0.963921354,
25
+ "r": 0.9683879835,
26
+ "f": 0.9661495063
27
  },
28
  "nsubj": {
29
+ "p": 0.9573359244,
30
+ "r": 0.94966046,
31
+ "f": 0.9534827457
32
  },
33
  "aux": {
34
+ "p": 0.981595092,
35
+ "r": 0.9828184813,
36
+ "f": 0.9822064057
37
  },
38
  "advmod": {
39
+ "p": 0.8567202029,
40
+ "r": 0.8526838297,
41
+ "f": 0.8546972508
42
  },
43
  "relcl": {
44
+ "p": 0.7682926829,
45
+ "r": 0.7772133527,
46
+ "f": 0.7727272727
47
  },
48
  "root": {
49
+ "p": 0.9196058444,
50
+ "r": 0.8926710205,
51
+ "f": 0.9059382741
52
  },
53
  "xcomp": {
54
+ "p": 0.8853797019,
55
+ "r": 0.8955491744,
56
+ "f": 0.8904354033
57
  },
58
  "amod": {
59
+ "p": 0.9199114468,
60
+ "r": 0.9153223194,
61
+ "f": 0.9176111454
62
  },
63
  "compound": {
64
+ "p": 0.9198242724,
65
+ "r": 0.9328358209,
66
+ "f": 0.9262843555
67
  },
68
  "poss": {
69
+ "p": 0.9735205617,
70
+ "r": 0.9768518519,
71
+ "f": 0.9751833618
72
  },
73
  "ccomp": {
74
+ "p": 0.7757201646,
75
+ "r": 0.8446028513,
76
+ "f": 0.8086973479
77
  },
78
  "attr": {
79
+ "p": 0.9064542484,
80
+ "r": 0.93313709,
81
+ "f": 0.919602155
82
  },
83
  "case": {
84
+ "p": 0.9797330697,
85
+ "r": 0.991991992,
86
+ "f": 0.9858244218
87
  },
88
  "mark": {
89
+ "p": 0.9015625,
90
+ "r": 0.9173290938,
91
+ "f": 0.9093774626
92
  },
93
  "intj": {
94
+ "p": 0.680533752,
95
+ "r": 0.6351648352,
96
+ "f": 0.6570670709
97
  },
98
  "advcl": {
99
+ "p": 0.6686002522,
100
+ "r": 0.6675900277,
101
+ "f": 0.6680947581
102
  },
103
  "cc": {
104
+ "p": 0.8381204182,
105
+ "r": 0.8341107523,
106
+ "f": 0.8361107781
107
  },
108
  "neg": {
109
+ "p": 0.9451371571,
110
+ "r": 0.9508278976,
111
+ "f": 0.947973987
112
  },
113
  "conj": {
114
+ "p": 0.7760468594,
115
+ "r": 0.7838620342,
116
+ "f": 0.7799348697
117
  },
118
  "nsubjpass": {
119
+ "p": 0.9234693878,
120
+ "r": 0.9282051282,
121
+ "f": 0.925831202
122
  },
123
  "auxpass": {
124
+ "p": 0.9468791501,
125
+ "r": 0.9744874715,
126
+ "f": 0.9604849573
127
  },
128
  "dobj": {
129
+ "p": 0.9278213166,
130
+ "r": 0.9434217866,
131
+ "f": 0.9355565214
132
  },
133
  "nummod": {
134
+ "p": 0.9377224199,
135
+ "r": 0.9315656566,
136
+ "f": 0.9346338992
137
  },
138
  "npadvmod": {
139
+ "p": 0.7837218189,
140
+ "r": 0.7285968028,
141
+ "f": 0.7551546392
142
  },
143
  "prt": {
144
+ "p": 0.8103025348,
145
+ "r": 0.8879928315,
146
+ "f": 0.8473706712
147
  },
148
  "pcomp": {
149
+ "p": 0.8873937677,
150
+ "r": 0.8774509804,
151
+ "f": 0.8823943662
152
  },
153
  "expl": {
154
+ "p": 0.9809725159,
155
+ "r": 0.9935760171,
156
+ "f": 0.9872340426
157
  },
158
  "acl": {
159
+ "p": 0.7534883721,
160
+ "r": 0.7070376432,
161
+ "f": 0.7295243456
162
  },
163
  "agent": {
164
+ "p": 0.9042735043,
165
+ "r": 0.9480286738,
166
+ "f": 0.9256342957
167
  },
168
  "dative": {
169
+ "p": 0.7725,
170
+ "r": 0.7087155963,
171
+ "f": 0.7392344498
172
  },
173
  "acomp": {
174
+ "p": 0.9080091533,
175
+ "r": 0.8997732426,
176
+ "f": 0.9038724374
177
  },
178
  "dep": {
179
+ "p": 0.3263473054,
180
+ "r": 0.1769480519,
181
+ "f": 0.2294736842
182
  },
183
  "csubj": {
184
+ "p": 0.7045454545,
185
+ "r": 0.7337278107,
186
+ "f": 0.7188405797
187
  },
188
  "quantmod": {
189
+ "p": 0.8531468531,
190
+ "r": 0.7928513404,
191
+ "f": 0.8218947368
192
  },
193
  "nmod": {
194
+ "p": 0.7539432177,
195
+ "r": 0.5825716027,
196
+ "f": 0.6572705397
197
  },
198
  "appos": {
199
+ "p": 0.6997270246,
200
+ "r": 0.6672451193,
201
+ "f": 0.6831001555
202
  },
203
  "predet": {
204
+ "p": 0.8524590164,
205
+ "r": 0.8927038627,
206
+ "f": 0.8721174004
207
  },
208
  "preconj": {
209
+ "p": 0.5684210526,
210
+ "r": 0.6279069767,
211
+ "f": 0.5966850829
212
  },
213
  "oprd": {
214
+ "p": 0.8322368421,
215
+ "r": 0.7552238806,
216
+ "f": 0.7918622848
217
  },
218
  "parataxis": {
219
+ "p": 0.6323119777,
220
+ "r": 0.4924078091,
221
+ "f": 0.5536585366
222
  },
223
  "meta": {
224
+ "p": 0.8461538462,
225
+ "r": 0.4230769231,
226
+ "f": 0.5641025641
227
  },
228
  "csubjpass": {
229
+ "p": 0.4285714286,
230
+ "r": 0.5,
231
+ "f": 0.4615384615
232
  }
233
  },
234
+ "ents_p": 0.8636641533,
235
+ "ents_r": 0.8489583333,
236
+ "ents_f": 0.8562481059,
237
  "ents_per_type": {
238
  "DATE": {
239
+ "p": 0.8711209626,
240
+ "r": 0.8733333333,
241
+ "f": 0.8722257451
242
  },
243
  "GPE": {
244
+ "p": 0.9365811473,
245
+ "r": 0.9062761506,
246
+ "f": 0.9211794726
247
  },
248
  "ORDINAL": {
249
  "p": 0.787965616,
250
  "r": 0.8540372671,
251
  "f": 0.8196721311
252
  },
253
+ "FAC": {
254
+ "p": 0.4910714286,
255
+ "r": 0.4230769231,
256
+ "f": 0.4545454545
257
+ },
258
  "ORG": {
259
+ "p": 0.8242392445,
260
+ "r": 0.8329798515,
261
+ "f": 0.8285864979
262
+ },
263
+ "QUANTITY": {
264
+ "p": 0.8231292517,
265
+ "r": 0.6648351648,
266
+ "f": 0.73556231
267
+ },
268
+ "LOC": {
269
+ "p": 0.7222222222,
270
+ "r": 0.6624203822,
271
+ "f": 0.6910299003
272
  },
273
  "CARDINAL": {
274
+ "p": 0.8295912493,
275
+ "r": 0.8567181926,
276
+ "f": 0.8429365311
277
  },
278
  "PERSON": {
279
+ "p": 0.8915049316,
280
+ "r": 0.9144908616,
281
+ "f": 0.9028516191
282
  },
283
  "NORP": {
284
+ "p": 0.9150485437,
285
+ "r": 0.9048,
286
+ "f": 0.9098954143
 
 
 
 
 
 
 
 
 
 
287
  },
288
  "TIME": {
289
+ "p": 0.7133956386,
290
+ "r": 0.6695906433,
291
+ "f": 0.6907993967
292
  },
293
+ "WORK_OF_ART": {
294
+ "p": 0.544,
295
+ "r": 0.3505154639,
296
+ "f": 0.4263322884
297
  },
298
  "EVENT": {
299
+ "p": 0.606741573,
300
+ "r": 0.3103448276,
301
+ "f": 0.4106463878
302
  },
303
+ "LAW": {
304
+ "p": 0.3870967742,
305
+ "r": 0.375,
306
+ "f": 0.380952381
307
  },
308
  "MONEY": {
309
+ "p": 0.9183673469,
310
+ "r": 0.9031877214,
311
+ "f": 0.9107142857
 
 
 
 
 
312
  },
313
  "PERCENT": {
314
+ "p": 0.9079365079,
315
+ "r": 0.875957121,
316
+ "f": 0.8916601715
317
  },
318
  "LANGUAGE": {
319
+ "p": 0.6296296296,
320
+ "r": 0.53125,
321
+ "f": 0.5762711864
322
  },
323
  "PRODUCT": {
324
+ "p": 0.5333333333,
325
+ "r": 0.2274881517,
326
+ "f": 0.3189368771
327
  }
328
  },
329
+ "speed": 7875.967150799
330
  }
en_core_web_lg-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6ce19d37dfe5280400f80a5954d41afca10cbc742b97bfcf4b0e452b6eb24273
3
- size 400651786
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8a61a694fbc7533e23115bfc6025ab5100e57013fbbc73b9d82494e53c77cd85
3
+ size 587686926
meta.json CHANGED
@@ -1,18 +1,18 @@
1
  {
2
  "lang":"en",
3
  "name":"core_web_lg",
4
- "version":"3.3.0",
5
  "description":"English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"MIT",
10
- "spacy_version":">=3.3.0.dev0,<3.4.0",
11
- "spacy_git_version":"849bef2de",
12
  "vectors":{
13
  "width":300,
14
- "vectors":342918,
15
- "keys":684830,
16
  "name":"en_vectors"
17
  },
18
  "labels":{
@@ -68,6 +68,7 @@
68
  "WP$",
69
  "WRB",
70
  "XX",
 
71
  "``"
72
  ],
73
  "parser":[
@@ -169,330 +170,330 @@
169
  "token_p":0.9956819193,
170
  "token_r":0.9957659295,
171
  "token_f":0.9957239226,
172
- "tag_acc":0.9738145328,
173
- "sents_p":0.9177103185,
174
- "sents_r":0.8894386173,
175
- "sents_f":0.9033533215,
176
- "dep_uas":0.9188508811,
177
- "dep_las":0.9008477499,
178
  "dep_las_per_type":{
179
  "prep":{
180
- "p":0.8537864878,
181
- "r":0.8645418327,
182
- "f":0.8591305004
183
  },
184
  "det":{
185
- "p":0.9790682522,
186
- "r":0.9802658403,
187
- "f":0.9796666802
188
  },
189
  "pobj":{
190
- "p":0.9633579437,
191
- "r":0.9684272531,
192
- "f":0.965885947
193
  },
194
  "nsubj":{
195
- "p":0.9564757243,
196
- "r":0.9502738226,
197
- "f":0.9533646873
198
  },
199
  "aux":{
200
- "p":0.9809760868,
201
- "r":0.9823733642,
202
- "f":0.9816742283
203
  },
204
  "advmod":{
205
- "p":0.8550492715,
206
- "r":0.8541140838,
207
- "f":0.8545814218
208
  },
209
  "relcl":{
210
- "p":0.7709000356,
211
- "r":0.7862844702,
212
- "f":0.7785162565
213
  },
214
  "root":{
215
- "p":0.9183576195,
216
- "r":0.889702487,
217
- "f":0.9038029821
218
  },
219
  "xcomp":{
220
- "p":0.882620883,
221
- "r":0.9041636755,
222
- "f":0.8932624113
223
  },
224
  "amod":{
225
- "p":0.9195970101,
226
- "r":0.9166180758,
227
- "f":0.9181051265
228
  },
229
  "compound":{
230
- "p":0.9193539526,
231
- "r":0.9320004455,
232
- "f":0.9256340054
233
  },
234
  "poss":{
235
- "p":0.9711422846,
236
- "r":0.9754428341,
237
- "f":0.9732878088
238
  },
239
  "ccomp":{
240
- "p":0.7727868239,
241
- "r":0.8409368635,
242
- "f":0.8054228031
243
  },
244
  "attr":{
245
- "p":0.8955042527,
246
- "r":0.9297729184,
247
- "f":0.912316897
248
  },
249
  "case":{
250
- "p":0.9758144126,
251
- "r":0.9894894895,
252
- "f":0.9826043738
253
  },
254
  "mark":{
255
- "p":0.9062829989,
256
- "r":0.9096449391,
257
- "f":0.9079608569
258
  },
259
  "intj":{
260
- "p":0.6653322658,
261
- "r":0.6087912088,
262
- "f":0.635807192
263
  },
264
  "advcl":{
265
- "p":0.6779661017,
266
- "r":0.6648199446,
267
- "f":0.6713286713
268
  },
269
  "cc":{
270
- "p":0.8292624233,
271
- "r":0.824303313,
272
- "f":0.8267754319
273
  },
274
  "neg":{
275
- "p":0.9393336648,
276
- "r":0.9478173608,
277
- "f":0.9435564436
278
  },
279
  "conj":{
280
- "p":0.763665795,
281
- "r":0.7720292044,
282
- "f":0.7678247261
283
  },
284
  "nsubjpass":{
285
- "p":0.9263266358,
286
- "r":0.9220512821,
287
- "f":0.9241840144
288
  },
289
  "auxpass":{
290
- "p":0.9499329459,
291
- "r":0.9681093394,
292
- "f":0.9589350181
293
  },
294
  "dobj":{
295
- "p":0.926432648,
296
- "r":0.9442983505,
297
- "f":0.9352801894
298
  },
299
  "nummod":{
300
- "p":0.9362134689,
301
- "r":0.9303030303,
302
- "f":0.9332488917
303
  },
304
  "npadvmod":{
305
- "p":0.7723030982,
306
- "r":0.734991119,
307
- "f":0.753185293
308
  },
309
  "prt":{
310
- "p":0.8160066007,
311
- "r":0.8862007168,
312
- "f":0.8496563574
313
  },
314
  "pcomp":{
315
- "p":0.8800841515,
316
- "r":0.8788515406,
317
- "f":0.8794674142
318
  },
319
  "expl":{
320
- "p":0.9809322034,
321
- "r":0.9914346895,
322
- "f":0.9861554846
323
  },
324
  "acl":{
325
- "p":0.7556456283,
326
- "r":0.7119476268,
327
- "f":0.7331460674
328
  },
329
  "agent":{
330
- "p":0.8991452991,
331
- "r":0.9426523297,
332
- "f":0.9203849519
333
  },
334
  "dative":{
335
- "p":0.810298103,
336
- "r":0.6857798165,
337
- "f":0.7428571429
338
  },
339
  "acomp":{
340
- "p":0.9111721612,
341
- "r":0.9024943311,
342
- "f":0.9068124858
343
  },
344
  "dep":{
345
- "p":0.3930131004,
346
- "r":0.1461038961,
347
- "f":0.2130177515
348
  },
349
  "csubj":{
350
- "p":0.7068965517,
351
- "r":0.7278106509,
352
- "f":0.7172011662
353
  },
354
  "quantmod":{
355
- "p":0.8746594005,
356
- "r":0.7822908205,
357
- "f":0.8259005146
358
  },
359
  "nmod":{
360
- "p":0.7651217596,
361
- "r":0.5935405241,
362
- "f":0.6684969115
363
  },
364
  "appos":{
365
- "p":0.6994459834,
366
- "r":0.6572668113,
367
- "f":0.6777007381
368
  },
369
  "predet":{
370
- "p":0.8380566802,
371
- "r":0.8884120172,
372
- "f":0.8625
373
  },
374
  "preconj":{
375
- "p":0.537037037,
376
- "r":0.6744186047,
377
- "f":0.5979381443
378
  },
379
  "oprd":{
380
- "p":0.8477508651,
381
- "r":0.7313432836,
382
- "f":0.7852564103
383
  },
384
  "parataxis":{
385
- "p":0.6187845304,
386
- "r":0.4859002169,
387
- "f":0.5443499392
388
  },
389
  "meta":{
390
- "p":1.0,
391
- "r":0.3269230769,
392
- "f":0.4927536232
393
  },
394
  "csubjpass":{
395
- "p":0.5555555556,
396
- "r":0.8333333333,
397
- "f":0.6666666667
398
  }
399
  },
400
- "ents_p":0.8602117695,
401
- "ents_r":0.8462540064,
402
- "ents_f":0.8531758053,
403
  "ents_per_type":{
404
  "DATE":{
405
- "p":0.872593068,
406
- "r":0.8631746032,
407
- "f":0.8678582828
408
  },
409
  "GPE":{
410
- "p":0.9257256688,
411
- "r":0.9073919107,
412
- "f":0.916467108
413
  },
414
  "ORDINAL":{
415
  "p":0.787965616,
416
  "r":0.8540372671,
417
  "f":0.8196721311
418
  },
 
 
 
 
 
419
  "ORG":{
420
- "p":0.8203309693,
421
- "r":0.8279427359,
422
- "f":0.8241192769
 
 
 
 
 
 
 
 
 
 
423
  },
424
  "CARDINAL":{
425
- "p":0.8304398148,
426
- "r":0.8531510107,
427
- "f":0.8416422287
428
  },
429
  "PERSON":{
430
- "p":0.8953229399,
431
- "r":0.9184073107,
432
- "f":0.9067182214
433
  },
434
  "NORP":{
435
- "p":0.8794048551,
436
- "r":0.8984,
437
- "f":0.8888009497
438
- },
439
- "LOC":{
440
- "p":0.7147766323,
441
- "r":0.6624203822,
442
- "f":0.6876033058
443
- },
444
- "FAC":{
445
- "p":0.3949579832,
446
- "r":0.3615384615,
447
- "f":0.3775100402
448
  },
449
  "TIME":{
450
- "p":0.71875,
451
- "r":0.6725146199,
452
- "f":0.6948640483
453
  },
454
- "QUANTITY":{
455
- "p":0.8014184397,
456
- "r":0.6208791209,
457
- "f":0.6996904025
458
  },
459
  "EVENT":{
460
- "p":0.6354166667,
461
- "r":0.3505747126,
462
- "f":0.4518518519
463
  },
464
- "WORK_OF_ART":{
465
- "p":0.5,
466
- "r":0.3092783505,
467
- "f":0.3821656051
468
  },
469
  "MONEY":{
470
- "p":0.9039145907,
471
- "r":0.8996458087,
472
- "f":0.9017751479
473
- },
474
- "LAW":{
475
- "p":0.6428571429,
476
- "r":0.421875,
477
- "f":0.5094339623
478
  },
479
  "PERCENT":{
480
- "p":0.9187898089,
481
- "r":0.8836140888,
482
- "f":0.9008587041
483
  },
484
  "LANGUAGE":{
485
- "p":0.75,
486
- "r":0.65625,
487
- "f":0.7
488
  },
489
  "PRODUCT":{
490
- "p":0.6097560976,
491
- "r":0.2369668246,
492
- "f":0.3412969283
493
  }
494
  },
495
- "speed":7281.6726563626
496
  },
497
  "sources":[
498
  {
@@ -514,10 +515,10 @@
514
  "license":"WordNet 3.0 License"
515
  },
516
  {
517
- "name":"GloVe Common Crawl",
518
- "url":"https://nlp.stanford.edu/projects/glove/",
519
- "license":"Public Domain Dedication and License v1.0",
520
- "author":"Jeffrey Pennington, Richard Socher, and Christopher D. Manning"
521
  }
522
  ],
523
  "requirements":[
 
1
  {
2
  "lang":"en",
3
  "name":"core_web_lg",
4
+ "version":"3.4.0",
5
  "description":"English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"MIT",
10
+ "spacy_version":">=3.4.0,<3.5.0",
11
+ "spacy_git_version":"dd038b536",
12
  "vectors":{
13
  "width":300,
14
+ "vectors":514157,
15
+ "keys":514157,
16
  "name":"en_vectors"
17
  },
18
  "labels":{
 
68
  "WP$",
69
  "WRB",
70
  "XX",
71
+ "_SP",
72
  "``"
73
  ],
74
  "parser":[
 
170
  "token_p":0.9956819193,
171
  "token_r":0.9957659295,
172
  "token_f":0.9957239226,
173
+ "tag_acc":0.9734404547,
174
+ "sents_p":0.9179347826,
175
+ "sents_r":0.8913516723,
176
+ "sents_f":0.90444794,
177
+ "dep_uas":0.9204363007,
178
+ "dep_las":0.9023174614,
179
  "dep_las_per_type":{
180
  "prep":{
181
+ "p":0.8597877625,
182
+ "r":0.8669322709,
183
+ "f":0.8633452361
184
  },
185
  "det":{
186
+ "p":0.9797074284,
187
+ "r":0.9803066134,
188
+ "f":0.9800069293
189
  },
190
  "pobj":{
191
+ "p":0.963921354,
192
+ "r":0.9683879835,
193
+ "f":0.9661495063
194
  },
195
  "nsubj":{
196
+ "p":0.9573359244,
197
+ "r":0.94966046,
198
+ "f":0.9534827457
199
  },
200
  "aux":{
201
+ "p":0.981595092,
202
+ "r":0.9828184813,
203
+ "f":0.9822064057
204
  },
205
  "advmod":{
206
+ "p":0.8567202029,
207
+ "r":0.8526838297,
208
+ "f":0.8546972508
209
  },
210
  "relcl":{
211
+ "p":0.7682926829,
212
+ "r":0.7772133527,
213
+ "f":0.7727272727
214
  },
215
  "root":{
216
+ "p":0.9196058444,
217
+ "r":0.8926710205,
218
+ "f":0.9059382741
219
  },
220
  "xcomp":{
221
+ "p":0.8853797019,
222
+ "r":0.8955491744,
223
+ "f":0.8904354033
224
  },
225
  "amod":{
226
+ "p":0.9199114468,
227
+ "r":0.9153223194,
228
+ "f":0.9176111454
229
  },
230
  "compound":{
231
+ "p":0.9198242724,
232
+ "r":0.9328358209,
233
+ "f":0.9262843555
234
  },
235
  "poss":{
236
+ "p":0.9735205617,
237
+ "r":0.9768518519,
238
+ "f":0.9751833618
239
  },
240
  "ccomp":{
241
+ "p":0.7757201646,
242
+ "r":0.8446028513,
243
+ "f":0.8086973479
244
  },
245
  "attr":{
246
+ "p":0.9064542484,
247
+ "r":0.93313709,
248
+ "f":0.919602155
249
  },
250
  "case":{
251
+ "p":0.9797330697,
252
+ "r":0.991991992,
253
+ "f":0.9858244218
254
  },
255
  "mark":{
256
+ "p":0.9015625,
257
+ "r":0.9173290938,
258
+ "f":0.9093774626
259
  },
260
  "intj":{
261
+ "p":0.680533752,
262
+ "r":0.6351648352,
263
+ "f":0.6570670709
264
  },
265
  "advcl":{
266
+ "p":0.6686002522,
267
+ "r":0.6675900277,
268
+ "f":0.6680947581
269
  },
270
  "cc":{
271
+ "p":0.8381204182,
272
+ "r":0.8341107523,
273
+ "f":0.8361107781
274
  },
275
  "neg":{
276
+ "p":0.9451371571,
277
+ "r":0.9508278976,
278
+ "f":0.947973987
279
  },
280
  "conj":{
281
+ "p":0.7760468594,
282
+ "r":0.7838620342,
283
+ "f":0.7799348697
284
  },
285
  "nsubjpass":{
286
+ "p":0.9234693878,
287
+ "r":0.9282051282,
288
+ "f":0.925831202
289
  },
290
  "auxpass":{
291
+ "p":0.9468791501,
292
+ "r":0.9744874715,
293
+ "f":0.9604849573
294
  },
295
  "dobj":{
296
+ "p":0.9278213166,
297
+ "r":0.9434217866,
298
+ "f":0.9355565214
299
  },
300
  "nummod":{
301
+ "p":0.9377224199,
302
+ "r":0.9315656566,
303
+ "f":0.9346338992
304
  },
305
  "npadvmod":{
306
+ "p":0.7837218189,
307
+ "r":0.7285968028,
308
+ "f":0.7551546392
309
  },
310
  "prt":{
311
+ "p":0.8103025348,
312
+ "r":0.8879928315,
313
+ "f":0.8473706712
314
  },
315
  "pcomp":{
316
+ "p":0.8873937677,
317
+ "r":0.8774509804,
318
+ "f":0.8823943662
319
  },
320
  "expl":{
321
+ "p":0.9809725159,
322
+ "r":0.9935760171,
323
+ "f":0.9872340426
324
  },
325
  "acl":{
326
+ "p":0.7534883721,
327
+ "r":0.7070376432,
328
+ "f":0.7295243456
329
  },
330
  "agent":{
331
+ "p":0.9042735043,
332
+ "r":0.9480286738,
333
+ "f":0.9256342957
334
  },
335
  "dative":{
336
+ "p":0.7725,
337
+ "r":0.7087155963,
338
+ "f":0.7392344498
339
  },
340
  "acomp":{
341
+ "p":0.9080091533,
342
+ "r":0.8997732426,
343
+ "f":0.9038724374
344
  },
345
  "dep":{
346
+ "p":0.3263473054,
347
+ "r":0.1769480519,
348
+ "f":0.2294736842
349
  },
350
  "csubj":{
351
+ "p":0.7045454545,
352
+ "r":0.7337278107,
353
+ "f":0.7188405797
354
  },
355
  "quantmod":{
356
+ "p":0.8531468531,
357
+ "r":0.7928513404,
358
+ "f":0.8218947368
359
  },
360
  "nmod":{
361
+ "p":0.7539432177,
362
+ "r":0.5825716027,
363
+ "f":0.6572705397
364
  },
365
  "appos":{
366
+ "p":0.6997270246,
367
+ "r":0.6672451193,
368
+ "f":0.6831001555
369
  },
370
  "predet":{
371
+ "p":0.8524590164,
372
+ "r":0.8927038627,
373
+ "f":0.8721174004
374
  },
375
  "preconj":{
376
+ "p":0.5684210526,
377
+ "r":0.6279069767,
378
+ "f":0.5966850829
379
  },
380
  "oprd":{
381
+ "p":0.8322368421,
382
+ "r":0.7552238806,
383
+ "f":0.7918622848
384
  },
385
  "parataxis":{
386
+ "p":0.6323119777,
387
+ "r":0.4924078091,
388
+ "f":0.5536585366
389
  },
390
  "meta":{
391
+ "p":0.8461538462,
392
+ "r":0.4230769231,
393
+ "f":0.5641025641
394
  },
395
  "csubjpass":{
396
+ "p":0.4285714286,
397
+ "r":0.5,
398
+ "f":0.4615384615
399
  }
400
  },
401
+ "ents_p":0.8636641533,
402
+ "ents_r":0.8489583333,
403
+ "ents_f":0.8562481059,
404
  "ents_per_type":{
405
  "DATE":{
406
+ "p":0.8711209626,
407
+ "r":0.8733333333,
408
+ "f":0.8722257451
409
  },
410
  "GPE":{
411
+ "p":0.9365811473,
412
+ "r":0.9062761506,
413
+ "f":0.9211794726
414
  },
415
  "ORDINAL":{
416
  "p":0.787965616,
417
  "r":0.8540372671,
418
  "f":0.8196721311
419
  },
420
+ "FAC":{
421
+ "p":0.4910714286,
422
+ "r":0.4230769231,
423
+ "f":0.4545454545
424
+ },
425
  "ORG":{
426
+ "p":0.8242392445,
427
+ "r":0.8329798515,
428
+ "f":0.8285864979
429
+ },
430
+ "QUANTITY":{
431
+ "p":0.8231292517,
432
+ "r":0.6648351648,
433
+ "f":0.73556231
434
+ },
435
+ "LOC":{
436
+ "p":0.7222222222,
437
+ "r":0.6624203822,
438
+ "f":0.6910299003
439
  },
440
  "CARDINAL":{
441
+ "p":0.8295912493,
442
+ "r":0.8567181926,
443
+ "f":0.8429365311
444
  },
445
  "PERSON":{
446
+ "p":0.8915049316,
447
+ "r":0.9144908616,
448
+ "f":0.9028516191
449
  },
450
  "NORP":{
451
+ "p":0.9150485437,
452
+ "r":0.9048,
453
+ "f":0.9098954143
 
 
 
 
 
 
 
 
 
 
454
  },
455
  "TIME":{
456
+ "p":0.7133956386,
457
+ "r":0.6695906433,
458
+ "f":0.6907993967
459
  },
460
+ "WORK_OF_ART":{
461
+ "p":0.544,
462
+ "r":0.3505154639,
463
+ "f":0.4263322884
464
  },
465
  "EVENT":{
466
+ "p":0.606741573,
467
+ "r":0.3103448276,
468
+ "f":0.4106463878
469
  },
470
+ "LAW":{
471
+ "p":0.3870967742,
472
+ "r":0.375,
473
+ "f":0.380952381
474
  },
475
  "MONEY":{
476
+ "p":0.9183673469,
477
+ "r":0.9031877214,
478
+ "f":0.9107142857
 
 
 
 
 
479
  },
480
  "PERCENT":{
481
+ "p":0.9079365079,
482
+ "r":0.875957121,
483
+ "f":0.8916601715
484
  },
485
  "LANGUAGE":{
486
+ "p":0.6296296296,
487
+ "r":0.53125,
488
+ "f":0.5762711864
489
  },
490
  "PRODUCT":{
491
+ "p":0.5333333333,
492
+ "r":0.2274881517,
493
+ "f":0.3189368771
494
  }
495
  },
496
+ "speed":7875.967150799
497
  },
498
  "sources":[
499
  {
 
515
  "license":"WordNet 3.0 License"
516
  },
517
  {
518
+ "name":"Explosion Vectors (OSCAR 2109 + Wikipedia + OpenSubtitles + WMT News Crawl)",
519
+ "url":"https://github.com/explosion/spacy-vectors-builder",
520
+ "license":"CC0",
521
+ "author":"Explosion"
522
  }
523
  ],
524
  "requirements":[
ner/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d9d8a97f17d882960a52360ae2e58d9c960937534c9c010e1d912a3b82767a8f
3
  size 6511153
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6f3f9355af1e468feffbc75b068675ad0598179cabaf1b02e189bc61de01064d
3
  size 6511153
ner/moves CHANGED
@@ -1 +1 @@
1
- ��moves�{"0":{},"1":{"ORG":56356,"DATE":40381,"PERSON":36475,"GPE":26716,"MONEY":15121,"CARDINAL":14096,"NORP":9638,"PERCENT":9182,"WORK_OF_ART":4475,"LOC":4047,"TIME":3670,"QUANTITY":3114,"FAC":3042,"EVENT":3015,"ORDINAL":2142,"PRODUCT":1782,"LAW":1620,"LANGUAGE":355},"2":{"ORG":56356,"DATE":40381,"PERSON":36475,"GPE":26716,"MONEY":15121,"CARDINAL":14096,"NORP":9638,"PERCENT":9182,"WORK_OF_ART":4475,"LOC":4047,"TIME":3670,"QUANTITY":3114,"FAC":3042,"EVENT":3015,"ORDINAL":2142,"PRODUCT":1782,"LAW":1620,"LANGUAGE":355},"3":{"ORG":56356,"DATE":40381,"PERSON":36475,"GPE":26716,"MONEY":15121,"CARDINAL":14096,"NORP":9638,"PERCENT":9182,"WORK_OF_ART":4475,"LOC":4047,"TIME":3670,"QUANTITY":3114,"FAC":3042,"EVENT":3015,"ORDINAL":2142,"PRODUCT":1782,"LAW":1620,"LANGUAGE":355},"4":{"ORG":56356,"DATE":40381,"PERSON":36475,"GPE":26716,"MONEY":15121,"CARDINAL":14096,"NORP":9638,"PERCENT":9182,"WORK_OF_ART":4475,"LOC":4047,"TIME":3670,"QUANTITY":3114,"FAC":3042,"EVENT":3015,"ORDINAL":2142,"PRODUCT":1782,"LAW":1620,"LANGUAGE":355,"":1},"5":{"":1}}�cfg��neg_key�
 
1
+ ��moves�{"0":{},"1":{"ORG":56516,"DATE":40493,"PERSON":36534,"GPE":26745,"MONEY":15158,"CARDINAL":14109,"NORP":9641,"PERCENT":9199,"WORK_OF_ART":4488,"LOC":4055,"TIME":3678,"QUANTITY":3123,"FAC":3046,"EVENT":3021,"ORDINAL":2142,"PRODUCT":1787,"LAW":1624,"LANGUAGE":355},"2":{"ORG":56516,"DATE":40493,"PERSON":36534,"GPE":26745,"MONEY":15158,"CARDINAL":14109,"NORP":9641,"PERCENT":9199,"WORK_OF_ART":4488,"LOC":4055,"TIME":3678,"QUANTITY":3123,"FAC":3046,"EVENT":3021,"ORDINAL":2142,"PRODUCT":1787,"LAW":1624,"LANGUAGE":355},"3":{"ORG":56516,"DATE":40493,"PERSON":36534,"GPE":26745,"MONEY":15158,"CARDINAL":14109,"NORP":9641,"PERCENT":9199,"WORK_OF_ART":4488,"LOC":4055,"TIME":3678,"QUANTITY":3123,"FAC":3046,"EVENT":3021,"ORDINAL":2142,"PRODUCT":1787,"LAW":1624,"LANGUAGE":355},"4":{"ORG":56516,"DATE":40493,"PERSON":36534,"GPE":26745,"MONEY":15158,"CARDINAL":14109,"NORP":9641,"PERCENT":9199,"WORK_OF_ART":4488,"LOC":4055,"TIME":3678,"QUANTITY":3123,"FAC":3046,"EVENT":3021,"ORDINAL":2142,"PRODUCT":1787,"LAW":1624,"LANGUAGE":355,"":1},"5":{"":1}}�cfg��neg_key�
parser/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4b8abfdcfaa0d0a822556f61fa2ab7b48d5528e8ab25375e9c657af78d8e2368
3
  size 319909
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e3d67c3b8be35db7fdf36b35ff04d575bca9a1d6862ed2003af2bfb0a70d54ae
3
  size 319909
parser/moves CHANGED
@@ -1,2 +1 @@
1
- ��moves�
2
- {"0":{"":994267},"1":{"":990803},"2":{"det":172595,"nsubj":165748,"compound":116623,"amod":105184,"aux":86667,"punct":65478,"advmod":62763,"poss":36443,"mark":27941,"nummod":22598,"auxpass":15594,"prep":14001,"nsubjpass":13856,"neg":12357,"cc":10739,"nmod":9562,"advcl":9062,"npadvmod":8168,"quantmod":7101,"intj":6464,"ccomp":5896,"dobj":3427,"expl":3360,"dep":2806,"predet":1944,"parataxis":1837,"csubj":1428,"preconj":621,"pobj||prep":616,"attr":578,"meta":376,"advmod||conj":368,"dobj||xcomp":352,"acomp":284,"nsubj||ccomp":224,"dative":206,"advmod||xcomp":149,"dobj||ccomp":70,"csubjpass":64,"dobj||conj":62,"prep||conj":51,"acl":48,"prep||nsubj":41,"prep||dobj":36,"xcomp":34,"advmod||ccomp":32,"oprd":31},"3":{"punct":183790,"pobj":182191,"prep":174008,"dobj":89615,"conj":59687,"cc":51930,"ccomp":30385,"advmod":22861,"xcomp":21021,"relcl":20969,"advcl":19828,"attr":17741,"acomp":16922,"appos":15265,"case":13388,"acl":12085,"pcomp":10324,"npadvmod":9796,"prt":8179,"agent":3903,"dative":3866,"nsubj":3470,"neg":2906,"amod":2839,"intj":2819,"nummod":2732,"oprd":2301,"dep":1487,"parataxis":1261,"quantmod":319,"nmod":294,"acl||dobj":200,"prep||dobj":190,"prep||nsubj":162,"acl||nsubj":159,"appos||nsubj":145,"relcl||dobj":134,"relcl||nsubj":111,"aux":103,"expl":96,"meta":92,"appos||dobj":86,"preconj":71,"csubj":65,"prep||nsubjpass":55,"prep||advmod":54,"prep||acomp":53,"det":51,"nsubjpass":45,"relcl||pobj":42,"acl||nsubjpass":42,"mark":40,"auxpass":39,"prep||pobj":36,"relcl||nsubjpass":32,"appos||nsubjpass":31},"4":{"ROOT":111664}}�cfg��neg_key�
 
1
+ ��moves� {"0":{"":994332},"1":{"":999432},"2":{"det":172595,"nsubj":165748,"compound":116623,"amod":105184,"aux":86667,"punct":65478,"advmod":62763,"poss":36443,"mark":27941,"nummod":22598,"auxpass":15594,"prep":14001,"nsubjpass":13856,"neg":12357,"cc":10739,"nmod":9562,"advcl":9062,"npadvmod":8168,"quantmod":7101,"intj":6464,"ccomp":5896,"dobj":3427,"expl":3360,"dep":2871,"predet":1944,"parataxis":1837,"csubj":1428,"preconj":621,"pobj||prep":616,"attr":578,"meta":376,"advmod||conj":368,"dobj||xcomp":352,"acomp":284,"nsubj||ccomp":224,"dative":206,"advmod||xcomp":149,"dobj||ccomp":70,"csubjpass":64,"dobj||conj":62,"prep||conj":51,"acl":48,"prep||nsubj":41,"prep||dobj":36,"xcomp":34,"advmod||ccomp":32,"oprd":31},"3":{"punct":183790,"pobj":182191,"prep":174008,"dobj":89615,"conj":59687,"cc":51930,"ccomp":30385,"advmod":22861,"xcomp":21021,"relcl":20969,"advcl":19828,"attr":17741,"acomp":16922,"appos":15265,"case":13388,"acl":12085,"pcomp":10324,"dep":10116,"npadvmod":9796,"prt":8179,"agent":3903,"dative":3866,"nsubj":3470,"neg":2906,"amod":2839,"intj":2819,"nummod":2732,"oprd":2301,"parataxis":1261,"quantmod":319,"nmod":294,"acl||dobj":200,"prep||dobj":190,"prep||nsubj":162,"acl||nsubj":159,"appos||nsubj":145,"relcl||dobj":134,"relcl||nsubj":111,"aux":103,"expl":96,"meta":92,"appos||dobj":86,"preconj":71,"csubj":65,"prep||nsubjpass":55,"prep||advmod":54,"prep||acomp":53,"det":51,"nsubjpass":45,"relcl||pobj":42,"acl||nsubjpass":42,"mark":40,"auxpass":39,"prep||pobj":36,"relcl||nsubjpass":32,"appos||nsubjpass":31},"4":{"ROOT":111664}}�cfg��neg_key�
 
senter/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3a1bdccc5dc2d8c842081528c93680c54508411615b525cef695239f30bb0ed8
3
  size 219953
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c62c89d439056fafe7886478e92bebbcb752104eeda3ca61f6830b956edaa070
3
  size 219953
tagger/cfg CHANGED
@@ -48,6 +48,7 @@
48
  "WP$",
49
  "WRB",
50
  "XX",
 
51
  "``"
52
  ],
53
  "neg_prefix":"!",
 
48
  "WP$",
49
  "WRB",
50
  "XX",
51
+ "_SP",
52
  "``"
53
  ],
54
  "neg_prefix":"!",
tagger/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4481bf82fdaea8773149ca8b637057e0dfaa4f8fa1cc5e8f19f33250568f6fc0
3
- size 19441
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4570d9aeb04bba9d57fcffb0c230e9a45ef507630daad3e0c83c51c5fd78e12f
3
+ size 19829
tok2vec/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:71724ee469b871ec2287455264d692c9b229b1bf129aa5bc06130a4aeb9b7c0e
3
  size 6365604
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a8256cf8ba7da785abb8a7db0a4a43ef627731048bcfb07f2acac17f6ed5107b
3
  size 6365604
tokenizer CHANGED
The diff for this file is too large to render. See raw diff
 
vocab/key2row CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c8163b927a234a675074bb38ce62c17a57182998dc83fb9275d35500559a582a
3
- size 9311659
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:31566ae010da3d399eb1d930ae142757afd2601034a4be3bdb00d18881c8c06a
3
+ size 7066303
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:649ca580aed1f07d3b761fa73308bc96f72b78e8bd4d51140a3a920b3429ba10
3
- size 9694998
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:06aeff5e8687bc142b8d3b54846e034ad18b8d9e98650d4a27273e483ed57f45
3
+ size 10369007
vocab/vectors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:dd82f972c4fca3d440c505cdd94c88efdded56457cc86851d584b751f7dea673
3
- size 411501728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:234dcf234bfdf01775ae6182715d55eaacfcde8555b189f25440b56d3c39fd5d
3
+ size 616988528