Ibrahemqasim
commited on
Commit
•
8bf81f5
1
Parent(s):
0154aba
Update README.md
Browse files
README.md
CHANGED
@@ -9,12 +9,16 @@ This dataset contains mappings between English Wikipedia categories and their co
|
|
9 |
|
10 |
### langlinks.json
|
11 |
|
12 |
-
This file contains the original mappings as downloaded from
|
13 |
|
14 |
### filtered_data.json
|
15 |
|
16 |
-
This file contains the mappings after filtering out those that do not contain a 4-digit year. It contains
|
17 |
|
18 |
### cats_2000.json
|
19 |
|
20 |
-
This file contains the mappings after replacing all 4-digit years with the year 2000. It contains
|
|
|
|
|
|
|
|
|
|
9 |
|
10 |
### langlinks.json
|
11 |
|
12 |
+
This file contains the original mappings as downloaded from the Hugging Face Hub. It contains 818,354 mappings.
|
13 |
|
14 |
### filtered_data.json
|
15 |
|
16 |
+
This file contains the mappings after filtering out those that do not contain a 4-digit year. It contains 231,349 mappings.
|
17 |
|
18 |
### cats_2000.json
|
19 |
|
20 |
+
This file contains the mappings after replacing all 4-digit years with the year 2000. It contains 20,913 mappings.
|
21 |
+
|
22 |
+
## Usage
|
23 |
+
|
24 |
+
To use this dataset, you can download it from the Hugging Face Hub and load it into your code. For example, you can use the following code to load the filtered data into a Python dictionary:
|