Spaces:
Runtime error
Runtime error
# Key Information Extraction | |
## Overview | |
The structure of the key information extraction dataset directory is organized as follows. | |
```text | |
βββ wildreceipt | |
βββ class_list.txt | |
βββ dict.txt | |
βββ image_files | |
βββ openset_train.txt | |
βββ openset_test.txt | |
βββ test.txt | |
βββ train.txt | |
``` | |
## Preparation Steps | |
### WildReceipt | |
- Just download and extract [wildreceipt.tar](https://download.openmmlab.com/mmocr/data/wildreceipt.tar). | |
### WildReceiptOpenset | |
- Step0: have [WildReceipt](#WildReceipt) prepared. | |
- Step1: Convert annotation files to OpenSet format: | |
```bash | |
# You may find more available arguments by running | |
# python tools/data/kie/closeset_to_openset.py -h | |
python tools/data/kie/closeset_to_openset.py data/wildreceipt/train.txt data/wildreceipt/openset_train.txt | |
python tools/data/kie/closeset_to_openset.py data/wildreceipt/test.txt data/wildreceipt/openset_test.txt | |
``` | |
:::{note} | |
You can learn more about the key differences between CloseSet and OpenSet annotations in our [tutorial](../tutorials/kie_closeset_openset.md). | |
::: | |