- This folder contains all the SQL databases for the different processed data along with their raw data. - The databases are named after the arXiv category and the format of the generated data. Each file in this folder is a database containing 2 tables: - **papers** The papers data from the `raw` folder that was fed to the model. SCHEMA: - paper_id TEXT PRIMARY KEY, - abstract TEXT, - authors TEXT, - primary_category TEXT, - url TEXT, - updated_on TEXT, - sentence_count INTEGER - **predictions** The corresponding model generations stored in the `results` folder. SCHEMA: - id INTEGER PRIMARY KEY AUTOINCREMENT, - paper_id TEXT, - sentence_index INTEGER, - tag_type TEXT, - concept TEXT, - FOREIGN KEY (paper_id) REFERENCES papers(paper_id) To query any database, open SQLite in your terminal and specify the database name.