surveyor-0 / data /README.md
abby101's picture
Upload folder using huggingface_hub
8e3f751 verified

A newer version of the Gradio SDK is available: 5.6.0

Upgrade

This directory contains all the scraped, manually labeled, util and processed data for this project.

It contains of 4 folders:

data
β”œβ”€β”€ manual
β”œβ”€β”€ raw
β”œβ”€β”€ results
└── databases

manual

This folder contains the schema and constituency tests that were manually developed for efficiently labelling data. It also contains manual annotations of a small subset of the raw data using the defined schema and constituency tests.

Check the manual README for further details.

raw

This folder contains the raw data that was scraped from arXiv using the arxiv Python wrapper for different channels of research.

Check the raw README for further details.

results

This folder contains the processed results after running inference through the model. The model generates predictions of tagged concepts from the raw data and stores under folders within this folder.

Check the results README for further details.

databases

This folder contains the SQL daatabases created for each pair of raw papers and the corresponding model predictions of tagged concepts.

Check the databases README for further details.