Spaces:
Sleeping
Sleeping
File size: 2,391 Bytes
db5a87e 34246fc |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 |
Directory Overview: This directory contains all the evaluation modules and results: ################################################################################################################################ ## 1. evaluation.py The `evaluation.py` module provides a comprehensive framework for evaluating Knowledge-Based Visual Question Answering (KB-VQA) models. This module includes methods for syntactic and semantic evaluation, leveraging both exact match and VQA scores. The evaluation results can be saved to an Excel file for further analysis. Notes Ensure that the necessary configuration settings are correctly set in "my_model/config/evaluation_config" file. The module requires the following dependencies: pandas, fuzzywuzzy, nltk, ast, typing, streamlit, and openai. ################################################################################################################################ ## 2. demo.py The `demo.py` module includes the `ResultDemonstrator` class, which provides a comprehensive framework for visualizing and demonstrating the results of Knowledge-Based Visual Question Answering (KB-VQA) models. The class utilizes Streamlit for displaying data and Altair for creating interactive visualizations. this class is the backbone for the results demo tool of the application. Notes Ensure that the necessary configuration settings are correctly set in "my_model/config/evaluation_config". The module requires the following dependencies: os, altair, streamlit, PIL, pandas, random. ################################################################################################################################ ## 3. evaluation_results.xlsx This file contains all the evaluation results based on the evaluation data (~ 1,000 samples), the results are in the sheet names "Main Data". running the evaluation module will read the main columns (Model Answers for all model's configurations) from this sheet and regenerate and save the evaluation rsults. This Main Data sheet is the result of running the evaluation module (with the same file name chosen for save). ############################################################################################################################### ## 4. Demo_Images This folder contains a pool of images used for the demo app, to visualize and demonstrate the images together with the model answers. |