es_qwen2_vl_pangea / README.md
apjanco's picture
Update README.md
fabf456 verified
---
library_name: transformers
tags:
- llama-factory
license: mit
datasets:
- neulab/PangeaInstruct
- apjanco/fmb_primera_muestra_redboxes
language:
- es
base_model:
- Qwen/Qwen2-VL-2B-Instruct
---
# Model Card
This is Qwen2-VL 2B, fine-tuned for OCR/HTR with Spanish language historical documents using data from [neulab/PangeaInstruct](https://huggingface.co/datasets/neulab/PangeaInstruct).
Each image has a red box around an area of text in the image. The model is asked to return the text inside.
For the training data see
- Pangea (task_data_vmultilingual_cc_news_es_curated.tar)
- [apjanco/fmb_primera_muestra_redboxes](https://huggingface.co/datasets/apjanco/fmb_primera_muestra_redboxes)
## Model Details
This is the model card of a 🤗 transformers model that has been pushed on the Hub.
- **Developed by:** Andrew Janco
- **Model type:** Qwen2-VL
- **Language(s) (NLP):** Spanish
- **License:** MIT
- **Finetuned from model [optional]:** Qwen2-VL 2B
## Uses
This model is part of experiments to extract text from historical handwritten documents.