File size: 1,069 Bytes
2a08594 fabf456 2a08594 fabf456 2a08594 fabf456 2a08594 fabf456 2a08594 fabf456 2a08594 fabf456 2a08594 fabf456 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 |
---
library_name: transformers
tags:
- llama-factory
license: mit
datasets:
- neulab/PangeaInstruct
- apjanco/fmb_primera_muestra_redboxes
language:
- es
base_model:
- Qwen/Qwen2-VL-2B-Instruct
---
# Model Card
This is Qwen2-VL 2B, fine-tuned for OCR/HTR with Spanish language historical documents using data from [neulab/PangeaInstruct](https://huggingface.co/datasets/neulab/PangeaInstruct).
Each image has a red box around an area of text in the image. The model is asked to return the text inside.
For the training data see
- Pangea (task_data_vmultilingual_cc_news_es_curated.tar)
- [apjanco/fmb_primera_muestra_redboxes](https://huggingface.co/datasets/apjanco/fmb_primera_muestra_redboxes)
## Model Details
This is the model card of a 🤗 transformers model that has been pushed on the Hub.
- **Developed by:** Andrew Janco
- **Model type:** Qwen2-VL
- **Language(s) (NLP):** Spanish
- **License:** MIT
- **Finetuned from model [optional]:** Qwen2-VL 2B
## Uses
This model is part of experiments to extract text from historical handwritten documents. |