File size: 1,069 Bytes
2a08594
 
 
 
fabf456
 
 
 
 
 
 
 
2a08594
 
fabf456
2a08594
fabf456
 
2a08594
fabf456
 
 
2a08594
 
 
fabf456
2a08594
fabf456
 
 
 
 
2a08594
 
 
fabf456
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
---
library_name: transformers
tags:
- llama-factory
license: mit
datasets:
- neulab/PangeaInstruct
- apjanco/fmb_primera_muestra_redboxes
language:
- es
base_model:
- Qwen/Qwen2-VL-2B-Instruct
---

# Model Card

This is Qwen2-VL 2B, fine-tuned for OCR/HTR with Spanish language historical documents using data from [neulab/PangeaInstruct](https://huggingface.co/datasets/neulab/PangeaInstruct). 
Each image has a red box around an area of text in the image. The model is asked to return the text inside. 

For the training data see 
- Pangea (task_data_vmultilingual_cc_news_es_curated.tar)
- [apjanco/fmb_primera_muestra_redboxes](https://huggingface.co/datasets/apjanco/fmb_primera_muestra_redboxes)

## Model Details

This is the model card of a 🤗 transformers model that has been pushed on the Hub.

- **Developed by:** Andrew Janco
- **Model type:** Qwen2-VL
- **Language(s) (NLP):** Spanish
- **License:** MIT
- **Finetuned from model [optional]:** Qwen2-VL 2B

## Uses

This model is part of experiments to extract text from historical handwritten documents.