apjanco
/

es_qwen2_vl_pangea

Image-Text-to-Text

Inference Endpoints

Model card Files Files and versions Community

es_qwen2_vl_pangea / README.md

apjanco's picture

Update README.md

fabf456 verified 5 days ago

|

history blame contribute delete

1.07 kB

	---
	library_name: transformers
	tags:
	- llama-factory
	license: mit
	datasets:
	- neulab/PangeaInstruct
	- apjanco/fmb_primera_muestra_redboxes
	language:
	- es
	base_model:
	- Qwen/Qwen2-VL-2B-Instruct
	---

	# Model Card

	This is Qwen2-VL 2B, fine-tuned for OCR/HTR with Spanish language historical documents using data from [neulab/PangeaInstruct](https://huggingface.co/datasets/neulab/PangeaInstruct).
	Each image has a red box around an area of text in the image. The model is asked to return the text inside.

	For the training data see
	- Pangea (task_data_vmultilingual_cc_news_es_curated.tar)
	- [apjanco/fmb_primera_muestra_redboxes](https://huggingface.co/datasets/apjanco/fmb_primera_muestra_redboxes)

	## Model Details

	This is the model card of a 🤗 transformers model that has been pushed on the Hub.

	- Developed by: Andrew Janco
	- Model type: Qwen2-VL
	- Language(s) (NLP): Spanish
	- License: MIT
	- Finetuned from model [optional]: Qwen2-VL 2B

	## Uses

	This model is part of experiments to extract text from historical handwritten documents.