es_qwen2_vl_pangea / README.md
apjanco's picture
Update README.md
fabf456 verified
metadata
library_name: transformers
tags:
  - llama-factory
license: mit
datasets:
  - neulab/PangeaInstruct
  - apjanco/fmb_primera_muestra_redboxes
language:
  - es
base_model:
  - Qwen/Qwen2-VL-2B-Instruct

Model Card

This is Qwen2-VL 2B, fine-tuned for OCR/HTR with Spanish language historical documents using data from neulab/PangeaInstruct. Each image has a red box around an area of text in the image. The model is asked to return the text inside.

For the training data see

Model Details

This is the model card of a 🤗 transformers model that has been pushed on the Hub.

  • Developed by: Andrew Janco
  • Model type: Qwen2-VL
  • Language(s) (NLP): Spanish
  • License: MIT
  • Finetuned from model [optional]: Qwen2-VL 2B

Uses

This model is part of experiments to extract text from historical handwritten documents.