Edit model card

Levanti (Hebrew -> colloquial Levantine Arabic) translator

Trained on the Levanti dataset by fine-tuning Helsinki-NLP/opus-mt-he-ar for 8 epochs. This model is trained to support dialect conditional generation by utilizing the first token (followed by a space) as an indicator of the desired dialect:

  • P for Palestinian
  • L for Lebanese
  • S for Syrian
  • E for Egyptian

Example usage

from transformers import pipeline
trans = pipeline("translation", "guymorlan/levanti_translate_he_ar")
trans("P אני רוצה ללכת מחר לחנות")
Out[1]: [{'translation_text': 'بدي أروح ع الدكان بكرا'}]

Attribution

Created by Guy Mor-Lan.
Contact: guy.mor AT mail.huji.ac.il

Downloads last month
7
Safetensors
Model size
76.6M params
Tensor type
F32
·
Inference Examples
Inference API (serverless) is not available, repository is disabled.

Dataset used to train guymorlan/levanti_translate_he_ar