boumehdi
/

wav2vec2-large-xlsr-moroccan-darija

Automatic Speech Recognition

Moroccan Arabic

xlsr-fine-tuning-week

Inference Endpoints

Model card Files Files and versions Community

boumehdi commited on Apr 9, 2023

Commit

3128428

•

1 Parent(s): 346ca1e

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ model-index:
          type: wer
          value: 49.68
 ---
-# Wav2Vec2-Large-XLSR-53-Moroccan-Darija-V1
 [othrif/wav2vec2-large-xlsr-moroccan](https://huggingface.co/othrif/wav2vec2-large-xlsr-moroccan) fine-tuned on 6 hours of labeled Darija Audios
@@ -35,8 +35,8 @@ import torch
 from transformers import Wav2Vec2CTCTokenizer, Wav2Vec2ForCTC, Wav2Vec2Processor, TrainingArguments, Wav2Vec2FeatureExtractor, Trainer
 tokenizer = Wav2Vec2CTCTokenizer("./vocab.json", unk_token="[UNK]", pad_token="[PAD]", word_delimiter_token="|")
-processor = Wav2Vec2Processor.from_pretrained('boumehdi/wav2vec2-large-xlsr-moroccan-darija-v1', tokenizer=tokenizer)
-model=Wav2Vec2ForCTC.from_pretrained('boumehdi/wav2vec2-large-xlsr-moroccan-darija-v1')
 # load the audio data (use your own wav file here!)
@@ -71,6 +71,6 @@ This high validation loss value is mainly due to the fact that Darija can be wri
 ## Future Work
-Currently working on **wav2vec2-large-xlsr-moroccan-darija-v2** which will be available soon by adding more data (from 6hours to 12hours).
-I am also working on audio data augmentation techniques (pitch shift, reberbation, additive augmentation.. ) to see if it is going to improve the **WER**.

          type: wer
          value: 49.68
 ---
+# Wav2Vec2-Large-XLSR-53-Moroccan-Darija
 [othrif/wav2vec2-large-xlsr-moroccan](https://huggingface.co/othrif/wav2vec2-large-xlsr-moroccan) fine-tuned on 6 hours of labeled Darija Audios
 from transformers import Wav2Vec2CTCTokenizer, Wav2Vec2ForCTC, Wav2Vec2Processor, TrainingArguments, Wav2Vec2FeatureExtractor, Trainer
 tokenizer = Wav2Vec2CTCTokenizer("./vocab.json", unk_token="[UNK]", pad_token="[PAD]", word_delimiter_token="|")
+processor = Wav2Vec2Processor.from_pretrained('boumehdi/wav2vec2-large-xlsr-moroccan-darija', tokenizer=tokenizer)
+model=Wav2Vec2ForCTC.from_pretrained('boumehdi/wav2vec2-large-xlsr-moroccan-darija')
 # load the audio data (use your own wav file here!)
 ## Future Work
+Currently working on improving this model. The new model will be available soon.