boumehdi
/

wav2vec2-large-xlsr-moroccan-darija

Automatic Speech Recognition

Moroccan Arabic

xlsr-fine-tuning-week

Inference Endpoints

Model card Files Files and versions Community

boumehdi commited on Dec 13, 2023

Commit

eded877

•

1 Parent(s): b948b7b

Update README.md

Files changed (1) hide show

README.md +17 -2

README.md CHANGED Viewed

@@ -17,11 +17,25 @@ model-index:
     metrics:
        - name: Test WER
          type: wer
-         value: 0.09
 ---
 # Wav2Vec2-Large-XLSR-53-Moroccan-Darija
-**wav2vec2-large-xlsr-53** fine-tuned on 120 hours of labeled Darija Audios
 ## Usage
@@ -58,3 +72,4 @@ print(transcription)
 Output: قالت ليا هاد السيد هادا ما كاينش بحالو
 email: souregh@gmail.com

     metrics:
        - name: Test WER
          type: wer
+         value: 0.254919
 ---
 # Wav2Vec2-Large-XLSR-53-Moroccan-Darija
+**wav2vec2-large-xlsr-53** fine-tuned on 27 hours (27 people) of labeled Darija Audios.
+# Old model vs new model
+Old Model:
+- The model contains numerous incorrect transcriptions as input
+- There are multiple transcribers.
+- The audio database is not organized (by gender, age, regions ..).
+- Wrong wer rate
+New Model:
+- Transcriptions are now performed by a single individual.
+- Each hour of audio is pronounced by one person.
+- Fine-tuning is ongoing 24/7 to enhance accuracy, and we are consistently adding more data to the model every day.
+- Correct Wer rate
 ## Usage
 Output: قالت ليا هاد السيد هادا ما كاينش بحالو
 email: souregh@gmail.com
+BOUMEHDI Ahmed