EzraWilliam
/

wav2vec2-xlsr-53-CV-demo-google-colab-Ezra_William_Prod13

@@ -1,8 +1,8 @@
 ---
 license: apache-2.0
 tags:
 - generated_from_trainer
-base_model: facebook/wav2vec2-large-xlsr-53
 datasets:
 - common_voice_13_0
 metrics:
@@ -11,8 +11,8 @@ model-index:
 - name: wav2vec2-xlsr-53-CV-demo-google-colab-Ezra_William_Prod13
   results:
   - task:
-      type: automatic-speech-recognition
       name: Automatic Speech Recognition
     dataset:
       name: common_voice_13_0
       type: common_voice_13_0
@@ -20,9 +20,9 @@ model-index:
       split: test
       args: id
     metrics:
-    - type: wer
-      value: 0.5021202064896755
-      name: Wer
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on the common_voice_13_0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5021
-- Wer: 0.5021
 ## Model description
@@ -65,16 +65,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
-| 4.9125        | 0.9   | 500  | 2.9530          | 1.0    |
-| 2.9145        | 1.8   | 1000 | 2.8763          | 1.0    |
-| 2.7636        | 2.7   | 1500 | 2.1172          | 1.0    |
-| 1.6023        | 3.6   | 2000 | 0.8278          | 0.7302 |
-| 1.0405        | 4.5   | 2500 | 0.6488          | 0.6256 |
-| 0.8858        | 5.4   | 3000 | 0.5819          | 0.5666 |
-| 0.8119        | 6.29  | 3500 | 0.5431          | 0.5321 |
-| 0.7547        | 7.19  | 4000 | 0.5203          | 0.5145 |
-| 0.713         | 8.09  | 4500 | 0.5039          | 0.5037 |
-| 0.712         | 8.99  | 5000 | 0.5021          | 0.5021 |
 ### Framework versions

 ---
 license: apache-2.0
+base_model: facebook/wav2vec2-large-xlsr-53
 tags:
 - generated_from_trainer
 datasets:
 - common_voice_13_0
 metrics:
 - name: wav2vec2-xlsr-53-CV-demo-google-colab-Ezra_William_Prod13
   results:
   - task:
       name: Automatic Speech Recognition
+      type: automatic-speech-recognition
     dataset:
       name: common_voice_13_0
       type: common_voice_13_0
       split: test
       args: id
     metrics:
+    - name: Wer
+      type: wer
+      value: 0.5518989675516224
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on the common_voice_13_0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5399
+- Wer: 0.5519
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
+| 4.7217        | 0.9   | 500  | 2.9517          | 1.0    |
+| 2.9149        | 1.8   | 1000 | 2.8778          | 1.0    |
+| 2.851         | 2.7   | 1500 | 2.6437          | 1.0    |
+| 2.0653        | 3.6   | 2000 | 1.0367          | 0.8727 |
+| 1.1893        | 4.5   | 2500 | 0.7226          | 0.7006 |
+| 0.9685        | 5.4   | 3000 | 0.6301          | 0.6358 |
+| 0.8742        | 6.29  | 3500 | 0.5778          | 0.5890 |
+| 0.8076        | 7.19  | 4000 | 0.5576          | 0.5696 |
+| 0.7624        | 8.09  | 4500 | 0.5412          | 0.5525 |
+| 0.7604        | 8.99  | 5000 | 0.5399          | 0.5519 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:065b0a137fe9357d8ca8a5e8b645c4c9976b5b7723ac3ed01cc9118bac30da05
 size 1261991980

 version https://git-lfs.github.com/spec/v1
+oid sha256:ad94635ecfdc32bc2013388defcdd3ff975d540cbde21fd986ba77ac0c151787
 size 1261991980