xiaoming-leza
/

wav2vec2-common_voice-tr-demo

@@ -1,11 +1,7 @@
 ---
-language:
-- tr
 license: apache-2.0
 base_model: facebook/wav2vec2-large-xlsr-53
 tags:
-- automatic-speech-recognition
-- common_voice
 - generated_from_trainer
 datasets:
 - common_voice
@@ -18,15 +14,15 @@ model-index:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
-      name: COMMON_VOICE - TR
       type: common_voice
       config: tr
       split: test
-      args: 'Config: tr, Training split: train+validation, Eval split: test'
     metrics:
     - name: Wer
       type: wer
-      value: 0.34950464712491064
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -34,10 +30,10 @@ should probably proofread and complete it, then remove this comment. -->
 # wav2vec2-common_voice-tr-demo
-This model is a fine-tuned version of [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on the COMMON_VOICE - TR dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3828
-- Wer: 0.3495
 ## Model description
@@ -71,22 +67,22 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
-| No log        | 0.92  | 100  | 3.5826          | 1.0    |
-| No log        | 1.83  | 200  | 3.0218          | 0.9999 |
-| No log        | 2.75  | 300  | 0.8985          | 0.8036 |
-| No log        | 3.67  | 400  | 0.5992          | 0.6197 |
-| 3.1629        | 4.59  | 500  | 0.4968          | 0.5340 |
-| 3.1629        | 5.5   | 600  | 0.4646          | 0.5045 |
-| 3.1629        | 6.42  | 700  | 0.4316          | 0.4425 |
-| 3.1629        | 7.34  | 800  | 0.4500          | 0.4735 |
-| 3.1629        | 8.26  | 900  | 0.4114          | 0.4123 |
-| 0.2226        | 9.17  | 1000 | 0.4162          | 0.4019 |
-| 0.2226        | 10.09 | 1100 | 0.3999          | 0.3824 |
-| 0.2226        | 11.01 | 1200 | 0.4048          | 0.3842 |
-| 0.2226        | 11.93 | 1300 | 0.3789          | 0.3602 |
-| 0.2226        | 12.84 | 1400 | 0.4024          | 0.3536 |
-| 0.1015        | 13.76 | 1500 | 0.3899          | 0.3575 |
-| 0.1015        | 14.68 | 1600 | 0.3802          | 0.3490 |
 ### Framework versions

 ---
 license: apache-2.0
 base_model: facebook/wav2vec2-large-xlsr-53
 tags:
 - generated_from_trainer
 datasets:
 - common_voice
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
+      name: common_voice
       type: common_voice
       config: tr
       split: test
+      args: tr
     metrics:
     - name: Wer
       type: wer
+      value: 0.3454192625880911
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # wav2vec2-common_voice-tr-demo
+This model is a fine-tuned version of [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on the common_voice dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3714
+- Wer: 0.3454
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
+| No log        | 0.92  | 100  | 3.5988          | 1.0    |
+| No log        | 1.83  | 200  | 3.0083          | 0.9999 |
+| No log        | 2.75  | 300  | 0.8642          | 0.7579 |
+| No log        | 3.67  | 400  | 0.5713          | 0.6203 |
+| 3.14          | 4.59  | 500  | 0.4795          | 0.5338 |
+| 3.14          | 5.5   | 600  | 0.4441          | 0.4912 |
+| 3.14          | 6.42  | 700  | 0.4241          | 0.4521 |
+| 3.14          | 7.34  | 800  | 0.4326          | 0.4611 |
+| 3.14          | 8.26  | 900  | 0.3913          | 0.4212 |
+| 0.2183        | 9.17  | 1000 | 0.4036          | 0.3973 |
+| 0.2183        | 10.09 | 1100 | 0.4035          | 0.3959 |
+| 0.2183        | 11.01 | 1200 | 0.3807          | 0.3790 |
+| 0.2183        | 11.93 | 1300 | 0.3750          | 0.3650 |
+| 0.2183        | 12.84 | 1400 | 0.3822          | 0.3573 |
+| 0.1011        | 13.76 | 1500 | 0.3747          | 0.3510 |
+| 0.1011        | 14.68 | 1600 | 0.3714          | 0.3454 |
 ### Framework versions