Sercan commited on
Commit
9e976f5
1 Parent(s): 6d8d3f1

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +88 -0
README.md ADDED
@@ -0,0 +1,88 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - generated_from_trainer
5
+ datasets:
6
+ - common_voice_11_0
7
+ metrics:
8
+ - wer
9
+ model-index:
10
+ - name: openai/whisper-small
11
+ results:
12
+ - task:
13
+ name: Automatic Speech Recognition
14
+ type: automatic-speech-recognition
15
+ dataset:
16
+ name: common_voice_11_0
17
+ type: common_voice_11_0
18
+ config: tr
19
+ split: test
20
+ args: tr
21
+ metrics:
22
+ - name: Wer
23
+ type: wer
24
+ value: 16.634639647508685
25
+ ---
26
+
27
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
28
+ should probably proofread and complete it, then remove this comment. -->
29
+
30
+ # openai/whisper-small
31
+
32
+ This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the common_voice_11_0 dataset.
33
+ It achieves the following results on the evaluation set:
34
+ - Loss: 0.2615
35
+ - Wer: 16.6346
36
+ - Cer: 4.2839
37
+
38
+ ## Model description
39
+
40
+ More information needed
41
+
42
+ ## Intended uses & limitations
43
+
44
+ More information needed
45
+
46
+ ## Training and evaluation data
47
+
48
+ More information needed
49
+
50
+ ## Training procedure
51
+
52
+ ### Training hyperparameters
53
+
54
+ The following hyperparameters were used during training:
55
+ - learning_rate: 1e-05
56
+ - train_batch_size: 32
57
+ - eval_batch_size: 16
58
+ - seed: 42
59
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
60
+ - lr_scheduler_type: linear
61
+ - lr_scheduler_warmup_steps: 500
62
+ - training_steps: 5000
63
+ - mixed_precision_training: Native AMP
64
+
65
+ ### Training results
66
+
67
+ | Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
68
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|
69
+ | 0.1412 | 0.08 | 400 | 0.2656 | 19.8335 | 5.2393 |
70
+ | 0.0851 | 1.03 | 800 | 0.2382 | 18.6300 | 4.8916 |
71
+ | 0.0525 | 1.11 | 1200 | 0.2532 | 19.1696 | 5.2238 |
72
+ | 0.0163 | 2.07 | 1600 | 0.2447 | 17.2014 | 4.5840 |
73
+ | 0.0202 | 3.02 | 2000 | 0.2472 | 17.1063 | 4.4935 |
74
+ | 0.0075 | 3.1 | 2400 | 0.2503 | 17.0151 | 4.4318 |
75
+ | 0.0039 | 4.05 | 2800 | 0.2514 | 16.7433 | 4.3655 |
76
+ | 0.0038 | 5.01 | 3200 | 0.2565 | 16.8870 | 4.3582 |
77
+ | 0.0023 | 5.09 | 3600 | 0.2590 | 16.6987 | 4.3337 |
78
+ | 0.0013 | 6.04 | 4000 | 0.2576 | 16.6327 | 4.2853 |
79
+ | 0.0011 | 6.12 | 4400 | 0.2647 | 16.9122 | 4.3556 |
80
+ | 0.001 | 7.07 | 4800 | 0.2615 | 16.6346 | 4.2839 |
81
+
82
+
83
+ ### Framework versions
84
+
85
+ - Transformers 4.26.0.dev0
86
+ - Pytorch 1.13.1+cu117
87
+ - Datasets 2.8.1.dev0
88
+ - Tokenizers 0.13.2