eddiegulay commited on
Commit
6316d1e
1 Parent(s): e281d61

End of training

Browse files
README.md ADDED
@@ -0,0 +1,88 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: alamsher/wav2vec2-large-xlsr-53-common-voice-sw
4
+ tags:
5
+ - generated_from_trainer
6
+ datasets:
7
+ - common_voice_13_0
8
+ metrics:
9
+ - wer
10
+ model-index:
11
+ - name: wav2vec2-large-xlsr-mvc-swahili
12
+ results:
13
+ - task:
14
+ name: Automatic Speech Recognition
15
+ type: automatic-speech-recognition
16
+ dataset:
17
+ name: common_voice_13_0
18
+ type: common_voice_13_0
19
+ config: sw
20
+ split: test
21
+ args: sw
22
+ metrics:
23
+ - name: Wer
24
+ type: wer
25
+ value: 0.32237526397075045
26
+ ---
27
+
28
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
29
+ should probably proofread and complete it, then remove this comment. -->
30
+
31
+ # wav2vec2-large-xlsr-mvc-swahili
32
+
33
+ This model is a fine-tuned version of [alamsher/wav2vec2-large-xlsr-53-common-voice-sw](https://huggingface.co/alamsher/wav2vec2-large-xlsr-53-common-voice-sw) on the common_voice_13_0 dataset.
34
+ It achieves the following results on the evaluation set:
35
+ - Loss: inf
36
+ - Wer: 0.3224
37
+
38
+ ## Model description
39
+
40
+ More information needed
41
+
42
+ ## Intended uses & limitations
43
+
44
+ More information needed
45
+
46
+ ## Training and evaluation data
47
+
48
+ More information needed
49
+
50
+ ## Training procedure
51
+
52
+ ### Training hyperparameters
53
+
54
+ The following hyperparameters were used during training:
55
+ - learning_rate: 0.0003
56
+ - train_batch_size: 16
57
+ - eval_batch_size: 8
58
+ - seed: 42
59
+ - gradient_accumulation_steps: 2
60
+ - total_train_batch_size: 32
61
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
+ - lr_scheduler_type: linear
63
+ - lr_scheduler_warmup_steps: 500
64
+ - num_epochs: 2
65
+
66
+ ### Training results
67
+
68
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
69
+ |:-------------:|:-----:|:----:|:---------------:|:------:|
70
+ | No log | 0.17 | 100 | inf | 1.0 |
71
+ | No log | 0.34 | 200 | inf | 1.0 |
72
+ | No log | 0.5 | 300 | inf | 0.3420 |
73
+ | 3.3446 | 0.67 | 400 | inf | 0.3431 |
74
+ | 3.3446 | 0.84 | 500 | inf | 0.3500 |
75
+ | 3.3446 | 1.01 | 600 | inf | 0.3433 |
76
+ | 3.3446 | 1.17 | 700 | inf | 0.3347 |
77
+ | 0.1975 | 1.34 | 800 | inf | 0.3340 |
78
+ | 0.1975 | 1.51 | 900 | inf | 0.3307 |
79
+ | 0.1975 | 1.68 | 1000 | inf | 0.3233 |
80
+ | 0.1975 | 1.84 | 1100 | inf | 0.3224 |
81
+
82
+
83
+ ### Framework versions
84
+
85
+ - Transformers 4.35.0
86
+ - Pytorch 2.1.0
87
+ - Datasets 2.14.6
88
+ - Tokenizers 0.14.1
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:dd0bb551c94175807c3e9d77e43a4a361e0024dd88d1ad294e452b0c8f65c25f
3
  size 1261950980
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b6b17fa9562ad6d1f4d19cefa681f5af9c19a99707c10b2d903abaae6ee0543d
3
  size 1261950980
runs/Nov06_23-24-19_8d905d1e3af6/events.out.tfevents.1699313622.8d905d1e3af6.3490.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d533226dd238fcc2e18a9377b0c1290e387c205159ab23970f73ca8b3eb999f8
3
- size 8777
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dbad0e156e412a6f28a05a7bf7417a577102263b54a6d30d02643b93b42e8fbf
3
+ size 10085