End of training

Browse files

Files changed (4) hide show

README.md +32 -4
model.safetensors +1 -1
runs/Apr22_13-35-54_solaris/events.out.tfevents.1713807355.solaris.82143.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -7,9 +7,24 @@ tags:
 - generated_from_trainer
 datasets:
 - librispeech_asr
 model-index:
 - name: SpeechGPT
-  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -18,6 +33,9 @@ should probably proofread and complete it, then remove this comment. -->
 # SpeechGPT
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the librispeech_asr dataset.
 ## Model description
@@ -37,19 +55,29 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
-- train_batch_size: 4
 - eval_batch_size: 8
 - seed: 42
-- gradient_accumulation_steps: 4
 - total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- training_steps: 1
 - mixed_precision_training: Native AMP
 ### Training results
 ### Framework versions

 - generated_from_trainer
 datasets:
 - librispeech_asr
+metrics:
+- wer
 model-index:
 - name: SpeechGPT
+  results:
+  - task:
+      name: Automatic Speech Recognition
+      type: automatic-speech-recognition
+    dataset:
+      name: librispeech_asr
+      type: librispeech_asr
+      config: clean
+      split: None
+      args: 'config: clean, split: train'
+    metrics:
+    - name: Wer
+      type: wer
+      value: 23.544963481436394
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # SpeechGPT
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the librispeech_asr dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.5354
+- Wer: 23.5450
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
+- train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
+- gradient_accumulation_steps: 2
 - total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Wer     |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|
+| 1.0518        | 0.12  | 1000 | 0.7491          | 31.7103 |
+| 0.8884        | 0.24  | 2000 | 0.6588          | 27.4003 |
+| 0.8061        | 0.36  | 3000 | 0.6177          | 26.4569 |
+| 0.8549        | 0.48  | 4000 | 0.5888          | 25.5002 |
+| 0.7836        | 0.6   | 5000 | 0.5688          | 25.8939 |
+| 0.691         | 0.72  | 6000 | 0.5542          | 24.1574 |
+| 0.7044        | 0.84  | 7000 | 0.5429          | 23.5450 |
+| 0.7309        | 0.97  | 8000 | 0.5354          | 23.5450 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1d7734884874f1a1513ed9aa760a4f8e97aaa02fd6d93a3a85d27b2ae9ca596b
 size 966995080

 version https://git-lfs.github.com/spec/v1
+oid sha256:a0b0bc51c8fcc2553ef1edbd890f973348dccaec5e8acd9a67c3fab7a45bc320
 size 966995080

runs/Apr22_13-35-54_solaris/events.out.tfevents.1713807355.solaris.82143.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5564ca24eb8ec54a6a5f627ccee054eb9512284bee3ea9569543457daffe4812
+size 79051

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:13292bfe1a060c7a942b0ae44bb338403b3472b70fbbe81302e7a7ed63e4da2c
 size 5048

 version https://git-lfs.github.com/spec/v1
+oid sha256:9f25c9e6b75d44792062d1a59f1f77044b0778d7c809dcf358c482ca488df5f2
 size 5048