--- license: apache-2.0 datasets: - Junhoee/STT_Korean_Dataset_80000 - Bingsu/zeroth-korean language: - ko metrics: - wer base_model: - openai/whisper-large-v3-turbo pipeline_tag: automatic-speech-recognition library_name: transformers --- This model is a fine-tune of OpenAI's Whisper Large v3 Turbo model (https://huggingface.co/openai/whisper-large-v3-turbo) over the following Korean datasets: https://huggingface.co/datasets/Junhoee/STT_Korean_Dataset_80000 https://huggingface.co/datasets/Bingsu/zeroth-korean Combined they have roughly 102k sentences. This is the last checkpoint which has achieved ~16 WER (down from ~24 WER). Training was 10,000 iterations.