Update README.md
Browse files
README.md
CHANGED
@@ -13,8 +13,8 @@ tags:
|
|
13 |
whisper-th-small-ct2 is the CTranslate2 format of [biodatlab/whisper-th-small-combined](https://huggingface.co/biodatlab/whisper-th-small-combined), comparable with [WhisperX](https://github.com/m-bain/whisperX) and [faster-whisper](https://github.com/SYSTRAN/faster-whisper), which enables:
|
14 |
|
15 |
- 🤏 **Half the size** of original Huggingface format.
|
16 |
-
- ⚡️ Batched inference for **70x** real-time transcription
|
17 |
-
- 🪶 A faster-whisper backend, requiring **<8GB GPU memory**
|
18 |
- 🎯 Accurate word-level timestamps using wav2vec2 alignment.
|
19 |
- 👯♂️ Multispeaker ASR using speaker diarization(includes speaker ID labels).
|
20 |
- 🗣️ VAD preprocessing, reducing hallucinations and allowing batching with no WER degradation.
|
|
|
13 |
whisper-th-small-ct2 is the CTranslate2 format of [biodatlab/whisper-th-small-combined](https://huggingface.co/biodatlab/whisper-th-small-combined), comparable with [WhisperX](https://github.com/m-bain/whisperX) and [faster-whisper](https://github.com/SYSTRAN/faster-whisper), which enables:
|
14 |
|
15 |
- 🤏 **Half the size** of original Huggingface format.
|
16 |
+
- ⚡️ Batched inference for **70x** real-time transcription.
|
17 |
+
- 🪶 A faster-whisper backend, requiring **<8GB GPU memory** with beam_size=5.
|
18 |
- 🎯 Accurate word-level timestamps using wav2vec2 alignment.
|
19 |
- 👯♂️ Multispeaker ASR using speaker diarization(includes speaker ID labels).
|
20 |
- 🗣️ VAD preprocessing, reducing hallucinations and allowing batching with no WER degradation.
|