$fractalego's picture$

fractalego

Create README.md

3720e66 11 months ago

preview code

raw

history blame contribute delete

343 Bytes

Personal speech to text model

Speech to Text models often do not understand my accent, so I fine tuned this one from "distil-whisper/distil-medium.en" using about 1000 recordings of my voice, comprising of about 2h of recordings. The system goes from ~12% WER to ~8% WER.

Do not download unless you have exactly my accent (North-East Italy).