Wav2Vec 2.0
A collection for the first release of Wav2Vec 2.0, a speech encoder that learns powerful representations from unlabelled audio data.
Automatic Speech Recognition • Updated • 110k • 133Note The Wav2Vec 2.0 "large" model pre-trained on 53k hours of un-labelled audio data from the LibriSpeech and LibriVox (LV) corpora, and fine-tuned on 960 hours of LibriSpeech ASR data. This is the most performant Wav2Vec 2.0 checkpoint from the initial release, obtaining 1.9/3.9% WER on the LibriSpeech test clean/other subsets respectively.
facebook/wav2vec2-large-960h
Automatic Speech Recognition • Updated • 83.3k • 26Note The Wav2Vec 2.0 "large" model pre-trained and fine-tuned on 960 hours of LibriSpeech ASR data.
facebook/wav2vec2-base-960h
Automatic Speech Recognition • Updated • 1.27M • • 306Note The Wav2Vec 2.0 "base" model pre-trained and fine-tuned on 960 hours of LibriSpeech ASR data.
facebook/wav2vec2-base-100h
Automatic Speech Recognition • Updated • 2.11k • 6Note The Wav2Vec 2.0 "base" model pre-trained on 960 hours of un-labelled LibriSpeech ASR data, and fine-tuned on 100 hours of labelled LibriSpeech ASR data.
facebook/wav2vec2-large-lv60
Updated • 13.1k • 6Note The Wav2Vec 2.0 "large" model pre-trained on 53k hours of un-labelled data from the LibriSpeech and LibriVox (LV) corpora.
facebook/wav2vec2-large
Updated • 3.99k • 3Note The Wav2Vec 2.0 "large" model pre-trained on 960 hours of un-labelled LibriSpeech ASR data.
facebook/wav2vec2-base
Updated • 1.27M • 77Note The Wav2Vec 2.0 "base" model pre-trained on 960 hours of un-labelled LibriSpeech ASR data.
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Paper • 2006.11477 • Published • 4Note The wav2vec 2.0 paper, accepted to NeurIPS 2020.