no-op-ul-se
commited on
Commit
•
8471a16
1
Parent(s):
38f0a65
add models
Browse files
README.md
CHANGED
@@ -2,3 +2,47 @@
|
|
2 |
license: cc-by-nc-4.0
|
3 |
---
|
4 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2 |
license: cc-by-nc-4.0
|
3 |
---
|
4 |
|
5 |
+
### guitar_iil_b2048_r48000_z16.ts
|
6 |
+
|
7 |
+
Dataset: [IILGuitarTimbre](https://github.com/Intelligent-Instruments-Lab/IILGuitarTimbre).
|
8 |
+
|
9 |
+
Model: modified RAVE v1, 48kHz, block size 2048, 16 latent dimensions.
|
10 |
+
|
11 |
+
### organ_archive_b2048_r48000_z16.ts
|
12 |
+
|
13 |
+
Dataset: public domain organ music from archive.org. Small amounts of voice and other instruments were included, and vinyl record noises are prominent.
|
14 |
+
|
15 |
+
Model: modified RAVE v1, 48kHz, block size 2048, 16 latent dimensions.
|
16 |
+
|
17 |
+
### organ_bach_b2048_sr48000_z16.ts
|
18 |
+
|
19 |
+
Dataset: various recordings of J. S. Bach music for church organ.
|
20 |
+
|
21 |
+
Model: modified RAVE v1, 48kHz, block size 2048, 16 latent dimensions.
|
22 |
+
|
23 |
+
### voice_vocalset_b2048_r48000_z16.ts
|
24 |
+
|
25 |
+
Dataset: [VocalSet](https://zenodo.org/record/1193957) singing voice dataset.
|
26 |
+
|
27 |
+
Model: modified RAVE v1, 48kHz, block size 2048, 16 latent dimensions.
|
28 |
+
|
29 |
+
### voice_hifitts_b2048_r48000_z16.ts
|
30 |
+
|
31 |
+
Dataset: [Hi-Fi TTS](http://arxiv.org/abs/2104.01497) audiobooks dataset.
|
32 |
+
|
33 |
+
Model: modified RAVE v1, 48kHz, block size 2048, 16 latent dimensions.
|
34 |
+
|
35 |
+
### voice_jvs_b2048_r44100_z16.ts
|
36 |
+
|
37 |
+
Dataset: [Hi-Fi TTS](http://arxiv.org/abs/2104.01497) speaker 9017 (John Van Stan).
|
38 |
+
|
39 |
+
Model: RAVE v3, 44.1kHz, block size 2048, 16 latent dimensions.
|
40 |
+
|
41 |
+
### voice_vctk_b2048_r44100_z16.ts
|
42 |
+
|
43 |
+
Dataset: [CSTR VCTK Corpus](https://datashare.ed.ac.uk/handle/10283/3443) multispeaker read speech dataset.
|
44 |
+
|
45 |
+
Model: RAVE v3, 44.1kHz, block size 2048, 22 latent dimensions.
|
46 |
+
|
47 |
+
|
48 |
+
|
guitar_iil_b2048_r48000_z16.ts
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:02458214e23890d6818504319a5b9903eabfe87a524491f6524f453e7f3dbcf0
|
3 |
+
size 163881670
|
organ_archive_b2048_r48000_z16.ts
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:7fb80ff896c114e1ed436dfa4059e23694c8b0e36f2b16532b637f9b8854f96d
|
3 |
+
size 163885039
|
organ_bach_b2048_sr48000_z16.ts
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f7c06309e0388e666993226c06ed1438b56adc23b2a5a3b8f9155ed26990423c
|
3 |
+
size 163879431
|
voice_hifitts_b2048_r48000_z16.ts
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:67e888716655c5670d5d9e15d0bc43b5851ddd7a3004512a0c400a2eeb62522a
|
3 |
+
size 163881009
|
voice_jvs_b2048_r44100_z16.ts
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5d41684d151c0a98a51815479d866c1b4f8d8cbe2cdb62652d27f6ff2286ed77
|
3 |
+
size 150059552
|
voice_vctk_b2048_r44100_z22.ts
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9e5578ea2c98856eff6b511089cc1eaba69eaf85527ad343604a6420fe3a751f
|
3 |
+
size 150058264
|
voice_vocalset_b2048_r48000_z16.ts
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ba1b5392c4645c8040aa618e43b8269d840b9752536caac37c91c698334fa9a6
|
3 |
+
size 163882118
|