TensorBoard
no-op-ul-se commited on
Commit
8471a16
1 Parent(s): 38f0a65

add models

Browse files
README.md CHANGED
@@ -2,3 +2,47 @@
2
  license: cc-by-nc-4.0
3
  ---
4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  license: cc-by-nc-4.0
3
  ---
4
 
5
+ ### guitar_iil_b2048_r48000_z16.ts
6
+
7
+ Dataset: [IILGuitarTimbre](https://github.com/Intelligent-Instruments-Lab/IILGuitarTimbre).
8
+
9
+ Model: modified RAVE v1, 48kHz, block size 2048, 16 latent dimensions.
10
+
11
+ ### organ_archive_b2048_r48000_z16.ts
12
+
13
+ Dataset: public domain organ music from archive.org. Small amounts of voice and other instruments were included, and vinyl record noises are prominent.
14
+
15
+ Model: modified RAVE v1, 48kHz, block size 2048, 16 latent dimensions.
16
+
17
+ ### organ_bach_b2048_sr48000_z16.ts
18
+
19
+ Dataset: various recordings of J. S. Bach music for church organ.
20
+
21
+ Model: modified RAVE v1, 48kHz, block size 2048, 16 latent dimensions.
22
+
23
+ ### voice_vocalset_b2048_r48000_z16.ts
24
+
25
+ Dataset: [VocalSet](https://zenodo.org/record/1193957) singing voice dataset.
26
+
27
+ Model: modified RAVE v1, 48kHz, block size 2048, 16 latent dimensions.
28
+
29
+ ### voice_hifitts_b2048_r48000_z16.ts
30
+
31
+ Dataset: [Hi-Fi TTS](http://arxiv.org/abs/2104.01497) audiobooks dataset.
32
+
33
+ Model: modified RAVE v1, 48kHz, block size 2048, 16 latent dimensions.
34
+
35
+ ### voice_jvs_b2048_r44100_z16.ts
36
+
37
+ Dataset: [Hi-Fi TTS](http://arxiv.org/abs/2104.01497) speaker 9017 (John Van Stan).
38
+
39
+ Model: RAVE v3, 44.1kHz, block size 2048, 16 latent dimensions.
40
+
41
+ ### voice_vctk_b2048_r44100_z16.ts
42
+
43
+ Dataset: [CSTR VCTK Corpus](https://datashare.ed.ac.uk/handle/10283/3443) multispeaker read speech dataset.
44
+
45
+ Model: RAVE v3, 44.1kHz, block size 2048, 22 latent dimensions.
46
+
47
+
48
+
guitar_iil_b2048_r48000_z16.ts ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:02458214e23890d6818504319a5b9903eabfe87a524491f6524f453e7f3dbcf0
3
+ size 163881670
organ_archive_b2048_r48000_z16.ts ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7fb80ff896c114e1ed436dfa4059e23694c8b0e36f2b16532b637f9b8854f96d
3
+ size 163885039
organ_bach_b2048_sr48000_z16.ts ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f7c06309e0388e666993226c06ed1438b56adc23b2a5a3b8f9155ed26990423c
3
+ size 163879431
voice_hifitts_b2048_r48000_z16.ts ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:67e888716655c5670d5d9e15d0bc43b5851ddd7a3004512a0c400a2eeb62522a
3
+ size 163881009
voice_jvs_b2048_r44100_z16.ts ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5d41684d151c0a98a51815479d866c1b4f8d8cbe2cdb62652d27f6ff2286ed77
3
+ size 150059552
voice_vctk_b2048_r44100_z22.ts ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9e5578ea2c98856eff6b511089cc1eaba69eaf85527ad343604a6420fe3a751f
3
+ size 150058264
voice_vocalset_b2048_r48000_z16.ts ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ba1b5392c4645c8040aa618e43b8269d840b9752536caac37c91c698334fa9a6
3
+ size 163882118