Update README.md
Browse files
README.md
CHANGED
@@ -9,10 +9,17 @@ pipeline_tag: text-to-speech
|
|
9 |
|
10 |
## The vanilla VALL E train on WenetSpeech4TTS using Amphion tooltik.
|
11 |
|
12 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
13 |
|
14 |
### usage
|
15 |
-
|
|
|
16 |
|
17 |
|
18 |
```
|
|
|
9 |
|
10 |
## The vanilla VALL E train on WenetSpeech4TTS using Amphion tooltik.
|
11 |
|
12 |
+
The entire training process follows its training code, except that the text-to-phoneme feature step is slightly different.
|
13 |
+
|
14 |
+
### Checkpoints
|
15 |
+
|
16 |
+
- **base_model.bin** : VALL-E trained with the WenetSpeech4TTS Basic subset
|
17 |
+
- **38sft_model.bin** : VALL-E Basic fine-tuning with the WenetSpeech4TTS Standard subset
|
18 |
+
- **4sft_model.bin** : VALL-E Standard fine-tuning with the WenetSpeech4TTS Premium subset
|
19 |
|
20 |
### usage
|
21 |
+
Inference code and more details : [ISCSLP2024_CoVoC_baseline](https://github.com/xkx-hub/ISCSLP2024_CoVoC_baseline).
|
22 |
+
```
|
23 |
|
24 |
|
25 |
```
|