Kamtera's picture
Update README.md
d3b3a30
|
raw
history blame
2.44 kB
---
library_name: TTS
task: text-to-speech
tags:
- Persian
- TTS
- Farsi
- Coqui
- CoquiTTS
- pytorch
- audio
- text-to-speech
language: fa
datasets:
- persian-tts-dataset
widget:
- text: .زندگی فقط یک بار است؛ از آن به خوبی استفاده کن
example_title: Hello, this is a test run.
license: openrail
pipeline_tag: text-to-speech
---
# **persian-tts-male-vits**
- persian-tts-male vits model for text to speech purposes.
- Persian فارسی
- Single-speaker male voice
- finetuned **[persian-tts-female-vits](https://huggingface.co/Kamtera/persian-tts-female-vits)** model on **[persian-tts-dataset-male](https://www.kaggle.com/datasets/magnoliasis/persian-tts-dataset)** dataset
- [GitHub Repo](https://github.com/coqui-ai/TTS)
- <span style="color: #0072ff;font-weight: bold;">[Demo](https://huggingface.co/spaces/Kamtera/Persian-tts-CoquiTTS)</span>
# **info**
finetuned <span style="color: #d0212d;font-weight: bold;">[persian-tts-female-vits](https://huggingface.co/Kamtera/persian-tts-female-vits)</span>
model on <span style="color: #d0212d;font-weight: bold;">[persian-tts-dataset-male](https://www.kaggle.com/datasets/magnoliasis/persian-tts-dataset)</span>
dataset
# **Uses**
Install dependencies:
```python
!pip install TTS
!sudo apt-get -y install espeak-ng
```
Generate audio from text:
##### using cli:
```python
!tts --text "زندگی فقط یک بار است؛ از آن به خوبی استفاده کن" \
--model_path "best_model_91323.pth" \
--config_path "config.json" \
--out_path "speech1.wav"
```
##### python api:
```python
from TTS.config import load_config
from TTS.utils.manage import ModelManager
from TTS.utils.synthesizer import Synthesizer
config="config.json"
model="best_model_91323.pth"
model_path =model # Absolute path to the model checkpoint.pth
config_path =config # Absolute path to the model config.json
text=".زندگی فقط یک بار است؛ از آن به خوبی استفاده کن"
synthesizer = Synthesizer(
model_path, config_path
)
wavs = synthesizer.tts(text)
synthesizer.save_wav(wavs, 'sp.wav')
```
Display audio:
```python
import IPython
IPython.display.Audio('sp.wav')
```
- **Hours used:** 10
- **Cloud Provider:** kaggle
# How to Get Started with the Model
Use the code below to get started with the model.
<details>
<summary> Click to expand </summary>
More information needed
</details>