Kamtera
/

persian-tts-male-vits

Model card Files Files and versions Metrics Training metrics Community

persian-tts-male-vits / README.md

Kamtera's picture

Update README.md

d3b3a30 almost 2 years ago

|

2.44 kB

	---
	library_name: TTS
	task: text-to-speech
	tags:
	- Persian
	- TTS
	- Farsi
	- Coqui
	- CoquiTTS
	- pytorch
	- audio
	- text-to-speech
	language: fa
	datasets:
	- persian-tts-dataset
	widget:
	- text: .زندگی فقط یک بار است؛ از آن به خوبی استفاده کن
	example_title: Hello, this is a test run.
	license: openrail
	pipeline_tag: text-to-speech
	---

	# persian-tts-male-vits

	- persian-tts-male vits model for text to speech purposes.
	- Persian فارسی
	- Single-speaker male voice
	- finetuned [persian-tts-female-vits](https://huggingface.co/Kamtera/persian-tts-female-vits) model on [persian-tts-dataset-male](https://www.kaggle.com/datasets/magnoliasis/persian-tts-dataset) dataset
	- [GitHub Repo](https://github.com/coqui-ai/TTS)
	- <span style="color: #0072ff;font-weight: bold;">[Demo](https://huggingface.co/spaces/Kamtera/Persian-tts-CoquiTTS)</span>

	# info

	finetuned <span style="color: #d0212d;font-weight: bold;">[persian-tts-female-vits](https://huggingface.co/Kamtera/persian-tts-female-vits)</span>
	model on <span style="color: #d0212d;font-weight: bold;">[persian-tts-dataset-male](https://www.kaggle.com/datasets/magnoliasis/persian-tts-dataset)</span>
	dataset

	# Uses

	Install dependencies:
	```python
	!pip install TTS
	!sudo apt-get -y install espeak-ng
	```

	Generate audio from text:

	##### using cli:
	```python
	!tts --text "زندگی فقط یک بار است؛ از آن به خوبی استفاده کن" \
	--model_path "best_model_91323.pth" \
	--config_path "config.json" \
	--out_path "speech1.wav"
	```
	##### python api:

	```python

	from TTS.config import load_config
	from TTS.utils.manage import ModelManager
	from TTS.utils.synthesizer import Synthesizer

	config="config.json"
	model="best_model_91323.pth"

	model_path =model # Absolute path to the model checkpoint.pth
	config_path =config # Absolute path to the model config.json

	text=".زندگی فقط یک بار است؛ از آن به خوبی استفاده کن"

	synthesizer = Synthesizer(
	model_path, config_path
	)
	wavs = synthesizer.tts(text)
	synthesizer.save_wav(wavs, 'sp.wav')
	```
	Display audio:


	```python
	import IPython
	IPython.display.Audio('sp.wav')
	```


	- Hours used: 10
	- Cloud Provider: kaggle



	# How to Get Started with the Model

	Use the code below to get started with the model.

	<details>
	<summary> Click to expand </summary>

	More information needed

	</details>