RVC / docs /README.ko.md
fantasyfish's picture
Initial commit
4de32eb
|
raw
history blame
5.47 kB
<div align="center">
<h1>Retrieval-based-Voice-Conversion-WebUI</h1>
VITS ๊ธฐ๋ฐ˜์˜ ๊ฐ„๋‹จํ•˜๊ณ  ์‚ฌ์šฉํ•˜๊ธฐ ์‰ฌ์šด ์Œ์„ฑ ๋ณ€ํ™˜ ํ”„๋ ˆ์ž„์›Œํฌ.<br><br>
[![madewithlove](https://forthebadge.com/images/badges/built-with-love.svg)](https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI)
<img src="https://counter.seku.su/cmoe?name=rvc&theme=r34" /><br>
[![Open In Colab](https://img.shields.io/badge/Colab-F9AB00?style=for-the-badge&logo=googlecolab&color=525252)](https://colab.research.google.com/github/liujing04/Retrieval-based-Voice-Conversion-WebUI/blob/main/Retrieval_based_Voice_Conversion_WebUI.ipynb)
[![Licence](https://img.shields.io/github/license/liujing04/Retrieval-based-Voice-Conversion-WebUI?style=for-the-badge)](https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI/blob/main/%E4%BD%BF%E7%94%A8%E9%9C%80%E9%81%B5%E5%AE%88%E7%9A%84%E5%8D%8F%E8%AE%AE-LICENSE.txt)
[![Huggingface](https://img.shields.io/badge/๐Ÿค—%20-Spaces-yellow.svg?style=for-the-badge)](https://huggingface.co/lj1995/VoiceConversionWebUI/tree/main/)
[![Discord](https://img.shields.io/badge/RVC%20Developers-Discord-7289DA?style=for-the-badge&logo=discord&logoColor=white)](https://discord.gg/HcsmBBGyVk)
</div>
------
[**์—…๋ฐ์ดํŠธ ๋กœ๊ทธ**](https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI/blob/main/Changelog_CN.md)
[**English**](./README.en.md) | [**ไธญๆ–‡็ฎ€ไฝ“**](../README.md) | [**ๆ—ฅๆœฌ่ชž**](./README.ja.md) | [**ํ•œ๊ตญ์–ด**](./README.ko.md) ([**้Ÿ“ๅœ‹่ชž**](./README.ko.han.md))
> [๋ฐ๋ชจ ์˜์ƒ](https://www.bilibili.com/video/BV1pm4y1z7Gm/)์„ ํ™•์ธํ•ด ๋ณด์„ธ์š”!
> RVC๋ฅผ ํ™œ์šฉํ•œ ์‹ค์‹œ๊ฐ„ ์Œ์„ฑ๋ณ€ํ™˜: [w-okada/voice-changer](https://github.com/w-okada/voice-changer)
> ๊ธฐ๋ณธ ๋ชจ๋ธ์€ 50์‹œ๊ฐ„ ๊ฐ€๋Ÿ‰์˜ ๊ณ ํ€„๋ฆฌํ‹ฐ ์˜คํ”ˆ ์†Œ์Šค VCTK ๋ฐ์ดํ„ฐ์…‹์„ ์‚ฌ์šฉํ•˜์˜€์œผ๋ฏ€๋กœ, ์ €์ž‘๊ถŒ์ƒ์˜ ์—ผ๋ ค๊ฐ€ ์—†์œผ๋‹ˆ ์•ˆ์‹ฌํ•˜๊ณ  ์‚ฌ์šฉํ•˜์‹œ๊ธฐ ๋ฐ”๋ž๋‹ˆ๋‹ค.
> ์ €์ž‘๊ถŒ ๋ฌธ์ œ๊ฐ€ ์—†๋Š” ๊ณ ํ€„๋ฆฌํ‹ฐ์˜ ๋…ธ๋ž˜๋ฅผ ์ดํ›„์—๋„ ๊ณ„์†ํ•ด์„œ ํ›ˆ๋ จํ•  ์˜ˆ์ •์ž…๋‹ˆ๋‹ค.
## ์†Œ๊ฐœ
๋ณธ Repo๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™์€ ํŠน์ง•์„ ๊ฐ€์ง€๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค:
+ top1 ๊ฒ€์ƒ‰์„ ์ด์šฉํ•˜์—ฌ ์ž…๋ ฅ ์Œ์ƒ‰ ํŠน์ง•์„ ํ›ˆ๋ จ ์„ธํŠธ ์Œ์ƒ‰ ํŠน์ง•์œผ๋กœ ๋Œ€์ฒดํ•˜์—ฌ ์Œ์ƒ‰์˜ ๋ˆ„์ถœ์„ ๋ฐฉ์ง€;
+ ์ƒ๋Œ€์ ์œผ๋กœ ๋‚ฎ์€ ์„ฑ๋Šฅ์˜ GPU์—์„œ๋„ ๋น ๋ฅธ ํ›ˆ๋ จ ๊ฐ€๋Šฅ;
+ ์ ์€ ์–‘์˜ ๋ฐ์ดํ„ฐ๋กœ ํ›ˆ๋ จํ•ด๋„ ์ข‹์€ ๊ฒฐ๊ณผ๋ฅผ ์–ป์„ ์ˆ˜ ์žˆ์Œ (์ตœ์†Œ 10๋ถ„ ์ด์ƒ์˜ ์ €์žก์Œ ์Œ์„ฑ ๋ฐ์ดํ„ฐ๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ์„ ๊ถŒ์žฅ);
+ ๋ชจ๋ธ ์œตํ•ฉ์„ ํ†ตํ•œ ์Œ์ƒ‰์˜ ๋ณ€์กฐ ๊ฐ€๋Šฅ (ckpt ์ฒ˜๋ฆฌ ํƒญ->ckpt ๋ณ‘ํ•ฉ ์„ ํƒ);
+ ์‚ฌ์šฉํ•˜๊ธฐ ์‰ฌ์šด WebUI (์›น ์ธํ„ฐํŽ˜์ด์Šค);
+ UVR5 ๋ชจ๋ธ์„ ์ด์šฉํ•˜์—ฌ ๋ชฉ์†Œ๋ฆฌ์™€ ๋ฐฐ๊ฒฝ์Œ์•…์˜ ๋น ๋ฅธ ๋ถ„๋ฆฌ;
## ํ™˜๊ฒฝ์˜ ์ค€๋น„
poetry๋ฅผ ํ†ตํ•ด dependecies๋ฅผ ์„ค์น˜ํ•˜๋Š” ๊ฒƒ์„ ๊ถŒ์žฅํ•ฉ๋‹ˆ๋‹ค.
๋‹ค์Œ ๋ช…๋ น์€ Python ๋ฒ„์ „ 3.8 ์ด์ƒ์˜ ํ™˜๊ฒฝ์—์„œ ์‹คํ–‰๋˜์–ด์•ผ ํ•ฉ๋‹ˆ๋‹ค:
```bash
# PyTorch ๊ด€๋ จ ์ฃผ์š” dependencies ์„ค์น˜, ์ด๋ฏธ ์„ค์น˜๋˜์–ด ์žˆ๋Š” ๊ฒฝ์šฐ ๊ฑด๋„ˆ๋›ฐ๊ธฐ ๊ฐ€๋Šฅ
# ์ฐธ์กฐ: https://pytorch.org/get-started/locally/
pip install torch torchvision torchaudio
# Windows + Nvidia Ampere Architecture(RTX30xx)๋ฅผ ์‚ฌ์šฉํ•˜๊ณ  ์žˆ๋‹ค๋ฉด, https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI/issues/21 ์—์„œ ๋ช…์‹œ๋œ ๊ฒƒ๊ณผ ๊ฐ™์ด PyTorch์— ๋งž๋Š” CUDA ๋ฒ„์ „์„ ์ง€์ •ํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค.
#pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu117
# Poetry ์„ค์น˜, ์ด๋ฏธ ์„ค์น˜๋˜์–ด ์žˆ๋Š” ๊ฒฝ์šฐ ๊ฑด๋„ˆ๋›ฐ๊ธฐ ๊ฐ€๋Šฅ
# Reference: https://python-poetry.org/docs/#installation
curl -sSL https://install.python-poetry.org | python3 -
# Dependecies ์„ค์น˜
poetry install
```
pip๋ฅผ ํ™œ์šฉํ•˜์—ฌ dependencies๋ฅผ ์„ค์น˜ํ•˜์—ฌ๋„ ๋ฌด๋ฐฉํ•ฉ๋‹ˆ๋‹ค.
**๊ณต์ง€**: `MacOS`์—์„œ `faiss 1.7.2`๋ฅผ ์‚ฌ์šฉํ•˜๋ฉด Segmentation Fault: 11 ์˜ค๋ฅ˜๊ฐ€ ๋ฐœ์ƒํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์ˆ˜๋™์œผ๋กœ pip๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์„ค์น˜ํ•˜๋Š” ๊ฒฝ์šฐ `pip install faiss-cpu==1.7.0`์„ ์‚ฌ์šฉํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค.
```bash
pip install -r requirements.txt
```
## ๊ธฐํƒ€ ์‚ฌ์ „ ๋ชจ๋ธ ์ค€๋น„
RVC ๋ชจ๋ธ์€ ์ถ”๋ก ๊ณผ ํ›ˆ๋ จ์„ ์œ„ํ•˜์—ฌ ๋‹ค๋ฅธ ์‚ฌ์ „ ๋ชจ๋ธ์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
[Huggingface space](https://huggingface.co/lj1995/VoiceConversionWebUI/tree/main/)๋ฅผ ํ†ตํ•ด์„œ ๋‹ค์šด๋กœ๋“œ ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๋‹ค์Œ์€ RVC์— ํ•„์š”ํ•œ ์‚ฌ์ „ ๋ชจ๋ธ ๋ฐ ๊ธฐํƒ€ ํŒŒ์ผ ๋ชฉ๋ก์ž…๋‹ˆ๋‹ค:
```bash
hubert_base.pt
./pretrained
./uvr5_weights
# Windows๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๊ฒฝ์šฐ ์ด ์‚ฌ์ „๋„ ํ•„์š”ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. FFmpeg๊ฐ€ ์„ค์น˜๋˜์–ด ์žˆ์œผ๋ฉด ๊ฑด๋„ˆ๋›ฐ์–ด๋„ ๋ฉ๋‹ˆ๋‹ค.
ffmpeg.exe
```
๊ทธ ํ›„ ์ดํ•˜์˜ ๋ช…๋ น์„ ์‚ฌ์šฉํ•˜์—ฌ WebUI๋ฅผ ์‹œ์ž‘ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค:
```bash
python infer-web.py
```
Windows๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๊ฒฝ์šฐ `RVC-beta.7z`๋ฅผ ๋‹ค์šด๋กœ๋“œ ๋ฐ ์••์ถ• ํ•ด์ œํ•˜์—ฌ RVC๋ฅผ ์ง์ ‘ ์‚ฌ์šฉํ•˜๊ฑฐ๋‚˜ `go-web.bat`์„ ์‚ฌ์šฉํ•˜์—ฌ WebUi๋ฅผ ์‹œ์ž‘ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
## ์ฐธ๊ณ 
+ [ContentVec](https://github.com/auspicious3000/contentvec/)
+ [VITS](https://github.com/jaywalnut310/vits)
+ [HIFIGAN](https://github.com/jik876/hifi-gan)
+ [Gradio](https://github.com/gradio-app/gradio)
+ [FFmpeg](https://github.com/FFmpeg/FFmpeg)
+ [Ultimate Vocal Remover](https://github.com/Anjok07/ultimatevocalremovergui)
+ [audio-slicer](https://github.com/openvpi/audio-slicer)
## ๋ชจ๋“  ๊ธฐ์—ฌ์ž ๋ถ„๋“ค์˜ ๋…ธ๋ ฅ์— ๊ฐ์‚ฌ๋“œ๋ฆฝ๋‹ˆ๋‹ค.
<a href="https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI/graphs/contributors" target="_blank">
<img src="https://contrib.rocks/image?repo=liujing04/Retrieval-based-Voice-Conversion-WebUI" />
</a>