|
--- |
|
license: mit |
|
--- |
|
|
|
|
|
# easyGUI |
|
|
|
`easyGUI` is a user-friendly voice conversion framework based on VITS, designed to eliminate timbre leakage by replacing input features with those from the training set. It's efficient even on lower-end GPUs, requiring only about 10 minutes of low-noise speech data for good results. The framework features a simple web interface, supports A card and I card acceleration, and uses the advanced RMVPE algorithm for pitch extraction. |
|
|
|
## Installation |
|
|
|
### Prerequisites |
|
- Python 3.8 or higher |
|
|
|
### Installation Steps |
|
1. **Install Pytorch**: |
|
```bash |
|
pip install torch torchvision torchaudio |
|
``` |
|
|
|
|
|
2. **Install Dependencies**: |
|
```bash |
|
pip install -r requirements.txt |
|
``` |
|
|
|
|
|
3 |
|
|
|
### Additional Setup |
|
- **Download Assets**: |
|
Download necessary models and files using the scripts in the `tools` directory. |
|
- **Install FFmpeg**: |
|
```bash |
|
sudo apt install ffmpeg |
|
``` |
|
|
|
## Usage |
|
Start the WebUI: |
|
```bash |
|
python demo.py |
|
``` |
|
|
|
|
|
## Features |
|
- Top1 retrieval to replace input features |
|
- Fast training on less powerful GPUs |
|
- Model merging to change timbre |
|
- Advanced pitch extraction with RMVPE |
|
|
|
|
|
--- |