easyGUI / README.md
Blane187's picture
Update README.md
adbdefc verified
|
raw
history blame
1.15 kB
---
license: mit
---
# easyGUI
`easyGUI` is a user-friendly voice conversion framework based on VITS, designed to eliminate timbre leakage by replacing input features with those from the training set. It's efficient even on lower-end GPUs, requiring only about 10 minutes of low-noise speech data for good results. The framework features a simple web interface, supports A card and I card acceleration, and uses the advanced RMVPE algorithm for pitch extraction.
## Installation
### Prerequisites
- Python 3.8 or higher
### Installation Steps
1. **Install Pytorch**:
```bash
pip install torch torchvision torchaudio
```
2. **Install Dependencies**:
```bash
pip install -r requirements.txt
```
3
### Additional Setup
- **Download Assets**:
Download necessary models and files using the scripts in the `tools` directory.
- **Install FFmpeg**:
```bash
sudo apt install ffmpeg
```
## Usage
Start the WebUI:
```bash
python demo.py
```
## Features
- Top1 retrieval to replace input features
- Fast training on less powerful GPUs
- Model merging to change timbre
- Advanced pitch extraction with RMVPE
---