File size: 1,154 Bytes
1abe9e0 adbdefc 1abe9e0 adbdefc 1abe9e0 adbdefc 1abe9e0 adbdefc 1abe9e0 1e43516 1abe9e0 1e43516 adbdefc 1e43516 1abe9e0 1e43516 1abe9e0 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 |
---
license: mit
---
# easyGUI
`easyGUI` is a user-friendly voice conversion framework based on VITS, designed to eliminate timbre leakage by replacing input features with those from the training set. It's efficient even on lower-end GPUs, requiring only about 10 minutes of low-noise speech data for good results. The framework features a simple web interface, supports A card and I card acceleration, and uses the advanced RMVPE algorithm for pitch extraction.
## Installation
### Prerequisites
- Python 3.8 or higher
### Installation Steps
1. **Install Pytorch**:
```bash
pip install torch torchvision torchaudio
```
2. **Install Dependencies**:
```bash
pip install -r requirements.txt
```
3
### Additional Setup
- **Download Assets**:
Download necessary models and files using the scripts in the `tools` directory.
- **Install FFmpeg**:
```bash
sudo apt install ffmpeg
```
## Usage
Start the WebUI:
```bash
python demo.py
```
## Features
- Top1 retrieval to replace input features
- Fast training on less powerful GPUs
- Model merging to change timbre
- Advanced pitch extraction with RMVPE
--- |