File size: 1,154 Bytes
1abe9e0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
adbdefc
1abe9e0
 
 
 
 
adbdefc
1abe9e0
adbdefc
1abe9e0
 
 
 
 
 
adbdefc
1abe9e0
 
 
 
1e43516
1abe9e0
1e43516
adbdefc
1e43516
1abe9e0
 
 
 
 
1e43516
 
1abe9e0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
---
license: mit
---


# easyGUI

`easyGUI` is a user-friendly voice conversion framework based on VITS, designed to eliminate timbre leakage by replacing input features with those from the training set. It's efficient even on lower-end GPUs, requiring only about 10 minutes of low-noise speech data for good results. The framework features a simple web interface, supports A card and I card acceleration, and uses the advanced RMVPE algorithm for pitch extraction.

## Installation

### Prerequisites
- Python 3.8 or higher

### Installation Steps
1. **Install Pytorch**:
   ```bash
   pip install torch torchvision torchaudio
   ```
   

2. **Install Dependencies**:
     ```bash
     pip install -r requirements.txt
     ```
   

3

### Additional Setup
- **Download Assets**:
  Download necessary models and files using the scripts in the `tools` directory.
- **Install FFmpeg**:
  ```bash
  sudo apt install ffmpeg  
  ```

## Usage
Start the WebUI:
```bash
python demo.py
```


## Features
- Top1 retrieval to replace input features
- Fast training on less powerful GPUs
- Model merging to change timbre
- Advanced pitch extraction with RMVPE


---