|
--- |
|
license: mit |
|
language: |
|
- en |
|
tags: |
|
- RVC |
|
- voice-conversion |
|
- text-to-speech |
|
- voice-cloning |
|
- audio-generation |
|
datasets: |
|
- LJSpeech |
|
- VCTK |
|
metrics: |
|
- MOS (Mean Opinion Score) |
|
- PESQ |
|
- STOI |
|
base_model: |
|
- MangioRVC/Mangio-RVC-Huggingface |
|
pipeline_tag: audio-to-audio |
|
--- |
|
|
|
# 🌙 LUNAR - High-Quality Female Voice RVC Model |
|
|
|
LUNAR is a state-of-the-art **RVC (Retrieval-Based Voice Conversion) model** optimized for **female voice conversion** with studio-grade audio quality at **48kHz sampling rate**. This model delivers natural-sounding voice transformations with minimal artifacts. |
|
|
|
## **Performance & Efficiency Metrics** |
|
Here are the visual benchmarks of Lunar-RVC: |
|
|
|
### **1. Training Loss Curve** |
|
 |
|
|
|
### **2. Validation Loss Curve** |
|
 |
|
|
|
### **3. Training vs Validation Loss** |
|
 |
|
|
|
### **4. Inference Speed Comparison** |
|
 |
|
|
|
### **5. Audio Quality Scores (MOS)** |
|
 |
|
|
|
### **6. GPU Memory Usage** |
|
 |
|
|
|
### **7. Dataset Duration Distribution** |
|
 |
|
|
|
### **8. Spectral Convergence** |
|
 |
|
|
|
### **9. Model Size Comparison** |
|
 |
|
|
|
### **10. Efficiency Radar Chart** |
|
 |
|
|
|
--- |
|
|
|
## Key Features |
|
|
|
- **High-Fidelity Conversion** - Produces natural, expressive female voices |
|
- **Real-Time Ready** - Optimized for low-latency inference (<20ms/frame) |
|
- **Pitch & Timbre Control** - Flexible voice modulation capabilities |
|
- **48kHz Studio Quality** - Professional-grade audio output |
|
- **Easy Integration** - Compatible with popular voice toolkits |
|
|
|
## 📊 Model Specifications |
|
|
|
| Parameter | Value | |
|
|--------------------|---------------------| |
|
| Framework | RVC v2 | |
|
| Sample Rate | 48kHz | |
|
| Bit Depth | 16-bit | |
|
| Model Size | 1.8GB | |
|
| Training Hours | 150 epochs (~10h) | |
|
| VRAM Requirements | 4GB+ (inference) | |
|
| Supported Formats | WAV, MP3, FLAC | |
|
--- |
|
|
|
## **Inference Guide** |
|
To use Lunar-RVC for inference: |
|
|
|
``` bash |
|
# Clone repository |
|
git clone https://huggingface.co/IssacMosesD/Lunar-RVC-Model |
|
cd Lunar-RVC |
|
|
|
# Install dependencies |
|
pip install -r requirements.txt |
|
|
|
# Run inference |
|
python infer.py --input input.wav --output output.wav --model Lunar-RVC.pth |
|
|
|
## Use Cases |
|
|
|
- Voice Cloning – Convert your voice into a professional singing voice. |
|
|
|
- Streaming – Real-time voice conversion for content creators. |
|
|
|
- Dubbing – High-quality voice conversion for movies & animations. |
|
|
|
- Music Production – Transform any vocal track into a new singer’s voice. |
|
|
|
## System Requirements |
|
|
|
- OS: Windows / Linux |
|
|
|
- Python: 3.8+ |
|
|
|
- GPU: NVIDIA (6GB VRAM minimum recommended) |
|
|
|
- CUDA: 11.7+ |
|
|
|
- Torch: 1.13.1+ |
|
|
|
## Contact |
|
|
|
- For support, queries, or collaboration: |
|
|
|
- Name: [Issac Moses D](https://www.linkedin.com/in/issacmosesd/) |
|
|
|
- Email: [[email protected]](mailto:[email protected]) |
|
|
|
- Hugging Face: [IssacMosesD](https://huggingface.co/IssacMosesD) |
|
|
|
- GitHub: [Issac Moses D](https://github.com/Issac-Moses) |
|
|
|
## Collaburator |
|
|
|
- Name: [Dharani Karnan](https://www.linkedin.com/in/dharani-karnan-060040320/) |
|
|
|
- Email: [[email protected]](mailto:[email protected]) |
|
|
|
- Hugging Face: [Dharani K](https://huggingface.co/dharzz188) |
|
|
|
- GitHub: [Dharani K](https://github.com/Issac-Moses) |