harmony-ai commited on
Commit
d47f092
·
verified ·
1 Parent(s): a4e7e04

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +76 -3
README.md CHANGED
@@ -1,3 +1,76 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - audio
5
+ - text-to-speech
6
+ - instant-voice-cloning
7
+ language:
8
+ - en
9
+ inference: true
10
+ ---
11
+
12
+ # Harmony Speech V1
13
+
14
+ ---
15
+
16
+ Version 1 of Harmony Speech is one of the first AI technologies which were created by Project Harmony.AI back in 2023.
17
+ The goal was to achive an AI voice cloning engine, which is capable of maintaining the Speaker Identity of any voice on speech generation,
18
+ and allows for faster-than-realtime voice generation, even if performed in a CPU-only environment.
19
+
20
+ It builds on top of a fork of [CorentinJ's amazing "Real-Time-Voice-Cloning"](https://github.com/CorentinJ/Real-Time-Voice-Cloning) repositoriy,
21
+ however, with some significant changes applied to the codebase from our side. The full training code will be open sourced at a later point,
22
+ when we had the chance to wrap it up properly.
23
+
24
+ <b>As part of Harmony.AI Release Version 0.2.1, we're open sourcing these model weights under the Apache License on Huggingface.
25
+
26
+ The inference code is part of our [Harmony Speech Engine](https://github.com/harmony-ai-solutions/harmony-speech-engine) codebase.</b>
27
+
28
+ Main components of this model:
29
+ - Speaker Encoder, trained on Mozilla CommonVoice 7 and a few other Multilingual Speech corpuses to ensure capturing of voice characteristics across any language
30
+ - Mel Spectogram Synthesizer based on [Forward Tacotron](https://github.com/spring-media/ForwardTacotron), trained in English language
31
+ - Speech Vocoder based on [Multi-band MelGAN](https://arxiv.org/abs/2005.05106), trained on the same dataset as the Synthesizer and Spectograms generated by the Synthesizer
32
+
33
+ Following the recent advancements of Open Source AI Speech Technology and the lack of development capacities on our side currently,
34
+ we came to the conclusion that we can best support our community by creating an inference engine with an unified API for the huge variety of Speech related AI models and toolchains,
35
+ rather than training additional, custom models.
36
+
37
+ This also led to our decision to open source this model.
38
+
39
+ We hope you enjoy this release, and feel free to also visit our discord (link below) to leave feedback or questions.
40
+
41
+ ---
42
+
43
+ ## About Project Harmony.AI
44
+ ### Our goal: Elevating Human <-to-> AI Interaction beyond known boundaries.
45
+ Project Harmony.AI emerged from the idea to allow for a seamless living together between AI-driven characters and humans.
46
+ Since it became obvious that a lot of technologies required for achieving this goal are not existing or still very experimental,
47
+ the long term vision of Project Harmony is to establish the full set of technologies which help minimizing biological and
48
+ technological barriers in Human <-to-> AI Interaction.
49
+
50
+ ### Our principles: Fair use and accessibility
51
+ We want to counter today's tendencies of AI development centralization at the hands of big
52
+ corporations. We're pushing towards maximum transparency in our own development efforts, and aim for our software to be
53
+ accessible and usable in the most democratic ways possible.
54
+
55
+ Therefore, for all our current and future software offerings, we'll perform a constant and well-educated evaluation whether
56
+ we can safely open source them in parts or even completely, as long as this appears to be non-harmful towards achieving
57
+ the project's main goal.
58
+
59
+ Harmony Speech Engine is being distributed under the AGPLv3 License, because A lot of the code in the module `harmonyspeech` has been borrowed from [Aphrodite Engine](https://github.com/PygmalionAI/Aphrodite-engine).
60
+ Everyone can use this software as part of their own projects without any restrictions from our side, except from restrictions derived from the nature of the licensing.
61
+
62
+ ### How to reach out to us
63
+
64
+ [Official Website of Project Harmony.AI](https://project-harmony.ai/)
65
+
66
+ #### If you want to collaborate or support this Project financially:
67
+
68
+ Feel free to join our Discord Server and / or subscribe to our Patreon - Even $1 helps us drive this project forward.
69
+
70
+ [Harmony.AI Discord Server](https://discord.gg/f6RQyhNPX8)
71
+
72
+ [Harmony.AI Patreon](https://patreon.com/harmony_ai)
73
+
74
+ #### If you want to use our software commercially or discuss a business or development partnership:
75
+
76
+ Contact us directly via: [[email protected]](mailto:[email protected])