Update README.md
Browse files
README.md
CHANGED
@@ -9,9 +9,23 @@ license: apache-2.0
|
|
9 |
|
10 |
BreezyVoice is a voice-cloning text-to-speech system specifically adapted for Taiwanese Mandarin, highlighting phonetic control abilities via auxiliary 注音 (bopomofo) inputs. BreezyVoice is partially derived from [CosyVoice](https://github.com/FunAudioLLM/CosyVoice)
|
11 |
|
12 |
-
<img src="https://raw.githubusercontent.com/Splend1d/BreezyVoice/main/flowchart.png" alt="Flowchart" width="750"/>
|
13 |
|
|
|
14 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
15 |
## How to Run
|
16 |
|
17 |
**Running from the GitHub instruction automatically downloads the model for you**
|
@@ -19,7 +33,7 @@ BreezyVoice is a voice-cloning text-to-speech system specifically adapted for Ta
|
|
19 |
You can also run the model from a specified local path by cloning the model
|
20 |
```
|
21 |
git lfs install
|
22 |
-
git clone https://huggingface.co/MediaTek-Research/BreezyVoice
|
23 |
```
|
24 |
then, you can use the model as specified in the run_inference.py script, providing the local model path using the model_path parameter.
|
25 |
|
|
|
9 |
|
10 |
BreezyVoice is a voice-cloning text-to-speech system specifically adapted for Taiwanese Mandarin, highlighting phonetic control abilities via auxiliary 注音 (bopomofo) inputs. BreezyVoice is partially derived from [CosyVoice](https://github.com/FunAudioLLM/CosyVoice)
|
11 |
|
12 |
+
<img src="https://raw.githubusercontent.com/Splend1d/BreezyVoice/main/images/flowchart.png" alt="Flowchart" width="750"/>
|
13 |
|
14 |
+
BreezyVoice outperforms competing commercial services in terms of naturalness.
|
15 |
|
16 |
+
|
17 |
+
|
18 |
+
<img src="https://raw.githubusercontent.com/Splend1d/BreezyVoice/main/images/comparisons.png" alt="comparisons" width="350"/>
|
19 |
+
|
20 |
+
BreezyVoice excels at code-switching scenarios.
|
21 |
+
|
22 |
+
| Code-Switching Term Category | **BreezyVoice** | Z | Y | U | M |
|
23 |
+
|-------------|--------------|---|---|---|---|
|
24 |
+
| **General Words** | **8** | 5 | **8** | **8** | 7 |
|
25 |
+
| **Entities**| **9** | 6 | 4 | 7 | 4 |
|
26 |
+
| **Abbreviations** | **9** | 8 | 6 | 6 | 7 |
|
27 |
+
| **Toponyms**| 3 | 3 | **7** | 3 | 4 |
|
28 |
+
| **Full Sentences**| 7 | 7 | **8** | 5 | 3 |
|
29 |
## How to Run
|
30 |
|
31 |
**Running from the GitHub instruction automatically downloads the model for you**
|
|
|
33 |
You can also run the model from a specified local path by cloning the model
|
34 |
```
|
35 |
git lfs install
|
36 |
+
git clone https://huggingface.co/MediaTek-Research/BreezyVoice
|
37 |
```
|
38 |
then, you can use the model as specified in the run_inference.py script, providing the local model path using the model_path parameter.
|
39 |
|