@ -1,4 +1,4 @@
(简体中文|[English](./README.md))
(简体中文|[English](./README.md))
< p align = "center" >
< img src = "./docs/images/PaddleSpeech_logo.png" / >
< / p >
@ -90,7 +90,7 @@ from https://github.com/18F/open-source-guide/blob/18f-pages/pages/making-readme
< table style = "width:100%" >
< thead >
< tr >
< th > < img width = "200" height = "1" > 输入文本 < img width = "200" height = "1" > < / th >
< th > 输入文本< / th >
< th > 合成音频< / th >
< / tr >
< / thead >
@ -109,6 +109,13 @@ from https://github.com/18F/open-source-guide/blob/18f-pages/pages/making-readme
< img align = "center" src = "./docs/images/audio_icon.png" width = "200" style = "max-width: 100%;" > < / a > < br >
< / td >
< / tr >
< tr >
< td > 季姬寂,集鸡,鸡即棘鸡。棘鸡饥叽,季姬及箕稷济鸡。鸡既济,跻姬笈,季姬忌,急咭鸡,鸡急,继圾几,季姬急,即籍箕击鸡,箕疾击几伎,伎即齑,鸡叽集几基,季姬急极屐击鸡,鸡既殛,季姬激,即记《季姬击鸡记》。< / td >
< td align = "center" >
< a href = "https://paddlespeech.bj.bcebos.com/Parakeet/docs/demos/jijiji.wav" rel = "nofollow" >
< img align = "center" src = "./docs/images/audio_icon.png" width = "200" style = "max-width: 100%;" > < / a > < br >
< / td >
< / tr >
< / tbody >
< / table >
@ -167,23 +174,20 @@ paddlespeech st --input input_16k.wav
```shell
paddlespeech tts --input "你好,欢迎使用百度飞桨深度学习框架!" --output output.wav
```
- 语音合成的 web demo 已经集成进了 [Huggingface Spaces ](https://huggingface.co/spaces ). 请参考: [TTS Demo ](https://huggingface.co/spaces/akhaliq/paddlespeech )
更多命令行命令请参考 [demos ](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/demos )
> Note: 如果需要训练或者微调,请查看[语音识别](./docs/source/asr/quick_start.md), [语音合成 ](./docs/source/tts/quick_start.md )。
## 模型列表
PaddleSpeech 支持很多主流的模型,并提供了预训练模型,详情请见[模型列表](./docs/source/released_model.md)。
< a name = "语音识别模型" > < / a >
PaddleSpeech 的 ** 语音识别** 包含语音识别声学模型、语音识别语言模型和语音翻译, 详情如下:
<!-- -
The current hyperlinks redirect to [Previous Parakeet ](https://github.com/PaddlePaddle/Parakeet/tree/develop/examples ).
-->
PaddleSpeech 的 ** 语音转文本** 包含语音识别声学模型、语音识别语言模型和语音翻译, 详情如下:
< table style = "width:100%" >
< thead >
< tr >
< th > 语音识别模块种类 < / th >
< th > 语音转文本模块类型< / th >
< th > 数据集< / th >
< th > 模型种类< / th >
< th > 链接< / th >
@ -259,8 +263,8 @@ PaddleSpeech 的 **语音合成** 主要包含三个模块:文本前端、声
< tr >
< th > 语音合成模块类型 < / th >
< th > 模型种类 < / th >
< th > < img width = "50" height = "1" > 数据集 < img width = "50" height = "1" > < / th >
< th > < img width = "101" height = "1" > 链接 < img width = "105" height = "1" > < / th >
< th > 数据集 < / th >
< th > 链接 < / th >
< / tr >
< / thead >
< tbody >