diff --git a/README.md b/README.md index 328508f1..a3a7adf8 100644 --- a/README.md +++ b/README.md @@ -141,12 +141,11 @@ Via the easy-to-use, efficient, flexible and scalable implementation, our vision - 🤗 2021.12.14: Our PaddleSpeech [ASR](https://huggingface.co/spaces/KPatrick/PaddleSpeechASR) and [TTS](https://huggingface.co/spaces/akhaliq/paddlespeech) Demos on Hugging Face Spaces are available! - 👏🏻 2021.12.10: PaddleSpeech CLI is available for Audio Classification, Automatic Speech Recognition, Speech Translation (English to Chinese) and Text-to-Speech. -### Communication -If you are in China, we recommend you to join our WeChat group to contact directly with our team members! +### Community +- Scan the QR code below with your Wechat, you can access to official technical exchange group. Look forward to your participation.
- - +
## Installation @@ -157,6 +156,7 @@ pip install paddlepaddle paddlespeech ``` Up to now, **Linux** supports CLI for the all our tasks, **Mac OSX and Windows** only supports PaddleSpeech CLI for Audio Classification, Speech-to-Text and Text-to-Speech. Please see [installation](./docs/source/install.md) for other alternatives. + ## Quick Start Developers can have a try of our models with [PaddleSpeech Command Line](./demos/README.md). Change `--input` to test your own audio/text. @@ -177,7 +177,7 @@ paddlespeech st --input input_16k.wav ``` **Text-to-Speech** ```shell -paddlespeech tts --input "你好,欢迎使用百度飞桨深度学习框架!" --output output.wav +paddlespeech tts --input "你好,欢迎使用飞桨深度学习框架!" --output output.wav ``` - web demo for Text to Speech is integrated to [Huggingface Spaces](https://huggingface.co/spaces) with [Gradio](https://github.com/gradio-app/gradio). See Demo: [TTS Demo](https://huggingface.co/spaces/akhaliq/paddlespeech) @@ -388,6 +388,8 @@ The current hyperlinks redirect to [Previous Parakeet](https://github.com/Paddle Normally, [Speech SoTA](https://paperswithcode.com/area/speech), [Audio SoTA](https://paperswithcode.com/area/audio) and [Music SoTA](https://paperswithcode.com/area/music) give you an overview of the hot academic topics in the related area. To focus on the tasks in PaddleSpeech, you will find the following guidelines are helpful to grasp the core ideas. - [Installation](./docs/source/install.md) +- [Quick Start](#quickstart) +- [Some Demos](./demos/README.md) - Tutorials - [Automatic Speech Recognition](./docs/source/asr/quick_start.md) - [Introduction](./docs/source/asr/models_introduction.md) @@ -399,9 +401,12 @@ Normally, [Speech SoTA](https://paperswithcode.com/area/speech), [Audio SoTA](ht - [Advanced Usage](./docs/source/tts/advanced_usage.md) - [Chinese Rule Based Text Frontend](./docs/source/tts/zh_text_frontend.md) - [Test Audio Samples](https://paddlespeech.readthedocs.io/en/latest/tts/demo.html) - - Audio Classification - - Speech Translation + - [Audio Classification](./demos/audio_tagging/README.md) + - [Speech Translation](./demos/speech_translation/README.md) - [Released Models](./docs/source/released_model.md) +- [Community](#Community) +- [Welcome to contribute](#contribution) +- [License](#License) The Text-to-Speech module is originally called [Parakeet](https://github.com/PaddlePaddle/Parakeet), and now merged with this repository. If you are interested in academic research about this task, please see [TTS research overview](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/docs/source/tts#overview). Also, [this document](https://github.com/PaddlePaddle/PaddleSpeech/blob/develop/docs/source/tts/models_introduction.md) is a good guideline for the pipeline components. @@ -416,7 +421,7 @@ howpublished = {\url{https://github.com/PaddlePaddle/PaddleSpeech}}, year={2021} } ``` - + ## Contribute to PaddleSpeech You are warmly welcome to submit questions in [discussions](https://github.com/PaddlePaddle/PaddleSpeech/discussions) and bug reports in [issues](https://github.com/PaddlePaddle/PaddleSpeech/issues)! Also, we highly appreciate if you are willing to contribute to this project! @@ -467,6 +472,7 @@ You are warmly welcome to submit questions in [discussions](https://github.com/P Besides, PaddleSpeech depends on a lot of open source repositories. See [references](./docs/source/reference.md) for more information. + ## License PaddleSpeech is provided under the [Apache-2.0 License](./LICENSE). diff --git a/README_cn.md b/README_cn.md index 551c9395..6dc791e4 100644 --- a/README_cn.md +++ b/README_cn.md @@ -6,7 +6,7 @@

快速开始 - | 教程 + | 教程文档 | 模型列表 @@ -30,7 +30,7 @@ from https://github.com/18F/open-source-guide/blob/18f-pages/pages/making-readme 4.What is the goal of this project? --> -**PaddleSpeech** 是基于飞桨 [PaddlePaddle](https://github.com/PaddlePaddle/Paddle) 深度学习开源框架平台上的一个开源模型库,用于语音和音频中的各种关键任务的开发,包含大量前沿和有影响力的模型,一些典型的应用示例如下: +**PaddleSpeech** 是基于飞桨 [PaddlePaddle](https://github.com/PaddlePaddle/Paddle) 的语音方向的开源模型库,用于语音和音频中的各种关键任务的开发,包含大量基于深度学习前沿和有影响力的模型,一些典型的应用示例如下: ##### 语音识别
@@ -135,12 +135,11 @@ from https://github.com/18F/open-source-guide/blob/18f-pages/pages/making-readme - 🤗 2021.12.14: 我们在 Hugging Face Spaces 上的 [ASR](https://huggingface.co/spaces/KPatrick/PaddleSpeechASR) 以及 [TTS](https://huggingface.co/spaces/akhaliq/paddlespeech) Demos 上线啦! - 👏🏻 2021.12.10: PaddleSpeech CLI 上线!覆盖了声音分类、语音识别、语音翻译(英译中)以及语音合成。 -### 交流 -欢迎加入以下微信群,直接和 PaddleSpeech 团队成员进行交流! +### 技术交流群 +微信扫描二维码加入官方交流群,获得更高效的问题答疑,与各行各业开发者充分交流,期待您的加入。
- - +
## 安装 @@ -177,7 +176,8 @@ paddlespeech tts --input "你好,欢迎使用百度飞桨深度学习框架! PaddleSpeech 支持很多主流的模型,并提供了预训练模型,详情请见[模型列表](./docs/source/released_model.md)。 -PaddleSpeech 的 **语音转文本** 包含语音识别声学模型、语音识别语言模型和语音翻译, 详情如下: + +PaddleSpeech 的 **语音识别** 包含语音识别声学模型、语音识别语言模型和语音翻译, 详情如下: