Merge pull request #1173 from KPatr1ck/doc

[README]Update README of demo and cli.
3 years ago · 23ac52bcdd
parent f6ba44f1de 9ec2bc8e2e
commit 23ac52bcdd
4 changed files with 70 additions and 3 deletions
--- a/demos/README.md
+++ b/demos/README.md
@ -1,10 +1,15 @@
 # Speech Application based on PaddleSpeech

+([简体中文](./README_cn.md)|English)
+
 The directory containes many speech applications in multi scenarios.

-* audio tagging  - tag audio label in vedio  
-* metaverse  - 2D AR with TTS  
-* speech recogintion - vidio understanding  
+* audio tagging - multi-label tagging of an audio file
+* automatic_video_subtitiles - generate subtitles from a video
+* metaverse - 2D AR with TTS  
+* punctuation_restoration - restore punctuation from raw text
+* speech recogintion - recognize text of an audio file 
 * speech translation - end to end speech translation  
 * story talker - book reader based on OCR and TTS  
 * style_fs2 - multi style control for FastSpeech2 model  
+* text_to_speech - convert text into speech 
--- a/demos/README_cn.md
+++ b/demos/README_cn.md
@ -0,0 +1,15 @@
+# PaddleSpeech 语音应用 Demo
+
+(简体中文|[English](./README.md))
+
+该目录包含基于 PaddleSpeech 开发的不同场景的语音应用 Demo：
+
+* 声音分类 - 基于 AudioSet 的 527 类标签的音频多标签分类。 
+* 视频字幕生成 - 识别视频中语音的文本，并进行文本后处理。
+* 元宇宙 - 基于语音合成的 2D 增强现实。
+* 标点恢复 - 通常作为语音识别的文本后处理任务，为一段无标点的纯文本添加相应的标点符号。
+* 语音识别 - 识别一段音频中包含的语音文字。
+* 语音翻译 - 实时识别音频中的语言，并同时翻译成目标语言。
+* 会说话的故事书 - 基于 OCR 和语音合成的会说话的故事书。
+* 个性化语音合成 - 基于 FastSpeech2 模型的个性化语音合成。 
+* 语音合成 - 基于给定的文本生成语音音频。
--- a/paddlespeech/cli/README.md
+++ b/paddlespeech/cli/README.md
@ -1,5 +1,7 @@
 # PaddleSpeech Command Line

+([简体中文](./README_cn.md)|English)
+
 The simplest approach to use PaddleSpeech models.

 ## Help
@ -28,3 +30,9 @@
 paddlespeech tts --input "你好，欢迎使用百度飞桨深度学习框架！" --output output.wav
 ```
 
+ ## Text Post-precessing
+
+- Punctuation Restoration
+  ```bash
+  paddlespeech text --task punc --input 今天的天气真不错啊你下午有空吗我想约你一起去吃饭
+  ```
--- a/paddlespeech/cli/README_cn.md
+++ b/paddlespeech/cli/README_cn.md
@ -0,0 +1,39 @@
+# PaddleSpeech 命令行工具
+
+(简体中文|[English](./README.md))
+
+`paddlespeech.cli` 模块是 PaddleSpeech 的命令行工具，它提供了最简便的方式调用 PaddleSpeech 提供的不同语音应用场景的预训练模型，用一行命令就可以进行模型预测：
+
+ ## 命令行使用帮助
+ ```bash
+ paddlespeech help
+ ```
+
+ ## 声音分类
+ ```bash
+ paddlespeech cls --input input.wav
+ ```
+
+ ## 语音识别
+ ```
+ paddlespeech asr --lang zh --input input_16k.wav
+ ```
+ 
+ ## 语音翻译（英-中）
+ 
+ (暂不支持Windows系统)
+ ```bash
+ paddlespeech st --input input_16k.wav
+ ```
+ 
+ ## 语音合成
+ ```bash
+ paddlespeech tts --input "你好，欢迎使用百度飞桨深度学习框架！" --output output.wav
+ ```
+ 
+ ## 文本后处理
+
+- 标点恢复
+  ```bash
+  paddlespeech text --task punc --input 今天的天气真不错啊你下午有空吗我想约你一起去吃饭
+  ```