Improve readability

pull/1976/head
r 3 years ago
parent f4d234c402
commit be70016edb

@ -2,14 +2,14 @@
([简体中文](./README_cn.md)|English) ([简体中文](./README_cn.md)|English)
The directory containes many speech applications in multi scenarios. This directory contains many speech applications in multiple scenarios.
* audio searching - mass audio similarity retrieval * audio searching - mass audio similarity retrieval
* audio tagging - multi-label tagging of an audio file * audio tagging - multi-label tagging of an audio file
* automatic_video_subtitiles - generate subtitles from a video * automatic_video_subtitles - generate subtitles from a video
* metaverse - 2D AR with TTS * metaverse - 2D AR with TTS
* punctuation_restoration - restore punctuation from raw text * punctuation_restoration - restore punctuation from raw text
* speech recogintion - recognize text of an audio file * speech recognition - recognize text of an audio file
* speech server - Server for Speech Task, e.g. ASR,TTS,CLS * speech server - Server for Speech Task, e.g. ASR,TTS,CLS
* streaming asr server - receive audio stream from websocket, and recognize to transcript. * streaming asr server - receive audio stream from websocket, and recognize to transcript.
* speech translation - end to end speech translation * speech translation - end to end speech translation

@ -44,13 +44,13 @@ More details please see `README.md` under `examples`.
> If using docker please check `--privileged` is set when `docker run`. > If using docker please check `--privileged` is set when `docker run`.
* Fatal error at startup: `a function redirection which is mandatory for this platform-tool combination cannot be set up` * Fatal error at startup: `a function redirection which is mandatory for this platform-tool combination cannot be set up`
``` ```bash
apt-get install libc6-dbg apt-get install libc6-dbg
``` ```
* Install * Install
``` ```bash
pushd tools pushd tools
./setup_valgrind.sh ./setup_valgrind.sh
popd popd
@ -59,4 +59,4 @@ popd
## TODO ## TODO
### Deepspeech2 with linear feature ### Deepspeech2 with linear feature
* DecibelNormalizer: there is a little bit difference between offline and online db norm. The computation of online db norm read feature chunk by chunk, which causes the feature size is different with offline db norm. In normalizer.cc:73, the samples.size() is different, which causes the difference of result. * DecibelNormalizer: there is a small difference between the offline and online db norm. The computation of online db norm reads features chunk by chunk, which causes the feature size to be different different with offline db norm. In `normalizer.cc:73`, the `samples.size()` is different, which causes the different result.

@ -1,27 +1,27 @@
* [python_kaldi_features](https://github.com/ZitengWang/python_kaldi_features) * [python_kaldi_features](https://github.com/ZitengWang/python_kaldi_features)
commit: fc1bd6240c2008412ab64dc25045cd872f5e126c commit: fc1bd6240c2008412ab64dc25045cd872f5e126c
ref: https://zhuanlan.zhihu.com/p/55371926 ref: https://zhuanlan.zhihu.com/p/55371926
licence: MIT license: MIT
* [python-pinyin](https://github.com/mozillazg/python-pinyin.git) * [python-pinyin](https://github.com/mozillazg/python-pinyin.git)
commit: 55e524aa1b7b8eec3d15c5306043c6cdd5938b03 commit: 55e524aa1b7b8eec3d15c5306043c6cdd5938b03
licence: MIT license: MIT
* [zhon](https://github.com/tsroten/zhon) * [zhon](https://github.com/tsroten/zhon)
commit: 09bf543696277f71de502506984661a60d24494c commit: 09bf543696277f71de502506984661a60d24494c
licence: MIT license: MIT
* [pymmseg-cpp](https://github.com/pluskid/pymmseg-cpp.git) * [pymmseg-cpp](https://github.com/pluskid/pymmseg-cpp.git)
commit: b76465045717fbb4f118c4fbdd24ce93bab10a6d commit: b76465045717fbb4f118c4fbdd24ce93bab10a6d
licence: MIT license: MIT
* [chinese_text_normalization](https://github.com/speechio/chinese_text_normalization.git) * [chinese_text_normalization](https://github.com/speechio/chinese_text_normalization.git)
commit: 9e92c7bf2d6b5a7974305406d8e240045beac51c commit: 9e92c7bf2d6b5a7974305406d8e240045beac51c
licence: MIT license: MIT
* [phkit](https://github.com/KuangDD/phkit.git) * [phkit](https://github.com/KuangDD/phkit.git)
commit: b2100293c1e36da531d7f30bd52c9b955a649522 commit: b2100293c1e36da531d7f30bd52c9b955a649522
licence: None license: None
* [nnAudio](https://github.com/KinWaiCheuk/nnAudio.git) * [nnAudio](https://github.com/KinWaiCheuk/nnAudio.git)
licence: MIT license: MIT

@ -5,4 +5,4 @@ score.h and score.cpp is under the LGPL license.
The two files include the header files from KenLM project. The two files include the header files from KenLM project.
For the rest: For the rest:
The default licence of paddlespeech-ctcdecoders is Apache License 2.0. The default license of paddlespeech-ctcdecoders is Apache License 2.0.

Loading…
Cancel
Save