|
|
|
# Reference
|
|
|
|
|
|
|
|
We borrowed a lot of code from these repos to build `model` and `engine`, thanks for these great works and the open-source community!
|
|
|
|
|
|
|
|
* [espnet](https://github.com/espnet/espnet/blob/master/LICENSE)
|
|
|
|
- Apache-2.0 License
|
|
|
|
- python/shell `utils`
|
|
|
|
- kaldi feat preprocessing
|
|
|
|
- data pipe line and `transformer`
|
|
|
|
- some tts models, like `fastspeech2` and GAN-based `vocoder`
|
|
|
|
|
|
|
|
* [wenet](https://github.com/wenet-e2e/wenet/blob/main/LICENSE)
|
|
|
|
- Apache-2.0 License
|
|
|
|
- U2 model
|
|
|
|
- Building TLG based Graph
|
|
|
|
- websocket server & client
|
|
|
|
|
|
|
|
* [kaldi](https://github.com/kaldi-asr/kaldi/blob/master/COPYING)
|
|
|
|
- Apache-2.0 License
|
|
|
|
- shell/perl/python utils.
|
|
|
|
- feature bins.
|
|
|
|
- WFST based decoding for LM integration.
|
|
|
|
|
|
|
|
* [delta](https://github.com/Delta-ML/delta/blob/master/LICENSE)
|
|
|
|
- Apache-2.0 License
|
|
|
|
- `engine` arch
|
|
|
|
|
|
|
|
* [speechbrain](https://github.com/speechbrain/speechbrain/blob/develop/LICENSE)
|
|
|
|
- Apache-2.0 License
|
|
|
|
- ECAPA-TDNN SV model
|
|
|
|
- ASR with CTC and pre-trained wav2vec2 models.
|
|
|
|
|
|
|
|
|
|
|
|
* [chainer](https://github.com/chainer/chainer/blob/master/LICENSE)
|
|
|
|
- MIT License
|
|
|
|
- Updater, Trainer, and some utils.
|
|
|
|
|
|
|
|
* [librosa](https://github.com/librosa/librosa/blob/main/LICENSE.md)
|
|
|
|
- ISC License
|
|
|
|
- Audio feature
|
|
|
|
|
|
|
|
* [ThreadPool](https://github.com/progschj/ThreadPool/blob/master/COPYING)
|
|
|
|
- zlib License
|
|
|
|
- ThreadPool
|
|
|
|
|
|
|
|
* [g2pW](https://github.com/GitYCC/g2pW/blob/master/LICENCE)
|
|
|
|
- Apache-2.0 license
|
|
|
|
|
|
|
|
*[transformers](https://github.com/huggingface/transformers)
|
|
|
|
- Apache-2.0 License
|
|
|
|
- Wav2vec2.0
|