You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
13 lines
773 B
13 lines
773 B
3 years ago
|
# Decoders
|
||
|
|
||
|
## Reference
|
||
|
### CTC Prefix Beam Search
|
||
|
* [Sequence Modeling With CTC](https://distill.pub/2017/ctc/)
|
||
|
* [First-Pass Large Vocabulary Continuous Speech Recognition using Bi-Directional Recurrent DNNs](https://arxiv.org/pdf/1408.2873.pdf)
|
||
|
|
||
|
### CTC Prefix Score & Join CTC/ATT One-passing Decoding
|
||
|
* [Hybrid CTC/Attention Architecture for End-to-End Speech Recognition](http://www.ifp.illinois.edu/speech/speech_web_lg/slides/2019/watanabe_hybridCTCAttention_2017.pdf)
|
||
|
* [Vectorized Beam Search for CTC-Attention-based Speech Recognition](https://www.isca-speech.org/archive/pdfs/interspeech_2019/seki19b_interspeech.pdf)
|
||
|
|
||
|
### Streaming Join CTC/ATT Beam Search
|
||
|
* [STREAMING TRANSFORMER ASR WITH BLOCKWISE SYNCHRONOUS BEAM SEARCH](https://arxiv.org/abs/2006.14941)
|