TianYuan
994da06287
Update README.md
3 years ago
Hui Zhang
67cb3004a3
fix doc of install
3 years ago
Mingxue-Xu
a3fd33f0d9
correct ASR
3 years ago
Mingxue-Xu
98a1befbe7
Merge branch 'develop' of https://github.com/Mingxue-Xu/DeepSpeech into develop
3 years ago
Mingxue-Xu
36fff54a53
Delete unnecessary images.
3 years ago
TianYuan
97cccf7275
Update README.md
3 years ago
TianYuan
297e18dfd2
Update README.md
3 years ago
TianYuan
4d6f1646d4
Merge remote-tracking branch 'upstream/develop' into develop
3 years ago
Mingxue-Xu
15b6a8ca7c
rename logo
3 years ago
Mingxue-Xu
d166c85e1b
correct hyperlinks
3 years ago
Mingxue-Xu
d75362c079
correct hyperlinks
3 years ago
Mingxue-Xu
df185c37c1
Update README.md
3 years ago
Mingxue-Xu
59197ba1bf
correct hyperlinks
3 years ago
Mingxue-Xu
2654344843
correct hyperlinks
3 years ago
Mingxue-Xu
718c7b3187
Add ASR samples.
3 years ago
Mingxue-Xu
6c6b1d16db
Remove unavailable links.
3 years ago
Mingxue-Xu
0bc0d267c9
Update README.md
...
Update README.md
Update README.md
Add files via upload
Update README.md
Update README.md
Update install.md
Update README.md
Update README.md
Update README.md
Update README.md
Update README.md
Update README.md
Update README.md
Update README.md
Update README.md
Update README.md
Update README.md
Update README.md
Update README.md
Corrected the mistakes mentioned by @zh794390558
Add files via upload
Update README.md
Update README.md
Delete 002.wav
Delete 001.wav
Delete 012.wav
Delete 006.wav
Update README.md
Update README.md
Add files via upload
Update README.md
Update README.md
Add files via upload
Update README.md
Add files via upload
Update README.md
Update README.md
Update README according to PaddleOCR
Update README according to PaddleOCR
Correct some links.
3 years ago
Hui Zhang
b49fbe65d7
more detals of reference
3 years ago
TianYuan
15beee6e0c
Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into fix_docs
3 years ago
TianYuan
36d60a717e
Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into add_mbmelgan
3 years ago
TianYuan
88668513b1
fix mv writer to visualdl in train
3 years ago
Mingxue-Xu
b8818991c0
Update README.md
...
Correct the mistakes mentioned by @zh794390558.
3 years ago
TianYuan
670a68ad95
fix textfrontend readme, fix imgs link
3 years ago
Mingxue-Xu
1cb1221389
[README(TTS+ASR)] Update the README according to compliant templates and specifications. ( #940 )
...
* Update README.md
* Update README.md
* Update README.md
* Add files via upload
* Update README.md
* Update README.md
* Update install.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
3 years ago
Hui Zhang
6e9a230f04
update readme
3 years ago
Hui Zhang
b079577e08
merge parakeet repo into deepspeech
3 years ago
huangyuxin
b453b425af
add readthedoc
3 years ago
Hui Zhang
366e34c925
update paddle version to 2.1.2
3 years ago
Hui Zhang
f15e1ff732
fix doc link
3 years ago
Hui Zhang
bad7f91857
fix readme
3 years ago
Hui Zhang
c9578cf940
remove useless link
3 years ago
Hui Zhang
f54dc983b6
using bw rnn in ds2
3 years ago
Hui Zhang
c81743403a
fix
3 years ago
Hui Zhang
c09b0e8940
fix specaug
3 years ago
Hui Zhang
4c0ee8d354
fix conf and readme
3 years ago
Hui Zhang
b3bc451328
remove sequnce_mask and change ds2 export audio shape to [B,T,D] ( #639 )
...
* remove sequnce_mask
* format
* fix ds2 export audio shape from B,D,T to B,T,D
4 years ago
Hui Zhang
2ff726a66e
fix doc link ( #627 )
4 years ago
Hui Zhang
37c5324138
fix result; add feature list
4 years ago
Hui Zhang
295f8bdad5
train ds2 model ( #622 )
...
* default cmvn compute config; more log of grad clip; diff ds2 cmvn compute and conf; ds2 lr step by epoch;
* fix ds2 config
* fix install and egs link
* sox speed pertrub shape (T, C), float64, process using int32
* fix libri ds2 scripts; add ngram and spm doc
* aishell ds2 cer7.86
* fix ds2 result
4 years ago
Hui Zhang
538bf271eb
chinese char/word ngram lm ( #613 )
...
* add ngram lm egs
* add zhon repo
* install kenlm, zhon
* format
* add chinese_text_normalization repo
* add ngram lm egs
4 years ago
Hui Zhang
db022fac6e
fix doc ( #611 )
4 years ago
Hui Zhang
90512c3964
Fix readme install link ( #610 )
4 years ago
Hui Zhang
c6ae9857f2
update doc ( #603 )
...
* fix doc format
* format doc
4 years ago
Hui Zhang
71e046b0ba
E2E/Streaming Transformer/Conformer ASR ( #578 )
...
* add cmvn and label smoothing loss layer
* add layer for transformer
* add glu and conformer conv
* add torch compatiable hack, mask funcs
* not hack size since it exists
* add test; attention
* add attention, common utils, hack paddle
* add audio utils
* conformer batch padding mask bug fix #223
* fix typo, python infer fix rnn mem opt name error and batchnorm1d, will be available at 2.0.2
* fix ci
* fix ci
* add encoder
* refactor egs
* add decoder
* refactor ctc, add ctc align, refactor ckpt, add warmup lr scheduler, cmvn utils
* refactor docs
* add fix
* fix readme
* fix bugs, refactor collator, add pad_sequence, fix ckpt bugs
* fix docstring
* refactor data feed order
* add u2 model
* refactor cmvn, test
* add utils
* add u2 config
* fix bugs
* fix bugs
* fix autograd maybe has problem when using inplace operation
* refactor data, build vocab; add format data
* fix text featurizer
* refactor build vocab
* add fbank, refactor feature of speech
* refactor audio feat
* refactor data preprare
* refactor data
* model init from config
* add u2 bins
* flake8
* can train
* fix bugs, add coverage, add scripts
* test can run
* fix data
* speed perturb with sox
* add spec aug
* fix for train
* fix train logitc
* fix logger
* log valid loss, time dataset process
* using np for speed perturb, remove some debug log of grad clip
* fix logger
* fix build vocab
* fix logger name
* using module logger as default
* fix
* fix install
* reorder imports
* fix board logger
* fix logger
* kaldi fbank and mfcc
* fix cmvn and print prarams
* fix add_eos_sos and cmvn
* fix cmvn compute
* fix logger and cmvn
* fix subsampling, label smoothing loss, remove useless
* add notebook test
* fix log
* fix tb logger
* multi gpu valid
* fix log
* fix log
* fix config
* fix compute cmvn, need paddle 2.1
* add cmvn notebook
* fix layer tools
* fix compute cmvn
* add rtf
* fix decoding
* fix layer tools
* fix log, add avg script
* more avg and test info
* fix dataset pickle problem; using 2.1 paddle; num_workers can > 0; ckpt save in exp dir;fix setup.sh;
* add vimrc
* refactor tiny script, add transformer and stream conf
* spm demo; librisppech scripts and confs
* fix log
* add librispeech scripts
* refactor data pipe; fix conf; fix u2 default params
* fix bugs
* refactor aishell scripts
* fix test
* fix cmvn
* fix s0 scripts
* fix ds2 scripts and bugs
* fix dev & test dataset filter
* fix dataset filter
* filter dev
* fix ckpt path
* filter test, since librispeech will cause OOM, but all test wer will be worse, since mismatch train with test
* add comment
* add syllable doc
* fix ds2 configs
* add doc
* add pypinyin tools
* fix decoder using blank_id=0
* mmseg with pybind11
* format code
4 years ago
Hui Zhang
9ac99f7cc6
disscusion for questions, issue only for bug report ( #573 )
4 years ago
Hui Zhang
d4e84f9b9d
fix doc link and enhance install ( #570 )
...
* fix doc link
* fix install
* fix install doc
* fix typo
* fix lm doc
4 years ago
Zeyu Chen
aaafe1417d
Update README.md
4 years ago
Hui Zhang
19e0f2ac46
Fix doc format ( #546 )
4 years ago
Hui Zhang
57ed5cd2e0
Fix Doc ( #544 )
4 years ago
Hui Zhang
d7e753546a
Support paddle 2.x ( #538 )
...
* 2.x model
* model test pass
* fix data
* fix soundfile with flac support
* one thread dataloader test pass
* export feasture size
add trainer and utils
add setup model and dataloader
update travis using Bionic dist
* add venv; test under venv
* fix unittest; train and valid
* add train and config
* add config and train script
* fix ctc cuda memcopy error
* fix imports
* fix train valid log
* fix dataset batch shuffle shift start from 1
fix rank_zero_only decreator error
close tensorboard when train over
add decoding config and code
* test process can run
* test with decoding
* test and infer with decoding
* fix infer
* fix ctc loss
lr schedule
sortagrad
logger
* aishell egs
* refactor train
add aishell egs
* fix dataset batch shuffle and add batch sampler log
print model parameter
* fix model and ctc
* sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp
add grad clip by global norm
add model train test notebook
* ctc loss
remove run prefix
using ord value as text id
* using unk when training
compute_loss need text ids
ord id using in test mode, which compute wer/cer
* fix tester
* add lr_deacy
refactor code
* fix tools
* fix ci
add tune
fix gru model bugs
add dataset and model test
* fix decoding
* refactor repo
fix decoding
* fix musan and rir dataset
* refactor io, loss, conv, rnn, gradclip, model, utils
* fix ci and import
* refactor model
add export jit model
* add deploy bin and test it
* rm uselss egs
* add layer tools
* refactor socket server
new model from pretrain
* remve useless
* fix instability loss and grad nan or inf for librispeech training
* fix sampler
* fix libri train.sh
* fix doc
* add license on cpp
* fix doc
* fix libri script
* fix install
* clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49
4 years ago