huangyuxin
719d23c07a
revise the install.md, setup.py and makefile, rm the setup.sh
3 years ago
huangyuxin
989aec4413
optimize the setup.py and setup.sh
3 years ago
TianYuan
05a6f7767b
Merge pull request #1052 from yt605155624/fix_docs
...
[TTS]update tts_tutorial
3 years ago
TianYuan
c35457b80e
update tts_tutorial
3 years ago
Hui Zhang
396db4a56a
update librispeech asr1-2 result; add warpctc source link in ctc topic
3 years ago
Hui Zhang
2bbfdbae91
Merge pull request #1015 from yt605155624/fs2_conformer
...
[TTS]fastspeech2 conformer
3 years ago
TianYuan
469329221b
refactor encoder, rm old code
3 years ago
Hui Zhang
7086579ded
Merge pull request #1038 from Jackwaterveg/release_model
...
[released model]updata the ds2 released model
3 years ago
huangyuxin
137238448d
updata the ds2 model
3 years ago
Hui Zhang
712de751cb
Merge pull request #1036 from zh794390558/nproc
...
[asr] nproc to ngpu
3 years ago
TianYuan
c52d7f2bfc
Update reference.md
3 years ago
Hui Zhang
fe83adfbcb
nproc to ngpu
3 years ago
TianYuan
f451d880ff
Update quick_start.md
3 years ago
TianYuan
d0f0a8e78d
update ipynb
3 years ago
TianYuan
f3fbce005e
update ipynb, add eval loss
3 years ago
TianYuan
c9f41bf1b3
Update reference.md
3 years ago
TianYuan
787e823782
Update quick_start.md
3 years ago
TianYuan
14397db5cc
Update quick_start.md
3 years ago
Jackwaterveg
09931d2ccc
Merge pull request #1019 from zh794390558/feat
...
[bugfix] Kaldi Feature using dither in train
3 years ago
Hui Zhang
7a25ee26d9
fix release model egs name
3 years ago
Hui Zhang
7ec0ed4aaf
kaldi feat dither when train
3 years ago
TianYuan
2d808a3c64
fix urls
3 years ago
TianYuan
4370c5cfa6
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into fs2_conformer
3 years ago
TianYuan
87c1f5bb3a
Merge pull request #998 from Jackwaterveg/develop
...
[Doc] update the released model info
3 years ago
TianYuan
3d5e078c91
add conformer
3 years ago
TianYuan
4c60d3b31f
Update index.rst
3 years ago
TianYuan
5a8938fc28
Update introduction.md
3 years ago
TianYuan
afe4a20b8c
Update released_model.md
3 years ago
Hui Zhang
b8e7dff82a
Merge pull request #997 from Jackwaterveg/conda_install
...
[setup.sh] add conda install pipeline
3 years ago
Zeyu Chen
4a28751df0
Formalize the terms in README
3 years ago
huangyuxin
0556e9d654
update the released model info
3 years ago
huangyuxin
2f92a5d9a7
add conda init, use gcc 8.4.0
3 years ago
TianYuan
8b86bc9f78
Update zh_text_frontend.md
3 years ago
huangyuxin
e84690f6f0
add conda install pipline
3 years ago
Jackwaterveg
dc7daa2a61
Merge pull request #993 from zh794390558/ctc_loss
...
[doc] add ctc loss topic
3 years ago
Hui Zhang
c01322e488
md to jupyter
3 years ago
Hui Zhang
e2e75fa66b
support for latex by texify
3 years ago
TianYuan
9106a90055
update text_frontend_struct.png
3 years ago
Hui Zhang
37e6f9d745
add ctc loss topic
3 years ago
TianYuan
30d09b411d
fix style_syn, replace DeepSpeech with PaddleSpeech in readme
3 years ago
Mingxue-Xu
43fab681d0
Update released_model.md
3 years ago
TianYuan
0bc9450c51
Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into fix_docs
3 years ago
TianYuan
0fcc5005a2
add tts tutorial
3 years ago
Mingxue-Xu
43d30109da
Delete outdated QR code.
3 years ago
Hui Zhang
67cb3004a3
fix doc of install
3 years ago
Jackwaterveg
b0b4843475
Update introduction.md
3 years ago
Jackwaterveg
36e09f90b8
Add the Speech-To-Text in introduction.md
3 years ago
TianYuan
cefe327d0b
Merge pull request #969 from Mingxue-Xu/develop
...
Update README.md
3 years ago
Mingxue-Xu
36fff54a53
Delete unnecessary images.
3 years ago
huangyuxin
61ad2c87a7
update the ds2 online conf
3 years ago
TianYuan
4d6f1646d4
Merge remote-tracking branch 'upstream/develop' into develop
3 years ago
Mingxue-Xu
93b12ec797
rename logo
3 years ago
Mingxue-Xu
f7a99bb8bd
Please enter the commit message for your changes. Lines starting
...
g.png
3 years ago
Mingxue-Xu
718c7b3187
Add ASR samples.
3 years ago
Hui Zhang
7ceef6c3f5
format code
3 years ago
Mingxue-Xu
0bc0d267c9
Update README.md
...
Update README.md
Update README.md
Add files via upload
Update README.md
Update README.md
Update install.md
Update README.md
Update README.md
Update README.md
Update README.md
Update README.md
Update README.md
Update README.md
Update README.md
Update README.md
Update README.md
Update README.md
Update README.md
Update README.md
Corrected the mistakes mentioned by @zh794390558
Add files via upload
Update README.md
Update README.md
Delete 002.wav
Delete 001.wav
Delete 012.wav
Delete 006.wav
Update README.md
Update README.md
Add files via upload
Update README.md
Update README.md
Add files via upload
Update README.md
Add files via upload
Update README.md
Update README.md
Update README according to PaddleOCR
Update README according to PaddleOCR
Correct some links.
3 years ago
Jackwaterveg
b4d0e72546
Create dependencies.md
...
Add the dependencies.md
3 years ago
Hui Zhang
3f611c75a6
Merge pull request #962 from PaddlePaddle/doc
...
[doc] more reference repo and licence info
3 years ago
KP
98c1131058
Add librosa reference.
3 years ago
Hui Zhang
77bab04403
add tutorial dir
3 years ago
Hui Zhang
b49fbe65d7
more detals of reference
3 years ago
Hui Zhang
58b24aa49f
Merge pull request #960 from PaddlePaddle/paddlespeech
...
[paddlespeech] merge deepspeech, parakeet and text_processing into paddlespeech
3 years ago
huangyuxin
6e8b3d0ffd
fix the quick start
3 years ago
Hui Zhang
e369022f71
remve .travis; fix install doc; more kws in setup.py
3 years ago
Hui Zhang
e2bcaee4f1
merge deepspeech, parakeet and text_processing into paddlespeech
3 years ago
TianYuan
9ccef7fa04
add paddle tts vs espnet tts demos
3 years ago
TianYuan
0c3c218305
fix demos
3 years ago
TianYuan
c13b75c2b0
fix docs install
3 years ago
TianYuan
670a68ad95
fix textfrontend readme, fix imgs link
3 years ago
TianYuan
41526ca1b8
Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into fix_docs
3 years ago
TianYuan
3f9e30c9b3
refactor docs
3 years ago
TianYuan
304d71747a
Merge pull request #939 from Jackwaterveg/doc
...
fix the doc
3 years ago
Mingxue-Xu
1cb1221389
[README(TTS+ASR)] Update the README according to compliant templates and specifications. ( #940 )
...
* Update README.md
* Update README.md
* Update README.md
* Add files via upload
* Update README.md
* Update README.md
* Update install.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
3 years ago
huangyuxin
cef36521f9
fix the doc
3 years ago
Hui Zhang
f55267f2b3
fix img link; rsl format;
3 years ago
Hui Zhang
58fe852170
setup.py deps from requirements.txt
3 years ago
Hui Zhang
6e9a230f04
update readme
3 years ago
Hui Zhang
b079577e08
merge parakeet repo into deepspeech
3 years ago
huangyuxin
b453b425af
add readthedoc
3 years ago
Hui Zhang
b7b1bda34f
test refactor collator
3 years ago
Jackwaterveg
d75cf89630
Update released_model.md
3 years ago
huangyuxin
bd72afb02b
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into doc
3 years ago
Jackwaterveg
c55b3c7bff
Update released_model.md
3 years ago
Jackwaterveg
c30200a438
Update released_model.md
3 years ago
Hui Zhang
9a95ceb0b4
add Acknowledgements
3 years ago
Hui Zhang
f15e1ff732
fix doc link
3 years ago
Jackwaterveg
d0c9dc9342
Update deepspeech_architecture.md
3 years ago
Jackwaterveg
0b2c794d88
Emphasis the setup stage in install.sh
3 years ago
Jackwaterveg
6e5d152503
Update deepspeech_architecture.md
3 years ago
Jackwaterveg
c8d62807b3
Update deepspeech_architecture.md
3 years ago
huangyuxin
84020a0471
fix some mistacks in doc
3 years ago
Hui Zhang
256e9c1b9c
more doc for egs
3 years ago
Hui Zhang
a12b16787d
speech text process docs ( #607 )
...
* add more speech doc
* fix doc path and mergify
* format doc
4 years ago
Hui Zhang
7bbe1d66d2
more speech docs ( #606 )
...
* add speech related docs: tts, text front end, ngram lm, corrector
* format doc
* mergify with doc
4 years ago
Hui Zhang
c6ae9857f2
update doc ( #603 )
...
* fix doc format
* format doc
4 years ago
Hui Zhang
71e046b0ba
E2E/Streaming Transformer/Conformer ASR ( #578 )
...
* add cmvn and label smoothing loss layer
* add layer for transformer
* add glu and conformer conv
* add torch compatiable hack, mask funcs
* not hack size since it exists
* add test; attention
* add attention, common utils, hack paddle
* add audio utils
* conformer batch padding mask bug fix #223
* fix typo, python infer fix rnn mem opt name error and batchnorm1d, will be available at 2.0.2
* fix ci
* fix ci
* add encoder
* refactor egs
* add decoder
* refactor ctc, add ctc align, refactor ckpt, add warmup lr scheduler, cmvn utils
* refactor docs
* add fix
* fix readme
* fix bugs, refactor collator, add pad_sequence, fix ckpt bugs
* fix docstring
* refactor data feed order
* add u2 model
* refactor cmvn, test
* add utils
* add u2 config
* fix bugs
* fix bugs
* fix autograd maybe has problem when using inplace operation
* refactor data, build vocab; add format data
* fix text featurizer
* refactor build vocab
* add fbank, refactor feature of speech
* refactor audio feat
* refactor data preprare
* refactor data
* model init from config
* add u2 bins
* flake8
* can train
* fix bugs, add coverage, add scripts
* test can run
* fix data
* speed perturb with sox
* add spec aug
* fix for train
* fix train logitc
* fix logger
* log valid loss, time dataset process
* using np for speed perturb, remove some debug log of grad clip
* fix logger
* fix build vocab
* fix logger name
* using module logger as default
* fix
* fix install
* reorder imports
* fix board logger
* fix logger
* kaldi fbank and mfcc
* fix cmvn and print prarams
* fix add_eos_sos and cmvn
* fix cmvn compute
* fix logger and cmvn
* fix subsampling, label smoothing loss, remove useless
* add notebook test
* fix log
* fix tb logger
* multi gpu valid
* fix log
* fix log
* fix config
* fix compute cmvn, need paddle 2.1
* add cmvn notebook
* fix layer tools
* fix compute cmvn
* add rtf
* fix decoding
* fix layer tools
* fix log, add avg script
* more avg and test info
* fix dataset pickle problem; using 2.1 paddle; num_workers can > 0; ckpt save in exp dir;fix setup.sh;
* add vimrc
* refactor tiny script, add transformer and stream conf
* spm demo; librisppech scripts and confs
* fix log
* add librispeech scripts
* refactor data pipe; fix conf; fix u2 default params
* fix bugs
* refactor aishell scripts
* fix test
* fix cmvn
* fix s0 scripts
* fix ds2 scripts and bugs
* fix dev & test dataset filter
* fix dataset filter
* filter dev
* fix ckpt path
* filter test, since librispeech will cause OOM, but all test wer will be worse, since mismatch train with test
* add comment
* add syllable doc
* fix ds2 configs
* add doc
* add pypinyin tools
* fix decoder using blank_id=0
* mmseg with pybind11
* format code
4 years ago
Hui Zhang
a9d0117cfe
fix install ( #580 )
4 years ago
Hui Zhang
d4e84f9b9d
fix doc link and enhance install ( #570 )
...
* fix doc link
* fix install
* fix install doc
* fix typo
* fix lm doc
4 years ago
Hui Zhang
19e0f2ac46
Fix doc format ( #546 )
4 years ago
Hui Zhang
57ed5cd2e0
Fix Doc ( #544 )
4 years ago
Hui Zhang
d7e753546a
Support paddle 2.x ( #538 )
...
* 2.x model
* model test pass
* fix data
* fix soundfile with flac support
* one thread dataloader test pass
* export feasture size
add trainer and utils
add setup model and dataloader
update travis using Bionic dist
* add venv; test under venv
* fix unittest; train and valid
* add train and config
* add config and train script
* fix ctc cuda memcopy error
* fix imports
* fix train valid log
* fix dataset batch shuffle shift start from 1
fix rank_zero_only decreator error
close tensorboard when train over
add decoding config and code
* test process can run
* test with decoding
* test and infer with decoding
* fix infer
* fix ctc loss
lr schedule
sortagrad
logger
* aishell egs
* refactor train
add aishell egs
* fix dataset batch shuffle and add batch sampler log
print model parameter
* fix model and ctc
* sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp
add grad clip by global norm
add model train test notebook
* ctc loss
remove run prefix
using ord value as text id
* using unk when training
compute_loss need text ids
ord id using in test mode, which compute wer/cer
* fix tester
* add lr_deacy
refactor code
* fix tools
* fix ci
add tune
fix gru model bugs
add dataset and model test
* fix decoding
* refactor repo
fix decoding
* fix musan and rir dataset
* refactor io, loss, conv, rnn, gradclip, model, utils
* fix ci and import
* refactor model
add export jit model
* add deploy bin and test it
* rm uselss egs
* add layer tools
* refactor socket server
new model from pretrain
* remve useless
* fix instability loss and grad nan or inf for librispeech training
* fix sampler
* fix libri train.sh
* fix doc
* add license on cpp
* fix doc
* fix libri script
* fix install
* clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49
4 years ago
lfchener
d74f4ff3f5
update deepspeech to fluid api
5 years ago
Yibing Liu
27d6cf90d1
add figure for tuning & enrich the tuning section in doc
7 years ago
Xinghai Sun
e8dce3a982
Add README doc section of multi-gpu acceleration.
7 years ago