PaddleSpeech

Commit Graph

Author	SHA1	Message	Date
Jackwaterveg	36e09f90b8	Add the Speech-To-Text in introduction.md	3 years ago
TianYuan	cefe327d0b	Merge pull request #969 from Mingxue-Xu/develop Update README.md	3 years ago
Mingxue-Xu	36fff54a53	Delete unnecessary images.	3 years ago
huangyuxin	61ad2c87a7	update the ds2 online conf	3 years ago
TianYuan	4d6f1646d4	Merge remote-tracking branch 'upstream/develop' into develop	3 years ago
Mingxue-Xu	93b12ec797	rename logo	3 years ago
Mingxue-Xu	f7a99bb8bd	Please enter the commit message for your changes. Lines starting g.png	3 years ago
Mingxue-Xu	718c7b3187	Add ASR samples.	3 years ago
Hui Zhang	7ceef6c3f5	format code	3 years ago
Mingxue-Xu	0bc0d267c9	Update README.md Update README.md Update README.md Add files via upload Update README.md Update README.md Update install.md Update README.md Update README.md Update README.md Update README.md Update README.md Update README.md Update README.md Update README.md Update README.md Update README.md Update README.md Update README.md Update README.md Corrected the mistakes mentioned by @zh794390558 Add files via upload Update README.md Update README.md Delete 002.wav Delete 001.wav Delete 012.wav Delete 006.wav Update README.md Update README.md Add files via upload Update README.md Update README.md Add files via upload Update README.md Add files via upload Update README.md Update README.md Update README according to PaddleOCR Update README according to PaddleOCR Correct some links.	3 years ago
Jackwaterveg	b4d0e72546	Create dependencies.md Add the dependencies.md	3 years ago
Hui Zhang	3f611c75a6	Merge pull request #962 from PaddlePaddle/doc [doc] more reference repo and licence info	3 years ago
KP	98c1131058	Add librosa reference.	3 years ago
Hui Zhang	77bab04403	add tutorial dir	3 years ago
Hui Zhang	b49fbe65d7	more detals of reference	3 years ago
Hui Zhang	58b24aa49f	Merge pull request #960 from PaddlePaddle/paddlespeech [paddlespeech] merge deepspeech, parakeet and text_processing into paddlespeech	3 years ago
huangyuxin	6e8b3d0ffd	fix the quick start	3 years ago
Hui Zhang	e369022f71	remve .travis; fix install doc; more kws in setup.py	3 years ago
Hui Zhang	e2bcaee4f1	merge deepspeech, parakeet and text_processing into paddlespeech	3 years ago
TianYuan	9ccef7fa04	add paddle tts vs espnet tts demos	3 years ago
TianYuan	0c3c218305	fix demos	3 years ago
TianYuan	c13b75c2b0	fix docs install	3 years ago
TianYuan	670a68ad95	fix textfrontend readme, fix imgs link	3 years ago
TianYuan	41526ca1b8	Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into fix_docs	3 years ago
TianYuan	3f9e30c9b3	refactor docs	3 years ago
TianYuan	304d71747a	Merge pull request #939 from Jackwaterveg/doc fix the doc	3 years ago
Mingxue-Xu	1cb1221389	[README(TTS+ASR)] Update the README according to compliant templates and specifications. (#940 ) * Update README.md * Update README.md * Update README.md * Add files via upload * Update README.md * Update README.md * Update install.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md	3 years ago
huangyuxin	cef36521f9	fix the doc	3 years ago
Hui Zhang	f55267f2b3	fix img link; rsl format;	3 years ago
Hui Zhang	58fe852170	setup.py deps from requirements.txt	3 years ago
Hui Zhang	6e9a230f04	update readme	3 years ago
Hui Zhang	b079577e08	merge parakeet repo into deepspeech	3 years ago
huangyuxin	b453b425af	add readthedoc	3 years ago
Hui Zhang	b7b1bda34f	test refactor collator	3 years ago
Jackwaterveg	d75cf89630	Update released_model.md	3 years ago
huangyuxin	bd72afb02b	Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into doc	3 years ago
Jackwaterveg	c55b3c7bff	Update released_model.md	3 years ago
Jackwaterveg	c30200a438	Update released_model.md	3 years ago
Hui Zhang	9a95ceb0b4	add Acknowledgements	3 years ago
Hui Zhang	f15e1ff732	fix doc link	3 years ago
Jackwaterveg	d0c9dc9342	Update deepspeech_architecture.md	3 years ago
Jackwaterveg	0b2c794d88	Emphasis the setup stage in install.sh	3 years ago
Jackwaterveg	6e5d152503	Update deepspeech_architecture.md	3 years ago
Jackwaterveg	c8d62807b3	Update deepspeech_architecture.md	3 years ago
huangyuxin	84020a0471	fix some mistacks in doc	3 years ago
Hui Zhang	256e9c1b9c	more doc for egs	3 years ago
Hui Zhang	a12b16787d	speech text process docs (#607 ) * add more speech doc * fix doc path and mergify * format doc	4 years ago
Hui Zhang	7bbe1d66d2	more speech docs (#606 ) * add speech related docs: tts, text front end, ngram lm, corrector * format doc * mergify with doc	4 years ago
Hui Zhang	c6ae9857f2	update doc (#603 ) * fix doc format * format doc	4 years ago
Hui Zhang	71e046b0ba	E2E/Streaming Transformer/Conformer ASR (#578 ) * add cmvn and label smoothing loss layer * add layer for transformer * add glu and conformer conv * add torch compatiable hack, mask funcs * not hack size since it exists * add test; attention * add attention, common utils, hack paddle * add audio utils * conformer batch padding mask bug fix #223 * fix typo, python infer fix rnn mem opt name error and batchnorm1d, will be available at 2.0.2 * fix ci * fix ci * add encoder * refactor egs * add decoder * refactor ctc, add ctc align, refactor ckpt, add warmup lr scheduler, cmvn utils * refactor docs * add fix * fix readme * fix bugs, refactor collator, add pad_sequence, fix ckpt bugs * fix docstring * refactor data feed order * add u2 model * refactor cmvn, test * add utils * add u2 config * fix bugs * fix bugs * fix autograd maybe has problem when using inplace operation * refactor data, build vocab; add format data * fix text featurizer * refactor build vocab * add fbank, refactor feature of speech * refactor audio feat * refactor data preprare * refactor data * model init from config * add u2 bins * flake8 * can train * fix bugs, add coverage, add scripts * test can run * fix data * speed perturb with sox * add spec aug * fix for train * fix train logitc * fix logger * log valid loss, time dataset process * using np for speed perturb, remove some debug log of grad clip * fix logger * fix build vocab * fix logger name * using module logger as default * fix * fix install * reorder imports * fix board logger * fix logger * kaldi fbank and mfcc * fix cmvn and print prarams * fix add_eos_sos and cmvn * fix cmvn compute * fix logger and cmvn * fix subsampling, label smoothing loss, remove useless * add notebook test * fix log * fix tb logger * multi gpu valid * fix log * fix log * fix config * fix compute cmvn, need paddle 2.1 * add cmvn notebook * fix layer tools * fix compute cmvn * add rtf * fix decoding * fix layer tools * fix log, add avg script * more avg and test info * fix dataset pickle problem; using 2.1 paddle; num_workers can > 0; ckpt save in exp dir;fix setup.sh; * add vimrc * refactor tiny script, add transformer and stream conf * spm demo; librisppech scripts and confs * fix log * add librispeech scripts * refactor data pipe; fix conf; fix u2 default params * fix bugs * refactor aishell scripts * fix test * fix cmvn * fix s0 scripts * fix ds2 scripts and bugs * fix dev & test dataset filter * fix dataset filter * filter dev * fix ckpt path * filter test, since librispeech will cause OOM, but all test wer will be worse, since mismatch train with test * add comment * add syllable doc * fix ds2 configs * add doc * add pypinyin tools * fix decoder using blank_id=0 * mmseg with pybind11 * format code	4 years ago
Hui Zhang	a9d0117cfe	fix install (#580 )	4 years ago
Hui Zhang	d4e84f9b9d	fix doc link and enhance install (#570 ) * fix doc link * fix install * fix install doc * fix typo * fix lm doc	4 years ago
Hui Zhang	19e0f2ac46	Fix doc format (#546 )	4 years ago
Hui Zhang	57ed5cd2e0	Fix Doc (#544 )	4 years ago
Hui Zhang	d7e753546a	Support paddle 2.x (#538 ) * 2.x model * model test pass * fix data * fix soundfile with flac support * one thread dataloader test pass * export feasture size add trainer and utils add setup model and dataloader update travis using Bionic dist * add venv; test under venv * fix unittest; train and valid * add train and config * add config and train script * fix ctc cuda memcopy error * fix imports * fix train valid log * fix dataset batch shuffle shift start from 1 fix rank_zero_only decreator error close tensorboard when train over add decoding config and code * test process can run * test with decoding * test and infer with decoding * fix infer * fix ctc loss lr schedule sortagrad logger * aishell egs * refactor train add aishell egs * fix dataset batch shuffle and add batch sampler log print model parameter * fix model and ctc * sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp add grad clip by global norm add model train test notebook * ctc loss remove run prefix using ord value as text id * using unk when training compute_loss need text ids ord id using in test mode, which compute wer/cer * fix tester * add lr_deacy refactor code * fix tools * fix ci add tune fix gru model bugs add dataset and model test * fix decoding * refactor repo fix decoding * fix musan and rir dataset * refactor io, loss, conv, rnn, gradclip, model, utils * fix ci and import * refactor model add export jit model * add deploy bin and test it * rm uselss egs * add layer tools * refactor socket server new model from pretrain * remve useless * fix instability loss and grad nan or inf for librispeech training * fix sampler * fix libri train.sh * fix doc * add license on cpp * fix doc * fix libri script * fix install * clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49	4 years ago
lfchener	d74f4ff3f5	update deepspeech to fluid api	5 years ago
Yibing Liu	27d6cf90d1	add figure for tuning & enrich the tuning section in doc	7 years ago
Xinghai Sun	e8dce3a982	Add README doc section of multi-gpu acceleration.	7 years ago

... 4 5 6 7 8

358 Commits (2147d3b565c7db99dcdaa2db9dae52bee375d0f7)