PaddleSpeech

Commit Graph

Author	SHA1	Message	Date
Hui Zhang	51d7a07c6d	format and fix pre-commit (#1120 )	3 years ago
Mingxue-Xu	65f684806e	[DOCS] Correct the grammar and spelling mistakes of install.md (#1115 )	3 years ago
TianYuan	9db1710ba7	add conformer demos (#1108 )	3 years ago
Hui Zhang	7acf62d208	fix release model (#1106 )	3 years ago
Mingxue-Xu	eeadee1e7f	[README] Update ST and AC info in README.md	3 years ago
TianYuan	094d05f6b8	Update quick_start.md	3 years ago
TianYuan	b8d8fdccd6	Update quick_start.md	3 years ago
TianYuan	92b6af82e4	Update released_model.md	3 years ago
TianYuan	d4a76e41cd	Update released_model.md	3 years ago
TianYuan	e56afdb18e	Update models_introduction.md	3 years ago
TianYuan	8e685a1a43	Update gan_vocoder.md	3 years ago
TianYuan	5d8446b17c	rm big sources in demos	3 years ago
TianYuan	a6a81e6365	Merge pull request #1075 from yt605155624/fix_docs [TTS]update tts tutorial	3 years ago
TianYuan	9b4c616f90	update tts tutorial	3 years ago
Hui Zhang	b13bea7dd8	fix index	3 years ago
TianYuan	2cf9162345	Update install.md	3 years ago
TianYuan	596a3c0100	Create install.md	3 years ago
TianYuan	cdb2fa9de9	Update install.md	3 years ago
TianYuan	360efddde0	Update install.md	3 years ago
Hui Zhang	72ef492f21	Merge branch 'develop' into install	3 years ago
Hui Zhang	2bbc4db508	fix install	3 years ago
TianYuan	b5527762b5	Update install.md	3 years ago
huangyuxin	829b7758de	revise	3 years ago
huangyuxin	719d23c07a	revise the install.md, setup.py and makefile, rm the setup.sh	3 years ago
huangyuxin	989aec4413	optimize the setup.py and setup.sh	3 years ago
TianYuan	05a6f7767b	Merge pull request #1052 from yt605155624/fix_docs [TTS]update tts_tutorial	3 years ago
TianYuan	c35457b80e	update tts_tutorial	3 years ago
Hui Zhang	396db4a56a	update librispeech asr1-2 result; add warpctc source link in ctc topic	3 years ago
Hui Zhang	2bbfdbae91	Merge pull request #1015 from yt605155624/fs2_conformer [TTS]fastspeech2 conformer	3 years ago
TianYuan	469329221b	refactor encoder, rm old code	3 years ago
Hui Zhang	7086579ded	Merge pull request #1038 from Jackwaterveg/release_model [released model]updata the ds2 released model	3 years ago
huangyuxin	137238448d	updata the ds2 model	3 years ago
Hui Zhang	712de751cb	Merge pull request #1036 from zh794390558/nproc [asr] nproc to ngpu	3 years ago
TianYuan	c52d7f2bfc	Update reference.md	3 years ago
Hui Zhang	fe83adfbcb	nproc to ngpu	3 years ago
TianYuan	f451d880ff	Update quick_start.md	3 years ago
TianYuan	d0f0a8e78d	update ipynb	3 years ago
TianYuan	f3fbce005e	update ipynb, add eval loss	3 years ago
TianYuan	c9f41bf1b3	Update reference.md	3 years ago
TianYuan	787e823782	Update quick_start.md	3 years ago
TianYuan	14397db5cc	Update quick_start.md	3 years ago
Jackwaterveg	09931d2ccc	Merge pull request #1019 from zh794390558/feat [bugfix] Kaldi Feature using dither in train	3 years ago
Hui Zhang	7a25ee26d9	fix release model egs name	3 years ago
Hui Zhang	7ec0ed4aaf	kaldi feat dither when train	3 years ago
TianYuan	2d808a3c64	fix urls	3 years ago
TianYuan	4370c5cfa6	Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into fs2_conformer	3 years ago
TianYuan	87c1f5bb3a	Merge pull request #998 from Jackwaterveg/develop [Doc] update the released model info	3 years ago
TianYuan	3d5e078c91	add conformer	3 years ago
TianYuan	4c60d3b31f	Update index.rst	3 years ago
TianYuan	5a8938fc28	Update introduction.md	3 years ago
TianYuan	afe4a20b8c	Update released_model.md	3 years ago
Hui Zhang	b8e7dff82a	Merge pull request #997 from Jackwaterveg/conda_install [setup.sh] add conda install pipeline	3 years ago
Zeyu Chen	4a28751df0	Formalize the terms in README	3 years ago
huangyuxin	0556e9d654	update the released model info	3 years ago
huangyuxin	2f92a5d9a7	add conda init, use gcc 8.4.0	3 years ago
TianYuan	8b86bc9f78	Update zh_text_frontend.md	3 years ago
huangyuxin	e84690f6f0	add conda install pipline	3 years ago
Jackwaterveg	dc7daa2a61	Merge pull request #993 from zh794390558/ctc_loss [doc] add ctc loss topic	3 years ago
Hui Zhang	c01322e488	md to jupyter	3 years ago
Hui Zhang	e2e75fa66b	support for latex by texify	3 years ago
TianYuan	9106a90055	update text_frontend_struct.png	3 years ago
Hui Zhang	37e6f9d745	add ctc loss topic	3 years ago
TianYuan	30d09b411d	fix style_syn, replace DeepSpeech with PaddleSpeech in readme	3 years ago
Mingxue-Xu	43fab681d0	Update released_model.md	3 years ago
TianYuan	0bc9450c51	Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into fix_docs	3 years ago
TianYuan	0fcc5005a2	add tts tutorial	3 years ago
Mingxue-Xu	43d30109da	Delete outdated QR code.	3 years ago
Hui Zhang	67cb3004a3	fix doc of install	3 years ago
Jackwaterveg	b0b4843475	Update introduction.md	3 years ago
Jackwaterveg	36e09f90b8	Add the Speech-To-Text in introduction.md	3 years ago
TianYuan	cefe327d0b	Merge pull request #969 from Mingxue-Xu/develop Update README.md	3 years ago
Mingxue-Xu	36fff54a53	Delete unnecessary images.	3 years ago
huangyuxin	61ad2c87a7	update the ds2 online conf	3 years ago
TianYuan	4d6f1646d4	Merge remote-tracking branch 'upstream/develop' into develop	3 years ago
Mingxue-Xu	93b12ec797	rename logo	3 years ago
Mingxue-Xu	f7a99bb8bd	Please enter the commit message for your changes. Lines starting g.png	3 years ago
Mingxue-Xu	718c7b3187	Add ASR samples.	3 years ago
Hui Zhang	7ceef6c3f5	format code	3 years ago
Mingxue-Xu	0bc0d267c9	Update README.md Update README.md Update README.md Add files via upload Update README.md Update README.md Update install.md Update README.md Update README.md Update README.md Update README.md Update README.md Update README.md Update README.md Update README.md Update README.md Update README.md Update README.md Update README.md Update README.md Corrected the mistakes mentioned by @zh794390558 Add files via upload Update README.md Update README.md Delete 002.wav Delete 001.wav Delete 012.wav Delete 006.wav Update README.md Update README.md Add files via upload Update README.md Update README.md Add files via upload Update README.md Add files via upload Update README.md Update README.md Update README according to PaddleOCR Update README according to PaddleOCR Correct some links.	3 years ago
Jackwaterveg	b4d0e72546	Create dependencies.md Add the dependencies.md	3 years ago
Hui Zhang	3f611c75a6	Merge pull request #962 from PaddlePaddle/doc [doc] more reference repo and licence info	3 years ago
KP	98c1131058	Add librosa reference.	3 years ago
Hui Zhang	77bab04403	add tutorial dir	3 years ago
Hui Zhang	b49fbe65d7	more detals of reference	3 years ago
Hui Zhang	58b24aa49f	Merge pull request #960 from PaddlePaddle/paddlespeech [paddlespeech] merge deepspeech, parakeet and text_processing into paddlespeech	3 years ago
huangyuxin	6e8b3d0ffd	fix the quick start	3 years ago
Hui Zhang	e369022f71	remve .travis; fix install doc; more kws in setup.py	3 years ago
Hui Zhang	e2bcaee4f1	merge deepspeech, parakeet and text_processing into paddlespeech	3 years ago
TianYuan	9ccef7fa04	add paddle tts vs espnet tts demos	3 years ago
TianYuan	0c3c218305	fix demos	3 years ago
TianYuan	c13b75c2b0	fix docs install	3 years ago
TianYuan	670a68ad95	fix textfrontend readme, fix imgs link	3 years ago
TianYuan	41526ca1b8	Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into fix_docs	3 years ago
TianYuan	3f9e30c9b3	refactor docs	3 years ago
TianYuan	304d71747a	Merge pull request #939 from Jackwaterveg/doc fix the doc	3 years ago
Mingxue-Xu	1cb1221389	[README(TTS+ASR)] Update the README according to compliant templates and specifications. (#940 ) * Update README.md * Update README.md * Update README.md * Add files via upload * Update README.md * Update README.md * Update install.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md	3 years ago
huangyuxin	cef36521f9	fix the doc	3 years ago
Hui Zhang	f55267f2b3	fix img link; rsl format;	3 years ago
Hui Zhang	58fe852170	setup.py deps from requirements.txt	3 years ago
Hui Zhang	6e9a230f04	update readme	3 years ago
Hui Zhang	b079577e08	merge parakeet repo into deepspeech	3 years ago
huangyuxin	b453b425af	add readthedoc	3 years ago
Hui Zhang	b7b1bda34f	test refactor collator	3 years ago
Jackwaterveg	d75cf89630	Update released_model.md	3 years ago
huangyuxin	bd72afb02b	Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into doc	3 years ago
Jackwaterveg	c55b3c7bff	Update released_model.md	3 years ago
Jackwaterveg	c30200a438	Update released_model.md	3 years ago
Hui Zhang	9a95ceb0b4	add Acknowledgements	3 years ago
Hui Zhang	f15e1ff732	fix doc link	3 years ago
Jackwaterveg	d0c9dc9342	Update deepspeech_architecture.md	3 years ago
Jackwaterveg	0b2c794d88	Emphasis the setup stage in install.sh	3 years ago
Jackwaterveg	6e5d152503	Update deepspeech_architecture.md	3 years ago
Jackwaterveg	c8d62807b3	Update deepspeech_architecture.md	3 years ago
huangyuxin	84020a0471	fix some mistacks in doc	3 years ago
Hui Zhang	256e9c1b9c	more doc for egs	3 years ago
Hui Zhang	a12b16787d	speech text process docs (#607 ) * add more speech doc * fix doc path and mergify * format doc	4 years ago
Hui Zhang	7bbe1d66d2	more speech docs (#606 ) * add speech related docs: tts, text front end, ngram lm, corrector * format doc * mergify with doc	4 years ago
Hui Zhang	c6ae9857f2	update doc (#603 ) * fix doc format * format doc	4 years ago
Hui Zhang	71e046b0ba	E2E/Streaming Transformer/Conformer ASR (#578 ) * add cmvn and label smoothing loss layer * add layer for transformer * add glu and conformer conv * add torch compatiable hack, mask funcs * not hack size since it exists * add test; attention * add attention, common utils, hack paddle * add audio utils * conformer batch padding mask bug fix #223 * fix typo, python infer fix rnn mem opt name error and batchnorm1d, will be available at 2.0.2 * fix ci * fix ci * add encoder * refactor egs * add decoder * refactor ctc, add ctc align, refactor ckpt, add warmup lr scheduler, cmvn utils * refactor docs * add fix * fix readme * fix bugs, refactor collator, add pad_sequence, fix ckpt bugs * fix docstring * refactor data feed order * add u2 model * refactor cmvn, test * add utils * add u2 config * fix bugs * fix bugs * fix autograd maybe has problem when using inplace operation * refactor data, build vocab; add format data * fix text featurizer * refactor build vocab * add fbank, refactor feature of speech * refactor audio feat * refactor data preprare * refactor data * model init from config * add u2 bins * flake8 * can train * fix bugs, add coverage, add scripts * test can run * fix data * speed perturb with sox * add spec aug * fix for train * fix train logitc * fix logger * log valid loss, time dataset process * using np for speed perturb, remove some debug log of grad clip * fix logger * fix build vocab * fix logger name * using module logger as default * fix * fix install * reorder imports * fix board logger * fix logger * kaldi fbank and mfcc * fix cmvn and print prarams * fix add_eos_sos and cmvn * fix cmvn compute * fix logger and cmvn * fix subsampling, label smoothing loss, remove useless * add notebook test * fix log * fix tb logger * multi gpu valid * fix log * fix log * fix config * fix compute cmvn, need paddle 2.1 * add cmvn notebook * fix layer tools * fix compute cmvn * add rtf * fix decoding * fix layer tools * fix log, add avg script * more avg and test info * fix dataset pickle problem; using 2.1 paddle; num_workers can > 0; ckpt save in exp dir;fix setup.sh; * add vimrc * refactor tiny script, add transformer and stream conf * spm demo; librisppech scripts and confs * fix log * add librispeech scripts * refactor data pipe; fix conf; fix u2 default params * fix bugs * refactor aishell scripts * fix test * fix cmvn * fix s0 scripts * fix ds2 scripts and bugs * fix dev & test dataset filter * fix dataset filter * filter dev * fix ckpt path * filter test, since librispeech will cause OOM, but all test wer will be worse, since mismatch train with test * add comment * add syllable doc * fix ds2 configs * add doc * add pypinyin tools * fix decoder using blank_id=0 * mmseg with pybind11 * format code	4 years ago
Hui Zhang	a9d0117cfe	fix install (#580 )	4 years ago
Hui Zhang	d4e84f9b9d	fix doc link and enhance install (#570 ) * fix doc link * fix install * fix install doc * fix typo * fix lm doc	4 years ago
Hui Zhang	19e0f2ac46	Fix doc format (#546 )	4 years ago
Hui Zhang	57ed5cd2e0	Fix Doc (#544 )	4 years ago
Hui Zhang	d7e753546a	Support paddle 2.x (#538 ) * 2.x model * model test pass * fix data * fix soundfile with flac support * one thread dataloader test pass * export feasture size add trainer and utils add setup model and dataloader update travis using Bionic dist * add venv; test under venv * fix unittest; train and valid * add train and config * add config and train script * fix ctc cuda memcopy error * fix imports * fix train valid log * fix dataset batch shuffle shift start from 1 fix rank_zero_only decreator error close tensorboard when train over add decoding config and code * test process can run * test with decoding * test and infer with decoding * fix infer * fix ctc loss lr schedule sortagrad logger * aishell egs * refactor train add aishell egs * fix dataset batch shuffle and add batch sampler log print model parameter * fix model and ctc * sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp add grad clip by global norm add model train test notebook * ctc loss remove run prefix using ord value as text id * using unk when training compute_loss need text ids ord id using in test mode, which compute wer/cer * fix tester * add lr_deacy refactor code * fix tools * fix ci add tune fix gru model bugs add dataset and model test * fix decoding * refactor repo fix decoding * fix musan and rir dataset * refactor io, loss, conv, rnn, gradclip, model, utils * fix ci and import * refactor model add export jit model * add deploy bin and test it * rm uselss egs * add layer tools * refactor socket server new model from pretrain * remve useless * fix instability loss and grad nan or inf for librispeech training * fix sampler * fix libri train.sh * fix doc * add license on cpp * fix doc * fix libri script * fix install * clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49	4 years ago
lfchener	d74f4ff3f5	update deepspeech to fluid api	5 years ago
Yibing Liu	27d6cf90d1	add figure for tuning & enrich the tuning section in doc	7 years ago
Xinghai Sun	e8dce3a982	Add README doc section of multi-gpu acceleration.	7 years ago

1 2 3 4 5

227 Commits (2c5121c53299089e15f87c51e5f6808c4ed7853e)