PaddleSpeech

Commit Graph

Author	SHA1	Message	Date
Hui Zhang	b079577e08	merge parakeet repo into deepspeech	3 years ago
huangyuxin	b453b425af	add readthedoc	3 years ago
Hui Zhang	366e34c925	update paddle version to 2.1.2	3 years ago
Hui Zhang	f15e1ff732	fix doc link	3 years ago
Hui Zhang	bad7f91857	fix readme	3 years ago
Hui Zhang	c9578cf940	remove useless link	3 years ago
Hui Zhang	f54dc983b6	using bw rnn in ds2	3 years ago
Hui Zhang	c81743403a	fix	3 years ago
Hui Zhang	c09b0e8940	fix specaug	3 years ago
Hui Zhang	4c0ee8d354	fix conf and readme	3 years ago
Hui Zhang	b3bc451328	remove sequnce_mask and change ds2 export audio shape to [B,T,D] (#639 ) * remove sequnce_mask * format * fix ds2 export audio shape from B,D,T to B,T,D	4 years ago
Hui Zhang	2ff726a66e	fix doc link (#627 )	4 years ago
Hui Zhang	37c5324138	fix result; add feature list	4 years ago
Hui Zhang	295f8bdad5	train ds2 model (#622 ) * default cmvn compute config; more log of grad clip; diff ds2 cmvn compute and conf; ds2 lr step by epoch; * fix ds2 config * fix install and egs link * sox speed pertrub shape (T, C), float64, process using int32 * fix libri ds2 scripts; add ngram and spm doc * aishell ds2 cer7.86 * fix ds2 result	4 years ago
Hui Zhang	538bf271eb	chinese char/word ngram lm (#613 ) * add ngram lm egs * add zhon repo * install kenlm, zhon * format * add chinese_text_normalization repo * add ngram lm egs	4 years ago
Hui Zhang	db022fac6e	fix doc (#611 )	4 years ago
Hui Zhang	90512c3964	Fix readme install link (#610 )	4 years ago
Hui Zhang	c6ae9857f2	update doc (#603 ) * fix doc format * format doc	4 years ago
Hui Zhang	71e046b0ba	E2E/Streaming Transformer/Conformer ASR (#578 ) * add cmvn and label smoothing loss layer * add layer for transformer * add glu and conformer conv * add torch compatiable hack, mask funcs * not hack size since it exists * add test; attention * add attention, common utils, hack paddle * add audio utils * conformer batch padding mask bug fix #223 * fix typo, python infer fix rnn mem opt name error and batchnorm1d, will be available at 2.0.2 * fix ci * fix ci * add encoder * refactor egs * add decoder * refactor ctc, add ctc align, refactor ckpt, add warmup lr scheduler, cmvn utils * refactor docs * add fix * fix readme * fix bugs, refactor collator, add pad_sequence, fix ckpt bugs * fix docstring * refactor data feed order * add u2 model * refactor cmvn, test * add utils * add u2 config * fix bugs * fix bugs * fix autograd maybe has problem when using inplace operation * refactor data, build vocab; add format data * fix text featurizer * refactor build vocab * add fbank, refactor feature of speech * refactor audio feat * refactor data preprare * refactor data * model init from config * add u2 bins * flake8 * can train * fix bugs, add coverage, add scripts * test can run * fix data * speed perturb with sox * add spec aug * fix for train * fix train logitc * fix logger * log valid loss, time dataset process * using np for speed perturb, remove some debug log of grad clip * fix logger * fix build vocab * fix logger name * using module logger as default * fix * fix install * reorder imports * fix board logger * fix logger * kaldi fbank and mfcc * fix cmvn and print prarams * fix add_eos_sos and cmvn * fix cmvn compute * fix logger and cmvn * fix subsampling, label smoothing loss, remove useless * add notebook test * fix log * fix tb logger * multi gpu valid * fix log * fix log * fix config * fix compute cmvn, need paddle 2.1 * add cmvn notebook * fix layer tools * fix compute cmvn * add rtf * fix decoding * fix layer tools * fix log, add avg script * more avg and test info * fix dataset pickle problem; using 2.1 paddle; num_workers can > 0; ckpt save in exp dir;fix setup.sh; * add vimrc * refactor tiny script, add transformer and stream conf * spm demo; librisppech scripts and confs * fix log * add librispeech scripts * refactor data pipe; fix conf; fix u2 default params * fix bugs * refactor aishell scripts * fix test * fix cmvn * fix s0 scripts * fix ds2 scripts and bugs * fix dev & test dataset filter * fix dataset filter * filter dev * fix ckpt path * filter test, since librispeech will cause OOM, but all test wer will be worse, since mismatch train with test * add comment * add syllable doc * fix ds2 configs * add doc * add pypinyin tools * fix decoder using blank_id=0 * mmseg with pybind11 * format code	4 years ago
Hui Zhang	9ac99f7cc6	disscusion for questions, issue only for bug report (#573 )	4 years ago
Hui Zhang	d4e84f9b9d	fix doc link and enhance install (#570 ) * fix doc link * fix install * fix install doc * fix typo * fix lm doc	4 years ago
Zeyu Chen	aaafe1417d	Update README.md	4 years ago
Hui Zhang	19e0f2ac46	Fix doc format (#546 )	4 years ago
Hui Zhang	57ed5cd2e0	Fix Doc (#544 )	4 years ago
Hui Zhang	d7e753546a	Support paddle 2.x (#538 ) * 2.x model * model test pass * fix data * fix soundfile with flac support * one thread dataloader test pass * export feasture size add trainer and utils add setup model and dataloader update travis using Bionic dist * add venv; test under venv * fix unittest; train and valid * add train and config * add config and train script * fix ctc cuda memcopy error * fix imports * fix train valid log * fix dataset batch shuffle shift start from 1 fix rank_zero_only decreator error close tensorboard when train over add decoding config and code * test process can run * test with decoding * test and infer with decoding * fix infer * fix ctc loss lr schedule sortagrad logger * aishell egs * refactor train add aishell egs * fix dataset batch shuffle and add batch sampler log print model parameter * fix model and ctc * sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp add grad clip by global norm add model train test notebook * ctc loss remove run prefix using ord value as text id * using unk when training compute_loss need text ids ord id using in test mode, which compute wer/cer * fix tester * add lr_deacy refactor code * fix tools * fix ci add tune fix gru model bugs add dataset and model test * fix decoding * refactor repo fix decoding * fix musan and rir dataset * refactor io, loss, conv, rnn, gradclip, model, utils * fix ci and import * refactor model add export jit model * add deploy bin and test it * rm uselss egs * add layer tools * refactor socket server new model from pretrain * remve useless * fix instability loss and grad nan or inf for librispeech training * fix sampler * fix libri train.sh * fix doc * add license on cpp * fix doc * fix libri script * fix install * clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49	4 years ago
Hui Zhang	a43ee0ff86	update en readme	4 years ago
Hui Zhang	141109b49d	update aishell egs	4 years ago
Hui Zhang	c246d315b5	update en readme	4 years ago
Hui Zhang	f965d72f3f	update readme	4 years ago
Hui Zhang	126677a35c	support py3	4 years ago
lfchener	01019fb92b	update grad_clip to 1.8	5 years ago
lfchener	a70a70a51d	update README.md and README_cn.md	5 years ago
Li Fuchen	a78922d5e1	Merge pull request #393 from AshishKarel/patch-1 Update Readme	5 years ago
AshishKarel	1b4117efdd	Update Readme Python command to normalize the data have some typo errors.	5 years ago
Harisankar H	31e7a95141	Update README.md added missing yum install package Update README.md added missing yum install package `python-devel` which is required by python package `resampy`.	5 years ago
lfchener	d5ee83b074	update README	5 years ago
lfchener	827a8c8ea3	update README	5 years ago
lfchener	261844b3da	update README	5 years ago
lfchener	7da06fb5ee	upadte README.md and README_cn.md	5 years ago
lfchener	d89c3a48a7	unify api to 1.6 version and fix some problems	5 years ago
lfchener	d74f4ff3f5	update deepspeech to fluid api	5 years ago
Yibing Liu	b8a6d3b969	Update README.md	5 years ago
Yibing Liu	13ec49de95	Update README.md	5 years ago
Yibing Liu	f19a58c2bf	Update README.md	5 years ago
Yibing Liu	445d84ee26	Update lm & acoustic models' link	6 years ago
Yibing Liu	9aef6d2b6b	Fix the issues link in README	7 years ago
Yibing Liu	9c5daab08d	Update benchmark result for BaiduEN8K model due to #88	7 years ago
Xinghai Sun	0823cd2ce4	Upload BaiduCN1.2k model.	7 years ago
Yibing Liu	6c2d0e61b5	fix the link to cloud training in doc	7 years ago
Yibing Liu	ccb4332fe3	update benchmark result for English model	7 years ago
Yibing Liu	cd5f558bc7	Add library boost to the dependency	7 years ago
Yibing Liu	61177a10b2	update the rebuilt docker repo's name in doc	7 years ago
Yibing Liu	f862e0c646	Merge branch 'develop' of upstream into fix_docker_doc	7 years ago
Yibing Liu	74e00f4e15	add more info in the setup section	7 years ago
Yibing Liu	5ba0e0a00b	update setup in readme	7 years ago
yangyaming	a200271ba9	Update libri model.	7 years ago
Yang yaming	45760ebf16	Merge pull request #16 from pkuyym/fix-9 Add script for VoxForge data preparation.	7 years ago
yangyaming	adc117312f	Refine doc and fix path for run_data.sh	7 years ago
yangyaming	b5f70d5fcf	Refine doc.	7 years ago
yangyaming	abbfa43b22	Add script for VoxForge data preparation.	7 years ago
yangyaming	35ef4624b0	Update url for Aishell model.	7 years ago
Yibing Liu	0923f3a520	fix doc for Docker	7 years ago
Hu Weiwei	cc5e420331	fix typo	7 years ago
yangyaming	1e3875160c	Add url for BaiduEng8k model.	7 years ago
Yibing Liu	a0d1146be7	Update benchmark results for LibriSpeech model due to #427	7 years ago
Yibing Liu	fcd6149704	Update benchmark results for BaiduEN8K model due to #427	7 years ago
Yibing Liu	f8da5127fe	Update benchmark results for LibriSpeech model	7 years ago
Xinghai Sun	9c897b7256	Merge pull request #421 from xinghai-sun/benchmark Update DS2 benchmark results.	7 years ago
Xinghai Sun	42ef8b3be3	Rename: Eng --> EN, Chi --> CN	7 years ago
Xinghai Sun	84155d1548	Update DS2 benchmark results.	7 years ago
Yang yaming	a6bffa59ec	Merge pull request #364 from pkuyym/fix-361 Add document for Mandarin model.	7 years ago
yangyaming	046f6ca994	Refine doc.	7 years ago
yangyaming	963b60d5ed	Refine doc for Mandarin training.	7 years ago
Xinghai Sun	31d8e7e033	Merge pull request #418 from kuke/docker_doc_dev Add the document about docker running for DS2	7 years ago
Yibing Liu	6f90a33f1f	Update the doc about docker running for DS2	7 years ago
Yibing Liu	3e048a3c9a	Add the doc about docker running for DS2	7 years ago
yangyaming	1f6a18e8e8	Refine doc.	7 years ago
yangyaming	e8a5a17b1d	Refine doc.	7 years ago
yangyaming	d78d4fa6ff	Add url for large Mandarin LM.	7 years ago
yangyaming	9b7fc7e903	Add doc for Chinese LM.	7 years ago
yangyaming	e3bb689c0e	Add document for Mandarin model.	7 years ago
yangyaming	e909396f91	Refine doc.	7 years ago
yangyaming	0057ca1fb5	Add doc for mandarin lm.	7 years ago
yangyaming	a87e3d0f61	Refine doc.	7 years ago
yangyaming	35543fff8b	Add doc for english LM.	7 years ago
Yibing Liu	c66a40d7ac	Merge branch 'develop' of upstream into add_tuning_fig	7 years ago
Yibing Liu	cc3570d406	format some writings	7 years ago
Xinghai Sun	285972e36e	Update experimental results for DS2.	7 years ago
Yibing Liu	27d6cf90d1	add figure for tuning & enrich the tuning section in doc	7 years ago
Xinghai Sun	e8dce3a982	Add README doc section of multi-gpu acceleration.	7 years ago
Xinghai Sun	3bb746c61f	Add last two sections (experiments and model released) to README.md.	7 years ago
Xinghai Sun	351f61e366	Update RAEDME.md and librispeech.py by following Yaming's review.	7 years ago
Xinghai Sun	ac56a2f249	Update READMD.md and other details by following reviewers comments.	7 years ago
Xinghai Sun	35caf5e0b7	Add bash code highlight to README.md for DS2.	7 years ago
Xinghai Sun	4969d297d8	Correct typos for DS2 README.md.	7 years ago
Xinghai Sun	87453365b2	Update REAME.md for DS2.	7 years ago
Xinghai Sun	e11b735de5	Update examples scripts and REAME.md for DS2.	7 years ago
Xinghai Sun	a00a436b52	Rewrite README.md doc (50%) and correct some bugs.	7 years ago
Xinghai Sun	861b946d7a	Re-design README.md doc structure and add table of contents.	7 years ago
Xinghai Sun	5623b09868	Move decoder.py to models and re-arrange unitests.	7 years ago
Xinghai Sun	0bbb9c3ee2	Re-organize folder structure and hierarchy for DS2.	7 years ago
Luo Tao	5e13fd7dad	deep speech2 can directly use warpctc instead by export LD_LIBRARY_PATH	7 years ago
Xinghai Sun	da28015556	Update README for DS2 cloud training.	7 years ago
Yibing Liu	9e08727c95	remove prerequisites part in the readme of DS2	7 years ago
Yibing Liu	b57dc63e1f	update readme in DS2	7 years ago
yangyaming	1325cd9b8e	Create 'tools' to hold tool scripts and add vocabulary dictionary building script.	7 years ago
Xinghai Sun	7e39debcb0	Convert README.md's file mode to 644.	7 years ago
Xinghai Sun	f4375ef125	Update README.md with code reviews for DS2.	7 years ago
Xinghai Sun	c0b3281e58	Remove pynput and pyaudio packages from requriements.txt and add installation tips to README.md.	8 years ago
Xinghai Sun	b57d244363	Add ASR demo usage to README.md for DS2.	8 years ago
Yibing Liu	a48469b9b6	add the requirement for cuDNN version in README	8 years ago
Luo Tao	c7676286ab	install libsndfile from /usr to thirdparty	8 years ago
Yibing Liu	cb0680e8c4	follow comments to modify README.md	8 years ago
Yibing Liu	724ef18596	update several scripts to support mfcc	8 years ago
Yibing Liu	ee5abbe37d	add mfcc feature for DS2	8 years ago
Yibing Liu	8ce9546710	modify README.md	8 years ago
Yibing Liu	d15c48d616	upload the language model	8 years ago
Yibing Liu	aeccd9851b	append README.md	8 years ago
Xinghai Sun	13f708739b	Improve audio featurizer and add shift augmentor. 1. Improve audio featurizer. 2. Add shift augmentor. 3. Update default argument to be the current best seggestion. 4. Add checkpoints with pass id.	8 years ago
yangyaming	a5dcd23bf2	Follow comments.	8 years ago
Xinghai Sun	1cef98f210	Update README.md for DS2.	8 years ago
Xinghai Sun	06e9f71389	Remove manifest's line number check from librispeech.py and update README.md.	8 years ago
Xinghai Sun	d3eeb7fd76	Refine librispeech.py for DeepSpeech2. Summary: 1. Add manifest line check. 2. Avoid re-unpacking if unpacked data already exists. 3. Add full_download (download all 7 sub-datasets of LibriSpeech).	8 years ago
Xinghai Sun	730d5c4dd3	Update DS2 README.md and fix bug in librispeech.py	8 years ago
Xinghai Sun	2a83486500	Refactor decoder interfaces and add ./data directory.	8 years ago
Xinghai Sun	8313895e85	1. Fix incorrect decoder result printing. 2. Fix incorrect batch-norm usage in RNN. 3. Fix overlapping train/dev/test manfests. 4. Update README.md and requirements.txt. 5. Expose more arguments to users in argparser. 6. Update all other details.	8 years ago
Xinghai Sun	70a343a499	Add infererence and add SortaGrad for only first pass.	8 years ago
Xinghai Sun	3fc94427db	Add librispeech dataset, audio data provider and simplfied DeepSpeech2 model configuration. Bug exists when run training.	8 years ago
Xinghai Sun	d59b8ca97e	Add deep_speech_2 folder.	8 years ago

... 2 3 4 5 6

279 Commits (5e63ac1e608d2e65bca08d71c49f13106fe9fa53)