PaddleSpeech

Commit Graph

Author	SHA1	Message	Date
Jerry	0719698118	Merge branch 'develop' into develop	3 years ago
AdamBear	36c9eaa437	Cache the TextFeaturizer instance for infer speed improvement. (#1260 )	3 years ago
huangyuxin	3e2cc898cb	remove default cfg and fix some bugs,test=asr	3 years ago
Jerryuhoo	2dccd5315d	remove useless "other" dataset	3 years ago
Jerryuhoo	f191d0b022	change speaker embedding position Change speaker embedding position into the encoder.	3 years ago
Jerryuhoo	11991b6d35	add multi-speaker support for speedyspeech	3 years ago
huangyuxin	a1d8ab0f99	merge the develop	3 years ago
huangyuxin	c907a8deda	change all recipes	3 years ago
TianYuan	b9a55262f1	Update fastspeech2.py	3 years ago
Hui Zhang	c81a3f0f83	[s2t] DataLoader with BatchSampler or DistributeBatchSampler (#1242 ) * batchsampler or distributebatchsampler * format	3 years ago
Junkun Chen	420709e5ce	[st] Distributed sampler and new dataloader with MIMO (#1239 ) * update timit result, test=doc_fix * result update * fix bug * add triplet loader * empty preprocess file * sync to u2, updating * sync to u2 config * fix bugs * code refine * update config * customize decoding batch size * update optimizer and lr scheduler * minor * minor * minor * fix bugs of refs * minor * distributed sampler * minor * refine the loader	3 years ago
TianYuan	fbe3c05137	add style_melgan and hifigan in tts cli, test=tts (#1241 )	3 years ago
TianYuan	a232cd8b12	Update fastspeech2.py	3 years ago
huangyuxin	41eeed0450	add librispeech asr1	3 years ago
huangyuxin	fb6d1e2c11	merge the develop	3 years ago
huangyuxin	2c5902d7c5	rename decoding to decode	3 years ago
TianYuan	42c109216d	[tts]add style melgan pretraied model (#1228 ) * add style melgan pretraied model * add style melgan pretraied model, test=tts Co-authored-by: Hui Zhang <zhtclz@foxmail.com>	3 years ago
Hui Zhang	bb2a370b23	[asr] remove useless conf of librispeech (#1227 ) * remve useless conf * format code * update conf * update conf * update conf	3 years ago
huangyuxin	c40b6f4062	refactor the train and test config,test=asr	3 years ago
TianYuan	5692b0ff04	fix log for t2s (#1219 )	3 years ago
TianYuan	b031ee43c4	Merge pull request #1215 from yt605155624/refactor_punc [text]Refactor punc	3 years ago
TianYuan	e1798e1eeb	update	3 years ago
KP	d362d28d35	Remove logging file in cli api.	3 years ago
TianYuan	15b8904fa1	refactor punc	3 years ago
JiehangXie	927c9bbdb6	Fix a bug when sentence inputed contain English words	3 years ago
KP	1632af7706	Update examples/esc50. (#1203 )	3 years ago
Jerryuhoo	3cbfd7bf35	Add speaker embedding and speaker id for style fastspeech2 inference	3 years ago
Hui Zhang	db121226b8	clean aishell asr1 conf & compare ctc loss with torch and warpctc_pytorch (#1191 )	3 years ago
Hui Zhang	d852aee2ff	[asr] logfbank with dither (#1179 ) * fix logfbank dither * format	3 years ago
KP	9ec2bc8e2e	Update README. test=doc_fix	3 years ago
Jackwaterveg	879857332d	[version]add paddlespeech.__version__ (#1166 ) * add paddlespeech.__version__ * version 0.1.0 is ready	3 years ago
TianYuan	19ef7210a0	[TTS]Add hifigan (#1097 ) * add hifigan * add hifigan * integrate synthesize synthesize_e2e, inference for tts, test=tts * add some python files, test=tts * update readme, test=doc_fix	3 years ago
TianYuan	675cff258b	[TTS]fix praatio version, test=tts (#1158 ) * fix praatio version, test=tts * fix praatio version, test=tts	3 years ago
Jackwaterveg	e9748faa71	[Cli]optimize the cli, add --yes, and delete transformer_aishell (#1154 ) * optimize the cli/asr,test=asr * test=doc_fix	3 years ago
Jackwaterveg	2bccde3def	update the version of ctcdecoders and feat,test=doc_fix (#1155 )	3 years ago
Jackwaterveg	0151f2463f	fix bug of pad_sequence in u2,test=asr (#1153 )	3 years ago
Jackwaterveg	68164dd39f	[asr]rename test_hub to test_wav (#1132 ) * add the readme, librispeech_asr1 * fix the test_hub * test=asr	3 years ago
KP	16d6ed3842	Add automatic_video_subtitiles demo.	3 years ago
KP	7394a18732	Add default arguments in cls python api.	3 years ago
TianYuan	f9efbf3063	Update generate_lexicon.py	3 years ago
Jackwaterveg	5b446f6321	[Config]clear the u2 decode config for asr (#1107 ) * clear the u2 decode config * rename the vocab_filepath and cmvn_path	3 years ago
KP	074559fe90	[CLI][Demo][Text]Refactor punctuation_restoration. (#1013 ) * Refactor punctuation_restoration. * Add text cli and punc demo.	3 years ago
Hui Zhang	51d7a07c6d	format and fix pre-commit (#1120 )	3 years ago
TianYuan	5f0f76f249	add eval() for inference model (#1114 )	3 years ago
TianYuan	59e4a34071	Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into tts_cli	3 years ago
TianYuan	3de4130dfc	update am name	3 years ago
TianYuan	9db1710ba7	add conformer demos (#1108 )	3 years ago
TianYuan	3fe75f833d	Merge pull request #1109 from yt605155624/tts_cli [cli]update voc name	3 years ago
TianYuan	ca12a83d5a	update voc name	3 years ago
TianYuan	965a57ef0e	Update README.md	3 years ago
Jackwaterveg	9e31a606d1	set default encoding utf8 for win (#1101 ) Co-authored-by: KP <109694228@qq.com>	3 years ago
Hui Zhang	764a5d4271	Merge branch 'develop' into ctc	3 years ago
Hui Zhang	b1c80c45e0	remove ctc grad norm type in config	3 years ago
huangyuxin	1d4002409f	separate the sox and soxbindings with the requirements	3 years ago
TianYuan	df5fe035e5	Update README.md	3 years ago
TianYuan	a6e0a69da8	Merge pull request #1095 from KPatr1ck/demo [Demo]Add tts demo.	3 years ago
TianYuan	963e906f56	Merge pull request #1068 from yt605155624/add_style_melgan [TTS]add style_melgan	3 years ago
KP	1909f2f620	Add tts demo.	3 years ago
KP	3701fba0be	Update download logic and fix README typos.	3 years ago
TianYuan	f701882b66	update add_style_melgan	3 years ago
gongel	dc60aeb8c2	format	3 years ago
gongel	31510d088c	refactor: rm kaldi_io	3 years ago
TianYuan	2189b46004	add tts cli	3 years ago
KP	70a8a75476	Add st demo.	3 years ago
Hui Zhang	6dedb63e8b	Merge pull request #1087 from Jackwaterveg/setup [ctcdecoders] Separate the ctcdecoders	3 years ago
huangyuxin	9fe0beee54	fix the bug: miss import after install	3 years ago
huangyuxin	cea5ffe0e4	refactor the code	3 years ago
gongel	20d88ec673	refactor: update params/input/output/namestyle	3 years ago
KP	6c1e6e7876	Update recommended model to cnn14 and argument name in __call__.	3 years ago
huangyuxin	ed12db61a6	Separate the ctcdecoders	3 years ago
KP	0b7e0d1e2e	Update tags of pretrained_models.	3 years ago
KP	d08b824d72	Update README.	3 years ago
KP	61e39daccc	Optimize model init.	3 years ago
KP	528c70e515	Remove TODO.	3 years ago
KP	b072453ca8	Fix decompressing problem.	3 years ago
KP	29da318379	Add audio classification cli.	3 years ago
gongel	f5c61ced28	feat: add st cli	3 years ago
Hui Zhang	0818c1601d	add __init__.py	3 years ago
TianYuan	7b2ecb6eed	add style_melgan, test=tts	3 years ago
Hui Zhang	03678c08c5	Merge branch 'develop' into fix_cli	3 years ago
huangyuxin	1b57d05d1b	rm the os.chdir in cli asr	3 years ago
TianYuan	aead853b1d	Update zh_frontend.py	3 years ago
huangyuxin	021311c76b	add transformer to cli infer	3 years ago
TianYuan	a070524d37	Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_style_melgan, test=tts	3 years ago
TianYuan	dd36eafe34	add style_melgan	3 years ago
KP	54cf048b2a	Merge remote-tracking branch 'update_stream/develop' into cli	3 years ago
huangyuxin	a258a34ec0	revise the convert pcm	3 years ago
Jackwaterveg	8ec576f477	Update infer.py	3 years ago
huangyuxin	b0356ae489	revise	3 years ago
huangyuxin	957f2e3a1c	revise	3 years ago
huangyuxin	aee530af27	revise the sample rate	3 years ago
Junkun	4e31a4445d	eval mode	3 years ago
KP	a19e51d7da	Update python api.	3 years ago
KP	e0642ffc77	Update doc strings.	3 years ago
huangyuxin	90d648a601	support using by __call__	3 years ago
huangyuxin	aecb5f567c	Merge branch 'tmp' into 1048	3 years ago
KP	44e9b032d5	Update inputs and outputs of executor.	3 years ago
huangyuxin	3fadcde5e2	revise the asr infer.py	3 years ago
Hui Zhang	4823892169	Merge pull request #1058 from Jackwaterveg/benchmark [benchmark]fix the benchmark	3 years ago
Junkun	3a14b82844	minor	3 years ago
Junkun	f50a2ab4ca	fix bugs	3 years ago
huangyuxin	cb383a39c3	fix the benchmark	3 years ago
huangyuxin	d0bf506fee	fix the load checkpoint	3 years ago
KP	1707244472	Update device usage.	3 years ago
KP	000294132c	Rename s2t to asr.	3 years ago
huangyuxin	43f4d47bfa	add the call in infer.py	3 years ago
Hui Zhang	39228864bb	format code	3 years ago
Hui Zhang	d395c2b8e3	jsonlines reade manifest file	3 years ago
Hui Zhang	7554b6107a	using visualdl; fix read_manifest	3 years ago
huangyuxin	cdc8520969	add the infer	3 years ago
KP	c94ebdc52c	Add python api for executor.	3 years ago
Junkun	d2fab3238b	fix bugs	3 years ago
Junkun	cdd0845127	add translate function	3 years ago
KP	e9798498d6	Update asr inference in paddlespeech.cli.	3 years ago
huangyuxin	895a086fdd	rename the config.feat_size and the config.vocab.size to input_size and output_size	3 years ago
KP	4d39a7746e	Add paddlespeech.cli.	3 years ago
KP	98f0806353	Add paddlespeech.cli.	3 years ago
TianYuan	6e3257ab8a	Create __init__.py	3 years ago
TianYuan	022f1ce8e9	Merge pull request #1040 from yt605155624/fix_frontend [TTS]update text frontend	3 years ago
TianYuan	a861e56e91	rm space for pure Chinese	3 years ago
TianYuan	dad1cbbcd6	update text frontend	3 years ago
KP	6e1ac1cc15	Add paddlespeech.cls and esc50 example.	3 years ago
KP	33f0e7622c	Add paddlespeech.cls and esc50 example.	3 years ago
KP	2c531d78ac	Add paddlespeech.cls and esc50 example.	3 years ago
KP	bdb3ce23ee	Add paddlespeech.cls and esc50 example.	3 years ago
KP	1189117784	Add paddlespeech.cls and esc50 example.	3 years ago
Hui Zhang	2bbfdbae91	Merge pull request #1015 from yt605155624/fs2_conformer [TTS]fastspeech2 conformer	3 years ago
TianYuan	b0a1d8ab60	fix base	3 years ago
TianYuan	469329221b	refactor encoder, rm old code	3 years ago
Hui Zhang	fe83adfbcb	nproc to ngpu	3 years ago
Hui Zhang	789471bfca	test wav for u2	3 years ago
TianYuan	bc0dd51149	Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into HEAD	3 years ago
Jackwaterveg	09931d2ccc	Merge pull request #1019 from zh794390558/feat [bugfix] Kaldi Feature using dither in train	3 years ago
huangyuxin	8aebfeac81	fix the prc-commit	3 years ago
Hui Zhang	56480e1033	fix format	3 years ago
Hui Zhang	7ec0ed4aaf	kaldi feat dither when train	3 years ago
Hui Zhang	2ba3f00bbd	Merge branch 'develop' into datapipe	3 years ago
Hui Zhang	b944418d6f	new format data support ds2/st	3 years ago
Hui Zhang	0defc658e1	update aishell/librispeech transformer result; wenetspeech pretrain conformer result	3 years ago
Hui Zhang	d2a05df02e	Merge pull request #1014 from Jackwaterveg/auto_log [asr]hidden the auto_log	3 years ago
huangyuxin	fb6974f950	update the auto_log	3 years ago
TianYuan	4370c5cfa6	Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into fs2_conformer	3 years ago
Hui Zhang	638b96bf07	check if cmvn_file in config for u2	3 years ago
Hui Zhang	c354e9154b	Merge pull request #1003 from yt605155624/fs2_ge2e [TTS]add fastspeech2 voice cloning in aishell3	3 years ago
TianYuan	133ee7db0b	rename num_speakers	3 years ago
TianYuan	3d5e078c91	add conformer	3 years ago
TianYuan	a97c7b5206	rename spembs	3 years ago
huangyuxin	f646d4c3a1	renew the setup.py for paddlespeech feat and ctcdecoders	3 years ago
huangyuxin	ca06b91fc4	renew the setup.py for paddlespeech feat and ctcdecoders	3 years ago
Hui Zhang	3bd87bc379	add wenet lincense	3 years ago
TianYuan	8d025451de	add fastspeech2 voice cloning in aishell3	3 years ago
TianYuan	c5c9f19091	rename to gen_gta_mel.py, remove stats compute when gen fintune data	3 years ago
TianYuan	a6ac497f8e	add multi-band melgan finetune scripts	3 years ago
Hui Zhang	fe29f74a1c	Merge pull request #992 from yt605155624/fix_docs [TTS] add tts tutorial	3 years ago
TianYuan	30d09b411d	fix style_syn, replace DeepSpeech with PaddleSpeech in readme	3 years ago
TianYuan	0bc9450c51	Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into fix_docs	3 years ago
Hui Zhang	f9b66d0d97	Remove useless folders (#990 )	3 years ago
Hui Zhang	2d76638d62	more speech domain	3 years ago
TianYuan	0fcc5005a2	add tts tutorial	3 years ago
Hui Zhang	1ae1ead80f	more install scripts	3 years ago
Hui Zhang	51a6845564	Merge pull request #985 from Jackwaterveg/benchmark revise the benchmark	3 years ago
huangyuxin	843ea1c12e	revise the benchmark	3 years ago
Hui Zhang	080b0431f4	format code	3 years ago
Junkun	7c8843448c	add word reward into beam search.	3 years ago
Hui Zhang	9a71c091c5	remove debug info and format code	3 years ago
Hui Zhang	8b0e344c69	fix logfbank using PCM16	3 years ago
Hui Zhang	7ceef6c3f5	format code	3 years ago
Hui Zhang	f9221b4b74	fix ctc align	3 years ago
Hui Zhang	fb853167d3	format code	3 years ago
Hui Zhang	18d9abc7a0	add sox speed pertrub	3 years ago
Hui Zhang	56d06f2aaf	Merge pull request #968 from yt605155624/merge_paddlespeech [TTS] change nprocs to ngpu	3 years ago
Hui Zhang	000fac53fe	Merge pull request #966 from Jackwaterveg/dev change the lm dataset dir, add the 'LM_BIN_DIR' in s2 path.sh	3 years ago
Hui Zhang	6a7e0265cd	add josn global cmvn	3 years ago
Hui Zhang	9cdd2643b1	fix bug for batch dataloader using	3 years ago
Hui Zhang	69bccb4f02	fix ctc align	3 years ago
TianYuan	bacdf5756b	Merge remote-tracking branch 'origin/develop' into merge_paddlespeech	3 years ago
Hui Zhang	69055698a2	transformer using batch data loader	3 years ago
TianYuan	35c37ace17	change nprocs to ngpu, add aishell3/voc1	3 years ago
huangyuxin	d647cde870	change the lm dataset dir	3 years ago
TianYuan	6655728b08	add reference	3 years ago
Hui Zhang	38cf56295a	fix reference format	3 years ago
Hui Zhang	c463a00f81	add reference code license	3 years ago
Hui Zhang	2a66c2c13b	format code	3 years ago
Hui Zhang	e2bcaee4f1	merge deepspeech, parakeet and text_processing into paddlespeech	3 years ago

... 13 14 15 16 17 ...

884 Commits (ac385053ba341273faa99741273d5ff53eb378ce)