PaddleSpeech

Commit Graph

Author	SHA1	Message	Date
qingen	84576d6956	[vec][score] add plda model, test=doc fix #1667	3 years ago
lym0302	1a3c811f04	code format, test=doc	3 years ago
TianYuan	0d6f5868ea	Merge pull request #1665 from yt605155624/add_onnx [TTS]add onnx inference for fastspeech2 + hifigan/mb_melgan	3 years ago
Honei	f500fa8bde	Merge pull request #1646 from Honei/develop [vec]add speaker verification score method	3 years ago
TianYuan	0282d45c62	remove fill_constant_batch_size_like in static model of speedyspeech, test=tts	3 years ago
TianYuan	c765fca6b4	Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_onnx	3 years ago
TianYuan	124eb6af8f	update notes, test=doc	3 years ago
TianYuan	e0d222e674	update notes, test=doc	3 years ago
Hui Zhang	1843bed458	Merge pull request #1666 from Jackwaterveg/cli [CLI] ASR: Add duration limitation for asr	3 years ago
xiongxinlei	a8244dc5b0	update the note, test=doc	3 years ago
Jackwaterveg	c852776bc6	test=doc	3 years ago
TianYuan	f264b912fc	add warmup for frontend, test=doc	3 years ago
Jackwaterveg	4922e697e1	update cli, test = asr	3 years ago
Jackwaterveg	1c05d03806	test=asr	3 years ago
xiongxinlei	9b5f7f71ac	add part ecapa-tdnn note, test=doc	3 years ago
Hui Zhang	6eed542c08	Merge pull request #1660 from yt605155624/fix_pre [TTS]fix preprocess bug, test=tts	3 years ago
Honei	83310b6379	Merge branch 'develop' into develop	3 years ago
huangyuxin	faf21f033f	add duration limitation for asr	3 years ago
TianYuan	7aecb2c4bb	add onnx inference for fastspeech2 + hifigan/mb_melgan, test=tts	3 years ago
xiongxinlei	d064c8196e	update the speaker verification model, test=doc	3 years ago
xiongxinlei	e72912adb9	update the speaker verification model, test=doc	3 years ago
TianYuan	a8f5990869	fix preprocess bug, test=tts	3 years ago
lym0302	759a9e61e4	update server cli, test=doc	3 years ago
lym0302	603e565ab1	add stream tts server, test=doc	3 years ago
ccrrong	378fe5909f	add ami diarization pipeline, test=doc	3 years ago
xiongxinlei	48b8cc8937	add score method, test=doc	3 years ago
xiongxinlei	ebfe3e6b13	test.py update the CSVDataset, test=doc	3 years ago
xiongxinlei	acebfad7b7	change the vector csv.spk_id to csv.label, test=doc	3 years ago
xiongxinlei	57c11dcab0	add some annotations, test=doc	3 years ago
xiongxinlei	30b5b3cb9e	add vector csv dataset format, test=doc	3 years ago
TianYuan	e366fb6b2f	Merge pull request #1643 from Jackwaterveg/check [Doc] supplement note	3 years ago
huangyuxin	ca860e3d2f	supplement note	3 years ago
TianYuan	828ee14404	add license and reference for some models, test=doc	3 years ago
xiongxinlei	5b05300e53	train process add new voxceleb and rirs dataset, test=doc	3 years ago
xiongxinlei	965f486dd5	add voxceleb and rirs noise dataset	3 years ago
Hui Zhang	36df70cbe6	Merge pull request #1638 from zh794390558/spx_refactor [speechx] refactor audio/data/feature cache	3 years ago
TianYuan	5bff096715	Merge pull request #1634 from yt605155624/cnn_decoder [TTS]Cnn decoder	3 years ago
TianYuan	3aec266ca5	add chunk size and pad size in args, test=doc	3 years ago
Hui Zhang	cb39777a60	format code	3 years ago
TianYuan	4d7cd0e063	add streaming synthesize, test=tts	3 years ago
liangym	602b0b0da3	Merge pull request #1632 from lym0302/develop [server] fix output bug	3 years ago
Hui Zhang	61941d14b0	Merge pull request #1627 from WilliamZhang06/ws-develop [websocket] added online asr engine	3 years ago
WilliamZhang06	2ec8d608bf	fixed comments, test=doc	3 years ago
liangym	21c4132eda	Update paddlespeech_client.py	3 years ago
TianYuan	005aa4066c	Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into cnn_decoder	3 years ago
TianYuan	0fc79f474d	add CNNDecoder, test=tts	3 years ago
WilliamZhang06	d847fe29cf	added online asr engine , test=doc	3 years ago
TianYuan	318edec303	Merge pull request #1613 from yt605155624/restructure_expand [tts]restructure expand in length_regulator.py for paddle2onnx, test=tts	3 years ago
Hui Zhang	943d4ac1ee	Merge pull request #1612 from Jackwaterveg/update [ASR] Replace kaidi_fbank with paddleaudio	3 years ago
huangyuxin	f47146af49	add docstring, test=asr	3 years ago
huangyuxin	ed490b66cb	update spectrogram, test=asr	3 years ago
Hui Zhang	84d712d493	format code, test=doc	3 years ago
Honei	d60856b1ed	Merge pull request #1614 from Honei/vox12 [vec]change the vector output to numpy.array	3 years ago
xiongxinlei	ed7113f320	change the vector output to numpy.array	3 years ago
TianYuan	bc5ae43d3a	restructure expand in length_regulator.py for paddle2onnx, test=tts	3 years ago
huangyuxin	0ffe1f9114	replace kaidi_fbank with paddleaudio	3 years ago
Hui Zhang	caee809513	Merge pull request #1605 from Honei/vox12 [vec]add speaker verification demo and doc	3 years ago
xiongxinlei	5ae57206f3	add paddlespeech vector modules __init__.py	3 years ago
xiongxinlei	2c9dc0c89b	add some vector cli comments, test=doc	3 years ago
xiongxinlei	ef1bc5e815	vector cli output dim info, test=doc	3 years ago
xiongxinlei	d5142e5e15	add vector cli annotation, test=doc	3 years ago
xiongxinlei	ad2caf2ccb	add speaker verification demo and doc, test=doc	3 years ago
TianYuan	3cc0ec950e	Merge pull request #1604 from lym0302/add_readme [server] update readme	3 years ago
lym0302	829f1e332e	update readme, test=doc	3 years ago
xiongxinlei	0f78d25f76	add vector cli batch and pipeline test demo, test=doc	3 years ago
Honei	305bacdcf2	Merge branch 'develop' into vox12	3 years ago
xiongxinlei	0bb67d8b8e	add vector cli unit test, test=doc	3 years ago
KP	b6e976a860	Merge pull request #1602 from yt605155624/fix_dtype [TTS]fix dtype of window of stft	3 years ago
xiongxinlei	62cbce6915	add vectorwrapper to extract audio embedding	3 years ago
TianYuan	8938483529	Merge pull request #1601 from yt605155624/add_ljspeech_hifigan [TTS] update readme for ljspeech hifigan	3 years ago
TianYuan	5347dbad3f	fix dtype of window of stft, test=tts	3 years ago
TianYuan	342b487383	update readme for ljspeech hifigan, test=tts	3 years ago
Hui Zhang	4051e7b762	fix compliance test bug, and format	3 years ago
TianYuan	26ef47810d	Merge pull request #1593 from windstamp/npu_dev_20220322 [NPU] Add NPU support for TransformerTTS	3 years ago
zhangkeliang	59b3de6a6d	[NPU] test TransformerTTS with NPU	3 years ago
Jackwaterveg	fcc1762048	Merge pull request #1577 from Jackwaterveg/change_init [ASR] change default initializer to kaiming_uniform	3 years ago
huangyuxin	e1b581b622	fix some bug, test=asr	3 years ago
Hui Zhang	b5315657ff	Merge pull request #1509 from qingen/cluster [vec] add clustering of vectors	3 years ago
huangyuxin	6da8465f14	add dist_sampler args, test=asr	3 years ago
TianYuan	e5e8b8a129	Merge pull request #1587 from yt605155624/add_vctk_hifigan [TTS]Add vctk hifigan	3 years ago
TianYuan	6469568d2a	update readme for vctk hifigan, test=tts	3 years ago
huangyuxin	a4f5a68074	fix some format, test=asr	3 years ago
xiongxinlei	d85d1deef5	exec pre-commit in paddlespeech vector, test=doc	3 years ago
xiongxinlei	9874fb7d75	add some comments in code	3 years ago
huangyuxin	e991d82ae7	Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into change_init	3 years ago
huangyuxin	d53e1163a6	update the code, test=asr	3 years ago
xiongxinlei	b9eafddd94	change - to _ to distinguish field	3 years ago
xiongxinlei	9c6735f921	add vector voxceleb12 base mode url, test=doc	3 years ago
xiongxinlei	d28ccfa96b	add vector cli component, test=doc	3 years ago
KP	831cadacc7	Add paddleaudio doc.	3 years ago
TianYuan	5ab2601759	update readme for aishell3 hifigan, test=tts	3 years ago
Hui Zhang	6abc5d9f7e	format	3 years ago
huangyuxin	ab16d8ce3c	change default initializer to kaiming_uniform, test=asr	3 years ago
qingen	0f7ede11ef	Merge branch 'cluster' of github.com:qingen/PaddleSpeech into cluster	3 years ago
qingen	d16ce21d47	[wip][vec] update cluster of diarization, test=doc #1304	3 years ago
xiongxinlei	506d26a957	change the code style to s2t code style, test=doc	3 years ago
xiongxinlei	311fa87a11	add some comments to the code	3 years ago
Hui Zhang	90deeca06f	Merge pull request #1554 from lym0302/develop [server] add server cls	3 years ago
lym0302	89457b273a	modify, test=doc	3 years ago
xiongxinlei	8ed5c287a3	add vox2 data into VoxCeleb class	3 years ago
lym0302	77bad44e8b	modify readme, test=doc	3 years ago
lym0302	8ef92a9495	modify, test=doc	3 years ago
lym0302	89dbda58f6	add cls static model, test=doc	3 years ago
Hui Zhang	40ab05a462	Merge pull request #1552 from yt605155624/format_syn [TTS]format synthesize	3 years ago
lym0302	5187df847f	modify server demo, test=doc	3 years ago
xiongxinlei	584a2c0e39	add ecapa-tdnn config yaml file	3 years ago
lym0302	0a6602c708	modify application.yaml, test=doc	3 years ago
TianYuan	544c372b50	fix cr, test=tts	3 years ago
lym0302	99fa7a8205	add server cls, test=doc	3 years ago
TianYuan	fe8bf2a38c	format synthesize, test=tts	3 years ago
xiongxinlei	993d6783d7	remove unused code, test=doc	3 years ago
xiongxinlei	0e87037f2c	refactor to compilance paddleaudio	3 years ago
xiongxinlei	4473405f82	merge develop to vox12, test=doc	3 years ago
Honei	0dee8f40e9	Merge branch 'PaddlePaddle:develop' into develop	3 years ago
xiongxinlei	60d73bb7bd	add state 0 to prepare the voxcele data and augment data	3 years ago
xiongxinlei	14efbf5b15	check extract embedding result, test=doc	3 years ago
xiongxinlei	386ef3f161	add voxceleb augment unit test, test=doc	3 years ago
Hui Zhang	5147163592	Merge pull request #1544 from yt605155624/add_vctk_hifigan [tts]add vctk hifigan egs	3 years ago
TianYuan	81d964f0a0	add vctk hifigan, test=tts	3 years ago
xiongxinlei	2d89c80e6f	add waveform augment pipeline, test=doc	3 years ago
lym0302	3b304544f6	modify yaml, test=doc	3 years ago
xiongxinlei	ac4967e204	optimize the data prepare process	3 years ago
xiongxinlei	016ed6d69c	repair the code according to the part comment, test=doc	3 years ago
Hui Zhang	2886ab9373	Merge pull request #1530 from lym0302/server_cli [server] add server test	3 years ago
xiongxinlei	1f74af110b	add training log info and comment, test=doc	3 years ago
lym0302	e50c1b3b1d	add server test, test=doc	3 years ago
xiongxinlei	4648059b5f	add training process for sid, test=doc	3 years ago
xiongxinlei	7668f61422	add sid dataloader for training, test=doc	3 years ago
xiongxinlei	6af2bc3d5b	add sid loss wraper for voxceleb, test=doc	3 years ago
xiongxinlei	57c4f4a68c	add sid learning rate and training model	3 years ago
TianYuan	4d2f2191a8	fix gbk encode bug	3 years ago
Honei	1395b5f5fa	Merge branch 'PaddlePaddle:develop' into develop	3 years ago
TianYuan	175c39b4a4	Merge pull request #1511 from yt605155624/pre_fix_for_streaming [TTS]add rtf for synthesize, add more vocoder for synthesize.sh	3 years ago
Hui Zhang	5ba4907c44	Merge pull request #1514 from lym0302/server_cli [server] update server cli	3 years ago
lym0302	85d4a31e04	update application.yaml, test=doc	3 years ago
Jerryuhoo	c116a3a926	fix Speedyspeech multi-speaker inference, test=tts	3 years ago
lym0302	ab04488738	update server cli, test=doc	3 years ago
TianYuan	cb07bd2a94	add rtf for synthesize, add more vocoder for synthesize_e2e.sh, test=tts	3 years ago
Hui Zhang	26d413ce8f	Merge pull request #1510 from lym0302/paddlespeech_stats [server] add paddlespeech_server stats	3 years ago
lym0302	72c0cda30c	add paddlespeech_server stats, test=doc	3 years ago
Hui Zhang	e8f2d8f11b	Merge pull request #1507 from zh794390558/cli [cli] add cli batch/pipe example to readme	3 years ago
Hui Zhang	2517df92a0	Merge pull request #1508 from lym0302/paddlespeech_stats [CLI] modified text sr to lang	3 years ago
TianYuan	b6d33a7fb4	Merge pull request #1506 from yt605155624/fix_frontend [TTS]update text frontend, test=tts	3 years ago
lym0302	395c923dee	modified text sr to lang, test=doc	3 years ago
Hui Zhang	75098698d8	format,test=doc	3 years ago
TianYuan	66a8beb27f	update text frontend, test=tts	3 years ago
lym0302	96abb33b5b	add __call__, test=doc	3 years ago
lym0302	5f1728f855	rm server related, test=doc	3 years ago
xiongxinlei	70d3b01c0d	remove invalid code	3 years ago
xiongxinlei	d7da629302	add kaldi feats egs dataset	3 years ago
xiongxinlei	6f7e9656fe	add kaldi feats ark dataset	3 years ago
lym0302	35357e775e	update, test=doc	3 years ago
lym0302	e5aa24fa5a	resolve setup.py conflicts, test=doc	3 years ago
lym0302	fe6be4a65e	Merge branch 'develop' of https://github.com/lym0302/PaddleSpeech into paddlespeech_stats	3 years ago
lym0302	f8375764b9	add paddlespeech stats, test=doc	3 years ago
Hui Zhang	8d474c2658	Merge pull request #1482 from lym0302/servercli_update [server] update server cli	3 years ago
lym0302	162361d878	format code, test=doc	3 years ago
lym0302	434708cff4	set device cpu, test=doc	3 years ago
lym0302	920b2c808c	paras required, test=doc	3 years ago
Hui Zhang	6b1fe70100	format code,test=doc	3 years ago
lym0302	6b2dd16845	update server cli, test=doc	3 years ago
WilliamZhang06	78c9b7342c	deleted wav file , test=doc	3 years ago
WilliamZhang06	a6ec3a26f1	Merge branch 'develop' into server_asr	3 years ago
WilliamZhang06	8b4602f738	added isinstance code, test=doc	3 years ago
lym0302	bb60561c66	update util, test=doc	3 years ago
WilliamZhang06	147018a8b4	added cli changed code, test=doc	3 years ago
lym0302	332009142b	add server demo, test=doc	3 years ago
WilliamZhang06	7ebe904e20	fixed overload , test=doc	3 years ago
Hui Zhang	60c0877e7a	Merge pull request #1472 from KPatr1ck/cli_batch [CLI][Logger]Add cli logger control.	3 years ago
WilliamZhang06	b8f16ac9b0	Merge branch 'develop' into server_asr	3 years ago
WilliamZhang06	da3ea7bb40	added engine type and asr inference , test=doc	3 years ago
Hui Zhang	49f80afe6a	Merge pull request #1381 from PaddlePaddle/server [server] speech server init version	3 years ago
lym0302	b508c4d0cb	add readme, test=doc	3 years ago
KP	d36a4ccfc8	Add cli logger control.	3 years ago
KP	94ed5969fa	Add cli logger control.	3 years ago
lym0302	42cbe313c2	improve cli code, test=doc	3 years ago
lym0302	2bf4b4521f	add cli, test=doc	3 years ago
lym0302	8fd117e4da	add cli, test=doc	3 years ago
lym0302	80b83b7434	add cli, test=doc	3 years ago
KP	7814fba07f	Update batch input.	3 years ago
KP	05288fe1c3	Update batch input and stdin input.	3 years ago
KP	1818b058aa	Support batch input in cls task.	3 years ago
WilliamZhang06	35e3be9ac8	Merge remote-tracking branch 'remote/develop' into server	3 years ago
TianYuan	ae521d3700	Update infer.py	3 years ago
lym0302	07158b2f12	move dir, test=doc	3 years ago
lym0302	76391275fc	move dir, test=doc	3 years ago
TianYuan	67ec6242c3	fix ci for waveflow, test=tts	3 years ago
TianYuan	f51097618b	Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into fix_ci_waveflow	3 years ago
TianYuan	fc8c0e3ea2	fix ci for waveflow, test=tts	3 years ago
huangyuxin	95d5274aef	fix sortagrad, test=asr	3 years ago
Hui Zhang	718c849f68	Merge pull request #1445 from yt605155624/update_train [TTS]init for all works in train.py when ngpu>1	3 years ago
Hui Zhang	f3ec985aaf	Merge pull request #1439 from Jackwaterveg/tipc [TIPC]Add tipc_benchmark of conformer	3 years ago
TianYuan	4ac7db185e	init for all works in train.py when ngpu>1, test=tts	3 years ago
Jackwaterveg	426bae3de1	Merge pull request #1440 from yt605155624/merge_datasets [TTS]Merge datasets, change style of docstring	3 years ago
TianYuan	2cec8f6c76	update tts cli, test=doc	3 years ago
TianYuan	9699c00769	change the docstring style from numpydoc to google, test=tts	3 years ago
huangyuxin	aefe9e93a7	add tipc benchmark of conformer	3 years ago
TianYuan	683679bec7	merge data and datasets, test=tts	3 years ago
TianYuan	7dc1f2daa3	fix some librosa bugs, test=tts	3 years ago
TianYuan	30085ac229	Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into rename_tacotron2	3 years ago
TianYuan	25347bb6a3	rename tacotron2, test=tts	3 years ago
huangyuxin	9a55783aa0	fix resample	3 years ago
Hui Zhang	dcfc32f1ec	Merge pull request #1379 from yt605155624/new_wavernn [TTS] add wavernn	3 years ago
TianYuan	0747600c95	[TTS]add ljspeech new tacotron2 (#1416 ) * add ljspeech new tacotron2, test=tts * update ljspeech waveflow's synthesize * add config, test=doc Co-authored-by: Hui Zhang <zhtclz@foxmail.com>	3 years ago
TianYuan	348a1a33bf	update tacotron2 voice cloning in aishell3 with new tacotron2, test=tts (#1419 )	3 years ago
huangyuxin	f428ec4431	change log of cli/asr/infer	3 years ago
TianYuan	1b0c034134	update wavernn, test=tts	3 years ago
TianYuan	89e69ee10e	[TTS]fix tacotron2 dygraph to static (#1414 ) * fix tacotron2 dygraph to static , test=tts * fix tacotron2 dygraph to static , test=tts * simplify synthesize_e2e.py , test=tts	3 years ago
huangyuxin	2a42421a63	cli add ds2-librispeech offline, fix versionm, test=asr	3 years ago
Hui Zhang	4128f4d61f	fix __version__ error in develop (#1398 )	3 years ago
TianYuan	001afee644	fix wavernn dygraph to static , test=tts	3 years ago
TianYuan	2844f388dc	[doc ]add tacotron2 readme (#1385 ) * add tacotron2 readme, test=doc * update changelog.md, test=doc	3 years ago
TianYuan	2071774d81	add wavernn in synthesize_e2e, test=tts	3 years ago
TianYuan	1cc7905d51	rm csmsc.py, test=tts	3 years ago
TianYuan	4c3e57a23c	align preprocess of wavernn, test=tts	3 years ago
Jackwaterveg	f49cf838a8	Update u2.py (#1378 )	3 years ago
TianYuan	fb0acd40a2	add wavernn, test=tts	3 years ago
Jackwaterveg	d7222c0453	[ASR] Support CTC decoder online (#821 ) * fix the destructer problem for prefixes * unified offline and online in ctcdecoders, test=asr * rename swig_decoders to paddlespeech_ctcdecoders, test=asr * add reset_stage for ctcdecoder * fix some problems * fix ctconline * fix a bug * fix the format * fix 1xt2x	3 years ago
Jerryuhoo	f515416c4a	fix missing model choice, test=doc	3 years ago
Jerryuhoo	a22080130b	Add speedyspeech multi-speaker support for synthesize_e2e.py, test=tts	3 years ago
Hui Zhang	97db74ca60	Merge pull request #1314 from yt605155624/add_new_tacotron2 [TTS]Add new tacotron2	3 years ago
huangyuxin	3845804cc9	Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into Setup	3 years ago
TianYuan	96323816e9	fix yamls, change labels to stop_labels, test=tts	3 years ago
TianYuan	1bf1a876ae	Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_new_tacotron2, test=tts	3 years ago
TianYuan	3fd7a7790b	add typehit for updater and evaluator, test=tts	3 years ago
huangyuxin	4e31247633	refacto the code	3 years ago
TianYuan	41d24337cb	fix fastspeech2 multi speaker to static, test=tts	3 years ago
TianYuan	1a9e59612a	fix fastspeech2 multi speaker to static, test=tts	3 years ago
huangyuxin	565a63c5ef	refactor the setup in paddleaudio	3 years ago
huangyuxin	eb91ce84f9	refactor the version	3 years ago
Hui Zhang	4a133619a1	Merge pull request #1356 from Jackwaterveg/CLI [CLI] asr, Add Deepspeech2 online and offline model	3 years ago
Hui Zhang	d4acf4704f	Merge pull request #1350 from LittleChenCc/develop [ST] beam search with optimality guarantees	3 years ago
huangyuxin	ab759b16de	Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into CLI	3 years ago
huangyuxin	38edfd1a89	Add Deepspeech2 online and offline in cli	3 years ago
TianYuan	d368d57d67	fix low ips bug of speedyspeech and fastspeech2, test=tts (#1349 )	3 years ago
TianYuan	9c7f0762b0	update racotron2 and transformer tts, test=tts	3 years ago
huangyuxin	8028f33b7f	synchronize the version	3 years ago
Junkun	44408e5211	sync the variable name to others	3 years ago
Junkun	f866059b74	config and formalize	3 years ago
Junkun	43aad7a018	beam search with optimality guarantees	3 years ago
Jackwaterveg	26524031d2	Merge pull request #1343 from Jackwaterveg/fix [ASR] Fix some bugs	3 years ago
huangyuxin	5e7e8a3e24	fix the u2 export, test=asr	3 years ago
TianYuan	a1867c20c3	fix slice bug of speedyspeech expand, test=tts (#1337 )	3 years ago
Hui Zhang	ec1c88ae1a	[s2t] remove nltk (#1332 )	3 years ago
TianYuan	7ae4f7221e	Update length_regulator.py	3 years ago
TianYuan	acfe2b9084	Update duration_predictor.py	3 years ago
TianYuan	caa391f461	fix speedyspeech inference, test=tts (#1322 )	3 years ago
Jackwaterveg	0c4895cd0b	mv the ctcdecoders to third_part (#1313 )	3 years ago
TianYuan	8f507ba4ba	Merge pull request #1302 from jerryuhoo/develop [TTS] Add support for finetuning speedyspeech	3 years ago
Jerryuhoo	111a452378	Fix the code format, test=tts	3 years ago
TianYuan	89e988a69e	add csmsc tacotron2, test=tts	3 years ago
TianYuan	c088b9a304	add csmsc tacotron2	3 years ago
huangyuxin	fe1dc9d211	refactor the cli/st, test=st	3 years ago
TianYuan	27bb76bdb9	fix tone_sandhi of yi, test=tts	3 years ago
Jerryuhoo	be99807d61	Add durations to gen_gta_mel.py inference	3 years ago
KP	52a8b2f320	Add ECAPA_TDNN. (#1301 )	3 years ago
Jerryuhoo	fcc34e3e95	[tts] add gen_gta_mel.py for finetuning speedypeech, test=tts	3 years ago
Jackwaterveg	010aa65b2b	[cli] asr - support English, decode_metod and unified config (#1297 ) * fix config, test=asr * fix config, test=doc_fix * add en and decode_method for cli/asr, test=asr * test=asr * fix, test=doc_fix	3 years ago
KP	c09466ebbe	Add ECAPA_TDNN. (#1295 )	3 years ago
TianYuan	fb238d83f4	update vctk voc1, test=tts (#1294 )	3 years ago
TianYuan	73dc0e2535	fix_ning	3 years ago
billishyahao	ddf184be60	fix some typos	3 years ago
TianYuan	318cc9e539	Merge branch 'develop' into develop	3 years ago
Jackwaterveg	e69abc9265	Merge pull request #1273 from zh794390558/batch_sampler [s2t] Fix Batch sampler set epoch	3 years ago
KP	a810cd4e5c	Add cli logging. (#1274 )	3 years ago
Jerryuhoo	d6e9b76e76	change link_wav.py path, test=tts	3 years ago
Jerryuhoo	c94f346207	move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/ move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/ so that different vocoders can use one script.	3 years ago
Jerryuhoo	e239ee1cd2	add multi-speaker support for finetuning hifigan vocoder	3 years ago
huangyuxin	07d457859d	use pre-commit, test=doc_fix	3 years ago
Hui Zhang	45832f6770	fix default dist_samlper to False	3 years ago
Hui Zhang	3a2db414e6	format code	3 years ago
Hui Zhang	6f651d762e	fix batch sampler set_epoch when epcoh start	3 years ago
TianYuan	680eac02b9	[tts]Update mb melgan (#1272 ) * update mb melgan * update mb melgan, test=tts	3 years ago
TianYuan	98ce69d0aa	Merge pull request #1259 from jerryuhoo/develop [TTS]Add multi-speaker support for the SpeedySpeech model	3 years ago
huangyuxin	ffadbe22a7	merge the develop, test=asr	3 years ago
JiehangXie	bdc48114a4	Update text_normlization.py	3 years ago
JiehangXie	d88ceef7bc	Fix punctuation bug 修复顿号和英文冒号停顿和分句的问题	3 years ago
huangyuxin	8b63485ce3	fix some bug, test=asr	3 years ago
JiehangXie	6065b1b607	Fix punctuation bug 修复顿号和英文冒号停顿和分句的问题	3 years ago
Jerry	0719698118	Merge branch 'develop' into develop	3 years ago
AdamBear	36c9eaa437	Cache the TextFeaturizer instance for infer speed improvement. (#1260 )	3 years ago
huangyuxin	3e2cc898cb	remove default cfg and fix some bugs,test=asr	3 years ago
Jerryuhoo	2dccd5315d	remove useless "other" dataset	3 years ago
Jerryuhoo	f191d0b022	change speaker embedding position Change speaker embedding position into the encoder.	3 years ago
Jerryuhoo	11991b6d35	add multi-speaker support for speedyspeech	3 years ago
huangyuxin	a1d8ab0f99	merge the develop	3 years ago
huangyuxin	c907a8deda	change all recipes	3 years ago
TianYuan	b9a55262f1	Update fastspeech2.py	3 years ago
Hui Zhang	c81a3f0f83	[s2t] DataLoader with BatchSampler or DistributeBatchSampler (#1242 ) * batchsampler or distributebatchsampler * format	3 years ago
Junkun Chen	420709e5ce	[st] Distributed sampler and new dataloader with MIMO (#1239 ) * update timit result, test=doc_fix * result update * fix bug * add triplet loader * empty preprocess file * sync to u2, updating * sync to u2 config * fix bugs * code refine * update config * customize decoding batch size * update optimizer and lr scheduler * minor * minor * minor * fix bugs of refs * minor * distributed sampler * minor * refine the loader	3 years ago
TianYuan	fbe3c05137	add style_melgan and hifigan in tts cli, test=tts (#1241 )	3 years ago
TianYuan	a232cd8b12	Update fastspeech2.py	3 years ago
huangyuxin	41eeed0450	add librispeech asr1	3 years ago
huangyuxin	fb6d1e2c11	merge the develop	3 years ago
huangyuxin	2c5902d7c5	rename decoding to decode	3 years ago
TianYuan	42c109216d	[tts]add style melgan pretraied model (#1228 ) * add style melgan pretraied model * add style melgan pretraied model, test=tts Co-authored-by: Hui Zhang <zhtclz@foxmail.com>	3 years ago
Hui Zhang	bb2a370b23	[asr] remove useless conf of librispeech (#1227 ) * remve useless conf * format code * update conf * update conf * update conf	3 years ago
huangyuxin	c40b6f4062	refactor the train and test config,test=asr	3 years ago
TianYuan	5692b0ff04	fix log for t2s (#1219 )	3 years ago
TianYuan	b031ee43c4	Merge pull request #1215 from yt605155624/refactor_punc [text]Refactor punc	3 years ago
TianYuan	e1798e1eeb	update	3 years ago
KP	d362d28d35	Remove logging file in cli api.	3 years ago
TianYuan	15b8904fa1	refactor punc	3 years ago
JiehangXie	927c9bbdb6	Fix a bug when sentence inputed contain English words	3 years ago
KP	1632af7706	Update examples/esc50. (#1203 )	3 years ago
Jerryuhoo	3cbfd7bf35	Add speaker embedding and speaker id for style fastspeech2 inference	3 years ago
Hui Zhang	db121226b8	clean aishell asr1 conf & compare ctc loss with torch and warpctc_pytorch (#1191 )	3 years ago
Hui Zhang	d852aee2ff	[asr] logfbank with dither (#1179 ) * fix logfbank dither * format	3 years ago
KP	9ec2bc8e2e	Update README. test=doc_fix	3 years ago
Jackwaterveg	879857332d	[version]add paddlespeech.__version__ (#1166 ) * add paddlespeech.__version__ * version 0.1.0 is ready	3 years ago
TianYuan	19ef7210a0	[TTS]Add hifigan (#1097 ) * add hifigan * add hifigan * integrate synthesize synthesize_e2e, inference for tts, test=tts * add some python files, test=tts * update readme, test=doc_fix	3 years ago
TianYuan	675cff258b	[TTS]fix praatio version, test=tts (#1158 ) * fix praatio version, test=tts * fix praatio version, test=tts	3 years ago
Jackwaterveg	e9748faa71	[Cli]optimize the cli, add --yes, and delete transformer_aishell (#1154 ) * optimize the cli/asr,test=asr * test=doc_fix	3 years ago
Jackwaterveg	2bccde3def	update the version of ctcdecoders and feat,test=doc_fix (#1155 )	3 years ago
Jackwaterveg	0151f2463f	fix bug of pad_sequence in u2,test=asr (#1153 )	3 years ago
Jackwaterveg	68164dd39f	[asr]rename test_hub to test_wav (#1132 ) * add the readme, librispeech_asr1 * fix the test_hub * test=asr	3 years ago
KP	16d6ed3842	Add automatic_video_subtitiles demo.	3 years ago
KP	7394a18732	Add default arguments in cls python api.	3 years ago
TianYuan	f9efbf3063	Update generate_lexicon.py	3 years ago
Jackwaterveg	5b446f6321	[Config]clear the u2 decode config for asr (#1107 ) * clear the u2 decode config * rename the vocab_filepath and cmvn_path	3 years ago
KP	074559fe90	[CLI][Demo][Text]Refactor punctuation_restoration. (#1013 ) * Refactor punctuation_restoration. * Add text cli and punc demo.	3 years ago
Hui Zhang	51d7a07c6d	format and fix pre-commit (#1120 )	3 years ago
TianYuan	5f0f76f249	add eval() for inference model (#1114 )	3 years ago
TianYuan	59e4a34071	Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into tts_cli	3 years ago
TianYuan	3de4130dfc	update am name	3 years ago
TianYuan	9db1710ba7	add conformer demos (#1108 )	3 years ago
TianYuan	3fe75f833d	Merge pull request #1109 from yt605155624/tts_cli [cli]update voc name	3 years ago
TianYuan	ca12a83d5a	update voc name	3 years ago
TianYuan	965a57ef0e	Update README.md	3 years ago
Jackwaterveg	9e31a606d1	set default encoding utf8 for win (#1101 ) Co-authored-by: KP <109694228@qq.com>	3 years ago
Hui Zhang	764a5d4271	Merge branch 'develop' into ctc	3 years ago
Hui Zhang	b1c80c45e0	remove ctc grad norm type in config	3 years ago
huangyuxin	1d4002409f	separate the sox and soxbindings with the requirements	3 years ago
TianYuan	df5fe035e5	Update README.md	3 years ago
TianYuan	a6e0a69da8	Merge pull request #1095 from KPatr1ck/demo [Demo]Add tts demo.	3 years ago
TianYuan	963e906f56	Merge pull request #1068 from yt605155624/add_style_melgan [TTS]add style_melgan	3 years ago
KP	1909f2f620	Add tts demo.	3 years ago
KP	3701fba0be	Update download logic and fix README typos.	3 years ago
TianYuan	f701882b66	update add_style_melgan	3 years ago
gongel	dc60aeb8c2	format	3 years ago
gongel	31510d088c	refactor: rm kaldi_io	3 years ago
TianYuan	2189b46004	add tts cli	3 years ago
KP	70a8a75476	Add st demo.	3 years ago
Hui Zhang	6dedb63e8b	Merge pull request #1087 from Jackwaterveg/setup [ctcdecoders] Separate the ctcdecoders	3 years ago
huangyuxin	9fe0beee54	fix the bug: miss import after install	3 years ago
huangyuxin	cea5ffe0e4	refactor the code	3 years ago
gongel	20d88ec673	refactor: update params/input/output/namestyle	3 years ago
KP	6c1e6e7876	Update recommended model to cnn14 and argument name in __call__.	3 years ago
huangyuxin	ed12db61a6	Separate the ctcdecoders	3 years ago
KP	0b7e0d1e2e	Update tags of pretrained_models.	3 years ago
KP	d08b824d72	Update README.	3 years ago
KP	61e39daccc	Optimize model init.	3 years ago
KP	528c70e515	Remove TODO.	3 years ago
KP	b072453ca8	Fix decompressing problem.	3 years ago
KP	29da318379	Add audio classification cli.	3 years ago
gongel	f5c61ced28	feat: add st cli	3 years ago
Hui Zhang	0818c1601d	add __init__.py	3 years ago
TianYuan	7b2ecb6eed	add style_melgan, test=tts	3 years ago
Hui Zhang	03678c08c5	Merge branch 'develop' into fix_cli	3 years ago
huangyuxin	1b57d05d1b	rm the os.chdir in cli asr	3 years ago
TianYuan	aead853b1d	Update zh_frontend.py	3 years ago
huangyuxin	021311c76b	add transformer to cli infer	3 years ago
TianYuan	a070524d37	Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_style_melgan, test=tts	3 years ago
TianYuan	dd36eafe34	add style_melgan	3 years ago
KP	54cf048b2a	Merge remote-tracking branch 'update_stream/develop' into cli	3 years ago
huangyuxin	a258a34ec0	revise the convert pcm	3 years ago
Jackwaterveg	8ec576f477	Update infer.py	3 years ago
huangyuxin	b0356ae489	revise	3 years ago
huangyuxin	957f2e3a1c	revise	3 years ago
huangyuxin	aee530af27	revise the sample rate	3 years ago
Junkun	4e31a4445d	eval mode	3 years ago
KP	a19e51d7da	Update python api.	3 years ago
KP	e0642ffc77	Update doc strings.	3 years ago
huangyuxin	90d648a601	support using by __call__	3 years ago
huangyuxin	aecb5f567c	Merge branch 'tmp' into 1048	3 years ago
KP	44e9b032d5	Update inputs and outputs of executor.	3 years ago
huangyuxin	3fadcde5e2	revise the asr infer.py	3 years ago
Hui Zhang	4823892169	Merge pull request #1058 from Jackwaterveg/benchmark [benchmark]fix the benchmark	3 years ago
Junkun	3a14b82844	minor	3 years ago
Junkun	f50a2ab4ca	fix bugs	3 years ago
huangyuxin	cb383a39c3	fix the benchmark	3 years ago
huangyuxin	d0bf506fee	fix the load checkpoint	3 years ago
KP	1707244472	Update device usage.	3 years ago
KP	000294132c	Rename s2t to asr.	3 years ago
huangyuxin	43f4d47bfa	add the call in infer.py	3 years ago
Hui Zhang	39228864bb	format code	3 years ago
Hui Zhang	d395c2b8e3	jsonlines reade manifest file	3 years ago
Hui Zhang	7554b6107a	using visualdl; fix read_manifest	3 years ago
huangyuxin	cdc8520969	add the infer	3 years ago
KP	c94ebdc52c	Add python api for executor.	3 years ago
Junkun	d2fab3238b	fix bugs	3 years ago
Junkun	cdd0845127	add translate function	3 years ago
KP	e9798498d6	Update asr inference in paddlespeech.cli.	3 years ago
huangyuxin	895a086fdd	rename the config.feat_size and the config.vocab.size to input_size and output_size	3 years ago
KP	4d39a7746e	Add paddlespeech.cli.	3 years ago
KP	98f0806353	Add paddlespeech.cli.	3 years ago
TianYuan	6e3257ab8a	Create __init__.py	3 years ago
TianYuan	022f1ce8e9	Merge pull request #1040 from yt605155624/fix_frontend [TTS]update text frontend	3 years ago
TianYuan	a861e56e91	rm space for pure Chinese	3 years ago
TianYuan	dad1cbbcd6	update text frontend	3 years ago
KP	6e1ac1cc15	Add paddlespeech.cls and esc50 example.	3 years ago
KP	33f0e7622c	Add paddlespeech.cls and esc50 example.	3 years ago
KP	2c531d78ac	Add paddlespeech.cls and esc50 example.	3 years ago
KP	bdb3ce23ee	Add paddlespeech.cls and esc50 example.	3 years ago
KP	1189117784	Add paddlespeech.cls and esc50 example.	3 years ago
Hui Zhang	2bbfdbae91	Merge pull request #1015 from yt605155624/fs2_conformer [TTS]fastspeech2 conformer	3 years ago
TianYuan	b0a1d8ab60	fix base	3 years ago
TianYuan	469329221b	refactor encoder, rm old code	3 years ago
Hui Zhang	fe83adfbcb	nproc to ngpu	3 years ago
Hui Zhang	789471bfca	test wav for u2	3 years ago
TianYuan	bc0dd51149	Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into HEAD	3 years ago
Jackwaterveg	09931d2ccc	Merge pull request #1019 from zh794390558/feat [bugfix] Kaldi Feature using dither in train	3 years ago
huangyuxin	8aebfeac81	fix the prc-commit	3 years ago
Hui Zhang	56480e1033	fix format	3 years ago
Hui Zhang	7ec0ed4aaf	kaldi feat dither when train	3 years ago
Hui Zhang	2ba3f00bbd	Merge branch 'develop' into datapipe	3 years ago
Hui Zhang	b944418d6f	new format data support ds2/st	3 years ago
Hui Zhang	0defc658e1	update aishell/librispeech transformer result; wenetspeech pretrain conformer result	3 years ago
Hui Zhang	d2a05df02e	Merge pull request #1014 from Jackwaterveg/auto_log [asr]hidden the auto_log	3 years ago
huangyuxin	fb6974f950	update the auto_log	3 years ago
TianYuan	4370c5cfa6	Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into fs2_conformer	3 years ago
Hui Zhang	638b96bf07	check if cmvn_file in config for u2	3 years ago
Hui Zhang	c354e9154b	Merge pull request #1003 from yt605155624/fs2_ge2e [TTS]add fastspeech2 voice cloning in aishell3	3 years ago
TianYuan	133ee7db0b	rename num_speakers	3 years ago
TianYuan	3d5e078c91	add conformer	3 years ago
TianYuan	a97c7b5206	rename spembs	3 years ago
huangyuxin	f646d4c3a1	renew the setup.py for paddlespeech feat and ctcdecoders	3 years ago
huangyuxin	ca06b91fc4	renew the setup.py for paddlespeech feat and ctcdecoders	3 years ago
Hui Zhang	3bd87bc379	add wenet lincense	3 years ago
TianYuan	8d025451de	add fastspeech2 voice cloning in aishell3	3 years ago
TianYuan	c5c9f19091	rename to gen_gta_mel.py, remove stats compute when gen fintune data	3 years ago
TianYuan	a6ac497f8e	add multi-band melgan finetune scripts	3 years ago
Hui Zhang	fe29f74a1c	Merge pull request #992 from yt605155624/fix_docs [TTS] add tts tutorial	3 years ago
TianYuan	30d09b411d	fix style_syn, replace DeepSpeech with PaddleSpeech in readme	3 years ago
TianYuan	0bc9450c51	Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into fix_docs	3 years ago
Hui Zhang	f9b66d0d97	Remove useless folders (#990 )	3 years ago
Hui Zhang	2d76638d62	more speech domain	3 years ago
TianYuan	0fcc5005a2	add tts tutorial	3 years ago
Hui Zhang	1ae1ead80f	more install scripts	3 years ago
Hui Zhang	51a6845564	Merge pull request #985 from Jackwaterveg/benchmark revise the benchmark	3 years ago
huangyuxin	843ea1c12e	revise the benchmark	3 years ago
Hui Zhang	080b0431f4	format code	3 years ago
Junkun	7c8843448c	add word reward into beam search.	3 years ago
Hui Zhang	9a71c091c5	remove debug info and format code	3 years ago
Hui Zhang	8b0e344c69	fix logfbank using PCM16	3 years ago
Hui Zhang	7ceef6c3f5	format code	3 years ago
Hui Zhang	f9221b4b74	fix ctc align	3 years ago
Hui Zhang	fb853167d3	format code	3 years ago
Hui Zhang	18d9abc7a0	add sox speed pertrub	3 years ago
Hui Zhang	56d06f2aaf	Merge pull request #968 from yt605155624/merge_paddlespeech [TTS] change nprocs to ngpu	3 years ago

... 7 8 9 10 11 ...

863 Commits (368e3e1b591da921c7b98501f8394f788bc31589)