PaddleSpeech

Commit Graph

Author	SHA1	Message	Date
Jackwaterveg	c852776bc6	test=doc	3 years ago
TianYuan	f264b912fc	add warmup for frontend, test=doc	3 years ago
Jackwaterveg	4922e697e1	update cli, test = asr	3 years ago
Jackwaterveg	1c05d03806	test=asr	3 years ago
xiongxinlei	9b5f7f71ac	add part ecapa-tdnn note, test=doc	3 years ago
Hui Zhang	6eed542c08	Merge pull request #1660 from yt605155624/fix_pre [TTS]fix preprocess bug, test=tts	3 years ago
Honei	83310b6379	Merge branch 'develop' into develop	3 years ago
huangyuxin	faf21f033f	add duration limitation for asr	3 years ago
TianYuan	7aecb2c4bb	add onnx inference for fastspeech2 + hifigan/mb_melgan, test=tts	3 years ago
xiongxinlei	d064c8196e	update the speaker verification model, test=doc	3 years ago
xiongxinlei	e72912adb9	update the speaker verification model, test=doc	3 years ago
TianYuan	a8f5990869	fix preprocess bug, test=tts	3 years ago
lym0302	759a9e61e4	update server cli, test=doc	3 years ago
lym0302	603e565ab1	add stream tts server, test=doc	3 years ago
ccrrong	378fe5909f	add ami diarization pipeline, test=doc	3 years ago
xiongxinlei	48b8cc8937	add score method, test=doc	3 years ago
xiongxinlei	ebfe3e6b13	test.py update the CSVDataset, test=doc	3 years ago
xiongxinlei	acebfad7b7	change the vector csv.spk_id to csv.label, test=doc	3 years ago
xiongxinlei	57c11dcab0	add some annotations, test=doc	3 years ago
xiongxinlei	30b5b3cb9e	add vector csv dataset format, test=doc	3 years ago
TianYuan	e366fb6b2f	Merge pull request #1643 from Jackwaterveg/check [Doc] supplement note	3 years ago
huangyuxin	ca860e3d2f	supplement note	3 years ago
TianYuan	828ee14404	add license and reference for some models, test=doc	3 years ago
xiongxinlei	5b05300e53	train process add new voxceleb and rirs dataset, test=doc	3 years ago
xiongxinlei	965f486dd5	add voxceleb and rirs noise dataset	3 years ago
Hui Zhang	36df70cbe6	Merge pull request #1638 from zh794390558/spx_refactor [speechx] refactor audio/data/feature cache	3 years ago
TianYuan	5bff096715	Merge pull request #1634 from yt605155624/cnn_decoder [TTS]Cnn decoder	3 years ago
TianYuan	3aec266ca5	add chunk size and pad size in args, test=doc	3 years ago
Hui Zhang	cb39777a60	format code	3 years ago
TianYuan	4d7cd0e063	add streaming synthesize, test=tts	3 years ago
liangym	602b0b0da3	Merge pull request #1632 from lym0302/develop [server] fix output bug	3 years ago
Hui Zhang	61941d14b0	Merge pull request #1627 from WilliamZhang06/ws-develop [websocket] added online asr engine	3 years ago
WilliamZhang06	2ec8d608bf	fixed comments, test=doc	3 years ago
liangym	21c4132eda	Update paddlespeech_client.py	3 years ago
TianYuan	005aa4066c	Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into cnn_decoder	3 years ago
TianYuan	0fc79f474d	add CNNDecoder, test=tts	3 years ago
WilliamZhang06	d847fe29cf	added online asr engine , test=doc	3 years ago
TianYuan	318edec303	Merge pull request #1613 from yt605155624/restructure_expand [tts]restructure expand in length_regulator.py for paddle2onnx, test=tts	3 years ago
Hui Zhang	943d4ac1ee	Merge pull request #1612 from Jackwaterveg/update [ASR] Replace kaidi_fbank with paddleaudio	3 years ago
huangyuxin	f47146af49	add docstring, test=asr	3 years ago
huangyuxin	ed490b66cb	update spectrogram, test=asr	3 years ago
Hui Zhang	84d712d493	format code, test=doc	3 years ago
Honei	d60856b1ed	Merge pull request #1614 from Honei/vox12 [vec]change the vector output to numpy.array	3 years ago
xiongxinlei	ed7113f320	change the vector output to numpy.array	3 years ago
TianYuan	bc5ae43d3a	restructure expand in length_regulator.py for paddle2onnx, test=tts	3 years ago
huangyuxin	0ffe1f9114	replace kaidi_fbank with paddleaudio	3 years ago
Hui Zhang	caee809513	Merge pull request #1605 from Honei/vox12 [vec]add speaker verification demo and doc	3 years ago
xiongxinlei	5ae57206f3	add paddlespeech vector modules __init__.py	3 years ago
xiongxinlei	2c9dc0c89b	add some vector cli comments, test=doc	3 years ago
xiongxinlei	ef1bc5e815	vector cli output dim info, test=doc	3 years ago
xiongxinlei	d5142e5e15	add vector cli annotation, test=doc	3 years ago
xiongxinlei	ad2caf2ccb	add speaker verification demo and doc, test=doc	3 years ago
TianYuan	3cc0ec950e	Merge pull request #1604 from lym0302/add_readme [server] update readme	3 years ago
lym0302	829f1e332e	update readme, test=doc	3 years ago
xiongxinlei	0f78d25f76	add vector cli batch and pipeline test demo, test=doc	3 years ago
Honei	305bacdcf2	Merge branch 'develop' into vox12	3 years ago
xiongxinlei	0bb67d8b8e	add vector cli unit test, test=doc	3 years ago
KP	b6e976a860	Merge pull request #1602 from yt605155624/fix_dtype [TTS]fix dtype of window of stft	3 years ago
xiongxinlei	62cbce6915	add vectorwrapper to extract audio embedding	3 years ago
TianYuan	8938483529	Merge pull request #1601 from yt605155624/add_ljspeech_hifigan [TTS] update readme for ljspeech hifigan	3 years ago
TianYuan	5347dbad3f	fix dtype of window of stft, test=tts	3 years ago
TianYuan	342b487383	update readme for ljspeech hifigan, test=tts	3 years ago
Hui Zhang	4051e7b762	fix compliance test bug, and format	3 years ago
TianYuan	26ef47810d	Merge pull request #1593 from windstamp/npu_dev_20220322 [NPU] Add NPU support for TransformerTTS	3 years ago
zhangkeliang	59b3de6a6d	[NPU] test TransformerTTS with NPU	3 years ago
Jackwaterveg	fcc1762048	Merge pull request #1577 from Jackwaterveg/change_init [ASR] change default initializer to kaiming_uniform	3 years ago
huangyuxin	e1b581b622	fix some bug, test=asr	3 years ago
Hui Zhang	b5315657ff	Merge pull request #1509 from qingen/cluster [vec] add clustering of vectors	3 years ago
huangyuxin	6da8465f14	add dist_sampler args, test=asr	3 years ago
TianYuan	e5e8b8a129	Merge pull request #1587 from yt605155624/add_vctk_hifigan [TTS]Add vctk hifigan	3 years ago
TianYuan	6469568d2a	update readme for vctk hifigan, test=tts	3 years ago
huangyuxin	a4f5a68074	fix some format, test=asr	3 years ago
xiongxinlei	d85d1deef5	exec pre-commit in paddlespeech vector, test=doc	3 years ago
xiongxinlei	9874fb7d75	add some comments in code	3 years ago
huangyuxin	e991d82ae7	Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into change_init	3 years ago
huangyuxin	d53e1163a6	update the code, test=asr	3 years ago
xiongxinlei	b9eafddd94	change - to _ to distinguish field	3 years ago
xiongxinlei	9c6735f921	add vector voxceleb12 base mode url, test=doc	3 years ago
xiongxinlei	d28ccfa96b	add vector cli component, test=doc	3 years ago
KP	831cadacc7	Add paddleaudio doc.	3 years ago
TianYuan	5ab2601759	update readme for aishell3 hifigan, test=tts	3 years ago
Hui Zhang	6abc5d9f7e	format	3 years ago
huangyuxin	ab16d8ce3c	change default initializer to kaiming_uniform, test=asr	3 years ago
qingen	0f7ede11ef	Merge branch 'cluster' of github.com:qingen/PaddleSpeech into cluster	3 years ago
qingen	d16ce21d47	[wip][vec] update cluster of diarization, test=doc #1304	3 years ago
xiongxinlei	506d26a957	change the code style to s2t code style, test=doc	3 years ago
xiongxinlei	311fa87a11	add some comments to the code	3 years ago
Hui Zhang	90deeca06f	Merge pull request #1554 from lym0302/develop [server] add server cls	3 years ago
lym0302	89457b273a	modify, test=doc	3 years ago
xiongxinlei	8ed5c287a3	add vox2 data into VoxCeleb class	3 years ago
lym0302	77bad44e8b	modify readme, test=doc	3 years ago
lym0302	8ef92a9495	modify, test=doc	3 years ago
lym0302	89dbda58f6	add cls static model, test=doc	3 years ago
Hui Zhang	40ab05a462	Merge pull request #1552 from yt605155624/format_syn [TTS]format synthesize	3 years ago
lym0302	5187df847f	modify server demo, test=doc	3 years ago
xiongxinlei	584a2c0e39	add ecapa-tdnn config yaml file	3 years ago
lym0302	0a6602c708	modify application.yaml, test=doc	3 years ago
TianYuan	544c372b50	fix cr, test=tts	3 years ago
lym0302	99fa7a8205	add server cls, test=doc	3 years ago
TianYuan	fe8bf2a38c	format synthesize, test=tts	3 years ago
xiongxinlei	993d6783d7	remove unused code, test=doc	3 years ago
xiongxinlei	0e87037f2c	refactor to compilance paddleaudio	3 years ago
xiongxinlei	4473405f82	merge develop to vox12, test=doc	3 years ago
Honei	0dee8f40e9	Merge branch 'PaddlePaddle:develop' into develop	3 years ago
xiongxinlei	60d73bb7bd	add state 0 to prepare the voxcele data and augment data	3 years ago
xiongxinlei	14efbf5b15	check extract embedding result, test=doc	3 years ago
xiongxinlei	386ef3f161	add voxceleb augment unit test, test=doc	3 years ago
Hui Zhang	5147163592	Merge pull request #1544 from yt605155624/add_vctk_hifigan [tts]add vctk hifigan egs	3 years ago
TianYuan	81d964f0a0	add vctk hifigan, test=tts	3 years ago
xiongxinlei	2d89c80e6f	add waveform augment pipeline, test=doc	3 years ago
lym0302	3b304544f6	modify yaml, test=doc	3 years ago
xiongxinlei	ac4967e204	optimize the data prepare process	3 years ago
xiongxinlei	016ed6d69c	repair the code according to the part comment, test=doc	3 years ago
Hui Zhang	2886ab9373	Merge pull request #1530 from lym0302/server_cli [server] add server test	3 years ago
xiongxinlei	1f74af110b	add training log info and comment, test=doc	3 years ago
lym0302	e50c1b3b1d	add server test, test=doc	3 years ago
xiongxinlei	4648059b5f	add training process for sid, test=doc	3 years ago
xiongxinlei	7668f61422	add sid dataloader for training, test=doc	3 years ago
xiongxinlei	6af2bc3d5b	add sid loss wraper for voxceleb, test=doc	3 years ago
xiongxinlei	57c4f4a68c	add sid learning rate and training model	3 years ago
TianYuan	4d2f2191a8	fix gbk encode bug	3 years ago
Honei	1395b5f5fa	Merge branch 'PaddlePaddle:develop' into develop	3 years ago
TianYuan	175c39b4a4	Merge pull request #1511 from yt605155624/pre_fix_for_streaming [TTS]add rtf for synthesize, add more vocoder for synthesize.sh	3 years ago
Hui Zhang	5ba4907c44	Merge pull request #1514 from lym0302/server_cli [server] update server cli	3 years ago
lym0302	85d4a31e04	update application.yaml, test=doc	3 years ago
Jerryuhoo	c116a3a926	fix Speedyspeech multi-speaker inference, test=tts	3 years ago
lym0302	ab04488738	update server cli, test=doc	3 years ago
TianYuan	cb07bd2a94	add rtf for synthesize, add more vocoder for synthesize_e2e.sh, test=tts	3 years ago
Hui Zhang	26d413ce8f	Merge pull request #1510 from lym0302/paddlespeech_stats [server] add paddlespeech_server stats	3 years ago
lym0302	72c0cda30c	add paddlespeech_server stats, test=doc	3 years ago
Hui Zhang	e8f2d8f11b	Merge pull request #1507 from zh794390558/cli [cli] add cli batch/pipe example to readme	3 years ago
Hui Zhang	2517df92a0	Merge pull request #1508 from lym0302/paddlespeech_stats [CLI] modified text sr to lang	3 years ago
TianYuan	b6d33a7fb4	Merge pull request #1506 from yt605155624/fix_frontend [TTS]update text frontend, test=tts	3 years ago
lym0302	395c923dee	modified text sr to lang, test=doc	3 years ago
Hui Zhang	75098698d8	format,test=doc	3 years ago
TianYuan	66a8beb27f	update text frontend, test=tts	3 years ago
lym0302	96abb33b5b	add __call__, test=doc	3 years ago
lym0302	5f1728f855	rm server related, test=doc	3 years ago
xiongxinlei	70d3b01c0d	remove invalid code	3 years ago
xiongxinlei	d7da629302	add kaldi feats egs dataset	3 years ago
xiongxinlei	6f7e9656fe	add kaldi feats ark dataset	3 years ago
lym0302	35357e775e	update, test=doc	3 years ago
lym0302	e5aa24fa5a	resolve setup.py conflicts, test=doc	3 years ago
lym0302	fe6be4a65e	Merge branch 'develop' of https://github.com/lym0302/PaddleSpeech into paddlespeech_stats	3 years ago
lym0302	f8375764b9	add paddlespeech stats, test=doc	3 years ago
Hui Zhang	8d474c2658	Merge pull request #1482 from lym0302/servercli_update [server] update server cli	3 years ago
lym0302	162361d878	format code, test=doc	3 years ago
lym0302	434708cff4	set device cpu, test=doc	3 years ago
lym0302	920b2c808c	paras required, test=doc	3 years ago
Hui Zhang	6b1fe70100	format code,test=doc	3 years ago
lym0302	6b2dd16845	update server cli, test=doc	3 years ago
WilliamZhang06	78c9b7342c	deleted wav file , test=doc	3 years ago
WilliamZhang06	a6ec3a26f1	Merge branch 'develop' into server_asr	3 years ago
WilliamZhang06	8b4602f738	added isinstance code, test=doc	3 years ago
lym0302	bb60561c66	update util, test=doc	3 years ago
WilliamZhang06	147018a8b4	added cli changed code, test=doc	3 years ago
lym0302	332009142b	add server demo, test=doc	3 years ago
WilliamZhang06	7ebe904e20	fixed overload , test=doc	3 years ago
Hui Zhang	60c0877e7a	Merge pull request #1472 from KPatr1ck/cli_batch [CLI][Logger]Add cli logger control.	3 years ago
WilliamZhang06	b8f16ac9b0	Merge branch 'develop' into server_asr	3 years ago
WilliamZhang06	da3ea7bb40	added engine type and asr inference , test=doc	3 years ago
Hui Zhang	49f80afe6a	Merge pull request #1381 from PaddlePaddle/server [server] speech server init version	3 years ago
lym0302	b508c4d0cb	add readme, test=doc	3 years ago
KP	d36a4ccfc8	Add cli logger control.	3 years ago
KP	94ed5969fa	Add cli logger control.	3 years ago
lym0302	42cbe313c2	improve cli code, test=doc	3 years ago
lym0302	2bf4b4521f	add cli, test=doc	3 years ago
lym0302	8fd117e4da	add cli, test=doc	3 years ago
lym0302	80b83b7434	add cli, test=doc	3 years ago
KP	7814fba07f	Update batch input.	3 years ago
KP	05288fe1c3	Update batch input and stdin input.	3 years ago
KP	1818b058aa	Support batch input in cls task.	3 years ago
WilliamZhang06	35e3be9ac8	Merge remote-tracking branch 'remote/develop' into server	3 years ago
TianYuan	ae521d3700	Update infer.py	3 years ago
lym0302	07158b2f12	move dir, test=doc	3 years ago
lym0302	76391275fc	move dir, test=doc	3 years ago
TianYuan	67ec6242c3	fix ci for waveflow, test=tts	3 years ago
TianYuan	f51097618b	Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into fix_ci_waveflow	3 years ago
TianYuan	fc8c0e3ea2	fix ci for waveflow, test=tts	3 years ago
huangyuxin	95d5274aef	fix sortagrad, test=asr	3 years ago
Hui Zhang	718c849f68	Merge pull request #1445 from yt605155624/update_train [TTS]init for all works in train.py when ngpu>1	3 years ago
Hui Zhang	f3ec985aaf	Merge pull request #1439 from Jackwaterveg/tipc [TIPC]Add tipc_benchmark of conformer	3 years ago
TianYuan	4ac7db185e	init for all works in train.py when ngpu>1, test=tts	3 years ago
Jackwaterveg	426bae3de1	Merge pull request #1440 from yt605155624/merge_datasets [TTS]Merge datasets, change style of docstring	3 years ago
TianYuan	2cec8f6c76	update tts cli, test=doc	3 years ago
TianYuan	9699c00769	change the docstring style from numpydoc to google, test=tts	3 years ago
huangyuxin	aefe9e93a7	add tipc benchmark of conformer	3 years ago
TianYuan	683679bec7	merge data and datasets, test=tts	3 years ago
TianYuan	7dc1f2daa3	fix some librosa bugs, test=tts	3 years ago
TianYuan	30085ac229	Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into rename_tacotron2	3 years ago
TianYuan	25347bb6a3	rename tacotron2, test=tts	3 years ago
huangyuxin	9a55783aa0	fix resample	3 years ago
Hui Zhang	dcfc32f1ec	Merge pull request #1379 from yt605155624/new_wavernn [TTS] add wavernn	3 years ago
TianYuan	0747600c95	[TTS]add ljspeech new tacotron2 (#1416 ) * add ljspeech new tacotron2, test=tts * update ljspeech waveflow's synthesize * add config, test=doc Co-authored-by: Hui Zhang <zhtclz@foxmail.com>	3 years ago
TianYuan	348a1a33bf	update tacotron2 voice cloning in aishell3 with new tacotron2, test=tts (#1419 )	3 years ago
huangyuxin	f428ec4431	change log of cli/asr/infer	3 years ago
TianYuan	1b0c034134	update wavernn, test=tts	3 years ago
TianYuan	89e69ee10e	[TTS]fix tacotron2 dygraph to static (#1414 ) * fix tacotron2 dygraph to static , test=tts * fix tacotron2 dygraph to static , test=tts * simplify synthesize_e2e.py , test=tts	3 years ago
huangyuxin	2a42421a63	cli add ds2-librispeech offline, fix versionm, test=asr	3 years ago
Hui Zhang	4128f4d61f	fix __version__ error in develop (#1398 )	3 years ago

... 2 3 4 5 6 ...

603 Commits (c5fe181405df43f822b7eeab40737a8ecf3d198f)