PaddleSpeech

Commit Graph

Author	SHA1	Message	Date
qingen	6446f72cab	[vec][score] add plda model, test=doc fix #1667	3 years ago
qingen	84576d6956	[vec][score] add plda model, test=doc fix #1667	3 years ago
lym0302	1a3c811f04	code format, test=doc	3 years ago
TianYuan	0d6f5868ea	Merge pull request #1665 from yt605155624/add_onnx [TTS]add onnx inference for fastspeech2 + hifigan/mb_melgan	3 years ago
Honei	f500fa8bde	Merge pull request #1646 from Honei/develop [vec]add speaker verification score method	3 years ago
TianYuan	0282d45c62	remove fill_constant_batch_size_like in static model of speedyspeech, test=tts	3 years ago
TianYuan	c765fca6b4	Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_onnx	3 years ago
TianYuan	124eb6af8f	update notes, test=doc	3 years ago
TianYuan	e0d222e674	update notes, test=doc	3 years ago
Hui Zhang	1843bed458	Merge pull request #1666 from Jackwaterveg/cli [CLI] ASR: Add duration limitation for asr	3 years ago
xiongxinlei	a8244dc5b0	update the note, test=doc	3 years ago
Jackwaterveg	c852776bc6	test=doc	3 years ago
TianYuan	f264b912fc	add warmup for frontend, test=doc	3 years ago
Jackwaterveg	4922e697e1	update cli, test = asr	3 years ago
Jackwaterveg	1c05d03806	test=asr	3 years ago
xiongxinlei	9b5f7f71ac	add part ecapa-tdnn note, test=doc	3 years ago
Hui Zhang	6eed542c08	Merge pull request #1660 from yt605155624/fix_pre [TTS]fix preprocess bug, test=tts	3 years ago
Honei	83310b6379	Merge branch 'develop' into develop	3 years ago
huangyuxin	faf21f033f	add duration limitation for asr	3 years ago
TianYuan	7aecb2c4bb	add onnx inference for fastspeech2 + hifigan/mb_melgan, test=tts	3 years ago
xiongxinlei	d064c8196e	update the speaker verification model, test=doc	3 years ago
xiongxinlei	e72912adb9	update the speaker verification model, test=doc	3 years ago
TianYuan	a8f5990869	fix preprocess bug, test=tts	3 years ago
lym0302	759a9e61e4	update server cli, test=doc	3 years ago
lym0302	603e565ab1	add stream tts server, test=doc	3 years ago
ccrrong	378fe5909f	add ami diarization pipeline, test=doc	3 years ago
xiongxinlei	48b8cc8937	add score method, test=doc	3 years ago
xiongxinlei	ebfe3e6b13	test.py update the CSVDataset, test=doc	3 years ago
xiongxinlei	acebfad7b7	change the vector csv.spk_id to csv.label, test=doc	3 years ago
xiongxinlei	57c11dcab0	add some annotations, test=doc	3 years ago
xiongxinlei	30b5b3cb9e	add vector csv dataset format, test=doc	3 years ago
TianYuan	e366fb6b2f	Merge pull request #1643 from Jackwaterveg/check [Doc] supplement note	3 years ago
huangyuxin	ca860e3d2f	supplement note	3 years ago
TianYuan	828ee14404	add license and reference for some models, test=doc	3 years ago
xiongxinlei	5b05300e53	train process add new voxceleb and rirs dataset, test=doc	3 years ago
xiongxinlei	965f486dd5	add voxceleb and rirs noise dataset	3 years ago
Hui Zhang	36df70cbe6	Merge pull request #1638 from zh794390558/spx_refactor [speechx] refactor audio/data/feature cache	3 years ago
TianYuan	5bff096715	Merge pull request #1634 from yt605155624/cnn_decoder [TTS]Cnn decoder	3 years ago
TianYuan	3aec266ca5	add chunk size and pad size in args, test=doc	3 years ago
Hui Zhang	cb39777a60	format code	3 years ago
TianYuan	4d7cd0e063	add streaming synthesize, test=tts	3 years ago
liangym	602b0b0da3	Merge pull request #1632 from lym0302/develop [server] fix output bug	3 years ago
Hui Zhang	61941d14b0	Merge pull request #1627 from WilliamZhang06/ws-develop [websocket] added online asr engine	3 years ago
WilliamZhang06	2ec8d608bf	fixed comments, test=doc	3 years ago
liangym	21c4132eda	Update paddlespeech_client.py	3 years ago
TianYuan	005aa4066c	Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into cnn_decoder	3 years ago
TianYuan	0fc79f474d	add CNNDecoder, test=tts	3 years ago
WilliamZhang06	d847fe29cf	added online asr engine , test=doc	3 years ago
TianYuan	318edec303	Merge pull request #1613 from yt605155624/restructure_expand [tts]restructure expand in length_regulator.py for paddle2onnx, test=tts	3 years ago
Hui Zhang	943d4ac1ee	Merge pull request #1612 from Jackwaterveg/update [ASR] Replace kaidi_fbank with paddleaudio	3 years ago
huangyuxin	f47146af49	add docstring, test=asr	3 years ago
huangyuxin	ed490b66cb	update spectrogram, test=asr	3 years ago
Hui Zhang	84d712d493	format code, test=doc	3 years ago
Honei	d60856b1ed	Merge pull request #1614 from Honei/vox12 [vec]change the vector output to numpy.array	3 years ago
xiongxinlei	ed7113f320	change the vector output to numpy.array	3 years ago
TianYuan	bc5ae43d3a	restructure expand in length_regulator.py for paddle2onnx, test=tts	3 years ago
huangyuxin	0ffe1f9114	replace kaidi_fbank with paddleaudio	3 years ago
Hui Zhang	caee809513	Merge pull request #1605 from Honei/vox12 [vec]add speaker verification demo and doc	3 years ago
xiongxinlei	5ae57206f3	add paddlespeech vector modules __init__.py	3 years ago
xiongxinlei	2c9dc0c89b	add some vector cli comments, test=doc	3 years ago
xiongxinlei	ef1bc5e815	vector cli output dim info, test=doc	3 years ago
xiongxinlei	d5142e5e15	add vector cli annotation, test=doc	3 years ago
xiongxinlei	ad2caf2ccb	add speaker verification demo and doc, test=doc	3 years ago
TianYuan	3cc0ec950e	Merge pull request #1604 from lym0302/add_readme [server] update readme	3 years ago
lym0302	829f1e332e	update readme, test=doc	3 years ago
xiongxinlei	0f78d25f76	add vector cli batch and pipeline test demo, test=doc	3 years ago
Honei	305bacdcf2	Merge branch 'develop' into vox12	3 years ago
xiongxinlei	0bb67d8b8e	add vector cli unit test, test=doc	3 years ago
KP	b6e976a860	Merge pull request #1602 from yt605155624/fix_dtype [TTS]fix dtype of window of stft	3 years ago
xiongxinlei	62cbce6915	add vectorwrapper to extract audio embedding	3 years ago
TianYuan	8938483529	Merge pull request #1601 from yt605155624/add_ljspeech_hifigan [TTS] update readme for ljspeech hifigan	3 years ago
TianYuan	5347dbad3f	fix dtype of window of stft, test=tts	3 years ago
TianYuan	342b487383	update readme for ljspeech hifigan, test=tts	3 years ago
Hui Zhang	4051e7b762	fix compliance test bug, and format	3 years ago
TianYuan	26ef47810d	Merge pull request #1593 from windstamp/npu_dev_20220322 [NPU] Add NPU support for TransformerTTS	3 years ago
zhangkeliang	59b3de6a6d	[NPU] test TransformerTTS with NPU	3 years ago
Jackwaterveg	fcc1762048	Merge pull request #1577 from Jackwaterveg/change_init [ASR] change default initializer to kaiming_uniform	3 years ago
huangyuxin	e1b581b622	fix some bug, test=asr	3 years ago
Hui Zhang	b5315657ff	Merge pull request #1509 from qingen/cluster [vec] add clustering of vectors	3 years ago
huangyuxin	6da8465f14	add dist_sampler args, test=asr	3 years ago
TianYuan	e5e8b8a129	Merge pull request #1587 from yt605155624/add_vctk_hifigan [TTS]Add vctk hifigan	3 years ago
TianYuan	6469568d2a	update readme for vctk hifigan, test=tts	3 years ago
huangyuxin	a4f5a68074	fix some format, test=asr	3 years ago
xiongxinlei	d85d1deef5	exec pre-commit in paddlespeech vector, test=doc	3 years ago
xiongxinlei	9874fb7d75	add some comments in code	3 years ago
huangyuxin	e991d82ae7	Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into change_init	3 years ago
huangyuxin	d53e1163a6	update the code, test=asr	3 years ago
xiongxinlei	b9eafddd94	change - to _ to distinguish field	3 years ago
xiongxinlei	9c6735f921	add vector voxceleb12 base mode url, test=doc	3 years ago
xiongxinlei	d28ccfa96b	add vector cli component, test=doc	3 years ago
KP	831cadacc7	Add paddleaudio doc.	3 years ago
TianYuan	5ab2601759	update readme for aishell3 hifigan, test=tts	3 years ago
Hui Zhang	6abc5d9f7e	format	3 years ago
huangyuxin	ab16d8ce3c	change default initializer to kaiming_uniform, test=asr	3 years ago
qingen	0f7ede11ef	Merge branch 'cluster' of github.com:qingen/PaddleSpeech into cluster	3 years ago
qingen	d16ce21d47	[wip][vec] update cluster of diarization, test=doc #1304	3 years ago
xiongxinlei	506d26a957	change the code style to s2t code style, test=doc	3 years ago
xiongxinlei	311fa87a11	add some comments to the code	3 years ago
Hui Zhang	90deeca06f	Merge pull request #1554 from lym0302/develop [server] add server cls	3 years ago
lym0302	89457b273a	modify, test=doc	3 years ago
xiongxinlei	8ed5c287a3	add vox2 data into VoxCeleb class	3 years ago
lym0302	77bad44e8b	modify readme, test=doc	3 years ago
lym0302	8ef92a9495	modify, test=doc	3 years ago
lym0302	89dbda58f6	add cls static model, test=doc	3 years ago
Hui Zhang	40ab05a462	Merge pull request #1552 from yt605155624/format_syn [TTS]format synthesize	3 years ago
lym0302	5187df847f	modify server demo, test=doc	3 years ago
xiongxinlei	584a2c0e39	add ecapa-tdnn config yaml file	3 years ago
lym0302	0a6602c708	modify application.yaml, test=doc	3 years ago
TianYuan	544c372b50	fix cr, test=tts	3 years ago
lym0302	99fa7a8205	add server cls, test=doc	3 years ago
TianYuan	fe8bf2a38c	format synthesize, test=tts	3 years ago
xiongxinlei	993d6783d7	remove unused code, test=doc	3 years ago
xiongxinlei	0e87037f2c	refactor to compilance paddleaudio	3 years ago
xiongxinlei	4473405f82	merge develop to vox12, test=doc	3 years ago
Honei	0dee8f40e9	Merge branch 'PaddlePaddle:develop' into develop	3 years ago
xiongxinlei	60d73bb7bd	add state 0 to prepare the voxcele data and augment data	3 years ago
xiongxinlei	14efbf5b15	check extract embedding result, test=doc	3 years ago
xiongxinlei	386ef3f161	add voxceleb augment unit test, test=doc	3 years ago
Hui Zhang	5147163592	Merge pull request #1544 from yt605155624/add_vctk_hifigan [tts]add vctk hifigan egs	3 years ago
TianYuan	81d964f0a0	add vctk hifigan, test=tts	3 years ago
xiongxinlei	2d89c80e6f	add waveform augment pipeline, test=doc	3 years ago
lym0302	3b304544f6	modify yaml, test=doc	3 years ago
xiongxinlei	ac4967e204	optimize the data prepare process	3 years ago
xiongxinlei	016ed6d69c	repair the code according to the part comment, test=doc	3 years ago
Hui Zhang	2886ab9373	Merge pull request #1530 from lym0302/server_cli [server] add server test	3 years ago
xiongxinlei	1f74af110b	add training log info and comment, test=doc	3 years ago
lym0302	e50c1b3b1d	add server test, test=doc	3 years ago
xiongxinlei	4648059b5f	add training process for sid, test=doc	3 years ago
xiongxinlei	7668f61422	add sid dataloader for training, test=doc	3 years ago
xiongxinlei	6af2bc3d5b	add sid loss wraper for voxceleb, test=doc	3 years ago
xiongxinlei	57c4f4a68c	add sid learning rate and training model	3 years ago
TianYuan	4d2f2191a8	fix gbk encode bug	3 years ago
Honei	1395b5f5fa	Merge branch 'PaddlePaddle:develop' into develop	3 years ago
TianYuan	175c39b4a4	Merge pull request #1511 from yt605155624/pre_fix_for_streaming [TTS]add rtf for synthesize, add more vocoder for synthesize.sh	3 years ago
Hui Zhang	5ba4907c44	Merge pull request #1514 from lym0302/server_cli [server] update server cli	3 years ago
lym0302	85d4a31e04	update application.yaml, test=doc	3 years ago
Jerryuhoo	c116a3a926	fix Speedyspeech multi-speaker inference, test=tts	3 years ago
lym0302	ab04488738	update server cli, test=doc	3 years ago
TianYuan	cb07bd2a94	add rtf for synthesize, add more vocoder for synthesize_e2e.sh, test=tts	3 years ago
Hui Zhang	26d413ce8f	Merge pull request #1510 from lym0302/paddlespeech_stats [server] add paddlespeech_server stats	3 years ago
lym0302	72c0cda30c	add paddlespeech_server stats, test=doc	3 years ago
Hui Zhang	e8f2d8f11b	Merge pull request #1507 from zh794390558/cli [cli] add cli batch/pipe example to readme	3 years ago
Hui Zhang	2517df92a0	Merge pull request #1508 from lym0302/paddlespeech_stats [CLI] modified text sr to lang	3 years ago
TianYuan	b6d33a7fb4	Merge pull request #1506 from yt605155624/fix_frontend [TTS]update text frontend, test=tts	3 years ago
lym0302	395c923dee	modified text sr to lang, test=doc	3 years ago
Hui Zhang	75098698d8	format,test=doc	3 years ago
TianYuan	66a8beb27f	update text frontend, test=tts	3 years ago
lym0302	96abb33b5b	add __call__, test=doc	3 years ago
lym0302	5f1728f855	rm server related, test=doc	3 years ago
xiongxinlei	70d3b01c0d	remove invalid code	3 years ago

1 2 3 4 5 ...

564 Commits (651012616a9bda276040ca308e336094cfa55584)