PaddleSpeech

Commit Graph

Author	SHA1	Message	Date
zhuyipin	a9ece28ba6	speedyspeech code adapt for mlu (#3828 ) * speedyspeech code adapt for mlu * fix inference * fix help message	1 year ago
zhuyipin	0b568136d9	speedyspeech code adapt for npu (#3804 ) * speedyspeech code adapt for npu * fix npu inference * fix e2e synthesize * add paddle version control for memory optim config * fix code style * fix code style * fix help message * fix code style * fix help message	1 year ago
Kai Song	9a50647592	Add XPU support for FastSpeech2 (#3514 ) * Add XPU support for FastSpeech2 * optimize	2 years ago
Kai Song	08599b7300	Add XPU support for SpeedySpeech (#3502 ) * Add XPU support for SpeedySpeech * fix typos * update description of nxpu	2 years ago
Hui Zhang	596f71407c	update result	3 years ago
Hui Zhang	0a5cc5556e	rope for streaming decoding	3 years ago
Hui Zhang	3b6b680771	add roformer result	3 years ago
Hui Zhang	55870ffbb3	fix bugs	3 years ago
Hui Zhang	03e9ea9e52	add roformer	3 years ago
Hui Zhang	6b4d1f80ac	add t2s assets	3 years ago
Hui Zhang	83d93da8d0	add scripts for tts code switch	3 years ago
Hui Zhang	2376c14d7b	fix ckpt	3 years ago
Hui Zhang	6e7c71b26c	refactor rhy	3 years ago
Hui Zhang	2fe97f2e3a	Merge pull request #3292 from zh794390558/mfa refactor mfa scripts	3 years ago
jiamingkong	8432e8626f	Final cleaning; Modified SSL/infer.py and README for wavlm inclusion in model options	3 years ago
Hui Zhang	c9ddc4f832	refactor mfa scripts	3 years ago
jiamingkong	927c60a5c1	Deleted examples/librispeech/asr5/format_rsl.py	3 years ago
jiamingkong	232dcf8660	Adapted wavlmASR model to pretrained weights and CLI	3 years ago
jiamingkong	9ee1205d25	Changed the path for the uploaded weight	3 years ago
jiamingkong	60bd7f202e	Code clean up according to comments in https://github.com/PaddlePaddle/PaddleSpeech/pull/3242	3 years ago
jiamingkong	3b6651ba7c	Adding WavLM implementation	3 years ago
TianHao Zhang	12e3e76092	[ASR] Support Hubert, fintuned on the librispeech dataset (#3088 ) * librispeech hubert, test=asr * librispeech hubert, test=asr * hubert decode * review * copyright, notes, example related * hubert cli * pre-commit format * fix conflicts * fix conflicts * doc related * doc and train config * librispeech.py * support hubert cli	3 years ago
Hui Zhang	8371d14f5d	Merge pull request #3167 from zxcd/amp [ASR] add amp for U2 conformer	3 years ago
Hui Zhang	e3dcfa8815	Merge pull request #3186 from PaddlePaddle/vits_pr [TTS]update lr schedulers from per iter to per epoch for VITS	3 years ago
zxcd	bc365cbb52	Merge branch 'develop' into amp	3 years ago
zxcd	9d8660b2f6	add new aishell model for better CER.	3 years ago
WongLaw	305375c310	VITS learning rate revised, test=tts	3 years ago
WongLaw	fdeb9b88a7	VITS learning rate revised, test=tts	3 years ago
TianYuan	fc670339d1	[TTS]Fix losses of StarGAN v2 VC (#3184 )	3 years ago
ljhzxc	84cc5fc98f	Update pretrained model in README (#3193 )	3 years ago
Hui Zhang	df3be4acae	[s2t] move s2t data preprocess into paddlespeech.dataset (#3189 ) * move s2t data preprocess into paddlespeech.dataset * avg model, compute wer, format rsl into paddlespeech.dataset * fix format rsl * fix avg ckpts	3 years ago
Shuangchi He	8c7859d3bc	Fix some typos. (#3178 ) Signed-off-by: Yulv-git <yulvchi@qq.com>	3 years ago
WongLaw	414de3747c	VITS learning rate revised, test=tts	3 years ago
TianYuan	3ad55a31e7	[TTS]StarGANv2 VC fix some trainer bugs, add add reset_parameters (#3182 )	3 years ago
ljhzxc	dc56c3a10e	[TTS] [黑客松]Add JETS (#3109 )	3 years ago
TianYuan	bd0d69ca74	[TTS]add StarGANv2VC preprocess (#3163 )	3 years ago
lemondy	9c387577fd	fix example/aishell local/train.sh if condition bug, test=asr (#3146 )	3 years ago
longRookie	df37798598	[TTS]【Hackathon + No.190】 + 模型复现：iSTFTNet (#3006 ) * iSTFTNet implementation based on hifigan, not affect the function and execution of HIFIGAN * modify the comment in iSTFT.yaml * add the comments in hifigan * iSTFTNet implementation based on hifigan, not affect the function and execution of HIFIGAN * modify the comment in iSTFT.yaml * add the comments in hifigan * add iSTFTNet.md * modify the format of iSTFTNet.md * modify iSTFT.yaml and hifigan.py * Format code using pre-commit * modify hifigan.py,delete the unused self.istft_layer_id , move the self.output_conv behind else, change conv_post to output_conv * update iSTFTNet_csmsc_ckpt.zip download link * modify iSTFTNet.md * modify hifigan.py and iSTFT.yaml * modify iSTFTNet.md	3 years ago
TianYuan	72aa19c32c	[TTS]add starganv2 vc trainer (#3143 ) * add starganv2 vc trainer * fix StarGANv2VCUpdater and losses * fix StarGANv2VCEvaluator * add some typehint	3 years ago
TianYuan	54ef90fcec	[TTS]Fix VITS lite infer (#3098 )	3 years ago
TianYuan	cc02b007fe	fix paddle2onnx's install version, install the newest paddle2onnx in run.sh (#3084 )	3 years ago
liangym	880c172db7	[TTS] add svs frontend (#3062 )	3 years ago
夜雨飘零	31a4562ae8	[ASR]add squeezeformer model (#2755 ) * add squeezeformer model * change CodeStyle, test=asr * change CodeStyle, test=asr * fix subsample rate error, test=asr * merge classes as required, test=asr * change CodeStyle, test=asr * fix missing code, test=asr * split code to new file, test=asr * remove rel_shift, test=asr	3 years ago
zxcd	9bf5471613	optional tokenizer and fix some doc. (#3042 )	3 years ago
liangym	348064de0d	[TTS] add opencpop HIFIGAN example (#3038 ) * add opencpop voc, test=tts * soft link * add opencpop hifigan, test=tts * update	3 years ago
zxcd	4e9bca177a	[ASR] change optimizer and fix import error, test=asr (#3023 ) * mv dataio.py to s2t.io.speechbrain.dataio mv dataio.py to paddlespeech.s2t.io.speechbrain.dataio * remove transformers import. * change optimizer same with released model * add paddlenlp version in RESULT.md. * fix run.sh * fix data.sh step_num. * add adadelta optimizer config. * fix wav2vec2 test_wav.sh run error. * add tokenizer config.	3 years ago
TianYuan	65c3217b8b	Update textnorm_test_cases.txt	3 years ago
liangym	435fc5cc19	[TTS] add opencpop PWGAN example (#3031 ) * add opencpop voc, test=tts * soft link	3 years ago
liangym	1afd14acd9	[TTS]add Diffsinger with opencpop dataset (#3005 )	3 years ago
zxcd	3145325b4e	[ASR] add wav2vec2 aishell model result, test=asr (#3012 ) * Create RESULT.md * add wav2vec2ASR-large-aishell1 finetune model. * update model link and add readme. * fix released model info.	3 years ago
TianYuan	528ae58a67	[TTS]remove pad op in static model by replace F.pad with nn.Pad1D and nn.Pad2D (#3002 ) * remove pad op in static model by replace F.pad with nn.Pad1D and nn.Pad2D * fix variable names * add note	3 years ago
JiehangXie	59cabdc967	[TTS]Cli Cantonese onnx, test=tts (#2990 ) Co-authored-by: TianYuan <white-sky@qq.com>	3 years ago
TianYuan	f7fd111647	[TTS]add StarGANv2-VC model scripts (#2842 )	3 years ago
TianYuan	16d75eaa52	Update synthesize_e2e.sh	3 years ago
TianYuan	f31219c63d	Update synthesize_e2e.sh	3 years ago
TianYuan	ad239eb444	[TTS]add VITS inference (#2972 )	3 years ago
zxcd	5430a366ab	Update RESULTS.md (#2964 )	3 years ago
TianYuan	84f751f529	[TTS]vits dygraph to static (#2883 ) Co-authored-by: 0x45f <wangzhen45@baidu.com>	3 years ago
HuangLiangJie	11bc392617	[TTS]Canton phonetic fix, test=tts (#2950 )	3 years ago
TianYuan	a38e3c3187	Update README.md	3 years ago
HuangLiangJie	2553022396	[TTS]Cantonese TTS checkpoint for e2e, test=tts (#2932 )	3 years ago
HuangLiangJie	1af9bd47d9	[TTS]Cantonese FastSpeech2 e2e infer, test=tts (#2927 )	3 years ago
lance6716	71bda24437	[TTS]Fix canton (#2924 ) * Update run.sh * Update README.md	3 years ago
HuangLiangJie	acfa057dc7	[TTS]Cantonese FastSpeech2 Training, test=tts (#2907 )	3 years ago
HuangLiangJie	c75906462e	[TTS]Cantonese TTS MFA pipeline (#2899 ) Co-authored-by: TianYuan <white-sky@qq.com>	3 years ago
zxcd	047092de8e	add wav2vev2_zh aishell recipe, and speechbrain dataloader. (#2916 )	3 years ago
HuangLiangJie	3a8ba2e242	[TTS]Update VITS checkpoint , test=tts (#2887 )	3 years ago
Zhou Wei	16d84367c6	fix Tensor.numpy()[0] to float(Tensor) to adapt 0D (#2884 )	3 years ago
TianYuan	2f526c093c	fix data for slim (#2862 )	3 years ago
TianYuan	2b01e40525	[TTS]soft link for shell in example, add skip_copy_wave in norm stage of G… (#2851 ) soft link for shell in example, add skip_copy_wave in norm stage of GANVocoders to save disk	3 years ago
zxcd	e793d267d9	[ASR] add code-switch asr tal_cs recipe (#2796 ) * add tal_cs asr recipe. * add readme and result, and fix some bug. * add commit id and date.	3 years ago
TianYuan	355517d1ca	Update run.sh	3 years ago
TianYuan	553fefff9b	Update run_en.sh	3 years ago
TianYuan	59d4b8a2a8	Update run_mix.sh	3 years ago
TianYuan	3f6afc4834	[TTS]Add slim for TTS (#2729 )	3 years ago
YangZhou	42ff946007	[audio] mv paddlespeech/audio to paddleaudio (#2706 ) * split paddlespeech/audio to paddleaudio. * add sox io ,sox effect, kaldi native fbank to paddleaudio.	3 years ago
TianYuan	001b22cf41	Update README.md	3 years ago
HuangLiangJie	a874d8f325	Add prosody prediction in synthesize_e2e, test=tts (#2693 )	3 years ago
TianYuan	0b4cf2211d	[TTS]Add TTS Paddle-Lite x86 inference (#2667 ) * Add export2lite, test=tts * add tts paddlelite x86 inference, test=tts * update released_model.md, test=tts * add paddlelite in setup.py * update	3 years ago
Zth9730	94a487bd81	[ASR] support wav2vec2 command line and demo (#2658 ) * wav2vec2_cli * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * Update RESULTS.md * Update RESULTS.md * Update base_commands.py * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr	3 years ago
liangym	f110928195	[tts ] Chmod run_mix.sh (#2647 )	3 years ago
Zth9730	8d3494320d	[ASR] wav2vec2_en, test=asr (#2637 ) * wav2vec2_en, test=asr * wav2vec2_en, test=asr * wav2vec2_en, test=asr	3 years ago
TianYuan	0727984873	Add export2lite, test=tts (#2636 )	3 years ago
HuangLiangJie	b7312e9f0b	Revised TN qualifier for measure notation, test=tts (#2629 )	3 years ago
HuangLiangJie	f43d026000	Add rhythm tags for MFA, test=tts (#2615 ) * Add rhythm tags for MFA, test=tts	3 years ago
TianYuan	d1e65a4781	Update README.md	3 years ago
TianYuan	39df6b14e5	Update README.md	3 years ago
TianYuan	46fd67e2ce	Update README.md	3 years ago
liangym	d622b8bc5f	[tts] update finetune readme (#2625 ) * updata readme, test=doc * update yaml and readme, test=tts * fix batch_size, test=tts * update readme, test=doc	3 years ago
Hui Zhang	2c34481ea0	[s2t] quant with wav scp (#2568 ) * add quant hint * add paddleslim * using paddleslim 2.3.4 and paddle 2.4	3 years ago
liangym	e18170228c	[tts] add adversarial loss (#2588 )	3 years ago
YangZhou	294d83da82	add kws release model (#2585 ) add kws release model.	3 years ago
TianYuan	91fa195b8b	Update README.md	3 years ago
WongLaw	d27364d141	Added Text Rhythm Prediction, test=tts	3 years ago
HuangLiangJie	872be9c8ce	Merge branch 'PaddlePaddle:develop' into rhy	3 years ago
YangZhou	bbf2401e3e	Merge pull request #2524 from zh794390558/u2 [speechx] add u2/u2pp asr inference	3 years ago
WongLaw	72bbabbf79	Revised structure of rhythm prediction, test=tts	3 years ago
Hui Zhang	caa1dd84d4	Merge pull request #2559 from Zth9730/fix_u2pp_model [doc] update wenetspeech u2pp RESULT.md	3 years ago
tianhao zhang	ce153d915e	update u2pp result.md	3 years ago
WongLaw	fd00a22e37	Revised Rhythm Prediction Function, test=tts	3 years ago

1 2 3 4 5 ...

1119 Commits (develop)