PaddleSpeech

Commit Graph

Author	SHA1	Message	Date
longRookie	df37798598	[TTS]【Hackathon + No.190】 + 模型复现：iSTFTNet (#3006 ) * iSTFTNet implementation based on hifigan, not affect the function and execution of HIFIGAN * modify the comment in iSTFT.yaml * add the comments in hifigan * iSTFTNet implementation based on hifigan, not affect the function and execution of HIFIGAN * modify the comment in iSTFT.yaml * add the comments in hifigan * add iSTFTNet.md * modify the format of iSTFTNet.md * modify iSTFT.yaml and hifigan.py * Format code using pre-commit * modify hifigan.py,delete the unused self.istft_layer_id , move the self.output_conv behind else, change conv_post to output_conv * update iSTFTNet_csmsc_ckpt.zip download link * modify iSTFTNet.md * modify hifigan.py and iSTFT.yaml * modify iSTFTNet.md	2 years ago
TianYuan	72aa19c32c	[TTS]add starganv2 vc trainer (#3143 ) * add starganv2 vc trainer * fix StarGANv2VCUpdater and losses * fix StarGANv2VCEvaluator * add some typehint	2 years ago
TianYuan	54ef90fcec	[TTS]Fix VITS lite infer (#3098 )	2 years ago
liangym	e83b491c34	rm unused dep, test=tts (#3097 )	2 years ago
TianYuan	6894a2a77d	[TTS]fix elementwise_floordiv's fill_constant (#3075 ) * fix elementwise_floordiv's fill_constant * add float converter for min_value in attention	2 years ago
TianYuan	0a2e367ff4	[TTS]clean starganv2 vc model code and add docstring (#2987 ) * clean code * add docstring	2 years ago
liangym	880c172db7	[TTS] add svs frontend (#3062 )	2 years ago
TianYuan	d5720e4e7b	fix input dtype of elementwise_mul op from bool to int64 (#3054 )	2 years ago
夜雨飘零	31a4562ae8	[ASR]add squeezeformer model (#2755 ) * add squeezeformer model * change CodeStyle, test=asr * change CodeStyle, test=asr * fix subsample rate error, test=asr * merge classes as required, test=asr * change CodeStyle, test=asr * fix missing code, test=asr * split code to new file, test=asr * remove rel_shift, test=asr	2 years ago
zxcd	9bf5471613	optional tokenizer and fix some doc. (#3042 )	2 years ago
TianYuan	706a68bde9	fix dtype diff of last expand_v2 op of VITS (#3041 )	2 years ago
liangym	348064de0d	[TTS] add opencpop HIFIGAN example (#3038 ) * add opencpop voc, test=tts * soft link * add opencpop hifigan, test=tts * update	2 years ago
zxcd	4e9bca177a	[ASR] change optimizer and fix import error, test=asr (#3023 ) * mv dataio.py to s2t.io.speechbrain.dataio mv dataio.py to paddlespeech.s2t.io.speechbrain.dataio * remove transformers import. * change optimizer same with released model * add paddlenlp version in RESULT.md. * fix run.sh * fix data.sh step_num. * add adadelta optimizer config. * fix wav2vec2 test_wav.sh run error. * add tokenizer config.	2 years ago
liangym	435fc5cc19	[TTS] add opencpop PWGAN example (#3031 ) * add opencpop voc, test=tts * soft link	2 years ago
TianYuan	271112ca69	fix vits reduce_sum's input/output dtype, test=tts (#3028 )	2 years ago
liangym	1afd14acd9	[TTS]add Diffsinger with opencpop dataset (#3005 )	2 years ago
MistEO	319c805968	[TTS] Support set device id for tts prediction, test=tts (#3019 )	2 years ago
zxcd	3145325b4e	[ASR] add wav2vec2 aishell model result, test=asr (#3012 ) * Create RESULT.md * add wav2vec2ASR-large-aishell1 finetune model. * update model link and add readme. * fix released model info.	2 years ago
zxcd	5186319f48	fix load model schedule error, config optional. (#3008 )	2 years ago
TianYuan	528ae58a67	[TTS]remove pad op in static model by replace F.pad with nn.Pad1D and nn.Pad2D (#3002 ) * remove pad op in static model by replace F.pad with nn.Pad1D and nn.Pad2D * fix variable names * add note	2 years ago
JiehangXie	59cabdc967	[TTS]Cli Cantonese onnx, test=tts (#2990 ) Co-authored-by: TianYuan <white-sky@qq.com>	2 years ago
mooncake	c02bc087f6	rearrange-encoder-infer-param (#2983 )	2 years ago
TianYuan	f7fd111647	[TTS]add StarGANv2-VC model scripts (#2842 )	2 years ago
HuangLiangJie	c8196d45ae	[TTS]Canton CLI, test=tts (#2977 )	2 years ago
TianYuan	ad239eb444	[TTS]add VITS inference (#2972 )	2 years ago
TianYuan	84f751f529	[TTS]vits dygraph to static (#2883 ) Co-authored-by: 0x45f <wangzhen45@baidu.com>	2 years ago
HuangLiangJie	11bc392617	[TTS]Canton phonetic fix, test=tts (#2950 )	2 years ago
TianYuan	c8d5a01bdb	[TTS]fix dygraph to static for tacotron2, test=doc (#2426 ) * fix dygraph to static for tacotron2, test=doc * Fix dy2st error for taco2 * Update attentions.py --------- Co-authored-by: 0x45f <wangzhen45@baidu.com>	2 years ago
liangym	d9b041e999	[TTS]Cli male onnx (#2945 )	2 years ago
zxcd	dcf8ef04e0	[ASR] Remove fluid api and useless import, test=asr (#2944 ) * remove fluid api and useless import. * fix variable name	2 years ago
JiehangXie	a5c0bffd2a	add Cantonese test examples (#2937 )	2 years ago
zxcd	a8a353d0ac	[ASR] add python simple adadelta optimizer, test=asr (#2925 ) * add simple adeadelta optimizer. * remove useless log * remove useless and fluid import. * add framework.dygraph_only back	2 years ago
HuangLiangJie	1af9bd47d9	[TTS]Cantonese FastSpeech2 e2e infer, test=tts (#2927 )	2 years ago
zxcd	004a4d6096	[ASR] rm transformers import and modify variable name consistent with infer.py, test=asr (#2929 ) * rm transformers import and modify variable name consistent with infer.py * add condition ctc_prefix_beam_search decode.	2 years ago
zxcd	17a7ebddfa	fix dist_sampler AttributeError (#2918 )	2 years ago
HuangLiangJie	acfa057dc7	[TTS]Cantonese FastSpeech2 Training, test=tts (#2907 )	2 years ago
zxcd	047092de8e	add wav2vev2_zh aishell recipe, and speechbrain dataloader. (#2916 )	2 years ago
艾梦	bcd8e309ec	[TTS]Add diffusion noise clip to optimize sample result (#2902 ) * add diffusion module for training diffsinger * add wavenet denoiser final conv initializer * add diffusion noise clip to optimize sample result	2 years ago
zxcd	6728db5b59	[ASR]Whisper remove audio duration limit, test=asr (#2900 )	2 years ago
zxcd	f6b624ddc8	add encoding=utf8 for text cli. (#2896 )	2 years ago
章宏彬	c764710aa1	[TTS]Avoid using variable "attn_loss" before assignment (#2860 ) * Avoid using variable "attn_loss" before assignment * Update tacotron2_updater.py --------- Co-authored-by: TianYuan <white-sky@qq.com>	2 years ago
TianYuan	a283f8a57e	[TTS]fix open encoding (#2865 )	2 years ago
艾梦	a55fd2e556	[TTS]Fix diffusion wavenet denoiser final conv init param (#2868 ) * add diffusion module for training diffsinger * add wavenet denoiser final conv initializer	2 years ago
QuanZ9	ac3ed3c5a8	Update zh_frontend.py (#2863 )	2 years ago
zxcd	64aeb6dccc	remove some fluid api (elementwise_div elementwise_mul sqrt reduce_sum). (#2859 )	2 years ago
zxcd	31c2c226ca	clean fluid elementwise_max and square api. (#2852 )	2 years ago
HuangLiangJie	140aed4b54	[TTS]VITS init sampler reverse, test=tts (#2843 )	2 years ago
艾梦	57b9d4bca4	add diffusion module for training diffsinger (#2832 )	2 years ago
TianYuan	1fd38c0e8b	fix o (#2831 )	2 years ago
晋东毅	742523fb38	[tts]For mixed Chinese and English speech synthesis, add SSML support for Chinese (#2830 ) * 添加.history * [tts]添加中英混合语音合成时对中文SSML的支持	2 years ago

1 2 3 4 5 ...

1133 Commits (9c387577fd9758d04b43844f8297286632333bb3)