PaddleSpeech

Commit Graph

Author	SHA1	Message	Date
JeffLu	02a5f7bce8	bugfix: audio_len should be 1D, no 0D, which will raise list index out (#3490 ) of range error in the following decode process Co-authored-by: Luzhenhui <luzhenhui@mqsz.com>	9 months ago
jiamingkong	8432e8626f	Final cleaning; Modified SSL/infer.py and README for wavlm inclusion in model options	1 year ago
zoooo0820	17f2944a17	fix error in tts/st	2 years ago
zxcd	caca8e2f12	[ASR] fix asr 0-d tensor. (#3214 ) * fix asr infer.py * add readme.	2 years ago
TianHao Zhang	12e3e76092	[ASR] Support Hubert, fintuned on the librispeech dataset (#3088 ) * librispeech hubert, test=asr * librispeech hubert, test=asr * hubert decode * review * copyright, notes, example related * hubert cli * pre-commit format * fix conflicts * fix conflicts * doc related * doc and train config * librispeech.py * support hubert cli	2 years ago
PiaoYang	5a0103b2ae	[BUG] Fix progress bar unit. (#3177 )	2 years ago
JiehangXie	59cabdc967	[TTS]Cli Cantonese onnx, test=tts (#2990 ) Co-authored-by: TianYuan <white-sky@qq.com>	2 years ago
HuangLiangJie	c8196d45ae	[TTS]Canton CLI, test=tts (#2977 )	2 years ago
liangym	d9b041e999	[TTS]Cli male onnx (#2945 )	2 years ago
zxcd	6728db5b59	[ASR]Whisper remove audio duration limit, test=asr (#2900 )	2 years ago
zxcd	f6b624ddc8	add encoding=utf8 for text cli. (#2896 )	2 years ago
TianYuan	a283f8a57e	[TTS]fix open encoding (#2865 )	2 years ago
zxcd	88fe26f17c	[ASR] add asr code-switch cli and demo, test='asr' (#2816 ) * add asr code-switch cli and demo. * fix some model named problem.	2 years ago
YangZhou	42ff946007	[audio] mv paddlespeech/audio to paddleaudio (#2706 ) * split paddlespeech/audio to paddleaudio. * add sox io ,sox effect, kaldi native fbank to paddleaudio.	2 years ago
Zth9730	c67bf7b4ef	[ASR] support wav2vec2-zh cli, test=asr (#2697 ) * support wav2vec2-zh cli, test=asr * support wav2vec2-zh cli, test=asr * support wav2vec2-zh cli, test=asr * support wav2vec2-zh cli, test=asr	2 years ago
zxcd	4542684694	[ASR] fix Whisper cli model download path error. test=asr (#2679 ) * add all whisper model size support * add choices in parser. * fix Whisper cli model download path error. * fix resource download path. * fix code style	2 years ago
zxcd	b71f1428c7	add all whisper model size support, test=asr (#2677 ) * add all whisper model size support * add choices in parser.	2 years ago
Zth9730	94a487bd81	[ASR] support wav2vec2 command line and demo (#2658 ) * wav2vec2_cli * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * Update RESULTS.md * Update RESULTS.md * Update base_commands.py * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr	2 years ago
zxcd	b1d3f59bcb	[s2t] add whisper asr large model (#2640 ) * add whisper asr large model decoding, test=asr * fix code style. * fix json code style. * remove resource and fix code style. * fix yapf * add cli and demos, fix some code style. * fix some problem by comment. * fix yapf	2 years ago
liangym	25b6bf9668	[tts] Add male voice for tts (#2660 )	2 years ago
Zth9730	68134c8436	fix u2pp model (#2549 )	2 years ago
Hui Zhang	467cfd4e75	Merge pull request #2489 from Zth9730/u2++_server [ASR] support u2pp based cli and server, optimiz code of u2pp decode (reversed_weight parameter)	2 years ago
ZapBird	7a13b35fe6	BytesIO类型时，要保证切到初始位置，这样多次读取才能够正常。比如__call__函数。 (#2484 ) * BytesIO类型时，要保证切到初始位置，这样多次读取才能够正常。比如__call__函数。 __call__函数的参数audio_file为BytesIO类型时执行到self.preprocess(model, audio_file)会报错，需要判断audio_file为BytesIO类型时执行audio_file.seek(0)。	2 years ago
tianhao zhang	5bbe6e9897	support u2pp cli and server, optimiz code of u2pp decode, test=asr	2 years ago
THUzyt21	bdbacd4249	precomited	2 years ago
Zhao Yuting	d5dec46336	Update README.md	2 years ago
Zhao Yuting	18b71dc136	Update README.md	2 years ago
Zhao Yuting	12a11394bd	Update infer.py add a new faster model to infer in cli	2 years ago
Zhao Yuting	fb7f04e021	Update README.md	2 years ago
Zhao Yuting	92d09d5cce	Update README_cn.md	2 years ago
Zhao Yuting	57dcd0d17f	Update infer.py change the infer in order to implement the new faster model for text	2 years ago
TianYuan	35c6ffa90b	Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_vc2	2 years ago
TianYuan	e622f42d92	add aishell3 voice cloning with ECAPA-TDNN spk encoder	2 years ago
TianYuan	795eb7bd10	format paddlespeech with pre-commit (#2331 )	2 years ago
TianYuan	0baec4325a	fix stats bugs	2 years ago
TianYuan	5d515f3f3f	update mix tts	2 years ago
TianYuan	8da993bbf8	fix fs bug	2 years ago
TianYuan	788a3062d0	fix onnx am_ckpt from list to item in prtrained_mdoels.py	2 years ago
TianYuan	cd662a08e0	fix for load specified model files	2 years ago
TianYuan	b9ade18055	add onnxruntime infer for cli	2 years ago
TianYuan	510e240c5c	achange default voc in cli from pwgan_csmsc to hifigan_csmsc, test=tts	2 years ago
TianYuan	00e9853f66	add mix tts cli, test=tts	2 years ago
Hui Zhang	5ca05fea20	cli batch process support \t	2 years ago
huangyuxin	05d41523ad	Merge branch 'develop' into webdataset	2 years ago
TianYuan	4b1f82d312	log redundancy in server	2 years ago
TianYuan	bc93bffbb4	replace logger.info with logger.debug in cli, change default log level to INFO	2 years ago
TianYuan	cf846f9ebc	rm extra log	2 years ago
KP	adc7c9b4aa	Fix unnecessary download present in issue #2067 .	2 years ago
huangyuxin	6ec6921255	Merge branch 'webdataset' of https://github.com/Jackwaterveg/DeepSpeech into webdataset	2 years ago
Jackwaterveg	6598216b2f	Merge branch 'develop' into webdataset	2 years ago

1 2 3 4 5

234 Commits (03022f2170ce76d2ca8385a92aa8df3519e2366b)