PaddleSpeech

Commit Graph

Author	SHA1	Message	Date
YangZhou	42ff946007	[audio] mv paddlespeech/audio to paddleaudio (#2706 ) * split paddlespeech/audio to paddleaudio. * add sox io ,sox effect, kaldi native fbank to paddleaudio.	2 years ago
Zth9730	c67bf7b4ef	[ASR] support wav2vec2-zh cli, test=asr (#2697 ) * support wav2vec2-zh cli, test=asr * support wav2vec2-zh cli, test=asr * support wav2vec2-zh cli, test=asr * support wav2vec2-zh cli, test=asr	2 years ago
zxcd	4542684694	[ASR] fix Whisper cli model download path error. test=asr (#2679 ) * add all whisper model size support * add choices in parser. * fix Whisper cli model download path error. * fix resource download path. * fix code style	2 years ago
zxcd	b71f1428c7	add all whisper model size support, test=asr (#2677 ) * add all whisper model size support * add choices in parser.	2 years ago
Zth9730	94a487bd81	[ASR] support wav2vec2 command line and demo (#2658 ) * wav2vec2_cli * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * Update RESULTS.md * Update RESULTS.md * Update base_commands.py * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr	2 years ago
zxcd	b1d3f59bcb	[s2t] add whisper asr large model (#2640 ) * add whisper asr large model decoding, test=asr * fix code style. * fix json code style. * remove resource and fix code style. * fix yapf * add cli and demos, fix some code style. * fix some problem by comment. * fix yapf	2 years ago
liangym	25b6bf9668	[tts] Add male voice for tts (#2660 )	2 years ago
Zth9730	68134c8436	fix u2pp model (#2549 )	2 years ago
Hui Zhang	467cfd4e75	Merge pull request #2489 from Zth9730/u2++_server [ASR] support u2pp based cli and server, optimiz code of u2pp decode (reversed_weight parameter)	2 years ago
ZapBird	7a13b35fe6	BytesIO类型时，要保证切到初始位置，这样多次读取才能够正常。比如__call__函数。 (#2484 ) * BytesIO类型时，要保证切到初始位置，这样多次读取才能够正常。比如__call__函数。 __call__函数的参数audio_file为BytesIO类型时执行到self.preprocess(model, audio_file)会报错，需要判断audio_file为BytesIO类型时执行audio_file.seek(0)。	2 years ago
tianhao zhang	5bbe6e9897	support u2pp cli and server, optimiz code of u2pp decode, test=asr	2 years ago
THUzyt21	bdbacd4249	precomited	2 years ago
Zhao Yuting	d5dec46336	Update README.md	2 years ago
Zhao Yuting	18b71dc136	Update README.md	2 years ago
Zhao Yuting	12a11394bd	Update infer.py add a new faster model to infer in cli	2 years ago
Zhao Yuting	fb7f04e021	Update README.md	2 years ago
Zhao Yuting	92d09d5cce	Update README_cn.md	2 years ago
Zhao Yuting	57dcd0d17f	Update infer.py change the infer in order to implement the new faster model for text	2 years ago
TianYuan	35c6ffa90b	Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_vc2	2 years ago
TianYuan	e622f42d92	add aishell3 voice cloning with ECAPA-TDNN spk encoder	2 years ago
TianYuan	795eb7bd10	format paddlespeech with pre-commit (#2331 )	2 years ago
TianYuan	0baec4325a	fix stats bugs	2 years ago
TianYuan	5d515f3f3f	update mix tts	2 years ago
TianYuan	8da993bbf8	fix fs bug	2 years ago
TianYuan	788a3062d0	fix onnx am_ckpt from list to item in prtrained_mdoels.py	2 years ago
TianYuan	cd662a08e0	fix for load specified model files	2 years ago
TianYuan	b9ade18055	add onnxruntime infer for cli	2 years ago
TianYuan	510e240c5c	achange default voc in cli from pwgan_csmsc to hifigan_csmsc, test=tts	2 years ago
TianYuan	00e9853f66	add mix tts cli, test=tts	2 years ago
Hui Zhang	5ca05fea20	cli batch process support \t	2 years ago
huangyuxin	05d41523ad	Merge branch 'develop' into webdataset	2 years ago
TianYuan	4b1f82d312	log redundancy in server	2 years ago
TianYuan	bc93bffbb4	replace logger.info with logger.debug in cli, change default log level to INFO	2 years ago
TianYuan	cf846f9ebc	rm extra log	2 years ago
KP	adc7c9b4aa	Fix unnecessary download present in issue #2067 .	2 years ago
huangyuxin	6ec6921255	Merge branch 'webdataset' of https://github.com/Jackwaterveg/DeepSpeech into webdataset	2 years ago
Jackwaterveg	6598216b2f	Merge branch 'develop' into webdataset	2 years ago
huangyuxin	aa12b9ab52	replace s2t.transform with audio.transform	2 years ago
KP	b230dfbdec	Add kws cli and demo.	2 years ago
Hui Zhang	d20adb5c89	Merge pull request #2048 from KPatr1ck/import_bug [CLI][Audio] Fix circular import error in paddlespeech.cli.utils and paddlespeech.audio	2 years ago
KP	b452be3d8d	Fix circular import error in paddlespeech.cli.utils and paddlespeech.audio	2 years ago
KP	fe345409bb	Fix circular import error in paddlespeech.cli.utils and paddlespeech.audio	2 years ago
Hui Zhang	59a78f2a46	ds2 wenetspeech to onnx and support streaming asr server	2 years ago
Hui Zhang	c8574c7e35	ds2 inference as sepearte engine for streaming asr	2 years ago
Jackwaterveg	bca014fd92	Merge pull request #2032 from PaddlePaddle/audio_refactoring [audio] Audio refactoring	2 years ago
KP	4aaa8effe8	Refactor paddleaudio to paddlespeech.audio	2 years ago
KP	bf056c013d	Refactor paddleaudio to paddlespeech.audio	2 years ago
huangyuxin	2b5bc6df39	fix cli, test=doc	2 years ago
huangyuxin	06c9eee339	update reademe, add conf file, updata test_cli	2 years ago
huangyuxin	6ebe476532	support editing num_decode_left_chunks in cli and server	2 years ago

1 2 3 4 5

221 Commits (d35dda002012d84b742dfef86b917fd7d3a40b37)