PaddleSpeech

Commit Graph

Author	SHA1	Message	Date
zxcd	c0cf9e9d3d	adapt paddle 3.2, remove some hack code to avoid conflict. (#4120 )	1 week ago
zxcd	af2bdf120f	Reduce precision difference for whipser model (#4116 )	2 weeks ago
zxcd	538f260061	【ASR】whisper large v3 (#4101 ) * whisper large v3 * add convert.py * mv nlp tokenizer to tiktoken. * fix bug * remove convert.py * add new model file. * fix * fix version number * fix version number * fix some bug * fix bug	3 weeks ago
张春乔	f357ec6172	Docker=-= 兼容性验证 (#4018 ) * add docker * fix unit error > Type promotion * fix url * add gpu docker * Update Dockerfile * fix pp3.0 0-d tensor problem * 兼容性验证	6 months ago
Wang Xin	c11b19df90	[Hackathon 7th] updating the paths to utility scripts and modifying method parameters (#3942 ) * fix path error * Update examples/aishell/asr0/local/test.sh * remove some broken symbolic link	9 months ago
megemini	b4c2f3bae3	[Hackathon 7th] 修复 `s2t` 示例错误 (#3950 ) * [Fix] s2t * [Fix] s2t test	9 months ago
megemini	e3c4d4bd7e	[Fix] use reshape instead of view (#3939 )	9 months ago
megemini	ff539ef007	[Fix] transpose use numpy (#3933 )	10 months ago
yinfan98	5b3612f273	【Hackathon 7th】fix whisper at Paddle 3.0 (#3880 ) * fix whisper at Paddle 3.0 * fix whisper at Paddle 3.0 * fix whisper at Paddle 3.0 * fix lint * fix * fix whisper ci * Update TTSCppFrontend * Update utils * Update steps * Update utils * Update __init__.py * Update whisper.py * Update utils * Update utils	10 months ago
Wang Xin	61728f8db6	NumPy compatibility enhancements (#3907 ) * NumPy compatibility enhancements * update code	10 months ago
张春乔	7dc806dc1d	run with aishell/asr3 (#3904 )	10 months ago
zxcd	d32ced7f1a	separate paddle.logsumexp (#3897 )	10 months ago
zxcd	89bfd44293	fix file name (#3895 )	10 months ago
Wang Xin	62c21e951f	replace view with reshape in aishell/asr1 (#3887 )	10 months ago
张春乔	6f44ac92c8	fix the shape error in layer_norm (#3884 )	10 months ago
zxcd	91170bd260	adapt view behavior change, fix KeyError. (#3794 ) * adapt view behavior change, fix KeyError. * fix readme demo run error. * fixed opencc version	1 year ago
Wang Huan	790f183f68	fix develop view confict with model's	2 years ago
Hui Zhang	55870ffbb3	fix bugs	2 years ago
zxcd	9b8ac050de	add dtype param for arange API. (#3302 )	2 years ago
jiamingkong	ba874db5dc	Fixed the transpose usages ignored before	2 years ago
jiamingkong	0e2068e2cf	Code clean up for CIs	2 years ago
jiamingkong	232dcf8660	Adapted wavlmASR model to pretrained weights and CLI	2 years ago
jiamingkong	60bd7f202e	Code clean up according to comments in https://github.com/PaddlePaddle/PaddleSpeech/pull/3242	2 years ago
jiamingkong	3b6651ba7c	Adding WavLM implementation	2 years ago
TianHao Zhang	12e3e76092	[ASR] Support Hubert, fintuned on the librispeech dataset (#3088 ) * librispeech hubert, test=asr * librispeech hubert, test=asr * hubert decode * review * copyright, notes, example related * hubert cli * pre-commit format * fix conflicts * fix conflicts * doc related * doc and train config * librispeech.py * support hubert cli	2 years ago
Shuangchi He	8c7859d3bc	Fix some typos. (#3178 ) Signed-off-by: Yulv-git <yulvchi@qq.com>	2 years ago
夜雨飘零	31a4562ae8	[ASR]add squeezeformer model (#2755 ) * add squeezeformer model * change CodeStyle, test=asr * change CodeStyle, test=asr * fix subsample rate error, test=asr * merge classes as required, test=asr * change CodeStyle, test=asr * fix missing code, test=asr * split code to new file, test=asr * remove rel_shift, test=asr	3 years ago
zxcd	dcf8ef04e0	[ASR] Remove fluid api and useless import, test=asr (#2944 ) * remove fluid api and useless import. * fix variable name	3 years ago
zxcd	004a4d6096	[ASR] rm transformers import and modify variable name consistent with infer.py, test=asr (#2929 ) * rm transformers import and modify variable name consistent with infer.py * add condition ctc_prefix_beam_search decode.	3 years ago
zxcd	047092de8e	add wav2vev2_zh aishell recipe, and speechbrain dataloader. (#2916 )	3 years ago
zxcd	6728db5b59	[ASR]Whisper remove audio duration limit, test=asr (#2900 )	3 years ago
cxumol	a99244d86e	fix: whisper language choice, test=asr (#2828 )	3 years ago
zxcd	ad40dafa85	fix some bug. (#2825 )	3 years ago
zxcd	a8a240d4ef	remove paddle.fluid (#2740 )	3 years ago
YangZhou	12fa8a2d19	[audio]patch:fix tensor_utils error (#2738 ) * fix tensor utils	3 years ago
YangZhou	42ff946007	[audio] mv paddlespeech/audio to paddleaudio (#2706 ) * split paddlespeech/audio to paddleaudio. * add sox io ,sox effect, kaldi native fbank to paddleaudio.	3 years ago
Zth9730	c67bf7b4ef	[ASR] support wav2vec2-zh cli, test=asr (#2697 ) * support wav2vec2-zh cli, test=asr * support wav2vec2-zh cli, test=asr * support wav2vec2-zh cli, test=asr * support wav2vec2-zh cli, test=asr	3 years ago
zxcd	4542684694	[ASR] fix Whisper cli model download path error. test=asr (#2679 ) * add all whisper model size support * add choices in parser. * fix Whisper cli model download path error. * fix resource download path. * fix code style	3 years ago
Zth9730	fc02cd0540	[doc] update wav2vec2 demos README.md, test=doc (#2674 ) * fix wav2vec2 demos, test=doc * fix wav2vec2 demos, test=doc * fix enc_dropout and nor.py, test=asr	3 years ago
zxcd	b71f1428c7	add all whisper model size support, test=asr (#2677 ) * add all whisper model size support * add choices in parser.	3 years ago
Zth9730	94a487bd81	[ASR] support wav2vec2 command line and demo (#2658 ) * wav2vec2_cli * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * Update RESULTS.md * Update RESULTS.md * Update base_commands.py * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr	3 years ago
zxcd	b1d3f59bcb	[s2t] add whisper asr large model (#2640 ) * add whisper asr large model decoding, test=asr * fix code style. * fix json code style. * remove resource and fix code style. * fix yapf * add cli and demos, fix some code style. * fix some problem by comment. * fix yapf	3 years ago
Zth9730	8d3494320d	[ASR] wav2vec2_en, test=asr (#2637 ) * wav2vec2_en, test=asr * wav2vec2_en, test=asr * wav2vec2_en, test=asr	3 years ago
Zth9730	8d3464c050	[s2t] Update wav2vec2 license (#2600 )	3 years ago
tianhao zhang	3d994f5c23	format wav2vec2 demo	3 years ago
tianhao zhang	19180d359d	format wav2vec2 demo	3 years ago
tianhao zhang	6e429f0513	support wav2vec2ASR on librispeech	3 years ago
Hui Zhang	1f4f98b171	fix bug	3 years ago
Hui Zhang	2a75405e9a	Merge branch 'develop' into u2pp_export	3 years ago
Hui Zhang	467cfd4e75	Merge pull request #2489 from Zth9730/u2++_server [ASR] support u2pp based cli and server, optimiz code of u2pp decode (reversed_weight parameter)	3 years ago

1 2 3

128 Commits (develop)