PaddleSpeech

Commit Graph

Author	SHA1	Message	Date
zxcd	9b8ac050de	add dtype param for arange API. (#3302 )	1 year ago
jiamingkong	8432e8626f	Final cleaning; Modified SSL/infer.py and README for wavlm inclusion in model options	1 year ago
jiamingkong	ba874db5dc	Fixed the transpose usages ignored before	1 year ago
jiamingkong	0e2068e2cf	Code clean up for CIs	1 year ago
jiamingkong	232dcf8660	Adapted wavlmASR model to pretrained weights and CLI	2 years ago
jiamingkong	60bd7f202e	Code clean up according to comments in https://github.com/PaddlePaddle/PaddleSpeech/pull/3242	2 years ago
jiamingkong	3b6651ba7c	Adding WavLM implementation	2 years ago
TianHao Zhang	12e3e76092	[ASR] Support Hubert, fintuned on the librispeech dataset (#3088 ) * librispeech hubert, test=asr * librispeech hubert, test=asr * hubert decode * review * copyright, notes, example related * hubert cli * pre-commit format * fix conflicts * fix conflicts * doc related * doc and train config * librispeech.py * support hubert cli	2 years ago
Hui Zhang	8371d14f5d	Merge pull request #3167 from zxcd/amp [ASR] add amp for U2 conformer	2 years ago
Hui Zhang	225737d4e3	[s2t] fix cli args to config (#3194 ) * fix cli args to config * fix train cli	2 years ago
zxcd	bc365cbb52	Merge branch 'develop' into amp	2 years ago
Hui Zhang	df3be4acae	[s2t] move s2t data preprocess into paddlespeech.dataset (#3189 ) * move s2t data preprocess into paddlespeech.dataset * avg model, compute wer, format rsl into paddlespeech.dataset * fix format rsl * fix avg ckpts	2 years ago
Shuangchi He	8c7859d3bc	Fix some typos. (#3178 ) Signed-off-by: Yulv-git <yulvchi@qq.com>	2 years ago
zxcd	a1e5f27003	mv scaler.unscale_ blow grad_clip.	2 years ago
zxcd	7399d560e7	fix scaler save and load.	2 years ago
zxcd	2f4414a5f8	fix scaler save	2 years ago
zxcd	fbd27aab41	add amp for U2 conformer.	2 years ago
夜雨飘零	31a4562ae8	[ASR]add squeezeformer model (#2755 ) * add squeezeformer model * change CodeStyle, test=asr * change CodeStyle, test=asr * fix subsample rate error, test=asr * merge classes as required, test=asr * change CodeStyle, test=asr * fix missing code, test=asr * split code to new file, test=asr * remove rel_shift, test=asr	2 years ago
zxcd	9bf5471613	optional tokenizer and fix some doc. (#3042 )	2 years ago
zxcd	4e9bca177a	[ASR] change optimizer and fix import error, test=asr (#3023 ) * mv dataio.py to s2t.io.speechbrain.dataio mv dataio.py to paddlespeech.s2t.io.speechbrain.dataio * remove transformers import. * change optimizer same with released model * add paddlenlp version in RESULT.md. * fix run.sh * fix data.sh step_num. * add adadelta optimizer config. * fix wav2vec2 test_wav.sh run error. * add tokenizer config.	2 years ago
zxcd	5186319f48	fix load model schedule error, config optional. (#3008 )	2 years ago
zxcd	dcf8ef04e0	[ASR] Remove fluid api and useless import, test=asr (#2944 ) * remove fluid api and useless import. * fix variable name	2 years ago
zxcd	a8a353d0ac	[ASR] add python simple adadelta optimizer, test=asr (#2925 ) * add simple adeadelta optimizer. * remove useless log * remove useless and fluid import. * add framework.dygraph_only back	2 years ago
zxcd	004a4d6096	[ASR] rm transformers import and modify variable name consistent with infer.py, test=asr (#2929 ) * rm transformers import and modify variable name consistent with infer.py * add condition ctc_prefix_beam_search decode.	2 years ago
zxcd	17a7ebddfa	fix dist_sampler AttributeError (#2918 )	2 years ago
zxcd	047092de8e	add wav2vev2_zh aishell recipe, and speechbrain dataloader. (#2916 )	2 years ago
zxcd	6728db5b59	[ASR]Whisper remove audio duration limit, test=asr (#2900 )	2 years ago
zxcd	64aeb6dccc	remove some fluid api (elementwise_div elementwise_mul sqrt reduce_sum). (#2859 )	2 years ago
zxcd	31c2c226ca	clean fluid elementwise_max and square api. (#2852 )	2 years ago
cxumol	a99244d86e	fix: whisper language choice, test=asr (#2828 )	2 years ago
zxcd	ad40dafa85	fix some bug. (#2825 )	2 years ago
zxcd	a8a240d4ef	remove paddle.fluid (#2740 )	2 years ago
YangZhou	12fa8a2d19	[audio]patch:fix tensor_utils error (#2738 ) * fix tensor utils	2 years ago
YangZhou	42ff946007	[audio] mv paddlespeech/audio to paddleaudio (#2706 ) * split paddlespeech/audio to paddleaudio. * add sox io ,sox effect, kaldi native fbank to paddleaudio.	2 years ago
Zth9730	c67bf7b4ef	[ASR] support wav2vec2-zh cli, test=asr (#2697 ) * support wav2vec2-zh cli, test=asr * support wav2vec2-zh cli, test=asr * support wav2vec2-zh cli, test=asr * support wav2vec2-zh cli, test=asr	2 years ago
zxcd	4542684694	[ASR] fix Whisper cli model download path error. test=asr (#2679 ) * add all whisper model size support * add choices in parser. * fix Whisper cli model download path error. * fix resource download path. * fix code style	2 years ago
Zth9730	fc02cd0540	[doc] update wav2vec2 demos README.md, test=doc (#2674 ) * fix wav2vec2 demos, test=doc * fix wav2vec2 demos, test=doc * fix enc_dropout and nor.py, test=asr	2 years ago
zxcd	b71f1428c7	add all whisper model size support, test=asr (#2677 ) * add all whisper model size support * add choices in parser.	2 years ago
Zth9730	94a487bd81	[ASR] support wav2vec2 command line and demo (#2658 ) * wav2vec2_cli * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * Update RESULTS.md * Update RESULTS.md * Update base_commands.py * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr	2 years ago
zxcd	b1d3f59bcb	[s2t] add whisper asr large model (#2640 ) * add whisper asr large model decoding, test=asr * fix code style. * fix json code style. * remove resource and fix code style. * fix yapf * add cli and demos, fix some code style. * fix some problem by comment. * fix yapf	2 years ago
Zth9730	8d3494320d	[ASR] wav2vec2_en, test=asr (#2637 ) * wav2vec2_en, test=asr * wav2vec2_en, test=asr * wav2vec2_en, test=asr	2 years ago
Hui Zhang	2c34481ea0	[s2t] quant with wav scp (#2568 ) * add quant hint * add paddleslim * using paddleslim 2.3.4 and paddle 2.4	2 years ago
Zth9730	8d3464c050	[s2t] Update wav2vec2 license (#2600 )	2 years ago
YangZhou	bbf2401e3e	Merge pull request #2524 from zh794390558/u2 [speechx] add u2/u2pp asr inference	2 years ago
Hui Zhang	eac545e1db	Merge pull request #2544 from Zth9730/fix_attention [s2t] fix attention eval bug, do not compose kv in infer	2 years ago
tianhao zhang	1ea828c30e	fix attention val bug	2 years ago
Hui Zhang	964c22c677	Merge pull request #2532 from Zth9730/wav2vec2.0 [s2t] fix wav2vec2 report loss bug	2 years ago
tianhao zhang	86f65f0b8e	fix wav2vec2 report loss bug	2 years ago
Hui Zhang	f1ca564731	Merge pull request #2518 from Zth9730/wav2vec2.0 [ASR] wav2vec2 ASR, pre-trained wav2vec2 based CTC for librispeech	2 years ago
tianhao zhang	2ae94bd277	freeze wav2vec2=True, change loss report and update README.md	2 years ago

1 2 3 4 5 ...

292 Commits (9727e67a3fbc2779a64ae3372fb0fbd79edefe24)