Commit Graph

292 Commits (9727e67a3fbc2779a64ae3372fb0fbd79edefe24)

Author SHA1 Message Date
zxcd 9b8ac050de
add dtype param for arange API. (#3302)
1 year ago
jiamingkong 8432e8626f Final cleaning; Modified SSL/infer.py and README for wavlm inclusion in model options
1 year ago
jiamingkong ba874db5dc Fixed the transpose usages ignored before
1 year ago
jiamingkong 0e2068e2cf Code clean up for CIs
1 year ago
jiamingkong 232dcf8660 Adapted wavlmASR model to pretrained weights and CLI
2 years ago
jiamingkong 60bd7f202e Code clean up according to comments in https://github.com/PaddlePaddle/PaddleSpeech/pull/3242
2 years ago
jiamingkong 3b6651ba7c Adding WavLM implementation
2 years ago
TianHao Zhang 12e3e76092
[ASR] Support Hubert, fintuned on the librispeech dataset (#3088)
2 years ago
Hui Zhang 8371d14f5d
Merge pull request #3167 from zxcd/amp
2 years ago
Hui Zhang 225737d4e3
[s2t] fix cli args to config (#3194)
2 years ago
zxcd bc365cbb52
Merge branch 'develop' into amp
2 years ago
Hui Zhang df3be4acae
[s2t] move s2t data preprocess into paddlespeech.dataset (#3189)
2 years ago
Shuangchi He 8c7859d3bc
Fix some typos. (#3178)
2 years ago
zxcd a1e5f27003 mv scaler.unscale_ blow grad_clip.
2 years ago
zxcd 7399d560e7 fix scaler save and load.
2 years ago
zxcd 2f4414a5f8 fix scaler save
2 years ago
zxcd fbd27aab41 add amp for U2 conformer.
2 years ago
夜雨飘零 31a4562ae8
[ASR]add squeezeformer model (#2755)
2 years ago
zxcd 9bf5471613
optional tokenizer and fix some doc. (#3042)
2 years ago
zxcd 4e9bca177a
[ASR] change optimizer and fix import error, test=asr (#3023)
2 years ago
zxcd 5186319f48
fix load model schedule error, config optional. (#3008)
2 years ago
zxcd dcf8ef04e0
[ASR] Remove fluid api and useless import, test=asr (#2944)
2 years ago
zxcd a8a353d0ac
[ASR] add python simple adadelta optimizer, test=asr (#2925)
2 years ago
zxcd 004a4d6096
[ASR] rm transformers import and modify variable name consistent with infer.py, test=asr (#2929)
2 years ago
zxcd 17a7ebddfa
fix dist_sampler AttributeError (#2918)
2 years ago
zxcd 047092de8e
add wav2vev2_zh aishell recipe, and speechbrain dataloader. (#2916)
2 years ago
zxcd 6728db5b59
[ASR]Whisper remove audio duration limit, test=asr (#2900)
2 years ago
zxcd 64aeb6dccc
remove some fluid api (elementwise_div elementwise_mul sqrt reduce_sum). (#2859)
2 years ago
zxcd 31c2c226ca
clean fluid elementwise_max and square api. (#2852)
2 years ago
cxumol a99244d86e
fix: whisper language choice, test=asr (#2828)
2 years ago
zxcd ad40dafa85
fix some bug. (#2825)
2 years ago
zxcd a8a240d4ef
remove paddle.fluid (#2740)
2 years ago
YangZhou 12fa8a2d19
[audio]patch:fix tensor_utils error (#2738)
2 years ago
YangZhou 42ff946007
[audio] mv paddlespeech/audio to paddleaudio (#2706)
2 years ago
Zth9730 c67bf7b4ef
[ASR] support wav2vec2-zh cli, test=asr (#2697)
2 years ago
zxcd 4542684694
[ASR] fix Whisper cli model download path error. test=asr (#2679)
2 years ago
Zth9730 fc02cd0540
[doc] update wav2vec2 demos README.md, test=doc (#2674)
2 years ago
zxcd b71f1428c7
add all whisper model size support, test=asr (#2677)
2 years ago
Zth9730 94a487bd81
[ASR] support wav2vec2 command line and demo (#2658)
2 years ago
zxcd b1d3f59bcb
[s2t] add whisper asr large model (#2640)
2 years ago
Zth9730 8d3494320d
[ASR] wav2vec2_en, test=asr (#2637)
2 years ago
Hui Zhang 2c34481ea0
[s2t] quant with wav scp (#2568)
2 years ago
Zth9730 8d3464c050
[s2t] Update wav2vec2 license (#2600)
2 years ago
YangZhou bbf2401e3e
Merge pull request #2524 from zh794390558/u2
2 years ago
Hui Zhang eac545e1db
Merge pull request #2544 from Zth9730/fix_attention
2 years ago
tianhao zhang 1ea828c30e fix attention val bug
2 years ago
Hui Zhang 964c22c677
Merge pull request #2532 from Zth9730/wav2vec2.0
2 years ago
tianhao zhang 86f65f0b8e fix wav2vec2 report loss bug
2 years ago
Hui Zhang f1ca564731
Merge pull request #2518 from Zth9730/wav2vec2.0
2 years ago
tianhao zhang 2ae94bd277 freeze wav2vec2=True, change loss report and update README.md
2 years ago