Commit Graph

267 Commits (047092de8ed344ec391e5492c897395837773765)

Author SHA1 Message Date
zxcd 047092de8e
add wav2vev2_zh aishell recipe, and speechbrain dataloader. (#2916)
2 years ago
zxcd 6728db5b59
[ASR]Whisper remove audio duration limit, test=asr (#2900)
2 years ago
zxcd 64aeb6dccc
remove some fluid api (elementwise_div elementwise_mul sqrt reduce_sum). (#2859)
2 years ago
zxcd 31c2c226ca
clean fluid elementwise_max and square api. (#2852)
2 years ago
cxumol a99244d86e
fix: whisper language choice, test=asr (#2828)
2 years ago
zxcd ad40dafa85
fix some bug. (#2825)
2 years ago
zxcd a8a240d4ef
remove paddle.fluid (#2740)
2 years ago
YangZhou 12fa8a2d19
[audio]patch:fix tensor_utils error (#2738)
2 years ago
YangZhou 42ff946007
[audio] mv paddlespeech/audio to paddleaudio (#2706)
2 years ago
Zth9730 c67bf7b4ef
[ASR] support wav2vec2-zh cli, test=asr (#2697)
2 years ago
zxcd 4542684694
[ASR] fix Whisper cli model download path error. test=asr (#2679)
2 years ago
Zth9730 fc02cd0540
[doc] update wav2vec2 demos README.md, test=doc (#2674)
2 years ago
zxcd b71f1428c7
add all whisper model size support, test=asr (#2677)
2 years ago
Zth9730 94a487bd81
[ASR] support wav2vec2 command line and demo (#2658)
2 years ago
zxcd b1d3f59bcb
[s2t] add whisper asr large model (#2640)
2 years ago
Zth9730 8d3494320d
[ASR] wav2vec2_en, test=asr (#2637)
2 years ago
Hui Zhang 2c34481ea0
[s2t] quant with wav scp (#2568)
2 years ago
Zth9730 8d3464c050
[s2t] Update wav2vec2 license (#2600)
2 years ago
YangZhou bbf2401e3e
Merge pull request #2524 from zh794390558/u2
2 years ago
Hui Zhang eac545e1db
Merge pull request #2544 from Zth9730/fix_attention
2 years ago
tianhao zhang 1ea828c30e fix attention val bug
2 years ago
Hui Zhang 964c22c677
Merge pull request #2532 from Zth9730/wav2vec2.0
2 years ago
tianhao zhang 86f65f0b8e fix wav2vec2 report loss bug
2 years ago
Hui Zhang f1ca564731
Merge pull request #2518 from Zth9730/wav2vec2.0
2 years ago
tianhao zhang 2ae94bd277 freeze wav2vec2=True, change loss report and update README.md
2 years ago
tianhao zhang 3d994f5c23 format wav2vec2 demo
2 years ago
tianhao zhang 19180d359d format wav2vec2 demo
2 years ago
tianhao zhang 6e429f0513 support wav2vec2ASR on librispeech
2 years ago
Hui Zhang 290c23b9d7 add u2 nnet, u2 nnet main, codelab, and can compile
2 years ago
tianhao zhang cda440e6f0 use reverse_weight in decode.yaml
2 years ago
Hui Zhang c98b5dd173 fix masked_fill which will nan in trainning
2 years ago
Hui Zhang 9277fcb8a8 fix attn can not train
2 years ago
Hui Zhang 1f4f98b171 fix bug
2 years ago
Hui Zhang e86337a423 fix bug
2 years ago
Hui Zhang 925abcca23 format
2 years ago
Hui Zhang 2a75405e9a Merge branch 'develop' into u2pp_export
2 years ago
Hui Zhang 3ed24474d2 wenetspeech asr1 quant
2 years ago
Hui Zhang 467cfd4e75
Merge pull request #2489 from Zth9730/u2++_server
2 years ago
tianhao zhang 5bbe6e9897 support u2pp cli and server, optimiz code of u2pp decode, test=asr
2 years ago
Hui Zhang bdf876ea7b Merge branch 'develop' into u2pp_export
2 years ago
Hui Zhang afda7ed7d1 remove useless code
2 years ago
Hui Zhang b20bf7d5de masked_fill by multiply, remove while
2 years ago
Hui Zhang feb27e2a84 fuse linear kv
2 years ago
Hui Zhang 3adb20b468 eliminate shape and slice
2 years ago
Hui Zhang 46088c0a16 elimiate attn transpose
2 years ago
Hui Zhang f9e3eaa024 transpose in matmul
2 years ago
Hui Zhang 3d7ca93861 bool type slice
2 years ago
Hui Zhang c2c8a662b1 refactor reshape
2 years ago
Hui Zhang 6de81d74d9 elimiete cast dtype for bool op
2 years ago
Hui Zhang 8e7a315e00 remove comment
2 years ago