Commit Graph

323 Commits (5069111e6dd32308938e03cd2d9457ac0d00864d)

Author SHA1 Message Date
zxcd 4e9bca177a
[ASR] change optimizer and fix import error, test=asr (#3023)
3 years ago
zxcd 5186319f48
fix load model schedule error, config optional. (#3008)
3 years ago
zxcd dcf8ef04e0
[ASR] Remove fluid api and useless import, test=asr (#2944)
3 years ago
zxcd a8a353d0ac
[ASR] add python simple adadelta optimizer, test=asr (#2925)
3 years ago
zxcd 004a4d6096
[ASR] rm transformers import and modify variable name consistent with infer.py, test=asr (#2929)
3 years ago
zxcd 17a7ebddfa
fix dist_sampler AttributeError (#2918)
3 years ago
zxcd 047092de8e
add wav2vev2_zh aishell recipe, and speechbrain dataloader. (#2916)
3 years ago
zxcd 6728db5b59
[ASR]Whisper remove audio duration limit, test=asr (#2900)
3 years ago
zxcd 64aeb6dccc
remove some fluid api (elementwise_div elementwise_mul sqrt reduce_sum). (#2859)
3 years ago
zxcd 31c2c226ca
clean fluid elementwise_max and square api. (#2852)
3 years ago
cxumol a99244d86e
fix: whisper language choice, test=asr (#2828)
3 years ago
zxcd ad40dafa85
fix some bug. (#2825)
3 years ago
zxcd a8a240d4ef
remove paddle.fluid (#2740)
3 years ago
YangZhou 12fa8a2d19
[audio]patch:fix tensor_utils error (#2738)
3 years ago
YangZhou 42ff946007
[audio] mv paddlespeech/audio to paddleaudio (#2706)
3 years ago
Zth9730 c67bf7b4ef
[ASR] support wav2vec2-zh cli, test=asr (#2697)
3 years ago
zxcd 4542684694
[ASR] fix Whisper cli model download path error. test=asr (#2679)
3 years ago
Zth9730 fc02cd0540
[doc] update wav2vec2 demos README.md, test=doc (#2674)
3 years ago
zxcd b71f1428c7
add all whisper model size support, test=asr (#2677)
3 years ago
Zth9730 94a487bd81
[ASR] support wav2vec2 command line and demo (#2658)
3 years ago
zxcd b1d3f59bcb
[s2t] add whisper asr large model (#2640)
3 years ago
Zth9730 8d3494320d
[ASR] wav2vec2_en, test=asr (#2637)
3 years ago
Hui Zhang 2c34481ea0
[s2t] quant with wav scp (#2568)
3 years ago
Zth9730 8d3464c050
[s2t] Update wav2vec2 license (#2600)
3 years ago
YangZhou bbf2401e3e
Merge pull request #2524 from zh794390558/u2
3 years ago
Hui Zhang eac545e1db
Merge pull request #2544 from Zth9730/fix_attention
3 years ago
tianhao zhang 1ea828c30e fix attention val bug
3 years ago
Hui Zhang 964c22c677
Merge pull request #2532 from Zth9730/wav2vec2.0
3 years ago
tianhao zhang 86f65f0b8e fix wav2vec2 report loss bug
3 years ago
Hui Zhang f1ca564731
Merge pull request #2518 from Zth9730/wav2vec2.0
3 years ago
tianhao zhang 2ae94bd277 freeze wav2vec2=True, change loss report and update README.md
3 years ago
tianhao zhang 3d994f5c23 format wav2vec2 demo
3 years ago
tianhao zhang 19180d359d format wav2vec2 demo
3 years ago
tianhao zhang 6e429f0513 support wav2vec2ASR on librispeech
3 years ago
Hui Zhang 290c23b9d7 add u2 nnet, u2 nnet main, codelab, and can compile
3 years ago
tianhao zhang cda440e6f0 use reverse_weight in decode.yaml
3 years ago
Hui Zhang c98b5dd173 fix masked_fill which will nan in trainning
3 years ago
Hui Zhang 9277fcb8a8 fix attn can not train
3 years ago
Hui Zhang 1f4f98b171 fix bug
3 years ago
Hui Zhang e86337a423 fix bug
3 years ago
Hui Zhang 925abcca23 format
3 years ago
Hui Zhang 2a75405e9a Merge branch 'develop' into u2pp_export
3 years ago
Hui Zhang 3ed24474d2 wenetspeech asr1 quant
3 years ago
Hui Zhang 467cfd4e75
Merge pull request #2489 from Zth9730/u2++_server
3 years ago
tianhao zhang 5bbe6e9897 support u2pp cli and server, optimiz code of u2pp decode, test=asr
3 years ago
Hui Zhang bdf876ea7b Merge branch 'develop' into u2pp_export
3 years ago
Hui Zhang afda7ed7d1 remove useless code
3 years ago
Hui Zhang b20bf7d5de masked_fill by multiply, remove while
3 years ago
Hui Zhang feb27e2a84 fuse linear kv
3 years ago
Hui Zhang 3adb20b468 eliminate shape and slice
3 years ago
Hui Zhang 46088c0a16 elimiate attn transpose
3 years ago
Hui Zhang f9e3eaa024 transpose in matmul
3 years ago
Hui Zhang 3d7ca93861 bool type slice
3 years ago
Hui Zhang c2c8a662b1 refactor reshape
3 years ago
Hui Zhang 6de81d74d9 elimiete cast dtype for bool op
3 years ago
Hui Zhang 8e7a315e00 remove comment
3 years ago
Hui Zhang b7388ce25a eliminate useless unsqueese
3 years ago
Hui Zhang 1a1ce92cb4
Merge pull request #2415 from Zth9730/u2++_decoder
3 years ago
tianhao zhang d3e5937591 support bitransformer decoder
3 years ago
Hui Zhang 7382050e21 fix bug on win
3 years ago
Hui Zhang d25871a7b0 format
3 years ago
Hui Zhang b10512eb0e more config or u2pp
3 years ago
Hui Zhang 00b2c1c8fb fix forward attention decoder caller
3 years ago
Hui Zhang 309c8d70d9 add reverse weight
3 years ago
Hui Zhang 9b66680ea4 Merge branch 'u2++_decoder' into u2pp_export
3 years ago
tianhao zhang 027535dec1 support bitransformer decoder, test=asr
3 years ago
tianhao zhang 0a95689461 support bitransformer decoder
3 years ago
tianhao zhang 455379b88e support bitransformer decoder
3 years ago
tianhao zhang 1a56a6e42b add bitransformer decoder, test=asr
3 years ago
Hui Zhang 53d6baff0b format
3 years ago
Hui Zhang 549d477592 fix code style
3 years ago
Hui Zhang 4d5cfd4003 export param from cnofig
3 years ago
Hui Zhang e3298c79ce Merge branch 'develop' into u2_export
3 years ago
Hui Zhang 260752aa2a using forward_attention_decoder
3 years ago
Hui Zhang 8690a00bd8 add feature pipeline layer(cmvn, fbank), but to_static and jit.layer output is not equal
3 years ago
Hui Zhang 3a8869fba4 rm to_static decarator; configure jit save for ctc_activation
3 years ago
Hui Zhang 1c9f238ba0 configurable export
3 years ago
Hui Zhang 63aeb747b0 more comment
3 years ago
Hui Zhang d638325c46 do not jit save forward; using slice for zeros([0,0,0,0]) tensor
3 years ago
tianhao zhang 663e3ab58e fix dp init
3 years ago
tianhao zhang 6745e9dd6b fix dp init
3 years ago
tianhao zhang 598eb1a5ef Merge branch 'develop' into fix_dp_init
3 years ago
tianhao zhang 9560d650db fix dp init
3 years ago
tianhao zhang 82e04d7815 fix trianer
3 years ago
Hui Zhang 2bb40c41ba
Merge pull request #2351 from Zth9730/fix_deepspeech
3 years ago
tianhao zhang ab92e2c98c fix deepspeech2 decode_wav
3 years ago
TianYuan 795eb7bd10
format paddlespeech with pre-commit (#2331)
3 years ago
tianhao zhang cdcb1a5316 s2t: fix encoder.py
3 years ago
tianhao zhang ed2819d7af fix format test=asr
3 years ago
tianhao zhang ed80b0e2c3 fix multigpu training test=asr
3 years ago
tianhao zhang 733ec7f2bc fix conformer multi-gpu training test=asr
3 years ago
Hui Zhang c1fbfe928e add test
3 years ago
Hui Zhang 05bc258833 update docstring
3 years ago
Hui Zhang 6149daa221 export ctc_activation
3 years ago
huangyuxin 060e337623 fix dataloader factory, test=asr
3 years ago
Hui Zhang 812d80ab1c Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into u2_export
3 years ago
Hui Zhang e5a6c243f1 fix jit save for conformer
3 years ago
0x45f 4e7106d9e2 Support dy2st
3 years ago
Hui Zhang ef37f73a01 fix cnn cache dy2st shape
3 years ago
0x45f e21cceea51 Remove blank line
3 years ago