Commit Graph

304 Commits (1a693448faaa07aa54d1a2a1b89c7c7c4d7427a2)

Author SHA1 Message Date
Zth9730 94a487bd81
[ASR] support wav2vec2 command line and demo (#2658)
2 years ago
zxcd b1d3f59bcb
[s2t] add whisper asr large model (#2640)
2 years ago
Zth9730 8d3494320d
[ASR] wav2vec2_en, test=asr (#2637)
2 years ago
Hui Zhang 2c34481ea0
[s2t] quant with wav scp (#2568)
2 years ago
Zth9730 8d3464c050
[s2t] Update wav2vec2 license (#2600)
2 years ago
YangZhou bbf2401e3e
Merge pull request #2524 from zh794390558/u2
2 years ago
Hui Zhang eac545e1db
Merge pull request #2544 from Zth9730/fix_attention
2 years ago
tianhao zhang 1ea828c30e fix attention val bug
2 years ago
Hui Zhang 964c22c677
Merge pull request #2532 from Zth9730/wav2vec2.0
2 years ago
tianhao zhang 86f65f0b8e fix wav2vec2 report loss bug
2 years ago
Hui Zhang f1ca564731
Merge pull request #2518 from Zth9730/wav2vec2.0
2 years ago
tianhao zhang 2ae94bd277 freeze wav2vec2=True, change loss report and update README.md
2 years ago
tianhao zhang 3d994f5c23 format wav2vec2 demo
2 years ago
tianhao zhang 19180d359d format wav2vec2 demo
2 years ago
tianhao zhang 6e429f0513 support wav2vec2ASR on librispeech
2 years ago
Hui Zhang 290c23b9d7 add u2 nnet, u2 nnet main, codelab, and can compile
2 years ago
tianhao zhang cda440e6f0 use reverse_weight in decode.yaml
2 years ago
Hui Zhang c98b5dd173 fix masked_fill which will nan in trainning
2 years ago
Hui Zhang 9277fcb8a8 fix attn can not train
2 years ago
Hui Zhang 1f4f98b171 fix bug
2 years ago
Hui Zhang e86337a423 fix bug
2 years ago
Hui Zhang 925abcca23 format
2 years ago
Hui Zhang 2a75405e9a Merge branch 'develop' into u2pp_export
2 years ago
Hui Zhang 3ed24474d2 wenetspeech asr1 quant
2 years ago
Hui Zhang 467cfd4e75
Merge pull request #2489 from Zth9730/u2++_server
2 years ago
tianhao zhang 5bbe6e9897 support u2pp cli and server, optimiz code of u2pp decode, test=asr
2 years ago
Hui Zhang bdf876ea7b Merge branch 'develop' into u2pp_export
2 years ago
Hui Zhang afda7ed7d1 remove useless code
2 years ago
Hui Zhang b20bf7d5de masked_fill by multiply, remove while
2 years ago
Hui Zhang feb27e2a84 fuse linear kv
2 years ago
Hui Zhang 3adb20b468 eliminate shape and slice
2 years ago
Hui Zhang 46088c0a16 elimiate attn transpose
2 years ago
Hui Zhang f9e3eaa024 transpose in matmul
2 years ago
Hui Zhang 3d7ca93861 bool type slice
2 years ago
Hui Zhang c2c8a662b1 refactor reshape
2 years ago
Hui Zhang 6de81d74d9 elimiete cast dtype for bool op
2 years ago
Hui Zhang 8e7a315e00 remove comment
2 years ago
Hui Zhang b7388ce25a eliminate useless unsqueese
2 years ago
Hui Zhang 1a1ce92cb4
Merge pull request #2415 from Zth9730/u2++_decoder
2 years ago
tianhao zhang d3e5937591 support bitransformer decoder
2 years ago
Hui Zhang 7382050e21 fix bug on win
2 years ago
Hui Zhang d25871a7b0 format
2 years ago
Hui Zhang b10512eb0e more config or u2pp
2 years ago
Hui Zhang 00b2c1c8fb fix forward attention decoder caller
2 years ago
Hui Zhang 309c8d70d9 add reverse weight
2 years ago
Hui Zhang 9b66680ea4 Merge branch 'u2++_decoder' into u2pp_export
2 years ago
tianhao zhang 027535dec1 support bitransformer decoder, test=asr
2 years ago
tianhao zhang 0a95689461 support bitransformer decoder
2 years ago
tianhao zhang 455379b88e support bitransformer decoder
2 years ago
tianhao zhang 1a56a6e42b add bitransformer decoder, test=asr
2 years ago
Hui Zhang 53d6baff0b format
2 years ago
Hui Zhang 549d477592 fix code style
2 years ago
Hui Zhang 4d5cfd4003 export param from cnofig
2 years ago
Hui Zhang e3298c79ce Merge branch 'develop' into u2_export
2 years ago
Hui Zhang 260752aa2a using forward_attention_decoder
2 years ago
Hui Zhang 8690a00bd8 add feature pipeline layer(cmvn, fbank), but to_static and jit.layer output is not equal
2 years ago
Hui Zhang 3a8869fba4 rm to_static decarator; configure jit save for ctc_activation
2 years ago
Hui Zhang 1c9f238ba0 configurable export
2 years ago
Hui Zhang 63aeb747b0 more comment
2 years ago
Hui Zhang d638325c46 do not jit save forward; using slice for zeros([0,0,0,0]) tensor
2 years ago
tianhao zhang 663e3ab58e fix dp init
2 years ago
tianhao zhang 6745e9dd6b fix dp init
2 years ago
tianhao zhang 598eb1a5ef Merge branch 'develop' into fix_dp_init
2 years ago
tianhao zhang 9560d650db fix dp init
2 years ago
tianhao zhang 82e04d7815 fix trianer
2 years ago
Hui Zhang 2bb40c41ba
Merge pull request #2351 from Zth9730/fix_deepspeech
2 years ago
tianhao zhang ab92e2c98c fix deepspeech2 decode_wav
2 years ago
TianYuan 795eb7bd10
format paddlespeech with pre-commit (#2331)
2 years ago
tianhao zhang cdcb1a5316 s2t: fix encoder.py
2 years ago
tianhao zhang ed2819d7af fix format test=asr
2 years ago
tianhao zhang ed80b0e2c3 fix multigpu training test=asr
2 years ago
tianhao zhang 733ec7f2bc fix conformer multi-gpu training test=asr
2 years ago
Hui Zhang c1fbfe928e add test
2 years ago
Hui Zhang 05bc258833 update docstring
2 years ago
Hui Zhang 6149daa221 export ctc_activation
2 years ago
huangyuxin 060e337623 fix dataloader factory, test=asr
2 years ago
Hui Zhang 812d80ab1c Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into u2_export
2 years ago
Hui Zhang e5a6c243f1 fix jit save for conformer
2 years ago
0x45f 4e7106d9e2 Support dy2st
2 years ago
Hui Zhang ef37f73a01 fix cnn cache dy2st shape
2 years ago
0x45f e21cceea51 Remove blank line
2 years ago
0x45f e6ac8881f1 Fix comments
2 years ago
0x45f ac680aa783 Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into new_api
2 years ago
0x45f 294b7b00bd Supprot dy2st for conformer
2 years ago
huangyuxin 75997d8277 merge
2 years ago
Hui Zhang e81849277e att cache for streaming asr
2 years ago
Hui Zhang fb40602d94 refactor attention cache
2 years ago
huangyuxin 05d41523ad Merge branch 'develop' into webdataset
2 years ago
huangyuxin 92d1d08b9a fix scripts
2 years ago
TianYuan e4a8e15334
Merge pull request #2111 from yt605155624/rm_more_log
2 years ago
TianYuan 496e2dd14b fix Pillow's version
2 years ago
TianYuan bc93bffbb4 replace logger.info with logger.debug in cli, change default log level to INFO
2 years ago
huangyuxin 98cfdc4c05 fix nxpu
2 years ago
huangyuxin 7463df89c5 fix nxpu
2 years ago
huangyuxin 6ec6921255 Merge branch 'webdataset' of https://github.com/Jackwaterveg/DeepSpeech into webdataset
2 years ago
Jackwaterveg 6598216b2f
Merge branch 'develop' into webdataset
2 years ago
huangyuxin 9b5655f6ad fix 'print log' in cli
2 years ago
huangyuxin aa12b9ab52 replace s2t.transform with audio.transform
2 years ago
huangyuxin 0c7abc1f17 add training scripts
2 years ago
huangyuxin c7a7b113c8 support multi-gpu training with webdataset
2 years ago