Commit Graph

145 Commits (develop)

Author SHA1 Message Date
zxcd 538f260061
【ASR】whisper large v3 (#4101)
3 weeks ago
co63oc c2dc4dae2d
Fix typos (#4021)
6 months ago
megemini 9752f0a03b
[Hackathon 7th] 修复 `asr5` 的 `test.sh` 脚本路径错误 (#3941)
9 months ago
Wang Xin c11b19df90
[Hackathon 7th] updating the paths to utility scripts and modifying method parameters (#3942)
9 months ago
megemini b4c2f3bae3
[Hackathon 7th] 修复 `s2t` 示例错误 (#3950)
9 months ago
megemini 2d7cf7f0e6
[Hackathon 7th] 修复 `asr4` 的 `test_wav` 多余的 argument (#3940)
10 months ago
张春乔 5069111e6d
[Hackathon 7th] 修复 deepspeech2online 的导出问题 (#3935)
10 months ago
megemini d17361cf8c
[Fix] duplicated arg (#3934)
10 months ago
megemini 890c87ea93
[Fix] import TimeDomainSpecAugment (#3919)
10 months ago
megemini 5e8c727fd6
[Hackathon 7th] 修复 `tal_cs` 测试中 0D tensor to 1D (#3913)
10 months ago
megemini a397ebe207
[Fix] import print_arguments (#3918)
10 months ago
张春乔 7dc806dc1d
run with aishell/asr3 (#3904)
10 months ago
张春乔 419af4503c
【Hackathon 7th】Remove parser.add_argument (#3878)
10 months ago
张春乔 290d161d8a
【Hackathon 7th】add implemention of strtobool (#3877)
10 months ago
Hui Zhang a2ae6396ef
old grad clip has 0d tensor problem, fix it (#3334)
2 years ago
jiamingkong 8432e8626f Final cleaning; Modified SSL/infer.py and README for wavlm inclusion in model options
2 years ago
jiamingkong 60bd7f202e Code clean up according to comments in https://github.com/PaddlePaddle/PaddleSpeech/pull/3242
2 years ago
jiamingkong 3b6651ba7c Adding WavLM implementation
2 years ago
TianHao Zhang 12e3e76092
[ASR] Support Hubert, fintuned on the librispeech dataset (#3088)
2 years ago
Hui Zhang 8371d14f5d
Merge pull request #3167 from zxcd/amp
2 years ago
Hui Zhang 225737d4e3
[s2t] fix cli args to config (#3194)
2 years ago
zxcd bc365cbb52
Merge branch 'develop' into amp
2 years ago
Hui Zhang df3be4acae
[s2t] move s2t data preprocess into paddlespeech.dataset (#3189)
2 years ago
zxcd a1e5f27003 mv scaler.unscale_ blow grad_clip.
2 years ago
zxcd 7399d560e7 fix scaler save and load.
2 years ago
zxcd fbd27aab41 add amp for U2 conformer.
2 years ago
zxcd 9bf5471613
optional tokenizer and fix some doc. (#3042)
3 years ago
zxcd 4e9bca177a
[ASR] change optimizer and fix import error, test=asr (#3023)
3 years ago
zxcd 5186319f48
fix load model schedule error, config optional. (#3008)
3 years ago
zxcd 004a4d6096
[ASR] rm transformers import and modify variable name consistent with infer.py, test=asr (#2929)
3 years ago
zxcd 047092de8e
add wav2vev2_zh aishell recipe, and speechbrain dataloader. (#2916)
3 years ago
zxcd 4542684694
[ASR] fix Whisper cli model download path error. test=asr (#2679)
3 years ago
Zth9730 94a487bd81
[ASR] support wav2vec2 command line and demo (#2658)
3 years ago
zxcd b1d3f59bcb
[s2t] add whisper asr large model (#2640)
3 years ago
Zth9730 8d3494320d
[ASR] wav2vec2_en, test=asr (#2637)
3 years ago
Hui Zhang 2c34481ea0
[s2t] quant with wav scp (#2568)
3 years ago
Zth9730 8d3464c050
[s2t] Update wav2vec2 license (#2600)
3 years ago
YangZhou bbf2401e3e
Merge pull request #2524 from zh794390558/u2
3 years ago
Hui Zhang 964c22c677
Merge pull request #2532 from Zth9730/wav2vec2.0
3 years ago
tianhao zhang 86f65f0b8e fix wav2vec2 report loss bug
3 years ago
Hui Zhang f1ca564731
Merge pull request #2518 from Zth9730/wav2vec2.0
3 years ago
tianhao zhang 2ae94bd277 freeze wav2vec2=True, change loss report and update README.md
3 years ago
tianhao zhang 3d994f5c23 format wav2vec2 demo
3 years ago
tianhao zhang 19180d359d format wav2vec2 demo
3 years ago
tianhao zhang 6e429f0513 support wav2vec2ASR on librispeech
3 years ago
Hui Zhang 290c23b9d7 add u2 nnet, u2 nnet main, codelab, and can compile
3 years ago
tianhao zhang cda440e6f0 use reverse_weight in decode.yaml
3 years ago
Hui Zhang 1f4f98b171 fix bug
3 years ago
Hui Zhang e86337a423 fix bug
3 years ago
Hui Zhang 2a75405e9a Merge branch 'develop' into u2pp_export
3 years ago