Commit Graph

128 Commits (develop)

Author SHA1 Message Date
zxcd c0cf9e9d3d
adapt paddle 3.2, remove some hack code to avoid conflict. (#4120)
1 week ago
zxcd af2bdf120f
Reduce precision difference for whipser model (#4116)
2 weeks ago
zxcd 538f260061
【ASR】whisper large v3 (#4101)
3 weeks ago
张春乔 f357ec6172
Docker=-= 兼容性验证 (#4018)
6 months ago
Wang Xin c11b19df90
[Hackathon 7th] updating the paths to utility scripts and modifying method parameters (#3942)
9 months ago
megemini b4c2f3bae3
[Hackathon 7th] 修复 `s2t` 示例错误 (#3950)
9 months ago
megemini e3c4d4bd7e
[Fix] use reshape instead of view (#3939)
9 months ago
megemini ff539ef007
[Fix] transpose use numpy (#3933)
10 months ago
yinfan98 5b3612f273
【Hackathon 7th】fix whisper at Paddle 3.0 (#3880)
10 months ago
Wang Xin 61728f8db6
NumPy compatibility enhancements (#3907)
10 months ago
张春乔 7dc806dc1d
run with aishell/asr3 (#3904)
10 months ago
zxcd d32ced7f1a
separate paddle.logsumexp (#3897)
10 months ago
zxcd 89bfd44293
fix file name (#3895)
10 months ago
Wang Xin 62c21e951f
replace view with reshape in aishell/asr1 (#3887)
10 months ago
张春乔 6f44ac92c8
fix the shape error in layer_norm (#3884)
10 months ago
zxcd 91170bd260
adapt view behavior change, fix KeyError. (#3794)
1 year ago
Wang Huan 790f183f68 fix develop view confict with model's
2 years ago
Hui Zhang 55870ffbb3 fix bugs
2 years ago
zxcd 9b8ac050de
add dtype param for arange API. (#3302)
2 years ago
jiamingkong ba874db5dc Fixed the transpose usages ignored before
2 years ago
jiamingkong 0e2068e2cf Code clean up for CIs
2 years ago
jiamingkong 232dcf8660 Adapted wavlmASR model to pretrained weights and CLI
2 years ago
jiamingkong 60bd7f202e Code clean up according to comments in https://github.com/PaddlePaddle/PaddleSpeech/pull/3242
2 years ago
jiamingkong 3b6651ba7c Adding WavLM implementation
2 years ago
TianHao Zhang 12e3e76092
[ASR] Support Hubert, fintuned on the librispeech dataset (#3088)
2 years ago
Shuangchi He 8c7859d3bc
Fix some typos. (#3178)
2 years ago
夜雨飘零 31a4562ae8
[ASR]add squeezeformer model (#2755)
3 years ago
zxcd dcf8ef04e0
[ASR] Remove fluid api and useless import, test=asr (#2944)
3 years ago
zxcd 004a4d6096
[ASR] rm transformers import and modify variable name consistent with infer.py, test=asr (#2929)
3 years ago
zxcd 047092de8e
add wav2vev2_zh aishell recipe, and speechbrain dataloader. (#2916)
3 years ago
zxcd 6728db5b59
[ASR]Whisper remove audio duration limit, test=asr (#2900)
3 years ago
cxumol a99244d86e
fix: whisper language choice, test=asr (#2828)
3 years ago
zxcd ad40dafa85
fix some bug. (#2825)
3 years ago
zxcd a8a240d4ef
remove paddle.fluid (#2740)
3 years ago
YangZhou 12fa8a2d19
[audio]patch:fix tensor_utils error (#2738)
3 years ago
YangZhou 42ff946007
[audio] mv paddlespeech/audio to paddleaudio (#2706)
3 years ago
Zth9730 c67bf7b4ef
[ASR] support wav2vec2-zh cli, test=asr (#2697)
3 years ago
zxcd 4542684694
[ASR] fix Whisper cli model download path error. test=asr (#2679)
3 years ago
Zth9730 fc02cd0540
[doc] update wav2vec2 demos README.md, test=doc (#2674)
3 years ago
zxcd b71f1428c7
add all whisper model size support, test=asr (#2677)
3 years ago
Zth9730 94a487bd81
[ASR] support wav2vec2 command line and demo (#2658)
3 years ago
zxcd b1d3f59bcb
[s2t] add whisper asr large model (#2640)
3 years ago
Zth9730 8d3494320d
[ASR] wav2vec2_en, test=asr (#2637)
3 years ago
Zth9730 8d3464c050
[s2t] Update wav2vec2 license (#2600)
3 years ago
tianhao zhang 3d994f5c23 format wav2vec2 demo
3 years ago
tianhao zhang 19180d359d format wav2vec2 demo
3 years ago
tianhao zhang 6e429f0513 support wav2vec2ASR on librispeech
3 years ago
Hui Zhang 1f4f98b171 fix bug
3 years ago
Hui Zhang 2a75405e9a Merge branch 'develop' into u2pp_export
3 years ago
Hui Zhang 467cfd4e75
Merge pull request #2489 from Zth9730/u2++_server
3 years ago