megemini
e3c4d4bd7e
[Fix] use reshape instead of view ( #3939 )
3 weeks ago
megemini
ff539ef007
[Fix] transpose use numpy ( #3933 )
3 weeks ago
yinfan98
5b3612f273
【Hackathon 7th】fix whisper at Paddle 3.0 ( #3880 )
...
* fix whisper at Paddle 3.0
* fix whisper at Paddle 3.0
* fix whisper at Paddle 3.0
* fix lint
* fix
* fix whisper ci
* Update TTSCppFrontend
* Update utils
* Update steps
* Update utils
* Update __init__.py
* Update whisper.py
* Update utils
* Update utils
4 weeks ago
Wang Xin
61728f8db6
NumPy compatibility enhancements ( #3907 )
...
* NumPy compatibility enhancements
* update code
1 month ago
张春乔
7dc806dc1d
run with aishell/asr3 ( #3904 )
1 month ago
zxcd
d32ced7f1a
separate paddle.logsumexp ( #3897 )
1 month ago
zxcd
89bfd44293
fix file name ( #3895 )
1 month ago
Wang Xin
62c21e951f
replace view with reshape in aishell/asr1 ( #3887 )
1 month ago
张春乔
6f44ac92c8
fix the shape error in layer_norm ( #3884 )
1 month ago
zxcd
91170bd260
adapt view behavior change, fix KeyError. ( #3794 )
...
* adapt view behavior change, fix KeyError.
* fix readme demo run error.
* fixed opencc version
7 months ago
Wang Huan
790f183f68
fix develop view confict with model's
1 year ago
Hui Zhang
55870ffbb3
fix bugs
1 year ago
zxcd
9b8ac050de
add dtype param for arange API. ( #3302 )
2 years ago
jiamingkong
ba874db5dc
Fixed the transpose usages ignored before
2 years ago
jiamingkong
0e2068e2cf
Code clean up for CIs
2 years ago
jiamingkong
232dcf8660
Adapted wavlmASR model to pretrained weights and CLI
2 years ago
jiamingkong
60bd7f202e
Code clean up according to comments in https://github.com/PaddlePaddle/PaddleSpeech/pull/3242
2 years ago
jiamingkong
3b6651ba7c
Adding WavLM implementation
2 years ago
TianHao Zhang
12e3e76092
[ASR] Support Hubert, fintuned on the librispeech dataset ( #3088 )
...
* librispeech hubert, test=asr
* librispeech hubert, test=asr
* hubert decode
* review
* copyright, notes, example related
* hubert cli
* pre-commit format
* fix conflicts
* fix conflicts
* doc related
* doc and train config
* librispeech.py
* support hubert cli
2 years ago
Shuangchi He
8c7859d3bc
Fix some typos. ( #3178 )
...
Signed-off-by: Yulv-git <yulvchi@qq.com>
2 years ago
夜雨飘零
31a4562ae8
[ASR]add squeezeformer model ( #2755 )
...
* add squeezeformer model
* change CodeStyle, test=asr
* change CodeStyle, test=asr
* fix subsample rate error, test=asr
* merge classes as required, test=asr
* change CodeStyle, test=asr
* fix missing code, test=asr
* split code to new file, test=asr
* remove rel_shift, test=asr
2 years ago
zxcd
dcf8ef04e0
[ASR] Remove fluid api and useless import, test=asr ( #2944 )
...
* remove fluid api and useless import.
* fix variable name
2 years ago
zxcd
004a4d6096
[ASR] rm transformers import and modify variable name consistent with infer.py, test=asr ( #2929 )
...
* rm transformers import and modify variable name consistent with infer.py
* add condition ctc_prefix_beam_search decode.
2 years ago
zxcd
047092de8e
add wav2vev2_zh aishell recipe, and speechbrain dataloader. ( #2916 )
2 years ago
zxcd
6728db5b59
[ASR]Whisper remove audio duration limit, test=asr ( #2900 )
2 years ago
cxumol
a99244d86e
fix: whisper language choice, test=asr ( #2828 )
2 years ago
zxcd
ad40dafa85
fix some bug. ( #2825 )
2 years ago
zxcd
a8a240d4ef
remove paddle.fluid ( #2740 )
2 years ago
YangZhou
12fa8a2d19
[audio]patch:fix tensor_utils error ( #2738 )
...
* fix tensor utils
2 years ago
YangZhou
42ff946007
[audio] mv paddlespeech/audio to paddleaudio ( #2706 )
...
* split paddlespeech/audio to paddleaudio.
* add sox io ,sox effect, kaldi native fbank to paddleaudio.
2 years ago
Zth9730
c67bf7b4ef
[ASR] support wav2vec2-zh cli, test=asr ( #2697 )
...
* support wav2vec2-zh cli, test=asr
* support wav2vec2-zh cli, test=asr
* support wav2vec2-zh cli, test=asr
* support wav2vec2-zh cli, test=asr
2 years ago
zxcd
4542684694
[ASR] fix Whisper cli model download path error. test=asr ( #2679 )
...
* add all whisper model size support
* add choices in parser.
* fix Whisper cli model download path error.
* fix resource download path.
* fix code style
2 years ago
Zth9730
fc02cd0540
[doc] update wav2vec2 demos README.md, test=doc ( #2674 )
...
* fix wav2vec2 demos, test=doc
* fix wav2vec2 demos, test=doc
* fix enc_dropout and nor.py, test=asr
2 years ago
zxcd
b71f1428c7
add all whisper model size support, test=asr ( #2677 )
...
* add all whisper model size support
* add choices in parser.
2 years ago
Zth9730
94a487bd81
[ASR] support wav2vec2 command line and demo ( #2658 )
...
* wav2vec2_cli
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* Update RESULTS.md
* Update RESULTS.md
* Update base_commands.py
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
2 years ago
zxcd
b1d3f59bcb
[s2t] add whisper asr large model ( #2640 )
...
* add whisper asr large model decoding, test=asr
* fix code style.
* fix json code style.
* remove resource and fix code style.
* fix yapf
* add cli and demos, fix some code style.
* fix some problem by comment.
* fix yapf
2 years ago
Zth9730
8d3494320d
[ASR] wav2vec2_en, test=asr ( #2637 )
...
* wav2vec2_en, test=asr
* wav2vec2_en, test=asr
* wav2vec2_en, test=asr
2 years ago
Zth9730
8d3464c050
[s2t] Update wav2vec2 license ( #2600 )
2 years ago
tianhao zhang
3d994f5c23
format wav2vec2 demo
2 years ago
tianhao zhang
19180d359d
format wav2vec2 demo
2 years ago
tianhao zhang
6e429f0513
support wav2vec2ASR on librispeech
2 years ago
Hui Zhang
1f4f98b171
fix bug
2 years ago
Hui Zhang
2a75405e9a
Merge branch 'develop' into u2pp_export
2 years ago
Hui Zhang
467cfd4e75
Merge pull request #2489 from Zth9730/u2++_server
...
[ASR] support u2pp based cli and server, optimiz code of u2pp decode (reversed_weight parameter)
2 years ago
tianhao zhang
5bbe6e9897
support u2pp cli and server, optimiz code of u2pp decode, test=asr
2 years ago
Hui Zhang
6de81d74d9
elimiete cast dtype for bool op
2 years ago
Hui Zhang
1a1ce92cb4
Merge pull request #2415 from Zth9730/u2++_decoder
...
[s2t] support bitransformer decoder
2 years ago
Hui Zhang
7382050e21
fix bug on win
2 years ago
Hui Zhang
d25871a7b0
format
2 years ago
Hui Zhang
b10512eb0e
more config or u2pp
2 years ago