zxcd
538f260061
【ASR】whisper large v3 ( #4101 )
...
* whisper large v3
* add convert.py
* mv nlp tokenizer to tiktoken.
* fix bug
* remove convert.py
* add new model file.
* fix
* fix version number
* fix version number
* fix some bug
* fix bug
3 weeks ago
co63oc
c2dc4dae2d
Fix typos ( #4021 )
...
* Fix
* Fix
* Fix
6 months ago
megemini
9752f0a03b
[Hackathon 7th] 修复 `asr5` 的 `test.sh` 脚本路径错误 ( #3941 )
...
* [Fix] test parsearg and test.sh
* [Update] use short path
9 months ago
Wang Xin
c11b19df90
[Hackathon 7th] updating the paths to utility scripts and modifying method parameters ( #3942 )
...
* fix path error
* Update examples/aishell/asr0/local/test.sh
* remove some broken symbolic link
9 months ago
megemini
b4c2f3bae3
[Hackathon 7th] 修复 `s2t` 示例错误 ( #3950 )
...
* [Fix] s2t
* [Fix] s2t test
9 months ago
megemini
2d7cf7f0e6
[Hackathon 7th] 修复 `asr4` 的 `test_wav` 多余的 argument ( #3940 )
...
* [Fix] test_wav parse arg
* [Fix] remove line
10 months ago
张春乔
5069111e6d
[Hackathon 7th] 修复 deepspeech2online 的导出问题 ( #3935 )
...
* Update model.py
* Update model.py
10 months ago
megemini
d17361cf8c
[Fix] duplicated arg ( #3934 )
10 months ago
megemini
890c87ea93
[Fix] import TimeDomainSpecAugment ( #3919 )
10 months ago
megemini
5e8c727fd6
[Hackathon 7th] 修复 `tal_cs` 测试中 0D tensor to 1D ( #3913 )
...
* [Fix] 0D tensor to 1D
* [Update] feat dim
10 months ago
megemini
a397ebe207
[Fix] import print_arguments ( #3918 )
10 months ago
张春乔
7dc806dc1d
run with aishell/asr3 ( #3904 )
10 months ago
张春乔
419af4503c
【Hackathon 7th】Remove parser.add_argument ( #3878 )
...
* Update test_wav.py
* Update export.py
* Update test_export.py
* Update model.py
* Update README.md
* Apply suggestions from code review
* Apply suggestions from code review
* Update README.md
* Update README.md
* Update test.py
* Update README.md
10 months ago
张春乔
290d161d8a
【Hackathon 7th】add implemention of strtobool ( #3877 )
...
* add implemention of strtobool
* Apply suggestions from code review
* Apply suggestions from code review
* Apply suggestions from code review
10 months ago
Hui Zhang
a2ae6396ef
old grad clip has 0d tensor problem, fix it ( #3334 )
2 years ago
jiamingkong
8432e8626f
Final cleaning; Modified SSL/infer.py and README for wavlm inclusion in model options
2 years ago
jiamingkong
60bd7f202e
Code clean up according to comments in https://github.com/PaddlePaddle/PaddleSpeech/pull/3242
2 years ago
jiamingkong
3b6651ba7c
Adding WavLM implementation
2 years ago
TianHao Zhang
12e3e76092
[ASR] Support Hubert, fintuned on the librispeech dataset ( #3088 )
...
* librispeech hubert, test=asr
* librispeech hubert, test=asr
* hubert decode
* review
* copyright, notes, example related
* hubert cli
* pre-commit format
* fix conflicts
* fix conflicts
* doc related
* doc and train config
* librispeech.py
* support hubert cli
2 years ago
Hui Zhang
8371d14f5d
Merge pull request #3167 from zxcd/amp
...
[ASR] add amp for U2 conformer
2 years ago
Hui Zhang
225737d4e3
[s2t] fix cli args to config ( #3194 )
...
* fix cli args to config
* fix train cli
2 years ago
zxcd
bc365cbb52
Merge branch 'develop' into amp
2 years ago
Hui Zhang
df3be4acae
[s2t] move s2t data preprocess into paddlespeech.dataset ( #3189 )
...
* move s2t data preprocess into paddlespeech.dataset
* avg model, compute wer, format rsl into paddlespeech.dataset
* fix format rsl
* fix avg ckpts
2 years ago
zxcd
a1e5f27003
mv scaler.unscale_ blow grad_clip.
2 years ago
zxcd
7399d560e7
fix scaler save and load.
2 years ago
zxcd
fbd27aab41
add amp for U2 conformer.
2 years ago
zxcd
9bf5471613
optional tokenizer and fix some doc. ( #3042 )
3 years ago
zxcd
4e9bca177a
[ASR] change optimizer and fix import error, test=asr ( #3023 )
...
* mv dataio.py to s2t.io.speechbrain.dataio
mv dataio.py to paddlespeech.s2t.io.speechbrain.dataio
* remove transformers import.
* change optimizer same with released model
* add paddlenlp version in RESULT.md.
* fix run.sh
* fix data.sh step_num.
* add adadelta optimizer config.
* fix wav2vec2 test_wav.sh run error.
* add tokenizer config.
3 years ago
zxcd
5186319f48
fix load model schedule error, config optional. ( #3008 )
3 years ago
zxcd
004a4d6096
[ASR] rm transformers import and modify variable name consistent with infer.py, test=asr ( #2929 )
...
* rm transformers import and modify variable name consistent with infer.py
* add condition ctc_prefix_beam_search decode.
3 years ago
zxcd
047092de8e
add wav2vev2_zh aishell recipe, and speechbrain dataloader. ( #2916 )
3 years ago
zxcd
4542684694
[ASR] fix Whisper cli model download path error. test=asr ( #2679 )
...
* add all whisper model size support
* add choices in parser.
* fix Whisper cli model download path error.
* fix resource download path.
* fix code style
3 years ago
Zth9730
94a487bd81
[ASR] support wav2vec2 command line and demo ( #2658 )
...
* wav2vec2_cli
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* Update RESULTS.md
* Update RESULTS.md
* Update base_commands.py
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
3 years ago
zxcd
b1d3f59bcb
[s2t] add whisper asr large model ( #2640 )
...
* add whisper asr large model decoding, test=asr
* fix code style.
* fix json code style.
* remove resource and fix code style.
* fix yapf
* add cli and demos, fix some code style.
* fix some problem by comment.
* fix yapf
3 years ago
Zth9730
8d3494320d
[ASR] wav2vec2_en, test=asr ( #2637 )
...
* wav2vec2_en, test=asr
* wav2vec2_en, test=asr
* wav2vec2_en, test=asr
3 years ago
Hui Zhang
2c34481ea0
[s2t] quant with wav scp ( #2568 )
...
* add quant hint
* add paddleslim
* using paddleslim 2.3.4 and paddle 2.4
3 years ago
Zth9730
8d3464c050
[s2t] Update wav2vec2 license ( #2600 )
3 years ago
YangZhou
bbf2401e3e
Merge pull request #2524 from zh794390558/u2
...
[speechx] add u2/u2pp asr inference
3 years ago
Hui Zhang
964c22c677
Merge pull request #2532 from Zth9730/wav2vec2.0
...
[s2t] fix wav2vec2 report loss bug
3 years ago
tianhao zhang
86f65f0b8e
fix wav2vec2 report loss bug
3 years ago
Hui Zhang
f1ca564731
Merge pull request #2518 from Zth9730/wav2vec2.0
...
[ASR] wav2vec2 ASR, pre-trained wav2vec2 based CTC for librispeech
3 years ago
tianhao zhang
2ae94bd277
freeze wav2vec2=True, change loss report and update README.md
3 years ago
tianhao zhang
3d994f5c23
format wav2vec2 demo
3 years ago
tianhao zhang
19180d359d
format wav2vec2 demo
3 years ago
tianhao zhang
6e429f0513
support wav2vec2ASR on librispeech
3 years ago
Hui Zhang
290c23b9d7
add u2 nnet, u2 nnet main, codelab, and can compile
3 years ago
tianhao zhang
cda440e6f0
use reverse_weight in decode.yaml
3 years ago
Hui Zhang
1f4f98b171
fix bug
3 years ago
Hui Zhang
e86337a423
fix bug
3 years ago
Hui Zhang
2a75405e9a
Merge branch 'develop' into u2pp_export
3 years ago