zhuyipin
a9ece28ba6
speedyspeech code adapt for mlu ( #3828 )
...
* speedyspeech code adapt for mlu
* fix inference
* fix help message
1 year ago
zhuyipin
0b568136d9
speedyspeech code adapt for npu ( #3804 )
...
* speedyspeech code adapt for npu
* fix npu inference
* fix e2e synthesize
* add paddle version control for memory optim config
* fix code style
* fix code style
* fix help message
* fix code style
* fix help message
1 year ago
Kai Song
9a50647592
Add XPU support for FastSpeech2 ( #3514 )
...
* Add XPU support for FastSpeech2
* optimize
2 years ago
Kai Song
08599b7300
Add XPU support for SpeedySpeech ( #3502 )
...
* Add XPU support for SpeedySpeech
* fix typos
* update description of nxpu
2 years ago
Hui Zhang
596f71407c
update result
2 years ago
Hui Zhang
0a5cc5556e
rope for streaming decoding
2 years ago
Hui Zhang
3b6b680771
add roformer result
2 years ago
Hui Zhang
55870ffbb3
fix bugs
2 years ago
Hui Zhang
03e9ea9e52
add roformer
2 years ago
Hui Zhang
6b4d1f80ac
add t2s assets
2 years ago
Hui Zhang
83d93da8d0
add scripts for tts code switch
2 years ago
Hui Zhang
2376c14d7b
fix ckpt
2 years ago
Hui Zhang
6e7c71b26c
refactor rhy
2 years ago
Hui Zhang
2fe97f2e3a
Merge pull request #3292 from zh794390558/mfa
...
refactor mfa scripts
2 years ago
jiamingkong
8432e8626f
Final cleaning; Modified SSL/infer.py and README for wavlm inclusion in model options
2 years ago
Hui Zhang
c9ddc4f832
refactor mfa scripts
2 years ago
jiamingkong
927c60a5c1
Deleted examples/librispeech/asr5/format_rsl.py
2 years ago
jiamingkong
232dcf8660
Adapted wavlmASR model to pretrained weights and CLI
2 years ago
jiamingkong
9ee1205d25
Changed the path for the uploaded weight
2 years ago
jiamingkong
60bd7f202e
Code clean up according to comments in https://github.com/PaddlePaddle/PaddleSpeech/pull/3242
2 years ago
jiamingkong
3b6651ba7c
Adding WavLM implementation
2 years ago
TianHao Zhang
12e3e76092
[ASR] Support Hubert, fintuned on the librispeech dataset ( #3088 )
...
* librispeech hubert, test=asr
* librispeech hubert, test=asr
* hubert decode
* review
* copyright, notes, example related
* hubert cli
* pre-commit format
* fix conflicts
* fix conflicts
* doc related
* doc and train config
* librispeech.py
* support hubert cli
2 years ago
Hui Zhang
8371d14f5d
Merge pull request #3167 from zxcd/amp
...
[ASR] add amp for U2 conformer
2 years ago
Hui Zhang
e3dcfa8815
Merge pull request #3186 from PaddlePaddle/vits_pr
...
[TTS]update lr schedulers from per iter to per epoch for VITS
2 years ago
zxcd
bc365cbb52
Merge branch 'develop' into amp
2 years ago
zxcd
9d8660b2f6
add new aishell model for better CER.
2 years ago
WongLaw
305375c310
VITS learning rate revised, test=tts
2 years ago
WongLaw
fdeb9b88a7
VITS learning rate revised, test=tts
2 years ago
TianYuan
fc670339d1
[TTS]Fix losses of StarGAN v2 VC ( #3184 )
2 years ago
ljhzxc
84cc5fc98f
Update pretrained model in README ( #3193 )
2 years ago
Hui Zhang
df3be4acae
[s2t] move s2t data preprocess into paddlespeech.dataset ( #3189 )
...
* move s2t data preprocess into paddlespeech.dataset
* avg model, compute wer, format rsl into paddlespeech.dataset
* fix format rsl
* fix avg ckpts
2 years ago
Shuangchi He
8c7859d3bc
Fix some typos. ( #3178 )
...
Signed-off-by: Yulv-git <yulvchi@qq.com>
2 years ago
WongLaw
414de3747c
VITS learning rate revised, test=tts
2 years ago
TianYuan
3ad55a31e7
[TTS]StarGANv2 VC fix some trainer bugs, add add reset_parameters ( #3182 )
2 years ago
ljhzxc
dc56c3a10e
[TTS] [黑客松]Add JETS ( #3109 )
2 years ago
TianYuan
bd0d69ca74
[TTS]add StarGANv2VC preprocess ( #3163 )
2 years ago
lemondy
9c387577fd
fix example/aishell local/train.sh if condition bug, test=asr ( #3146 )
2 years ago
longRookie
df37798598
[TTS]【Hackathon + No.190】 + 模型复现:iSTFTNet ( #3006 )
...
* iSTFTNet implementation based on hifigan, not affect the function and execution of HIFIGAN
* modify the comment in iSTFT.yaml
* add the comments in hifigan
* iSTFTNet implementation based on hifigan, not affect the function and execution of HIFIGAN
* modify the comment in iSTFT.yaml
* add the comments in hifigan
* add iSTFTNet.md
* modify the format of iSTFTNet.md
* modify iSTFT.yaml and hifigan.py
* Format code using pre-commit
* modify hifigan.py,delete the unused self.istft_layer_id , move the self.output_conv behind else, change conv_post to output_conv
* update iSTFTNet_csmsc_ckpt.zip download link
* modify iSTFTNet.md
* modify hifigan.py and iSTFT.yaml
* modify iSTFTNet.md
2 years ago
TianYuan
72aa19c32c
[TTS]add starganv2 vc trainer ( #3143 )
...
* add starganv2 vc trainer
* fix StarGANv2VCUpdater and losses
* fix StarGANv2VCEvaluator
* add some typehint
2 years ago
TianYuan
54ef90fcec
[TTS]Fix VITS lite infer ( #3098 )
2 years ago
TianYuan
cc02b007fe
fix paddle2onnx's install version, install the newest paddle2onnx in run.sh ( #3084 )
3 years ago
liangym
880c172db7
[TTS] add svs frontend ( #3062 )
3 years ago
夜雨飘零
31a4562ae8
[ASR]add squeezeformer model ( #2755 )
...
* add squeezeformer model
* change CodeStyle, test=asr
* change CodeStyle, test=asr
* fix subsample rate error, test=asr
* merge classes as required, test=asr
* change CodeStyle, test=asr
* fix missing code, test=asr
* split code to new file, test=asr
* remove rel_shift, test=asr
3 years ago
zxcd
9bf5471613
optional tokenizer and fix some doc. ( #3042 )
3 years ago
liangym
348064de0d
[TTS] add opencpop HIFIGAN example ( #3038 )
...
* add opencpop voc, test=tts
* soft link
* add opencpop hifigan, test=tts
* update
3 years ago
zxcd
4e9bca177a
[ASR] change optimizer and fix import error, test=asr ( #3023 )
...
* mv dataio.py to s2t.io.speechbrain.dataio
mv dataio.py to paddlespeech.s2t.io.speechbrain.dataio
* remove transformers import.
* change optimizer same with released model
* add paddlenlp version in RESULT.md.
* fix run.sh
* fix data.sh step_num.
* add adadelta optimizer config.
* fix wav2vec2 test_wav.sh run error.
* add tokenizer config.
3 years ago
TianYuan
65c3217b8b
Update textnorm_test_cases.txt
3 years ago
liangym
435fc5cc19
[TTS] add opencpop PWGAN example ( #3031 )
...
* add opencpop voc, test=tts
* soft link
3 years ago
liangym
1afd14acd9
[TTS]add Diffsinger with opencpop dataset ( #3005 )
3 years ago
zxcd
3145325b4e
[ASR] add wav2vec2 aishell model result, test=asr ( #3012 )
...
* Create RESULT.md
* add wav2vec2ASR-large-aishell1 finetune model.
* update model link and add readme.
* fix released model info.
3 years ago
TianYuan
528ae58a67
[TTS]remove pad op in static model by replace F.pad with nn.Pad1D and nn.Pad2D ( #3002 )
...
* remove pad op in static model by replace F.pad with nn.Pad1D and nn.Pad2D
* fix variable names
* add note
3 years ago
JiehangXie
59cabdc967
[TTS]Cli Cantonese onnx, test=tts ( #2990 )
...
Co-authored-by: TianYuan <white-sky@qq.com>
3 years ago
TianYuan
f7fd111647
[TTS]add StarGANv2-VC model scripts ( #2842 )
3 years ago
TianYuan
16d75eaa52
Update synthesize_e2e.sh
3 years ago
TianYuan
f31219c63d
Update synthesize_e2e.sh
3 years ago
TianYuan
ad239eb444
[TTS]add VITS inference ( #2972 )
3 years ago
zxcd
5430a366ab
Update RESULTS.md ( #2964 )
3 years ago
TianYuan
84f751f529
[TTS]vits dygraph to static ( #2883 )
...
Co-authored-by: 0x45f <wangzhen45@baidu.com>
3 years ago
HuangLiangJie
11bc392617
[TTS]Canton phonetic fix, test=tts ( #2950 )
3 years ago
TianYuan
a38e3c3187
Update README.md
3 years ago
HuangLiangJie
2553022396
[TTS]Cantonese TTS checkpoint for e2e, test=tts ( #2932 )
3 years ago
HuangLiangJie
1af9bd47d9
[TTS]Cantonese FastSpeech2 e2e infer, test=tts ( #2927 )
3 years ago
lance6716
71bda24437
[TTS]Fix canton ( #2924 )
...
* Update run.sh
* Update README.md
3 years ago
HuangLiangJie
acfa057dc7
[TTS]Cantonese FastSpeech2 Training, test=tts ( #2907 )
3 years ago
HuangLiangJie
c75906462e
[TTS]Cantonese TTS MFA pipeline ( #2899 )
...
Co-authored-by: TianYuan <white-sky@qq.com>
3 years ago
zxcd
047092de8e
add wav2vev2_zh aishell recipe, and speechbrain dataloader. ( #2916 )
3 years ago
HuangLiangJie
3a8ba2e242
[TTS]Update VITS checkpoint , test=tts ( #2887 )
3 years ago
Zhou Wei
16d84367c6
fix Tensor.numpy()[0] to float(Tensor) to adapt 0D ( #2884 )
3 years ago
TianYuan
2f526c093c
fix data for slim ( #2862 )
3 years ago
TianYuan
2b01e40525
[TTS]soft link for shell in example, add skip_copy_wave in norm stage of G… ( #2851 )
...
soft link for shell in example, add skip_copy_wave in norm stage of GANVocoders to save disk
3 years ago
zxcd
e793d267d9
[ASR] add code-switch asr tal_cs recipe ( #2796 )
...
* add tal_cs asr recipe.
* add readme and result, and fix some bug.
* add commit id and date.
3 years ago
TianYuan
355517d1ca
Update run.sh
3 years ago
TianYuan
553fefff9b
Update run_en.sh
3 years ago
TianYuan
59d4b8a2a8
Update run_mix.sh
3 years ago
TianYuan
3f6afc4834
[TTS]Add slim for TTS ( #2729 )
3 years ago
YangZhou
42ff946007
[audio] mv paddlespeech/audio to paddleaudio ( #2706 )
...
* split paddlespeech/audio to paddleaudio.
* add sox io ,sox effect, kaldi native fbank to paddleaudio.
3 years ago
TianYuan
001b22cf41
Update README.md
3 years ago
HuangLiangJie
a874d8f325
Add prosody prediction in synthesize_e2e, test=tts ( #2693 )
3 years ago
TianYuan
0b4cf2211d
[TTS]Add TTS Paddle-Lite x86 inference ( #2667 )
...
* Add export2lite, test=tts
* add tts paddlelite x86 inference, test=tts
* update released_model.md, test=tts
* add paddlelite in setup.py
* update
3 years ago
Zth9730
94a487bd81
[ASR] support wav2vec2 command line and demo ( #2658 )
...
* wav2vec2_cli
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* Update RESULTS.md
* Update RESULTS.md
* Update base_commands.py
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
3 years ago
liangym
f110928195
[tts ] Chmod run_mix.sh ( #2647 )
3 years ago
Zth9730
8d3494320d
[ASR] wav2vec2_en, test=asr ( #2637 )
...
* wav2vec2_en, test=asr
* wav2vec2_en, test=asr
* wav2vec2_en, test=asr
3 years ago
TianYuan
0727984873
Add export2lite, test=tts ( #2636 )
3 years ago
HuangLiangJie
b7312e9f0b
Revised TN qualifier for measure notation, test=tts ( #2629 )
3 years ago
HuangLiangJie
f43d026000
Add rhythm tags for MFA, test=tts ( #2615 )
...
* Add rhythm tags for MFA, test=tts
3 years ago
TianYuan
d1e65a4781
Update README.md
3 years ago
TianYuan
39df6b14e5
Update README.md
3 years ago
TianYuan
46fd67e2ce
Update README.md
3 years ago
liangym
d622b8bc5f
[tts] update finetune readme ( #2625 )
...
* updata readme, test=doc
* update yaml and readme, test=tts
* fix batch_size, test=tts
* update readme, test=doc
3 years ago
Hui Zhang
2c34481ea0
[s2t] quant with wav scp ( #2568 )
...
* add quant hint
* add paddleslim
* using paddleslim 2.3.4 and paddle 2.4
3 years ago
liangym
e18170228c
[tts] add adversarial loss ( #2588 )
3 years ago
YangZhou
294d83da82
add kws release model ( #2585 )
...
add kws release model.
3 years ago
TianYuan
91fa195b8b
Update README.md
3 years ago
WongLaw
d27364d141
Added Text Rhythm Prediction, test=tts
3 years ago
HuangLiangJie
872be9c8ce
Merge branch 'PaddlePaddle:develop' into rhy
3 years ago
YangZhou
bbf2401e3e
Merge pull request #2524 from zh794390558/u2
...
[speechx] add u2/u2pp asr inference
3 years ago
WongLaw
72bbabbf79
Revised structure of rhythm prediction, test=tts
3 years ago
Hui Zhang
caa1dd84d4
Merge pull request #2559 from Zth9730/fix_u2pp_model
...
[doc] update wenetspeech u2pp RESULT.md
3 years ago
tianhao zhang
ce153d915e
update u2pp result.md
3 years ago
WongLaw
fd00a22e37
Revised Rhythm Prediction Function, test=tts
3 years ago