Commit Graph

1278 Commits (develop)

Author SHA1 Message Date
Hui Zhang 9727e67a3f add ssml unit test
2 years ago
Hui Zhang 4d867700eb move ssl into t2s.frontend; fix spk_id for 0-D tensor;
2 years ago
Hui Zhang 42f2186d71 more comment on tts frontend
2 years ago
Hui Zhang 8aa9790c75
Merge pull request #3305 from zh794390558/tts
2 years ago
Hui Zhang 46de1b0379
Merge pull request #3268 from shuishu/patch-1
2 years ago
Hui Zhang 6b4d1f80ac add t2s assets
2 years ago
zxcd 9b8ac050de
add dtype param for arange API. (#3302)
2 years ago
Hui Zhang 6e7c71b26c refactor rhy
2 years ago
jiamingkong 8432e8626f Final cleaning; Modified SSL/infer.py and README for wavlm inclusion in model options
2 years ago
jiamingkong ba874db5dc Fixed the transpose usages ignored before
2 years ago
jiamingkong 0e2068e2cf Code clean up for CIs
2 years ago
jiamingkong 3ef28dee45
Merge branch 'PaddlePaddle:develop' into develop
2 years ago
Hui Zhang 4453430ac0
Merge pull request #3265 from zoooo0820/fix_0d_error
2 years ago
jiamingkong 2ea00755f7 Changed the MD5 of the pretrained tar file due to bug fixes
2 years ago
jiamingkong 232dcf8660 Adapted wavlmASR model to pretrained weights and CLI
2 years ago
shuishu 1f7eabee0f
Update phonecode.py
2 years ago
zoooo0820 17f2944a17 fix error in tts/st
2 years ago
jiamingkong 60bd7f202e Code clean up according to comments in https://github.com/PaddlePaddle/PaddleSpeech/pull/3242
2 years ago
zxcd b1b8859290 fix model m5s
2 years ago
jiamingkong 3b6651ba7c Adding WavLM implementation
2 years ago
guanyc 5f53e902e1
fix: 🐛 修复服务端 python ASREngine 无法使用conformer_talcs模型 (#3230)
2 years ago
zxcd caca8e2f12
[ASR] fix asr 0-d tensor. (#3214)
2 years ago
TianHao Zhang 12e3e76092
[ASR] Support Hubert, fintuned on the librispeech dataset (#3088)
2 years ago
Hui Zhang 8371d14f5d
Merge pull request #3167 from zxcd/amp
2 years ago
Hui Zhang 225737d4e3
[s2t] fix cli args to config (#3194)
2 years ago
Hui Zhang e3dcfa8815
Merge pull request #3186 from PaddlePaddle/vits_pr
2 years ago
zxcd bc365cbb52
Merge branch 'develop' into amp
2 years ago
zxcd 9d8660b2f6 add new aishell model for better CER.
2 years ago
WongLaw 305375c310 VITS learning rate revised, test=tts
2 years ago
WongLaw fdeb9b88a7 VITS learning rate revised, test=tts
2 years ago
TianYuan fc670339d1
[TTS]Fix losses of StarGAN v2 VC (#3184)
2 years ago
Hui Zhang df3be4acae
[s2t] move s2t data preprocess into paddlespeech.dataset (#3189)
2 years ago
Shuangchi He 8c7859d3bc
Fix some typos. (#3178)
2 years ago
Hui Zhang 35d874c532
[s2t] mv dataset into paddlespeech.dataset (#3183)
2 years ago
WongLaw 47e31f46cb VITS learning rate revised, test=tts
2 years ago
WongLaw 414de3747c VITS learning rate revised, test=tts
2 years ago
TianYuan 3ad55a31e7
[TTS]StarGANv2 VC fix some trainer bugs, add add reset_parameters (#3182)
2 years ago
PiaoYang 5a0103b2ae
[BUG] Fix progress bar unit. (#3177)
2 years ago
ljhzxc dc56c3a10e
[TTS] [黑客松]Add JETS (#3109)
2 years ago
TianYuan bd0d69ca74
[TTS]add StarGANv2VC preprocess (#3163)
2 years ago
zxcd a1e5f27003 mv scaler.unscale_ blow grad_clip.
2 years ago
zxcd 7399d560e7 fix scaler save and load.
2 years ago
zxcd 2f4414a5f8 fix scaler save
2 years ago
zxcd fbd27aab41 add amp for U2 conformer.
2 years ago
TianYuan c7d24ba42c
fix some preprocess bugs (#3155)
2 years ago
longRookie df37798598
[TTS]【Hackathon + No.190】 + 模型复现:iSTFTNet (#3006)
2 years ago
TianYuan 72aa19c32c
[TTS]add starganv2 vc trainer (#3143)
2 years ago
TianYuan 54ef90fcec
[TTS]Fix VITS lite infer (#3098)
2 years ago
liangym e83b491c34
rm unused dep, test=tts (#3097)
2 years ago
TianYuan 6894a2a77d
[TTS]fix elementwise_floordiv's fill_constant (#3075)
2 years ago
TianYuan 0a2e367ff4
[TTS]clean starganv2 vc model code and add docstring (#2987)
2 years ago
liangym 880c172db7
[TTS] add svs frontend (#3062)
2 years ago
TianYuan d5720e4e7b
fix input dtype of elementwise_mul op from bool to int64 (#3054)
3 years ago
夜雨飘零 31a4562ae8
[ASR]add squeezeformer model (#2755)
3 years ago
zxcd 9bf5471613
optional tokenizer and fix some doc. (#3042)
3 years ago
TianYuan 706a68bde9
fix dtype diff of last expand_v2 op of VITS (#3041)
3 years ago
liangym 348064de0d
[TTS] add opencpop HIFIGAN example (#3038)
3 years ago
zxcd 4e9bca177a
[ASR] change optimizer and fix import error, test=asr (#3023)
3 years ago
liangym 435fc5cc19
[TTS] add opencpop PWGAN example (#3031)
3 years ago
TianYuan 271112ca69
fix vits reduce_sum's input/output dtype, test=tts (#3028)
3 years ago
liangym 1afd14acd9
[TTS]add Diffsinger with opencpop dataset (#3005)
3 years ago
MistEO 319c805968
[TTS] Support set device id for tts prediction, test=tts (#3019)
3 years ago
zxcd 3145325b4e
[ASR] add wav2vec2 aishell model result, test=asr (#3012)
3 years ago
zxcd 5186319f48
fix load model schedule error, config optional. (#3008)
3 years ago
TianYuan 528ae58a67
[TTS]remove pad op in static model by replace F.pad with nn.Pad1D and nn.Pad2D (#3002)
3 years ago
JiehangXie 59cabdc967
[TTS]Cli Cantonese onnx, test=tts (#2990)
3 years ago
mooncake c02bc087f6
rearrange-encoder-infer-param (#2983)
3 years ago
TianYuan f7fd111647
[TTS]add StarGANv2-VC model scripts (#2842)
3 years ago
HuangLiangJie c8196d45ae
[TTS]Canton CLI, test=tts (#2977)
3 years ago
TianYuan ad239eb444
[TTS]add VITS inference (#2972)
3 years ago
TianYuan 84f751f529
[TTS]vits dygraph to static (#2883)
3 years ago
HuangLiangJie 11bc392617
[TTS]Canton phonetic fix, test=tts (#2950)
3 years ago
TianYuan c8d5a01bdb
[TTS]fix dygraph to static for tacotron2, test=doc (#2426)
3 years ago
liangym d9b041e999
[TTS]Cli male onnx (#2945)
3 years ago
zxcd dcf8ef04e0
[ASR] Remove fluid api and useless import, test=asr (#2944)
3 years ago
JiehangXie a5c0bffd2a
add Cantonese test examples (#2937)
3 years ago
zxcd a8a353d0ac
[ASR] add python simple adadelta optimizer, test=asr (#2925)
3 years ago
HuangLiangJie 1af9bd47d9
[TTS]Cantonese FastSpeech2 e2e infer, test=tts (#2927)
3 years ago
zxcd 004a4d6096
[ASR] rm transformers import and modify variable name consistent with infer.py, test=asr (#2929)
3 years ago
zxcd 17a7ebddfa
fix dist_sampler AttributeError (#2918)
3 years ago
HuangLiangJie acfa057dc7
[TTS]Cantonese FastSpeech2 Training, test=tts (#2907)
3 years ago
zxcd 047092de8e
add wav2vev2_zh aishell recipe, and speechbrain dataloader. (#2916)
3 years ago
艾梦 bcd8e309ec
[TTS]Add diffusion noise clip to optimize sample result (#2902)
3 years ago
zxcd 6728db5b59
[ASR]Whisper remove audio duration limit, test=asr (#2900)
3 years ago
zxcd f6b624ddc8
add encoding=utf8 for text cli. (#2896)
3 years ago
章宏彬 c764710aa1
[TTS]Avoid using variable "attn_loss" before assignment (#2860)
3 years ago
TianYuan a283f8a57e
[TTS]fix open encoding (#2865)
3 years ago
艾梦 a55fd2e556
[TTS]Fix diffusion wavenet denoiser final conv init param (#2868)
3 years ago
QuanZ9 ac3ed3c5a8
Update zh_frontend.py (#2863)
3 years ago
zxcd 64aeb6dccc
remove some fluid api (elementwise_div elementwise_mul sqrt reduce_sum). (#2859)
3 years ago
zxcd 31c2c226ca
clean fluid elementwise_max and square api. (#2852)
3 years ago
HuangLiangJie 140aed4b54
[TTS]VITS init sampler reverse, test=tts (#2843)
3 years ago
艾梦 57b9d4bca4
add diffusion module for training diffsinger (#2832)
3 years ago
TianYuan 1fd38c0e8b
fix o (#2831)
3 years ago
晋东毅 742523fb38
[tts]For mixed Chinese and English speech synthesis, add SSML support for Chinese (#2830)
3 years ago
cxumol a99244d86e
fix: whisper language choice, test=asr (#2828)
3 years ago
zxcd ad40dafa85
fix some bug. (#2825)
3 years ago
HuangLiangJie faa2f86651
[TTS]update VITS init method (#2809)
3 years ago
zxcd 88fe26f17c
[ASR] add asr code-switch cli and demo, test='asr' (#2816)
3 years ago
HuangLiangJie 964211a81b
Change optimizer for vits, test=tts (#2791)
3 years ago
liangym 96d76c83ad
multi-spk tts static model (#2779)
3 years ago
HuangLiangJie 2e51e0da90
[TTS]Fix attention bugs and sort VITS data with feats_lengths (#2770)
3 years ago
TianYuan 6725bcd823
revise paddlenlp's version (#2767)
3 years ago
TianYuan 979bbd9dcb
add mkldnn and trt config for paddleInference (#2748)
3 years ago
zxcd a8a240d4ef
remove paddle.fluid (#2740)
3 years ago
YangZhou 12fa8a2d19
[audio]patch:fix tensor_utils error (#2738)
3 years ago
TianYuan 3f6afc4834
[TTS]Add slim for TTS (#2729)
3 years ago
YangZhou 42ff946007
[audio] mv paddlespeech/audio to paddleaudio (#2706)
3 years ago
HuangLiangJie a874d8f325
Add prosody prediction in synthesize_e2e, test=tts (#2693)
3 years ago
TianYuan 62357d876c
[TTS]rm paddlelite in setup.py (#2713)
3 years ago
Zth9730 c67bf7b4ef
[ASR] support wav2vec2-zh cli, test=asr (#2697)
3 years ago
David An (An Hongliang) bd01bc155d
add greek char and fix issue2571 (#2683)
3 years ago
zxcd 4542684694
[ASR] fix Whisper cli model download path error. test=asr (#2679)
3 years ago
Zth9730 fc02cd0540
[doc] update wav2vec2 demos README.md, test=doc (#2674)
3 years ago
zxcd b71f1428c7
add all whisper model size support, test=asr (#2677)
3 years ago
TianYuan 0b4cf2211d
[TTS]Add TTS Paddle-Lite x86 inference (#2667)
3 years ago
David An (An Hongliang) 1c3d2cb89e
add double byte char for zh normalization (#2661)
3 years ago
Zth9730 94a487bd81
[ASR] support wav2vec2 command line and demo (#2658)
3 years ago
zxcd b1d3f59bcb
[s2t] add whisper asr large model (#2640)
3 years ago
kFoodie dc9d3baf51
Update onnx_api.py (#2664)
3 years ago
liangym 25b6bf9668
[tts] Add male voice for tts (#2660)
3 years ago
Zth9730 8d3494320d
[ASR] wav2vec2_en, test=asr (#2637)
3 years ago
HuangLiangJie b7312e9f0b
Revised TN qualifier for measure notation, test=tts (#2629)
3 years ago
Zth9730 e6d20888c5
支持0维Tensor需要的修改 (#2621)
3 years ago
David An (An Hongliang) 8a5fe83e1d
add ssml sentences.txt (#2620)
3 years ago
Hui Zhang 2c34481ea0
[s2t] quant with wav scp (#2568)
3 years ago
Zth9730 8d3464c050
[s2t] Update wav2vec2 license (#2600)
3 years ago
liangym e18170228c
[tts] add adversarial loss (#2588)
3 years ago
TianYuan 9aab706cba
fix frontend bug, test=tts (#2606)
3 years ago
WongLaw e348aa825d Added Rhythm Prediction, test=tts
3 years ago
WongLaw b96fb1d57e Added Rythm Prediction, test=tts
3 years ago
WongLaw d27364d141 Added Text Rhythm Prediction, test=tts
3 years ago
HuangLiangJie 872be9c8ce
Merge branch 'PaddlePaddle:develop' into rhy
3 years ago
YangZhou bbf2401e3e
Merge pull request #2524 from zh794390558/u2
3 years ago
WongLaw 72bbabbf79 Revised structure of rhythm prediction, test=tts
3 years ago
david.95 ed0138c6e3 add condition check if a ssml input and filter space line, test=tts
3 years ago
David An (An Hongliang) 21cce0e0bb
Merge branch 'PaddlePaddle:develop' into hongliang1014
3 years ago
TianYuan 63c80121e2 fix uvicorn's bug
3 years ago
TianYuan 2a60c3d854
Merge pull request #2554 from dahu1/develop
3 years ago
david.95 3ac7ac253f fix review issue,test=tts
3 years ago
David An (An Hongliang) 0476e645aa
Merge branch 'PaddlePaddle:develop' into hongliang1014
3 years ago
Zth9730 68134c8436
fix u2pp model (#2549)
3 years ago
dahu1 cb76e66401 1.token配置不写死,2.text显示不乱码, test=asr
3 years ago
Hui Zhang eac545e1db
Merge pull request #2544 from Zth9730/fix_attention
3 years ago
tianhao zhang 1ea828c30e fix attention val bug
3 years ago
David An (An Hongliang) 103e46f819
Merge branch 'PaddlePaddle:develop' into hongliang1014
3 years ago
TianYuan 2d71577e75
fix g2p (#2539)
3 years ago
david.95 f295d2d445 remove useless code
3 years ago
david.95 89e9ea69eb modify __init__
3 years ago
david.95 1067088deb modify __init__
3 years ago