Commit Graph

1096 Commits (bcd8e309ec3fade62971067de6d5607027c254e4)

Author SHA1 Message Date
艾梦 bcd8e309ec
[TTS]Add diffusion noise clip to optimize sample result (#2902)
2 years ago
zxcd 6728db5b59
[ASR]Whisper remove audio duration limit, test=asr (#2900)
2 years ago
zxcd f6b624ddc8
add encoding=utf8 for text cli. (#2896)
2 years ago
章宏彬 c764710aa1
[TTS]Avoid using variable "attn_loss" before assignment (#2860)
2 years ago
TianYuan a283f8a57e
[TTS]fix open encoding (#2865)
2 years ago
艾梦 a55fd2e556
[TTS]Fix diffusion wavenet denoiser final conv init param (#2868)
2 years ago
QuanZ9 ac3ed3c5a8
Update zh_frontend.py (#2863)
2 years ago
zxcd 64aeb6dccc
remove some fluid api (elementwise_div elementwise_mul sqrt reduce_sum). (#2859)
2 years ago
zxcd 31c2c226ca
clean fluid elementwise_max and square api. (#2852)
2 years ago
HuangLiangJie 140aed4b54
[TTS]VITS init sampler reverse, test=tts (#2843)
2 years ago
艾梦 57b9d4bca4
add diffusion module for training diffsinger (#2832)
2 years ago
TianYuan 1fd38c0e8b
fix o (#2831)
2 years ago
晋东毅 742523fb38
[tts]For mixed Chinese and English speech synthesis, add SSML support for Chinese (#2830)
2 years ago
cxumol a99244d86e
fix: whisper language choice, test=asr (#2828)
2 years ago
zxcd ad40dafa85
fix some bug. (#2825)
2 years ago
HuangLiangJie faa2f86651
[TTS]update VITS init method (#2809)
2 years ago
zxcd 88fe26f17c
[ASR] add asr code-switch cli and demo, test='asr' (#2816)
2 years ago
HuangLiangJie 964211a81b
Change optimizer for vits, test=tts (#2791)
2 years ago
liangym 96d76c83ad
multi-spk tts static model (#2779)
2 years ago
HuangLiangJie 2e51e0da90
[TTS]Fix attention bugs and sort VITS data with feats_lengths (#2770)
2 years ago
TianYuan 6725bcd823
revise paddlenlp's version (#2767)
2 years ago
TianYuan 979bbd9dcb
add mkldnn and trt config for paddleInference (#2748)
2 years ago
zxcd a8a240d4ef
remove paddle.fluid (#2740)
2 years ago
YangZhou 12fa8a2d19
[audio]patch:fix tensor_utils error (#2738)
2 years ago
TianYuan 3f6afc4834
[TTS]Add slim for TTS (#2729)
2 years ago
YangZhou 42ff946007
[audio] mv paddlespeech/audio to paddleaudio (#2706)
2 years ago
HuangLiangJie a874d8f325
Add prosody prediction in synthesize_e2e, test=tts (#2693)
2 years ago
TianYuan 62357d876c
[TTS]rm paddlelite in setup.py (#2713)
2 years ago
Zth9730 c67bf7b4ef
[ASR] support wav2vec2-zh cli, test=asr (#2697)
2 years ago
David An (An Hongliang) bd01bc155d
add greek char and fix issue2571 (#2683)
2 years ago
zxcd 4542684694
[ASR] fix Whisper cli model download path error. test=asr (#2679)
2 years ago
Zth9730 fc02cd0540
[doc] update wav2vec2 demos README.md, test=doc (#2674)
2 years ago
zxcd b71f1428c7
add all whisper model size support, test=asr (#2677)
2 years ago
TianYuan 0b4cf2211d
[TTS]Add TTS Paddle-Lite x86 inference (#2667)
2 years ago
David An (An Hongliang) 1c3d2cb89e
add double byte char for zh normalization (#2661)
2 years ago
Zth9730 94a487bd81
[ASR] support wav2vec2 command line and demo (#2658)
2 years ago
zxcd b1d3f59bcb
[s2t] add whisper asr large model (#2640)
2 years ago
kFoodie dc9d3baf51
Update onnx_api.py (#2664)
2 years ago
liangym 25b6bf9668
[tts] Add male voice for tts (#2660)
2 years ago
Zth9730 8d3494320d
[ASR] wav2vec2_en, test=asr (#2637)
2 years ago
HuangLiangJie b7312e9f0b
Revised TN qualifier for measure notation, test=tts (#2629)
2 years ago
Zth9730 e6d20888c5
支持0维Tensor需要的修改 (#2621)
2 years ago
David An (An Hongliang) 8a5fe83e1d
add ssml sentences.txt (#2620)
2 years ago
Hui Zhang 2c34481ea0
[s2t] quant with wav scp (#2568)
2 years ago
Zth9730 8d3464c050
[s2t] Update wav2vec2 license (#2600)
2 years ago
liangym e18170228c
[tts] add adversarial loss (#2588)
2 years ago
TianYuan 9aab706cba
fix frontend bug, test=tts (#2606)
2 years ago
WongLaw e348aa825d Added Rhythm Prediction, test=tts
2 years ago
WongLaw b96fb1d57e Added Rythm Prediction, test=tts
2 years ago
WongLaw d27364d141 Added Text Rhythm Prediction, test=tts
2 years ago