TianYuan
a38e3c3187
Update README.md
2 years ago
HuangLiangJie
2553022396
[TTS]Cantonese TTS checkpoint for e2e, test=tts ( #2932 )
2 years ago
HuangLiangJie
1af9bd47d9
[TTS]Cantonese FastSpeech2 e2e infer, test=tts ( #2927 )
2 years ago
zxcd
004a4d6096
[ASR] rm transformers import and modify variable name consistent with infer.py, test=asr ( #2929 )
...
* rm transformers import and modify variable name consistent with infer.py
* add condition ctc_prefix_beam_search decode.
2 years ago
lance6716
71bda24437
[TTS]Fix canton ( #2924 )
...
* Update run.sh
* Update README.md
2 years ago
TianYuan
9db75af27d
Update README_cn.md
2 years ago
Bing Wong
afcf623351
Update README_cn.md, FastAPI-PaddleSpeech-Audio-To-Text ( #2922 )
2 years ago
zxcd
17a7ebddfa
fix dist_sampler AttributeError ( #2918 )
2 years ago
HuangLiangJie
acfa057dc7
[TTS]Cantonese FastSpeech2 Training, test=tts ( #2907 )
2 years ago
HuangLiangJie
c75906462e
[TTS]Cantonese TTS MFA pipeline ( #2899 )
...
Co-authored-by: TianYuan <white-sky@qq.com>
2 years ago
zxcd
047092de8e
add wav2vev2_zh aishell recipe, and speechbrain dataloader. ( #2916 )
2 years ago
ZhengZZZ
66a9cf8ebc
modify readme ( #2915 )
2 years ago
艾梦
bcd8e309ec
[TTS]Add diffusion noise clip to optimize sample result ( #2902 )
...
* add diffusion module for training diffsinger
* add wavenet denoiser final conv initializer
* add diffusion noise clip to optimize sample result
2 years ago
zxcd
6728db5b59
[ASR]Whisper remove audio duration limit, test=asr ( #2900 )
2 years ago
Hui Zhang
8cbf6a2c9a
Update setup.py ( #2879 )
2 years ago
zxcd
f6b624ddc8
add encoding=utf8 for text cli. ( #2896 )
2 years ago
TianYuan
2f8aad95e0
Update .mergify.yml
2 years ago
TianYuan
b1d0658ef8
Update stale.yml
2 years ago
YangZhou
8e1b4cd513
[engine] rename speechx ( #2892 )
...
* rename speechx
* fix wfst decode error
* replace reset with make_unique
2 years ago
YangZhou
21183d48b6
add wfst decoder ( #2886 )
2 years ago
HuangLiangJie
3a8ba2e242
[TTS]Update VITS checkpoint , test=tts ( #2887 )
2 years ago
Zhou Wei
16d84367c6
fix Tensor.numpy()[0] to float(Tensor) to adapt 0D ( #2884 )
2 years ago
TianYuan
089c060756
fix pwgan tipc ( #2882 )
2 years ago
TianYuan
6b00ad6064
[Install]clean dependencies ( #2871 )
...
* clean dependencies
* update paddleaudio's version
* rm dependency in librosa and paddlenlp
* rm dependency in paddlepaddle
* rm dependency in speech_web
2 years ago
章宏彬
c764710aa1
[TTS]Avoid using variable "attn_loss" before assignment ( #2860 )
...
* Avoid using variable "attn_loss" before assignment
* Update tacotron2_updater.py
---------
Co-authored-by: TianYuan <white-sky@qq.com>
2 years ago
TianYuan
a283f8a57e
[TTS]fix open encoding ( #2865 )
2 years ago
艾梦
a55fd2e556
[TTS]Fix diffusion wavenet denoiser final conv init param ( #2868 )
...
* add diffusion module for training diffsinger
* add wavenet denoiser final conv initializer
2 years ago
TianYuan
896da6dcd1
remove utils and third_party in paddlespeech's site-packages ( #2867 )
2 years ago
YangZhou
5042a1686a
[speechx] add batch recognizer decode. ( #2866 )
...
* add recognizer_batch
2 years ago
QuanZ9
ac3ed3c5a8
Update zh_frontend.py ( #2863 )
2 years ago
TianYuan
2f526c093c
fix data for slim ( #2862 )
2 years ago
zxcd
64aeb6dccc
remove some fluid api (elementwise_div elementwise_mul sqrt reduce_sum). ( #2859 )
2 years ago
TianYuan
b5764e9f74
[Install]rm protobuf in setup.py ( #2853 )
...
* rm protobuf in setup.py && rm audio's dependances in setup.py
2 years ago
zxcd
31c2c226ca
clean fluid elementwise_max and square api. ( #2852 )
2 years ago
TianYuan
2b01e40525
[TTS]soft link for shell in example, add skip_copy_wave in norm stage of G… ( #2851 )
...
soft link for shell in example, add skip_copy_wave in norm stage of GANVocoders to save disk
2 years ago
HuangLiangJie
140aed4b54
[TTS]VITS init sampler reverse, test=tts ( #2843 )
2 years ago
YangZhou
8a225b1708
[speechx] thread decode ( #2839 )
...
* fix nnet thread crash && rescore cost time
* add nnet thread main
2 years ago
Ming
478fd2593e
update QR Code in README, test=doc ( #2841 )
2 years ago
TianYuan
2f3ca4ac48
Update README.md ( #2840 )
...
* Update README.md
* Update README_cn.md
2 years ago
艾梦
57b9d4bca4
add diffusion module for training diffsinger ( #2832 )
2 years ago
TianYuan
1fd38c0e8b
fix o ( #2831 )
2 years ago
晋东毅
742523fb38
[tts]For mixed Chinese and English speech synthesis, add SSML support for Chinese ( #2830 )
...
* 添加.history
* [tts]添加中英混合语音合成时对中文SSML的支持
2 years ago
cxumol
a99244d86e
fix: whisper language choice, test=asr ( #2828 )
2 years ago
zxcd
ad40dafa85
fix some bug. ( #2825 )
2 years ago
YangZhou
ee7c266f13
[speechx] rm openblas && refactor kaldi-matrix, kaldi-vector ( #2824 )
...
* rm openblas && refactor kaldi-matrix kaldi-vector
2 years ago
HuangLiangJie
faa2f86651
[TTS]update VITS init method ( #2809 )
2 years ago
zxcd
88fe26f17c
[ASR] add asr code-switch cli and demo, test='asr' ( #2816 )
...
* add asr code-switch cli and demo.
* fix some model named problem.
2 years ago
YangZhou
2c4c141de5
[audio] fix load paddleaudio fail ( #2815 )
...
* fix paddleaudio import fail
2 years ago
zxcd
e793d267d9
[ASR] add code-switch asr tal_cs recipe ( #2796 )
...
* add tal_cs asr recipe.
* add readme and result, and fix some bug.
* add commit id and date.
2 years ago
YangZhou
25dcad3de7
update paddleaudio readme, test=doc ( #2801 )
2 years ago