TianYuan
9db75af27d
Update README_cn.md
2 years ago
Bing Wong
afcf623351
Update README_cn.md, FastAPI-PaddleSpeech-Audio-To-Text ( #2922 )
2 years ago
zxcd
17a7ebddfa
fix dist_sampler AttributeError ( #2918 )
2 years ago
HuangLiangJie
acfa057dc7
[TTS]Cantonese FastSpeech2 Training, test=tts ( #2907 )
2 years ago
HuangLiangJie
c75906462e
[TTS]Cantonese TTS MFA pipeline ( #2899 )
...
Co-authored-by: TianYuan <white-sky@qq.com>
2 years ago
zxcd
047092de8e
add wav2vev2_zh aishell recipe, and speechbrain dataloader. ( #2916 )
2 years ago
ZhengZZZ
66a9cf8ebc
modify readme ( #2915 )
2 years ago
艾梦
bcd8e309ec
[TTS]Add diffusion noise clip to optimize sample result ( #2902 )
...
* add diffusion module for training diffsinger
* add wavenet denoiser final conv initializer
* add diffusion noise clip to optimize sample result
2 years ago
zxcd
6728db5b59
[ASR]Whisper remove audio duration limit, test=asr ( #2900 )
2 years ago
Hui Zhang
8cbf6a2c9a
Update setup.py ( #2879 )
2 years ago
zxcd
f6b624ddc8
add encoding=utf8 for text cli. ( #2896 )
2 years ago
TianYuan
2f8aad95e0
Update .mergify.yml
2 years ago
TianYuan
b1d0658ef8
Update stale.yml
2 years ago
YangZhou
8e1b4cd513
[engine] rename speechx ( #2892 )
...
* rename speechx
* fix wfst decode error
* replace reset with make_unique
2 years ago
YangZhou
21183d48b6
add wfst decoder ( #2886 )
2 years ago
HuangLiangJie
3a8ba2e242
[TTS]Update VITS checkpoint , test=tts ( #2887 )
2 years ago
Zhou Wei
16d84367c6
fix Tensor.numpy()[0] to float(Tensor) to adapt 0D ( #2884 )
2 years ago
TianYuan
089c060756
fix pwgan tipc ( #2882 )
2 years ago
TianYuan
6b00ad6064
[Install]clean dependencies ( #2871 )
...
* clean dependencies
* update paddleaudio's version
* rm dependency in librosa and paddlenlp
* rm dependency in paddlepaddle
* rm dependency in speech_web
2 years ago
章宏彬
c764710aa1
[TTS]Avoid using variable "attn_loss" before assignment ( #2860 )
...
* Avoid using variable "attn_loss" before assignment
* Update tacotron2_updater.py
---------
Co-authored-by: TianYuan <white-sky@qq.com>
2 years ago
TianYuan
a283f8a57e
[TTS]fix open encoding ( #2865 )
2 years ago
艾梦
a55fd2e556
[TTS]Fix diffusion wavenet denoiser final conv init param ( #2868 )
...
* add diffusion module for training diffsinger
* add wavenet denoiser final conv initializer
2 years ago
TianYuan
896da6dcd1
remove utils and third_party in paddlespeech's site-packages ( #2867 )
2 years ago
YangZhou
5042a1686a
[speechx] add batch recognizer decode. ( #2866 )
...
* add recognizer_batch
2 years ago
QuanZ9
ac3ed3c5a8
Update zh_frontend.py ( #2863 )
2 years ago
TianYuan
2f526c093c
fix data for slim ( #2862 )
2 years ago
zxcd
64aeb6dccc
remove some fluid api (elementwise_div elementwise_mul sqrt reduce_sum). ( #2859 )
2 years ago
TianYuan
b5764e9f74
[Install]rm protobuf in setup.py ( #2853 )
...
* rm protobuf in setup.py && rm audio's dependances in setup.py
2 years ago
zxcd
31c2c226ca
clean fluid elementwise_max and square api. ( #2852 )
2 years ago
TianYuan
2b01e40525
[TTS]soft link for shell in example, add skip_copy_wave in norm stage of G… ( #2851 )
...
soft link for shell in example, add skip_copy_wave in norm stage of GANVocoders to save disk
2 years ago
HuangLiangJie
140aed4b54
[TTS]VITS init sampler reverse, test=tts ( #2843 )
2 years ago
YangZhou
8a225b1708
[speechx] thread decode ( #2839 )
...
* fix nnet thread crash && rescore cost time
* add nnet thread main
2 years ago
Ming
478fd2593e
update QR Code in README, test=doc ( #2841 )
2 years ago
TianYuan
2f3ca4ac48
Update README.md ( #2840 )
...
* Update README.md
* Update README_cn.md
2 years ago
艾梦
57b9d4bca4
add diffusion module for training diffsinger ( #2832 )
2 years ago
TianYuan
1fd38c0e8b
fix o ( #2831 )
2 years ago
晋东毅
742523fb38
[tts]For mixed Chinese and English speech synthesis, add SSML support for Chinese ( #2830 )
...
* 添加.history
* [tts]添加中英混合语音合成时对中文SSML的支持
2 years ago
cxumol
a99244d86e
fix: whisper language choice, test=asr ( #2828 )
2 years ago
zxcd
ad40dafa85
fix some bug. ( #2825 )
2 years ago
YangZhou
ee7c266f13
[speechx] rm openblas && refactor kaldi-matrix, kaldi-vector ( #2824 )
...
* rm openblas && refactor kaldi-matrix kaldi-vector
2 years ago
HuangLiangJie
faa2f86651
[TTS]update VITS init method ( #2809 )
2 years ago
zxcd
88fe26f17c
[ASR] add asr code-switch cli and demo, test='asr' ( #2816 )
...
* add asr code-switch cli and demo.
* fix some model named problem.
2 years ago
YangZhou
2c4c141de5
[audio] fix load paddleaudio fail ( #2815 )
...
* fix paddleaudio import fail
2 years ago
zxcd
e793d267d9
[ASR] add code-switch asr tal_cs recipe ( #2796 )
...
* add tal_cs asr recipe.
* add readme and result, and fix some bug.
* add commit id and date.
2 years ago
YangZhou
25dcad3de7
update paddleaudio readme, test=doc ( #2801 )
2 years ago
TianYuan
4a11302e35
Update setup.py
2 years ago
YangZhou
d7a6268bcc
[audio]replace kaldi fbank with kaldi-native-fbank in paddleaudio ( #2799 )
...
* replace kaldi_fbank with kaldi-native-fbank in paddleaudio
* fix mac
2 years ago
YangZhou
c1b1ae0515
[speechx]add kaldi-native-fbank && refactor frontend ( #2794 )
...
* replace kaldi-fbank with kaldi-native-fbank
* make kaldi-native-fbank work
2 years ago
HuangLiangJie
964211a81b
Change optimizer for vits, test=tts ( #2791 )
2 years ago
YangZhou
acf1d27230
[speechx] rm ds2 && rm boost ( #2786 )
...
* fix openfst download error
* add acknowledgments of openfst
* refactor directory
* clean ctc_decoders dir
* add nnet cache && make 2 thread work
* do not compile websocket
* rm ds2 && rm boost
* rm ds2 example
2 years ago