PiaoYang
6ffd3e4f7f
Update quick_start.md ( #3175 )
2 years ago
ljhzxc
dc56c3a10e
[TTS] [黑客松]Add JETS ( #3109 )
2 years ago
TianYuan
bd0d69ca74
[TTS]add StarGANv2VC preprocess ( #3163 )
2 years ago
YangZhou
b05ead51d7
[engine]add recognizer api && clean params && make a shared decoder resource ( #3165 )
2 years ago
zxcd
a1e5f27003
mv scaler.unscale_ blow grad_clip.
2 years ago
zxcd
7399d560e7
fix scaler save and load.
2 years ago
zxcd
2f4414a5f8
fix scaler save
2 years ago
zxcd
fbd27aab41
add amp for U2 conformer.
2 years ago
TianYuan
c7d24ba42c
fix some preprocess bugs ( #3155 )
2 years ago
masimeng1994
11ce08b260
[engine] replace onnx with fastdeploy ( #3150 )
...
* onnxruntime change to fastdeploy
2 years ago
lemondy
9c387577fd
fix example/aishell local/train.sh if condition bug, test=asr ( #3146 )
2 years ago
twoDogy
c0cc850776
add function for generating srt file ( #3123 )
...
* add function for generating srt file
在原来websocket_client.py的基础上,增加了由wav或mp3格式的音频文件生成对应srt格式字幕文件的功能
* add function for generating srt file
在原来websocket_client.py的基础上,增加了由wav或mp3格式的音频文件生成对应srt格式字幕文件的功能
* keep origin websocket_client.py
恢复原本的websocket_client.py文件
* add generating subtitle function into README
* add generate subtitle funciton into README
* add subtitle generation function
* add subtitle generation function
2 years ago
masimeng1994
d03ebe872a
add vad interface GetVadResult ( #3140 )
...
* add vad interface GetVadResult
* fix comment
2 years ago
longRookie
df37798598
[TTS]【Hackathon + No.190】 + 模型复现:iSTFTNet ( #3006 )
...
* iSTFTNet implementation based on hifigan, not affect the function and execution of HIFIGAN
* modify the comment in iSTFT.yaml
* add the comments in hifigan
* iSTFTNet implementation based on hifigan, not affect the function and execution of HIFIGAN
* modify the comment in iSTFT.yaml
* add the comments in hifigan
* add iSTFTNet.md
* modify the format of iSTFTNet.md
* modify iSTFT.yaml and hifigan.py
* Format code using pre-commit
* modify hifigan.py,delete the unused self.istft_layer_id , move the self.output_conv behind else, change conv_post to output_conv
* update iSTFTNet_csmsc_ckpt.zip download link
* modify iSTFTNet.md
* modify hifigan.py and iSTFT.yaml
* modify iSTFTNet.md
2 years ago
TianYuan
72aa19c32c
[TTS]add starganv2 vc trainer ( #3143 )
...
* add starganv2 vc trainer
* fix StarGANv2VCUpdater and losses
* fix StarGANv2VCEvaluator
* add some typehint
2 years ago
YangZhou
f35a87ab89
[Engine] recognizer controller refactor ( #3139 )
...
* refactor recognizer_controller
* clean frontend file
2 years ago
TianYuan
54ef90fcec
[TTS]Fix VITS lite infer ( #3098 )
2 years ago
TianYuan
d3d86f59aa
Update bug-report-tts.md ( #3120 )
2 years ago
masimeng1994
591b957b96
[runtime] fix linux && android cmake bug ( #3112 )
...
* support vad ios compile
* support onnx model recognize
* add build ios sh
2 years ago
masimeng1994
9e5a39cae0
[runtime] support onnx runtime && support ios compile ( #3101 )
...
* support vad ios compile
* support onnx model recognize
* add build ios sh
2 years ago
liangym
e83b491c34
rm unused dep, test=tts ( #3097 )
2 years ago
YangZhou
767f6dd4e2
[engine] add recognizer_controller && fix build bugs ( #3086 )
...
* fix asr compile
2 years ago
liangym
793effa122
[TTS] update svs_music_score.md ( #3085 )
2 years ago
TianYuan
cc02b007fe
fix paddle2onnx's install version, install the newest paddle2onnx in run.sh ( #3084 )
2 years ago
TianYuan
6894a2a77d
[TTS]fix elementwise_floordiv's fill_constant ( #3075 )
...
* fix elementwise_floordiv's fill_constant
* add float converter for min_value in attention
2 years ago
YangZhou
2be7e5725f
[engine]fix asr compile ( #3078 )
...
* fix asr compile
* add pybind
2 years ago
jlqian98
ab4217c2e4
[Engine] add TN/ITN functions ( #3047 )
...
* add AddBlk, ReverseFrac function
* rename text processing functions
2 years ago
liangym
1a272e11de
get music score, test=doc ( #3070 )
2 years ago
masimeng1994
704e363a2d
fix asr cmake ( #3071 )
2 years ago
zxcd
ca575bdda3
[Doc] change define asr server config to chunk asr config, test=doc ( #3067 )
...
* Update README.md
* Update README_cn.md
2 years ago
TianYuan
0a2e367ff4
[TTS]clean starganv2 vc model code and add docstring ( #2987 )
...
* clean code
* add docstring
2 years ago
liangym
880c172db7
[TTS] add svs frontend ( #3062 )
2 years ago
masimeng1994
f34d4ad4df
[runtime] fix vad and cls cmake ( #3050 )
...
* fix vad and cls cmake
2 years ago
TianYuan
46334ae045
Update setup.py ( #3056 )
2 years ago
TianYuan
d5720e4e7b
fix input dtype of elementwise_mul op from bool to int64 ( #3054 )
2 years ago
夜雨飘零
31a4562ae8
[ASR]add squeezeformer model ( #2755 )
...
* add squeezeformer model
* change CodeStyle, test=asr
* change CodeStyle, test=asr
* fix subsample rate error, test=asr
* merge classes as required, test=asr
* change CodeStyle, test=asr
* fix missing code, test=asr
* split code to new file, test=asr
* remove rel_shift, test=asr
2 years ago
zxcd
9bf5471613
optional tokenizer and fix some doc. ( #3042 )
2 years ago
TianYuan
b07f87b42e
Update README.md
2 years ago
TianYuan
78a0ded1b2
Update README_cn.md
2 years ago
TianYuan
596f5c28f2
Update README.md
2 years ago
TianYuan
b32b005037
Update README_cn.md
2 years ago
TianYuan
03ebbc335b
Update README.md
2 years ago
TianYuan
5270fda5bf
[docs]update readme ( #3043 )
2 years ago
TianYuan
706a68bde9
fix dtype diff of last expand_v2 op of VITS ( #3041 )
2 years ago
liangym
348064de0d
[TTS] add opencpop HIFIGAN example ( #3038 )
...
* add opencpop voc, test=tts
* soft link
* add opencpop hifigan, test=tts
* update
2 years ago
zxcd
4e9bca177a
[ASR] change optimizer and fix import error, test=asr ( #3023 )
...
* mv dataio.py to s2t.io.speechbrain.dataio
mv dataio.py to paddlespeech.s2t.io.speechbrain.dataio
* remove transformers import.
* change optimizer same with released model
* add paddlenlp version in RESULT.md.
* fix run.sh
* fix data.sh step_num.
* add adadelta optimizer config.
* fix wav2vec2 test_wav.sh run error.
* add tokenizer config.
2 years ago
TianYuan
65c3217b8b
Update textnorm_test_cases.txt
2 years ago
liangym
435fc5cc19
[TTS] add opencpop PWGAN example ( #3031 )
...
* add opencpop voc, test=tts
* soft link
2 years ago
TianYuan
1aa7495dab
[TTS]Add license and reformat for TTSCppFrontend ( #3030 )
2 years ago
jlqian98
b9bdeca6c5
add text blank preprocess, test=asr ( #3025 )
2 years ago