YangZhou
42ff946007
[audio] mv paddlespeech/audio to paddleaudio ( #2706 )
...
* split paddlespeech/audio to paddleaudio.
* add sox io ,sox effect, kaldi native fbank to paddleaudio.
2 years ago
Zth9730
c67bf7b4ef
[ASR] support wav2vec2-zh cli, test=asr ( #2697 )
...
* support wav2vec2-zh cli, test=asr
* support wav2vec2-zh cli, test=asr
* support wav2vec2-zh cli, test=asr
* support wav2vec2-zh cli, test=asr
2 years ago
zxcd
4542684694
[ASR] fix Whisper cli model download path error. test=asr ( #2679 )
...
* add all whisper model size support
* add choices in parser.
* fix Whisper cli model download path error.
* fix resource download path.
* fix code style
2 years ago
zxcd
b71f1428c7
add all whisper model size support, test=asr ( #2677 )
...
* add all whisper model size support
* add choices in parser.
2 years ago
Zth9730
94a487bd81
[ASR] support wav2vec2 command line and demo ( #2658 )
...
* wav2vec2_cli
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* Update RESULTS.md
* Update RESULTS.md
* Update base_commands.py
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
2 years ago
zxcd
b1d3f59bcb
[s2t] add whisper asr large model ( #2640 )
...
* add whisper asr large model decoding, test=asr
* fix code style.
* fix json code style.
* remove resource and fix code style.
* fix yapf
* add cli and demos, fix some code style.
* fix some problem by comment.
* fix yapf
2 years ago
liangym
25b6bf9668
[tts] Add male voice for tts ( #2660 )
2 years ago
Zth9730
68134c8436
fix u2pp model ( #2549 )
2 years ago
Hui Zhang
467cfd4e75
Merge pull request #2489 from Zth9730/u2++_server
...
[ASR] support u2pp based cli and server, optimiz code of u2pp decode (reversed_weight parameter)
2 years ago
ZapBird
7a13b35fe6
BytesIO类型时,要保证切到初始位置,这样多次读取才能够正常。比如__call__函数。 ( #2484 )
...
* BytesIO类型时,要保证切到初始位置,这样多次读取才能够正常。比如__call__函数。
__call__函数的参数audio_file为BytesIO类型时执行到self.preprocess(model, audio_file)会报错,需要判断audio_file为BytesIO类型时执行audio_file.seek(0)。
2 years ago
tianhao zhang
5bbe6e9897
support u2pp cli and server, optimiz code of u2pp decode, test=asr
2 years ago
THUzyt21
bdbacd4249
precomited
2 years ago
Zhao Yuting
d5dec46336
Update README.md
2 years ago
Zhao Yuting
18b71dc136
Update README.md
2 years ago
Zhao Yuting
12a11394bd
Update infer.py
...
add a new faster model to infer in cli
2 years ago
Zhao Yuting
fb7f04e021
Update README.md
2 years ago
Zhao Yuting
92d09d5cce
Update README_cn.md
2 years ago
Zhao Yuting
57dcd0d17f
Update infer.py
...
change the infer in order to implement the new faster model for text
2 years ago
TianYuan
35c6ffa90b
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_vc2
2 years ago
TianYuan
e622f42d92
add aishell3 voice cloning with ECAPA-TDNN spk encoder
2 years ago
TianYuan
795eb7bd10
format paddlespeech with pre-commit ( #2331 )
2 years ago
TianYuan
0baec4325a
fix stats bugs
2 years ago
TianYuan
5d515f3f3f
update mix tts
2 years ago
TianYuan
8da993bbf8
fix fs bug
2 years ago
TianYuan
788a3062d0
fix onnx am_ckpt from list to item in prtrained_mdoels.py
2 years ago
TianYuan
cd662a08e0
fix for load specified model files
2 years ago
TianYuan
b9ade18055
add onnxruntime infer for cli
2 years ago
TianYuan
510e240c5c
achange default voc in cli from pwgan_csmsc to hifigan_csmsc, test=tts
2 years ago
TianYuan
00e9853f66
add mix tts cli, test=tts
2 years ago
Hui Zhang
5ca05fea20
cli batch process support \t
2 years ago
huangyuxin
05d41523ad
Merge branch 'develop' into webdataset
2 years ago
TianYuan
4b1f82d312
log redundancy in server
2 years ago
TianYuan
bc93bffbb4
replace logger.info with logger.debug in cli, change default log level to INFO
2 years ago
TianYuan
cf846f9ebc
rm extra log
2 years ago
KP
adc7c9b4aa
Fix unnecessary download present in issue #2067 .
2 years ago
huangyuxin
6ec6921255
Merge branch 'webdataset' of https://github.com/Jackwaterveg/DeepSpeech into webdataset
2 years ago
Jackwaterveg
6598216b2f
Merge branch 'develop' into webdataset
2 years ago
huangyuxin
aa12b9ab52
replace s2t.transform with audio.transform
2 years ago
KP
b230dfbdec
Add kws cli and demo.
2 years ago
Hui Zhang
d20adb5c89
Merge pull request #2048 from KPatr1ck/import_bug
...
[CLI][Audio] Fix circular import error in paddlespeech.cli.utils and paddlespeech.audio
2 years ago
KP
b452be3d8d
Fix circular import error in paddlespeech.cli.utils and paddlespeech.audio
2 years ago
KP
fe345409bb
Fix circular import error in paddlespeech.cli.utils and paddlespeech.audio
2 years ago
Hui Zhang
59a78f2a46
ds2 wenetspeech to onnx and support streaming asr server
2 years ago
Hui Zhang
c8574c7e35
ds2 inference as sepearte engine for streaming asr
2 years ago
Jackwaterveg
bca014fd92
Merge pull request #2032 from PaddlePaddle/audio_refactoring
...
[audio] Audio refactoring
2 years ago
KP
4aaa8effe8
Refactor paddleaudio to paddlespeech.audio
2 years ago
KP
bf056c013d
Refactor paddleaudio to paddlespeech.audio
2 years ago
huangyuxin
2b5bc6df39
fix cli, test=doc
2 years ago
huangyuxin
06c9eee339
update reademe, add conf file, updata test_cli
2 years ago
huangyuxin
6ebe476532
support editing num_decode_left_chunks in cli and server
2 years ago