david.95
|
f56cc08b18
|
add license content, test=tts
|
2 years ago |
david.95
|
29508f400b
|
to fix CI issue, test=tts
|
2 years ago |
david.95
|
60801d8f14
|
Merge branch 'hongliang1014' of https://github.com/david-95/PaddleSpeech into hongliang1014
|
2 years ago |
David An (An Hongliang)
|
ce21f9bc41
|
Merge branch 'PaddlePaddle:develop' into hongliang1014
|
2 years ago |
david.95
|
278c7a41a8
|
add module define to fix ci, test=tts
|
2 years ago |
Hui Zhang
|
964c22c677
|
Merge pull request #2532 from Zth9730/wav2vec2.0
[s2t] fix wav2vec2 report loss bug
|
2 years ago |
tianhao zhang
|
86f65f0b8e
|
fix wav2vec2 report loss bug
|
2 years ago |
david.95
|
13a7fa9808
|
enable chinese words' pinyin specified in text of ssml formats, test=tts
|
2 years ago |
Hui Zhang
|
f1ca564731
|
Merge pull request #2518 from Zth9730/wav2vec2.0
[ASR] wav2vec2 ASR, pre-trained wav2vec2 based CTC for librispeech
|
2 years ago |
tianhao zhang
|
2ae94bd277
|
freeze wav2vec2=True, change loss report and update README.md
|
2 years ago |
tianhao zhang
|
3d994f5c23
|
format wav2vec2 demo
|
2 years ago |
Hui Zhang
|
c6f9764ed6
|
Merge pull request #2510 from Zth9730/u2pp_jit_export
[s2t] use reverse_weight in decode.yaml
|
2 years ago |
tianhao zhang
|
19180d359d
|
format wav2vec2 demo
|
2 years ago |
tianhao zhang
|
6e429f0513
|
support wav2vec2ASR on librispeech
|
2 years ago |
Hui Zhang
|
290c23b9d7
|
add u2 nnet, u2 nnet main, codelab, and can compile
|
2 years ago |
tianhao zhang
|
e367242765
|
update dependency of paddle
|
2 years ago |
tianhao zhang
|
5a66a14659
|
fix u2pp model version number
|
2 years ago |
tianhao zhang
|
cda440e6f0
|
use reverse_weight in decode.yaml
|
2 years ago |
Zth9730
|
c9b0c96b7b
|
Merge pull request #2502 from zh794390558/u2pp_export
[s2t] streaming conformer u2 and u2pp jit export
|
2 years ago |
Hui Zhang
|
c98b5dd173
|
fix masked_fill which will nan in trainning
|
2 years ago |
Hui Zhang
|
9277fcb8a8
|
fix attn can not train
|
2 years ago |
Hui Zhang
|
1f4f98b171
|
fix bug
|
2 years ago |
liangym
|
0359c3f6b5
|
Fix mix front (#2493)
* update mix frontend, test=tts
|
2 years ago |
Hui Zhang
|
e86337a423
|
fix bug
|
2 years ago |
Hui Zhang
|
925abcca23
|
format
|
2 years ago |
Hui Zhang
|
2a75405e9a
|
Merge branch 'develop' into u2pp_export
|
2 years ago |
Hui Zhang
|
3ed24474d2
|
wenetspeech asr1 quant
|
2 years ago |
Hui Zhang
|
467cfd4e75
|
Merge pull request #2489 from Zth9730/u2++_server
[ASR] support u2pp based cli and server, optimiz code of u2pp decode (reversed_weight parameter)
|
2 years ago |
tianhao zhang
|
5b5167b586
|
support u2pp cli and server, optimiz code of u2pp decode, test=asr
|
2 years ago |
YangZhou
|
3507829a6d
|
Merge pull request #2464 from THUzyt21/Deploy-text-model-in-server
[Server]Deploy text model in server
|
2 years ago |
ZapBird
|
7a13b35fe6
|
BytesIO类型时,要保证切到初始位置,这样多次读取才能够正常。比如__call__函数。 (#2484)
* BytesIO类型时,要保证切到初始位置,这样多次读取才能够正常。比如__call__函数。
__call__函数的参数audio_file为BytesIO类型时执行到self.preprocess(model, audio_file)会报错,需要判断audio_file为BytesIO类型时执行audio_file.seek(0)。
|
2 years ago |
tianhao zhang
|
5bbe6e9897
|
support u2pp cli and server, optimiz code of u2pp decode, test=asr
|
2 years ago |
Hui Zhang
|
bdf876ea7b
|
Merge branch 'develop' into u2pp_export
|
2 years ago |
Zhao Yuting
|
304dc2603c
|
Update text_engine.py
|
2 years ago |
Zhao Yuting
|
8c945c073d
|
Update application.yaml
|
2 years ago |
Zhao Yuting
|
b9693a0e8e
|
Update text_engine.py
|
2 years ago |
Zhao Yuting
|
8ecf6796f3
|
Update text_engine.py
|
2 years ago |
Hui Zhang
|
afda7ed7d1
|
remove useless code
|
2 years ago |
YangZhou
|
4841f94298
|
Merge pull request #2421 from THUzyt21/Deploy-fast-text-model-for-cli
[CLI]Deploy fast text model for cli
|
2 years ago |
Hui Zhang
|
b20bf7d5de
|
masked_fill by multiply, remove while
|
2 years ago |
Zhao Yuting
|
d2da7f50d2
|
Update text_engine.py
precommihted already
|
2 years ago |
Zhao Yuting
|
82f731c153
|
Update application.yaml
change model
|
2 years ago |
Hui Zhang
|
feb27e2a84
|
fuse linear kv
|
2 years ago |
Hui Zhang
|
3adb20b468
|
eliminate shape and slice
|
2 years ago |
Hui Zhang
|
46088c0a16
|
elimiate attn transpose
|
2 years ago |
Hui Zhang
|
f9e3eaa024
|
transpose in matmul
|
2 years ago |
Hui Zhang
|
3d7ca93861
|
bool type slice
|
2 years ago |
Hui Zhang
|
c2c8a662b1
|
refactor reshape
|
2 years ago |
Hui Zhang
|
6de81d74d9
|
elimiete cast dtype for bool op
|
2 years ago |
Hui Zhang
|
8e7a315e00
|
remove comment
|
2 years ago |
Hui Zhang
|
c4a5ae3825
|
eliminate mul
|
2 years ago |
Hui Zhang
|
b7388ce25a
|
eliminate useless unsqueese
|
2 years ago |
Hui Zhang
|
1a1ce92cb4
|
Merge pull request #2415 from Zth9730/u2++_decoder
[s2t] support bitransformer decoder
|
2 years ago |
TianYuan
|
52af86fcc3
|
fix ERNIE-SAT bug when edit the end of sentenses, test=doc (#2432)
|
2 years ago |
tianhao zhang
|
d3e5937591
|
support bitransformer decoder
|
2 years ago |
Hui Zhang
|
7382050e21
|
fix bug on win
|
2 years ago |
TianYuan
|
b14da765e8
|
frm random spk embedding in voice cloning, test=doc (#2429)
|
2 years ago |
Hui Zhang
|
d25871a7b0
|
format
|
2 years ago |
Hui Zhang
|
b10512eb0e
|
more config or u2pp
|
2 years ago |
Hui Zhang
|
00b2c1c8fb
|
fix forward attention decoder caller
|
2 years ago |
zhoupc2015
|
2ae0f66d0d
|
Solve "unknown format: 3" (#2422)
* Solve execute the following code with return wav:
iob = io.BytesIO(wav)
wave.open(iob, 'rb')
will throw an "unknown format: 3" exception
|
2 years ago |
Hui Zhang
|
309c8d70d9
|
add reverse weight
|
2 years ago |
Hui Zhang
|
9b66680ea4
|
Merge branch 'u2++_decoder' into u2pp_export
|
2 years ago |
tianhao zhang
|
027535dec1
|
support bitransformer decoder, test=asr
|
2 years ago |
THUzyt21
|
bdbacd4249
|
precomited
|
2 years ago |
Zhao Yuting
|
d5dec46336
|
Update README.md
|
2 years ago |
Zhao Yuting
|
18b71dc136
|
Update README.md
|
2 years ago |
tianhao zhang
|
0a95689461
|
support bitransformer decoder
|
2 years ago |
tianhao zhang
|
455379b88e
|
support bitransformer decoder
|
2 years ago |
Zhao Yuting
|
a63a0b1350
|
Update pretrained_models.py
|
2 years ago |
Zhao Yuting
|
12a11394bd
|
Update infer.py
add a new faster model to infer in cli
|
2 years ago |
Zhao Yuting
|
fb7f04e021
|
Update README.md
|
2 years ago |
Zhao Yuting
|
92d09d5cce
|
Update README_cn.md
|
2 years ago |
Zhao Yuting
|
57dcd0d17f
|
Update infer.py
change the infer in order to implement the new faster model for text
|
2 years ago |
Zhao Yuting
|
b627666ce9
|
Update model_alias.py
Add a new model for faster text process in cli
|
2 years ago |
Zhao Yuting
|
a02654660a
|
Update pretrained_models.py
Add a new model for faster text process
|
2 years ago |
tianhao zhang
|
ecbf324286
|
support bitransformer decoder, test=asr
|
2 years ago |
tianhao zhang
|
1a56a6e42b
|
add bitransformer decoder, test=asr
|
2 years ago |
Hui Zhang
|
53d6baff0b
|
format
|
2 years ago |
Hui Zhang
|
549d477592
|
fix code style
|
2 years ago |
Hui Zhang
|
4d5cfd4003
|
export param from cnofig
|
2 years ago |
Hui Zhang
|
e3298c79ce
|
Merge branch 'develop' into u2_export
|
2 years ago |
Hui Zhang
|
260752aa2a
|
using forward_attention_decoder
|
2 years ago |
TianYuan
|
5e714ecb4a
|
[doc]update api docs (#2406)
* update apt docs, test=doc
|
2 years ago |
TianYuan
|
eac362057c
|
add typehint for g2pw (#2390)
|
2 years ago |
Hui Zhang
|
0d7d87120b
|
simplify feature pipeline graph
|
2 years ago |
WongLaw
|
324b166c52
|
Removed useless spk_id in speech_server and streaming_tts_server from demos, and support bilingual server engine, test=tts (#2380)
* Removed useless spk_id in speech_server and streaming_tts_server from demos, and support bilingual server engine.
|
2 years ago |
TianYuan
|
80b180217d
|
[TTS] fix some bugs of ERNIE-SAT (#2378)
* fix ernie_sat, test=tts
* fix for comments, test=tts
|
2 years ago |
Hui Zhang
|
8690a00bd8
|
add feature pipeline layer(cmvn, fbank), but to_static and jit.layer output is not equal
|
2 years ago |
Hui Zhang
|
07f566e0a5
|
Merge pull request #2372 from Zth9730/fix_dp_init
[s2t] DataParallel init method changed, fixed conformer could not multi-gpu training and don't affect dy2st
|
2 years ago |
Hui Zhang
|
3a8869fba4
|
rm to_static decarator; configure jit save for ctc_activation
|
2 years ago |
Hui Zhang
|
1c9f238ba0
|
configurable export
|
2 years ago |
Hui Zhang
|
63aeb747b0
|
more comment
|
2 years ago |
Hui Zhang
|
a7c6c54e75
|
fix
|
2 years ago |
Hui Zhang
|
d638325c46
|
do not jit save forward; using slice for zeros([0,0,0,0]) tensor
|
2 years ago |
tianhao zhang
|
663e3ab58e
|
fix dp init
|
2 years ago |
tianhao zhang
|
6745e9dd6b
|
fix dp init
|
2 years ago |
tianhao zhang
|
598eb1a5ef
|
Merge branch 'develop' into fix_dp_init
|
2 years ago |
WongLaw
|
989b755e8e
|
Revised must_neural_tone_words, test=doc. (#2370)
* Revised must_neural_tone_words.
|
2 years ago |
tianhao zhang
|
9560d650db
|
fix dp init
|
2 years ago |