Hui Zhang
3adb20b468
eliminate shape and slice
2 years ago
Hui Zhang
46088c0a16
elimiate attn transpose
2 years ago
Hui Zhang
f9e3eaa024
transpose in matmul
2 years ago
Hui Zhang
3d7ca93861
bool type slice
2 years ago
Hui Zhang
c2c8a662b1
refactor reshape
2 years ago
Hui Zhang
6de81d74d9
elimiete cast dtype for bool op
2 years ago
Hui Zhang
8e7a315e00
remove comment
2 years ago
Hui Zhang
c4a5ae3825
eliminate mul
2 years ago
Hui Zhang
b7388ce25a
eliminate useless unsqueese
2 years ago
Hui Zhang
1a1ce92cb4
Merge pull request #2415 from Zth9730/u2++_decoder
...
[s2t] support bitransformer decoder
2 years ago
TianYuan
52af86fcc3
fix ERNIE-SAT bug when edit the end of sentenses, test=doc ( #2432 )
2 years ago
tianhao zhang
d3e5937591
support bitransformer decoder
2 years ago
Hui Zhang
7382050e21
fix bug on win
2 years ago
TianYuan
b14da765e8
frm random spk embedding in voice cloning, test=doc ( #2429 )
2 years ago
Hui Zhang
d25871a7b0
format
2 years ago
Hui Zhang
b10512eb0e
more config or u2pp
2 years ago
Hui Zhang
00b2c1c8fb
fix forward attention decoder caller
2 years ago
zhoupc2015
2ae0f66d0d
Solve "unknown format: 3" ( #2422 )
...
* Solve execute the following code with return wav:
iob = io.BytesIO(wav)
wave.open(iob, 'rb')
will throw an "unknown format: 3" exception
2 years ago
Hui Zhang
309c8d70d9
add reverse weight
2 years ago
Hui Zhang
9b66680ea4
Merge branch 'u2++_decoder' into u2pp_export
2 years ago
tianhao zhang
027535dec1
support bitransformer decoder, test=asr
2 years ago
THUzyt21
bdbacd4249
precomited
2 years ago
Zhao Yuting
d5dec46336
Update README.md
2 years ago
Zhao Yuting
18b71dc136
Update README.md
2 years ago
tianhao zhang
0a95689461
support bitransformer decoder
2 years ago
tianhao zhang
455379b88e
support bitransformer decoder
2 years ago
Zhao Yuting
a63a0b1350
Update pretrained_models.py
2 years ago
Zhao Yuting
12a11394bd
Update infer.py
...
add a new faster model to infer in cli
2 years ago
Zhao Yuting
fb7f04e021
Update README.md
2 years ago
Zhao Yuting
92d09d5cce
Update README_cn.md
2 years ago
Zhao Yuting
57dcd0d17f
Update infer.py
...
change the infer in order to implement the new faster model for text
2 years ago
Zhao Yuting
b627666ce9
Update model_alias.py
...
Add a new model for faster text process in cli
2 years ago
Zhao Yuting
a02654660a
Update pretrained_models.py
...
Add a new model for faster text process
2 years ago
tianhao zhang
ecbf324286
support bitransformer decoder, test=asr
2 years ago
tianhao zhang
1a56a6e42b
add bitransformer decoder, test=asr
2 years ago
Hui Zhang
53d6baff0b
format
2 years ago
Hui Zhang
549d477592
fix code style
2 years ago
Hui Zhang
4d5cfd4003
export param from cnofig
2 years ago
Hui Zhang
e3298c79ce
Merge branch 'develop' into u2_export
2 years ago
Hui Zhang
260752aa2a
using forward_attention_decoder
2 years ago
TianYuan
5e714ecb4a
[doc]update api docs ( #2406 )
...
* update apt docs, test=doc
2 years ago
TianYuan
eac362057c
add typehint for g2pw ( #2390 )
2 years ago
Hui Zhang
0d7d87120b
simplify feature pipeline graph
2 years ago
WongLaw
324b166c52
Removed useless spk_id in speech_server and streaming_tts_server from demos, and support bilingual server engine, test=tts ( #2380 )
...
* Removed useless spk_id in speech_server and streaming_tts_server from demos, and support bilingual server engine.
2 years ago
TianYuan
80b180217d
[TTS] fix some bugs of ERNIE-SAT ( #2378 )
...
* fix ernie_sat, test=tts
* fix for comments, test=tts
2 years ago
Hui Zhang
8690a00bd8
add feature pipeline layer(cmvn, fbank), but to_static and jit.layer output is not equal
2 years ago
Hui Zhang
07f566e0a5
Merge pull request #2372 from Zth9730/fix_dp_init
...
[s2t] DataParallel init method changed, fixed conformer could not multi-gpu training and don't affect dy2st
2 years ago
Hui Zhang
3a8869fba4
rm to_static decarator; configure jit save for ctc_activation
2 years ago
Hui Zhang
1c9f238ba0
configurable export
2 years ago
Hui Zhang
63aeb747b0
more comment
2 years ago
Hui Zhang
a7c6c54e75
fix
2 years ago
Hui Zhang
d638325c46
do not jit save forward; using slice for zeros([0,0,0,0]) tensor
2 years ago
tianhao zhang
663e3ab58e
fix dp init
2 years ago
tianhao zhang
6745e9dd6b
fix dp init
2 years ago
tianhao zhang
598eb1a5ef
Merge branch 'develop' into fix_dp_init
2 years ago
WongLaw
989b755e8e
Revised must_neural_tone_words, test=doc. ( #2370 )
...
* Revised must_neural_tone_words.
2 years ago
tianhao zhang
9560d650db
fix dp init
2 years ago
TianYuan
7e4f3b029c
Merge pull request #2359 from yt605155624/add_vc2
...
[TTS]add aishell3 voice cloning with ECAPA-TDNN spk encoder
2 years ago
tianhao zhang
82e04d7815
fix trianer
2 years ago
TianYuan
f7873773bf
uadd __init__.py for VITS, test=tts ( #2362 )
2 years ago
TianYuan
35c6ffa90b
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_vc2
2 years ago
TianYuan
e622f42d92
add aishell3 voice cloning with ECAPA-TDNN spk encoder
2 years ago
TianYuan
1c30cff1bf
fix gpus of ernie_sat, test=tts ( #2355 )
2 years ago
Hui Zhang
2bb40c41ba
Merge pull request #2351 from Zth9730/fix_deepspeech
...
[s2t] fix deepspeech2 decode_wav
2 years ago
tianhao zhang
ab92e2c98c
fix deepspeech2 decode_wav
2 years ago
艾梦
ea9ee93739
[TTS]Update VITS to support VITS and its voice cloning training on AIShell-3 ( #2268 )
...
* code for training vits voice clone on aishell3.
Co-authored-by: TianYuan <white-sky@qq.com>
2 years ago
TianYuan
795eb7bd10
format paddlespeech with pre-commit ( #2331 )
2 years ago
TianYuan
5d5888af8e
fix tone, update readme ( #2335 )
2 years ago
贾晓
0b544ee84e
Merge pull request #2336 from Zth9730/fix_multigpu_train
...
[s2t] fix format test=asr
2 years ago
tianhao zhang
cdcb1a5316
s2t: fix encoder.py
2 years ago
tianhao zhang
ed2819d7af
fix format test=asr
2 years ago
Hui Zhang
58ab7e8d10
Merge pull request #2334 from Zth9730/fix_multigpu_train
...
[s2t] fix asr_engine.py
2 years ago
tianhao zhang
1dfca4ef73
fix multigpu training
2 years ago
Hui Zhang
94e750c4c4
Merge pull request #2327 from Zth9730/fix_multigpu_train
...
[s2t] fix conformer/transformer multi-gpu training, maybe impact dy2st
2 years ago
tianhao zhang
ed80b0e2c3
fix multigpu training test=asr
2 years ago
tianhao zhang
733ec7f2bc
fix conformer multi-gpu training test=asr
2 years ago
David An (An Hongliang)
f5367f5efb
[TTS]fix bug of tone modify ( #2323 )
...
* add special tone modifed case
Co-authored-by: TianYuan <white-sky@qq.com>
2 years ago
Zhao Yuting
c28064fec2
Update asr_engine.py ( #2302 )
...
* Update asr_engine.py
* Update asr_engine.py
* Update application.yaml
must add parameter "num_decoding_left_chunks" so as to modify this in other scenarios.
* Update asr_engine.py
* Update application.yaml
* Update application.yaml
* Update asr_engine.py
2 years ago
TianYuan
7b864e8f38
clean old ernie sat inference scripts ( #2316 )
2 years ago
David An (An Hongliang)
c7163abffa
add thanks into readme, append data for chinese unit ( #2312 )
...
* add chinese words correct phonic,test=tts
* added thanks into readme. add data of unit, test=tts
* added thanks into readme. add data of unit, test=tts
* modify data of unit, test=tts
* modify thanks, test=tts
2 years ago
彭震东
c9de22eaa8
[TN] Update quantifiers ( #2308 )
2 years ago
TianYuan
d1c70a7809
fix g2pw model ( #2304 )
2 years ago
liangym
043b21d3b4
fix mix frontend, test=tts ( #2299 )
2 years ago
David An (An Hongliang)
25b96405df
add chinese words correct phonic,test=tts ( #2300 )
2 years ago
TianYuan
c1d4551055
add ernie sat synthesize_e2e, test=tts ( #2287 )
2 years ago
李子
5a58a27492
[TTS]指定G2PW的传入数据类型 , test=tts ( #2288 )
...
* fix ONNXRuntimeError Specify data type (int64),test=tts
* Tactron2→Tacotron2 ,test=doc
2 years ago
TianYuan
3f9339edff
Update polyphonic.yaml
2 years ago
TianYuan
f9a6970a62
Merge pull request #2263 from oyjxer/pc
...
[TTS]add ernie-sat sampler
2 years ago
lym0302
677e0961a8
fix point bug, test=tts
2 years ago
TianYuan
4a59702d60
Merge pull request #2255 from lym0302/develop
...
[tts] fix point bug
2 years ago
TianYuan
0baec4325a
fix stats bugs
2 years ago
TianYuan
f7780658db
fix tone sand_hi bugs for Chinese frontend
2 years ago
pangchao04
b9be2bd64a
add ernie-sat sampler
2 years ago
lym0302
f8f73e41f0
fix point bug, test=tts
2 years ago
TianYuan
5de2c2dab5
format g2pw
2 years ago
TianYuan
5d515f3f3f
update mix tts
2 years ago
TianYuan
a75b2a5bab
Merge pull request #2230 from BarryKCL/develop_g2pW
...
Add g2pW to Chinese frontend
2 years ago
TianYuan
db89cfe829
Merge pull request #2234 from lym0302/mix_example
...
[tts] add zh_en mix example
2 years ago
TianYuan
8dbefc0165
fix preprocess bug, add hifigan_csmsc decoder, update readme
2 years ago
BarryKCL
a84b40ef79
update g2pW dict
2 years ago
Zhao Yuting
d02e04d532
Update audio_handler.py
2 years ago
BarryKCL
6593c24968
set window_size None
2 years ago
BarryKCL
5e63ac1e60
Fix a bug in g2pW
2 years ago
TianYuan
0eb598b876
Merge pull request #2235 from david-95/hongliang-dev
...
add filter for double punctuation in sentences; add homonym, test=tts
2 years ago
david.95
0df7fc8fbf
remove comment
2 years ago
david.95
7ba74f175f
remove comment
2 years ago
david.95
f52a87b8d0
remove useless fix, test=tts
2 years ago
david.95
a48e4f249f
add filter for double punctuation, revise comment ;
...
add homonym, fix mistakes
2 years ago
BarryKCL
aecf8fd384
add onnxruntime sess_options
2 years ago
lym0302
368e3e1b59
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into mix_example
2 years ago
lym0302
894556f871
add zh_en mix example, test=tts
2 years ago
david.95
1edd474bcb
add filter for double punctuation in sentences; add homonym, test=tts
2 years ago
BarryKCL
61dd92e49c
update
2 years ago
BarryKCL
de0f99150a
change G2PWModel download
2 years ago
BarryKCL
744ea44279
add comment
2 years ago
BarryKCL
7b0f2a796d
change transformers to paddlenlp.transformers
2 years ago
BarryKCL
e60a63fbdd
Rollback "get_input_ids"
2 years ago
BarryKCL
ab2a1219c8
Add g2pW to Chinese frontend
2 years ago
TianYuan
2f9bdf2306
Merge pull request #2222 from yt605155624/add_onnx_cli
...
[CLI]add onnxruntime infer for cli
2 years ago
TianYuan
c3d47441cf
fix fs bug in inference.py (change fixed 24000 to variable for ljspeech)
2 years ago
TianYuan
8da993bbf8
fix fs bug
2 years ago
TianYuan
788a3062d0
fix onnx am_ckpt from list to item in prtrained_mdoels.py
2 years ago
TianYuan
c6b25c05f4
change logger.debug to logger.info for streaming asr
2 years ago
Hui Zhang
c1fbfe928e
add test
2 years ago
TianYuan
cd662a08e0
fix for load specified model files
2 years ago
TianYuan
b9ade18055
add onnxruntime infer for cli
2 years ago
Hui Zhang
05bc258833
update docstring
2 years ago
Hui Zhang
6149daa221
export ctc_activation
2 years ago
huangyuxin
923b0b873e
fix import kws.exps.mdtc
2 years ago
huangyuxin
060e337623
fix dataloader factory, test=asr
2 years ago
TianYuan
b0b3222f9a
Merge pull request #2213 from yt605155624/fix_name_bug
...
[server]fix readme, test=doc
2 years ago
TianYuan
354601d0e9
fix readme, test=doc
2 years ago
Hui Zhang
812d80ab1c
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into u2_export
2 years ago
Hui Zhang
e5a6c243f1
fix jit save for conformer
2 years ago
TianYuan
510e240c5c
achange default voc in cli from pwgan_csmsc to hifigan_csmsc, test=tts
2 years ago
TianYuan
00e9853f66
add mix tts cli, test=tts
2 years ago
0x45f
4e7106d9e2
Support dy2st
2 years ago
TianYuan
1f128a0817
Merge pull request #2117 from yt605155624/ernie_sat_trainer
...
[TTS]add ernie sat trainer
2 years ago
TianYuan
1bf78fa5c7
updatte batch_fn train.py, test=doc
2 years ago
TianYuan
9d4161ce5f
update config, test=doc
2 years ago
lym0302
e1f8695456
add mix tts, test=tts
2 years ago
Betterman-qs
e2dc204d4d
update engine_warmup.py, test=tts
2 years ago
Betterman-qs
cf1b873528
update engine_warmup.py, test=tts
2 years ago
Hui Zhang
ef37f73a01
fix cnn cache dy2st shape
2 years ago
0x45f
e21cceea51
Remove blank line
2 years ago
0x45f
e6ac8881f1
Fix comments
2 years ago
0x45f
ac680aa783
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into new_api
2 years ago
Hui Zhang
d098e027ca
Merge pull request #2155 from Jackwaterveg/develop_dev
...
[ASR] remove fluid(except gradclip)
2 years ago
0x45f
294b7b00bd
Supprot dy2st for conformer
2 years ago
TianYuan
97965f4c37
fix mlm_prob, test=tts
2 years ago
huangyuxin
7c9208765f
fix audio,test=doc
2 years ago
huangyuxin
75997d8277
merge
2 years ago
TianYuan
72fa8176ca
fix for mix_lang
2 years ago
TianYuan
5503c8bd6b
add ernie_sat synthesize script for metadata.jsonl, test=tts
2 years ago
TianYuan
f4ac0c79d9
Merge pull request #2143 from lym0302/mix_front
...
[tts] add mix frontend
2 years ago
Jackwaterveg
ae7a73bc11
Merge pull request #2138 from zh794390558/demos
...
[demos] more cli for speech demos
2 years ago
lym0302
207bb5d93b
add mix frontend, test=tts
2 years ago
Hui Zhang
e62cbc464e
Merge pull request #2124 from zh794390558/new_api
...
[asr] optimize attention cache used for attention ; 0-dim tensor for model export
2 years ago
Hui Zhang
8376f3d40d
Merge pull request #2128 from zh794390558/endpoint
...
[server][asr]more accuracy decoding somthing
2 years ago
Hui Zhang
caaa5cd502
more cli for speech demos
2 years ago
Hui Zhang
1edf120506
fix comment error
2 years ago
Hui Zhang
d142d3a7c0
add docstring
2 years ago
Hui Zhang
f8450c39e5
rename n_v_s to n_v_b, n_v_ns to n_v_nb
3 years ago
Hui Zhang
f4b11b19e5
rename time_s and time_ns to time_b and time_nb
3 years ago
liangym
45f51651bf
Merge pull request #2129 from lym0302/onnx_gpu
...
[server]specify id
3 years ago
lym0302
3d5ed00c60
specify id, test=doc
3 years ago
Hui Zhang
98eed53e6d
more accuracy decoding somthing
3 years ago
TianYuan
028742b69a
update lr scheduler
3 years ago
TianYuan
94688264c7
add ernie sat model file and config
3 years ago
Hui Zhang
e81849277e
att cache for streaming asr
3 years ago
Hui Zhang
5ca05fea20
cli batch process support \t
3 years ago
Hui Zhang
fb40602d94
refactor attention cache
3 years ago
liangym
e153495519
Merge pull request #2122 from yt605155624/rm_server_log
...
[TTS]add stream_play_tts.py, test=doc
3 years ago
TianYuan
6bbe6de1ec
add stream_play_tts.py, test=doc
3 years ago
lym0302
d66d6a05c7
Merge branch 'develop' of https://github.com/lym0302/PaddleSpeech into develop
3 years ago
lym0302
5b06b76ebc
change sr, test=doc
3 years ago
huangyuxin
05d41523ad
Merge branch 'develop' into webdataset
3 years ago
huangyuxin
92d1d08b9a
fix scripts
3 years ago
TianYuan
f6d1c545ac
fromat doc_string
3 years ago
TianYuan
4b1f82d312
log redundancy in server
3 years ago
TianYuan
e4a8e15334
Merge pull request #2111 from yt605155624/rm_more_log
...
[CLI]replace logger.info with logger.debug in cli, change default log leve…
3 years ago
TianYuan
496e2dd14b
fix Pillow's version
3 years ago
TianYuan
bc93bffbb4
replace logger.info with logger.debug in cli, change default log level to INFO
3 years ago
TianYuan
f76bd9fe51
Merge pull request #2109 from raycool/fix_log
...
[Server]Fix log error
3 years ago
TianYuan
e10eaa397e
Merge pull request #2100 from Jackwaterveg/develop_dev
...
[s2t] support --nxpu for all s2t models
3 years ago
huangyuxin
98cfdc4c05
fix nxpu
3 years ago
huguanghui
ddf14662ca
fix log issue #2070
3 years ago
huguanghui
20a9a67925
fix log issue #2070
3 years ago
TianYuan
cf846f9ebc
rm extra log
3 years ago
KP
527744d5f0
Fix unnecessary download present in issue #2067 .
3 years ago
KP
adc7c9b4aa
Fix unnecessary download present in issue #2067 .
3 years ago
huangyuxin
7463df89c5
fix nxpu
3 years ago
huangyuxin
6ec6921255
Merge branch 'webdataset' of https://github.com/Jackwaterveg/DeepSpeech into webdataset
3 years ago
huangyuxin
429221dc03
adopt multi machine traiing
3 years ago
huangyuxin
ac1b301657
Merge branch 'webdataset' of https://github.com/Jackwaterveg/DeepSpeech into webdataset
3 years ago
TianYuan
d8a0ba5913
Merge pull request #2085 from yt605155624/fix_tts_cli_log
...
[TTS]fix int32 warning in tts
3 years ago
Jackwaterveg
6598216b2f
Merge branch 'develop' into webdataset
3 years ago
TianYuan
c0f126ecd9
fix int32 warning in tts, test=tts
3 years ago
huangyuxin
9b5655f6ad
fix 'print log' in cli
3 years ago
huangyuxin
aa12b9ab52
replace s2t.transform with audio.transform
3 years ago