TianYuan
001afee644
fix wavernn dygraph to static , test=tts
3 years ago
TianYuan
2844f388dc
[doc ]add tacotron2 readme ( #1385 )
...
* add tacotron2 readme, test=doc
* update changelog.md, test=doc
3 years ago
TianYuan
2071774d81
add wavernn in synthesize_e2e, test=tts
3 years ago
TianYuan
1cc7905d51
rm csmsc.py, test=tts
3 years ago
TianYuan
4c3e57a23c
align preprocess of wavernn, test=tts
3 years ago
Jackwaterveg
f49cf838a8
Update u2.py ( #1378 )
3 years ago
TianYuan
fb0acd40a2
add wavernn, test=tts
3 years ago
Jackwaterveg
d7222c0453
[ASR] Support CTC decoder online ( #821 )
...
* fix the destructer problem for prefixes
* unified offline and online in ctcdecoders, test=asr
* rename swig_decoders to paddlespeech_ctcdecoders, test=asr
* add reset_stage for ctcdecoder
* fix some problems
* fix ctconline
* fix a bug
* fix the format
* fix 1xt2x
3 years ago
Jerryuhoo
f515416c4a
fix missing model choice, test=doc
3 years ago
Jerryuhoo
a22080130b
Add speedyspeech multi-speaker support for synthesize_e2e.py, test=tts
3 years ago
Hui Zhang
97db74ca60
Merge pull request #1314 from yt605155624/add_new_tacotron2
...
[TTS]Add new tacotron2
3 years ago
huangyuxin
3845804cc9
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into Setup
3 years ago
TianYuan
96323816e9
fix yamls, change labels to stop_labels, test=tts
3 years ago
TianYuan
1bf1a876ae
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_new_tacotron2, test=tts
3 years ago
TianYuan
3fd7a7790b
add typehit for updater and evaluator, test=tts
3 years ago
huangyuxin
4e31247633
refacto the code
3 years ago
TianYuan
41d24337cb
fix fastspeech2 multi speaker to static, test=tts
3 years ago
TianYuan
1a9e59612a
fix fastspeech2 multi speaker to static, test=tts
3 years ago
huangyuxin
565a63c5ef
refactor the setup in paddleaudio
3 years ago
huangyuxin
eb91ce84f9
refactor the version
3 years ago
Hui Zhang
4a133619a1
Merge pull request #1356 from Jackwaterveg/CLI
...
[CLI] asr, Add Deepspeech2 online and offline model
3 years ago
Hui Zhang
d4acf4704f
Merge pull request #1350 from LittleChenCc/develop
...
[ST] beam search with optimality guarantees
3 years ago
huangyuxin
ab759b16de
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into CLI
3 years ago
huangyuxin
38edfd1a89
Add Deepspeech2 online and offline in cli
3 years ago
TianYuan
d368d57d67
fix low ips bug of speedyspeech and fastspeech2, test=tts ( #1349 )
3 years ago
TianYuan
9c7f0762b0
update racotron2 and transformer tts, test=tts
3 years ago
huangyuxin
8028f33b7f
synchronize the version
3 years ago
Junkun
44408e5211
sync the variable name to others
3 years ago
Junkun
f866059b74
config and formalize
3 years ago
Junkun
43aad7a018
beam search with optimality guarantees
3 years ago
Jackwaterveg
26524031d2
Merge pull request #1343 from Jackwaterveg/fix
...
[ASR] Fix some bugs
3 years ago
huangyuxin
5e7e8a3e24
fix the u2 export, test=asr
3 years ago
TianYuan
a1867c20c3
fix slice bug of speedyspeech expand, test=tts ( #1337 )
3 years ago
Hui Zhang
ec1c88ae1a
[s2t] remove nltk ( #1332 )
3 years ago
TianYuan
7ae4f7221e
Update length_regulator.py
3 years ago
TianYuan
acfe2b9084
Update duration_predictor.py
3 years ago
TianYuan
caa391f461
fix speedyspeech inference, test=tts ( #1322 )
3 years ago
Jackwaterveg
0c4895cd0b
mv the ctcdecoders to third_part ( #1313 )
3 years ago
TianYuan
8f507ba4ba
Merge pull request #1302 from jerryuhoo/develop
...
[TTS] Add support for finetuning speedyspeech
3 years ago
Jerryuhoo
111a452378
Fix the code format, test=tts
3 years ago
TianYuan
89e988a69e
add csmsc tacotron2, test=tts
3 years ago
TianYuan
c088b9a304
add csmsc tacotron2
3 years ago
huangyuxin
fe1dc9d211
refactor the cli/st, test=st
3 years ago
TianYuan
27bb76bdb9
fix tone_sandhi of yi, test=tts
3 years ago
Jerryuhoo
be99807d61
Add durations to gen_gta_mel.py inference
3 years ago
KP
52a8b2f320
Add ECAPA_TDNN. ( #1301 )
3 years ago
Jerryuhoo
fcc34e3e95
[tts] add gen_gta_mel.py for finetuning speedypeech, test=tts
3 years ago
Jackwaterveg
010aa65b2b
[cli] asr - support English, decode_metod and unified config ( #1297 )
...
* fix config, test=asr
* fix config, test=doc_fix
* add en and decode_method for cli/asr, test=asr
* test=asr
* fix, test=doc_fix
3 years ago
KP
c09466ebbe
Add ECAPA_TDNN. ( #1295 )
3 years ago
TianYuan
fb238d83f4
update vctk voc1, test=tts ( #1294 )
3 years ago
TianYuan
73dc0e2535
fix_ning
3 years ago
billishyahao
ddf184be60
fix some typos
3 years ago
TianYuan
318cc9e539
Merge branch 'develop' into develop
3 years ago
Jackwaterveg
e69abc9265
Merge pull request #1273 from zh794390558/batch_sampler
...
[s2t] Fix Batch sampler set epoch
3 years ago
KP
a810cd4e5c
Add cli logging. ( #1274 )
3 years ago
Jerryuhoo
d6e9b76e76
change link_wav.py path, test=tts
3 years ago
Jerryuhoo
c94f346207
move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/
...
move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/ so that different vocoders can use one script.
3 years ago
Jerryuhoo
e239ee1cd2
add multi-speaker support for finetuning hifigan vocoder
3 years ago
huangyuxin
07d457859d
use pre-commit, test=doc_fix
3 years ago
Hui Zhang
45832f6770
fix default dist_samlper to False
3 years ago
Hui Zhang
3a2db414e6
format code
3 years ago
Hui Zhang
6f651d762e
fix batch sampler set_epoch when epcoh start
3 years ago
TianYuan
680eac02b9
[tts]Update mb melgan ( #1272 )
...
* update mb melgan
* update mb melgan, test=tts
3 years ago
TianYuan
98ce69d0aa
Merge pull request #1259 from jerryuhoo/develop
...
[TTS]Add multi-speaker support for the SpeedySpeech model
3 years ago
huangyuxin
ffadbe22a7
merge the develop, test=asr
3 years ago
JiehangXie
bdc48114a4
Update text_normlization.py
3 years ago
JiehangXie
d88ceef7bc
Fix punctuation bug
...
修复顿号和英文冒号停顿和分句的问题
3 years ago
huangyuxin
8b63485ce3
fix some bug, test=asr
3 years ago
JiehangXie
6065b1b607
Fix punctuation bug
...
修复顿号和英文冒号停顿和分句的问题
3 years ago
Jerry
0719698118
Merge branch 'develop' into develop
3 years ago
AdamBear
36c9eaa437
Cache the TextFeaturizer instance for infer speed improvement. ( #1260 )
3 years ago
huangyuxin
3e2cc898cb
remove default cfg and fix some bugs,test=asr
3 years ago
Jerryuhoo
2dccd5315d
remove useless "other" dataset
3 years ago
Jerryuhoo
f191d0b022
change speaker embedding position
...
Change speaker embedding position into the encoder.
3 years ago
Jerryuhoo
11991b6d35
add multi-speaker support for speedyspeech
3 years ago
huangyuxin
a1d8ab0f99
merge the develop
3 years ago
huangyuxin
c907a8deda
change all recipes
3 years ago
TianYuan
b9a55262f1
Update fastspeech2.py
3 years ago
Hui Zhang
c81a3f0f83
[s2t] DataLoader with BatchSampler or DistributeBatchSampler ( #1242 )
...
* batchsampler or distributebatchsampler
* format
3 years ago
Junkun Chen
420709e5ce
[st] Distributed sampler and new dataloader with MIMO ( #1239 )
...
* update timit result, test=doc_fix
* result update
* fix bug
* add triplet loader
* empty preprocess file
* sync to u2, updating
* sync to u2 config
* fix bugs
* code refine
* update config
* customize decoding batch size
* update optimizer and lr scheduler
* minor
* minor
* minor
* fix bugs of refs
* minor
* distributed sampler
* minor
* refine the loader
3 years ago
TianYuan
fbe3c05137
add style_melgan and hifigan in tts cli, test=tts ( #1241 )
3 years ago
TianYuan
a232cd8b12
Update fastspeech2.py
3 years ago
huangyuxin
41eeed0450
add librispeech asr1
3 years ago
huangyuxin
fb6d1e2c11
merge the develop
3 years ago
huangyuxin
2c5902d7c5
rename decoding to decode
3 years ago
TianYuan
42c109216d
[tts]add style melgan pretraied model ( #1228 )
...
* add style melgan pretraied model
* add style melgan pretraied model, test=tts
Co-authored-by: Hui Zhang <zhtclz@foxmail.com>
3 years ago
Hui Zhang
bb2a370b23
[asr] remove useless conf of librispeech ( #1227 )
...
* remve useless conf
* format code
* update conf
* update conf
* update conf
3 years ago
huangyuxin
c40b6f4062
refactor the train and test config,test=asr
3 years ago
TianYuan
5692b0ff04
fix log for t2s ( #1219 )
3 years ago
TianYuan
b031ee43c4
Merge pull request #1215 from yt605155624/refactor_punc
...
[text]Refactor punc
3 years ago
TianYuan
e1798e1eeb
update
3 years ago
KP
d362d28d35
Remove logging file in cli api.
3 years ago
TianYuan
15b8904fa1
refactor punc
3 years ago
JiehangXie
927c9bbdb6
Fix a bug when sentence inputed contain English words
3 years ago
KP
1632af7706
Update examples/esc50. ( #1203 )
3 years ago
Jerryuhoo
3cbfd7bf35
Add speaker embedding and speaker id for style fastspeech2 inference
3 years ago
Hui Zhang
db121226b8
clean aishell asr1 conf & compare ctc loss with torch and warpctc_pytorch ( #1191 )
3 years ago
Hui Zhang
d852aee2ff
[asr] logfbank with dither ( #1179 )
...
* fix logfbank dither
* format
3 years ago
KP
9ec2bc8e2e
Update README. test=doc_fix
3 years ago
Jackwaterveg
879857332d
[version]add paddlespeech.__version__ ( #1166 )
...
* add paddlespeech.__version__
* version 0.1.0 is ready
3 years ago
TianYuan
19ef7210a0
[TTS]Add hifigan ( #1097 )
...
* add hifigan
* add hifigan
* integrate synthesize synthesize_e2e, inference for tts, test=tts
* add some python files, test=tts
* update readme, test=doc_fix
3 years ago
TianYuan
675cff258b
[TTS]fix praatio version, test=tts ( #1158 )
...
* fix praatio version, test=tts
* fix praatio version, test=tts
3 years ago
Jackwaterveg
e9748faa71
[Cli]optimize the cli, add --yes, and delete transformer_aishell ( #1154 )
...
* optimize the cli/asr,test=asr
* test=doc_fix
3 years ago
Jackwaterveg
2bccde3def
update the version of ctcdecoders and feat,test=doc_fix ( #1155 )
3 years ago
Jackwaterveg
0151f2463f
fix bug of pad_sequence in u2,test=asr ( #1153 )
3 years ago
Jackwaterveg
68164dd39f
[asr]rename test_hub to test_wav ( #1132 )
...
* add the readme, librispeech_asr1
* fix the test_hub
* test=asr
3 years ago
KP
16d6ed3842
Add automatic_video_subtitiles demo.
3 years ago
KP
7394a18732
Add default arguments in cls python api.
3 years ago
TianYuan
f9efbf3063
Update generate_lexicon.py
3 years ago
Jackwaterveg
5b446f6321
[Config]clear the u2 decode config for asr ( #1107 )
...
* clear the u2 decode config
* rename the vocab_filepath and cmvn_path
3 years ago
KP
074559fe90
[CLI][Demo][Text]Refactor punctuation_restoration. ( #1013 )
...
* Refactor punctuation_restoration.
* Add text cli and punc demo.
3 years ago
Hui Zhang
51d7a07c6d
format and fix pre-commit ( #1120 )
3 years ago
TianYuan
5f0f76f249
add eval() for inference model ( #1114 )
3 years ago
TianYuan
59e4a34071
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into tts_cli
3 years ago
TianYuan
3de4130dfc
update am name
3 years ago
TianYuan
9db1710ba7
add conformer demos ( #1108 )
3 years ago
TianYuan
3fe75f833d
Merge pull request #1109 from yt605155624/tts_cli
...
[cli]update voc name
3 years ago
TianYuan
ca12a83d5a
update voc name
3 years ago
TianYuan
965a57ef0e
Update README.md
3 years ago
Jackwaterveg
9e31a606d1
set default encoding utf8 for win ( #1101 )
...
Co-authored-by: KP <109694228@qq.com>
3 years ago
Hui Zhang
764a5d4271
Merge branch 'develop' into ctc
3 years ago
Hui Zhang
b1c80c45e0
remove ctc grad norm type in config
3 years ago
huangyuxin
1d4002409f
separate the sox and soxbindings with the requirements
3 years ago
TianYuan
df5fe035e5
Update README.md
3 years ago
TianYuan
a6e0a69da8
Merge pull request #1095 from KPatr1ck/demo
...
[Demo]Add tts demo.
3 years ago
TianYuan
963e906f56
Merge pull request #1068 from yt605155624/add_style_melgan
...
[TTS]add style_melgan
3 years ago
KP
1909f2f620
Add tts demo.
3 years ago
KP
3701fba0be
Update download logic and fix README typos.
3 years ago
TianYuan
f701882b66
update add_style_melgan
3 years ago
gongel
dc60aeb8c2
format
3 years ago
gongel
31510d088c
refactor: rm kaldi_io
3 years ago
TianYuan
2189b46004
add tts cli
3 years ago
KP
70a8a75476
Add st demo.
3 years ago
Hui Zhang
6dedb63e8b
Merge pull request #1087 from Jackwaterveg/setup
...
[ctcdecoders] Separate the ctcdecoders
3 years ago
huangyuxin
9fe0beee54
fix the bug: miss import after install
3 years ago
huangyuxin
cea5ffe0e4
refactor the code
3 years ago
gongel
20d88ec673
refactor: update params/input/output/namestyle
3 years ago
KP
6c1e6e7876
Update recommended model to cnn14 and argument name in __call__.
3 years ago
huangyuxin
ed12db61a6
Separate the ctcdecoders
3 years ago
KP
0b7e0d1e2e
Update tags of pretrained_models.
3 years ago
KP
d08b824d72
Update README.
3 years ago
KP
61e39daccc
Optimize model init.
3 years ago
KP
528c70e515
Remove TODO.
3 years ago
KP
b072453ca8
Fix decompressing problem.
3 years ago
KP
29da318379
Add audio classification cli.
3 years ago
gongel
f5c61ced28
feat: add st cli
3 years ago
Hui Zhang
0818c1601d
add __init__.py
3 years ago
TianYuan
7b2ecb6eed
add style_melgan, test=tts
3 years ago
Hui Zhang
03678c08c5
Merge branch 'develop' into fix_cli
3 years ago
huangyuxin
1b57d05d1b
rm the os.chdir in cli asr
3 years ago
TianYuan
aead853b1d
Update zh_frontend.py
3 years ago
huangyuxin
021311c76b
add transformer to cli infer
3 years ago
TianYuan
a070524d37
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_style_melgan, test=tts
3 years ago
TianYuan
dd36eafe34
add style_melgan
3 years ago
KP
54cf048b2a
Merge remote-tracking branch 'update_stream/develop' into cli
3 years ago
huangyuxin
a258a34ec0
revise the convert pcm
3 years ago
Jackwaterveg
8ec576f477
Update infer.py
3 years ago
huangyuxin
b0356ae489
revise
3 years ago
huangyuxin
957f2e3a1c
revise
3 years ago
huangyuxin
aee530af27
revise the sample rate
3 years ago
Junkun
4e31a4445d
eval mode
3 years ago
KP
a19e51d7da
Update python api.
3 years ago
KP
e0642ffc77
Update doc strings.
3 years ago
huangyuxin
90d648a601
support using by __call__
3 years ago
huangyuxin
aecb5f567c
Merge branch 'tmp' into 1048
3 years ago
KP
44e9b032d5
Update inputs and outputs of executor.
3 years ago
huangyuxin
3fadcde5e2
revise the asr infer.py
3 years ago
Hui Zhang
4823892169
Merge pull request #1058 from Jackwaterveg/benchmark
...
[benchmark]fix the benchmark
3 years ago
Junkun
3a14b82844
minor
3 years ago
Junkun
f50a2ab4ca
fix bugs
3 years ago
huangyuxin
cb383a39c3
fix the benchmark
3 years ago
huangyuxin
d0bf506fee
fix the load checkpoint
3 years ago
KP
1707244472
Update device usage.
3 years ago
KP
000294132c
Rename s2t to asr.
3 years ago
huangyuxin
43f4d47bfa
add the call in infer.py
3 years ago
Hui Zhang
39228864bb
format code
3 years ago
Hui Zhang
d395c2b8e3
jsonlines reade manifest file
3 years ago
Hui Zhang
7554b6107a
using visualdl; fix read_manifest
3 years ago
huangyuxin
cdc8520969
add the infer
3 years ago
KP
c94ebdc52c
Add python api for executor.
3 years ago
Junkun
d2fab3238b
fix bugs
3 years ago
Junkun
cdd0845127
add translate function
3 years ago
KP
e9798498d6
Update asr inference in paddlespeech.cli.
3 years ago
huangyuxin
895a086fdd
rename the config.feat_size and the config.vocab.size to input_size and output_size
3 years ago
KP
4d39a7746e
Add paddlespeech.cli.
3 years ago
KP
98f0806353
Add paddlespeech.cli.
3 years ago
TianYuan
6e3257ab8a
Create __init__.py
3 years ago
TianYuan
022f1ce8e9
Merge pull request #1040 from yt605155624/fix_frontend
...
[TTS]update text frontend
3 years ago
TianYuan
a861e56e91
rm space for pure Chinese
3 years ago
TianYuan
dad1cbbcd6
update text frontend
3 years ago
KP
6e1ac1cc15
Add paddlespeech.cls and esc50 example.
3 years ago
KP
33f0e7622c
Add paddlespeech.cls and esc50 example.
3 years ago
KP
2c531d78ac
Add paddlespeech.cls and esc50 example.
3 years ago
KP
bdb3ce23ee
Add paddlespeech.cls and esc50 example.
3 years ago
KP
1189117784
Add paddlespeech.cls and esc50 example.
3 years ago
Hui Zhang
2bbfdbae91
Merge pull request #1015 from yt605155624/fs2_conformer
...
[TTS]fastspeech2 conformer
3 years ago
TianYuan
b0a1d8ab60
fix base
3 years ago
TianYuan
469329221b
refactor encoder, rm old code
3 years ago
Hui Zhang
fe83adfbcb
nproc to ngpu
3 years ago
Hui Zhang
789471bfca
test wav for u2
3 years ago