Honei
48e0177767
Merge pull request #1630 from Honei/vox12
...
[vec]voxceleb convert dataset format to paddlespeech
3 years ago
qingen
fc72295334
Merge pull request #1651 from ccrrong/ami
...
[vec] add speaker diarization pipeline
3 years ago
ccrrong
995436c6f1
delete unused file ami_dataset.py, compute_der.py, test=doc
3 years ago
Hui Zhang
44ee5cd805
Merge pull request #1677 from PaddlePaddle/Jackwaterveg-patch-1
...
[Doc] update readem for aishell/asr0
3 years ago
ccrrong
bc53f726fe
convert dataset format to paddlespeech, test=doc
3 years ago
TianYuan
c674e59b91
update readme, test=doc
3 years ago
TianYuan
0282d45c62
remove fill_constant_batch_size_like in static model of speedyspeech, test=tts
3 years ago
TianYuan
30628f6832
update readme, test=doc
3 years ago
TianYuan
c765fca6b4
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_onnx
3 years ago
TianYuan
21c75684ac
add paddle2onnx, test=tts
3 years ago
Jackwaterveg
75c9dc773b
test=doc
3 years ago
Jackwaterveg
3c93953550
test=doc
3 years ago
Jackwaterveg
f71b9b915d
test=doc
3 years ago
Jackwaterveg
1a67038616
test=doc
3 years ago
Jackwaterveg
88f5595bd7
test=doc
3 years ago
Jackwaterveg
ee96fb40f0
test=doc
3 years ago
Jackwaterveg
a22f29ba10
test=doc
3 years ago
Jackwaterveg
ae1b22273f
[Doc] update readem for aishell/asr0, test=doc
3 years ago
xiongxinlei
a8244dc5b0
update the note, test=doc
3 years ago
KP
80b1fb9839
Update RESULTS.md. test=doc
3 years ago
KP
34b77a9db1
Update RESULTS.md. test=doc
3 years ago
huangyuxin
fd7a50d5a0
add new cer tools, test=asr
3 years ago
Hui Zhang
2b7ca6f261
Update RESULTS.md
3 years ago
Hui Zhang
7ca40ff008
Merge pull request #1668 from PaddlePaddle/Jackwaterveg-patch-1
...
[ASR] update ds2 online model
3 years ago
Honei
89791d7aca
Merge pull request #1663 from Honei/model
...
[vec]update the speaker verification model
3 years ago
Jackwaterveg
82cd7015d7
test=doc
3 years ago
Jackwaterveg
5bb36472e8
test=doc
3 years ago
Jackwaterveg
eeae00cc04
test=doc
3 years ago
TianYuan
d592f25279
add onnx inference for fastspeech2 + hifigan/mb_melgan, test=tts
3 years ago
TianYuan
7aecb2c4bb
add onnx inference for fastspeech2 + hifigan/mb_melgan, test=tts
3 years ago
xiongxinlei
d064c8196e
update the speaker verification model, test=doc
3 years ago
KP
079ac5caa0
Update README.md
3 years ago
xiongxinlei
38e4e9c893
refactor voxceleb2 data download, test=doc
3 years ago
ccrrong
7a03f36548
code format, test=doc
3 years ago
ccrrong
378fe5909f
add ami diarization pipeline, test=doc
3 years ago
xiongxinlei
acebfad7b7
change the vector csv.spk_id to csv.label, test=doc
3 years ago
xiongxinlei
57c11dcab0
add some annotations, test=doc
3 years ago
xiongxinlei
30b5b3cb9e
add vector csv dataset format, test=doc
3 years ago
xiongxinlei
5b05300e53
train process add new voxceleb and rirs dataset, test=doc
3 years ago
xiongxinlei
9944fec3d4
convert rirs noise to csv file
3 years ago
TianYuan
78219cef7b
add cnndecoder pretrained model, test=doc
3 years ago
TianYuan
4d7cd0e063
add streaming synthesize, test=tts
3 years ago
xiongxinlei
ec24a169ee
convert jsonfile to csv file
3 years ago
TianYuan
005aa4066c
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into cnn_decoder
3 years ago
TianYuan
0fc79f474d
add CNNDecoder, test=tts
3 years ago
TianYuan
318edec303
Merge pull request #1613 from yt605155624/restructure_expand
...
[tts]restructure expand in length_regulator.py for paddle2onnx, test=tts
3 years ago
Hui Zhang
943d4ac1ee
Merge pull request #1612 from Jackwaterveg/update
...
[ASR] Replace kaidi_fbank with paddleaudio
3 years ago
huangyuxin
ed490b66cb
update spectrogram, test=asr
3 years ago
Hui Zhang
84d712d493
format code, test=doc
3 years ago
Honei
d60856b1ed
Merge pull request #1614 from Honei/vox12
...
[vec]change the vector output to numpy.array
3 years ago
xiongxinlei
ed7113f320
change the vector output to numpy.array
3 years ago
Jackwaterveg
5db7e6382a
test=doc
3 years ago
TianYuan
e52fc08c58
update readme, test=doc
3 years ago
TianYuan
bc5ae43d3a
restructure expand in length_regulator.py for paddle2onnx, test=tts
3 years ago
huangyuxin
0ffe1f9114
replace kaidi_fbank with paddleaudio
3 years ago
Jackwaterveg
64e12e949a
Update RESULTS.md
3 years ago
Jackwaterveg
1e35007925
test=doc
3 years ago
xiongxinlei
ef1bc5e815
vector cli output dim info, test=doc
3 years ago
xiongxinlei
1fdb36f757
add mode emb dim info, test=doc
3 years ago
xiongxinlei
ad2caf2ccb
add speaker verification demo and doc, test=doc
3 years ago
Honei
305bacdcf2
Merge branch 'develop' into vox12
3 years ago
TianYuan
8938483529
Merge pull request #1601 from yt605155624/add_ljspeech_hifigan
...
[TTS] update readme for ljspeech hifigan
3 years ago
TianYuan
342b487383
update readme for ljspeech hifigan, test=tts
3 years ago
Hui Zhang
4051e7b762
fix compliance test bug, and format
3 years ago
xiongxinlei
e2684e71f2
refactor the data prepare process
3 years ago
Jackwaterveg
5c1283289e
[Doc] Updata doc
3 years ago
Jackwaterveg
c07d248afd
test=doc
3 years ago
Jackwaterveg
13ac21b705
Update RESULTS.md
3 years ago
xiongxinlei
5221c2797f
add voxceleb dataset and trial info, test=doc
3 years ago
Jackwaterveg
5d3c760eae
Update RESULTS.md
3 years ago
Jackwaterveg
fcc1762048
Merge pull request #1577 from Jackwaterveg/change_init
...
[ASR] change default initializer to kaiming_uniform
3 years ago
huangyuxin
e1b581b622
fix some bug, test=asr
3 years ago
Hui Zhang
b5315657ff
Merge pull request #1509 from qingen/cluster
...
[vec] add clustering of vectors
3 years ago
TianYuan
490300f84f
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_ljspeech_hifigan
3 years ago
TianYuan
c36039ce32
update readme for ljspeech hifigan, test=tts
3 years ago
huangyuxin
6da8465f14
add dist_sampler args, test=asr
3 years ago
TianYuan
6469568d2a
update readme for vctk hifigan, test=tts
3 years ago
TianYuan
9497c93fb0
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_vctk_hifigan
3 years ago
TianYuan
d9127601b6
update readme for vctk hifigan, test=tts
3 years ago
huangyuxin
e991d82ae7
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into change_init
3 years ago
xiongxinlei
d28ccfa96b
add vector cli component, test=doc
3 years ago
TianYuan
5ab2601759
update readme for aishell3 hifigan, test=tts
3 years ago
TianYuan
c4035f8c43
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_aishell3_hifigan
3 years ago
TianYuan
8d9197817a
add hifigan in syn of aishell3, test=tts
3 years ago
TianYuan
13242d015e
update run.sh, test=doc
3 years ago
huangyuxin
ab16d8ce3c
change default initializer to kaiming_uniform, test=asr
3 years ago
TianYuan
4c517fa8a6
update preprocess.sh in aishell3 vc0, test=doc
3 years ago
TianYuan
bf587ba879
update synthesize_e2e.sh, test=tts
3 years ago
qingen
0f7ede11ef
Merge branch 'cluster' of github.com:qingen/PaddleSpeech into cluster
3 years ago
qingen
d16ce21d47
[wip][vec] update cluster of diarization, test=doc #1304
3 years ago
xiongxinlei
506d26a957
change the code style to s2t code style, test=doc
3 years ago
xiongxinlei
7eb8fa72a1
convert save_freq to save_interval, test=doc
3 years ago
xiongxinlei
311fa87a11
add some comments to the code
3 years ago
xiongxinlei
8ed5c287a3
add vox2 data into VoxCeleb class
3 years ago
xiongxinlei
584a2c0e39
add ecapa-tdnn config yaml file
3 years ago
Hui Zhang
67fc073b01
Merge pull request #1550 from yt605155624/fix_ss_dump_bug
...
[TTS]fix synthesize bug for speedyspeech
3 years ago
TianYuan
589f780850
fix synthesize bug for speedyspeech, test=tts
3 years ago
xiongxinlei
993d6783d7
remove unused code, test=doc
3 years ago
xiongxinlei
0e87037f2c
refactor to compilance paddleaudio
3 years ago
xiongxinlei
4473405f82
merge develop to vox12, test=doc
3 years ago
Honei
0dee8f40e9
Merge branch 'PaddlePaddle:develop' into develop
3 years ago
xiongxinlei
60d73bb7bd
add state 0 to prepare the voxcele data and augment data
3 years ago
TianYuan
a151935eaf
add ljspeech hifigan, test=tts
3 years ago
xiongxinlei
14efbf5b15
check extract embedding result, test=doc
3 years ago
Hui Zhang
67dcff2f3f
Merge pull request #1545 from yt605155624/add_aishell3_hifigan
...
[tts]add aishell3 hifigan egs
3 years ago
TianYuan
81d964f0a0
add vctk hifigan, test=tts
3 years ago
TianYuan
1410a84054
add aishell3 hifigan, test=tts
3 years ago
xiongxinlei
7db7eb8993
add extract audio embedding api, test=doc
3 years ago
xiongxinlei
2d89c80e6f
add waveform augment pipeline, test=doc
3 years ago
qingen
ff47ab1779
Merge branch 'PaddlePaddle:develop' into cluster
3 years ago
xiongxinlei
016ed6d69c
repair the code according to the part comment, test=doc
3 years ago
xiongxinlei
97ec01260b
add speaker verification using cosine score, test=doc
3 years ago
xiongxinlei
1f74af110b
add training log info and comment, test=doc
3 years ago
xiongxinlei
4648059b5f
add training process for sid, test=doc
3 years ago
xiongxinlei
7668f61422
add sid dataloader for training, test=doc
3 years ago
xiongxinlei
6af2bc3d5b
add sid loss wraper for voxceleb, test=doc
3 years ago
xiongxinlei
57c4f4a68c
add sid learning rate and training model
3 years ago
xiongxinlei
3a943ca95b
repair the variable name bug
3 years ago
xiongxinlei
0780d181d2
remove personal code test=doc
3 years ago
xiongxinlei
7ef60ebae2
add voxceleb1 data prepare
3 years ago
Honei
1395b5f5fa
Merge branch 'PaddlePaddle:develop' into develop
3 years ago
TianYuan
175c39b4a4
Merge pull request #1511 from yt605155624/pre_fix_for_streaming
...
[TTS]add rtf for synthesize, add more vocoder for synthesize.sh
3 years ago
TianYuan
641984ae30
add code annotation, test=tts
3 years ago
TianYuan
cb07bd2a94
add rtf for synthesize, add more vocoder for synthesize_e2e.sh, test=tts
3 years ago
qingen
d01d7fedce
[wip][vec] add clustering of vectors #1304
3 years ago
qingen
c962eec51d
[wip][vec] add clustering of vectors #1304
3 years ago
TianYuan
66a8beb27f
update text frontend, test=tts
3 years ago
xiongxinlei
35b7968ed1
remove invalid directory
3 years ago
xiongxinlei
16108de71e
add voxceleb1 dataset prepare process
3 years ago
Hui Zhang
6b1fe70100
format code,test=doc
3 years ago
Junkun
97e2015242
update run sh
3 years ago
Junkun
1169ffa480
add config files
3 years ago
Junkun
70166c2026
mv json_to_manifest to utils
3 years ago
Junkun
af2b20650e
update mustc v1
3 years ago
Junkun
0165c450ad
update script
3 years ago
TianYuan
b5a7c2d080
update readme for aishell3_vc0, test=doc
3 years ago
TianYuan
ebef10efcd
Update README.md
3 years ago
TianYuan
a8cac30bd6
Create README.md
3 years ago
Hui Zhang
d07fd5bd43
upadte data info
3 years ago
TianYuan
b6fbacdd9b
Merge pull request #1436 from yt605155624/rename_tacotron2
...
[TTS]Rename tacotron2
3 years ago
TianYuan
e7adc854d4
Update README.md
3 years ago
TianYuan
6a19de44ff
Update README.md
3 years ago
TianYuan
7dc1f2daa3
fix some librosa bugs, test=tts
3 years ago
Hui Zhang
8eb708e754
Merge pull request #1417 from Honei/develop
...
[vector] voxceleb trial: convert voxceleb trial to kaldi format trial
3 years ago
TianYuan
25347bb6a3
rename tacotron2, test=tts
3 years ago
xiongxinlei
d7a09ff71c
repair the annotation of make voxceleb trial script
3 years ago
TianYuan
ea29275acd
fix dead links, test=doc
3 years ago
TianYuan
270fe4fdfc
Merge pull request #1430 from kslz/develop
...
Add Chinese docs
3 years ago
Hui Zhang
dcfc32f1ec
Merge pull request #1379 from yt605155624/new_wavernn
...
[TTS] add wavernn
3 years ago
TianYuan
0747600c95
[TTS]add ljspeech new tacotron2 ( #1416 )
...
* add ljspeech new tacotron2, test=tts
* update ljspeech waveflow's synthesize
* add config, test=doc
Co-authored-by: Hui Zhang <zhtclz@foxmail.com>
3 years ago
TianYuan
348a1a33bf
update tacotron2 voice cloning in aishell3 with new tacotron2, test=tts ( #1419 )
3 years ago
lizi
be2fc2cc11
Modify typesetting, test=doc
3 years ago
lizi
5e34cdbd6e
Modify typesetting, test=doc
3 years ago
lizi
06e8bdf0d7
add Chinese doc for "FastSpeech2 with CSMSC", test=doc
3 years ago
Honei
940602adbe
convert voxceleb trial to kaldi format trial
3 years ago
TianYuan
1b0c034134
update wavernn, test=tts
3 years ago
TianYuan
89e69ee10e
[TTS]fix tacotron2 dygraph to static ( #1414 )
...
* fix tacotron2 dygraph to static , test=tts
* fix tacotron2 dygraph to static , test=tts
* simplify synthesize_e2e.py , test=tts
3 years ago
TianYuan
001afee644
fix wavernn dygraph to static , test=tts
3 years ago
TianYuan
2844f388dc
[doc ]add tacotron2 readme ( #1385 )
...
* add tacotron2 readme, test=doc
* update changelog.md, test=doc
3 years ago
TianYuan
2071774d81
add wavernn in synthesize_e2e, test=tts
3 years ago
TianYuan
4c3e57a23c
align preprocess of wavernn, test=tts
3 years ago
qingen
7413c9e48a
Merge pull request #1335 from qingen/test-pr
...
[vector] add DER scripts, AMI data preparation scripts
3 years ago
qingen
9c2a23e15e
[vector] add AMI data preparation scripts
3 years ago
TianYuan
fb0acd40a2
add wavernn, test=tts
3 years ago
Jackwaterveg
d7222c0453
[ASR] Support CTC decoder online ( #821 )
...
* fix the destructer problem for prefixes
* unified offline and online in ctcdecoders, test=asr
* rename swig_decoders to paddlespeech_ctcdecoders, test=asr
* add reset_stage for ctcdecoder
* fix some problems
* fix ctconline
* fix a bug
* fix the format
* fix 1xt2x
3 years ago
qingen
1899200cae
[vector] add AMI data preparation scripts
3 years ago
qingen
9d32f62f48
[vector] add AMI data preparation scripts
3 years ago
TianYuan
49fd55dc16
Merge pull request #1366 from Jackwaterveg/fix
...
[Others]fix g2p
3 years ago
TianYuan
9764535d3d
Update run.sh
3 years ago
Jackwaterveg
89a5c4ec5b
Update run.sh
3 years ago
huangyuxin
baccedee54
fix g2p, test=doc
3 years ago
Hui Zhang
b4f621b9d5
add esc50 reference
3 years ago
Jackwaterveg
2082b89d12
Update chunk_decode.yaml
3 years ago
TianYuan
96323816e9
fix yamls, change labels to stop_labels, test=tts
3 years ago
TianYuan
1bf1a876ae
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_new_tacotron2, test=tts
3 years ago
TianYuan
41d24337cb
fix fastspeech2 multi speaker to static, test=tts
3 years ago
TianYuan
1a9e59612a
fix fastspeech2 multi speaker to static, test=tts
3 years ago
Junkun
44408e5211
sync the variable name to others
3 years ago
Junkun
f866059b74
config and formalize
3 years ago
TianYuan
d50d195145
update frontend readme, test=doc
3 years ago
TianYuan
8f507ba4ba
Merge pull request #1302 from jerryuhoo/develop
...
[TTS] Add support for finetuning speedyspeech
3 years ago
TianYuan
89e988a69e
add csmsc tacotron2, test=tts
3 years ago
TianYuan
c088b9a304
add csmsc tacotron2
3 years ago
Jackwaterveg
e7189b216c
Update chunk_decode.yaml
3 years ago
Jerryuhoo
75c2bd5faf
fix link_wav.py path, test=tts
3 years ago
TianYuan
fb238d83f4
update vctk voc1, test=tts ( #1294 )
3 years ago
Jackwaterveg
9c1e098693
[Asr][Config] fix config ( #1293 )
...
* fix config, test=asr
* fix config, test=doc_fix
3 years ago
Jackwaterveg
494d6f8b6b
[ASR][Config]fix config ( #1290 )
...
* fix config, test=asr
* fix config, test=doc_fix
3 years ago
billishyahao
ddf184be60
fix some typos
3 years ago
Jerryuhoo
d6e9b76e76
change link_wav.py path, test=tts
3 years ago
Jerryuhoo
ea8977555f
Simplify link_wav.py path
3 years ago
Jerryuhoo
c94f346207
move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/
...
move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/ so that different vocoders can use one script.
3 years ago
Jerryuhoo
76f98c6f69
add --dataset and --rootdir to voc3 finetune script
...
add --dataset and --rootdir to voc3 finetune script, also remove --speaker-dict in voc5.
3 years ago
Jerryuhoo
e239ee1cd2
add multi-speaker support for finetuning hifigan vocoder
3 years ago
huangyuxin
07d457859d
use pre-commit, test=doc_fix
3 years ago
TianYuan
680eac02b9
[tts]Update mb melgan ( #1272 )
...
* update mb melgan
* update mb melgan, test=tts
3 years ago
Jackwaterveg
66a615555d
revise aishell_asr0 Result, test=doc_fix
3 years ago
Hui Zhang
2316e5cb8a
Update README.md
3 years ago
TianYuan
98ce69d0aa
Merge pull request #1259 from jerryuhoo/develop
...
[TTS]Add multi-speaker support for the SpeedySpeech model
3 years ago
huangyuxin
455bf477a4
fix some bug, test=asr
3 years ago