Jackwaterveg
8d1ee8262e
Merge branch 'develop' into CER
3 years ago
xiongxinlei
ff4ddd229e
fix the unuseful code, test=doc
3 years ago
xiongxinlei
9c03280ca6
remove debug info, test=doc
3 years ago
xiongxinlei
48fa84bee9
fix the asr online client bug, return None, test=doc
3 years ago
huangyuxin
6e80618e3d
add ds2
3 years ago
Honei
9d20a10b5a
Merge branch 'develop' into server
3 years ago
Hui Zhang
0cde9f87ab
Merge pull request #1710 from Honei/deepspeech_server
...
[asr][websocket]fix the ws send bug, cache buffer, text=doc
3 years ago
xiongxinlei
efc269b75f
remove unuseful code, test=doc
3 years ago
xiongxinlei
89b102a7dd
fix the ws send bug, cache buffer, text=doc
3 years ago
xiongxinlei
d21ccd0287
add conformer online server, test=doc
3 years ago
buchongyu
48358055d0
修改hack 单词拼写错误
3 years ago
huangyuxin
ca860e3d2f
supplement note
3 years ago
Hui Zhang
cb39777a60
format code
3 years ago
Hui Zhang
61941d14b0
Merge pull request #1627 from WilliamZhang06/ws-develop
...
[websocket] added online asr engine
3 years ago
WilliamZhang06
d847fe29cf
added online asr engine , test=doc
3 years ago
Hui Zhang
943d4ac1ee
Merge pull request #1612 from Jackwaterveg/update
...
[ASR] Replace kaidi_fbank with paddleaudio
3 years ago
huangyuxin
f47146af49
add docstring, test=asr
3 years ago
huangyuxin
ed490b66cb
update spectrogram, test=asr
3 years ago
Hui Zhang
84d712d493
format code, test=doc
3 years ago
huangyuxin
0ffe1f9114
replace kaidi_fbank with paddleaudio
3 years ago
huangyuxin
e1b581b622
fix some bug, test=asr
3 years ago
huangyuxin
6da8465f14
add dist_sampler args, test=asr
3 years ago
huangyuxin
a4f5a68074
fix some format, test=asr
3 years ago
huangyuxin
d53e1163a6
update the code, test=asr
3 years ago
huangyuxin
ab16d8ce3c
change default initializer to kaiming_uniform, test=asr
3 years ago
Hui Zhang
75098698d8
format,test=doc
3 years ago
Hui Zhang
6b1fe70100
format code,test=doc
3 years ago
WilliamZhang06
da3ea7bb40
added engine type and asr inference , test=doc
3 years ago
huangyuxin
95d5274aef
fix sortagrad, test=asr
3 years ago
huangyuxin
aefe9e93a7
add tipc benchmark of conformer
3 years ago
TianYuan
7dc1f2daa3
fix some librosa bugs, test=tts
3 years ago
huangyuxin
9a55783aa0
fix resample
3 years ago
huangyuxin
2a42421a63
cli add ds2-librispeech offline, fix versionm, test=asr
3 years ago
Jackwaterveg
f49cf838a8
Update u2.py ( #1378 )
3 years ago
Jackwaterveg
d7222c0453
[ASR] Support CTC decoder online ( #821 )
...
* fix the destructer problem for prefixes
* unified offline and online in ctcdecoders, test=asr
* rename swig_decoders to paddlespeech_ctcdecoders, test=asr
* add reset_stage for ctcdecoder
* fix some problems
* fix ctconline
* fix a bug
* fix the format
* fix 1xt2x
3 years ago
Junkun
44408e5211
sync the variable name to others
3 years ago
Junkun
f866059b74
config and formalize
3 years ago
Junkun
43aad7a018
beam search with optimality guarantees
3 years ago
Jackwaterveg
26524031d2
Merge pull request #1343 from Jackwaterveg/fix
...
[ASR] Fix some bugs
3 years ago
huangyuxin
5e7e8a3e24
fix the u2 export, test=asr
3 years ago
Hui Zhang
ec1c88ae1a
[s2t] remove nltk ( #1332 )
3 years ago
Jackwaterveg
0c4895cd0b
mv the ctcdecoders to third_part ( #1313 )
3 years ago
Jackwaterveg
010aa65b2b
[cli] asr - support English, decode_metod and unified config ( #1297 )
...
* fix config, test=asr
* fix config, test=doc_fix
* add en and decode_method for cli/asr, test=asr
* test=asr
* fix, test=doc_fix
3 years ago
billishyahao
ddf184be60
fix some typos
3 years ago
Jackwaterveg
e69abc9265
Merge pull request #1273 from zh794390558/batch_sampler
...
[s2t] Fix Batch sampler set epoch
3 years ago
huangyuxin
07d457859d
use pre-commit, test=doc_fix
3 years ago
Hui Zhang
45832f6770
fix default dist_samlper to False
3 years ago
Hui Zhang
3a2db414e6
format code
3 years ago
Hui Zhang
6f651d762e
fix batch sampler set_epoch when epcoh start
3 years ago
huangyuxin
8b63485ce3
fix some bug, test=asr
3 years ago
huangyuxin
3e2cc898cb
remove default cfg and fix some bugs,test=asr
3 years ago
huangyuxin
a1d8ab0f99
merge the develop
3 years ago
huangyuxin
c907a8deda
change all recipes
3 years ago
Hui Zhang
c81a3f0f83
[s2t] DataLoader with BatchSampler or DistributeBatchSampler ( #1242 )
...
* batchsampler or distributebatchsampler
* format
3 years ago
Junkun Chen
420709e5ce
[st] Distributed sampler and new dataloader with MIMO ( #1239 )
...
* update timit result, test=doc_fix
* result update
* fix bug
* add triplet loader
* empty preprocess file
* sync to u2, updating
* sync to u2 config
* fix bugs
* code refine
* update config
* customize decoding batch size
* update optimizer and lr scheduler
* minor
* minor
* minor
* fix bugs of refs
* minor
* distributed sampler
* minor
* refine the loader
3 years ago
huangyuxin
41eeed0450
add librispeech asr1
3 years ago
huangyuxin
2c5902d7c5
rename decoding to decode
3 years ago
Hui Zhang
bb2a370b23
[asr] remove useless conf of librispeech ( #1227 )
...
* remve useless conf
* format code
* update conf
* update conf
* update conf
3 years ago
huangyuxin
c40b6f4062
refactor the train and test config,test=asr
3 years ago
TianYuan
5692b0ff04
fix log for t2s ( #1219 )
3 years ago
KP
d362d28d35
Remove logging file in cli api.
3 years ago
Hui Zhang
db121226b8
clean aishell asr1 conf & compare ctc loss with torch and warpctc_pytorch ( #1191 )
3 years ago
Hui Zhang
d852aee2ff
[asr] logfbank with dither ( #1179 )
...
* fix logfbank dither
* format
3 years ago
Jackwaterveg
2bccde3def
update the version of ctcdecoders and feat,test=doc_fix ( #1155 )
3 years ago
Jackwaterveg
0151f2463f
fix bug of pad_sequence in u2,test=asr ( #1153 )
3 years ago
Jackwaterveg
68164dd39f
[asr]rename test_hub to test_wav ( #1132 )
...
* add the readme, librispeech_asr1
* fix the test_hub
* test=asr
3 years ago
Jackwaterveg
5b446f6321
[Config]clear the u2 decode config for asr ( #1107 )
...
* clear the u2 decode config
* rename the vocab_filepath and cmvn_path
3 years ago
Hui Zhang
51d7a07c6d
format and fix pre-commit ( #1120 )
3 years ago
Hui Zhang
764a5d4271
Merge branch 'develop' into ctc
3 years ago
Hui Zhang
b1c80c45e0
remove ctc grad norm type in config
3 years ago
huangyuxin
1d4002409f
separate the sox and soxbindings with the requirements
3 years ago
TianYuan
2189b46004
add tts cli
3 years ago
huangyuxin
9fe0beee54
fix the bug: miss import after install
3 years ago
huangyuxin
cea5ffe0e4
refactor the code
3 years ago
huangyuxin
ed12db61a6
Separate the ctcdecoders
3 years ago
Hui Zhang
0818c1601d
add __init__.py
3 years ago
Junkun
4e31a4445d
eval mode
3 years ago
Hui Zhang
4823892169
Merge pull request #1058 from Jackwaterveg/benchmark
...
[benchmark]fix the benchmark
3 years ago
Junkun
3a14b82844
minor
3 years ago
Junkun
f50a2ab4ca
fix bugs
3 years ago
huangyuxin
cb383a39c3
fix the benchmark
3 years ago
huangyuxin
d0bf506fee
fix the load checkpoint
3 years ago
Hui Zhang
39228864bb
format code
3 years ago
Hui Zhang
d395c2b8e3
jsonlines reade manifest file
3 years ago
Hui Zhang
7554b6107a
using visualdl; fix read_manifest
3 years ago
Junkun
d2fab3238b
fix bugs
3 years ago
Junkun
cdd0845127
add translate function
3 years ago
huangyuxin
895a086fdd
rename the config.feat_size and the config.vocab.size to input_size and output_size
3 years ago
Hui Zhang
fe83adfbcb
nproc to ngpu
3 years ago
Hui Zhang
789471bfca
test wav for u2
3 years ago
Jackwaterveg
09931d2ccc
Merge pull request #1019 from zh794390558/feat
...
[bugfix] Kaldi Feature using dither in train
3 years ago
huangyuxin
8aebfeac81
fix the prc-commit
3 years ago
Hui Zhang
56480e1033
fix format
3 years ago
Hui Zhang
7ec0ed4aaf
kaldi feat dither when train
3 years ago
Hui Zhang
2ba3f00bbd
Merge branch 'develop' into datapipe
3 years ago
Hui Zhang
b944418d6f
new format data support ds2/st
3 years ago
Hui Zhang
0defc658e1
update aishell/librispeech transformer result; wenetspeech pretrain conformer result
3 years ago
Hui Zhang
d2a05df02e
Merge pull request #1014 from Jackwaterveg/auto_log
...
[asr]hidden the auto_log
3 years ago
huangyuxin
fb6974f950
update the auto_log
3 years ago
Hui Zhang
638b96bf07
check if cmvn_file in config for u2
3 years ago
huangyuxin
f646d4c3a1
renew the setup.py for paddlespeech feat and ctcdecoders
3 years ago
huangyuxin
ca06b91fc4
renew the setup.py for paddlespeech feat and ctcdecoders
3 years ago
Hui Zhang
3bd87bc379
add wenet lincense
3 years ago
Hui Zhang
1ae1ead80f
more install scripts
3 years ago
Hui Zhang
51a6845564
Merge pull request #985 from Jackwaterveg/benchmark
...
revise the benchmark
3 years ago
huangyuxin
843ea1c12e
revise the benchmark
3 years ago
Hui Zhang
080b0431f4
format code
3 years ago
Junkun
7c8843448c
add word reward into beam search.
3 years ago
Hui Zhang
9a71c091c5
remove debug info and format code
3 years ago
Hui Zhang
8b0e344c69
fix logfbank using PCM16
3 years ago
Hui Zhang
7ceef6c3f5
format code
3 years ago
Hui Zhang
f9221b4b74
fix ctc align
3 years ago
Hui Zhang
fb853167d3
format code
3 years ago
Hui Zhang
18d9abc7a0
add sox speed pertrub
3 years ago
Hui Zhang
000fac53fe
Merge pull request #966 from Jackwaterveg/dev
...
change the lm dataset dir, add the 'LM_BIN_DIR' in s2 path.sh
3 years ago
Hui Zhang
6a7e0265cd
add josn global cmvn
3 years ago
Hui Zhang
9cdd2643b1
fix bug for batch dataloader using
3 years ago
Hui Zhang
69bccb4f02
fix ctc align
3 years ago
Hui Zhang
69055698a2
transformer using batch data loader
3 years ago
huangyuxin
d647cde870
change the lm dataset dir
3 years ago
Hui Zhang
38cf56295a
fix reference format
3 years ago
Hui Zhang
c463a00f81
add reference code license
3 years ago
Hui Zhang
2a66c2c13b
format code
3 years ago
Hui Zhang
e2bcaee4f1
merge deepspeech, parakeet and text_processing into paddlespeech
3 years ago