Hui Zhang
ef37f73a01
fix cnn cache dy2st shape
2 years ago
0x45f
e21cceea51
Remove blank line
2 years ago
0x45f
e6ac8881f1
Fix comments
2 years ago
0x45f
ac680aa783
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into new_api
2 years ago
0x45f
294b7b00bd
Supprot dy2st for conformer
2 years ago
huangyuxin
75997d8277
merge
2 years ago
Hui Zhang
e81849277e
att cache for streaming asr
2 years ago
Hui Zhang
fb40602d94
refactor attention cache
2 years ago
huangyuxin
05d41523ad
Merge branch 'develop' into webdataset
2 years ago
huangyuxin
92d1d08b9a
fix scripts
2 years ago
TianYuan
e4a8e15334
Merge pull request #2111 from yt605155624/rm_more_log
...
[CLI]replace logger.info with logger.debug in cli, change default log leve…
2 years ago
TianYuan
496e2dd14b
fix Pillow's version
2 years ago
TianYuan
bc93bffbb4
replace logger.info with logger.debug in cli, change default log level to INFO
2 years ago
huangyuxin
98cfdc4c05
fix nxpu
2 years ago
huangyuxin
7463df89c5
fix nxpu
2 years ago
huangyuxin
6ec6921255
Merge branch 'webdataset' of https://github.com/Jackwaterveg/DeepSpeech into webdataset
2 years ago
Jackwaterveg
6598216b2f
Merge branch 'develop' into webdataset
2 years ago
huangyuxin
9b5655f6ad
fix 'print log' in cli
2 years ago
huangyuxin
aa12b9ab52
replace s2t.transform with audio.transform
2 years ago
huangyuxin
0c7abc1f17
add training scripts
2 years ago
huangyuxin
c7a7b113c8
support multi-gpu training with webdataset
2 years ago
KP
bf056c013d
Refactor paddleaudio to paddlespeech.audio
2 years ago
Hui Zhang
dfdf450b22
fix #2013 ; and format
2 years ago
huangyuxin
e48e1d5e81
fix tiny and local script, test=asr
3 years ago
huangyuxin
47dd61e5b2
refactor ds2, cli, server
3 years ago
TianYuan
aa3d151d1d
Merge pull request #1994 from Jackwaterveg/develop
...
[ASR] not install ctc on win
3 years ago
huangyuxin
10819e0fa2
not install ctc on win, test=asr
3 years ago
Hui Zhang
42fba661c9
more detail of copyright
3 years ago
Hui Zhang
3d88ac4e68
Merge pull request #1950 from Jackwaterveg/develop
...
[ASR] fix pad_sequence, remove paddle.size, paddle.static.Variable.size, using paddle.shape()
3 years ago
Hui Zhang
f07f57a3a8
Merge pull request #1945 from PaddlePaddle/asr_line
...
[server][asr] refactor asr streaming server and remove useless code
3 years ago
huangyuxin
b23bde8ec5
tensor.shape => paddle.shape(tensor)
3 years ago
huangyuxin
4c09927f61
fix
3 years ago
huangyuxin
e1888f9ae6
remove size,test=asr
3 years ago
Zhangjingyu06
acb19cf465
deepspeech2 modify for kunlun
3 years ago
Zhangjingyu06
b0eaeccd67
deepspeech2 modify for kunlun
3 years ago
Zhangjingyu06
1e91f7da35
deepspeech2 modify for kunlun
3 years ago
huangyuxin
1cdd41bd03
fix pad_sequence, test=asr
3 years ago
Hui Zhang
c15278ed80
format
3 years ago
xiongxinlei
b1ef434983
update the max len compute method, test=doc
3 years ago
xiongxinlei
0ea39f837b
add asr time limt configuration, test=doc
3 years ago
root
9f389a7a33
support cpu, test=asr
3 years ago
root
864041085f
replace dist.spawn with dist.launch, test=asr
3 years ago
Hui Zhang
fc96130fdc
fix speechx core dump when stop immediately after start
3 years ago
Hui Zhang
ebde26030b
patch func to var
3 years ago
huangyuxin
0df8d80833
remove logfbank from python_speech_features, test=asr
3 years ago
huangyuxin
fcdaef6cb4
replace fbank, test=asr
3 years ago
huangyuxin
5912ba53e4
fix log_interval and lr when resume training, test=asr
3 years ago
Hui Zhang
91e24b0480
format code
3 years ago
Jackwaterveg
85b50c4700
Merge pull request #1741 from Jackwaterveg/debug
...
[ASR] remove redundant log
3 years ago
Hui Zhang
c7d9b11529
format
3 years ago
huangyuxin
8e37a7c7f0
remove redundant log, test=doc
3 years ago
Jackwaterveg
8d1ee8262e
Merge branch 'develop' into CER
3 years ago
xiongxinlei
ff4ddd229e
fix the unuseful code, test=doc
3 years ago
xiongxinlei
9c03280ca6
remove debug info, test=doc
3 years ago
xiongxinlei
48fa84bee9
fix the asr online client bug, return None, test=doc
3 years ago
huangyuxin
6e80618e3d
add ds2
3 years ago
Honei
9d20a10b5a
Merge branch 'develop' into server
3 years ago
Hui Zhang
0cde9f87ab
Merge pull request #1710 from Honei/deepspeech_server
...
[asr][websocket]fix the ws send bug, cache buffer, text=doc
3 years ago
xiongxinlei
efc269b75f
remove unuseful code, test=doc
3 years ago
xiongxinlei
89b102a7dd
fix the ws send bug, cache buffer, text=doc
3 years ago
xiongxinlei
d21ccd0287
add conformer online server, test=doc
3 years ago
buchongyu
48358055d0
修改hack 单词拼写错误
3 years ago
huangyuxin
ca860e3d2f
supplement note
3 years ago
Hui Zhang
cb39777a60
format code
3 years ago
Hui Zhang
61941d14b0
Merge pull request #1627 from WilliamZhang06/ws-develop
...
[websocket] added online asr engine
3 years ago
WilliamZhang06
d847fe29cf
added online asr engine , test=doc
3 years ago
Hui Zhang
943d4ac1ee
Merge pull request #1612 from Jackwaterveg/update
...
[ASR] Replace kaidi_fbank with paddleaudio
3 years ago
huangyuxin
f47146af49
add docstring, test=asr
3 years ago
huangyuxin
ed490b66cb
update spectrogram, test=asr
3 years ago
Hui Zhang
84d712d493
format code, test=doc
3 years ago
huangyuxin
0ffe1f9114
replace kaidi_fbank with paddleaudio
3 years ago
huangyuxin
e1b581b622
fix some bug, test=asr
3 years ago
huangyuxin
6da8465f14
add dist_sampler args, test=asr
3 years ago
huangyuxin
a4f5a68074
fix some format, test=asr
3 years ago
huangyuxin
d53e1163a6
update the code, test=asr
3 years ago
huangyuxin
ab16d8ce3c
change default initializer to kaiming_uniform, test=asr
3 years ago
Hui Zhang
75098698d8
format,test=doc
3 years ago
Hui Zhang
6b1fe70100
format code,test=doc
3 years ago
WilliamZhang06
da3ea7bb40
added engine type and asr inference , test=doc
3 years ago
huangyuxin
95d5274aef
fix sortagrad, test=asr
3 years ago
huangyuxin
aefe9e93a7
add tipc benchmark of conformer
3 years ago
TianYuan
7dc1f2daa3
fix some librosa bugs, test=tts
3 years ago
huangyuxin
9a55783aa0
fix resample
3 years ago
huangyuxin
2a42421a63
cli add ds2-librispeech offline, fix versionm, test=asr
3 years ago
Jackwaterveg
f49cf838a8
Update u2.py ( #1378 )
3 years ago
Jackwaterveg
d7222c0453
[ASR] Support CTC decoder online ( #821 )
...
* fix the destructer problem for prefixes
* unified offline and online in ctcdecoders, test=asr
* rename swig_decoders to paddlespeech_ctcdecoders, test=asr
* add reset_stage for ctcdecoder
* fix some problems
* fix ctconline
* fix a bug
* fix the format
* fix 1xt2x
3 years ago
Junkun
44408e5211
sync the variable name to others
3 years ago
Junkun
f866059b74
config and formalize
3 years ago
Junkun
43aad7a018
beam search with optimality guarantees
3 years ago
Jackwaterveg
26524031d2
Merge pull request #1343 from Jackwaterveg/fix
...
[ASR] Fix some bugs
3 years ago
huangyuxin
5e7e8a3e24
fix the u2 export, test=asr
3 years ago
Hui Zhang
ec1c88ae1a
[s2t] remove nltk ( #1332 )
3 years ago
Jackwaterveg
0c4895cd0b
mv the ctcdecoders to third_part ( #1313 )
3 years ago
Jackwaterveg
010aa65b2b
[cli] asr - support English, decode_metod and unified config ( #1297 )
...
* fix config, test=asr
* fix config, test=doc_fix
* add en and decode_method for cli/asr, test=asr
* test=asr
* fix, test=doc_fix
3 years ago
billishyahao
ddf184be60
fix some typos
3 years ago
Jackwaterveg
e69abc9265
Merge pull request #1273 from zh794390558/batch_sampler
...
[s2t] Fix Batch sampler set epoch
3 years ago
huangyuxin
07d457859d
use pre-commit, test=doc_fix
3 years ago
Hui Zhang
45832f6770
fix default dist_samlper to False
3 years ago
Hui Zhang
3a2db414e6
format code
3 years ago
Hui Zhang
6f651d762e
fix batch sampler set_epoch when epcoh start
3 years ago
huangyuxin
8b63485ce3
fix some bug, test=asr
3 years ago
huangyuxin
3e2cc898cb
remove default cfg and fix some bugs,test=asr
3 years ago
huangyuxin
a1d8ab0f99
merge the develop
3 years ago
huangyuxin
c907a8deda
change all recipes
3 years ago
Hui Zhang
c81a3f0f83
[s2t] DataLoader with BatchSampler or DistributeBatchSampler ( #1242 )
...
* batchsampler or distributebatchsampler
* format
3 years ago
Junkun Chen
420709e5ce
[st] Distributed sampler and new dataloader with MIMO ( #1239 )
...
* update timit result, test=doc_fix
* result update
* fix bug
* add triplet loader
* empty preprocess file
* sync to u2, updating
* sync to u2 config
* fix bugs
* code refine
* update config
* customize decoding batch size
* update optimizer and lr scheduler
* minor
* minor
* minor
* fix bugs of refs
* minor
* distributed sampler
* minor
* refine the loader
3 years ago
huangyuxin
41eeed0450
add librispeech asr1
3 years ago
huangyuxin
2c5902d7c5
rename decoding to decode
3 years ago
Hui Zhang
bb2a370b23
[asr] remove useless conf of librispeech ( #1227 )
...
* remve useless conf
* format code
* update conf
* update conf
* update conf
3 years ago
huangyuxin
c40b6f4062
refactor the train and test config,test=asr
3 years ago
TianYuan
5692b0ff04
fix log for t2s ( #1219 )
3 years ago
KP
d362d28d35
Remove logging file in cli api.
3 years ago
Hui Zhang
db121226b8
clean aishell asr1 conf & compare ctc loss with torch and warpctc_pytorch ( #1191 )
3 years ago
Hui Zhang
d852aee2ff
[asr] logfbank with dither ( #1179 )
...
* fix logfbank dither
* format
3 years ago
Jackwaterveg
2bccde3def
update the version of ctcdecoders and feat,test=doc_fix ( #1155 )
3 years ago
Jackwaterveg
0151f2463f
fix bug of pad_sequence in u2,test=asr ( #1153 )
3 years ago
Jackwaterveg
68164dd39f
[asr]rename test_hub to test_wav ( #1132 )
...
* add the readme, librispeech_asr1
* fix the test_hub
* test=asr
3 years ago
Jackwaterveg
5b446f6321
[Config]clear the u2 decode config for asr ( #1107 )
...
* clear the u2 decode config
* rename the vocab_filepath and cmvn_path
3 years ago
Hui Zhang
51d7a07c6d
format and fix pre-commit ( #1120 )
3 years ago
Hui Zhang
764a5d4271
Merge branch 'develop' into ctc
3 years ago
Hui Zhang
b1c80c45e0
remove ctc grad norm type in config
3 years ago
huangyuxin
1d4002409f
separate the sox and soxbindings with the requirements
3 years ago
TianYuan
2189b46004
add tts cli
3 years ago
huangyuxin
9fe0beee54
fix the bug: miss import after install
3 years ago
huangyuxin
cea5ffe0e4
refactor the code
3 years ago
huangyuxin
ed12db61a6
Separate the ctcdecoders
3 years ago
Hui Zhang
0818c1601d
add __init__.py
3 years ago
Junkun
4e31a4445d
eval mode
3 years ago
Hui Zhang
4823892169
Merge pull request #1058 from Jackwaterveg/benchmark
...
[benchmark]fix the benchmark
3 years ago
Junkun
3a14b82844
minor
3 years ago
Junkun
f50a2ab4ca
fix bugs
3 years ago
huangyuxin
cb383a39c3
fix the benchmark
3 years ago
huangyuxin
d0bf506fee
fix the load checkpoint
3 years ago
Hui Zhang
39228864bb
format code
3 years ago
Hui Zhang
d395c2b8e3
jsonlines reade manifest file
3 years ago
Hui Zhang
7554b6107a
using visualdl; fix read_manifest
3 years ago
Junkun
d2fab3238b
fix bugs
3 years ago
Junkun
cdd0845127
add translate function
3 years ago
huangyuxin
895a086fdd
rename the config.feat_size and the config.vocab.size to input_size and output_size
3 years ago
Hui Zhang
fe83adfbcb
nproc to ngpu
3 years ago
Hui Zhang
789471bfca
test wav for u2
3 years ago
Jackwaterveg
09931d2ccc
Merge pull request #1019 from zh794390558/feat
...
[bugfix] Kaldi Feature using dither in train
3 years ago
huangyuxin
8aebfeac81
fix the prc-commit
3 years ago
Hui Zhang
56480e1033
fix format
3 years ago
Hui Zhang
7ec0ed4aaf
kaldi feat dither when train
3 years ago
Hui Zhang
2ba3f00bbd
Merge branch 'develop' into datapipe
3 years ago
Hui Zhang
b944418d6f
new format data support ds2/st
3 years ago
Hui Zhang
0defc658e1
update aishell/librispeech transformer result; wenetspeech pretrain conformer result
3 years ago
Hui Zhang
d2a05df02e
Merge pull request #1014 from Jackwaterveg/auto_log
...
[asr]hidden the auto_log
3 years ago
huangyuxin
fb6974f950
update the auto_log
3 years ago