Commit Graph

331 Commits (a107b75bac60112fe535aef26e6981606675eba7)

Author SHA1 Message Date
Hui Zhang 6e4a3aff07 float_mul_bool type promote, lhs type promote to rhs type, https://github.com/PaddlePaddle/Paddle/pull/29265
3 years ago
Hui Zhang c29ee83a46 add timer
3 years ago
Hui Zhang 244132c1c4 fix activation
3 years ago
Hui Zhang 7e136d0893 support no_sync for backward; ds support accum grad
3 years ago
Hui Zhang 184d30dd9c relase librispeech audio max len to 30 second
3 years ago
huangyuxin 04d9db199f add blank_id parameter
3 years ago
Hui Zhang f54dc983b6 using bw rnn in ds2
3 years ago
Hui Zhang 797ca389fc paddle support some bool op
3 years ago
Hui Zhang e7b71d7860 import ctcdecoder when needed
3 years ago
Hui Zhang 7181e427af
Merge pull request #786 from Jackwaterveg/ds2_online
3 years ago
Jackwaterveg 5890c84c91
Merge pull request #793 from PaddlePaddle/seed
3 years ago
Hui Zhang 341038b626 ds2 offline cer 6p4287
3 years ago
TianYuan 5972955f62 fix glu
3 years ago
huangyuxin 2451a177b0 fix paddling len bug
3 years ago
huangyuxin 317ffea5e5 simplify the code
3 years ago
huangyuxin 2b3b985227 fix paddling len
3 years ago
huangyuxin 1f050a4d01 make the code simple
3 years ago
huangyuxin 7ab022e1cc Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into ds2_online_export
3 years ago
Hui Zhang 673cc4a081 seed all with log; and format
3 years ago
huangyuxin 2e77c3c378 Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into ds2_online_export
3 years ago
huangyuxin 0d0b581181 add static_forward_online and static_forward_offline
3 years ago
Hui Zhang 14ac780658 fix trainer when dataloader not using batch_sampler
3 years ago
Hui Zhang cfdca210ff chaner style updater
3 years ago
huangyuxin 92617f0802 Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into ds2_online_export
3 years ago
huangyuxin db042a2974 Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into ds2_online_export
3 years ago
Hui Zhang d1db859657 fix dataloader pickle bugs
3 years ago
huangyuxin 564b6b6824 fix conflict
3 years ago
huangyuxin 40466ef669 Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into ds2_online
3 years ago
huangyuxin b3d27e4bbb merge the develop
3 years ago
huangyuxin b585684bf4 add function: test export
3 years ago
Hui Zhang 8215bd0e79 fix load vocab; zero W for not warptime
3 years ago
Hui Zhang fd3491ba1b fix dataloader batchsize and minibatchsize
3 years ago
Hui Zhang b56f899b76
Merge pull request #782 from PaddlePaddle/espnet
3 years ago
huangyuxin 2d3b2aed05 add seed in argparse
3 years ago
Hui Zhang 561d5cf085 refactor feature, dict and argument for new config format
3 years ago
TianYuan 2c75c923b9 fix_mfa
3 years ago
Jackwaterveg 9ac6d65a2a
Merge pull request #780 from Jackwaterveg/ds2_online
3 years ago
Hui Zhang 27daa92a81 using to_static
3 years ago
Hui Zhang aab02997f9 fix specaug config
3 years ago
huangyuxin 9068c0d4f9 Merge branch 'HEAD_1' into ds2_online
3 years ago
Hui Zhang 782f6be42d (D,T) to (T, D); time warp
3 years ago
Hui Zhang d9a3864072
Merge pull request #776 from PaddlePaddle/aug
3 years ago
Hui Zhang 50f10f37ae support replace with mean by aug
3 years ago
huangyuxin d065824bd3 fix the bug of 'import path error' for ds2
3 years ago
huangyuxin 718407b77d add seed
3 years ago
Hui Zhang d64cdc7838 fix
3 years ago
Hui Zhang c484d537c2 add assert
3 years ago
Hui Zhang a3e86dd8b5 fix call
3 years ago
Hui Zhang c81743403a fix
3 years ago
Hui Zhang c09b0e8940 fix specaug
3 years ago
Hui Zhang 4725bace4e fix
3 years ago
Hui Zhang 9de0343807 fix augment
3 years ago
Hui Zhang 9dace62581 fix augmentation
3 years ago
Hui Zhang 0ab299a842 test bin
3 years ago
Hui Zhang ab23eb5710 fix for kaldi
3 years ago
Hui Zhang f05f367cc5
Merge pull request #756 from PaddlePaddle/filter
3 years ago
Hui Zhang 0c4caa65d5 fix docstring
3 years ago
Hui Zhang 4af774d8f0 add dataloader; check augmenter base class type
3 years ago
Hui Zhang 64cf538e17 refactor converter
3 years ago
Hui Zhang 7d133368e5 fix bugs
3 years ago
Hui Zhang 44ec19317f refactor io
3 years ago
Hui Zhang 8939994d75 refactor augmentation interface
3 years ago
Hui Zhang 5ae639196c fix dataloader
3 years ago
Hui Zhang e3d73acd37 fix io; add test
3 years ago
Hui Zhang 4b5410eecd remove fixed hack api
3 years ago
Hui Zhang 86e42f3d21 more data utils
3 years ago
Hui Zhang 7b649af8d7 add batchfy
3 years ago
Hui Zhang 99dfe04515 test w/ all example
3 years ago
huangyuxin e1a2cfef7f fix the resume bug: the lr is not related to iteration, but epoch
3 years ago
huangyuxin 61fe292c47 Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into ds2_online
3 years ago
huangyuxin 718ae52e3f add from_config function to ds2_oneline and ds2
3 years ago
huangyuxin 7a3d164122 fix the bidirect rnn, add deepspeech2.yaml for aishell, tiny, librispeech
3 years ago
huangyuxin 85d5021475 reconstruct the exp/model.py and the model.export()
3 years ago
Hui Zhang e76123d418 rm useless
3 years ago
Hui Zhang 820b4db287 with all args for scheduler
3 years ago
Hui Zhang c4da9a7f3a filter key by class signature, no print tensor
3 years ago
Hui Zhang 3912c255ef support noam lr and opt
3 years ago
Hui Zhang 1cd4d4bf83 fix tiny conf and refactor optimizer and scheduler
3 years ago
Hui Zhang cc813b18d3 fix install and format code
3 years ago
huangyuxin 319228653e fix some small mistakes
3 years ago
huangyuxin 61d8540451 reconstruct the export function and the run.sh in aishell and librispeech
3 years ago
Junkun 515497ae1f refine the code
3 years ago
Junkun ac0ae57ef2 add collactor and evaluation code for ST
3 years ago
huangyuxin 722c55e4c5 reconstruct the rnn state, from list to tensor
3 years ago
huangyuxin 8f062cad6b fixed the small problems
3 years ago
Junkun 0323151912 add u2 st
3 years ago
huangyuxin 3fb9f6885a complete model export for ds2_online
3 years ago
Hui Zhang ccdfd5b342 format
3 years ago
huangyuxin e8a3913422 merge develop_ds2_online
3 years ago
huangyuxin 2f64ae6495 not change decoder
3 years ago
huangyuxin 6c484923a4 solve the conflicts
3 years ago
huangyuxin 4b5cbe9a12 ds2_online alignment, include prob_chunk_forward, prob_chunk_by_chunk_forward
3 years ago
huangyuxin fccecf9976 add strip for CUDA_VISIBLE_DEVICES
3 years ago
huangyuxin eacad8cf60 fix the bug: can not use the CPU to test the model
3 years ago
huangyuxin d398270f95 æ˜å增加了chunk_by_chunk,初步测试å通过ã
3 years ago
huangyuxin 2537221b61 Complete the modification according to the comments
3 years ago
huangyuxin 745df04f28 complete the pipline of tiny
3 years ago
huangyuxin e4ef8ed31e add the subsampling as conv
3 years ago
huangyuxin 6baf9f0620 跑通了deeppseech_online的流程
3 years ago
Jackwaterveg 8716386464
Update model.py
3 years ago
huangyuxin 2c8d28111a fix some small mistakes
3 years ago
huangyuxin 5dd9e2f8ec 先不暴露出online
3 years ago
huangyuxin 6079a2495d 把ds2中的deepspeech2.py恢复了
3 years ago
huangyuxin 66c59cdeae adding pre-commit
3 years ago
huangyuxin 7b201ba457 增加了online的模型,通过了测试,还需要搭建配套的实验流程代码
3 years ago
huangyuxin 4f392e28b1 complete the encoder of ds_online
3 years ago
huangyuxin 269eecb3be 新建ds2_online文件夹
3 years ago
huangyuxin 2cacbaf48e 修改了deepspeech2.py部分LSTM和GRU的代码,增加了LayerNorm
3 years ago
huangyuxin ce1e8ab5b6 change the dir
3 years ago
Hui Zhang fd8a4ec179
Merge pull request #729 from PaddlePaddle/fst
3 years ago
Hui Zhang ab5411ec16
Merge pull request #698 from yt605155624/thchs30_MFA
3 years ago
Hui Zhang 104743cccc TLG build pass
3 years ago
Hui Zhang b076d3e9bb fix autolog install; only autolog in test, or will hangup
3 years ago
Jackwaterveg ec19248f38 autoLog
3 years ago
Jackwaterveg 48e877375d change autoLog
3 years ago
huangyuxin d2db706384 added autolog
3 years ago
huangyuxin 3fffd57e8b added autoLog, but gpu_util is always 0.0%
3 years ago
huangyuxin fc88745782 revise load parameters
3 years ago
Hui Zhang 259781768e comment u2 model for easy understand
3 years ago
Hui Zhang 2820537fcc fix load param
3 years ago
TianYuan c0ee57d400 Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into thchs30_MFA
3 years ago
TianYuan 7d4eff2b86 add MFA example for THCHS30
3 years ago
Hui Zhang 20117d99ee fix ckpt load
3 years ago
Hui Zhang 43b52082c3
Merge pull request #629 from PaddlePaddle/align
3 years ago
Hui Zhang 6ee67785f6 fix ctc alignment
3 years ago
Hui Zhang 717fe1e4bd
Merge pull request #680 from PaddlePaddle/checkpoint
3 years ago
Haoxin Ma c0f7aac8fc revise conf/*.yaml
3 years ago
Haoxin Ma 08b6213bc8 fix private function
3 years ago
Hui Zhang 7ec623f7ae Merge branch 'develop' into align
3 years ago
Haoxin Ma 6d92417edd optimize the function
3 years ago
Hui Zhang 9b3acddd5d fix conf for new datapipe; u2 export inputspec
3 years ago
Hui Zhang 9c0b6c5bb0 fix audio shape bug for audio len
3 years ago
Haoxin Ma 16210c0587 fix bug
3 years ago
Haoxin Ma 91e70a2857 multi gpus
3 years ago
Haoxin Ma 3965dbc2c3 runtime.py
3 years ago
Hui Zhang 90788b116d more comment; fix datapipe of align
3 years ago
Hui Zhang 1e2a5887aa Merge branch 'develop' into align
3 years ago
Haoxin Ma 340e622953 fix runtime and server
3 years ago
Haoxin Ma c753b9ddf2 fix runtime.py and server.py
3 years ago
Haoxin Ma d55e6b5a0a revise from_pretrained function
3 years ago
Haoxin Ma 8af2eb073a revise config
3 years ago
Haoxin Ma 68bcc46940 save best and test on tiny/s0
3 years ago
Haoxin Ma 3652b87f33 fix
3 years ago
Haoxin Ma 3a743f3717 fix pre-commit
3 years ago
Haoxin Ma 089a8ed602 fix deepspeech2/model.py and deepspeech2/config.py
3 years ago
Haoxin Ma 557427736e move redundant params
3 years ago
Haoxin Ma 698d7a9bdb move batch_size, work_nums, shuffle_method, sortagrad to collator
3 years ago
Haoxin Ma 89a00eabeb revise deepspeech/exps/u2/model.py
3 years ago
Haoxin Ma 6ee3033cc4 finish aishell/s0
3 years ago
Haoxin Ma 7bae32f384 revise example/ting/s1
3 years ago
Haoxin Ma b9110af9d3 feat_dim, vocab_size
3 years ago
Haoxin Ma 3855522ee3 config
3 years ago
Haoxin Ma a1c6ee5ca1 merge
3 years ago
Haoxin Ma 3d5f294363 dataset
3 years ago
Blank 875139ca04
Merge branch 'develop' into spec_aug
3 years ago
Hui Zhang 1cd88d2619
Merge pull request #657 from PaddlePaddle/add_utt
3 years ago
Haoxin Ma 2b51d612dd delete _instance_reader_creator func in dataset
3 years ago
Haoxin Ma b4bda290aa fix bugs
3 years ago
Haoxin Ma c706dfec2a fix bug
3 years ago
Haoxin Ma 279348d786 move process utt to collator
3 years ago
Haoxin Ma 8781ab58cf fix export and run.sh
3 years ago
Haoxin Ma a58b1cb30a add result output
3 years ago
Haoxin Ma f3c9f32c9a add utt to train and test 0607
3 years ago
Haoxin Ma c8368410e2 utt datapipeline
3 years ago
Hui Zhang 69dfc2a5fa fix mask for bool type; fix other
3 years ago
Hui Zhang 4acaaba349 replace list zip by stack
3 years ago
Hui Zhang d05ae8eeb0 Merge branch 'develop' into align
3 years ago
Hui Zhang 34689bd1df add crf
3 years ago
Hui Zhang b3bc451328
remove sequnce_mask and change ds2 export audio shape to [B,T,D] (#639)
3 years ago
Hui Zhang 92381451fb format
3 years ago
Hui Zhang 30aba26693 add align code
3 years ago
Hui Zhang 0a7958b3f1
add tarball utils (#626)
3 years ago
Hui Zhang 0a3a840bee
more decoding method (#618)
3 years ago
Hui Zhang 295f8bdad5
train ds2 model (#622)
3 years ago
Hui Zhang d0635c6592
using soxbinddings (#619)
3 years ago
Hui Zhang 71e046b0ba
E2E/Streaming Transformer/Conformer ASR (#578)
3 years ago
Hui Zhang e0a87a5ab1
batch average ctc loss (#567)
3 years ago
Hui Zhang 258307df9b
fix egs bugs (#552)
3 years ago
Hui Zhang 1539f3e0a3
Refactor CTC module, add embedding and fix log (#549)
4 years ago
Hui Zhang 00889bfaf2
add decoder reference doc (#547)
4 years ago
Hui Zhang d7e753546a
Support paddle 2.x (#538)
4 years ago