huangyuxin
2f92a5d9a7
add conda init, use gcc 8.4.0
3 years ago
huangyuxin
e84690f6f0
add conda install pipline
3 years ago
Hui Zhang
3d77f49c0b
using pip isntall -e . , remove steup.sh
3 years ago
Hui Zhang
e2bcaee4f1
merge deepspeech, parakeet and text_processing into paddlespeech
3 years ago
Hui Zhang
a107b75bac
transform; librispeech/s2 data process ok
3 years ago
Hui Zhang
ba28c92b41
setup with ubuntu deps
3 years ago
Hui Zhang
69bd17dcb2
refactor raw ctc decoder into ctcdecoder
3 years ago
Hui Zhang
cc813b18d3
fix install and format code
3 years ago
Hui Zhang
74f987aa8f
setup.sh with tools/ make
3 years ago
Hui Zhang
571b13c53f
add dur utils
3 years ago
Hui Zhang
b076d3e9bb
fix autolog install; only autolog in test, or will hangup
3 years ago
Jackwaterveg
1690bd5911
added the way of installing auto_log into setup.sh
3 years ago
Hui Zhang
538bf271eb
chinese char/word ngram lm ( #613 )
...
* add ngram lm egs
* add zhon repo
* install kenlm, zhon
* format
* add chinese_text_normalization repo
* add ngram lm egs
4 years ago
Hui Zhang
71e046b0ba
E2E/Streaming Transformer/Conformer ASR ( #578 )
...
* add cmvn and label smoothing loss layer
* add layer for transformer
* add glu and conformer conv
* add torch compatiable hack, mask funcs
* not hack size since it exists
* add test; attention
* add attention, common utils, hack paddle
* add audio utils
* conformer batch padding mask bug fix #223
* fix typo, python infer fix rnn mem opt name error and batchnorm1d, will be available at 2.0.2
* fix ci
* fix ci
* add encoder
* refactor egs
* add decoder
* refactor ctc, add ctc align, refactor ckpt, add warmup lr scheduler, cmvn utils
* refactor docs
* add fix
* fix readme
* fix bugs, refactor collator, add pad_sequence, fix ckpt bugs
* fix docstring
* refactor data feed order
* add u2 model
* refactor cmvn, test
* add utils
* add u2 config
* fix bugs
* fix bugs
* fix autograd maybe has problem when using inplace operation
* refactor data, build vocab; add format data
* fix text featurizer
* refactor build vocab
* add fbank, refactor feature of speech
* refactor audio feat
* refactor data preprare
* refactor data
* model init from config
* add u2 bins
* flake8
* can train
* fix bugs, add coverage, add scripts
* test can run
* fix data
* speed perturb with sox
* add spec aug
* fix for train
* fix train logitc
* fix logger
* log valid loss, time dataset process
* using np for speed perturb, remove some debug log of grad clip
* fix logger
* fix build vocab
* fix logger name
* using module logger as default
* fix
* fix install
* reorder imports
* fix board logger
* fix logger
* kaldi fbank and mfcc
* fix cmvn and print prarams
* fix add_eos_sos and cmvn
* fix cmvn compute
* fix logger and cmvn
* fix subsampling, label smoothing loss, remove useless
* add notebook test
* fix log
* fix tb logger
* multi gpu valid
* fix log
* fix log
* fix config
* fix compute cmvn, need paddle 2.1
* add cmvn notebook
* fix layer tools
* fix compute cmvn
* add rtf
* fix decoding
* fix layer tools
* fix log, add avg script
* more avg and test info
* fix dataset pickle problem; using 2.1 paddle; num_workers can > 0; ckpt save in exp dir;fix setup.sh;
* add vimrc
* refactor tiny script, add transformer and stream conf
* spm demo; librisppech scripts and confs
* fix log
* add librispeech scripts
* refactor data pipe; fix conf; fix u2 default params
* fix bugs
* refactor aishell scripts
* fix test
* fix cmvn
* fix s0 scripts
* fix ds2 scripts and bugs
* fix dev & test dataset filter
* fix dataset filter
* filter dev
* fix ckpt path
* filter test, since librispeech will cause OOM, but all test wer will be worse, since mismatch train with test
* add comment
* add syllable doc
* fix ds2 configs
* add doc
* add pypinyin tools
* fix decoder using blank_id=0
* mmseg with pybind11
* format code
4 years ago
Hui Zhang
3a2de9e461
Fix ( #594 )
...
* fix install
* rm feature request
4 years ago
Hui Zhang
d4e84f9b9d
fix doc link and enhance install ( #570 )
...
* fix doc link
* fix install
* fix install doc
* fix typo
* fix lm doc
4 years ago
Hui Zhang
d7e753546a
Support paddle 2.x ( #538 )
...
* 2.x model
* model test pass
* fix data
* fix soundfile with flac support
* one thread dataloader test pass
* export feasture size
add trainer and utils
add setup model and dataloader
update travis using Bionic dist
* add venv; test under venv
* fix unittest; train and valid
* add train and config
* add config and train script
* fix ctc cuda memcopy error
* fix imports
* fix train valid log
* fix dataset batch shuffle shift start from 1
fix rank_zero_only decreator error
close tensorboard when train over
add decoding config and code
* test process can run
* test with decoding
* test and infer with decoding
* fix infer
* fix ctc loss
lr schedule
sortagrad
logger
* aishell egs
* refactor train
add aishell egs
* fix dataset batch shuffle and add batch sampler log
print model parameter
* fix model and ctc
* sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp
add grad clip by global norm
add model train test notebook
* ctc loss
remove run prefix
using ord value as text id
* using unk when training
compute_loss need text ids
ord id using in test mode, which compute wer/cer
* fix tester
* add lr_deacy
refactor code
* fix tools
* fix ci
add tune
fix gru model bugs
add dataset and model test
* fix decoding
* refactor repo
fix decoding
* fix musan and rir dataset
* refactor io, loss, conv, rnn, gradclip, model, utils
* fix ci and import
* refactor model
add export jit model
* add deploy bin and test it
* rm uselss egs
* add layer tools
* refactor socket server
new model from pretrain
* remve useless
* fix instability loss and grad nan or inf for librispeech training
* fix sampler
* fix libri train.sh
* fix doc
* add license on cpp
* fix doc
* fix libri script
* fix install
* clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49
4 years ago
Hui Zhang
126677a35c
support py3
4 years ago
Yibing Liu
9f0c3467e5
fix decoders: force indices in FST starting from one & add version check in setup
7 years ago
Yibing Liu
1471103daa
use cd instead of pushd in setup.sh
7 years ago
Yibing Liu
e92d01e562
disable the make output of libsndfile in setup
7 years ago
Yibing Liu
acf2072ed4
Merge branch 'develop' of upstream into ctc_decoder_deploy
7 years ago
yangyaming
1b206b3390
fix bugs for model.py and demo_server.py.
7 years ago
Yibing Liu
a18e6a7eda
refine by following review comments
7 years ago
Luo Tao
638fae13f4
use wget to download
7 years ago
Xinghai Sun
526e18b119
Add function docs for layer.py and model.py and update other details.
7 years ago
Luo Tao
b72b70e54c
add soundfile read/write unitest
7 years ago
Luo Tao
c7676286ab
install libsndfile from /usr to thirdparty
7 years ago
xushaoyong
286228c219
Merge pull request #127 from pkuyym/fix-soundfile
...
Refine SoundFile installation process.
7 years ago
yangyaming
6d6cdf4057
Refine SoundFile installation process.
...
1. Install libsndfile first.
2. Install SoundFile using pip.
7 years ago
Xinghai Sun
13f708739b
Improve audio featurizer and add shift augmentor.
...
1. Improve audio featurizer.
2. Add shift augmentor.
3. Update default argument to be the current best seggestion.
4. Add checkpoints with pass id.
7 years ago
yangyaming
a5dcd23bf2
Follow comments.
7 years ago
yangyaming
d35b747175
Fix ci following:
...
1. Unify the dependency installation process in setup.sh.
2. Change the version of package scipy from 0.13.0b1 to 0.13.1
7 years ago