# [SentencePiece Model](https://github.com/google/sentencepiece) Train a `spm` model for English tokenizer. ``` bash run.sh ```