1.9 KiB

Raw Blame History

SpeechX -- All in One Speech Task Inference

Environment

We develop under:

docker - registry.baidubce.com/paddlepaddle/paddle:2.1.1-gpu-cuda10.2-cudnn7
os - Ubuntu 16.04.7 LTS
gcc/g++ - 8.2.0
cmake - 3.16.0

We make sure all things work fun under docker, and recommend using it to develop and deploy.

Build

First to launch docker container.

nvidia-docker run --privileged  --net=host --ipc=host -it --rm -v $PWD:/workspace --name=dev registry.baidubce.com/paddlepaddle/paddle:2.1.1-gpu-cuda10.2-cudnn7 /bin/bash

More Paddle docker images you can see here.
If you want only work under cpu, please download corresponded image, and using docker instead nviida-docker.

Build speechx and examples.

pushd /path/to/speechx
./build.sh

Go to examples to have a fun.

More details please see README.md under examples.

Valgrind (Optional)

If using docker please check --privileged is set when docker run.

Fatal error at startup: a function redirection which is mandatory for this platform-tool combination cannot be set up

apt-get install libc6-dbg

Install

pushd tools
./setup_valgrind.sh
popd

TODO

DecibelNormalizer: there is a little bit difference between offline and online db norm. The computation of online db norm read feature chunk by chunk, which causes the feature size is different with offline db norm. In normalizer.cc:73, the samples.size() is different, which causes the difference of result.