Voice_Activity_Detection_v1

This is an implement of voice activity detction based on google webrtc. A VAD <https://en.wikipedia.org/wiki/Voice_activity_detection>_ classifies a piece of audio data as being voiced or unvoiced. It can be useful for telephony and speech recognition.

The VAD that Google developed for the WebRTC <https://webrtc.org/>_ project is reportedly one of the best available, being fast, modern and free.

How to use it

step1

install all required packages by pip install -r requirements.txt check if there is c++ library surport on your PC.

step2

put your raw audio file into the webrtcvad folder. Run vad with python vad.py <aggressive level(0/1/2/3)> <frame length(10/20/30)> <padding duration(100-5000 recommended, depends on your purpose)> <filepath>

Things to know

multichnnel audio accepted.
sample rate of 8000Hz,16000Hz, 32000Hz are accepted. If your sample rate is not among those 3, please resample your file by using resample.py
.pcm and .wav are accepted. If not, please transfer format before doing vad.
Wisely choose your own aggressive level, frame duration and padding duration. They affect final result hugely.
A one sec mute is to be attached at the beginning and the end of each chunk file. For another length, please adapt the parameter in def(padding) in vad.py

Example

king_32000.wav is a 2-channels, 32000Hz sample rate audio file. Run vad by python vad.py 0 30 500 king_32000.wav. We can see 10 parts are seperated out from original file. For those noise parts mistakenly recognized as "voiced", we can either manually throw them away or write another code to filt them out.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
cbits		cbits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
example.png		example.png
king_32000.wav		king_32000.wav
put_8000.wav		put_8000.wav
requirements.txt		requirements.txt
resample.py		resample.py
san_16000.wav		san_16000.wav
test_16000.wav		test_16000.wav
vad.py		vad.py
webrtcvad.py		webrtcvad.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voice_Activity_Detection_v1

How to use it

step1

step2

Things to know

Example

About

Releases

Packages

Languages

License

guozhonghao1994/Voice_Activity_Detection_V1

Folders and files

Latest commit

History

Repository files navigation

Voice_Activity_Detection_v1

How to use it

step1

step2

Things to know

Example

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages