docker-based pocketsphinx that takes A/V input file to create text from the audio and extract keywords and entities from the results.
Allows one to, once SETUP is done, run locally on a Mac, network disabled / no cloud, and run speech to text and text analysis on an A/V file.
Docker (eg: )
git (brew or XCode setups have you covered ;-)
git clone
cd audio2text
docker build -t audio2text .
( docker run --rm -i audio2text |tar xf - ) < test.mp3
will make (click each to see the results):