Command-line tools for speech and intent recognition on Linux
voice2json
is a collection of command-line tools for offline speech/intent recognition on Linux. It is free, open source (MIT), and supports 18 human languages.
From the command-line:
$ voice2json -p en transcribe-wav \
< turn-on-the-light.wav | \
voice2json -p en recognize-intent | \
jq .
produces a JSON event like:
{
"text": "turn on the light",
"intent": {
"name": "LightState"
},
"slots": {
"state": "on"
}
}
when trained with this template:
[LightState]
states = (on | off)
turn (<states>){state} [the] light
voice2json
is optimized for:
It can be used to:
Supported speech to text systems include:
ca
)
cs
)
de
)
el
)
en
)
es
)
fr
)
hi
)
it
)
ko
)
kz
)
nl
)
nl_kaldi-cgn
(default)nl_kaldi-rhasspy
nl_pocketsphinx-cmu
pl
)
pl_deepspeech-jaco
(default)pl_julius-github
pt
)
ru
)
ru_kaldi-rhasspy
(default)ru_pocketsphinx-cmu
sv
)
sv_kaldi-montreal
sv_kaldi-rhasspy
(default)vi
)
zh
)
voice2json
is more than just a wrapper around open source speech to text systems!
voice2json
’s templating language, you get more than just transcriptions for free.--open
for unrestricted speech to text--open
for unrestricted speech to text