A speech-to-text framework and bot for Discord. Take control of your Discord server using speech and voice commands. Can also be useful for hearing impaired and deaf people.
A speech-to-text bot for Discord written in NodeJS.
Can be useful for hearing impaired and deaf people.
YouTube: https://www.youtube.com/watch?v=IKIlnaCDZcI
Try the bot for yourself on our Discord server: https://discord.gg/ApdTMG9
In your Discord Developers Bot settings, you need to enable these intents:
You need nodeJS version +16 with npm on your machine, use node -v
to check your version.
Execute the following commands:
git clone https://github.com/healzer/DiscordEarsBot.git
cd DiscordEarsBot
npm install
Provide the Discord API Token using DISCORD_TOK
Env Variable or in settings.json
.
Finally run node index.js
. You can also use pm2 or nodemon to keep the bot running 24/7.
If you don’t have a linux server/machine then you can use Heroku for hosting your bot 24/7 and it’s free.
By now you have a discord server, the DiscordEarsBot is running and is a part of your server. Make sure your server has a text and voice channel.
*join
, the bot will join the voice channel.*leave
to make the bot leave the voice channel.*help
for a list of commands.YouTube comparison and tutorial for developers on choosing the right Speech API: https://www.youtube.com/watch?v=fQcEZIgw_LA
This is our default Speech-to-Text method. The Vosk API is a free & open-source solution that runs locally (offline). By default only english
is enabled. Developers can change or include more language models from here: https://alphacephei.com/vosk/models
Installation:
witai
WitAI supports over 120 languages (https://wit.ai/faq), however only one language can be used at a time.
If you’re not speaking English on Discord, then change your default language on WitAI under “settings” for your app.
You can also change the language using the following bot command: *lang <code>
<code>
should be an ISO 639-1 language code (2 digits): https://en.wikipedia.org/wiki/List_of_ISO_639-1_codes
You can use Google’s Speech-to-Text API as follows:
google
index.js
, inside the function transcribe_gspeech
change the value of languageCode
.gspeech_key.json
.Using Mozilla DeepSpeech for speech recognition, tutorial.
For enquiries or issues get in touch with me:
Name: Ilya Nevolin
Email: [email protected]
Discord: https://discord.gg/ApdTMG9