Hey, I am pretty new to NodeRed, and I am trying to build in Speech to text functionality. Unfortunately, it seems i have little clue as to the inputs and outputs that I should be using and then getting.
This is the flow that I have right now, I am converting audio bytes that come in arrays into Base64 strings, as per the documentation of the Google STT node node-red-contrib-google-stt (node) - Node-RED,
nvm, fixed lmao
idk what changed, but i tried something I did before, which was to change the input type to "Content" and then also change the input format to "Linear16", and then it worked. Wish i knew why it didn't work earlier, but eh
You could try to use Rhasspy that has local speech to text engine and that is not bad at all. I am running Rhaspy on Raspberry along with Node-Red. It recognizes activation keyword (in my case Alexa), then recognizes the intent and then the intent is sent to Node-Red where I am constructing feedback TTS and of course the implementation of the received command..
I considered this too, and I wanted to try this so I spent a few hours trying to figure it out, I was on a bit of a crunch so I gave up. If I ever revisit this project, I'll def give this a more fair shot.