Speech to text in NodeRed, undesired outputs and confused on inputs

Hey, I am pretty new to NodeRed, and I am trying to build in Speech to text functionality. Unfortunately, it seems i have little clue as to the inputs and outputs that I should be using and then getting.


This is the flow that I have right now, I am converting audio bytes that come in arrays into Base64 strings, as per the documentation of the Google STT node node-red-contrib-google-stt (node) - Node-RED,

I see in other flows that other people are probably using "node-red-contrib-wstt-stream-fixed" for their STT node, but I can't download that one

I am in NodeRed v4.0.2

Thanks for your time and attention guys

nvm, fixed lmao
idk what changed, but i tried something I did before, which was to change the input type to "Content" and then also change the input format to "Linear16", and then it worked. Wish i knew why it didn't work earlier, but eh

haha, now i REALLY wish i knew why it didn't work earlier, because it stopped working after I changed my NodeRed environment

You could try to use Rhasspy that has local speech to text engine and that is not bad at all. I am running Rhaspy on Raspberry along with Node-Red. It recognizes activation keyword (in my case Alexa), then recognizes the intent and then the intent is sent to Node-Red where I am constructing feedback TTS and of course the implementation of the received command..

1 Like

I considered this too, and I wanted to try this so I spent a few hours trying to figure it out, I was on a bit of a crunch so I gave up. If I ever revisit this project, I'll def give this a more fair shot.

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.