Help with simple OCR of two-digit numbers

grant1 · 21 June 2023 17:20

Hello and thanks in advance for anyone's help here...

See these 3 photos of my setup and flow: https://photos.app.goo.gl/g8XBTwsx5xYe1NXc6

I have set up this OCR node to read a two-digit value via a webcam. I convert it to grayscale and change the contrast, but after about 15 seconds of the yellow dot "processing", the OCR node "stalls out" whereby I get the red message as shown ("Lost connection to server, reconnecting...") and I never get the output of the node.

My understanding is the OCR node goes online and uses the tesseract library.

Is there a better way to approach this problem? FWIW, the digits that I am reading are always ranging from about 25 to 55.

ralphwetzel · 21 June 2023 17:27

This could indicate that this node raised an error that lead to a termination of the Node-RED (client or) server.
Have you checked the two consoles (in your browser & of the server) for any issues?

grant1 · 21 June 2023 17:38

Thanks for the tip. There are indeed a host of errors being thrown by that node. It seems the help file for the node did not mention any of these parameters as being required inputs.

(sorry for the screenshot instead of the error code copied/pasted)

ralphwetzel · 21 June 2023 18:09

Looking at the Tesseract repo, no additional parameters should be necessary for a successful recognition.

There's a relevant remark regarding supported image format / data types:

Note: images must be a supported image format and a supported data type. For example, a buffer containing a png image is supported. A buffer containing raw pixel data is not supported.

You could check that your image data buffer fulfills this demand...

grant1 · 21 June 2023 18:34

Thank you @ralphwetzel

I changed to PNG with output as buffer and it works. Will continue testing / tweaking to make sure I can cover all the two-digit combinations.

grant1 · 23 June 2023 02:38

After a few days of experimenting, the ability of the Tesseract.js to correctly perform seven segment optical character recognition is not reliable or repeatable. So I went back to searching and found some interesting attempts & technologies, but the best was a standalone program called SSOCR (seven segment optical character recognition). After using the standard image cleanup tools available, it has done a very nice job of interpreting the numbers. My test flow is shown below, where you can see I needed to trim the output and convert from string to number. I will incorporate this into the overall flow whereby the image is grabbed via the webcam, cleaned up, saved to the hard drive, and then read by SSOCR via the Exec node.

syoma755 · 23 June 2023 04:51

You probably could use some AI-based image recognition node, which you should train on your 7-segment indicator images. You don't have so many combinations, so this shall be possible. Basically AI will just compare webcam picture with stored image.
See Image analysis and comparison - #5 by Jotarod

system · 7 July 2023 04:52

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Extract numbers from image General	15	3380	7 June 2021
Advice for creating first node Developing Nodes	4	225	22 August 2023
Tesseract - How to send input in tesseract node when we have to read data from Image General	2	448	19 September 2020
[ANNOUNCE] node-red-contrib-plate-recognizer Share Your Nodes	41	3086	23 April 2020
Uploading picture but failed to get output General http-request	2	296	15 November 2021

Help with simple OCR of two-digit numbers

Related topics