MQTT - Storing data when disconnected

pjp8w · 22 March 2019 08:56

My use case is for a mobile device that can go in and out of network coverage. I have a constant data feed from sensors and need the data to backfill/send when network is reconnected.
So my MQTT node goes from connected to disconnected to connecting, etc. and eventually back to connected. My QoS is 1.

My only solution to this is to change the MQTT core node to always try to publish when using a QoS greater than 0.

github.com

node-red/node-red/blob/master/packages/node_modules/@node-red/nodes/core/io/10-mqtt.js#L336


        if (Object.keys(sub).length === 0) {
            delete node.subscriptions[topic];
            if (node.connected) {
                node.client.unsubscribe(topic);
            }
        }
    }
};


this.publish = function (msg) {
    if (node.connected) {
        if (msg.payload === null || msg.payload === undefined) {
            msg.payload = "";
        } else if (!Buffer.isBuffer(msg.payload)) {
            if (typeof msg.payload === "object") {
                msg.payload = JSON.stringify(msg.payload);
            } else if (typeof msg.payload !== "string") {
                msg.payload = "" + msg.payload;
            }
        }

Change: if (node.connected) {
To: if (node.connected || msg.qos > 0) {

In the client code, there is a test for connected and the msg will be stored in a map (max size ~65000) if not connected. The map will be flushed and sent once connected again. I've tested going over 65000 in the map and the node does not crash.

I've tried this and it works as a solution. I'm putting it out there to ask if there is an alternative solution though rather than creating my own MQTT node with this small code change.

cymplecy · 22 March 2019 10:18

Could you explain your setup a bit more
Where is NodeRED running on?
Your (fixed) devices or on your mobile device (laptop? tablet? phone?)

TotallyInformation · 22 March 2019 10:22

Whilst the code is small, the logic change is not small.

Personally, I would think that this would have to be an option that could be turned on - defaulting to off since this is a significant change. I think that you would also need options for sizing since it would be impossible to second-guess whether a 65k element array of objects would break any individual deployment of Node-RED.

I also wonder about unintended consequences but I don't know the internals of the Node-RED codebase to comment sensibly.

pjp8w · 22 March 2019 10:32

Node red running on a mobile mini ubuntu PC with a sim card. Sends data to a cloud MQTT service.

TotallyInformation · 22 March 2019 10:36

In that case, there is no problem at all!

Run Mosquitto locally, get Node-RED to write to that and then configure it to sync topics to your remote broker. It should take care of everything.

cymplecy · 22 March 2019 10:49

I don't think any other solution (e.g handshaking each message) is going to be as simple as your code change

pjp8w · 22 March 2019 11:50

Running mosquitto as a bridge with persistence was an option I considered. I'd like to do it all within node-red though. Less moving parts.

TotallyInformation · 22 March 2019 13:09

A point. Though I would counter that by saying that I think trying to handle such a caching process in Node-RED is very fragile. There are too many variables involved that could result in consequences such as running out of RAM.

While the code change works in your environment and with your data size/throughput, you couldn't say that of other people's configurations.

paul404 · 23 March 2019 10:10

So the Node-RED mqtt node has made qos 1 and 2 redundant by having a connection check around publish. The underlying mqtt client lib already manages messages correctly based on qos. In fact the mqtt client can be configured with a storage policy. The default policy is es6-map.

If Node-RED is running in a constrained environment where qos>0 can not be supported. Then I would expect qos to be set to 0 in that environment.

Colin · 23 March 2019 20:10

I don't think that is completely correct, in what way is qos 2, in particular, redundant?

knolleary · 23 March 2019 23:41

This discussion comes up on occasion and is something we need to improve with the MQTT nodes. The connected/disconnect events you can get from the status node do help to create a flow that can redirect messages when the connection is down. But it is very far from perfect - there is a window when the client doesn't yet know it is disconnected when it will buffer the messages internally, but they aren't persisted so will be lost if NR restarts. It is also actually quite hard to build a flow that does the right thing in all cases.

When the nodes were first written, the mqtt client library we used didn't provide much in this area. But it has moved on a long way and we haven't taken advantage of what it can provide.

I definitely think there is a place to have some options exposed on the broker node as to how it should handle offline messages. We need to distill it down to the core set of configuration properties so we can provide enough flexibility without overwhelming the user with choice. We need to be conscious of the different environments we run in - for example, running in IBM Cloud you don't have a persisted file system so can rely on that for storage. Quite how we handle those sorts of cases I don't know - but maybe we don't try to solve it for every case.

If someone wants to take a stab and listing what options it needs to expose - and doing so with awareness of what the underlying client is capable of - that would help to move this forward.

paul404 · 24 March 2019 16:02

That makes sense as to why the connection is check before the underlying client publish function is called.

The underlying client has moved on and allows for a storage mechanism to set and manages messages based on the qos. For qos 0 messages are never stored. For qos 1 and 2 the message are stored.

If the mqtt node removed the connection check on publish and allowed for the configuration of the storage mechanism, then most uses cases could be meet.

pjp8w · 25 March 2019 12:57

Thanks Nick for that clear explanation. Makes perfect sense.
I'm going to go down the road of creating my own node for our use-case/hardware in the meantime.

knolleary · 25 March 2019 15:11

Fair enough. It would be nice if someone with this need picked up the task of helping improve the core nodes rather than create another fork.

mako · 20 July 2019 13:21

Not sure if this is a solution.

Works a treat.

Topic		Replies	Views
Issues in persisting messages in MQTT Broker using NodeRed MQTT Nodes General	26	5632	10 March 2022
MQTT Caching in the event of a WAN failure General	28	3007	6 April 2020
MQTT node with disconnection - Node-RED General	6	1716	12 September 2020
MQTT data loss at reconnection General	20	3266	6 October 2021
MQTT data persistence not possible due to automatic unsubscribe before disconnect General mqtt	9	1036	9 May 2022

MQTT - Storing data when disconnected

Related topics