I have a multi-server setup 18 machines. LAPI-server with mariadb.
Since the latest update to v1.6.3-rpm-pragmatic-amd64-4851945a (all machines are on the same version, also may have happened in the previous version, but I just noticed it now), I often see some of them in ‘cscli machines list’ with a heartbeat of serveral hours behind. This clears within some minutes, but keeps happening again and again.
Until now I could not find any pattern for this behaviour. It look like all machines are affected.
Just now:
2024-10-11T07:52:00Z v1.6.3-rpm-pragmatic-amd64-4851945a Red Hat Enterprise Linux/8.10 password 6h30m17s
and about a minute later:
2024-10-11T07:53:59Z v1.6.3-rpm-pragmatic-amd64-4851945a Red Hat Enterprise Linux/8.10 password 15s
Also the version-string changes often from ‘v1.6.3-rpm-pragmatic-amd64-4851945a’ to ‘v1.6.3-rpm-pragmatic-amd64-4851945a-linux’ and back.
Now that you are asking… I have indeed identified one clone of VM where still a crowdsec-firewall-bouncer.service was active/running and is now deactivated. For the rest I am not aware that I have any other clones running, but I will check.
But the old heartbeats still show up on machines where I am sure I never had any clones running.
More data: I started checking cscli machines list every 30 seconds and the heartbeat of all the machines is shown as old every 30 minutes.
Next I will have a look at the incoming packets from one of the machines.