Looking for a simple server health monitoring/alert system.

I had a runaway process that wrote 100GB+ of logfiles in an hour or two and completely filled the disk, causing all kinds of processes to die, including my lightning node.

Use-case: alerts on changes in disk activity/cpu load/network traffic, maybe some other bits and bobs.

Reply to this note

Please Login to reply.

Discussion

netdata

+1 for netdata

i had this one thing once that had a really pretty, live updating web interface on it. maybe ... yeah i think nostr:nprofile1qqsx8lnrrrw9skpulctgzruxm5y7rzlaw64tcf9qpqww9pt0xvzsfmgpr9mhxue69uhhyetvv9ujuumwdae8gtnnda3kjctv9uqsuamnwvaz7tmwdaejumr0dshsz9thwden5te0wfjkccte9ejxzmt4wvhxjme0y60vf7 is right, netdata. looks like what i remembered having on it.

Nagios

He said simple 🤣🤣🤣

Maybe NetData or dead simple Ward.

If like python there is always

https://nicolargo.github.io/glances/

I only have experience with nagios 🥹 Been using it for the better part of a decade.

I went from Nagios to Zabbix all pretty time consuming. With cloud stuff have their CloudWatch or equivalent so use something like Uptime Kuma for remote checks. At home use Glances now. 🤙

Nagios is good. But was thinking for like a few servers, needs a rifle not a Howitzer 🤣

In this case +1 netdata

Prometheus, Prometheus node_explorer and Grafana for visualization.

s/node_explorer/node_exporter/

Muito obrigado pelos sats, mas agora se quiser algo mais profissional procure usar prometheus + alertmanager + graphana é sucesso certo.

Uptime Kuma

Oh nvm, somehow I missed your last sentence. Uptime Kuma is mainly interesting for very simple server checks