OK Question for the sysadmins / homelab nerds out there.

So I'm at a point where Ive go so much local and remote stuff running that I really would like to have a way to get system stats and set up alerts for events like some process in systemd crashing or traffic / resource use spikes.

I know of Nagios and Ive been looking over the docs for Zabbix as well. I guess I'm just wondering if there are any other systems that people recommend before I get to deep into one system.

Reply to this note

Please Login to reply.

Discussion

Interested in this as well. Right now, I still do everything manually like a fucking amateur

I have a half assed grafana set up with Prometheus and some exporters. Some work, some don't.

Gotta be a better way.

I like using influxdb, telegraf, grafana..

https://www.influxdata.com/time-series-platform/telegraf/

Ill take a look! Thanks.

Prometheus might be another one to look into. I’m looking into this as well.

I have grafana / Prometheus set up and doing some stuff localy. But I keep fighting exporters. Might have to do a deeper look at Prometheus though.