Network monitoring

Optimizing UPS power use when the power goes out

This post is mostly for myself. I’m just putting my Sunday morning thoughts down for later review and consideration. When Selecting replacements batteries for my Eaton 5PX UPS (5PX2200RT & 5PXEBM48RT), I began to think about: when the power goes off, why can’t I keep using my laptop? Usually, I still have an Internet connection: […]

Optimizing UPS power use when the power goes out Read More »

Nagios 3: moving from nrpe 3 to nrpe 4 – what needs changing?

Yesterday, I noticed this message in one of the “daily security run output” emails which FreeBSD host can send out. I’ve used net-mgmt/nrpe3 for several years. It checks remote hosts and runs any number of predetermined commands and returns the results. It’s stable, highly configurable, and just keeps running. I had a look at the

Nagios 3: moving from nrpe 3 to nrpe 4 – what needs changing? Read More »

Listen queue overflow

The R720 is showing a message like this from time to time: Jan 1 07:42:20 r720-01 kernel: sonewconn: pcb 0xfffff835e785d5b8: Listen queue overflow: 8 already in queue awaiting acceptance (1 occurrences) Jan 1 08:02:21 r720-01 syslogd: last message repeated 1 times Jan 1 08:27:22 r720-01 kernel: sonewconn: pcb 0xfffff835e785d5b8: Listen queue overflow: 8 already in

Listen queue overflow Read More »

Munin

This is an old post I wrote, but never published, back in 2010. I’ve started using Munin for some statistical monitoring. Using the hddtemp_smartctl plugin, I was getting some permission errors. After printing the output of the command, I noticed these in the logs: 2010/03/11-18:30:05 [60845] [ERROR] Command /usr/local/sbin/smartctl -A /dev/ad8 on drive ad8 failed:

Munin Read More »

Monitoring backups via Nagios and a shell script

Backups are useless without restores. I’ve written a few posts about Nagios, my current monitoring tool of choice. Included with Nagios are a number of plugins and you can even write your own plugins. In this post, I’ll show you a shell script I wrote to make sure my backup files turn up where they

Monitoring backups via Nagios and a shell script Read More »

ntp wasn’t running but Nagios didn’t notice

Earlier today, I noticed the following output from a Bacula job: 24-Sep 14:14 bacula-dir JobId 38548: Start Backup JobId 38548, Job=latens_home.2010-09-24_14.12.38_31 24-Sep 14:14 bacula-dir JobId 38548: Using Device “MegaFile-latens” 24-Sep 14:09 latens-fd JobId 38548: DIR and FD clocks differ by -307 seconds, FD automatically compensating. That’s 5 minutes. It shouldn’t be varying by that much.

ntp wasn’t running but Nagios didn’t notice Read More »

NRPE: Unable to read output

After rebooting kraken to take a photo, I found nagios was displaying an error for my smartmon checks: NRPE: Unable to read output. Running the command by hand on the nagios server, I found: $ /usr/local/libexec/nagios/check_nrpe2 -H kraken -c check_smartmon_ad24 NRPE: Unable to read output But from the remote server I got: # /usr/local/libexec/nagios/check_smartmon -d

NRPE: Unable to read output Read More »

Scroll to Top