Network monitoring

Nagios 3: moving from nrpe 3 to nrpe 4 – what needs changing?

Yesterday, I noticed this message in one of the “daily security run output” emails which FreeBSD host can send out. I’ve used net-mgmt/nrpe3 for several years. It checks remote hosts and runs any number of predetermined commands and returns the results. It’s stable, highly configurable, and just keeps running. I had a look at the […]

Nagios 3: moving from nrpe 3 to nrpe 4 – what needs changing? Read More »

Listen queue overflow

The R720 is showing a message like this from time to time: Jan 1 07:42:20 r720-01 kernel: sonewconn: pcb 0xfffff835e785d5b8: Listen queue overflow: 8 already in queue awaiting acceptance (1 occurrences) Jan 1 08:02:21 r720-01 syslogd: last message repeated 1 times Jan 1 08:27:22 r720-01 kernel: sonewconn: pcb 0xfffff835e785d5b8: Listen queue overflow: 8 already in

Listen queue overflow Read More »

Munin

This is an old post I wrote, but never published, back in 2010. I’ve started using Munin for some statistical monitoring. Using the hddtemp_smartctl plugin, I was getting some permission errors. After printing the output of the command, I noticed these in the logs: 2010/03/11-18:30:05 [60845] [ERROR] Command /usr/local/sbin/smartctl -A /dev/ad8 on drive ad8 failed:

Munin Read More »

Monitoring backups via Nagios and a shell script

Backups are useless without restores. I’ve written a few posts about Nagios, my current monitoring tool of choice. Included with Nagios are a number of plugins and you can even write your own plugins. In this post, I’ll show you a shell script I wrote to make sure my backup files turn up where they

Monitoring backups via Nagios and a shell script Read More »

ntp wasn’t running but Nagios didn’t notice

Earlier today, I noticed the following output from a Bacula job: 24-Sep 14:14 bacula-dir JobId 38548: Start Backup JobId 38548, Job=latens_home.2010-09-24_14.12.38_31 24-Sep 14:14 bacula-dir JobId 38548: Using Device “MegaFile-latens” 24-Sep 14:09 latens-fd JobId 38548: DIR and FD clocks differ by -307 seconds, FD automatically compensating. That’s 5 minutes. It shouldn’t be varying by that much.

ntp wasn’t running but Nagios didn’t notice Read More »

NRPE: Unable to read output

After rebooting kraken to take a photo, I found nagios was displaying an error for my smartmon checks: NRPE: Unable to read output. Running the command by hand on the nagios server, I found: $ /usr/local/libexec/nagios/check_nrpe2 -H kraken -c check_smartmon_ad24 NRPE: Unable to read output But from the remote server I got: # /usr/local/libexec/nagios/check_smartmon -d

NRPE: Unable to read output Read More »

Scroll to Top