Yesterday and today, none of my Bacula copy-to-tape jobs completed because my tape drive was missing. Today, I reseated all the cables, and power cycled the tape library.
In my ssh session, I did this:
$ sudo camcontrol devlist <ATA TOSHIBA DT01ACA3 ABB0> at scbus0 target 2 lun 0 (da0,pass0) <ATA TOSHIBA DT01ACA3 ABB0> at scbus0 target 10 lun 0 (da1,pass1) <ATA TOSHIBA DT01ACA3 ABB0> at scbus0 target 11 lun 0 (da2,pass2) <ATA TOSHIBA DT01ACA3 ABB0> at scbus0 target 12 lun 0 (da3,pass3) <ATA TOSHIBA DT01ACA3 ABB0> at scbus0 target 13 lun 0 (da4,pass4) <ATA TOSHIBA DT01ACA3 ABB0> at scbus0 target 14 lun 0 (da5,pass5) <ATA TOSHIBA DT01ACA3 ABB0> at scbus0 target 16 lun 0 (da6,pass6) <ATA TOSHIBA DT01ACA3 ABB0> at scbus0 target 17 lun 0 (da7,pass7) <TOSHIBA DT01ACA300 MX6OABB0> at scbus5 target 0 lun 0 (ada0,pass8) <TOSHIBA DT01ACA300 MX6OABB0> at scbus5 target 1 lun 0 (ada1,pass9) <COMPAQ MSL5000 Series 0520> at scbus7 target 3 lun 0 (ch0,pass11)
.
There is no tape; we should be able to see sa0.
I did a rescan:
$ sudo camcontrol rescan 7 Re-scan of bus 7 was successful
In /var/log/messages, I saw:
Dec 31 14:49:54 knew kernel: sa0 at sym0 bus 0 scbus7 target 1 lun 0 Dec 31 14:49:54 knew kernel: sa0: <COMPAQ SuperDLT1 5F5F> Removable Sequential Access SCSI-2 device Dec 31 14:49:54 knew kernel: sa0: Serial Number CXB37H0071 Dec 31 14:49:54 knew kernel: sa0: 80.000MB/s transfers (40.000MHz, offset 31, 16bit)
Rerunning the original command, I see:
$ sudo camcontrol devlist <ATA TOSHIBA DT01ACA3 ABB0> at scbus0 target 2 lun 0 (da0,pass0) <ATA TOSHIBA DT01ACA3 ABB0> at scbus0 target 10 lun 0 (da1,pass1) <ATA TOSHIBA DT01ACA3 ABB0> at scbus0 target 11 lun 0 (da2,pass2) <ATA TOSHIBA DT01ACA3 ABB0> at scbus0 target 12 lun 0 (da3,pass3) <ATA TOSHIBA DT01ACA3 ABB0> at scbus0 target 13 lun 0 (da4,pass4) <ATA TOSHIBA DT01ACA3 ABB0> at scbus0 target 14 lun 0 (da5,pass5) <ATA TOSHIBA DT01ACA3 ABB0> at scbus0 target 16 lun 0 (da6,pass6) <ATA TOSHIBA DT01ACA3 ABB0> at scbus0 target 17 lun 0 (da7,pass7) <TOSHIBA DT01ACA300 MX6OABB0> at scbus5 target 0 lun 0 (ada0,pass8) <TOSHIBA DT01ACA300 MX6OABB0> at scbus5 target 1 lun 0 (ada1,pass9) <COMPAQ SuperDLT1 5F5F> at scbus7 target 1 lun 0 (pass10,sa0) <COMPAQ MSL5000 Series 0520> at scbus7 target 3 lun 0 (ch0,pass11)
There’s my tape drive, on the 2nd line from the bottom!
Shortly afterwards, Nagios noticed and told me so: ** RECOVERY alert – knew/check_sa0 is OK **