I'm running Ubuntu 16.04 on a tiny device that the techs have placed on the very top of a server rack, it has become unresponsive 3 times in the last 4 days and with nothing logged saying it was shutting off.
The power light stayed on, etc... But I had no indication that the OS was functioning while it was not working (I only have atop and rsyslog actively logging).
I'm going to give them back the device with "sensors" running in a cronjob logging to a file until I can prove this is why it's getting turned off. But, before I do, should I have known that the machine was turning off because of some acpi shutdown trigger that ought to have been logged.
I'm guessing this might be hardware specific, but it seems strange that I'm not getting any sort of trigger from the kernel that it's about to go casters up.
sudo lshw
might be useful? – Elder Geek Mar 14 '17 at 14:02