I have a lab setup with 16 HP Z620 systems, all alike (purchased at the same time), with exactly the same Ubuntu 12.04 installation with current kernel 3.13.0-44-generic. Well, not quite all alike: 15 of these have BIOS version J61 v03.06, and the 16th has BIOS version J61 v03.18. All have static IP address with network-manager, avahi-daemon, and cups-browsed disabled.
The bizarre thing is that the 15 systems show load averages much less than 1 (as I write this, uptime shows a load average of 0.00), but the 16th system always shows a load average of 1.00 or above. Here's a top snapshot:
top - 13:13:04 up 25 min, 3 users, load average: 1.00, 1.03, 0.91
Tasks: 203 total, 1 running, 202 sleeping, 0 stopped, 0 zombie
%Cpu(s): 0.9 us, 0.3 sy, 0.0 ni, 97.5 id, 1.3 wa, 0.0 hi, 0.0 si, 0.0 st
KiB Mem: 12232332 total, 1583716 used, 10648616 free, 63148 buffers
KiB Swap: 12505084 total, 0 used, 12505084 free. 626708 cached Mem
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
1 root 20 0 33772 3024 1468 S 0.0 0.0 0:00.79 init
2 root 20 0 0 0 0 S 0.0 0.0 0:00.00 kthreadd
3 root 20 0 0 0 0 S 0.0 0.0 0:00.10 ksoftirqd/0
4 root 20 0 0 0 0 S 0.0 0.0 0:00.00 kworker/0:0
5 root 0 -20 0 0 0 S 0.0 0.0 0:00.00 kworker/0:+
7 root 20 0 0 0 0 S 0.0 0.0 0:01.64 rcu_sched
8 root 20 0 0 0 0 S 0.0 0.0 0:00.28 rcuos/0
9 root 20 0 0 0 0 S 0.0 0.0 0:00.23 rcuos/1
10 root 20 0 0 0 0 S 0.0 0.0 0:00.20 rcuos/2
11 root 20 0 0 0 0 S 0.0 0.0 0:01.95 rcuos/3
12 root 20 0 0 0 0 S 0.0 0.0 0:00.00 rcu_bh
13 root 20 0 0 0 0 S 0.0 0.0 0:00.00 rcuob/0
14 root 20 0 0 0 0 S 0.0 0.0 0:00.00 rcuob/1
15 root 20 0 0 0 0 S 0.0 0.0 0:00.00 rcuob/2
16 root 20 0 0 0 0 S 0.0 0.0 0:00.00 rcuob/3
I'm baffled as to why the load average on this one box is always 1.00 or above. Any suggestions?
BTW, I upgraded the BIOS on system 16 to version 3.85, but this didn't change anything. I also installed Ubuntu 14.04, but I still get the same behavior.
1 0 0 147184 197576 10454836 0 0 30 17 42 72 0 0 100 0 0 – Timothy Fossum Jan 31 '15 at 19:53
ps -e -o ... | grep ^R
command produces no output, repeatedly.vmstat 10 10
produces lines with zeros in theb
column and mostly zeros in ther
column. System 16 was loaded with the same software as the other 15 systems, and has the same usage pattern. How can this system always report a load average at or above 1.00 when the others report load averages near zero? – Timothy Fossum Feb 01 '15 at 20:03ps ax
entry:43 ? S 0.00 [khubd] ...
, but the "abnormal" system has43 ? D 3.09 [khubd] ...
. I have no idea why this didn't turn up earlier when I looked forD
entries. So now, how can I get rid of this?? – Timothy Fossum Feb 02 '15 at 00:20