VirtualBox

10 年 前 建立

7 年 前 結束

#13762 closed defect (obsolete)

"Not syncing: An NMI occurred" kernel panic

回報者: Fiorangelo Peluso 負責人:
元件: guest control 版本: VirtualBox 4.3.20
關鍵字: kernel panic vboxdrv 副本:
Guest type: Linux Host type: all

描述

Hi all, we're experiencing several kernel panics with vboxdrv on a brand new HP ProLiant DL560 G8. We currently have Virtualbox 4.3.20 installed with about 15-20 vm running with both Windows and Linux OS. Attached you'll find screenshots of panic errors got from ILO Remote Console. We also tried to reinstall different x86_64 Linux distributions (CentOS 6.6, Oracle Linux 6.6, Red Hat 6.5) and different Virtualbox packages (4.3.10, 4.3.18, 4.3.20) but panics still occurred. The panic event is random and we're unable to reproduce it. It occurs after few days of activity and we would be very afraid to give up on this. Please let us know if any further info is needed. Thanks in advance.

附加檔案 (15)

vbox_err_3.JPG (88.4 KB ) - 10 年 前, 由 Fiorangelo Peluso 新增
Kernel Panic Screenshot
vbox_err_centos.JPG (93.4 KB ) - 10 年 前, 由 Fiorangelo Peluso 新增
Kernel Panic Screenshot
vbox_err_oracle.JPG (102.9 KB ) - 10 年 前, 由 Fiorangelo Peluso 新增
Kernel Panic Screenshot
vbox_err_oracle_2.JPG (85.2 KB ) - 10 年 前, 由 Fiorangelo Peluso 新增
Kernel Panic Screenshot
vbox_err_oracle_3.JPG (98.9 KB ) - 10 年 前, 由 Fiorangelo Peluso 新增
Kernel Panic Screenshot
vbox_err_oracle_4.JPG (92.8 KB ) - 10 年 前, 由 Fiorangelo Peluso 新增
Kernel Panic Screenshot
vbox_err_oracle_5.JPG (79.4 KB ) - 10 年 前, 由 Fiorangelo Peluso 新增
Kernel Panic Screenshot
vbox_err_redhat6.5_1.png (97.0 KB ) - 10 年 前, 由 Fiorangelo Peluso 新增
Kernel Panic Screenshot
vbox_err_redhat6.5_2.PNG (104.6 KB ) - 10 年 前, 由 Fiorangelo Peluso 新增
Kernel Panic Screenshot
cpuinfo (56.8 KB ) - 10 年 前, 由 Fiorangelo Peluso 新增
/proc/cpuinfo file
vbox_logs.tar.gz (48.2 KB ) - 10 年 前, 由 Fiorangelo Peluso 新增
VBox.log files of one vm
PD Project messages.guest.tar.gz (58.7 KB ) - 10 年 前, 由 nuess0r 新增
Log file of the crashing guest
PD Project messages.host (63.9 KB ) - 10 年 前, 由 nuess0r 新增
Log file of the host (DL360 G5)
PD Project VBox.log.1 (108.5 KB ) - 10 年 前, 由 nuess0r 新增
publicdomainwiki.vbox (19.8 KB ) - 10 年 前, 由 nuess0r 新增

下載所有附檔: .zip

更動歷史 (31)

10 年 前Fiorangelo Peluso 編輯

附檔: 新增 vbox_err_3.JPG

Kernel Panic Screenshot

10 年 前Fiorangelo Peluso 編輯

附檔: 新增 vbox_err_centos.JPG

Kernel Panic Screenshot

10 年 前Fiorangelo Peluso 編輯

附檔: 新增 vbox_err_oracle.JPG

Kernel Panic Screenshot

10 年 前Fiorangelo Peluso 編輯

附檔: 新增 vbox_err_oracle_2.JPG

Kernel Panic Screenshot

10 年 前Fiorangelo Peluso 編輯

附檔: 新增 vbox_err_oracle_3.JPG

Kernel Panic Screenshot

10 年 前Fiorangelo Peluso 編輯

附檔: 新增 vbox_err_oracle_4.JPG

Kernel Panic Screenshot

10 年 前Fiorangelo Peluso 編輯

附檔: 新增 vbox_err_oracle_5.JPG

Kernel Panic Screenshot

10 年 前Fiorangelo Peluso 編輯

Kernel Panic Screenshot

10 年 前Fiorangelo Peluso 編輯

Kernel Panic Screenshot

comment:1 10 年 前Fiorangelo Peluso 編輯

Guest Kernel (currently 2.6.32-431.el6.x86_64 - Red Hat 6.5) boots with "intel_idle.max_cstate=0 idle=poll" kernel parameters in grub.conf file.

comment:2 10 年 前Fiorangelo Peluso 編輯

Googling about that, we found that a possible solution was to disable nmi_watchdog ("echo 0 > /proc/sys/kernel/nmi_watchdog" command used) since it could be a possible NMI storm issue (this article is quite old but also interesting: http://x86vmm.blogspot.it/2005/10/linux-nmis-on-intel-64-bit-hardware.html ).

Next steps:

  • use kernel boot parameter "divider=10" since it's suggested in VirtualBox troubleshooting for Red Hat / Fedora guests;
  • Disable Hyper Threading

Any help is truly appreciated anyway.

comment:3 10 年 前Klaus Espenlaub 編輯

Normally VBox should disable the NMI watchdog automatically as it interferes with VM execution (guest execution can take quite a while, and during that time the watchdog isn't re-set). No immediate idea why this doesn't work...

comment:4 10 年 前Klaus Espenlaub 編輯

Could it be that you enabled some watchdog in the BIOS? Scratching my head why we don't get many reports like yours. Must be a very special config.

comment:5 10 年 前Fiorangelo Peluso 編輯

I started with default BIOS configuration.. I really wonder if there is any watchdog enabled in BIOS. Anyway now server is working with nmi_watchdog manually disabled. So far so good.. really hope this is the key..

comment:6 10 年 前Frank Mehnert 編輯

Please attach a VBox.log file of such a VM session. What I'm after is the exact CPU specification of your host. Thank you!

comment:7 10 年 前Fiorangelo Peluso 編輯

Which VBox.log should I attach? Panic occurs on guest and I ignore the cause (doesn't seem a single vm to be the cause). Attached you'll find /proc/cpuinfo

Thanks for your feedback

10 年 前Fiorangelo Peluso 編輯

附檔: 新增 cpuinfo

/proc/cpuinfo file

comment:8 10 年 前Frank Mehnert 編輯

Please add a VBox.log file from such a VM anyway. It doesn't matter which one.

10 年 前Fiorangelo Peluso 編輯

附檔: 新增 vbox_logs.tar.gz

VBox.log files of one vm

comment:9 10 年 前Frank Mehnert 編輯

Thanks you for the log. I had an idea what it could be but looking at the log file my idea was proven wrong. Disabling the NMI watchdog on the host might help like in #13506. Of course that's only a workaround.

comment:10 10 年 前Fiorangelo Peluso 編輯

Thanks for your feedback. Is it safe for the host disabling NMI watchdog?

comment:11 10 年 前Frank Mehnert 編輯

Yes. The watchdog is only there for detecting unusual behavior.

回覆:  description comment:12 10 年 前nuess0r 編輯

I'm having similar issiues with my HP DL360 G5 Server. I got it second hand and installed it in dezember. Before that I was running memtest sucessfully.

Host running Debian 7 Stable, Linux 3.2.0-4-amd64 #1 SMP Debian 3.2.65-1 x86_64 GNU/Linux Guest also running Debian 7 Stable, Linux 3.2.0-4-amd64 #1 SMP Debian 3.2.65-1 x86_64 GNU/Linux

Virtualbox: 4.3.20

In my case I have three VMs which I have migrated from the old system. These VMs are still 32bit and with only one CPU. These three VMs are running absolutely fine, as they did on the old server.

One VM is a newly created one to have more computing power, it has SMP with six CPUs and 24 GByte of Ram configured. This VM crashes often. Sometimes it takes half a day, sometimes it runs for three days.

The symptom is that the guest is not responding to anything. A new VRDP connection keeps black and empty. On the host, six CPUs are running with 100%, the rest of the host is fine and works normal. Also on the host there is no information about the crash of the guest available (VBox.log or syslog).

After the restart of the guest, the entries in the /var/log/messages most of the time look the same, the name of the affected process differs, sometimes the Pid belongs to mysql, sometimes to apache etc.

I attached the messages log from the host and guest (each including boot process), the config of the crashing VM and the Log of the crashing VM.

One thing to add, I don't know if has an effect: Booting Linux on this HP DL360 G5 gives some warning message during boot, that the BIOS is somekind of buggy and I should complain at the vendor.

10 年 前nuess0r 編輯

Log file of the crashing guest

10 年 前nuess0r 編輯

Log file of the host (DL360 G5)

10 年 前nuess0r 編輯

附檔: 新增 PD Project VBox.log.1

10 年 前nuess0r 編輯

附檔: 新增 publicdomainwiki.vbox

comment:13 10 年 前nuess0r 編輯

Yesterday I upgraded our server to VirtualBox 4.3.22r98236

Today the problematic VM crashed again, so this bug is still valid.

comment:14 10 年 前nj 編輯

I raised ticket #14034 which seems to bear some similarity to this ticket

最後由 Frank Mehnert 編輯於 10 年 前 (上一筆) (差異)

comment:15 10 年 前Frank Mehnert 編輯

Actually I'm not sure if this is a VBox bug at all. See this Ubuntu ticket. Could you try to blacklist the hpwdt module like suggested there and check if this resolves your problem as well?

comment:16 7 年 前Frank Mehnert 編輯

狀態: newclosed
處理結果: obsolete
注意: 瀏覽 TracTickets 來幫助您使用待辦事項功能

© 2024 Oracle Support Privacy / Do Not Sell My Info Terms of Use Trademark Policy Automated Access Etiquette