Proxmox 5.3 Server startet einfach neu

mathschut

Well-Known Member
Sep 5, 2017
30
0
46
38
Hallo,

ich habe seit ein paar Tagen das Problem, dass mein PVE 5.3 Server manchmal einfach neustartet. Wie bekomme ich denn raus, warum er das macht und was er für ein Problem hat?
 
Wie bekomme ich denn raus, warum er das macht und was er für ein Problem hat?

Die Log-Files unter `/var/log/syslog` und/oder `/var/log/syslog.1` usw. checken. Nach "shutdown" o.Ä. suchen.
 
Und falls die syslog Dateien zu keiner Weisheit führen,

> dmesg

nach eventuellen kernel panics, OOMs u.ä. absuchen.

Ich hatte einmal etwas ähnliches, als ich im BIOS den Watchdog eingeschaltet habe und der mir eine Maschine in nicht nachvollziehbare Weise - natürlich zum ungünstigsten Zeitpunkt - per Reset geerdet hat.
 
Hi,

ich habe folgende Fehlermeldungen im Log gefunden, kann das damit zusammen hängen?

Code:
Apr  1 02:19:10 pve kernel: [    0.062419] ACPI: Power Resource [PG00] (on)
Apr  1 02:19:10 pve kernel: [    0.062761] ACPI: Power Resource [PG01] (on)
Apr  1 02:19:10 pve kernel: [    0.063078] ACPI: Power Resource [PG02] (on)
Apr  1 02:19:10 pve kernel: [    0.065503] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.065788] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.066073] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.066358] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.066641] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.066922] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.067208] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.067490] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.067773] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.068063] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.068346] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.068628] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.068909] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.069193] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.069478] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.069760] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.070043] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.070331] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.070613] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.070895] ACPI: Power Resource [WRST] (on)
Apr  1 02:19:10 pve kernel: [    0.083327] ACPI: Power Resource [FN00] (off)
Apr  1 02:19:10 pve kernel: [    0.083405] ACPI: Power Resource [FN01] (off)
Apr  1 02:19:10 pve kernel: [    0.083482] ACPI: Power Resource [FN02] (off)
Apr  1 02:19:10 pve kernel: [    0.083556] ACPI: Power Resource [FN03] (off)
Apr  1 02:19:10 pve kernel: [    0.083629] ACPI: Power Resource [FN04] (off)
Apr  1 02:19:10 pve kernel: [    0.084707] ACPI: PCI Root Bridge [PCI0] (domain 0000 [bus 00-3e])
Apr  1 02:19:10 pve kernel: [    0.084711] acpi PNP0A08:00: _OSC: OS supports [ExtendedConfig ASPM ClockPM Segments MSI]
Apr  1 02:19:10 pve kernel: [    0.084743] acpi PNP0A08:00: _OSC failed (AE_ERROR); disabling ASPM
Apr  1 02:19:10 pve kernel: [    0.085390] PCI host bridge to bus 0000:00
Apr  1 02:19:10 pve kernel: [    0.085392] pci_bus 0000:00: root bus resource [io  0x0000-0x0cf7 window]
Apr  1 02:19:10 pve kernel: [    0.085393] pci_bus 0000:00: root bus resource [io  0x0d00-0xffff window]
Apr  1 02:19:10 pve kernel: [    0.085394] pci_bus 0000:00: root bus resource [mem 0x000a0000-0x000bffff window]
Apr  1 02:19:10 pve kernel: [    0.085395] pci_bus 0000:00: root bus resource [mem 0xe0000000-0xf7ffffff window]
Apr  1 02:19:10 pve kernel: [    0.085395] pci_bus 0000:00: root bus resource [mem 0xfd000000-0xfe7fffff window]
 
Die Frage wäre wie alt der Server/das mainboard ist und ob da ne aktuelle - bzw. die neueste verfügbare - BIOS Version drauf ist.

Du könntest (GRRR copy & paste geht wieder nicht in dem /"&§("/$ Forum hier) "acpi=force reboot=acpi" dem Grub mitgeben.
Falls Du da ein neues, relativ fehlerfreies BIOS hast. Ansonsten hart "acpi=off". Das ist erstmal die Knüppel-Methode. Wenn der Workaround funktioniert, kann man immer noch nach sanfteren Alternativen Ausschau halten.
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!