Debian

Proxmox-Server 偶爾重啟

  • July 2, 2016

我剛剛提到我的 proxmox 系統在執行備份任務時不時重新啟動(bacula,而不是 proxmox-backup-system)。

日誌顯示以下內容:

Mar 8 02:30:01 proxmox-server kernel: [1922972.974633] audit: type=1400 audit(1457400601.487:129743): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=22456 comm="atop" requested_mask="trace" denied_mask="trace" peer="unconfined"

Mar 8 02:30:01 proxmox-server kernel: [1922972.974665] audit: type=1400 audit(1457400601.487:129744): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=22456 comm="atop" requested_mask="read" denied_mask="read" peer="unconfined"

Mar 8 02:30:01 proxmox-server kernel: [1922973.110758] audit: type=1400 audit(1457400601.623:129746): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=22532 comm="atop" requested_mask="read" denied_mask="read" peer="unconfined"

Mar 8 02:30:01 proxmox-server kernel: [1922973.111619] audit: type=1400 audit(1457400601.627:129747): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=22532 comm="atop" requested_mask="trace" denied_mask="trace" peer="unconfined"

Mar 8 02:31:33 proxmox-server kernel: [1923064.989437] audit: type=1400 audit(1457400693.511:129762): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=11598 comm="monit" requested_mask="trace" denied_mask="trace" peer="unconfined"

Mar 8 02:31:33 proxmox-server kernel: [1923064.989461] audit: type=1400 audit(1457400693.511:129763): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=11598 comm="monit" requested_mask="trace" denied_mask="trace" peer="unconfined"

Mar 8 02:31:34 proxmox-server puppet-agent[31158]: Finished catalog run in 0.63 seconds

Mar 8 02:33:09 proxmox-server kernel: [1923160.542757] audit: type=1400 audit(1457400789.071:129768): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=29153 comm="monit" requested_mask="trace" denied_mask="trace" peer="unconfined"

Mar 8 02:33:09 proxmox-server kernel: [1923160.542962] audit: type=1400 audit(1457400789.071:129769): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=29153 comm="monit" requested_mask="trace" denied_mask="trace" peer="unconfined"

Mar 8 02:34:23 proxmox-server kernel: [1923235.217380] audit: type=1400 audit(1457400863.751:129770): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=31188 comm="monit" requested_mask="trace" denied_mask="trace" peer="unconfined"

Mar 8 02:36:11 proxmox-server kernel: [1923342.872770] audit: type=1400 audit(1457400971.415:129771): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=9471 comm="monit" requested_mask="trace" denied_mask="trace" peer="unconfined"

Mar 8 02:36:13 proxmox-server kernel: [1923344.924216] audit: type=1400 audit(1457400973.463:129772): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=11037 comm="sw-collectd" requested_mask="trace" denied_mask="trace" peer="unconfined"

Mar 8 02:36:13 proxmox-server kernel: [1923344.924251] audit: type=1400 audit(1457400973.463:129773): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=11037 comm="sw-collectd" requested_mask="read" denied_mask="read" peer="unconfined"

Mar 8 02:36:28 proxmox-server kernel: [1923360.048495] audit: type=1400 audit(1457400988.591:129777): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=15979 comm="sw-collectd" requested_mask="read" denied_mask="read" peer="unconfined"

Mar 8 02:36:30 proxmox-server kernel: [1923361.994794] audit: type=1400 audit(1457400990.536:129778): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=16377 comm="sw-collectd" requested_mask="trace" denied_mask="trace" peer="unconfined"

Mar 8 02:36:49 proxmox-server kernel: [1923380.457157] audit: type=1400 audit(1457401009.000:129784): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=2842 comm="ps" requested_mask="trace" denied_mask="trace" peer="unconfined"

Mar 8 02:36:49 proxmox-server kernel: [1923380.821127] audit: type=1400 audit(1457401009.364:129790): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=2872 comm="ps" requested_mask="trace" denied_mask="trace" peer="unconfined"

Mar 8 02:38:11 proxmox-server kernel: [1923462.649203] audit_printk_skb: 54 callbacks suppressed

Mar 8 02:38:11 proxmox-server kernel: [1923462.649206] audit: type=1400 audit(1457401091.200:129810): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=29153 comm="monit" requested_mask="trace" denied_mask="trace" peer="unconfined"

Mar 8 02:38:11 proxmox-server kernel: [1923462.649433] audit: type=1400 audit(1457401091.200:129811): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=29153 comm="monit" requested_mask="trace" denied_mask="trace" peer="unconfined"

Mar 8 02:41:25 proxmox-server rsyslogd: [origin software="rsyslogd" swVersion="8.4.2" x-pid="920" x-info="http://www.rsyslog.com"] start

Mar 8 02:41:25 proxmox-server systemd-modules-load[303]: Module 'fuse' is builtin

Mar 8 02:41:25 proxmox-server systemd-modules-load[303]: Module 'loop' is builtin

Mar 8 02:41:25 proxmox-server systemd-modules-load[303]: Inserted module 'vhost_net'

Mar 8 02:41:25 proxmox-server hdparm[324]: Setting parameters of disc: (none).

Mar 8 02:41:25 proxmox-server keyboard-setup[325]: Setting preliminary keymap...done.

Mar 8 02:41:25 proxmox-server systemd-fsck[482]: /dev/sda3: Journal wird wiederhergestellt

Mar 8 02:41:25 proxmox-server systemd-fsck[482]: /dev/sda3: sauber, 331/62592 Dateien, 64258/250112 Blöcke

這是我的 proxmox 設置:

proxmox-ve: 4.1-39 (running kernel: 4.2.8-1-pve)
pve-manager: 4.1-15 (running version: 4.1-15/8cd55b52)
pve-kernel-4.2.6-1-pve: 4.2.6-36
pve-kernel-2.6.32-43-pve: 2.6.32-166
pve-kernel-4.2.8-1-pve: 4.2.8-39
pve-kernel-4.2.2-1-pve: 4.2.2-16
pve-kernel-2.6.32-26-pve: 2.6.32-114
pve-kernel-4.2.3-2-pve: 4.2.3-22
lvm2: 2.02.116-pve2
corosync-pve: 2.3.5-2
libqb0: 1.0-1
pve-cluster: 4.0-33
qemu-server: 4.0-62
pve-firmware: 1.1-7
libpve-common-perl: 4.0-49
libpve-access-control: 4.0-11
libpve-storage-perl: 4.0-42
pve-libspice-server1: 0.12.5-2
vncterm: 1.2-1
pve-qemu-kvm: 2.5-8
pve-container: 1.0-46
pve-firewall: 2.0-18
pve-ha-manager: 1.0-23
ksm-control-daemon: 1.2-1
glusterfs-client: 3.5.2-2+deb8u1
lxc-pve: 1.1.5-7
lxcfs: 2.0.0-pve1
cgmanager: 0.39-pve1
criu: 1.6.0-1

有什麼想法嗎?

通常當某些東西偶爾重啟時,它不是作業系統,而是物理問題。

我會從電源開始。是好還是便宜?伺服器上是否有冗餘電源,或者是低端伺服器或台式機。電源是否過載?

有可能存在一堆磁碟,並且這些磁碟在執行備份時都在旋轉和搜尋,從而使功耗達到峰值。這可能會導致電源的一個特定軌道(又名電壓匯流排)過載,儘管整個 PSU 都很好。

例如,如果 +5V 上的最大功率為 20 安培,並且您有 10 個磁碟,每個磁碟的電流為 2.1 安培,那麼您將過載電源,它將關閉或重新啟動。

引用自:https://serverfault.com/questions/762321