Ubuntu

ubuntu 伺服器通過 web、ftp、ssh 等變得無響應

  • March 1, 2013

我託管了一個 ubuntu 伺服器,有幾次它對所有內容都沒有響應,直到硬重啟完成.. 我已經提取了日誌,但我需要一點幫助來弄清楚它們的含義.. 如果它們真的是相關的,或者如果您認為這可能是硬體問題:

系統日誌

Mar  1 15:11:01 xxxxxxxx CRON[24473]: (root) CMD (/usr/local/rtm/bin/rtm 35 > /dev/null 2> /dev/null)
Mar  1 15:12:01 xxxxxxxx CRON[24530]: (root) CMD (/usr/local/rtm/bin/rtm 35 > /dev/null 2> /dev/null)
Mar  1 15:13:01 xxxxxxxx CRON[24585]: (root) CMD (/usr/local/rtm/bin/rtm 35 > /dev/null 2> /dev/null)
Mar  1 15:14:01 xxxxxxxx CRON[24654]: (root) CMD (/usr/local/rtm/bin/rtm 35 > /dev/null 2> /dev/null)
Mar  1 15:15:01 xxxxxxxx CRON[24713]: (root) CMD (/usr/local/rtm/bin/rtm 35 > /dev/null 2> /dev/null)
Mar  1 15:16:01 xxxxxxxx CRON[24770]: (root) CMD (/usr/local/rtm/bin/rtm 35 > /dev/null 2> /dev/null)
Mar  1 15:17:01 xxxxxxxx CRON[24827]: (root) CMD (/usr/local/rtm/bin/rtm 35 > /dev/null 2> /dev/null)
Mar  1 15:17:01 xxxxxxxx CRON[24828]: (root) CMD (   cd / && run-parts --report /etc/cron.hourly)
Mar  1 15:17:05 xxxxxxxx postfix/pickup[23311]: 3CFEE2E3CF: uid=0 from=<root>
Mar  1 15:17:05 xxxxxxxx postfix/cleanup[24880]: 3CFEE2E3CF: message-id=<20130301151705.3CFEE2E3CF@xxxxxxxx.kimsufi.com>
Mar  1 15:17:05 xxxxxxxx postfix/qmgr[3886]: 3CFEE2E3CF: from=<root@xxxxxxxx.kimsufi.com>, size=2080, nrcpt=1 (queue active)
Mar  1 15:17:05 xxxxxxxx postfix/smtp[24882]: 3CFEE2E3CF: to=<danny@xxxxxxxxxxxxxx.com>, relay=xxxxxxxxxxxxxx.dyndns.org[xxx.xxx.xxx.xxx]:25, delay=0.56, delays=0.08/0/0.21/0.26, dsn=2.6.0, status=sent (250 2.6.0  <20130301151705.3CFEE2E3CF@xxxxxxxx.kimsufi.com> Queued mail for delivery)
Mar  1 15:17:05 xxxxxxxx postfix/qmgr[3886]: 3CFEE2E3CF: removed
Mar  1 15:18:01 xxxxxxxx CRON[24897]: (root) CMD (/usr/local/rtm/bin/rtm 35 > /dev/null 2> /dev/null)
Mar  1 15:19:01 xxxxxxxx CRON[24944]: (root) CMD (/usr/local/rtm/bin/rtm 35 > /dev/null 2> /dev/null)
Mar  1 15:20:01 xxxxxxxx CRON[24999]: (root) CMD (/usr/local/rtm/bin/rtm 35 > /dev/null 2> /dev/null)
Mar  1 15:21:01 xxxxxxxx CRON[25046]: (root) CMD (/usr/local/rtm/bin/rtm 35 > /dev/null 2> /dev/null)
Mar  1 16:02:40 xxxxxxxx kernel: imklog 4.6.4, log source = /proc/kmsg started.
Mar  1 16:02:40 xxxxxxxx rsyslogd: [origin software="rsyslogd" swVersion="4.6.4" x-pid="3425" x-info="http://www.rsyslog.com"] (re)start
Mar  1 16:02:40 xxxxxxxx rsyslogd: rsyslogd's groupid changed to 103
Mar  1 16:02:40 xxxxxxxx rsyslogd: rsyslogd's userid changed to 101
Mar  1 16:02:40 xxxxxxxx rsyslogd-2039: Could no open output pipe '/dev/xconsole' [try http://www.rsyslog.com/e/2039 ]
Mar  1 16:02:40 xxxxxxxx kernel: Initializing cgroup subsys cpuset
Mar  1 16:02:40 xxxxxxxx kernel: Linux version 3.2.13-grsec-xxxx-grs-ipv6-64 (root@kernel-64.ovh.net) (gcc version 4.3.2 (Debian 4.3.2-1.1) ) #1 SMP Thu Mar 29 09:48:59 UTC 2012
Mar  1 16:02:40 xxxxxxxx kernel: Command line: root=/dev/sda1 console=tty0 BOOT_IMAGE=bzImage-2.6-xxxx-grs-ipv6-64 
Mar  1 16:02:40 xxxxxxxx kernel: BIOS-provided physical RAM map:
Mar  1 16:02:40 xxxxxxxx kernel: BIOS-e820: 0000000000000000 - 000000000009d800 (usable)
Mar  1 16:02:40 xxxxxxxx kernel: BIOS-e820: 000000000009d800 - 00000000000a0000 (reserved)
Mar  1 16:02:40 xxxxxxxx kernel: BIOS-e820: 00000000000e0000 - 0000000000100000 (reserved)
Mar  1 16:02:40 xxxxxxxx kernel: BIOS-e820: 0000000000100000 - 00000000df790000 (usable)
Mar  1 16:02:40 xxxxxxxx kernel: BIOS-e820: 00000000df790000 - 00000000df79e000 (ACPI data)
Mar  1 16:02:40 xxxxxxxx kernel: BIOS-e820: 00000000df79e000 - 00000000df7d0000 (ACPI NVS)
Mar  1 16:02:40 xxxxxxxx kernel: BIOS-e820: 00000000df7d0000 - 00000000df7e0000 (reserved)
Mar  1 16:02:40 xxxxxxxx kernel: BIOS-e820: 00000000df7ec000 - 00000000f0000000 (reserved)
Mar  1 16:02:40 xxxxxxxx kernel: BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved)
Mar  1 16:02:40 xxxxxxxx kernel: BIOS-e820: 00000000ffc00000 - 0000000100000000 (reserved)
Mar  1 16:02:40 xxxxxxxx kernel: BIOS-e820: 0000000100000000 - 0000000620000000 (usable)
Mar  1 16:02:40 xxxxxxxx kernel: NX (Execute Disable) protection: active
Mar  1 16:02:40 xxxxxxxx kernel: DMI present.

如您所見,伺服器在 cronjob 執行後立即停止。這裡沒有執行複雜的作業。

你有什麼可以給我診斷問題的建議嗎?

謝謝

rsyslogd-2039:無法打開輸出管道“/dev/xconsole”[嘗試http://www.rsyslog.com/e/2039 ]

這將是您的 Ubuntu 版本中已確認的錯誤,所以我希望這是導致問題的原因,並嘗試首先解決它。

您可以升級以繞過它,或者嘗試這裡的建議,這將是編輯您的/etc/rsyslog.d/50-default.conf文件(或執行apt-get upgrade)。

如果做不到這一點,請停止執行在伺服器掛起之前發生的 cron 作業並查看它,看看它可能在做什麼可能導致您的伺服器掛起。如果不出意外,修復rsyslog錯誤可能會讓您擷取一些有用的日誌記錄資訊,這些資訊可以為您指明正確的方向。

引用自:https://serverfault.com/questions/483829