Hardware
這個硬碟死了嗎?
不確定,這個 Q 的網站是否正確,但讓我試試
上次我的硬碟有問題。有時它會發出奇怪的聲音,我從日誌中得到它:
$dmesg | grep ata4 [29409.945516] ata4.00: exception Emask 0x10 SAct 0xf SErr 0x90202 action 0xe frozen [29409.945529] ata4.00: irq_stat 0x00400000, PHY RDY changed [29409.945538] ata4: SError: { RecovComm Persist PHYRdyChg 10B8B } [29409.945546] ata4.00: failed command: READ FPDMA QUEUED [29409.945562] ata4.00: cmd 60/30:00:56:22:5f/00:00:00:00:00/40 tag 0 ncq 24576 in [29409.945573] ata4.00: status: { DRDY } [29409.945580] ata4.00: failed command: READ FPDMA QUEUED [29409.945594] ata4.00: cmd 60/18:08:8e:22:5f/00:00:00:00:00/40 tag 1 ncq 12288 in [29409.945605] ata4.00: status: { DRDY } [29409.945611] ata4.00: failed command: READ FPDMA QUEUED [29409.945625] ata4.00: cmd 60/08:10:46:02:66/00:00:00:00:00/40 tag 2 ncq 4096 in [29409.945635] ata4.00: status: { DRDY } [29409.945641] ata4.00: failed command: READ FPDMA QUEUED [29409.945656] ata4.00: cmd 60/80:18:ee:04:66/00:00:00:00:00/40 tag 3 ncq 65536 in [29409.945666] ata4.00: status: { DRDY } [29409.945679] ata4: hard resetting link [29413.976083] ata4: softreset failed (device not ready) [29413.976097] ata4: applying SB600 PMP SRST workaround and retrying [29414.148070] ata4: SATA link up 3.0 Gbps (SStatus 123 SControl 300) [29414.184986] ata4.00: SB600 AHCI: limiting to 255 sectors per cmd [29414.243280] ata4.00: SB600 AHCI: limiting to 255 sectors per cmd [29414.243292] ata4.00: configured for UDMA/133 [29414.243324] ata4: EH complete [680674.804563] ata4: exception Emask 0x50 SAct 0x0 SErr 0x90a02 action 0xe frozen [680674.804575] ata4: irq_stat 0x00400000, PHY RDY changed [680674.804584] ata4: SError: { RecovComm Persist HostInt PHYRdyChg 10B8B } [680674.804603] ata4: hard resetting link [680678.840561] ata4: softreset failed (device not ready)
這個ata4 sata硬碟死了嗎?我必須盡快改變它嗎?需要我指定更多資訊嗎?
立即更換驅動器,尤其是當“奇怪的聲音”是咔噠聲時。
點擊當然是一個問題,但我發現這個問題正在調查我們伺服器上的類似錯誤,在我們的例子中,問題不是驅動器,而是 WD(Western Digital)和錯誤的 NCQ 實施的問題!
您可以在這些連結上閱讀它,看看它是否適合您的問題:
- https://superuser.com/questions/284952/troublesome-hard-drive-in-lvm-is-it-broken
- http://www.axelog.de/2010/05/9-sata-phyrdychg-exception/
- https://bugzilla.kernel.org/show_bug.cgi?id=8627
但是錯誤(特別是這部分)看起來很相似,以至於我發現了這個問題:)
ata6.00: exception Emask 0x10 SAct 0x0 SErr 0x1810000 action 0xe frozen ata6.00: irq_stat 0x00400000, PHY RDY changed ata6: SError: { PHYRdyChg LinkSeq TrStaTrns }
對我們來說,短期修復/測試是這樣的:
echo 1 > /sys/block/sd{a,b,c,d}/device/queue_depth
長期將其添加到黑名單中,您可以在參考資料中閱讀。我沒有這方面的經驗,但連結 2 說更新檔看起來像這樣:
--- a/drivers/ata/libata-core.c 2010-05-20 20:39:08.000000000 +0200 +++ b/drivers/ata/libata-core.c 2010-05-20 20:43:54.000000000 +0200 @@ -3924,6 +3924,7 @@ { "Maxtor 7V300F0", "VA111630", ATA_HORKAGE_NONCQ }, { "ST380817AS", "3.42", ATA_HORKAGE_NONCQ }, { "ST3160023AS", "3.42", ATA_HORKAGE_NONCQ }, + { "WDC WD2502ABYS-5*", NULL, ATA_HORKAGE_NONCQ },