Hardware

這個硬碟死了嗎?

  • February 2, 2017

不確定,這個 Q 的網站是否正確,但讓我試試

上次我的硬碟有問題。有時它會發出奇怪的聲音,我從日誌中得到它:

$dmesg | grep ata4

[29409.945516] ata4.00: exception Emask 0x10 SAct 0xf SErr 0x90202 action 0xe frozen

[29409.945529] ata4.00: irq_stat 0x00400000, PHY RDY changed

[29409.945538] ata4: SError: { RecovComm Persist PHYRdyChg 10B8B }

[29409.945546] ata4.00: failed command: READ FPDMA QUEUED

[29409.945562] ata4.00: cmd 60/30:00:56:22:5f/00:00:00:00:00/40 tag 0 ncq 24576 
in
[29409.945573] ata4.00: status: { DRDY }

[29409.945580] ata4.00: failed command: READ FPDMA QUEUED

[29409.945594] ata4.00: cmd 60/18:08:8e:22:5f/00:00:00:00:00/40 tag 1 ncq 12288 
in
[29409.945605] ata4.00: status: { DRDY }

[29409.945611] ata4.00: failed command: READ FPDMA QUEUED

[29409.945625] ata4.00: cmd 60/08:10:46:02:66/00:00:00:00:00/40 tag 2 ncq 4096 
in
[29409.945635] ata4.00: status: { DRDY }

[29409.945641] ata4.00: failed command: READ FPDMA QUEUED

[29409.945656] ata4.00: cmd 60/80:18:ee:04:66/00:00:00:00:00/40 tag 3 ncq 65536 
in
[29409.945666] ata4.00: status: { DRDY }

[29409.945679] ata4: hard resetting link

[29413.976083] ata4: softreset failed (device not ready)

[29413.976097] ata4: applying SB600 PMP SRST workaround and retrying

[29414.148070] ata4: SATA link up 3.0 Gbps (SStatus 123 SControl 300)

[29414.184986] ata4.00: SB600 AHCI: limiting to 255 sectors per cmd

[29414.243280] ata4.00: SB600 AHCI: limiting to 255 sectors per cmd

[29414.243292] ata4.00: configured for UDMA/133

[29414.243324] ata4: EH complete

[680674.804563] ata4: exception Emask 0x50 SAct 0x0 SErr 0x90a02 action 0xe 
frozen
[680674.804575] ata4: irq_stat 0x00400000, PHY RDY changed

[680674.804584] ata4: SError: { RecovComm Persist HostInt PHYRdyChg 10B8B }

[680674.804603] ata4: hard resetting link

[680678.840561] ata4: softreset failed (device not ready)

這個ata4 sata硬碟死了嗎?我必須盡快改變它嗎?需要我指定更多資訊嗎?

立即更換驅動器,尤其是當“奇怪的聲音”是咔噠聲時。

點擊當然是一個問題,但我發現這個問題正在調查我們伺服器上的類似錯誤,在我們的例子中,問題不是驅動器,而是 WD(Western Digital)和錯誤的 NCQ 實施的問題!

您可以在這些連結上閱讀它,看看它是否適合您的問題:

但是錯誤(特別是這部分)看起來很相似,以至於我發現了這個問題:)

ata6.00: exception Emask 0x10 SAct 0x0 SErr 0x1810000 action 0xe frozen
ata6.00: irq_stat 0x00400000, PHY RDY changed
ata6: SError: { PHYRdyChg LinkSeq TrStaTrns }

對我們來說,短期修復/測試是這樣的:

echo 1 > /sys/block/sd{a,b,c,d}/device/queue_depth

長期將其添加到黑名單中,您可以在參考資料中閱讀。我沒有這方面的經驗,但連結 2 說更新檔看起來像這樣:

--- a/drivers/ata/libata-core.c 2010-05-20 20:39:08.000000000 +0200
+++ b/drivers/ata/libata-core.c 2010-05-20 20:43:54.000000000 +0200
@@ -3924,6 +3924,7 @@
       { "Maxtor 7V300F0",     "VA111630",     ATA_HORKAGE_NONCQ },
       { "ST380817AS",         "3.42",         ATA_HORKAGE_NONCQ },
       { "ST3160023AS",        "3.42",         ATA_HORKAGE_NONCQ },
+       { "WDC WD2502ABYS-5*",  NULL,           ATA_HORKAGE_NONCQ },

引用自:https://serverfault.com/questions/330675