Todo o meu disco rígido principal Barracuda 7200.12 SATA 6Gb / s 250GB ST3250312AS (2 anos) vai à loucura. Começou em 28 de junho de 2014.
Partições sda1 (root), sda5 (home), sda7 (var) capturaram isso de tempos em tempos em arquivos aleatórios. Às vezes é a minha partição inicial, quando eu baixo algo via navegador. Às vezes é / var, onde minhas bases de .rrd:
[82845.340334] ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
[82845.340346] ata1.00: irq_stat 0x40000001
[82845.340347] ata1.00: failed command: READ DMA
[82845.340351] ata1.00: cmd c8/00:40:3e:29:0f/00:00:00:00:00/ec tag 0 dma 32768 in
[82845.340351] res 51/40:00:3e:29:0f/00:00:0c:00:00/0c Emask 0x9 (media error)
[82845.340352] ata1.00: status: { DRDY ERR }
[82845.340353] ata1.00: error: { UNC }
[82845.417764] ata1.00: configured for UDMA/133
[82845.417775] sd 0:0:0:0: [sda] Unhandled sense code
[82845.417777] sd 0:0:0:0: [sda]
[82845.417778] Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[82845.417779] sd 0:0:0:0: [sda]
[82845.417780] Sense Key : Medium Error [current] [descriptor]
[82845.417782] Descriptor sense data with sense descriptors (in hex):
[82845.417783] 72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00
[82845.417789] 0c 0f 29 3e
[82845.417791] sd 0:0:0:0: [sda]
[82845.417792] Add. Sense: Unrecovered read error - auto reallocate failed
[82845.417793] sd 0:0:0:0: [sda] CDB:
[82845.417794] Read(10): 28 00 0c 0f 29 3e 00 00 40 00
[82845.417799] end_request: I/O error, dev sda, sector 202320190
[82845.417809] ata1: EH complete
[82848.253943] ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
[82848.253946] ata1.00: irq_stat 0x40000001
[82848.253955] ata1.00: failed command: READ DMA
[82848.253958] ata1.00: cmd c8/00:08:3e:29:0f/00:00:00:00:00/ec tag 0 dma 4096 in
[82848.253958] res 51/40:00:3e:29:0f/00:00:0c:00:00/0c Emask 0x9 (media error)
[82848.253960] ata1.00: status: { DRDY ERR }
[82848.253961] ata1.00: error: { UNC }
[82848.264595] ata1.00: configured for UDMA/133
[82848.264610] sd 0:0:0:0: [sda] Unhandled sense code
[82848.264611] sd 0:0:0:0: [sda]
[82848.264612] Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[82848.264614] sd 0:0:0:0: [sda]
[82848.264615] Sense Key : Medium Error [current] [descriptor]
[82848.264617] Descriptor sense data with sense descriptors (in hex):
[82848.264618] 72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00
[82848.264623] 0c 0f 29 3e
[82848.264626] sd 0:0:0:0: [sda]
[82848.264627] Add. Sense: Unrecovered read error - auto reallocate failed
[82848.264628] sd 0:0:0:0: [sda] CDB:
[82848.264629] Read(10): 28 00 0c 0f 29 3e 00 00 08 00
[82848.264634] end_request: I/O error, dev sda, sector 202320190
[82848.264648] ata1: EH complete
Eu concluí o backup. Não tenho medo.
Estou no Ubuntu mais recente e no kernel mais recente.
Linux Ubuntu 3.13.0-30-lowlatency #54-Ubuntu SMP PREEMPT Mon Jun 9 23:14:29 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux
O Google encontrou para mim muitas perguntas semelhantes e respostas perfeitas em muitos fóruns diferentes.
Estou tentando descobrir, é uma falha de hardware ou é este patch (30 horas atrás) para o kernel do linux.
Alguma sugestão?
Edit1: encontrei também toools perfeitos:
# sdparm --page=rw --long --long --long /dev/sda
/dev/sda: ATA ST3250312AS
Direct access device specific parameters: WP=0 DPOFUA=0
Read write error recovery [rw] mode page [PS=0]:
AWRE 1 [cha: n, def: 1] Automatic write reallocation enabled
ARRE 0 [cha: n, def: 0] Automatic read reallocation enabled
TB 0 [cha: n, def: 0] Transfer block
RC 0 [cha: n, def: 0] Read continuous
0: error recovery may cause delays
1: transfer data without waiting for error recovery
EER 0 [cha: n, def: 0] Enable early recovery
1: increase chance of mis-detection or mis-correction of error
PER 0 [cha: n, def: 0] Post error
0: do not post recovered errors
1: report recovered errors (via sense key: recovered error)
DTE 0 [cha: n, def: 0] Data terminate on error
1: terminate data transfer when recovered error detected
DCR 0 [cha: n, def: 0] Disable correction
RRC 0 [cha: n, def: 0] Read retry count
COR_S 0 [cha: n, def: 0] Correction span (obsolete)
HOC 0 [cha: n, def: 0] Head offset count (obsolete)
DSOC 0 [cha: n, def: 0] Data strobe offset count (obsolete)
LBPERE 0 [cha: n, def: 0] Logical block provisioning error reporting enabled
WRC 0 [cha: n, def: 0] Write retry count
RTL 0 [cha: n, def: 0] Recovery time limit (ms)
0: default, -1: 65.5 seconds
Editar2 : smartctl -a / dev / sda - > pastebin.com/VsFEbAAQ
Edit3 : Por que eu espero que isso não seja um hw, mas Ext4: Porque, leia este tópico bbs.archlinux.org/viewtopic.php?id=151341
Ted Ts'o, a mesma pessoa que cometeu o patch anterior, por volta de 2012-10-24.