Forums Neueste Beiträge
 

[gentoo-user-de] Kaputte Platte?

26/02/2008 - 15:30 von André Glücksmann | Report spam
Hash: RIPEMD160


Guten Tag - liebe Gentoo-liste-leute,

nach dem Missverstàndnis von neulich, bei dem nicht nur mein Postfach
mit unsinnigen Mails befüllt wurde, wollte ich mal wieder zu etwas
Ernstem kommen...

Für eine meiner unzàhligen Platten steht regelmàßig im Syslog:
"Feb 26 14:08:27 [smartd] Device: /dev/sdf, 1 Offline uncorrectable
sectors_"

Weiter unten habe ich noch mal die komplette Ausgabe von
"smartctl --all /dev/sdf" angehàngt.

Den Fehler habe ich bestimmt schon seit ein paar Monaten, ich habe immer
drauf gewartet, dass die Platte komplett den Geist aufgibt um sie dann
auszuwechseln und den Raid1-Verbund wieder herzustellen, allerdings
làuft sie nun ja schon seit Monaten...

Also was meint ihr? Soll ich sie raussschmeissen? Wegschmeissen und neu
kaufen? Habt ihr Erfahrungen mit solchen Fehlern?
Ich habe smartd erst vor ein paar Wochen bei mir installiert und vorher
immer erst Platten gewechselt, wenn sie halt nciht mehr liefen ;-)

Gruß
André
==
saturn ~ # smartctl --all /dev/sdf

~ smartctl version 5.37 [i686-pc-linux-gnu] Copyright (C)
2002-6 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

START OF INFORMATION SECTION ==Device Model: SAMSUNG SP1614C
Serial Number: XXXXXXXXXXXXXXX
Firmware Version: SW100-25
User Capacity: 160,041,885,696 bytes
Device is: In smartctl database [for details use: -P show]
ATA Version is: 7
ATA Standard is: ATA/ATAPI-7 T13 1532D revision 0
Local Time is: Tue Feb 26 15:23:14 2008 CET

==> WARNING: May need -F samsung2 disabled; see manual for details.

SMART support is: Available - device has SMART capability.
SMART support is: Enabled

START OF READ SMART DATA SECTION ==SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status: (0x80) Offline data collection activity
~ was never started.
~ Auto Offline Data Collection:
Enabled.
Self-test execution status: ( 244) Self-test routine in progress...
~ 40% of test remaining.
Total time to complete Offline
data collection: (5760) seconds.
Offline data collection
capabilities: (0x1b) SMART execute Offline immediate.
~ Auto Offline data collection
on/off su
~ pport.
~ Suspend Offline collection upon new
~ command.
~ Offline surface scan supported.
~ Self-test supported.
~ No Conveyance Self-test supported.
~ No Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
~ power-saving mode.
~ Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
~ No General Purpose Logging support.
Short self-test routine
recommended polling time: ( 1) minutes.
Extended self-test routine
recommended polling time: ( 96) minutes.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE
UPDATED WHE
~ N_FAILED RAW_VALUE
~ 1 Raw_Read_Error_Rate 0x000b 100 100 051 Pre-fail Always

~ - 0
~ 3 Spin_Up_Time 0x0007 067 056 000 Pre-fail Always

~ - 5696
~ 4 Start_Stop_Count 0x0032 098 098 000 Old_age Always

~ - 3002
~ 5 Reallocated_Sector_Ct 0x0033 253 253 010 Pre-fail Always

~ - 0
~ 7 Seek_Error_Rate 0x000b 253 253 051 Pre-fail Always

~ - 0
~ 8 Seek_Time_Performance 0x0024 253 253 000 Old_age
Offline
~ - 0
~ 9 Power_On_Half_Minutes 0x0032 100 100 000 Old_age Always

~ - 2787h+44m
~ 10 Spin_Retry_Count 0x0013 253 253 049 Pre-fail Always

~ - 0
~ 12 Power_Cycle_Count 0x0032 099 099 000 Old_age Always

~ - 1765
194 Temperature_Celsius 0x0022 157 085 000 Old_age Always

~ - 27
195 Hardware_ECC_Recovered 0x000a 100 100 000 Old_age Always

~ - 89102473
196 Reallocated_Event_Count 0x0012 100 100 000 Old_age Always

~ - 1
197 Current_Pending_Sector 0x0033 253 253 010 Pre-fail Always

~ - 0
198 Offline_Uncorrectable 0x0031 100 100 010 Pre-fail
Offline
~ - 1
199 UDMA_CRC_Error_Count 0x000b 100 100 051 Pre-fail Always

~ - 0
200 Multi_Zone_Error_Rate 0x000b 100 100 051 Pre-fail Always

~ - 0
201 Soft_Read_Error_Rate 0x000b 100 100 051 Pre-fail Always

~ - 0

SMART Error Log Version: 1
ATA Error Count: 41728 (device log contains only the most recent five
errors)
~ CR = Command Register [HEX]
~ FR = Features Register [HEX]
~ SC = Sector Count Register [HEX]
~ SN = Sector Number Register [HEX]
~ CL = Cylinder Low Register [HEX]
~ CH = Cylinder High Register [HEX]
~ DH = Device/Head Register [HEX]
~ DC = Device Command Register [HEX]
~ ER = Error register [HEX]
~ ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 41728 occurred at disk power-on lifetime: 2153 hours (89 days + 17
hours
~ )
~ When the command that caused the error occurred, the device was active
or id
~ le.

~ After command completion occurred, registers were:
~ ER ST SC SN CL CH DH
~ -- -- -- -- -- -- --
~ 40 51 00 26 b1 f0 e0 Error: UNC at LBA = 0x00f0b126 = 15773990

~ Commands leading to the command that caused the error were:
~ CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
~ -- -- -- -- -- -- -- -- - --
~ c8 00 00 26 b1 f0 e0 00 02:52:36.313 READ DMA
~ c8 00 b8 6e b0 f0 e0 00 02:52:36.313 READ DMA
~ c8 00 08 8e 83 ef e0 00 02:52:36.313 READ DMA
~ c8 00 48 26 b0 f0 e0 00 02:52:36.313 READ DMA
~ c8 00 00 26 af f0 e0 00 02:52:36.313 READ DMA

Error 41727 occurred at disk power-on lifetime: 881 hours (36 days + 17
hours)
~ When the command that caused the error occurred, the device was active
or id
~ le.

~ After command completion occurred, registers were:
~ ER ST SC SN CL CH DH
~ -- -- -- -- -- -- --
~ 40 51 00 77 b1 f0 e0 Error: UNC at LBA = 0x00f0b177 = 15774071

~ Commands leading to the command that caused the error were:
~ CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
~ -- -- -- -- -- -- -- -- - --
~ c8 00 00 77 b1 f0 e0 00 02:58:46.188 READ DMA
~ ec 00 00 00 00 00 a0 00 02:58:46.188 IDENTIFY DEVICE
~ ef 03 46 00 00 00 a0 00 02:58:46.188 SET FEATURES [Set transfer
mode]
~ ec 00 00 00 00 00 a0 00 02:58:46.188 IDENTIFY DEVICE

Error 41726 occurred at disk power-on lifetime: 881 hours (36 days + 17
hours)
~ When the command that caused the error occurred, the device was active
or id
~ le.

~ After command completion occurred, registers were:
~ ER ST SC SN CL CH DH
~ -- -- -- -- -- -- --
~ 40 51 00 77 b1 f0 e0 Error: UNC at LBA = 0x00f0b177 = 15774071

~ Commands leading to the command that caused the error were:
~ CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
~ -- -- -- -- -- -- -- -- - --
~ c8 00 00 77 b1 f0 e0 00 02:58:45.125 READ DMA
~ ec 00 00 00 00 00 a0 00 02:58:45.125 IDENTIFY DEVICE
~ ef 03 46 00 00 00 a0 00 02:58:45.125 SET FEATURES [Set transfer
mode]
~ ec 00 00 00 00 00 a0 00 02:58:45.125 IDENTIFY DEVICE

Error 41725 occurred at disk power-on lifetime: 881 hours (36 days + 17
hours)
~ When the command that caused the error occurred, the device was active
or id
~ le.

~ After command completion occurred, registers were:
~ ER ST SC SN CL CH DH
~ -- -- -- -- -- -- --
~ 40 51 00 77 b1 f0 e0 Error: UNC at LBA = 0x00f0b177 = 15774071

~ Commands leading to the command that caused the error were:
~ CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
~ -- -- -- -- -- -- -- -- - --
~ c8 00 00 77 b1 f0 e0 00 02:58:44.125 READ DMA
~ ec 00 00 00 00 00 a0 00 02:58:44.125 IDENTIFY DEVICE
~ ef 03 46 00 00 00 a0 00 02:58:44.125 SET FEATURES [Set transfer
mode]
~ ec 00 00 00 00 00 a0 00 02:58:44.125 IDENTIFY DEVICE

Error 41724 occurred at disk power-on lifetime: 881 hours (36 days + 17
hours)
~ When the command that caused the error occurred, the device was active
or id
~ le.

~ After command completion occurred, registers were:
~ ER ST SC SN CL CH DH
~ -- -- -- -- -- -- --
~ 40 51 00 77 b1 f0 e0 Error: UNC at LBA = 0x00f0b177 = 15774071

~ Commands leading to the command that caused the error were:
~ CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
~ -- -- -- -- -- -- -- -- - --
~ c8 00 00 77 b1 f0 e0 00 02:58:43.063 READ DMA
~ ec 00 00 00 00 00 a0 00 02:58:43.063 IDENTIFY DEVICE
~ ef 03 46 00 00 00 a0 00 02:58:43.063 SET FEATURES [Set transfer
mode]
~ ec 00 00 00 00 00 a0 00 02:58:43.063 IDENTIFY DEVICE

SMART Self-test log structure revision number 1
Num Test_Description Status Remaining
LifeTime(hours) L
~ BA_of_first_error
# 1 Short offline Completed without error 00% 2786
~ -
# 2 Short offline Completed without error 00% 2777
~ -
# 3 Short offline Completed without error 00% 2777
~ -
# 4 Short offline Completed without error 00% 2776
~ -
# 5 Short offline Completed without error 00% 2714
~ -
# 6 Short offline Completed without error 00% 2700
~ -
# 7 Short offline Completed without error 00% 2693
~ -
# 8 Short offline Completed without error 00% 2687
~ -
# 9 Extended offline Completed without error 00% 2676
~ -
#10 Short offline Completed without error 00% 2662
~ -
#11 Short offline Completed without error 00% 2653
~ -
#12 Short offline Completed without error 00% 2652
~ -
#13 Short offline Completed without error 00% 2637
~ -
#14 Short offline Completed without error 00% 2622
~ -
#15 Short offline Completed without error 00% 2611
~ -
#16 Short offline Completed without error 00% 2598
~ -
#17 Extended offline Completed without error 00% 2585
~ -
#18 Short offline Completed without error 00% 2570
~ -
#19 Short offline Completed without error 00% 2557
~ -
#20 Short offline Completed without error 00% 2541
~ -
#21 Short offline Completed without error 00% 2529
~ -

Device does not support Selective Self Tests/Logging

email@smixx.de - GnuPG Key-ID: 0x1489FF7D
gentoo-user-de@lists.gentoo.org mailing list
 

Lesen sie die antworten

#1 Jörg Frings-Fürst
27/02/2008 - 22:40 | Warnen spam
André Glücksmann schrieb:

Guten Tag - liebe Gentoo-liste-leute,

nach dem Missverstàndnis von neulich, bei dem nicht nur mein Postfach
mit unsinnigen Mails befüllt wurde, wollte ich mal wieder zu etwas
Ernstem kommen...

Für eine meiner unzàhligen Platten steht regelmàßig im Syslog:
"Feb 26 14:08:27 [smartd] Device: /dev/sdf, 1 Offline uncorrectable
sectors_"

Weiter unten habe ich noch mal die komplette Ausgabe von
"smartctl --all /dev/sdf" angehàngt.

Den Fehler habe ich bestimmt schon seit ein paar Monaten, ich habe immer
drauf gewartet, dass die Platte komplett den Geist aufgibt um sie dann
auszuwechseln und den Raid1-Verbund wieder herzustellen, allerdings
làuft sie nun ja schon seit Monaten...

Also was meint ihr? Soll ich sie raussschmeissen? Wegschmeissen und neu
kaufen? Habt ihr Erfahrungen mit solchen Fehlern?
Ich habe smartd erst vor ein paar Wochen bei mir installiert und vorher
immer erst Platten gewechselt, wenn sie halt nciht mehr liefen ;-)

Gruß
André


[...]

Hallo Andre,

wenn deine Platte schon làngere Zeit den Fehler hat würde ich mir keine
großen Gedanken über einen Austausch machen. Ich habe 2 Platten die
jeweils 1 und 3 "uncorrectable sectors" haben und mit dem Fehler schon
über 2 Jahre ohne Probs laufen.

Auf der anderen Seite habe ich schon fehlerhafte Platten gehabt, wo die
errors schneller hochliefen wie ich Kaffee holen und trinken konnte und
dann auch recht schnell ganz ausgefallen sind.

Gruß Jörg


mailing list

Ähnliche fragen