List Info

Thread: SMART attributes: dead hard disk ?




SMART attributes: dead hard disk ?
user name
2006-10-30 22:17:40
Hi.

Probably you should reduce the watched attributes.
You must tweak a bit, but yes, the man page is
overwhelming...

My smartd.conf looks as follows:

/dev/hdb -m sysadminfixe-post.de,testfixe-post.de 
-M exec /root/bin/Scripts/smartmon_mail -M test -M daily -o
on -S on -H 
-s (S/../.././18|L/../01/./03)

/dev/twe0 -d 3ware,1 -m sysadminfixe-post.de,testfixe-post.de 
-M exec /root/bin/Scripts/smartmon_mail -M test -M daily -o
on -S on -H 
-s (S/../.././19|L/../01/./05) -p -R 194

/dev/twe0 -d 3ware,2 -m sysadminfixe-post.de,testfixe-post.de 
-M exec /root/bin/Scripts/smartmon_mail -M test -M daily -o
on -S on -H 
-s (S/../.././20|L/../01/./03) -p -R 194

and smartmon_mail is:

#! /bin/bash
my_hostname=`/usr/bin/hostname`

/usr/bin/echo -e "Subject:
$2nn$SMARTD_FULLMESSAGEnnFirst reported:
$SMARTD_TFIRST " | /usr/sbin
/sendmail -F "smartmon$my_hostname"
sysadminfixe-post.de

/usr/bin/echo -e "Subject:
$2nn$SMARTD_FULLMESSAGEnnFirst reported:
$SMARTD_TFIRST " | /usr/sbin
/sendmail -F "smartmon$my_hostname" testfixe-post.de


> Yeah, I think you're right Peter.
>
> I've been meaning to look up what the numbers mean in a
smartd log
> message. I generally see stuff like this:
>
>      Usage: Seek_Time_Performance (8) changed to 91
>      Usage: Temperature_Celsius (194) changed to 154,
163, 169
>
> but I certainly hope my drive isn't 169 degrees
Celsius!
>
> It's probably in the man page somewhere, but I've been
too lazy to look
> it up. I just hope that the smartd devs were nice
enough to make the
> message of an actual imminent failure quite alarming.
>
> b
>
> Peter Abrahamsen wrote:
>> I speak on no particular authority, but I think
these are normal
>> messages, and nothing to be too concerned about.
>>
>> P
>>
>> On 10/25/06, Nicolas MASSÉ <nicolas27.masselaposte.net> wrote:
>>> Hello,
>>>
>>> I have one RAID 1 array with two ATA hard
disks. I noticed a lots of
>>> messages
>>> from smartd in my logs.
>>>
>>> First, the CRC error count of hdc "goes
up" and then "down". What do
>>> you think
>>> of this ?
>>>
>>> SMART Usage Attribute: 199 UDMA_CRC_Error_Count
changed from 195 to 199
>>> SMART Usage Attribute: 199 UDMA_CRC_Error_Count
changed from 199 to 198
>>>
>>> Then, I have some messages regarding my first
disk (hda) :
>>>
>>> SMART Prefailure Attribute: 8
Seek_Time_Performance changed from 242
>>> to 245
>>>
>>> This message is repeated about ten times a day.
>>>
>>> Do you think my disks are going to die in the
next days or is smartd
>>> confused ?
>>>
>>> Regards,
>>>
>>> --
>>> Nicolas MASSÉ
>>> Pour récupérer ma clef GPG:
>>> gpg --keyserver wwwkeys.eu.pgp.net --recv-keys
0x2A18C433
>>> Key fingerprint: 6621 FC23 5DC7 54BA B952 316A
50B1 BC3F 2A18 C433
>>>
>>>
>>>
>>
> --
> gentoo-servergentoo.org mailing list
>
>
>


-- 
gentoo-servergentoo.org mailing list

[1]

about | contact  Other archives ( Real Estate discussion Medical topics )