Bug 658663 - Base:System/smartmontools: Bug: Freeze with Intel X18-M/X25-M/X25-V G2 SSDs
Summary: Base:System/smartmontools: Bug: Freeze with Intel X18-M/X25-M/X25-V G2 SSDs
Status: RESOLVED INVALID
Alias: None
Product: openSUSE 11.3
Classification: openSUSE
Component: Other (show other bugs)
Version: Final
Hardware: x86-64 openSUSE 11.3
: P5 - None : Minor with 1 vote (vote)
Target Milestone: ---
Assignee: Stanislav Brabec
QA Contact: E-mail List
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2010-12-09 23:16 UTC by Martin Schröder
Modified: 2011-02-22 16:59 UTC (History)
1 user (show)

See Also:
Found By: ---
Services Priority:
Business Priority:
Blocker: ---
Marketing QA Status: ---
IT Deployment: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Martin Schröder 2010-12-09 23:16:24 UTC
I just had an INTEL SSDSA2M080G2GN freeze up after an 
"smartctl -a /dev/sda":
-----------------
Dec  9 23:55:57 dream sudo:       ms : TTY=pts/11 ; PWD=/etc ; USER=root ; COMMAND=/usr/sbin/smartctl -a /dev/sda
Dec  9 23:56:18 dream kernel: [18038.930588] ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
Dec  9 23:56:18 dream kernel: [18038.930597] ata1.00: failed command: SMART
Dec  9 23:56:18 dream kernel: [18038.930606] ata1.00: cmd b0/d5:01:06:4f:c2/00:00:00:00:00/00 tag 0 pio 512 in
Dec  9 23:56:18 dream kernel: [18038.930608]          res 40/00:04:80:cd:b3/00:00:05:00:00/40 Emask 0x4 (timeout)
Dec  9 23:56:18 dream kernel: [18038.930613] ata1.00: status: { DRDY }
Dec  9 23:56:18 dream kernel: [18038.930619] ata1: hard resetting link
Dec  9 23:56:18 dream kernel: [18039.234710] ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
Dec  9 23:56:18 dream kernel: [18039.235427] ata1.00: configured for UDMA/133
Dec  9 23:56:18 dream kernel: [18039.235446] ata1: EH complete
Dec  9 23:56:38 dream kernel: [18059.892334] ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
Dec  9 23:56:38 dream kernel: [18059.892341] ata1.00: failed command: SMART
Dec  9 23:56:38 dream kernel: [18059.892350] ata1.00: cmd b0/d5:01:09:4f:c2/00:00:00:00:00/00 tag 0 pio 512 in
Dec  9 23:56:38 dream kernel: [18059.892352]          res 40/00:04:80:cd:b3/00:00:05:00:00/40 Emask 0x4 (timeout)
Dec  9 23:56:38 dream kernel: [18059.892357] ata1.00: status: { DRDY }
Dec  9 23:56:38 dream kernel: [18059.892363] ata1: hard resetting link
Dec  9 23:56:39 dream kernel: [18060.196645] ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
Dec  9 23:56:39 dream kernel: [18060.197318] ata1.00: configured for UDMA/133
Dec  9 23:56:39 dream kernel: [18060.197339] ata1: EH complete
-----------------
> sudo /usr/sbin/smartctl -a /dev/sda
Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Model Family:     Intel X18-M/X25-M/X25-V G2 SSDs
Device Model:     INTEL SSDSA2M080G2GN
Serial Number:    CVPO017402JP080JGN
Firmware Version: 2CV102HD
User Capacity:    80.026.361.856 bytes
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   7
ATA Standard is:  ATA/ATAPI-7 T13 1532D revision 1
Local Time is:    Thu Dec  9 23:55:57 2010 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever 
                                        been run.
Total time to complete Offline 
data collection:                 (   1) seconds.
Offline data collection
capabilities:                    (0x75) SMART execute Offline immediate.
                                        No Auto Offline data collection support.
                                        Abort Offline collection upon new
                                        command.
                                        No Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine 
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        (   1) minutes.
Conveyance self-test routine
recommended polling time:        (   1) minutes.

SMART Attributes Data Structure revision number: 5
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  3 Spin_Up_Time            0x0020   100   100   000    Old_age   Offline      -       0
  4 Start_Stop_Count        0x0030   100   100   000    Old_age   Offline      -       0
  5 Reallocated_Sector_Ct   0x0032   100   100   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       163
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       67
192 Unsafe_Shutdown_Count   0x0032   100   100   000    Old_age   Always       -       6
225 Host_Writes_32MiB       0x0030   200   200   000    Old_age   Offline      -       5330
226 Workld_Media_Wear_Indic 0x0032   100   100   000    Old_age   Always       -       7397
227 Workld_Host_Reads_Perc  0x0032   100   100   000    Old_age   Always       -       0
228 Workload_Minutes        0x0032   100   100   000    Old_age   Always       -       3651502743
232 Available_Reservd_Space 0x0033   100   100   010    Pre-fail  Always       -       0
233 Media_Wearout_Indicator 0x0032   099   099   000    Old_age   Always       -       0
184 End-to-End_Error        0x0033   100   100   099    Pre-fail  Always       -       0

SMART Error Log Version: 1
No Errors Logged

Error SMART Error Self-Test Log Read failed: Eingabe-/Ausgabefehler
Smartctl: SMART Self Test Log Read Failed
Error SMART Read Selective Self-Test Log failed: Eingabe-/Ausgabefehler
Smartctl: SMART Selective Self Test Log Read Failed
-----------------
> rpm -qi smartmontools
Name        : smartmontools                Relocations: (not relocatable)
Version     : 5.40                              Vendor: obs://build.opensuse.org/Base
Release     : 20.1                          Build Date: Fr 22 Okt 2010 20:45:52 CEST
Install Date: So 28 Nov 2010 19:16:13 CET      Build Host: build37
Group       : Hardware/Other                Source RPM: smartmontools-5.40-20.1.src.rpm
Size        : 1299964                          License: GPLv2+
Signature   : DSA/SHA1, Fr 22 Okt 2010 20:46:10 CEST, Key ID cc6060f35d053b4f
URL         : http://smartmontools.sourceforge.net/
Summary     : Monitor for S.M.A.R.T. Disks and Devices
Description :
SMARTmontools controls and monitors storage devices using the
Self-Monitoring, Analysis, and Reporting Technology System (S.M.A.R.T.)
built into ATA, SATA and SCSI Hard Drives. This is used to check the
hard drive reliability and to predict drive failures. The suite
contains two utilities. The first, smartctl, is a command line utility
designed to perform simple S.M.A.R.T. tasks. The second, smartd, is a
daemon that periodically monitors the smart status and reports errors
to syslog. The package is compatible with the ATA/ATAPI-3 to -7
specification. The package is intended to incorporate as much "vendor
specific" and "reserved" information as possible about disk drives. The
commands man smartctl and man smartd will provide more information.



Authors:
--------
    Bruce Allen <smartmontools-support@lists.sourceforge.net>
Distribution: Base:System / openSUSE_11.3
-----------------
This seems to be a common problem with Intel SSDs:
https://bugs.launchpad.net/ubuntu/+source/smartmontools/+bug/597518
http://sourceforge.net/apps/trac/smartmontools/browser/tags/RELEASE_5_40/smartmontools/NEWS
http://forums.gentoo.org/viewtopic-p-6496792.html?sid=868568d3e5153dabe659c0ac9bafe848
http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=560701

Should I report this upstream?
Comment 1 Martin Schröder 2010-12-20 20:03:12 UTC
Ping...
Comment 2 Martin Schröder 2010-12-28 20:46:19 UTC
Since Base:System / openSUSE_11.3 seems to be gone, I downgraded to the version in 11.3 (5.39.1-3.1); I can reproduce the problem there. I've also reported the bug upstream: https://sourceforge.net/apps/trac/smartmontools/ticket/137
Comment 3 Martin Schröder 2011-01-05 19:53:32 UTC
Since using 5.39.1-3.1 with the latest kernel the problem seems to be gone; I'll keep watching for problems.
Comment 4 Cristian Rodríguez 2011-02-22 16:59:26 UTC
As I commented in the opensuse factory mailing list, this is a bug in the SSD firware, it has been fixed by Intel


"8 January 2011

Firmware version : 2CV102M3

This firmware revision fixes enumeration and slow-boot issues on SATA 6Gb/s
controllers, adds improvements to S.M.A.R.T. attributes for more accurate
reporting of drive health, improves NCQ capability, and fixes possible
drive hangs when reading S.M.A.R.T. self-test log."