TY - GEN
T1 - SSD failures in datacenters
T2 - 9th ACM International Systems and Storage Conference, SYSTOR 2016
AU - Narayanan, Iyswarya
AU - Wang, Di
AU - Jeon, Myeongjae
AU - Sharma, Bikash
AU - Caulfield, Laura
AU - Sivasubramaniam, Anand
AU - Cutler, Ben
AU - Liu, Jie
AU - Khessib, Badriddine
AU - Vaid, Kushagra
N1 - Publisher Copyright:
Copyright © 2016 ACM.
PY - 2016/6/6
Y1 - 2016/6/6
N2 - Despite the growing popularity of Solid State Disks (SSDs) in the datacenter, little is known about their reliability characteristics in the field. The little knowledge is mainly vendor supplied, and such information cannot really help understand how SSD failures can manifest and impact the operation of production systems, in order to take appropriate remedial measures. Besides actual failure data and the symptoms exhibited by SSDs before failing, a detailed characterization effort requires wide set of data about factors influencing SSD failures, right from provisioning factors to the operational ones. This paper presents an extensive SSD failure characterization by analyzing a wide spectrum of data from over half a million SSDs that span multiple generations spread across several datacenters which host a wide spectrum of workloads over nearly 3 years. By studying the diverse set of design, provisioning and operational factors on failures, and their symptoms, our work provides the first comprehensive analysis of the what, when and why characteristics of SSD failures in production datacenters.
AB - Despite the growing popularity of Solid State Disks (SSDs) in the datacenter, little is known about their reliability characteristics in the field. The little knowledge is mainly vendor supplied, and such information cannot really help understand how SSD failures can manifest and impact the operation of production systems, in order to take appropriate remedial measures. Besides actual failure data and the symptoms exhibited by SSDs before failing, a detailed characterization effort requires wide set of data about factors influencing SSD failures, right from provisioning factors to the operational ones. This paper presents an extensive SSD failure characterization by analyzing a wide spectrum of data from over half a million SSDs that span multiple generations spread across several datacenters which host a wide spectrum of workloads over nearly 3 years. By studying the diverse set of design, provisioning and operational factors on failures, and their symptoms, our work provides the first comprehensive analysis of the what, when and why characteristics of SSD failures in production datacenters.
UR - http://www.scopus.com/inward/record.url?scp=84978795494&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84978795494&partnerID=8YFLogxK
U2 - 10.1145/2928275.2928278
DO - 10.1145/2928275.2928278
M3 - Conference contribution
AN - SCOPUS:84978795494
T3 - SYSTOR 2016 - Proceedings of the 9th ACM International Systems and Storage Conference
BT - SYSTOR 2016 - Proceedings of the 9th ACM International Systems and Storage Conference
PB - Association for Computing Machinery, Inc
Y2 - 6 June 2016 through 8 June 2016
ER -