Medium Data on Big Data Predicting Disk Failures in CERNs NetApp-based Data Storage System

I describe in this report an experimental system for using classification and regression trees to generate predictions of disk failures in a NetApp-based storage system at the European Organisation for Nuclear Research (CERN) based on a mixture of SMART data, system logs, and low-level system perfor...

Full description

Bibliographic Details
Main Author: Stjerna, Albin
Format: Others
Language:English
Published: Uppsala universitet, Institutionen för informationsteknologi 2017
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-337638
id ndltd-UPSALLA1-oai-DiVA.org-uu-337638
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-uu-3376382018-01-18T05:38:54ZMedium Data on Big Data Predicting Disk Failures in CERNs NetApp-based Data Storage SystemengStjerna, AlbinUppsala universitet, Institutionen för informationsteknologi2017Engineering and TechnologyTeknik och teknologierI describe in this report an experimental system for using classification and regression trees to generate predictions of disk failures in a NetApp-based storage system at the European Organisation for Nuclear Research (CERN) based on a mixture of SMART data, system logs, and low-level system performance dataparticular to NetApp's storage solutions. Additionally, I make an attempt at profiling the system's built-in failure prediction method, and compiling statistics on historical complete-disk failures as well as bad blocks developed. Finally, I experiment with various parameters for producing classification trees and end up with two candidate models which have a true-positive rate of 86% with a false-alarm rate of 4% or atrue-positive rate of 71% and a false-alarm rate of 0.9% respectively, illustrating that classification trees might be a viable method for predicting real-life disk failures in CERNs storage systems. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-337638IT ; 17081application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic Engineering and Technology
Teknik och teknologier
spellingShingle Engineering and Technology
Teknik och teknologier
Stjerna, Albin
Medium Data on Big Data Predicting Disk Failures in CERNs NetApp-based Data Storage System
description I describe in this report an experimental system for using classification and regression trees to generate predictions of disk failures in a NetApp-based storage system at the European Organisation for Nuclear Research (CERN) based on a mixture of SMART data, system logs, and low-level system performance dataparticular to NetApp's storage solutions. Additionally, I make an attempt at profiling the system's built-in failure prediction method, and compiling statistics on historical complete-disk failures as well as bad blocks developed. Finally, I experiment with various parameters for producing classification trees and end up with two candidate models which have a true-positive rate of 86% with a false-alarm rate of 4% or atrue-positive rate of 71% and a false-alarm rate of 0.9% respectively, illustrating that classification trees might be a viable method for predicting real-life disk failures in CERNs storage systems.
author Stjerna, Albin
author_facet Stjerna, Albin
author_sort Stjerna, Albin
title Medium Data on Big Data Predicting Disk Failures in CERNs NetApp-based Data Storage System
title_short Medium Data on Big Data Predicting Disk Failures in CERNs NetApp-based Data Storage System
title_full Medium Data on Big Data Predicting Disk Failures in CERNs NetApp-based Data Storage System
title_fullStr Medium Data on Big Data Predicting Disk Failures in CERNs NetApp-based Data Storage System
title_full_unstemmed Medium Data on Big Data Predicting Disk Failures in CERNs NetApp-based Data Storage System
title_sort medium data on big data predicting disk failures in cerns netapp-based data storage system
publisher Uppsala universitet, Institutionen för informationsteknologi
publishDate 2017
url http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-337638
work_keys_str_mv AT stjernaalbin mediumdataonbigdatapredictingdiskfailuresincernsnetappbaseddatastoragesystem
_version_ 1718611420336095232