Fault tolerant arrays

We investigate the existence and computability of a fault cover for a configuration architecture; that is, a setting of switches that achieves an array despite the presence of faulty elements and broken interconnect. The switches may be stuck open or closed. For a preponderance of architectures thes...

Full description

Bibliographic Details
Main Author: LaForge, Laurence E. (Laurence Edward)
Format: Others
Language:en
Published: McGill University 1991
Subjects:
Online Access:http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=74630
id ndltd-LACETR-oai-collectionscanada.gc.ca-QMM.74630
record_format oai_dc
spelling ndltd-LACETR-oai-collectionscanada.gc.ca-QMM.746302014-02-13T03:55:22ZFault tolerant arraysLaForge, Laurence E. (Laurence Edward)Computer Science.We investigate the existence and computability of a fault cover for a configuration architecture; that is, a setting of switches that achieves an array despite the presence of faulty elements and broken interconnect. The switches may be stuck open or closed. For a preponderance of architectures these questions are NP-complete. This is not the case with local sparing, a fundamental approach that can, in fact, be applied to any nominal architecture. We give an algorithm that decides and computes a fault cover in time that is subcubic in the size of a locally spared array whose neighboring blocks of h elements each can be connected in any of h x h ways.We measure scaling in terms of the probability of a fault cover (coverage), the fraction of faults we can tolerate (tolerance), and the ratio of the size of the architecture to the size of the nominal array (redundancy). We establish a threshold tolerance to faults such that, for arbitrary coverage less than 1, a local spares fault cover exists. We prove that the stochastic tolerance of a two-dimensional array spared by local blocks is much better than when spares are arranged by rows and columns. As to the latter, we treat two architectures: homogeneous and dedicated spares, the worst-case tolerance of which is much better than that of local spares. The fault tolerance of each of these architectures is inferior to the general grid scheme of (Leighton and Leiserson 1985). However, the longest wire in an array configured from a general grid is almost never shorter than that of an array configured by local spares.Although it is not the most tolerant configuration architecture, the combined test and configuration tolerance of local spares is essentially the same as the configuration tolerance alone. Moreover, the stochastic test redundancy of local spares matches the lower bound of (Scheinerman 1987) and (Blough 1988).We show that convergent hypergeometric and binomial distributions of faults do not imply convergent coverage. We characterize homogeneous spares whose complementary external graph consists of a vertex-disjoint packing of star graphs; this solves exactly a subset of the problem of Zarankiewicz.McGill University1991Electronic Thesis or Dissertationapplication/pdfenalephsysno: 001237029proquestno: AAINN67686Theses scanned by UMI/ProQuest.All items in eScholarship@McGill are protected by copyright with all rights reserved unless otherwise indicated.Doctor of Philosophy (School of Computer Science.) http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=74630
collection NDLTD
language en
format Others
sources NDLTD
topic Computer Science.
spellingShingle Computer Science.
LaForge, Laurence E. (Laurence Edward)
Fault tolerant arrays
description We investigate the existence and computability of a fault cover for a configuration architecture; that is, a setting of switches that achieves an array despite the presence of faulty elements and broken interconnect. The switches may be stuck open or closed. For a preponderance of architectures these questions are NP-complete. This is not the case with local sparing, a fundamental approach that can, in fact, be applied to any nominal architecture. We give an algorithm that decides and computes a fault cover in time that is subcubic in the size of a locally spared array whose neighboring blocks of h elements each can be connected in any of h x h ways. === We measure scaling in terms of the probability of a fault cover (coverage), the fraction of faults we can tolerate (tolerance), and the ratio of the size of the architecture to the size of the nominal array (redundancy). We establish a threshold tolerance to faults such that, for arbitrary coverage less than 1, a local spares fault cover exists. We prove that the stochastic tolerance of a two-dimensional array spared by local blocks is much better than when spares are arranged by rows and columns. As to the latter, we treat two architectures: homogeneous and dedicated spares, the worst-case tolerance of which is much better than that of local spares. The fault tolerance of each of these architectures is inferior to the general grid scheme of (Leighton and Leiserson 1985). However, the longest wire in an array configured from a general grid is almost never shorter than that of an array configured by local spares. === Although it is not the most tolerant configuration architecture, the combined test and configuration tolerance of local spares is essentially the same as the configuration tolerance alone. Moreover, the stochastic test redundancy of local spares matches the lower bound of (Scheinerman 1987) and (Blough 1988). === We show that convergent hypergeometric and binomial distributions of faults do not imply convergent coverage. We characterize homogeneous spares whose complementary external graph consists of a vertex-disjoint packing of star graphs; this solves exactly a subset of the problem of Zarankiewicz.
author LaForge, Laurence E. (Laurence Edward)
author_facet LaForge, Laurence E. (Laurence Edward)
author_sort LaForge, Laurence E. (Laurence Edward)
title Fault tolerant arrays
title_short Fault tolerant arrays
title_full Fault tolerant arrays
title_fullStr Fault tolerant arrays
title_full_unstemmed Fault tolerant arrays
title_sort fault tolerant arrays
publisher McGill University
publishDate 1991
url http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=74630
work_keys_str_mv AT laforgelaurenceelaurenceedward faulttolerantarrays
_version_ 1716641646118961152