TY - GEN

T1 - Distributed probabilistic fault diagnosis for multiprocessor systems

AU - Berman, Piotr

AU - Pelc, Andrzej

PY - 1990

Y1 - 1990

N2 - A class of n-unit multiprocessor systems with O(n log n) interconnecting links is constructed, and a distributed probabilistic fault diagnosis algorithm whose probability of correctness converges to 1 as n → ∞ is proposed. For small probability of unit failure, a distributed diagnosis whose probability also converges to 1 as the size of the system grows is proposed for the hypercube. On the other hand, it is proved that if a class of systems has fewer than kn log n links for a small constant k, the probability of correctness of every fault diagnosis converges to 0 as n → ∞. By combining the probabilistic and the distributed approach the authors' model of fault diagnosis removes the major drawbacks of the PMC (Preparata-Metze-Chien) model: the assumption of tests with complete fault coverage and the assumption of a fault-free central monitoring unit capable of performing diagnosis.

AB - A class of n-unit multiprocessor systems with O(n log n) interconnecting links is constructed, and a distributed probabilistic fault diagnosis algorithm whose probability of correctness converges to 1 as n → ∞ is proposed. For small probability of unit failure, a distributed diagnosis whose probability also converges to 1 as the size of the system grows is proposed for the hypercube. On the other hand, it is proved that if a class of systems has fewer than kn log n links for a small constant k, the probability of correctness of every fault diagnosis converges to 0 as n → ∞. By combining the probabilistic and the distributed approach the authors' model of fault diagnosis removes the major drawbacks of the PMC (Preparata-Metze-Chien) model: the assumption of tests with complete fault coverage and the assumption of a fault-free central monitoring unit capable of performing diagnosis.

UR - http://www.scopus.com/inward/record.url?scp=0025665993&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0025665993&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:0025665993

SN - 081862051X

T3 - Digest of Papers - FTCS (Fault-Tolerant Computing Symposium)

SP - 340

EP - 346

BT - Digest of Papers - FTCS (Fault-Tolerant Computing Symposium)

PB - Publ by IEEE

T2 - 20th International Symposium on Fault-Tolerant Computing - FTCS 20

Y2 - 26 June 1990 through 28 June 1990

ER -