TY - GEN
T1 - Distributed probabilistic fault diagnosis for multiprocessor systems
AU - Berman, Piotr
AU - Pelc, Andrzej
PY - 1990
Y1 - 1990
N2 - A class of n-unit multiprocessor systems with O(n log n) interconnecting links is constructed, and a distributed probabilistic fault diagnosis algorithm whose probability of correctness converges to 1 as n → ∞ is proposed. For small probability of unit failure, a distributed diagnosis whose probability also converges to 1 as the size of the system grows is proposed for the hypercube. On the other hand, it is proved that if a class of systems has fewer than kn log n links for a small constant k, the probability of correctness of every fault diagnosis converges to 0 as n → ∞. By combining the probabilistic and the distributed approach the authors' model of fault diagnosis removes the major drawbacks of the PMC (Preparata-Metze-Chien) model: the assumption of tests with complete fault coverage and the assumption of a fault-free central monitoring unit capable of performing diagnosis.
AB - A class of n-unit multiprocessor systems with O(n log n) interconnecting links is constructed, and a distributed probabilistic fault diagnosis algorithm whose probability of correctness converges to 1 as n → ∞ is proposed. For small probability of unit failure, a distributed diagnosis whose probability also converges to 1 as the size of the system grows is proposed for the hypercube. On the other hand, it is proved that if a class of systems has fewer than kn log n links for a small constant k, the probability of correctness of every fault diagnosis converges to 0 as n → ∞. By combining the probabilistic and the distributed approach the authors' model of fault diagnosis removes the major drawbacks of the PMC (Preparata-Metze-Chien) model: the assumption of tests with complete fault coverage and the assumption of a fault-free central monitoring unit capable of performing diagnosis.
UR - http://www.scopus.com/inward/record.url?scp=0025665993&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0025665993&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:0025665993
SN - 081862051X
T3 - Digest of Papers - FTCS (Fault-Tolerant Computing Symposium)
SP - 340
EP - 346
BT - Digest of Papers - FTCS (Fault-Tolerant Computing Symposium)
PB - Publ by IEEE
T2 - 20th International Symposium on Fault-Tolerant Computing - FTCS 20
Y2 - 26 June 1990 through 28 June 1990
ER -