Detecting and localizing large-scale router failures using active probes

Qiang Zheng, Guohong Cao, Tom La Porta, Ananthram Swami

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Scopus citations

Abstract

Detecting the occurrence of large-scale router failures and localizing the failed routers are critical to enhancing network reliability. We propose a two-phase approach for detecting and localizing large-scale router failures using traceroute-like active probes. To detect large-scale router failures, the detection phase is periodically invoked to probe all routers. When detecting large-scale router failures, the localization phase is triggered to identify the failed routers.We reduce the probing cost by avoiding three types of useless probes. For the routers whose status cannot be identified by probes, we develop a distance based method to estimate their failure probability. Experimental results based on ISP topologies show that the accuracy of our approach is higher than 96.5%, even when only 10% of routers are connected by end systems for probing. Compared with prior works, the proposed approach achieves much higher accuracy with lower probing cost.

Original languageEnglish (US)
Title of host publication2010 Military Communications Conference, MILCOM 2010
Pages1170-1175
Number of pages6
DOIs
StatePublished - 2011
Event2011 IEEE Military Communications Conference, MILCOM 2011 - Baltimore, MD, United States
Duration: Nov 7 2011Nov 10 2011

Publication series

NameProceedings - IEEE Military Communications Conference MILCOM

Other

Other2011 IEEE Military Communications Conference, MILCOM 2011
Country/TerritoryUnited States
CityBaltimore, MD
Period11/7/1111/10/11

All Science Journal Classification (ASJC) codes

  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Detecting and localizing large-scale router failures using active probes'. Together they form a unique fingerprint.

Cite this