By Topic

Analyzing causes of failures in the Global Research Network using active measurements

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

7 Author(s)
Eugene S. Myakotnykh ; Centre for Quantifiable Quality of Service in Communication Systems (Q2S), Norwegian University of Science and Technology, Trondheim, Norway ; Bjarne E. Helvik ; Jon Kåre Hellan ; Olav Kvittem
more authors

With the objective to better understand how the global Internet should achieve an availability in the order of five nines, i.e. be available 0.99999 of the time, active measurements were performed between Norway and China through the Global Research Network. End-to-end downtime statistics was continuously collected during a 3-month period up to mid February 2010. In addition to periodically sending probe packets between the two measurement systems, traceroute was used every two minutes to identify an exact IP-level path between the end-points. Also, TTL (time-to-live) counter in the IP-header, which is reduced by one on every hop, was analyzed for each packet. Causes of the observed network failures based on the collected data were identified and insight is gained into processes preceding and following communication downtimes. We distinguish inter- and intradomain failures and, when possible, identify an exact link or an Autonomous System where a certain event has happened. The study shows that the end-to-end path availability is mainly affected by interdomain failures and long BGP convergence time as well as series of events not straight forwardly explained by the anticipated (re)routing behavior.

Published in:

Ultra Modern Telecommunications and Control Systems and Workshops (ICUMT), 2010 International Congress on

Date of Conference:

18-20 Oct. 2010