Visible to the public Modeling, Monitoring and Scheduling Techniques for Network Recovery from Massive Failures

TitleModeling, Monitoring and Scheduling Techniques for Network Recovery from Massive Failures
Publication TypeConference Paper
Year of Publication2019
AuthorsTootaghaj, Diman Zad, La Porta, Thomas, He, Ting
Conference Name2019 IFIP/IEEE Symposium on Integrated Network and Service Management (IM)
Date Publishedapr
PublisherIEEE
ISBN Number978-3-903176-15-7
KeywordsCascading Failures, communication infrastructure, communication network, Communication networks, congested links, disruptive routing framework, failure locations, failure recovery, interdependent networks, Knowledge engineering, large-scale failures, maintenance engineering, Massive Disruption, massive failures, Monitoring, multiple interconnected networks, multistage recovery approach, natural disasters, network monitoring, network monitoring techniques, network recovery, Optimization, power grid, power grids, power system faults, pubcrawl, recovery schedule, rescue missions, resilience, Resiliency, soft failures, software defined networking, Software Defined Networks, software-defined networking, System recovery, telecommunication network reliability, telecommunication network routing, telecommunication scheduling, telecommunication security, Uncertainty
Abstract

Large-scale failures in communication networks due to natural disasters or malicious attacks can severely affect critical communications and threaten lives of people in the affected area. In the absence of a proper communication infrastructure, rescue operation becomes extremely difficult. Progressive and timely network recovery is, therefore, a key to minimizing losses and facilitating rescue missions. To this end, we focus on network recovery assuming partial and uncertain knowledge of the failure locations. We proposed a progressive multi-stage recovery approach that uses the incomplete knowledge of failure to find a feasible recovery schedule. Next, we focused on failure recovery of multiple interconnected networks. In particular, we focused on the interaction between a power grid and a communication network. Then, we focused on network monitoring techniques that can be used for diagnosing the performance of individual links for localizing soft failures (e.g. highly congested links) in a communication network. We studied the optimal selection of the monitoring paths to balance identifiability and probing cost. Finally, we addressed, a minimum disruptive routing framework in software defined networks. Extensive experimental and simulation results show that our proposed recovery approaches have a lower disruption cost compared to the state-of-the-art while we can configure our choice of trade-off between the identifiability, execution time, the repair/probing cost, congestion and the demand loss.

URLhttps://ieeexplore.ieee.org/document/8717793
Citation Keytootaghaj_modeling_2019