Modeling, Monitoring and Scheduling Techniques for Network Recovery from Massive Failures
Title | Modeling, Monitoring and Scheduling Techniques for Network Recovery from Massive Failures |
Publication Type | Conference Paper |
Year of Publication | 2019 |
Authors | Tootaghaj, Diman Zad, La Porta, Thomas, He, Ting |
Conference Name | 2019 IFIP/IEEE Symposium on Integrated Network and Service Management (IM) |
Date Published | apr |
Publisher | IEEE |
ISBN Number | 978-3-903176-15-7 |
Keywords | Cascading Failures, communication infrastructure, communication network, Communication networks, congested links, disruptive routing framework, failure locations, failure recovery, interdependent networks, Knowledge engineering, large-scale failures, maintenance engineering, Massive Disruption, massive failures, Monitoring, multiple interconnected networks, multistage recovery approach, natural disasters, network monitoring, network monitoring techniques, network recovery, Optimization, power grid, power grids, power system faults, pubcrawl, recovery schedule, rescue missions, resilience, Resiliency, soft failures, software defined networking, Software Defined Networks, software-defined networking, System recovery, telecommunication network reliability, telecommunication network routing, telecommunication scheduling, telecommunication security, Uncertainty |
Abstract | Large-scale failures in communication networks due to natural disasters or malicious attacks can severely affect critical communications and threaten lives of people in the affected area. In the absence of a proper communication infrastructure, rescue operation becomes extremely difficult. Progressive and timely network recovery is, therefore, a key to minimizing losses and facilitating rescue missions. To this end, we focus on network recovery assuming partial and uncertain knowledge of the failure locations. We proposed a progressive multi-stage recovery approach that uses the incomplete knowledge of failure to find a feasible recovery schedule. Next, we focused on failure recovery of multiple interconnected networks. In particular, we focused on the interaction between a power grid and a communication network. Then, we focused on network monitoring techniques that can be used for diagnosing the performance of individual links for localizing soft failures (e.g. highly congested links) in a communication network. We studied the optimal selection of the monitoring paths to balance identifiability and probing cost. Finally, we addressed, a minimum disruptive routing framework in software defined networks. Extensive experimental and simulation results show that our proposed recovery approaches have a lower disruption cost compared to the state-of-the-art while we can configure our choice of trade-off between the identifiability, execution time, the repair/probing cost, congestion and the demand loss. |
URL | https://ieeexplore.ieee.org/document/8717793 |
Citation Key | tootaghaj_modeling_2019 |
- soft failures
- optimization
- Power Grid
- power grids
- power system faults
- pubcrawl
- recovery schedule
- rescue missions
- resilience
- Resiliency
- network recovery
- software defined networking
- Software Defined Networks
- software-defined networking
- System recovery
- telecommunication network reliability
- telecommunication network routing
- telecommunication scheduling
- telecommunication security
- uncertainty
- large-scale failures
- communication infrastructure
- Communication Network
- Communication networks
- congested links
- disruptive routing framework
- failure locations
- failure recovery
- interdependent networks
- Knowledge engineering
- Cascading Failures
- maintenance engineering
- Massive Disruption
- massive failures
- Monitoring
- multiple interconnected networks
- multistage recovery approach
- natural disasters
- Network Monitoring
- network monitoring techniques