Applied Mathematics
Vol.07 No.03(2016), Article ID:64026,17 pages
10.4236/am.2016.73022
Non-Stationary Random Process for Large-Scale Failure and Recovery of Power Distribution
Yun Wei1, Chuanyi Ji1, Floyd Galvan2, Stephen Couvillon2, George Orellana2, James Momoh3
1Georgia Institute of Technology, Atlanta, GA, USA
2Entergy Services, Inc., New Orleans, LA, USA
3Howard University, Washington DC, USA

Copyright © 2016 by authors and Scientific Research Publishing Inc.
This work is licensed under the Creative Commons Attribution International License (CC BY).
http://creativecommons.org/licenses/by/4.0/


Received 11 May 2015; accepted 26 February 2016; published 29 February 2016
ABSTRACT
This work applies non-stationary random processes to resilience of power distribution under severe weather. Power distribution, the edge of the energy infrastructure, is susceptible to external hazards from severe weather. Large-scale power failures often occur, resulting in millions of people without electricity for days. However, the problem of large-scale power failure, recovery and resilience has not been formulated rigorously nor studied systematically. This work studies the resilience of power distribution from three aspects. First, we derive non-stationary random processes to model large-scale failures and recoveries. Transient Little’s Law then provides a simple approximation of the entire life cycle of failure and recovery through a
queue at the network-level. Second, we define time-varying resilience based on the non-stationary model. The resilience metric characterizes the ability of power distribution to remain operational and recover rapidly upon failures. Third, we apply the non-stationary model and the resilience metric to large-scale power failures caused by Hurricane Ike. We use the real data from the electric grid to learn time-varying model parameters and the resilience metric. Our results show non-statio- nary evolution of failure rates and recovery times, and how the network resilience deviates from that of normal operation during the hurricane.
Keywords:
Resilience, Non-Stationary Random Process, Power Distribution, Dynamic Queue, Transient Little’s Law, Real Data

1. Introduction
The power grid is a vast interconnected network that delivers electricity to customers. Power distribution system lies at the edge of the power grid [1] . Power distribution provides medium and low voltages to residences and organizations. Distribution networks consist of a large numbers and diverse types of components, such as substations, power lines, poles, feeders, and transformers. Many such components are distributed in the open and exposed to external hazards such as hurricanes, derechoes, and ice storms [2] . About 90% of total failures occurred at power distribution [3] . Thus power distribution is particularly susceptible to external disruptions from severe weather [3] .
A fundamental research issue pertaining to this real problem is the resilience of power distribution to large- scale external disruptions. Here, resilience corresponds to the ability of power distribution to reduce failures and recover rapidly when failed [4] . Until now, tremendous efforts have been directed to resilience of the core that consists of the major power generation and transmission of high voltages [5] - [7] . For example, cascading failures at transmission networks have been studied widely [6] [8] - [11] . However, the majority of the failures occurred during severe storms are often at power distribution rather than transmission networks [12] . As the demand for energy grows, the edge of the grid becomes more and more important. For example, a utility (dis- tribution system operator) can serve millions of customers in America. Damages from such hazards as severe storms to power distribution can profoundly impact a large number of users. Hence, the resilience of power distribution requires significant study. Resilience of power distribution under severe weather poses unique challenges:
・ Randomness and dynamics (i.e., non-stationarity) of failures and recoveries,
・ Time-varying resilience at the network level,
・ Estimation of non-stationarity and the resilience using real data from the electric grid.
A pertinent first step is to model large-scale failures and recoveries. Such a model is a prerequisite for deriving a resilience metric at the network level. The metric needs to reflect the intrinsic characteristics of large- scale failures and recoveries. Severe weather disruptions such as hurricanes evolve randomly and dynamically. So do large-scale failure and recovery at power distribution. For example, failures occur and recover depending on random factors such as the intensity of a storm and dynamic allocations of repair crews. These factors vary with time. Hence, it is appropriate to model based on non-stationary random processes.
Prior approaches account for randomness of failures but rarely dynamics [6] [9] [13] - [15] . Models for failures are widely studied in computer-communication, e.g., finite state Markov process [16] and reliability of other multi-component systems [17] [18] . These models are stochastic but assume stationary properties or distributions. Stochastic models and random processes are also studied for other aspects of power distribution systems (e.g., forecasting and modeling renewable resources and power generations [19] - [21] , stability of power flows [22] , demand response [23] , maintenance process [24] , and degradation process [25] ). We develop an approach to learn from data for non-stationary stochastic processes on weather-induced failures and recoveries [26] .
Another challenge is how to quantify the resilience of power distribution. Resilience in this work measures the performance of power distribution during severe weather. In principle, such a resilience metric should manifest the difference between the performance in severe weather and normal operations [4] . Various reliability metrics are developed and widely used, including the IEEE standard indices for power systems [27] . The reliability metrics are for daily operations where disruptive events (e.g., severe weather) are excluded [27] . It is thus infeasible for the reliability measures to be used to study resilience that focuses on power failures and recoveries induced by severe weather events. During a severe weather event such as a hurricane, large-scale failures and recoveries can occur, which is significantly different from that in daily operations. Resilience metrics are thus open and much needed for both industry and communities as advocated by the recent reports in the nation [3] [28] . Several resilience measures are developed by the prior works, including a static metric of fragility [29] and dynamic metrics of functionality or quality [4] . How resilience evolves with time is modeled by brute force [4] [30] or averaging over time [31] [32] . In principle, resilience as a performance measure at the system-level is lacking. Such a resilience metric needs to be derived from a system model of failures and recoveries.
A third challenge is that unknown parameters of non-stationary models and a resilience metric need to be estimated from real data [33] . Prior works studied the power failures using historical data from previous storms [13] [34] - [36] . As the models were static, the learned parameters provided the static characteristics of power failures. Dynamic characteristics of power failures have been studied little from real data. Recovery is rarely studied using real data. A challenge is to learn from one external disruption such as a hurricane, as real data is rare and often unavailable from many large-scale weather disruptions.
The contribution of this work is to address the above three challenges, which are:
・ To develop a model based on non-stationary random processes,
・ To derive a dynamic resilience metric based on the model,
・ To learn time-varying model parameters and resilience metric using large-scale real data.
We first formulate, from bottom up, an entire life cycle of large-scale failure and recovery. The problem formulation begins at the finest level of network nodes based on temporal-spatial stochastic processes. Since each external disruption results in one snapshot of nodal states (failed and normal), information from one weather event is insufficient for completely specifying a temporal-spatial model [37] . Thus we focus on temporal models by aggregating spatial variables. Such an aggregation enables this work to focus on the non-stationary nature of failure and recovery at a moderate time scale, e.g., minutes and beyond. Such a time scale concurs with that of an evolution of a severe storm [38] . Failure and self-recovery that occur in seconds or shorter within power distribution systems are studied in other contexts [39] .
A resulting temporal model can be approximated by a
queue [40] . The arrival process to the queue characterizes failures with a general time-varying distribution
. The service time of the queue corresponds to the delay for failures to recover, and has a general time-varying distribution
.
means that it is possible for failures to recover immediately. Note that such a queue is easily extensible to multiple queues at different geo-locations [26] without loss of generality. Such a queuing model is an approximation of recovery in practice as first-come-first-service policy is assumed. When recovery is conducted with certain optimality, intuitively such a model services provides a “lower bound” for the performance of restoration.
We study an analytically tractable case of
queue through the Transient Little’s Law [40] , which characterizes an entire non-stationary life-cycle of large-scale failures. The importance of Transient Little’s Law is that two simple quantities, failure rate and probability distribution of failure duration, completely quantify the dynamic model to the first moments. This simplifies definition of dynamic resilience and estimation of models parameters from data. We define a dynamic resilience metric that includes not only resistance to failures but also fast recovery as one additional attribute. Such a dynamic metric shows the time-evolution of network resilience during an external disruption. Finally, the non-stationary model and the resilience metric are applied to a real life example of large scale failures of power distribution. The failures occurred during Hurricane Ike in 2008. Real data from power distribution is used to study failure and recovery processes as well as resilience.
The rest of the paper is organized as follows. Section 2 provides background knowledge and an example of large-scale failures at power distribution. Section 3 develops a problem formulation from nodes (components) to
queue at the network level. Transient Little’s Law is applied to obtain the non-stationary failure and recovery rates. Section 4 defines resilience based on the non-stationary model. Section 5 estimates pertinent model parameters and the resilience metric using large-scale real data. Section 6 discusses our findings and concludes the paper.
2. Background and Example
To provide intuition for modeling on failures and recoveries induced by severe weather, we begin with two examples.
2.1. Synthetic Example
The first example illustrates how failures can be induced by severe weather. The example is on a small section of a power distribution system drawn from [13] and shown in Figure 1(a). The section consists of two sources
and
, seven components
and five loads
. A component can be a transformer, a feeder, a pole, or a circuit. Links correspond to power lines. Assume either a source or a component or a link can fail during a hurricane. Assume primary source
is used in normal operations; and back-up source
is used if
fails. The following scenarios can occur:
a) If any of the components
fails independently due to an external disruption, and if both the primary and the back-up sources can be in operation, there is no electricity supply to the loads that are connected to the failed components. Thus, the components and the loads experience independent failures. Such independent failures can be caused directly by external factors at a time scale of a minute or beyond.

Figure 1. (a) A section in a distribution system. (b) Empirical distribution of failure time and duration. Failures and their durations are plotted at the time scale of hours for ease of illustration. 

b) If 

c) Recovery depends on the types of failures, restoration schemes, as well as the terrene conditions. For example, if either source 

In summary, failures and self-recoveries in a small time-scale of seconds depend on detailed topology and self-recovery schemes. Failure and recovery at a larger time scale of minutes and beyond are often due to external disruptions that evolve dynamically and randomly.
2.2. Real Data
The second example illustrates non-stationary failures and recoveries. The example uses real data on an operational power distribution system during Hurricane Ike. Hurricane Ike occurred in 2008 and affected more than 2 million customers at densely populated areas in Texas and Louisiana. Figure 1(b) shows a histogram on failure occurrence time and duration at the distribution network before, during and after the hurricane (see Section 5 for details on the data). The histogram demonstrates the non-stationarity of the power failures and recoveries during the hurricane:
a) Failure occurrence was time-varying and random. More failures occurred during the hurricane than those that occurred before and after.
b) Recovery time was also time-varying and random. Recovery time was different for failures occurred at different time. For example, more failures occurring during the hurricane recovered slowly than those that occurred before and after.
As the result, the probability distributions of failure-occurrence and failure duration vary with time in minutes and hours. Note that information on root causes of failures and recoveries is unavailable, which is beyond the scope of this work.
3. Stochastic Model
We now formulate time evolution of large-scale failure and recovery as a non-stationary random process. We begin with detailed information on nodal states (failure and normal). We then aggregate the spatial variables of nodes to obtain the temporal evolution of failure and recovery of an entire network.
3.1. Failure and Recovery Probability
A spatial-temporal random process provides theoretical basis for modeling large-scale failures at the finest scale of nodes (component). The shorthand notation i is used to specify both the index of a node and its corresponding geo-location, where 
Let 





Failures caused by external disruptions exhibit randomness. Whether and when a node fails is random. Whether and when a failed node recovers is also random. Given time





Equation (1) models an individual node in a network. The model includes Markov temporal dependence for nodal states which is a simple assumption for state transitions. Such a model can be applied to a heterogeneous grid where nodes experience different failure and recovery processes in general. There are no assumptions on an underlying network topology nor independence/dependence of failures. Such n equations for n nodes together form a spatial-temporal model for a network.
Each severe weather event generates one snapshot of network states. Information available on failures and recoveries is often from one or a few events. Such information is insufficient for specifying the spatial-temporal model. Hence, we derive a temporal model in this work by considering an entire network as a whole.
3.2. Temporal Process
Our temporal model aggregates spatial variables from Equation (1),

The probability can be further related to an indicator function, e.g.,
Definition 1. Let 





Let 


An increment 
Definition 2. Failure process 
Now we assume that failure 


Definition 3. Recovery process 
Assume 


Similarly, assume that, at most one recovery occurs during 



Here recovery time (or failure duration) is also assumed at the time scale of minutes. Equation (2) can be rewritten as





Hence, the expected number of nodes in the failure state equals the difference between the expected failures and the expected recoveries. The time-scale of a minute enables this work to focus on modeling failures that are induced by external disruptions and the recoveries that can not be accomplished by instant self-healing schemes. The aggregation conceals spatial variables [26] , network topology [39] and automated reconfiguration that are not discussed in this work.
3.3. Dynamic Queuing Model and Transient Little’s Law
Failure 


1) The arrival process to the queue corresponds to the number of failures 

2) A failure that occurs in 


3) The departure process of the queue corresponds to the number of recoveries

A 
Figure 2. Application of 
Transient Little’s Law provides an analytically tractable case of 
Theorem 1. Transient Little’s Law [40]
Consider 




where 

Consider an increment of arrivals as new failures, an arrival rate as a failure rate, a delay as a failure duration, and departures as recoveries. Assume that recoveries occur following first-in-first-out (FIFO) policy. Transient Little’s Law can then be directly applied to our problem. The theorem has an intuitive explanation: 





Define recovery rate 

Applying Transient Little’s Law, the recovery rate can be related to a failure rate and a recovery time distribution by the corollary below.
Corollary 1. Let 







The proof of the corollary is in Appendix 1. In summary, two pertinent quantities completely determine the expected number of failures and recoveries: Failure rate 

4. Resilience
We now derive a resilience metric using the pertinent parameters for an entire life cycle of non-stationary failure and recovery. While resilience can be characterized from multiple dimensions [4] , the infrastructure of power distribution is where failures occur. Hence we quantify the so-called (system) resilience by characterizing failures and recoveries of all nodes in a distribution network.
4.1. Infant and Aging Recovery
For an non-stationary recovery process, a failure duration depends on when the failure occurs (Figure 1). Given threshold



When 
Definition 4. Infant and aging recovery
Let 



Note here 
4.2. Dynamic Resilience Metric
As failure and recovery processes are dynamic, a resilience metric should be dynamic also. Furthermore, how resilience varies with time should result from the dynamic model of failure-recovery processes. Following such a principle, we define resilience from bottom-up, starting with one node. Probability 

Definition 5. Resilience of a node
Given threshold value 


Aggregating the resilience of nodes over an entire network, (system) resilience 

Definition 6. Resilience of a network
Given threshold value 


Hence, aggregating over spatial variables, network topology and automated reconfiguration, the resilience of a network is an average resilience of all network nodes:


1) Resilience is a property of a distribution network as a whole to survive large-scale external disruptions.
2) Resilience is a function of time that reflects temporal evolutions of failures and recoveries in a network.
3) Resilience shows the ability of a distribution network to resist failures and recover rapidly.
4) Resilience depends on threshold 


4.3. Resilience Parameters
The resilience metric can be characterized by the parameters of the model, i.e., non-stationary random processes in Section 1. In particular, the resilience metric (Equation (13)) can be represented through a simple expressions owing to Transient Little’s Law,

The second term corresponds to the aging recoveries at time t. Let 


The above expression shows that given threshold


4.4. Resilience of Non-Homogeneous Poisson Processes
A special case of resilience is when the failure process is a Non-Homogeneous Poisson Process (NHPP). As a commonly-used failure process [47] , a non-homogeneous Poisson Process 


When a failure process is an NHPP, a 





4.5. Threshold
Threshold 















where









where

In general, a failure-recovery process can be regarded as a combination of these two special cases. At time







5. Numerate Results of Real Data
We apply the non-stationary failure-recovery processes to a real-life example of large-scale failures caused by a hurricane. Our focus is on estimating the three pertinent quantities


5.1. Real Data and Processing
Hurricane Ike was one of the strongest hurricanes that occurred in 2008. Ike caused large scale power failures, resulted in more than two million customers without electricity, and was considered by many as the second costliest Atlantic hurricane of all time [49] [50] .
Reported by the National Hurricane Center [38] , the storm started to cause power outages across the onshore areas in Louisiana and Texas on September 12, 2008 prior to the landfall. Ike then made the landfall at Galveston, Texas at 2:10 a.m. Central Daylight Time (CDT), September 13, 2008, causing strong winds, flooding, and heavy rains across Texas. The hurricane weakened to a tropical storm at 1:00 p.m. September 13 and passed Texas by 2:00 a.m. September 14.
Widespread power failures were reported across Louisiana and Texas starting September 12 [50] . A major utility collected data on power failures from more than ten counties. The outages included various component failures in the distribution network such as failed circuits, fallen trees/poles, and non-operational substations. The raw data set consists of 5152 samples. Each sample consists of the failure occurrence time (

Among the 2005 samples, there are groups of failures that occurred within a minute. Failures within a group are considered as dependent and aggregated as one failed entity. Each group has a unique failure occurrence time 


The 463 samples are then randomly partitioned into a training set of 333 samples and a test set of 130 samples. The training set is used to learn parameters. The test set is used for validating the model and the parameters.
5.2. Empirical Failure Process
We now use the data set to study the empirical processes





5.2.1. Estimating Failure Rate
First, we use the training set to determine failure rate
simple moving average [51] :

estimate the failure rate and use the testing set to validate the estimation. Figure 3(a) shows the estimated failure rate and the estimation error with the 95% confidence interval. The failure rate increased and decreased in accordance with the evolution of the hurricanes. Before 7 p.m. September 12, when the hurricane was yet to arrive, the rate was less than 5 new failures per hour. Then the failure rate increased rapidly and reached the maximum value of 50 new failures per hour. The peak time of the failure rate coincided with the time of the landfall at 2:10 CDT September 13 [38] . After that, the failure rate reduced to a small value of less than 5 new failures per hour. As the failure rate was time varying, the failure process was non-stationary.
5.2.2. Non-Homogeneous Poisson Model
We now consider hypothesis 

Figure 3. Estimated failure rate 

Poisson process. If 
We perform Pearson’s test [52] on hypothesis 










However, not rejecting 

5.3. Empirical Recovery Process
Next we study empirical recovery-time distribution 
5.3.1. Data
The 463 samples in our data set consist of durations of the failures that occurred from 7 a.m. September 12 to 4 p.m. September 14. Figure 1(b) shows the joint empirical distribution 
5.3.2. Mixture Model
As the failure durations varied with the hurricane (Figure 1(b)), we choose a mixture model for the probability density function 


where 




We select a Weibull distribution as a mixture function because the parameters exhibit clear physical meaning [26] . Mathmatically the Weibull mixtures can be expressed as

where







5.3.3 Parameter Estimation
The parameters of the Weibull mixtures are estimated from the training samples. For simplicity, we divide the failure time into 5 intervals shown in Figure 1. Within an interval 







5.4. Resilience
We now study time evolution of resilience. First, we obtain an optimal threshold 




The network resilience is then obtained through Equation (15) using the failure rate 


Figure 4. Non-stationary (empirical) distribution of failure durations with respect to the failures occurred in the five intervals 


Figure 5. (a) Threshold
・ Prior to the hurricane, no failures occurred yet, and the resilience was close to 1.
・ A large number of failures then occurred and reduced the resilience to a lower level. How fast the resilience decreased was measured by
・ At 3 am September 14th, about 42.7 hours after the first observed failure (24.8 hours after the landfall, and 26.25 hours after the failure rate reached the maximum value) the resilience reached the minimum value. There, 46% (214 out of 463) of total failures were in aging recovery. The maximum reduction of resilience from that of
the normal operation was
・ After the minimum, the resilience increased when more failures were restored. The impact from the hurricane was fading gradually. It took about 10.7 days for the resilience to return to that of the normal operation from the minimum value.
The dynamic resilience metric 
6. Conclusions
We have derived a non-stationary random process to model large-scale failure and recovery of a power distribution network under external disruptions. The resulting model is a dynamic 
We had used real data from an operational network that was impacted by Hurricane Ike. The failure rate and non-stationary probability distribution of failure durations as well as resilience metric are estimated from the real data. The failure process has been shown to be an non-homogenous Poisson process at the time scale of minutes. The recovery-time distribution has been modeled as Weibull mixtures with time-varying parameters. A threshold value is obtained as 15.5 hours for this network, where 50.8% of the failures recovered rapidly. The network resilience reached its minimum value 24.8 hours after the landfall when the aging recoveries were 46% of all failures. The network experienced the most difficult time when the failure rate reached the peak value and the aging recovery dominated until the resilience decreased to the minimum. It then took about 10 days for the network to regain 100% resilience from the minimum value. These observations suggest that enhanced recovery, especially during the most difficult duration, can perhaps reduce the worst impact to the network and improve the overall resilience and the recovery time.
There are several directions for extensions of this work. The first is to utilize spatial and network variables in the non-stationary model. Temporal resilience can then be extended to measure spatiotemporal characteristics. Different time scales may need to be considered to account for the impacts from a system structure. Such extensions are natural as our model is derived from bottom-up starting with nodes at certain geo- and system- locations. Our preliminary work shows a step towards such an extension [39] . The second is to characterize the impacts of failures and recoveries to customers. This may involve more complex models beyond aggregated non-stationary random processes and Transient Little’s Law.
Acknowledgements
The authors would like to thank Chris Kung, Jae Won Choi, Daniel Burnham and Xinyu Dai for data processing, Kurt Belgum for helpful comments on the manuscript, Anthony Kuh, Vince Poor, and Nikil Jayant for helpful discussions. The support from the National Science Foundation (ECCS 0952785) is gratefully acknowledged.
Cite this paper
YunWei,FloydGalvan,ChuanyiJi,StephenCouvillon,GeorgeOrellana,JamesMomoh, (2016) Non-Stationary Random Process for Large-Scale Failure and Recovery of Power Distribution. Applied Mathematics,07,233-249. doi: 10.4236/am.2016.73022
References
- 1. Kaplan, S. M. (2009) Smart Grid: Electrical Power Transmission: Background and Policy Issues. Congressional Research Service, CRS Report for Congress, R40511.
- 2. Hoffman, P. and Bryan, W. (2009) Comparing the Impacts of the 2005 and 2008 Hurricanes on U.S. Energy Infrastructure. Office of Electricity Delivery and Energy Reliability of U.S. Department of Energy, OE/ISER Report.
- 3. Executive Office of the President (2013) Economic Benefits of Increasing Electric Grid Resilience to Weather Outages. President’s Council of Economic Advisers and the U.S. Department of Energy’s Office of Electricity Delivery and Energy Reliability, Washington DC, Technical Report.
- 4. Bruneau, M., Chang, S.E., Eguchi, R.T., Lee, G.C., O’Rourke, T.D., Reinhorn, A.M., Shinozuka, M., Tierney, K., Wallace, W.A. and von Winterfeldt, D. (2003) A Framework to Quantitatively Assess and Enhance the Seismic Resilience of Communities. Earthquake Spectra, 19, 733-752.
http://dx.doi.org/10.1193/1.1623497 - 5. Pagani, G.A. and Aiello, M. (2013) The Power Grid as a Complex Network: A Survey. Physica A: Statistical Mechanics and its Applications, 392, 2688-2700.
- 6. Hines, P., Apt, J. and Talukdar, S. (2009) Large Blackouts in North America: Historical Trends and Policy Implications. Energy Policy, 37, 5249-5259.
http://dx.doi.org/10.1016/j.enpol.2009.07.049 - 7. Office of the Manager National Communications System (2004) Supervisory Control and Data Acquisition (SCADA) Systems. Communication Technologies Inc., Technical Report.
- 8. Bienstock, D. (2011) Optimal Control of Cascading Power Grid Failures. 50th IEEE Conference on Decision and Control and European Control Conference (CDC-ECC), Orlando, Florida, 12-15 December 2011, 2166-2173.
- 9. Dobson, I., Carreras, B.A. and Newman, D.E. (2004) A Branching Process Approximation to Cascading Load-Dependent System Failure. 37th Hawaii International Conference on System Sciences, Hawaii, 5-8 January 2004, 1-10.
- 10. Ilic, M.D., Allen, H., Chapman, W., King, C.A., Lang, J.H. and Litvinov, E. (2005) Preventing Future Blackouts by Means of Enhanced Electric Power Systems Control: From Complexity to Order. Proceedings of the IEEE, 93, 1920-1941.
- 11. Xiao, H. and Yeh, E. (2011) Cascading Link Failure in the Power Grid: A Percolation Based Analysis. Proceedings of the IEEE International Conference on Communications Workshops (ICC), Kyoto, 5-9 June 2011, 1-6.
http://dx.doi.org/10.1109/iccw.2011.5963573 - 12. Federal Energy Regulatory Commission and the North American Electric Reliability Corporation (2012) Transmission Facility Outages during the Northeast Snowstorm of October 29-30, 2011: Causes and Recommendations. FERC-NERC Staff Report.
- 13. Zhu, D. (2007) Electric Distribution Reliability Analysis Considering Time-Varying Load, Weather Conditions and Reconfiguration with Distributed Generation. PhD Dissertation, Virginia Polytechnic Institute and State University, Blacksburg, VA.
- 14. Liu, H., Davidson, R.A., David, R.V. and Stedinger, J.R. (2005) Negative Binomial Regression of Electric Power Outages in Hurricanes. Journal of Infrastructure Systems, 11, 258-267.
http://dx.doi.org/10.1061/(ASCE)1076-0342(2005)11:4(258) - 15. Wanik, D.W., Anagnostou, E.N., Hartman, B.M., Frediani, M.E.B. and Astitha, M. (2015) Storm Outage Modeling for an Electric Distribution Network in Northeastern USA. Natural Hazards, 79, 1359-1384.
http://dx.doi.org/10.1007/s11069-015-1908-2 - 16. Liu, Y. and Trivedi, K.S. (2004) A General Framework for Network Survivability Quantification. Proceedings of the 12th GI/ITG Conference on Measuring, Modeling, and Evaluation of Computer and Communication Systems, Dresden, 12-15 September 2004, 369-378.
- 17. Dui, H., Si, S., Cui, L., Cai, Z. and Sun, S. (2014) Component Importance for Multi-State System Lifetimes with Renewal Functions. IEEE Transactions on Reliability, 63, 105-117.
http://dx.doi.org/10.1109/TR.2014.2299132 - 18. Song, S., Coit, D.W., Feng, Q. and Peng, H. (2014) Reliability Analysis for Multi-Component Systems Subject to Multiple Dependent Competing Failure Processes. IEEE Transactions on Reliability, 63, 331-345.
http://dx.doi.org/10.1109/TR.2014.2299693 - 19. Chen, N., Qian, Z., Nabney, I. and Meng, X. (2014) Wind Power Forecasts Using Gaussian Processes and Numerical Weather Prediction. IEEE Transactions on Power Systems, 29, 656-665.
http://dx.doi.org/10.1109/TPWRS.2013.2282366 - 20. He, M., Yang, L., Zhang, J. and Vittal, V. (2014) A Spatio-Temporal Analysis Approach for Short-Term Forecast of Wind Farm Generation. IEEE Transactions on Power Systems, 29, 1611-1622.
http://dx.doi.org/10.1109/TPWRS.2014.2299767 - 21. Fanzeres, B., Street, A. and Barroso, L. (2015) Contracting Strategies for Renewable Generators: A Hybrid Stochastic and Robust Optimization Approach. IEEE Transactions on Power Systems, 30, 1825-1837.
http://dx.doi.org/10.1109/TPWRS.2014.2346988 - 22. Yuan, B., Zhou, M., Li, G. and Zhang, X.P. (2015) Stochastic Small-Signal Stability of Power Systems with Wind Power Generation. IEEE Transactions on Power Systems, 30, 1680-1689.
http://dx.doi.org/10.1109/TPWRS.2014.2353014 - 23. Bina, M. and Ahmadi, D. (2015) Stochastic Modeling for the Next Day Domestic Demand Response Applications. IEEE Transactions on Power Systems, 30, 2880-2893.
http://dx.doi.org/10.1109/TPWRS.2014.2379675 - 24. Tian, J. and Li, Y. (2014) System Dynamics Assessment of Mitigation Strategies for Power Distribution Poles Subjected to Hurricanes. Natural Hazards, 70, 1263-1285.
http://dx.doi.org/10.1007/s11069-013-0879-4 - 25. Lin, Y.H., Li, Y.F. and Zio, E. (2015) Integrating Random Shocks into Multi-State Physics Models of Degradation Processes for Component Reliability Assessment. IEEE Transactions on Reliability, 64, 154-166.
http://dx.doi.org/10.1109/TR.2014.2354874 - 26. Wei, Y., Ji, C., Galvan, F., Couvillon, S., Orellana, G. and Momoh, J. (2014) Learning Geo-Temporal Nonstationary Failure and Recovery of Power Distribution. IEEE Transactions on Neural Networks and Learning Systems, 25, 229-240.
http://dx.doi.org/10.1109/TNNLS.2013.2271853 - 27. IEEE Draft Guide for Electric Power Distribution Reliability Indices. IEEE P1366/D6, November 2011, 1-40.
- 28. Bloomberg, M.R. (2013) A Stronger, More Resilient New York. PlaNYC Report, City of New York, June 2013.
- 29. Reed, D.A., Kapur, K.C. and Christie, R.D. (2009) Methodology for Assessing the Resilience of Networked Infrastructure. IEEE Systems Journal, 3, 174-180.
http://dx.doi.org/10.1109/JSYST.2009.2017396 - 30. Cimellaro, G.P., Reinhorn, A.M. and Bruneau, M. (2006) Quantification of Seismic Resilience. Proceedings of the 8th US National Conference on Earthquake Engineering, San Francisco, 18-22 April 2006, Paper No. 1094.
- 31. O’Rourke, T.D. (2007) Critical Infrastructure, Interdependencies, and Resilience. The Bridge, 37, 22-29.
- 32. Pant, R., Barker, K. and Zobel, C.W. (2014) Static and Dynamic Metrics of Economic Resilience for Interdependent Infrastructure and Industry Sectors. Reliability Engineering & System Safety, 125, 92-102.
http://dx.doi.org/10.1016/j.ress.2013.09.007 - 33. Erjongmanee, S. and Ji, C. (2011) Large-Scale Network-Service Disruption: Dependencies and External Factors. IEEE Transactions on Network and Service Management, 8, 375-386.
http://dx.doi.org/10.1109/TNSM.2011.110311.110106 - 34. Davidson, R.A., Liu, H., Sarpong, I.K., Sparks, P. and Rosowsky, D.V. (2003) Electric Power Distribution System Performance in Carolina Hurricanes. Natural Hazards Review, 4, 36-45.
http://dx.doi.org/10.1061/(ASCE)1527-6988(2003)4:1(36) - 35. Guikema, S.D., Quiring, S.M. and Han, S.R. (2010) Pre-Storm Estimation of Hurricane Damage to Electric Power Distribution Systems. Risk Analysis, 30, 1744-1752.
http://dx.doi.org/10.1111/j.1539-6924.2010.01510.x - 36. Liu, Y. and Singh, C. (2011) A Methodology for Evaluation of Hurricane Impact on Composite Power System Reliability. IEEE Transactions on Power Systems, 26, 145-152.
http://dx.doi.org/10.1109/TPWRS.2010.2050219 - 37. Kant, K. (2007) Surviving Large Scale Internet Failures, Tutorial. Proceedings of the 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, Edinburgh, 25-28 June 2007, Tutorial No. 1.
- 38. National Hurricane Center (2008) A Digital Record of the Complete Best Track Data, 2008.
ftp://ftp.nhc.noaa.gov/atcf/archive/2008/bal092008.dat.gz - 39. Wei, Y., Ji, C., Galvan, F., Couvillon, S. and Orellana, G. (2013) Dynamic Modeling and Resilience for Power Distribution. Proceedings of the IEEE SmartGridComm 2013 Symposium—Cyber-Physical Wide-Area Monitoring, Protection & Control, Vancouver, 21-24 October 2013, 85-90.
- 40. Bertsimas, D. and Mourtzinou, G. (1997) Transient Laws of Non-Stationary Queueing Systems and Their Applications. Queueing Systems: Theory and Applications, 25, 115-155.
http://dx.doi.org/10.1023/A:1019100301115 - 41. Amin, M. and Stringer, J. (2008) The Electric Power Grid: Today and Tomorrow. MRS Bulletin, 33, 399-407.
http://dx.doi.org/10.1557/mrs2008.80 - 42. Zhao, Y., Sevlian, R., Rajagopal, R., Goldsmith, A. and Poor, H.V. (2013) Outage Detection in Power Distribution Networks with Optimally-Deployed Power Flow Sensors. Proceedings of the 2013 IEEE Power and Energy Society General Meeting, Vancouver, 21-25 July 2013, 1-5.
http://dx.doi.org/10.1109/PESMG.2013.6672981 - 43. Bernstein, A., Bienstock, D., Hay, D., Uzunoglu, M. and Zussman, G. (2011) Power Grid Vulnerability to Geographically Correlated Failures Analysis and Control Implications. Technical Report, No. 2011-05-06, Columbia University, New York.
- 44. Nelson, B.L. and Taaffe, M.R. (2004) The Queueing System: Part I—The Single Node. INFORMS Journal on Computing, 16, 266-274.
http://dx.doi.org/10.1287/ijoc.1040.0070 - 45. Hosmer, D.W. and Lemeshow, S. (2008) Applied Survival Analysis: Regression Modeling of Time to Event Data. 2nd Edition, Wiley-Interscience, New York.
http://dx.doi.org/10.1287/ijoc.1040.0070 - 46. Kalbfleisch, J.D. and Prentice, R.L. (2002) The Statistical Analysis of Failure Time Data. 2nd Edition, John Wiley and Sons, New York.
http://dx.doi.org/10.1002/9781118032985 - 47. Ross, S.M. (2010) Introduction to Probability Models. 10th Edition, Academic Press, Burlington, MA.
- 48. Eick, S.G., Massey, W.A. and Whitt, W. (1993) Queues with Sinusoidal Arrival Rates. Management Science, 39, 241-252.
http://dx.doi.org/10.1287/mnsc.39.2.241 - 49. Blake, E.S., Landsea, C.W. and Gibney, E.J. (2011) The Deadliest, Costliest, and Most Intense United States Tropical Cyclones from 1851 to 2010. National Hurricane Center, NOAA Technical Memorandum, NWS NHC-6, Miami.
- 50. Colley, J. and De Blasio, S.M. (2008) Sr. Hurricane Ike Impact Report. Technical Report, US Department of Homeland Security, 2008.
- 51. Van Trees, H.L., Bell, K.L. and Tian, Z. (2013) Detection Estimation and Modulation Theory. 2nd Edition, Wiley, New York.
- 52. Plackett, R.L. (1983) Karl Pearson and the Chi-Squared Test. International Statistical Review/Revue Internationale de Statistique, 51, 59-72.
http://dx.doi.org/10.2307/1402731 - 53. Le Cam, L. (1990) Maximum Likelihood—An Introduction. ISI Review, 58, 153-171.
Appendix
1. Proof of Corollary 1
Proof: We begin with the Transient Little’s Theorem. Computing the derivative of both sides of Equation (7), we have

where 

The first term on the right-hand-side is





2. Pearson’s Hypothesis Test
Pearson’s Hypothesis Test: The hypothesis test is based on a chi-square statistic which compares the failure occurrence times with their sample mean. The details of testing 

1) Compute the estimated failure rate


2) Divide the failure occurrence times into m non-overlapping intervals



3) Count


4) Use the estimated 

occurrences.

5) Compute the sum

(number of independent parameter fitted) − 1. Since one parameter 

6) Given a confidence level, for instance 95%, we obtain a threshold value









