European Commission logo
polski polski
CORDIS - Wyniki badań wspieranych przez UE
CORDIS

Generally Accepted Reliability Principle with Uncertainty modelling and through probabilistic Risk assessment

Final Report Summary - GARPUR (Generally Accepted Reliability Principle with Uncertainty modelling and through probabilistic Risk assessment)

Executive Summary:
Historically in Europe, power system reliability management has been predominantly relying on the “N-1” criterion, whereby the system should be able to withstand at all times an unexpected failure or outage of a system component in such a way that the system is capable of accommodating the new operational situation without violating security limits.

Today, the increasing uncertainty of generation due to intermittent energy sources, and the growing complexity of the pan-European power system, increases the need for new reliability criteria with a better balance between reliability and costs.

In this perspective, the GARPUR EC-funded project was launched in September 2013. Coordinated by SINTEF Energy Research, the project unites 7 TSOs, 12 R&D providers and 1 innovation management expert. After 4 years of work, GARPUR has designed, developed and assessed new probabilistic reliability criteria for the pan-European power system and its evolution beyond 2020.

GARPUR has also evaluated the relevance of the criteria and their practical use, while seeking to maximize social welfare. In simple terms, this new methodology allows TSO’s to assess the probability and consequences of failures in their power system, expressed as the potential cost of power interruptions to consumers.

With this information, TSO’s can make decisions and investments that provide the best balance between power supply security and cost. This new approach has been designed to seamlessly fit in with existing TSO processes. Four TSOs have tested the methodology for planning or operating their grids and the results are very promising.

The new criteria are developed for system development, asset management and system operation in order to ensure a consistent treatment of reliability across all time horizons. This report summarizes the keys results of the project and their potential benefits to TSOs. For more details, please refer to the deliverables available on the website:

www.garpur-project.eu

Project Context and Objectives:
Power system reliability management aims to maintain power system performance at a desired level, while minimizing the socio-economic costs of keeping the power system at that performance level. Historically in Europe, network reliability management has been relying on the so-called “N-1” criterion: in case of fault of one relevant element (e.g. one transmission system element, one significant generation element or one significant distribution network element), the elements remaining in operation must be capable of accommodating the new operational situation without violating the network’s operational security limits. Today, the increasing uncertainty of generation due to intermittent energy sources, combined with the opportunities provided e.g. by demand-side management and energy storage, call for imagining new reliability criteria with a better balance between reliability and costs.

Within this context, the GARPUR project has designed, developed, assessed and evaluated new reliability criteria to be progressively implemented over the next decades at a pan-European level, while maximizing social welfare. The seven main objectives of GARPUR were:
1. To develop a consistent probabilistic framework for reliability management, covering the definition of reliability, the calculation of reliability criteria, and the formulation of optimization problems expressing the economic costs and the desired target reliability levels at the pan-European level and within each individual control zone.
2. To develop a consistent methodology for the quantitative evaluation of the economic impact on society of different reliability management strategies both at the pan-European level, and within each control zone.
3. To develop a quantification platform able to compare different reliability management strategies in terms of their impact on the social welfare.
4. To ensure the compliance of the developed methodologies with the technical requirements of system development, asset management and power system operation, and to demonstrate the practical exploitability of the new concepts at the pan-European level and in all decision-making contexts.
5. To validate the different reliability criteria with the help of pilot tests.
6. To ensure the general acceptance of the proposed methods and tools by all stakeholders affected by the reliability management of the pan-European electric power system.
7. To define an implementation roadmap towards the use of the new reliability management practices.

To ensure progress beyond the state-of-the-art, GARPUR has pursued five different aspects by which progress can be ensured and evaluated. These were defined as:
1. Model the spatiotemporal variation of the probabilities of exogenous threats and take into account the actual criticalities of service interruptions in the reliability management.
2. Take into account the increased possibilities of corrective control and its probability of failure in the reliability management.
3. Exploit the flexibility provided by demand-side management and energy storage, to achieve the reliability enhancement given the emergence of decentralized renewable generation.
4. Explicitly model the impact of system development and asset management decisions on the reliability management during operation.
5. Explicitly take into account the consideration of low-probability high-impact events, such as the ones originating from extreme weather conditions, possibly through climate change, or those originating from adverse behaviours of external entities.

Project Results:
1 Decision-making challenges in reliability management
In today's world, a reliable and affordable electricity supply is the prerequisite to most societal and economic activities. European Transmission System Operators (TSOs) have the mission to ensure electricity supply in Europe by building, operating and maintaining the electric power transmission system.

Power system reliability describes the level of confidence in ensuring a continuous electricity supply to consumers. Reliability management is about gathering all of the information that enables TSOs to make better short-term and long-term decisions, with the aim to achieve an optimal balance between the costs of ensuring a secure power supply and the socio-economic costs of an interruption.

Power system reliability management means making decisions under uncertainty. It aims at meeting a reliability criterion, while minimising the costs of doing so. A reliability criterion is a principle imposing a standard to determine whether the reliability level of a power system is acceptable. Reliability management is commonly broken down into reliability assessment and reliability control. Reliability assessment concerns quantifying the (anticipated) performance of a system, taking into account the uncertainties in its operational conditions over specified time periods. Reliability control is the process of selecting the most appropriate actions to solve the problems revealed by the reliability assessment process. Reliability management also covers a wide range of activities over several timescales, from long-term infrastructure planning to real-time system operations, see Figure 1.


Figure 1 The time-scales and contexts of reliability management

In system development, decisions are taken about whether and where to build new transmission lines and substations, in order to adapt the physical network capacities to meet the future needs of the power system.

In asset management, the maintenance and replacement policies determine the various activities that will be undertaken in the coming decades to ensure that the grid assets will be in reliable working condition. In GARPUR we have also tackled the question of the scheduling of the outages for such activities, which need to be planned at times where they have the least possible impact on the system reliability.

In system operations, decisions are taken for operational planning and for real-time operation. In operational planning, maintenance schedules and transmission system capacities provided to the market are fine-tuned, and generation reserves are purchased, to ensure reliable operation for the next hours and days. In real-time operation, system topology and generation schedules are adjusted to avoid that component failures lead to power supply interruptions.

Historically in Europe and also today, power system reliability management is based on the N-1 criterion, with some variations to fit particular characteristics of different control areas of the pan-European system. In general terms, the N-1 criterion means that in case of any fault of one relevant element in the power system, the TSO must be still capable of supplying electricity to all consumers.

The increasing penetration of intermittent and distributed energy sources, ageing infrastructure and the European objective of a single electricity market has rendered the current N-1 criterion inadequate. A new, more responsive approach to reliability management is needed that exploits more information and that can lead to more optimal decisions that are better for everyone.

2 GARPUR: a risk-based approach to reliability management
The GARPUR research project was established in 2013 under the European Commission’s 7th Framework Programme to investigate a new reliability management approach which would allow networks to operate much closer to their economic optimum, by taking explicitly into consideration:
- the probabilities of failures of the different system components, by carefully modelling their dependence on weather conditions, maintenance history and real-time conditions;
- the uncertainties in generation and load forecasts;
- the flexibility provided by the demand-side, energy storage and distributed electricity generation from renewables;
- the possible costs of the power supply interruptions that could occur, by modelling the socio-economic impact of power supply interruptions on end-users.

GARPUR has developed a comprehensive probabilistic risk-based reliability management framework and demonstrated its technical feasibility, scalability and practical interest of the proposed methods to support TSOs in their reliability management activities over all relevant time scales and system areas. This framework targets on optimized socio-economic benefits while maintaining the security of supply at an adequate level.

3 GARPUR methodology for risk-based reliability management
GARPUR has developed a new methodology that has the potential to transform reliability management. It moves from the current N-1 practice to a risk-based approach by allowing transmission system operators to consider the probabilities of grid component failures, how big an impact the failure will have and what will be the socio-economic costs associated with it. The methodology has been named Reliability Management Approach and Criterion (RMAC) which relies on four components, described below.

A socio-economic objective
One of the functions of reliability management is to optimize the socio-economic costs and benefits for a given activity, including its impact on the market surplus, on the capital (CAPEX) and operating expenditures (OPEX) of TSOs, and the potential cost of power supply interruptions for end-users.

The socio-economic objective function of the RMAC balances the costs and benefits of reliability management to all electricity system stakeholders, from TSOs to end-users.

In real-time operations, for example, the socio-economic objective would be to minimize the total sum of different types of costs – including the costs associated with preventive and corrective measures (both actual costs and potential costs) and the costs incurred by end-users in the case of a power supply interruption.

A reliability target
Taking decisions that optimize the above-mentioned socio-economic objective may lead to unacceptable situations in terms of system performance. In some situations, it might lead to a system operating with a too high probability of large-scale power supply interruptions. Or it may lead to increased probability of interruptions in some of the smaller, more vulnerable areas of the network.

Therefore, a range of acceptable situations (network states and level of service) are specified, together with the overall probability that they should hold with. As an example, the real-time reliability target could be expressed as “ensuring with 99.9 % probability that no severe power supply interruptions should occur”.

A discarding principle
In practice, strictly applying the previous two components in large-scale electric power systems is not feasible, as one could not consider all the possible events.

Therefore, the discarding principle allows one to ignore a number of possible events, provided that their risk (i.e. expected socio-economic impact in case of power supply interruptions) contribution is below a defined threshold. In other words, the discarding principle specifies in a dynamic manner which events can be ignored. As an example, in real-time operations the discarding principle may lead to replacing the current fixed set of N-1 events with a smaller set of events (under good weather conditions/low risk situations) and a larger set of events (under severe weather/high risk situations).

A relaxation principle
It is possible then that in the event of a decision to be made, none of the available options are compliant with the reliability target & discarding principle that has been adopted. In this situation, the threshold for discarding can be progressively increased until a satisfactory option can be found. In practice, this would mean ignoring even more events than specified in the discarding principle starting with those that have the lowest risk in relative terms.

The methodology described above lays the groundwork to perform reliability management based on a better knowledge and evaluation of reliability of supply and of the risk of power supply interruptions. The methodology has been designed to:
- Suit all time horizons and type of decisions covered by the reliability management activities
- Be compatible with the computational and data resources that most TSOs are in capacity to currently deploy
- Be sustainable, i.e. usable in the future, as more data and computational resources become available
- Be interpretable, in particular for comparison with the traditional N-1 approach.

4 Translating the risk-based methodology into practical algorithms
A key objective with GARPUR was to ensure that the proposed risk-based approach for reliability management could be practically implemented. To this end, reliability assessment and control algorithms were developed in order to demonstrate the feasibility and tractability of the proposed methodology. The algorithms were implemented to test the scalability of the methodology (its ability to be applied to any TSO control zone in Europe) and its flexibility (its ability to address all the required time scales).

The algorithms were incorporated into research-grade software for the contexts of real-time operation, short-term operational planning, and mid-term outage scheduling.

For real-time operation, the algorithms were designed as adaptations of existing methods currently used by TSOs in view of:
- Optimizing the combination of preventive and corrective control decisions - Probabilistic Security Constrained Optimal Power Flow (SCOPF).
- Evaluating the expected value of potential power supply interruptions according to real-time weather data (probabilistic contingency screening algorithm).

For short-term operational planning, the algorithms developed allow for the:
- Study of different operating scenarios for immediate timeframes (next hours) and the ability to evaluate their expected socio-economic impact and probability of meeting the defined reliability target in real-time operation (Monte-Carlo simulations)
- Optimization of operational planning decisions in the case of uncertain renewable generation and weather conditions that could impact the real-time operation (probabilistic SCOPF in look-ahead mode).

For mid-term outage scheduling, the algorithms developed use massive parallel simulations and stochastic optimization (Cross-Entropy method) that make it possible to compare planning scenarios and to determine an outage schedule over a period of several months that minimizes the impact of planned outages on system operation.

These developments demonstrate that the risk-based Reliability Management Approach and Criterion (RMAC) can indeed be translated into practical tools that can help TSOs better assess, compare, and optimize their decisions.

Proxies
When addressing longer-term decisions, reliability management algorithms are challenging to implement, as they need to model the shorter-term decision-making stages of TSOs over several scenarios spanning the longer-term time horizons. This is often not computationally feasible, so approximations (proxies) are used to quickly determine a realistic behaviour of the TSO for the shorter-term decision-making stages.

In the case of mid-term outage scheduling, two proxies were designed to model respectively day-ahead and real-time operation decisions, using machine-learning methods and high-performance computing infrastructures. The use of these proxies enables reduction of computation time by several orders of magnitude, which made possible the reliability assessment of larger systems.

Proxies have also been proposed for the operational planning context to speed up the evaluation and identification of scenarios over a few hours or days.

Figure 2 A proxy takes the place of an accurate simulation for shorter-term reliability management processes of TSOs, when considering longer-term reliability management problems.

For further reading
Deliverable D2.2 Guidelines for implementing the new reliability assessment and optimization methodology.

5 A socio-economic impact assessment methodology to compare reliability management approaches
For the proposed probabilistic reliability management approach to be widely adopted, convincing arguments must be formulated to establish how these methods can outperform the current practice.

At the heart of this new probabilistic approach to reliability management is determining the best trade-off between the costs of providing a secure electricity supply and the socio-economic costs of power interruptions. A key challenge therefore is how to calculate these socio-economic costs, which are incurred not just by TSOs and electricity producers but are also incurred by consumers and the environment. It is also necessary to calculate congestion costs.

To that end, a new methodology was developed that compares the socio-economic impacts of different reliability management approaches. The Socio-Economic Impact Assessment (SEIA) methodology analyses the social welfare of the electricity market and quantifies the costs, benefits, and surpluses of all market stakeholders - including electricity consumers, producers, TSOs, the government (taxes on electricity) and the environment (externalities). A general mathematical formulation is developed that allows a wide range of costs to be quantified. The formulation is applied to a range of inputs, including different nodes, generation technologies, consumer types, time of occurrence, duration of interruptions, and pollutants.

Guidelines are used to show how the SEIA methodology can be applied to test cases that cover a range of time horizons – from short-term to long-term. The methodology also analyses the possible responses of electricity market stakeholders to changes in reliability level, electricity prices and taxes. A multi-actor approach is adopted by analysing the interaction of TSOs, countries and consumer types, and how their decisions affect welfare. Three types of phenomena are studied:
- Interaction between TSOs, where there is cross-border cooperation on reserves, which can increase reliability of supply and decrease reliability costs.
- Interaction between countries, where changes in reliability criteria may impact the interconnector transmission capacity.
- Impact of reliability management decisions on different consumer types and locations, affecting the welfare distribution between consumers.


Figure 3 System costs and benefits, and transfer payments between stakeholder groups.

A set of recommendations was drafted to support the practical implementation of the socio-economic impact assessment methodology, especially in regard to data requirements. A range of data inputs are required, including data about energy not supplied and the value of lost load. In the case that such data is not available, alternative data types are proposed. And finally, a roadmap for further development of the methodology was designed, which includes an estimation of consumer responses to price and reliability.

This methodology makes possible a full comparison between the traditional reliability management approach based on the N-1 criterion and the new probabilistic approach. Its integration into TSO handbooks and guidelines will pave the way for a widespread adoption of the probabilistic approach to reliability management.

For further reading
Deliverable D3.2: Recommendations for implementing the socio-economic impact assessment methodology

6 Proof of concept on the system development process
The system development process is about making decisions related to changing transmission capacities, either within a TSO’s own system or with other TSOs systems. A system planners job is to ensure that the system’s capability to transfer power from producer to consumer is sufficient. Those decisions, on whether to build, upgrade or replace high voltage links and control devices, require accurate data and realistic assumptions.

The GARPUR probabilistic approach is able to provide system planners with better data and better assumptions, that are linked with a specific grid structure at a given future target year. It provides a clearer understanding and greater visibility of the risks related to the reliability of electric power supply and of the costs incurred in the operational context.

Such information allows the planner to make sound decisions; to compare different grid reinforcement variants and to identify if an investment is worth it. This is illustrated in the Figure below. This detailed assessment is based on the analysis of a large number of representative, possible operating states. It is also based upon the ability to identify reliability and operational constraints that take into account the need to provide market capacities and to operate and maintain the grid in a secure manner.


Figure 4 The triangular relation between grid investment, socio-economic welfare and reliability

The proposed system development methodology relies on three main components:
- A framework is developed that considers a specific target year - and simulations are made that can assess reliability and transmission capacities and their impact on the electricity markets. The goal is to identify the main transmission bottlenecks and to assess their impact on the energy price and wider socio-economic welfare. With this information, a clearer cost-benefit analysis can be made that helps with decisions related to the upgrade of the transmission system structure.

- Based on the Monte-Carlo simulation approach, a large number of zonal and nodal scenarios are developed to simulate how the system could be operated and maintained during the target year. In order to limit the number of grid simulations needed, and to ensure a good compromise between the number and the level of detail of each simulation, a two-stage approach is used. Firstly, a set of ‘representative snapshots’ are identified based on clustering techniques. In the next stage, these ‘representative snapshots’, each having a probability of occurrence, are analysed in greater details.

- Proxies (model approximations) are used to model and assess system operability and maintainability. Using “representative snapshots”, they can model real-time operations as well as modelling a large set of days by using a sequence of consecutive snapshots. These proxies allow to quantify the reliability-related OPEX as well as the local part of socio-economic welfare.

At the end of the process, the system development planner is provided with a set of identified system weaknesses, which provide valuable guidance when making decisions on how to mitigate system weakness. The different system development options proposed by the planner are analysed with quantified metrics, which support the investment decision-making.

The methodology was first tested on a 10-bus system using realistic demand and wind data to build a set of credible operating states, and a market model to determine the set points of thermal generation units whilst considering temporal constraints like ramp rates and minimum up and down times.

The test demonstrated how a traditional approach to system development planning based solely on a few generation and demand conditions and not taking into account the probability of occurrence of situations (i.e. market clearing, planned and unplanned outages) nor their impacts would have missed some impacting system situations or would have overestimated the impact of other system situations and would have led to a sub-optimal investment decision.

This proof of concept paves the way to improved appraisal mechanisms for future investments in the transmission system.

For further reading
Deliverable D4.2 Upgrading of the decision-making process for system development


7 Proof of concept on the asset management process
The decisions on asset management policies are of utmost importance to the TSOs given the financial amounts at stake, and because those decisions will condition the reliability of supply for several decades. Probabilistic methods are adapted to define how to make the best use of the resources available for this purpose. However, a clean implementation of such a probabilistic method is challenging due to the data requirements, as well as the overall complexity when one tries to model the various actions that are going to happen later once in operation.

Another problem faced by the TSOs is the question of scheduling the various asset management operations, especially for those which require the outage of the asset for the duration of the operation. Indeed, such outages make the grid more vulnerable, therefore the TSO has to wisely schedule them at times when they do not jeopardize the reliability of supply. This process usually starts one or a few years in advance, and is regularly updated in order to adjust to any unexpected events.

The GARPUR theoretical approach has been derived to respond to each of these two questions, thus enabling the assessment of asset management policies and outage schedules to be determined in a more accurate manner, which eventually will lead to better informed decision-taking.

In the case of asset management policy assessment, the method developed enables to simulate, over a period of 20 years, the impact of specified asset management policies. In the simulations, the ageing process of grid components is modelled, and its impact is then translated into component failure rates varying time-and space-wise across the grid. The future reliability of supply is evaluated probabilistically through a Monte-Carlo approach where the uncertainties on external factors are sampled in the form of yearly trajectories at the hourly time-step.

Regarding the grid simulation once in exploitation and the evaluation of the expected power supply interruption costs, the various decisions such as generation re-dispatching or the recourse to corrective measures are also taken into account in the simulation through proxies. Due to obvious tractability reasons when one considers real size systems, proxies based on OPF cannot be envisaged, which prompted us to consider building proxies based on machine-learning techniques. Such an approach, combined with parallel computing, would enable us to overcome the computational burden.


Figure 5 Schematic representations of the assessment proposals for maintenance policy and outage scheduling highlighting the methodology and the use of shorter-term proxies

In addition, we have constructed a full-fledged proxy for outage scheduling. This proxy is based on a greedy algorithm that takes as an input a list of outage requests and computes an efficient outage schedule. This method could also be adapted to investigate whether investments should be made in condition monitoring devices.

The above-mentioned proxies are also core elements of our methodology to probabilistically evaluate an outage schedule. This methodology can help to spot the weaknesses of a tentative outage schedule or to compare a few alternative ones. As a future step, it should be possible to build optimization schemes that would automatically compute good outage schedules in a probabilistic context.

For both the asset management policy assessment problem and the outage scheduling problem, the results of the simulation provide an evaluation of the OPEX, CAPEX, workforce requirements, and interruption costs, at the global level as well as at a more local resolution. Such outcome enables the TSO to use the GARPUR reliability management approach and criterion in the context of asset management.

Proof-of-concept examples have been published in academic papers. It is recommended to pursue the validation of these methods beyond the frame of the GARPUR project through pilot tests on real-life systems.

For further reading
Deliverable D5.2 Pathways for mid-term and long-term asset management

8 Practical implementation of GARPUR for system operation processes
A key focus for GARPUR was to develop the probabilistic reliability management methodology on real time operations and short-term operational planning. It was important that the methods developed:
- Produce results easily understandable by the operator
- Be implementable with the current data and models existing at TSOs
- Be scalable for a pan-European system and give outputs close to real-time
- Fit within the current TSO workflows
- Be transparent and easy to analyse.

It is expected that such methods would provide operators with information that would increase their situational awareness for decision-making.

For real-time operations, methods and software were developed to assess the system reliability in real-time, using operational data, models of the system response to contingencies, the weather conditions and other external factors impacting the failure rates of the components of the power system. It was also important to take into account the possible failure of corrective actions. If this was neglected the results may be optimistically biased towards corrective actions instead of preventive actions. The proposed real-time operation methods were tested and validated as part of the pilot tests in GARPUR on a live system.

Regarding short-term operational planning, methods and algorithms were developed that allowed TSOs to assess risk levels hours or days into the future by factoring in possible weather conditions, failure rates of components, and forecasting errors on load and renewable energy production. The algorithm generates and selects a number of scenarios to manage the forecast uncertainty. Uncertainty makes the short-term problem more complex to deal with than the real-time problem, as multiple time-steps leading up to real-time must be taken into account, including decisions made by the operator. The resulting short-term reliability evaluation would enable TSOs to have a more comprehensive assessment of reliability of supply in the context of operational planning.

Overall, these methods will provide a much higher resolution to reliability assessment, resulting in improved reliability management in operations, which can lead to more efficient system operation.

The development of a sound reliability assessment methodology can provide a basis for comparing control actions and thereby optimizing system operations from a socio-economic perspective.
More generally, quantifying the risk related to reliability of the system in socio-economic terms, rather than in technical terms, allows for easier communication to non-technical stakeholders and is valuable for direct cost-benefit analyses in reliability management.

Finally, the use of weather-dependent failure rates is a fundamental part of the methodology and impacts the result greatly compared to static failure rates.


Figure 6 An example of the impact of weather data consideration on reliability risk assessment

In Figure 6 the blue curve of the upper graph shows how the residual risk (risk implied by contingencies not covered by the N-1 criterion) changes over time because of changing weather conditions. The blue curve in the lower graph shows how the GARPUR approach adapts the number of contingencies considered when the weather conditions are taken into account. The two dotted curves show what would have been the result of neglecting the impact of weather conditions.

For further reading
Deliverable D6.2 How to upgrade reliability management for short-term decision-making


9 Prototype software platform to compare reliability management approaches and criteria
One of the challenges in implementing a new reliability management methodology is being able to clearly compare different approaches. To this end, a prototype software platform was developed that made it possible to perform numerical simulations of the different reliability management approaches - in particular allowing a comparison between the N-1 approach and the probabilistic approach developed by GARPUR. The key idea behind this tool was to allow any member of the power system community to test and grasp the benefits of a probabilistic reliability management approach.

The software platform aims to facilitate off-line studies on how to move away from the N-1 approach towards the probabilistic approach, by making it possible to compare how the same system would perform from a socio-economic point of view using each approach. This involves performing massive simulations at a realistic temporal and spatial resolution over several areas of the European interconnected power system.

The prototype that has been developed uses Matlab (Matpower package), Python and various other existing software packages. It includes in particular a CIM (Common Information Model) parser to convert input files, and implements ‘DC’ Optimal Power Flow modules to optimize day-ahead and real-time decisions, both according to the N-1 criterion and according to the RMAC.

The prototype was tested on small academic benchmarks and on a real medium-size power system - which allowed the evaluation of the quality and the scalability of its computations for real-world applicability. The key challenges encountered during the tests were to clarify the quality of the input data and how to handle possibly missing data, as well as dealing with the significant computational and human resources required to run the platform to meet the needs of TSOs.


Figure 7 Illustration of calculation core supporting comparison of reliability criteria

The analysis of the test results and the lessons learnt led to recommendations for the development and use of an industry-grade version of the platform. For industrial use, there is a need to upgrade the platform to accommodate AC optimal power flow, to implement it on massively parallel computing hardware, and to incorporate state-of-the-art computational techniques.

For further reading
Deliverable D7.3 - Report on a broader comparison of different reliability criteria including recommendations on how to evolve into an industry grade tool

10 Testing the methodology on real systems
A key factor in the successful adoption of the GARPUR methodology is to ensure it can be practically implemented in real-life. To this end, three pilot tests were designed and performed to evaluate the challenges and benefits of implementing and using the method and to evaluate its scalability. In addition, two case studies have been presented in the GARPUR project in order to show the benefits of a probabilistic approach.

Real-time risk information in the control room through probabilistic reliability assessment
This pilot test was carried out at the control centre of Landsnet, the Icelandic TSO. It implemented probabilistic reliability assessment in the context of real-time operation of the Icelandic system, assessing system snapshots collected every few minutes from their Energy Management System. In this study, rather detailed models for system response, value of lost load, and weather dependent failure rates were implemented for the Icelandic system using efficient parallel computing to ensure results were computed fast enough for on-line use. Several prototype visualization tools were developed, to display the computed indicators in a meaningful way to the operators sitting in the control room. Based on this test, the probabilistic approach was found to provide quite useful information, complementary to the result of classical N-1 security assessment and well in line with operators’ experience and intuitions. In addition, the study highlighted the operational value of system integrity protection schemes currently used at Landsnet. Recommendations fostering the development of real-time reliability control tools and look-ahead mode reliability assessment tools were also issued as a result of this pilot test.

Greater transparency in system development through clustering and optimal power flow techniques
The near real-life pilot test was realised in the planning department of ELIA, the Belgian TSO, in collaboration with the University of Strathclyde. It concerned an actual system development study focusing on a part of the Belgian transmission system. It assessed the GARPUR approach for system development, showing how clustering techniques can be applied to system snapshots to create a set of credible future operating states for a future year. A DC-OPF assesses these snapshots to estimate the operability and maintainability of the system, and to identify the main reliability problems to be expected in the future year. The conditions under which a given asset can be taken out for maintenance are therefore possible to know in advance. The test showed also the trade-off between the number of clusters and risk estimates. It is shown that the proposed framework can be implemented for real transmission networks. However, the need for a Security Constrained OPF model for the rapid and convenient assessment of risk of various operating conditions, and for simulating the preventive/curative actions, was identified. In addition, there is also room for further improvement of the clustering techniques.

A probabilistic approach can identify and enable savings for society
Two additional case studies were carried out on the Nordic power system to show the benefits of the probabilistic reliability assessment approach.

The system development study of the Norwegian TSO Statnett concerned the expansion of the grid in the south-western part of Norway. It was conducted by using an in-house tool of Statnett and demonstrates how the probabilistic approach can influence investment decisions. The probabilistic approach enabled Statnett to quantify the value of security of supply and allows therefore to properly rank system expansion alternatives regarding the highest socio-economic benefit. In this case, it showed that the alternative with a higher level of security of supply was not beneficial since the increase of investment costs significantly outweighs the reduction of the expected interruption cost. The study shows that large socio-economic savings of about 25 % can be reached mainly to lower investment costs compared to the alternative with a higher level of security of supply. This equals in this specific case to around 130 million EUR.

The case study of SINTEF assesses the long-term impact of the amount of transmission capacity given to the power market both on the market costs and on the interruption costs. This study used a probabilistic methodology developed at SINTEF that integrates a hydrothermal market analysis for the purpose of reliability assessment in the context of long-term planning studies. The study found that increasing the transmission limits for a specific region of the Norwegian power system did not significantly change expected interruption costs, although it would relax the constraints placed on the market and hence significantly reduce the power market costs. It was also shown that the impact of climatic variability and uncertainties can be substantial. In general, the study illustrates how a probabilistic approach and flexible transmission limits may allow for higher socio-economic surplus.

Comparison of the probabilistic approach against the classical N-1 criterion
Another test was performed by the French TSO RTE using the GARPUR Quantification Platform. The test focused on short-term operation and it involved the “Tavel-Realtor” corridor, which is in the south-eastern part of the French grid. Four reliability management strategies were simulated on the software prototype in order to compare their socio-economic impact. The four strategies tested were the N-0, the strictly preventive N-1, a first probabilistic approach that just considered N-1 contingencies, and a second one that considered in addition some N-2 contingencies. Some sensitivity analyses were performed on the main input parameters, to observe which ones were most influential for each of the 4 strategies tested. The results confirmed that implementing a probabilistic approach should lead to similar reliability levels as today, but at a lower cost of operating the system. This study showed the interest of socio-economic impact assessment studies, in order to better understand the main advantages of the probabilistic approach.

Altogether, these tests demonstrated that the probabilistic framework designed in GARPUR is possible to put into practice and that it can give valuable contribution in assessing the system reliability. There exist many opportunities for using the methodologies in order to improve decision-making and to optimally balance security of supply and socio-economic costs. The achievements of the tests can also pave the way for developing scalable tools that may help TSOs justify actions and investments in terms of socio-economic benefits.

For further reading
D8.1 Pilot testing methodologies, models, scenarios and validation approach
D8.2 Public summary - pilot testing using the quantification platform prototype
D8.3 Public summary - results from near-life pilot testing

11 GARPUR vision and roadmap towards the future of reliability management
Over its four years, the GARPUR project has both developed and tested a comprehensive probabilistic reliability management framework aiming at effectively guiding Transmission System Operators (TSOs) in their investment, asset management and operational decisions. The results and experiences obtained in the GARPUR project, especially the demonstration of the methodologies and tools in close to real-life pilot test and studies, show that moving towards probabilistic reliability management in the interconnected pan-European transmission system is possible, but challenging.

This lead to the formulation of the GARPUR vision regarding the future of power systems reliability management:

“An adoption of probabilistic reliability management by all stakeholders dealing with electric power systems reliability management, from experts in the TSO organizations who have the practical responsibility to ensure the security of electricity supply, to the persons in charge at regulators and governments whose responsibility it is to ensure the electric power system performs for the benefit of all parts of society”.

Migration towards a systematic use of the probabilistic reliability management approaches can only happen gradually. It will require significant investments and a gradual change of attitudes towards probabilistic methods to make this migration possible and sustainable. Furthermore, several challenges need to be progressively overcome to realise the GARPUR vision and to gain fully the benefits of probabilistic reliability management.

Building upon the project findings, as well as a wide consultation among European stakeholders, a roadmap for the migration towards GARPURs vision by 2030 has been designed. This roadmap addresses the main challenges and suggests actions that can thus serve as template for the different stakeholders. By using a common roadmap, stakeholders can communicate and co-ordinate their plans with their neighbour TSOs and countries.

In addition, it is of paramount importance that the national and historical contexts of each TSO are considered. Different countries and TSOs will possibly prefer different path-ways to progressively implement a probabilistic approach. Some TSO’s might want to start to focus on managing short-term risks and exogenous uncertainty, while others see value in refining methods and data for long-term development of the system, impacting cross-border transmission capacity.

The GARPUR consortium recommends that further experience sharing among the different departments of TSOs responsible for system development, asset management, and system operation, and between different TSOs in close cooperation with ENTSO-E, should be fostered in order to speed up the transition. GARPUR suggests also that ENTSO-E should play a major role and could initiate and coordinate these activities among different TSOs and Regional Security Coordinators, especially in the context of system operation. All these local, regional, and European initiatives can in principle proceed in parallel, but they need to be monitored at the European scale and if necessary, be coordinated to ensure the swiftest possible progress.

The recommended actions have been classified in different clusters in the roadmap:

Regulation and socio-economic considerations
Currently, the regulatory framework organising and incentivising the power sector is in general not fitted to probabilistic reliability management. Incentives, remunerations, roles and responsibilities are defined to ultimately ensure that the power system is N-1 secured. Therefore, a next step should be to expand the regulation to take into account the probability of failure and risk associated with the N-1 faults that TSO’s are already assessing and to encourage the use of the new reliability targets and socio-economic evaluation criteria. The remuneration mechanisms of TSOs should also be adapted so as to incentivize them to implement the new approach in the most efficient way. For this decision-making, the regulators need information and a better understanding of the benefits and the socio-economic consequences of the probabilistic reliability management approach compared to the existing reliability management.

Data and models of uncertainties
The accuracy of the probabilistic reliability management approach is dependent upon the availability and quality of data. As a first step, TSOs have to collect relevant reliability data, i.e. failure data, outage and restoration times and interruption cost data. Based on the data, improved models have to be developed. Gradually these actions will provide the TSO’s with more precise results from the use of the probabilistic reliability management approach and supports an iterative improvement of models. It is also recommended to put in place common guidelines to persistently ensure the collection of data, maintain the databases, and the inferred models, and share the relevant parts of these data and models among the different stakeholders concerned.

Methodologies, algorithms and software
When applying the probabilistic reliability management approach, the resulting stochastic simulation and optimization problems are significantly more complex from the computational point of view compared to those used for N-1 based reliability management. Even though GARPUR has developed several methods and software prototypes, few have been implemented in TSOs practices and in industrial grade software. Therefore, a next generation of industrial grade software and tools needs to be developed with the goal of reaching the necessary robustness, efficiency and availability to support a large community of both industrial and academic actors for the progressive implementation of probabilistic reliability management approaches.

Testing and implementation at the operational level
A transition towards reliability management based on probabilities and expected consequences requires a change towards a risk-aware mind-set of all the main actors in the power sector - and especially in TSOs. The operation of the grid must be based on balancing the costs of providing security of electricity supply with socio-economic costs of power supply interruptions and should not focus just on achieving high security of power supply and all of costs and measures that accompany that.

Pilot scale testing of the new methods and approaches are an important next step to change the mind-set and to increase the trust in probabilistic reliability management at the same time. The first practical step for a TSO to take is to calculate the probabilities and risks of the N-1 contingencies that they are already assessing. This brings into light which data and models are required to develop the probabilistic approaches further and an increased understanding of grid reliability and its variations over time and space. Even though GARPUR has performed several pilot tests with different TSOs, a main focus has to be on further testing of the new approach in parallel to the existing N-1 approach to convince more people in the TSOs and other stakeholders of the applicability and benefits of the risk-based approach. Such testing will also provide more insights into how the probabilistic reliability management approach works in specific use-cases and on challenges for the implementation.


Figure 8 The resulting roadmap illustrates this transition in three time horizons for each cluster of recommendation

For further reading
Deliverable D9.1: Results and recommendations towards stakeholders
Deliverable D9.2: A transition roadmap towards probabilistic reliability management

Potential Impact:
The set of GARPUR results constitutes a step forward in matters of science and methodologies, supported by a large set of scientific publications, and to be pursued in the coming years through research and education programs among the academic community.

The project also resulted in a real dynamic among TSOs towards the implementation of a probabilistic approach to reliability management, with several partner TSOs committed to collaborate further on these topics, test some tools, share their experiences, and promote this work at ENTSO-E level.

Finally, the project produced a free corpus of knowledge, accessible to the whole community, as a tool to explain the probabilistic approach, highlight its added-value and benefits, and motivate towards new uses and further developments.

The potential impact of the GARPUR results are explained in the following, sorted by the different natures of Foreground in the same way as in deliverable D10.6 where more information can also be found:

- Methodologies and approaches
- Application of the methodologies/tools
- Platform and tools
- Recommendations further to the application of the methodology.

1 Methodologies and approaches
Mathematical formulation of reliability assessment and optimization problems
The mathematical formulation of reliability assessment and optimization problems expresses the GARPUR probabilistic reliability management approaches and criteria (RMACs) across all TSO decision making horizons (from the several years of long-term system development to the several minutes of real-time system operation). Such “generic” formulations go well beyond current practice and state-of-the-art literature, notably in terms of explicitly acknowledging (i) the relevant uncertainties and their spatio-temporal correlation, and, (ii) the inherent coupling between shorter-term and longer-term decision contexts.

The methodology lays the groundwork to perform reliability management based on a better knowledge and evaluation of reliability of supply and of the risk of power supply interruptions. The methodology has been designed to:
- Suit all time horizons and type of decisions covered by the reliability management activities
- Be compatible with the computational and data resources that most TSOs are in capacity to currently deploy
- Be sustainable, i.e. usable in the future, as more data and computational resources become available
- Be interpretable, in particular for comparison with the traditional N-1 approach.

This result is described in more detail in deliverable D2.2: Guidelines for implementing the new reliability assessment and optimization methodology and summarized in sections 2.2.1 and 2.2.2 of the public deliverable D9.1: Results and recommendations towards stakeholders.

Data and proxies needed to implement the new RMAC
Reliability assessment and control algorithms were developed in order to demonstrate the feasibility and tractability of the proposed methods. The algorithms tested scalability (the ability of the methods to be applied to any TSO control zone in Europe) and their flexibility in being able to address all the required time scales. The algorithms were incorporated into research-grade software to enable real-time reliability-management testing, operation planning and outage scheduling.
For real-time operation, the algorithms were designed as adaptations of existing methods currently used by TSOs in view of optimizing the combination of preventive and corrective control decisions - Probabilistic Security Constrained Optimal Power Flow (SCOPF), and evaluating the expected value of potential power supply interruptions according to real-time weather data.
For short-term operation planning, the algorithms developed allow for the study of different operating scenarios for immediate timeframes (next hours) and the ability to evaluate their expected socio-economic impact and probability of meeting the defined reliability target in real-time operation; and for the optimization of operation planning decisions in the case of uncertain renewable generation and weather conditions that could impact the real-time operation.
For mid-term outage scheduling, the algorithm developed uses massive parallel simulations and stochastic optimization (Cross-Entropy method) that make it possible to compare planning scenarios and to determine an outage schedule over a period of several months that minimizes the impact of planned outages on system operation.

These developments demonstrate that the Reliability Management and Criterion (RMAC) approach can indeed be translated into practical tools that can help TSOs better assess, compare, and optimize their decisions.

These results are described in more detail in deliverable D2.2: Guidelines for implementing the new reliability assessment and optimization methodology and summarized in section 2.2.2 of deliverable D9.1: Results and recommendations towards stakeholders.

Socio-Economic Impact Assessment Methodology
The Socio-Economic Impact Assessment (SEIA) methodology developed provides a tool for analysing the social welfare of the electricity market and quantifies costs, benefits, and surpluses of all market stakeholders - including electricity consumers, producers, TSOs, the government (taxes on electricity) and the environment (externalities).

The methodology details how to calculate various costs, including power supply interruption costs, the costs incurred by TSOs and producers, environmental costs and congestion costs related to different time horizons. A general mathematical formulation of these surpluses is given for different nodes, generation technologies, consumer types, time of occurrence and duration of interruptions, and pollutants. Guidelines illustrate how to apply the SEIA methodology for the long-term, mid-term and short-term time horizons and illustrations are provided for numerical test cases. The methodology also analyses the possible responses of electricity market stakeholders to changes in reliability level, electricity prices and taxes.

The methodology makes possible a full comparison between the traditional N-1 criterion based reliability management approach and the new probabilistic approach to reliability management. Its integration into TSO handbooks and guidelines will pave the way for a widespread adoption of the probabilistic approach to reliability management.

This result is described in more detail in deliverable D3.2: Recommendations for implementing the socio-economic impact assessment methodology and summarized in section 2.2.3 of the deliverable D9.1: Results and recommendations towards stakeholders.

Functional description of the "ideal" quantification platform
The design and architecture of a prototype software platform to compare different reliability management approaches and criteria is described, for usage by both academics and TSOs, performed for power system engineers and translation into specifications understandable by software professionals. The functional description is complemented by lessons learned from the actual implementation of the GQP and the tests performed by RTE.

The functional description provides the basis for the development of a tool enabling any member of the power system community to actually test and grasp the benefits of a probabilistic reliability management approach.

GARPUR training and dissemination material
Two Training webinars were organised with ISGAN to introduce GARPUR results to a wide audience within the Power system community. The results also include dissemination material (website, brochures) to synthesize the final projects results, conference material (live streaming video, proceedings) of the project’s final conference, a video introducing the project and teasing towards the public deliverables, and the full set of public deliverable and scientific publications and proceedings in free access

This freely accessible material can support the further dissemination of the probabilistic approach developed in GARPUR towards the power system community at large.

2 Application of the methodologies/tools
Probabilistic method to support decision making in Reliability Management, applied to system development
The method enables to provide system planners with a better understanding and a better visibility of risks related to the reliability of the load supply and expected operational impact on the socio-economic welfare (i.e. risk of curtailment, market price divergence, re-dispatching cost, ...) and linked to a specific grid structure and at a given future target year. Such information allows the planner to make sound decisions; to compare different grid reinforcement variants and to identify if an investment is worth. Result from this detailed assessment is based on the analysis of a large number of representative, possible future operating states - and on the ability to identify reliability and operational constraints taking into account the need to operate in a secure manner, provide market capacities and maintain the grid.

The main benefit of the approach is to better align system development with operational constraints as well as the enhanced transparency on system development decisions. Furthermore, the approach, designed as an evolution of existing industrial practices, facilitates its implementation and enable a relatively fast roll-out.

This result is described in more detail in Deliverable D4.2 Upgrading of the decision-making process for system development and summarized in the popular summaries (“Proof of concept on the system development process”) and in section 2.3.1 of public deliverable D9.1: Results and recommendations towards stakeholders.


Probabilistic method to support decision making in Reliability Management, applied to asset management
For the long-term problem (choice of asset management policies) the method developed enables to simulate, over a period of 20 years, the impact of specified asset management policies. In the simulations, the ageing process of grid components is modelled, and together with the impact of maintenance activities, is then translated into component failure rates varying time- and space-wise across the grid.

For the mid-term problem (choice of outage schedules) the method developed can help spot the weaknesses of a tentative outage schedule, and to compare alternative outage schedules.

These methods are expected to be much more accurate that the present ones, thus enabling to take better reliability-centered decisions for asset management. The methods could also be derived in order to evaluate the impact on system reliability of introducing a novel asset condition monitoring devices.

A more complete description of this result is available in Deliverable D5.2 Pathways for mid-term and long-term asset management and summarized in the popular summaries (Proof of concept on the asset management process) and in section 2.3.2 of public deliverable D9.1: Results and recommendations towards stakeholders.


Probabilistic method to support decision making in Reliability Management, applied to real-time and short-term operations
For real-time operation, methods and algorithms were developed to assess the system reliability in real-time, using operational data, models of the system response to contingencies, the weather conditions and other external factors impacting the failure rates of the components of the power system. It was also important to take into account the possible failure of corrective actions. The proposed real-time operation methods were tested and validated as part of the pilot tests in GARPUR on a live system.

Regarding short-term operational planning, methods and algorithms were developed that allowed TSOs to assess risk levels hours or days into the future by factoring in possible weather conditions, failure rates of components, and forecasting errors on load and renewable energy production. The algorithm generates and selects a number of scenarios to manage the forecast uncertainty. The uncertainty makes the short-term method more complex than the real-time problem as it has to take into account multiple time-steps leading up to real-time, which may include decisions made by the operator. The resulting short-term reliability evaluation would enable TSOs to have a more comprehensive assessment of reliability of supply in the context of operational planning.

The application of the new methodology gives a much higher resolution to risk assessment which will result in improved risk management. This means taking further preventive action to avoid high risk at some times, but at other times running the grid in a more beyond N-1 constraints. Quantifying risk in socio-economic terms, rather than technical terms, allows for easier communication to non-technical stakeholders and is valuable for direct cost-benefit analyses in risk management. Dynamic failure rates are a fundamental part of the methodology and impact the result greatly compared to fixed failure rates.

A more complete description of this result is available in Deliverable D6.2 How to upgrade reliability management for short-term decision-making and summarized in the popular summaries (Proof of concept on the asset management process) and in section 2.3.3 of public deliverable D9.1: Results and recommendations towards stakeholders.

3 Platform and tools
First prototype quantification platform
The prototype software platform developed made possible to perform numerical simulations of the different reliability management approaches - in particular allowing a comparison between the N-1 approach and the probabilistic approach developed by GARPUR.

The software platform aims to facilitate off-line studies on how to move away from the N-1 approach towards the probabilistic approach, by making it possible to compare how the same system would perform from a socio-economic point of view using each approach. This involves performing massive simulations at a realistic temporal and spatial resolution over several areas of the European interconnected power system.

The key idea behind this tool was to allow any member of the power system community to actually test and grasp the benefits of a probabilistic reliability management approach.

This result is described in more details in the popular summaries .

4 Recommendations further to the application of the methodology
Guidelines for implementing the new Reliability Assessment and optimization methodology
A set of guidelines for the progressive implementation of GARPUR approach in the practice of electric power system reliability management is developed. A six-step approach is proposed:
- Stage 1: prototype tools for probabilistic reliability assessment
- Stage 2: prototype tools for probabilistic reliability control
- Stage 3: gather experience and tune parameters with the help of experts
- Stage 4: industrial use of probabilistic reliability assessment
- Stage 5: improve data quality
- Stage 6: industrial use of probabilistic reliability control

These guidelines also include an analysis of the algorithms developed (see next result) to put the RMAC in practice.

This will ensure that the Reliability Management and Criterion (RMAC) approach can indeed be translated into practical tools that can help TSOs better assess, compare, and optimize their decisions.

Recommendations on how to evolve in to and industry grade tool
The pilot tests using the prototype quantification platform led to a set of key recommendations to improve its functionality, accuracy, computational tractability, and user friendliness. Three main categories of improvement are detailed in deliverable D7.3:
- Modelling detail and accuracy
- Computational tractability
- Interfacing: operators, inter-TSO, CIM extensions, SCADA, EMS.

This is valuable for project partners, coordination centres and regulators to understand the steps necessary. It can also be used to develop training and education on reliability assessment and control.

GARPUR recommendations
A set of recommendations resulting from all the lessons learnt from the project, organised into general and specific recommendations:
- General recommendations consist in:
- Making progressively implement in TSO practice the developed probabilistic reliability management approaches and criteria;
- Implementing the probabilistic methods for reliability management in parallel with the current N-1 based practice.
- Specific recommendations are related to:
- The need for changing the regulation and for properly anticipating the socio-economic impact of moving towards a probabilistic reliability management approach;
- Data collection and models of uncertainties;
- Reliability management methodology, algorithms, and software;
- The testing and implementation of the methods and algorithms developed within the project.

The recommendations constitute as concise summary of the main results of GARPUR with reference for further reading makes it easier for interested experts to get the right information. The result details further work and challenges to progress with a transition to a probabilistic reliability management. It also includes a set of recommendations which constitutes a sound foundation on probabilistic reliability management approaches upon which stakeholders of the electricity value chain (TSOs and non-TSOs) can develop further on.

All recommendations are detailed in deliverable D9.1: Results and recommendations towards stakeholders.


Transition roadmap to evolve from the current N-1 practices
Building upon the project findings as well as a wide consultation among European stakeholders, GARPUR designed a roadmap for the migration towards probabilistic risk management approaches by 2030. The roadmap details four complementary lines of action, along a 10 to 15 years’ time scale, and targeting four key stakeholders groups that are regulators, TSOs/ENTSO-E, research organisations and technology providers. The four lines of actions focus on:
- Regulation and socio-economic considerations
- Data and models of uncertainties
- Methodologies, algorithms and software
- Testing and implementation at the operational level.

This roadmap addresses main barriers and the preferred timing to implement a probabilistic approach. It gives a good overview of the next steps to take for different stakeholders, and impact at European scale is possible through ACER, CEER, ENTSO-E and similar organizations.

The roadmap for further implementation of the GARPUR results is given in Deliverable D9.2: A transition roadmap towards probabilistic reliability management .

List of Websites:
Public website: www.garpur-project.eu

Contact: oddbjorn.gjerde@sintef.no
final1-d11-1d-garpur-final-report.pdf