Completing the Forecast Cycle

Jasper Slingsby

Completing the Forecast



Today we’ll complete the forecast cycle (through data assimilation) and spend a little time discussing decision support.

Data assimilation

The iterative ecological forecasting cycle in the context of the scientific method, demonstrating how we stand to learn from making iterative forecasts. From lecture on data assimilation by Michael Dietze.

Recap:

The iterative ecological forecasting and the scientific method are closely aligned cycles or loops.


Bayes Theorem provides an iterative probabilistic framework that makes it easier to update predictions as new data become available, mirroring the scientific method and completing the forecasting cycle.


Today we’ll unpack this in a bit more detail.

Operational data assimilation

Most modelling workflows are set up to:

(a) fit the available data and estimate parameters etc (we’ll call this analysis), which they then often use to

(b) make predictions (which if made forward through time are typically forecasts)


They usually stop there. Few workflows are set up to make forecasts repeatedly or iteratively, updating predictions as new observations are made.


When making iterative forecasts, one could just refit the model and entire dataset with the new observations added, but there are a few reasons why this may not be ideal…

Operational data assimilation

Why not just refit the model?

  1. Complex models and/or large datasets can be very computationally taxing -e.g. Slingsby, Moncrieff, and Wilson (2020) ran the Hierarchical postfire model for the Cape Peninsula (~4000 MODIS pixels) over 4 days on a computing cluster with 64 cores and 1TB of RAM…
  2. Depending on the model, you may not be able to reuse older data in the new model fit, meaning you can’t take advantage of all available data…
  3. Refitting the model doesn’t make the most of learning from new observations and the forecast cycle…

The alternative is to assimilate the data sequentially, through forecast cycles, imputing observations a bit at a time as they’re observed.

Operational data assimilation

Sequential data assimilation has several advantages:

  1. They can handle larger datasets, because you don’t have to assimilate all data at once.
  2. If you start the model in the past and update towards present day, you have the opportunity to validate your predictions, and see how well the model does.
  • Does it improve with each iteration (i.e. is it learning)?
  • Think of it as letting your model get a run-up like a sportsperson before jumping/bowling/throwing.

Assimilating data sequentially is known as the sequential or operational data assimilation problem and occurs through two steps (the main components of the forecast cycle):

  • the forecast step, where we project our estimates of the current state forward in time
  • the analysis step, where we update our estimate of the state based on new observations

Operational data assimilation

The two main components of the forecast cycle are the forecast step (stippled lines), where we project from the initial state at time 0 (\(t_0\)) to the next time step (\(t_{0+1}\)), and the analysis step, where we use the forecast and new observations to get an updated estimate of the current state at \(t_{0+1}\), which would be used for the next forecast to \(t_{0+2}\).

The Forecast Step

While the first step has to be analysis, because you have to fit your model before you can make your first forecast, the forecast step is probably easier to explain first.


The goals of the forecast step are to:

  1. To predict the value of the state variable(s) at the next time step
  2. Indicate the uncertainty in our forecast (based on uncertainty that we have propagated through our model from various sources (data, priors, parameters, etc))

In short, we want to propagate uncertainty in our variable(s) of interest forward through time (and sometimes through space, depending on the goals).

The Forecast Step

There are a number of methods for propagating uncertainty into a forecast, mostly based on the same methods one would use to propagate the uncertainty through a model as discussed in the previous lecture.


Methods for propagating uncertainty through models (and into forecasts)
Approach Distribution Moments
Analytical Variable Transform Analytical Moments (Kalman Filter)
Taylor Series (Extended Kalman Filter)
Numerical Monte Carlo (Particle Filter) Ensemble (Ensemble Kalman Filter)

The Forecast Step

Explaining the different methods is beyond the scope of this module, but just a reminder that there’s a trade-off between the methods whereby:

  • the most efficient (the Kalman filter) also come with the most stringent assumptions (linear models and homogeneity of variance only) and only provide moments (means, standard deviations, etc)
  • the most flexible (particle filter - often called sequential Monte Carlo (SMC) - a Bayesian approach) are the most computationally taxing, but provide full distributions

In short, if your model isn’t too taxing, or you have access to a large computer and time to kill, SMC is probably best (and often easiest if you’re already working in Bayes)…

The Analysis Step

The two main components of the forecast cycle are the forecast step (stippled lines), where we project from the initial state at time 0 (\(t_0\)) to the next time step (\(t_{0+1}\)), and the analysis step, where we use the forecast and new observations to get an updated estimate of the current state at \(t_{0+1}\), which would be used for the next forecast to \(t_{0+2}\).

The Analysis Step

NOTE: I’m only explaining the general principles for Bayes. There are frequentist approaches, but I’m not going to go there.

This step involves using Bayes Theorem to combine our prior knowledge (our forecast) with new observations (at \(t_{0+1}\)) to generate an updated state for the next forecast (\(t_{0+2}\)).

The forecast cycle chaining together applications of Bayes Theorem at each timestep (\(t_0, t_1, ...\)). The forecast from one timestep becomes the prior for the next. The forecast is directly sampled as a posterior distribution when using MCMC.

The Analysis Step

This is essentially sequential Monte Carlo (SMC), also known as a particle filter.

The procedure of particle filtering from Kim et al. (2018). Here, values are sampled from the posterior at time \(k-1\) (= forecast for time \(k\)) to be the prior in the analysis at time \(k\). These are then weighted by the likelihood (i.e. the new data) to update the initial conditions used to generate a new posterior (forecast for time \(k+1\)), which is then resampled to provide the prior in time \(k+1\), and so on.

The Analysis Step


This is better than just using the new data as your updated state, because:

  • it uses our prior information and understanding
  • it allows our model to learn and (hopefully) improve with each iteration
  • there is likely error (noise) in the new data, so it can’t necessarily be trusted more than our prior understanding anyway

Fortunately, Bayes deals with this very nicely:

  • if the forecast (prior) is uncertain and new data precise then the data will prevail
  • if the forecast is precise and the new data uncertain, then the posterior will retain the legacy of previous observations

The Analysis Step

Comparison of situations where there is (A) high forecast uncertainty (the prior) and low observation error (data), versus (B) low forecast uncertainty and high observation error on the posterior probability from the analysis step. Note that the data and prior have the same means in panels A and B, but the variances differ.

Ensemble forecasts

Lastly, just a note that I’ve mostly dealt with single forecasts and haven’t talked about how to deal with ensemble forecasts. There are data assimilation methods to deal with them, but we don’t have time to cover them.


The methods, and how you apply them, depend on the kind of ensemble. Usually, ensembles can be divided into three kinds, but you can have mixes of all three:

  1. Where you use the same model, but vary the inputs to explore different scenarios.
  2. Where you have a set of nested models of increasing complexity (e.g. like our postfire models with and without the seasonality term).
  3. A mix of models with completely different model structures (or even approaches: empirical vs mechanistic, frequentist vs Bayesian, etc) aimed at forecasting the same thing.

Decision Support


Probably the hardest part of the whole ecological forecasting business… people!


It is also a huge topic. Here I just touch on a few hints and difficulties.

Decision Support


First and foremost, the decision at hand may not be amenable to a quantitative approach.

  • Ecological forecasting requires a clearly defined information need with measurable (and modelable) state variables, framed within one or multiple decision alternatives (scenarios).

Decision Support


Secondly, there’s also the risk of external factors making the forecasts unreliable, especially if they are not controlled by the decision maker and/or their probability is unknown (e.g. fire, pandemics, etc).

Decision Support

One way to try to deal with external factors is by developing scenarios with different boundary conditions.


  • e.g. scenarios with and without a fire, or different future climate states under alternative development pathways, etc.
  • Scenarios are often “what if” statements designed to address major sources of uncertainty that make it near-impossible to make accurate predictions with a single forecast.

IPCC AR6 illustrative mitigation pathways (IMPs).

Decision Support


A reminder of the distinction between predictions versus projections:

  • predictions are statements about the probability of the occurrence of events or the state of variables in the future based on what we currently know
  • projections are statements about the probability of the occurrence of events or the state of variables in the future given specific scenarios with clear boundary conditions

Decision Support


In an ideal world…

You’ll be working with an organized team that is a well-oiled machine at implementing Adaptive Management and Structured Decision Making and you can naturally slot into their workflow.


The advantages of Adaptive Management and Structured Decision Making are that they are founded on the concept of iterative learning cycles, which they have in common with the ecological forecasting cycle and the scientific method.

Decision Support

Conceptual relationships between iterative ecological forecasting, adaptive decision-making, adaptive monitoring, and the scientific method cycles (Dietze et al. 2018).


The iterative ecological forecast cycle integrates nicely with Adaptive Management…

Structured Decision Making


The Structured Decision Making Cycle sensu Gregory et al. (2012).


Focused on the process of coming to a decision, not the process of management, but very useful in the first iteration of the Adaptive Management Cycle.


Could easily be the topic of a whole course in itself, e.g. this online course by the US Fish and Wildlife Service.

Structured Decision Making


The Structured Decision Making Cycle sensu Gregory et al. (2012).


It is valuable when there are many stakeholders with disparate interests.

  • decisions are ultimately about values and often require evaluating trade-offs among properties with incomparable units - e.g. people housed/fed/watered vs species saved from extinction…
  • this can be a highly emotive space, and greatly benefits from a structured facilitation process

Structured Decision Making


The Structured Decision Making Cycle sensu Gregory et al. (2012).


It tries to bring all issues and values to light to be considered in a transparent framework where trade-offs can be identified and considered.

  • You can’t make the right choice if it isn’t on the table…
  • Step 0 is just identifying the necessary stakeholders (and getting them to the table).
    • e.g. indigenous knowledge holders and other local experts - not just the authorities responsible for making the decisions.

Structured Decision Making


The Structured Decision Making Cycle sensu Gregory et al. (2012).


It directly addresses the social, political or cognitive biases that marginalise some values or alternatives.

  • Many decisions pit people’s immediate needs (water, housing, etc) against the environment. We’d rather ignore that choosing one is choosing against the other, but if we’re not transparent about this we’re not going to learn from our decisions and improve them in the next iteration.

Structured Decision Making


The Structured Decision Making Cycle sensu Gregory et al. (2012).


But…

  • Very tricky to do well and easy to do badly…
  • Requires a good, well-trained facilitator who understands stakeholder and researcher needs
  • Needs trust and buy-in from participants
  • Can take a lot of time to get right…
  • Often simplifies a problem so that it is feasible to analyse

Ideal world? (see slide 7)


The beauty for the forecaster in this scenario is that a lot of the work is already done.

  • The decision alternatives (scenarios) have been well framed.
  • The performance measures, state variables of interest and associated covariates mostly identified.
  • Iterations of the learning cycle may even have already begun (through the Adaptive Management Cycle) and all you need do is develop the existing qualitative model into something more quantitative as more data and understanding are accumulated.
    • Think of the Protea example in the second lecture, where the demography of these species is already used for decision making using semi-quantitative “rules of thumb”.

Ideal world?


The Structured Decision Making Cycle sensu Gregory et al. (2012).


Can focus on estimating (forecasting) consequences and evaluating trade-offs among alternatives (steps 4 and 5), rather than having to do the whole process from scratch.

Estimating consequences…


Often one has to forecast multiple state variables, which may or may not be related to each other.


Decision-makers may also have to consider trade-offs among qualitative as well as quantitative consequences under different decision scenarios.


What’s missing?


An example consequence table adapted from Gregory et al. (2012) by Environmental Science Associates.

Uncertainty?

Model

Quantify and propagate uncertainty!

“It is better to be honestly uncertain than confidently wrong.”

Sensitivity analysis:

  • How robust is the decision to uncertainty in the model and assumptions?

  • How wrong does your model have to be before the decision changes?

Communication

  • Communicate uncertainty in the model and forecasts clearly to decision-makers.

  • Be transparent about the limitations of the model and the assumptions made.

  • Use uncertainty visualizations to help decision-makers understand the range of possible outcomes.

  • Frame uncertainty in multiple ways - e.g. 5% vs 1 in 20

References

Dietze, Michael C, Andrew Fox, Lindsay M Beck-Johnson, Julio L Betancourt, Mevin B Hooten, Catherine S Jarnevich, Timothy H Keitt, et al. 2018. Iterative near-term ecological forecasting: Needs, opportunities, and challenges.” Proceedings of the National Academy of Sciences of the United States of America 115 (7): 1424–32. https://doi.org/10.1073/pnas.1710231115.
Gregory, Robin, Lee Failing, Michael Harstone, Graham Long, Tim McDaniels, and Dan Ohlson. 2012. Structured Decision Making: A Practical Guide to Environmental Management Choices. John Wiley & Sons.
Kim, Gibeom, Hyeonmin Kim, Enrico Zio, and Gyunyoung Heo. 2018. Application of particle filtering for prognostics with measurement uncertainty in nuclear power plants.” Nuclear Engineering and Technology 50 (8): 1314–23. https://doi.org/10.1016/j.net.2018.08.002.
Slingsby, Jasper A, Glenn R Moncrieff, and Adam M Wilson. 2020. Near-real time forecasting and change detection for an open ecosystem with complex natural dynamics.” ISPRS Journal of Photogrammetry and Remote Sensing: Official Publication of the International Society for Photogrammetry and Remote Sensing 166 (August): 15–25. https://doi.org/10.1016/j.isprsjprs.2020.05.017.