Completing the Forecast Cycle

Jasper Slingsby

Completing the Forecast

Today we’ll complete the forecast cycle (through data assimilation) and spend a little time discussing decision support.

Data assimilation

The iterative ecological forecasting cycle in the context of the scientific method, demonstrating how we stand to learn from making iterative forecasts. From lecture on data assimilation by Michael Dietze.

Recap:

The iterative ecological forecasting and the scientific method are closely aligned cycles or loops.

Bayes Theorem provides an iterative probabilistic framework that makes it easier to update predictions as new data become available, mirroring the scientific method and completing the forecasting cycle.

Today we’ll unpack this in a bit more detail.

Operational data assimilation

Most modelling workflows are set up to:

(a) fit the available data and estimate parameters etc (we’ll call this analysis), which they then often use to

(b) make predictions (which if made forward through time are typically forecasts)

They usually stop there. Few workflows are set up to make forecasts repeatedly or iteratively, updating predictions as new observations are made.

When making iterative forecasts, one could just refit the model and entire dataset with the new observations added, but there are a few reasons why this may not be ideal…

Operational data assimilation

Why not just refit the model?

Complex models and/or large datasets can be very computationally taxing -e.g. Slingsby, Moncrieff, and Wilson (2020) ran the Hierarchical postfire model for the Cape Peninsula (~4000 MODIS pixels) over 4 days on a computing cluster with 64 cores and 1TB of RAM…
Depending on the model, you may not be able to reuse older data in the new model fit, meaning you can’t take advantage of all available data…
Refitting the model doesn’t make the most of learning from new observations and the forecast cycle…

The alternative is to assimilate the data sequentially, through forecast cycles, imputing observations a bit at a time as they’re observed.

Operational data assimilation

Sequential data assimilation has several advantages:

They can handle larger datasets, because you don’t have to assimilate all data at once.
If you start the model in the past and update towards present day, you have the opportunity to validate your predictions, and see how well the model does.

Does it improve with each iteration (i.e. is it learning)?
Think of it as letting your model get a run-up like a sportsperson before jumping/bowling/throwing.

Assimilating data sequentially is known as the sequential or operational data assimilation problem and occurs through two steps (the main components of the forecast cycle):

the forecast step, where we project our estimates of the current state forward in time
the analysis step, where we update our estimate of the state based on new observations

Operational data assimilation

The two main components of the forecast cycle are the forecast step (stippled lines), where we project from the initial state at time 0 (\(t_0\)) to the next time step (\(t_{0+1}\)), and the analysis step, where we use the forecast and new observations to get an updated estimate of the current state at \(t_{0+1}\), which would be used for the next forecast to \(t_{0+2}\).

The Forecast Step

While the first step has to be analysis, because you have to fit your model before you can make your first forecast, the forecast step is probably easier to explain first.

The goals of the forecast step are to:

To predict the value of the state variable(s) at the next time step
Indicate the uncertainty in our forecast (based on uncertainty that we have propagated through our model from various sources (data, priors, parameters, etc))

In short, we want to propagate uncertainty in our variable(s) of interest forward through time (and sometimes through space, depending on the goals).

The Forecast Step

There are a number of methods for propagating uncertainty into a forecast, mostly based on the same methods one would use to propagate the uncertainty through a model as discussed in the previous lecture.

Explaining the different methods is beyond the scope of this module, but just a reminder that there’s a trade-off between the methods whereby:

the most efficient (the Kalman filter in this case) also come with the most stringent assumptions (linear models and homogeneity of variance only)
the most flexible (Markov chain Monte Carlo (MCMC) in this case - a Bayesian approach) are the most computationally taxing

In short, if your model isn’t too taxing, or you have access to a large computer and time to kill, MCMC is probably best (and often easiest if you’re already working in Bayes)…

The Analysis Step

NOTE: I’m only explaining the general principles for Bayes. There are frequentist approaches, but I’m not going to go there.

This step involves using Bayes Theorem to combine our prior knowledge (our forecast) with new observations (at \(t_{0+1}\)) to generate an updated state for the next forecast (\(t_{0+2}\)).

The forecast cycle chaining together applications of Bayes Theorem at each timestep (\(t_0, t_1, ...\)). The forecast from one timestep becomes the prior for the next. The forecast is directly sampled as a posterior distribution when using MCMC.

The Analysis Step

This is better than just using the new data as your updated state, because:

it uses our prior information and understanding
it allows our model to learn and (hopefully) improve with each iteration
there is likely error (noise) in the new data, so it can’t necessarily be trusted more than our prior understanding anyway

Fortunately, Bayes deals with this very nicely:

if the forecast (prior) is uncertain and new data precise then the data will prevail
if the forecast is precise and the new data uncertain, then the posterior will retain the legacy of previous observations

The Analysis Step

Comparison of situations where there is (A) high forecast uncertainty (the prior) and low observation error (data), versus (B) low forecast uncertainty and high observation error on the posterior probability from the analysis step. Note that the data and prior have the same means in panels A and B, but the variances differ.

Ensemble forecasts

Lastly, just a note that I’ve mostly dealt with single forecasts and haven’t talked about how to deal with ensemble forecasts. There are data assimilation methods to deal with them, but we don’t have time to cover them.

The methods, and how you apply them, depend on the kind of ensemble. Usually, ensembles can be divided into three kinds, but you can have mixes of all three:

Where you use the same model, but vary the inputs to explore different scenarios.
Where you have a set of nested models of increasing complexity (e.g. like our postfire models with and without the seasonality term).
A mix of models with completely different model structures (or even approaches: empirical vs mechanistic, frequentist vs Bayesian, etc) aimed at forecasting the same thing.

Decision Support

This is probably the hardest part of the whole ecological forecasting business… people!

It is also a huge topic and not one I can cover in half a lecture. Here I just touch on a few hints and difficulties.

Decision Support

First and foremost, the decision at hand may not be amenable to a quantitative approach.

Ecological forecasting requires a clearly defined information need with a measurable (and modelable) state variable, framed within one or multiple decision alternatives (scenarios).
There’s also the risk of external factors making the forecasts unreliable, especially if they are not controlled by the decision maker and/or their probability is unknown (e.g. fire, pandemics, etc).

Decision Support

These external factors are where developing scenarios with different boundary conditions can be very useful.

e.g. scenarios with and without a fire, or different future climate states under alternative development pathways, etc.
Scenarios are often “what if” statements designed to address major sources of uncertainty that make it near-impossible to make accurate predictions with a single forecast.

Decision Support

It’s perhaps useful to note the distinction between predictions versus projections:

predictions are statements about the probability of the occurrence of events or the state of variables in the future based on what we currently know
projections are statements about the probability of the occurrence of events or the state of variables in the future given specific scenarios with clear boundary conditions

Decision Support

In an ideal world…

You’ll be working with an organized team that is a well-oiled machine at implementing Adaptive Management and Structured Decision Making and you can naturally slot into their workflow.

The advantages of Adaptive Management and Structured Decision Making are that they are founded on the concept of iterative learning cycles, which they have in common with the ecological forecasting cycle and the scientific method.

Decision Support

Conceptual relationships between iterative ecological forecasting, adaptive decision-making, adaptive monitoring, and the scientific method cycles (Dietze et al. 2018).

You’re already familiar with how the iterative ecological forecast cycle integrates with the Adaptive Management Cycle…

Decision Support

The beauty for the forecaster in this scenario is that a lot of the work is already done.

The decision alternatives (scenarios) have been well framed.
The performance measures, state variables of interest and associated covariates mostly identified.
Iterations of the learning cycle may even have already begun (through the Adaptive Management Cycle) and all you need do is develop the existing qualitative model into something more quantitative as more data and understanding are accumulated.
- Think of the Protea example in the second lecture, where the demography of these species is already used for decision making using semi-quantitative “rules of thumb”.

Structured Decision Making

Focused on the process of coming to a decision, not the process of management, but very useful in the first iteration of the Adaptive Management Cycle.

It is valuable when there are many stakeholders with disparate interests.

decisions are ultimately about values and often require evaluating trade-offs among properties with incomparable units - e.g. people housed/fed/watered vs species saved from extinction…
this can be a highly emotive space, and greatly benefits from a structured facilitation process

Structured Decision Making

It tries to bring all issues and values to light to be considered in a transparent framework where trade-offs can be identified and considered.

you can’t make the right choice if it isn’t on the table

It directly addresses the social, political or cognitive biases that marginalise some values or alternatives.

Many decisions pit people’s immediate needs (water, housing, etc) against the environment. We’d rather ignore the fact that choosing one is choosing against the other, but if we’re not transparent about this we’re not going to learn from our decisions and improve them in the next iteration.

References

Dietze, Michael C, Andrew Fox, Lindsay M Beck-Johnson, Julio L Betancourt, Mevin B Hooten, Catherine S Jarnevich, Timothy H Keitt, et al. 2018. “Iterative near-term ecological forecasting: Needs, opportunities, and challenges.” Proceedings of the National Academy of Sciences of the United States of America 115 (7): 1424–32. https://doi.org/10.1073/pnas.1710231115.

Gregory, Robin, Lee Failing, Michael Harstone, Graham Long, Tim McDaniels, and Dan Ohlson. 2012. Structured Decision Making: A Practical Guide to Environmental Management Choices. John Wiley & Sons.

Slingsby, Jasper A, Glenn R Moncrieff, and Adam M Wilson. 2020. “Near-real time forecasting and change detection for an open ecosystem with complex natural dynamics.” ISPRS Journal of Photogrammetry and Remote Sensing: Official Publication of the International Society for Photogrammetry and Remote Sensing 166 (August): 15–25. https://doi.org/10.1016/j.isprsjprs.2020.05.017.