Running forecasts#

In order to fit the simulation model to observations and generate forecasts, we need to:

Choose prior distributions for \(x(t)\), \(y(t)\), and \(z(t)\);
Define an input file for each observation model; and
Record summary statistics such as predictive credible intervals for each observation model, and simulated observations for \(z(t)\).

These changes are indicated by the highlighted lines in the following scenario definition:

An example scenario for generating forecasts for the Lorenz-63 system.#

# NOTE: Save this file as 'lorenz63_forecast.toml'

[components]
model = "pypfilt.examples.lorenz.Lorenz63"
time = "pypfilt.Scalar"
sampler = "pypfilt.sampler.LatinHypercube"
summary = "pypfilt.summary.HDF5"

[time]
start = 0.0
until = 25.0
steps_per_unit = 10
summaries_per_unit = 10

[prior]
sigma = { name = "constant", args.value = 10 }
rho = { name = "constant", args.value = 28 }
beta = { name = "constant", args.value = 2.66667 }
x = { name = "uniform", args.loc = -5, args.scale = 10 }
y = { name = "uniform", args.loc = -5, args.scale = 10 }
z = { name = "uniform", args.loc = -5, args.scale = 10 }

[observations.x]
model = "pypfilt.examples.lorenz.ObsLorenz63"
file = "lorenz63-x.ssv"

[observations.y]
model = "pypfilt.examples.lorenz.ObsLorenz63"
file = "lorenz63-y.ssv"

[observations.z]
model = "pypfilt.examples.lorenz.ObsLorenz63"
file = "lorenz63-z.ssv"

[summary.tables]
forecasts.component = "pypfilt.summary.PredictiveCIs"
forecasts.credible_intervals = [50, 60, 70, 80, 90, 95]
sim_z.component = "pypfilt.summary.SimulatedObs"
sim_z.observation_unit = "z"

[filter]
particles = 500
prng_seed = 2001
history_window = -1
resample.threshold = 0.25

[scenario.forecast]

Note

Call save_lorenz63_scenario_files() to save this scenario file (and the others used in this tutorial) in the working directory.

Observations will be read from these files when generating forecasts with pypfilt.forecast():

def run_lorenz63_forecast(filename=None):
    scenario_file = 'lorenz63_forecast.toml'
    instances = list(pypfilt.load_instances(scenario_file))
    instance = instances[0]

    # Run a forecast from t = 20.
    forecast_time = 20
    context = instance.build_context()
    return pypfilt.forecast(context, [forecast_time], filename=filename)

If you pass a filename to pypfilt.forecast(), all of the summary tables for the estimation pass and each forecasting pass will be saved to that file, as HDF5 data sets.

Note

HDF5 is a file format that allows you to store lots of data tables and related metadata in a single file, and to load these data tables as if they were NumPy arrays. All of the summary tables recorded by pypfilt are NumPy structured arrays. You can explore HDF5 files with the h5py package, which makes it easy to load and store data tables.

You can also load tables as Pandas or Polars data frames; see Working with data frames.