Customizing your own model parameters
This page delves into some of the more powerful things you can do with pipelines. It is aimed at people who have already completed the How to write advanced pipelines tutorial.
Pipeline model parameters
You should be aware that each pipeline step has parameters, but the pipeline model itself can also have its own parameters. Unlike step parameters, you have complete control over defining the model parameters yourself.
The model parameters are essentially custom variables that get replaced at run-time.
These variables are denoted by a $
in the pipeline file.
The model parameters can customize the behaviour of the model across all input data. Model parameters can encapsulate an assumption you are making about your model, which you may want to vary.
As a simple example, say we only wanted to run our model for one particular construction type at a time.
But we wanted to be able to vary that construction type easily, without editing the pipeline file each time.
We can turn the construction type we filter on into a construction
parameter, e.g.
input('assets.csv', name: 'asset') -> filter(filter: asset.construction = $construction)
When we run our model, we can then specify what specific construction type we are interested in, e.g.
riskscape model run my_pipeline --param "construction='steel'"
You can define the default values for your parameters in your model’s INI definition.
Just use param.<name> = <default>
, e.g.
[model my_pipeline]
framework = pipeline
location = my_pipeline.txt
param.construction = 'timber'
Parameter properties
When you want to share a model with other users, you can restrict what sort of values
the user can specify for certain parameters. For example, you could specify that
a parameter called damage_ratio
always had to be a numeric, or always had
to be within a certain range, such as 0.0 to 1.0.
You can specify what properties a parameter is expected to have in your project.ini
file.
For example, the damage_ratio
parameter would look like this:
param.damage_ratio.properties = numeric, min: 0.0, max: 1.0
If the user tries to enter anything that is not a numerical value between 0.0 and 1.0, then they will get an error when they try to run the model.
The parameter properties supported on the RiskScape CLI include: bookmark, file, integer, list, min, max, numeric, and text.
Tip
Specifying the text
, file
or bookmark
properties means that the user no longer
needs to use ''
single-quotes when entering a new parameter value.
The file
property behaves identically to the text
property, but allows the RiskScape
Platform to display a file chooser.
Parameter Templates
In your INI files, you can also define parameter templates. This saves you from having to define the same parameter
in several different models. A parameter template will automatically be used if it has the same name as the parameter,
but you can also provide the name of a template to use. E.g. param.buildings.template = city_buildings
For example:
[model one]
framework = pipeline
pipeline = input($people) -> sort(name)
param.people = ./data/people.csv
param.people.properties = bookmark
[model two]
framework = pipeline
pipeline = input($people) -> sort(age)
param.people = ./data/people.csv
param.people.properties = bookmark
could be replaced by:
[parameter people]
default = ./data/people.csv
properties = bookmark
[model one]
framework = pipeline
pipeline = input($people) -> sort(name)
[model two]
framework = pipeline
pipeline = input($people) -> sort(age)
This saves repetition if you have several different models that all use the same parameters over and over.
Parameter templates are also useful for configuring how model parameters are presented to the user in the RiskScape Platform web interface. More information about parameter templates can be found in the RiskScape Platform documentation
Function parameters
Model parameters also work if the assumption you are making is in your function. You simply need to pass the model parameter through to your function.
To do this you add an extra argument to your function.
The new argument will be a Struct
and contain the attributes you want to vary.
For example, say our Kaijū function was making an assumption on how resilient timber buildings
are to Kaijū attacks. We can modify our function to take an additional ‘options’ argument,
which is a Struct
with a timber_resilience
attribute. The (abridged) function would look like this:
from nz.org.riskscape.engine.types import Types
from nz.org.riskscape.engine.types import Struct
ID = 'kaiju_stomp'
DESCRIPTION = 'Models damage from a Kaiju stomping a building'
ARGUMENT_TYPES = ['building', 'kaiju_attack', \
Struct.of('timber_resilience', Types.INTEGER) ]
RETURN_TYPE = 'building_attack_outcome'
def function(building, stomp, options):
if building.get('construction') == 'timber':
# the resilience for timber buildings can be passed
# into our function, making it easier to vary it
resilience = options.get('timber_resilience')
elif building.get('construction') == 'concrete':
# whereas the resilience for concrete buildings is
# still hard-coded in the function itself
resilience = 5
# ...
Then to call our function from our pipeline code, we create a new Struct
with a
timber_resilience
attribute. In this case, the value for timber_resilience
is a model
parameter that we can now change on the fly whenever we run our model.
select({*,
kaiju_stomp(asset, hazard, { timber_resilience: $timber_resilience }) as damage
) as compute consequence
The ‘options’ Struct
could hold many different attributes, if there are many assumptions
in your function that you want to vary.
Resolving file paths
By default, RiskScape will try to resolve a model parameter file path relative to where the model is defined.
For example, say we have the following PopulationModels/project.ini
file:
[model one]
framework = pipeline
pipeline = input($people) -> sort(name)
param.people = data/people.csv
param.people.properties = bookmark
The $people
parameter will be resolved relative to the project.ini
file, so the model will try to use
the file path PopulationModels/data/people.csv
.
Sometimes you may want to share the same parameters, types, or bookmarks across multiple projects.
For example, say you had a C:\RiskScape_Projects\Shared\project.ini
file that defined a common dataset that you wanted to use across all your models:
[parameter people]
default = ./data/people.csv
properties = bookmark
You could then import that project.ini
file, and all the definitions it contains,
into your PopulationModels/project.ini
file:
[project]
import = C:\RiskScape_Projects\Shared\project.ini
[model one]
framework = pipeline
pipeline = input($people) -> sort(name)
This imports the $people
parameter template, so you do not have to define it again in your PopulationModels
project.
Note
When properties = bookmark
or file
is used, and the default value is a file path that begins with the special
sequence ./
, it will be resolved relative to the file it is defined in, not the model where it is used.
This means your models
can use templates (and their default file values) when the template is in a different folder to the main project file.
In the above example, because ./
is used, the $people
parameter is resolved relative to the Shared
directory project.ini
file where it is defined (i.e. C:\RiskScape_Projects\Shared\data\people.csv
).
If ./
were not used in the default value (i.e. data/people.csv
instead of ./data/people.csv
),
then the file path would be resolved relative to the PopulationModels
directory project.ini
file where the parameter is used (i.e. PopulationModels\data\people.csv
).
A working example
Click Here to download a working example of a parameterized pipeline.
Open the pipeline.txt
and project.ini
files and familiarize yourself with them.
You can see that the model defines ‘resilience’ parameters for the three construction materials of interest.
These parameters then get passed directly to the kaiju_stomp
function.
To run the model with default parameter values, use the command:
riskscape model run demo
Reproducible models
Model parameters can make it harder to tell later what actual parameter values were used to produce a certain set of results. Fortunately, RiskScape always saves the actual parameter values it used in the output directory.
Have a look at the output directory created by the last ‘model run’ command.
It should contain a pipeline.txt
file - this contains the pipeline code with the $
parameters
replaced with the actual values used by the ‘model run’ command.
Any piece of raw pipeline code (i.e. without parameters) can be executed using the riskscape pipeline eval
command.
So you can use the pipeline.txt
file in the output directory to re-run the exact same pipeline again in the future, e.g.
riskscape pipeline eval <pipeline.txt>
Try this now and check you get the same results.
Parameter INI files
When your model has several different parameters, rather than specifying each parameter on the command line, you can just specify one INI file that contains all the parameter values you want to use.
To pass the INI file to the ‘model run’ command, use the --parameters
CLI option (note the ‘s’ on the end).
For example:
riskscape model run demo --parameters stronger-resilience.ini
Using INI files can be helpful if you want to vary several different assumptions in a related way.
For example, this project contains two different INI files that make different assumptions about
a building’s resilience: stronger-resilience.ini
and weaker-resilience.ini
.
Try running the model with each of these INI files and see what difference it makes to the overall damage.