The characteristic method

implementation

Prerequisites

This page builds on The size-structured PDE. Read it first if the notation here is unfamiliar.

The idea first

Imagine trying to track how a whole population of plants of different sizes changes over time. You could try to solve for the entire size distribution at once, on a fixed grid of sizes, updating every point as time moves forward. That turns out to be hard, because growth constantly shuffles plants from one size to another, and the equation describing this shuffling is a partial differential equation with both space and time mixed together.

The method of characteristics takes a different route. Instead of watching the grid, you follow individual plants, or cohorts of plants that started life together, as they grow. Each one traces a path through time as its size changes, and along that path you can also track how many individuals remain, and how their local density evolves. Because you are moving with the plant rather than watching it pass through fixed points, the awkward partial differential equation collapses into an ordinary differential equation for each trajectory. These are far easier to solve, one step at a time, with a standard numerical stepper.

The rest of this page makes this precise.

Here we describe the overall numerical approach used to solve the dynamics in the plant model. Make sure you’ve read the size-structured PDE document describing the system of equations to be solved before reading this document.

Our approach to solving for the size-density distribution is based on the characteristic method (O. Angulo & López-Marcos, 2004; O. Angulo, López-Marcos, & López-Marcos, 2014, 2016). We initially started using the Escalator Boxcar Train technique (De Roos, 1988; De Roos, Diekmann, & Metz, 1992; De Roos, Taljapurkar, & Caswell, 1997), but then switched to using the characteristic method.

When simulating an individual plant, or the development of a patch, we need to solve for the size, survival, and seed output of individual plants. When solving for the size-density distribution in a large patch, we also need to estimate the average abundance of individuals. Each of these problems is formulated as an initial-value ODE problem (IVP), which can be solved using an ODE stepper.

All of the IVPs outlined below must be stepped through time. For this, plant uses an embedded implementation of the Runge-Kutta Cash-Karp 4-5 algorithm (Cash & Karp, 1990), with adaptive time step. The code is built into the model, based on code ported directly from the GNU Scientific Library (Galassi, 2009). The accuracy of the solver is controlled by two control parameters for relative and absolute accuracy, ode_tol_rel and ode_tol_abs. See ODE stepping for details of the stepper.

Recast as a set of ODEs, the equations to be solved are as follows.

Approach for specific equations

Size

The size of an individual is obtained via the size equation (defined on the size-structured PDE page), which is solved via the IVP

\[ \frac{dy}{dt} = g(x, y, t) , \]

\[ y(0) = H_0(x). \]

Survival

The probability of an individual surviving from patch age \(a_0\) to patch age \(a\) is obtained via the individual-survival equation (see the size-structured PDE page), which is solved via the IVP

\[ \frac{dy}{dt} = d(x, H_i(t) , E_t), \]

\[ y(0) = - \ln\left(S_{\rm G} (x, H_0, E_{a0})\right) . \]

Survival is then

\[ S_{\rm I} (x, a_0, a) = \exp\left( - y(a) \right). \]

Seed production

The lifetime seed production of individuals is obtained via the lifetime seed-production equation, which is solved via the IVP

\[ \frac{dy}{dt} = S_{\rm D} \, f(x, H_i(t), E_t) \, S_{\rm I} (x, a_0,t) \, S_{\rm P} (a_0, t), \]

\[ y(0) = 0, \]

where \(S_{\rm I}\) is calculated as described above and \(S_{\rm P}\) is calculated as in the patch-survival equation.

Size-density of individuals

By integrating along the characteristics of the governing PDE (defined on the size-structured PDE page), the size-density of individuals born with height \(H_0\) and traits \(x\) at patch age \(a_{0}\) is given by (De Roos et al., 1997; O. Angulo & López-Marcos, 2004)

\[ N(H | x, a) = N(H_0 | x, a_0) \exp \left( - \int _{a_0}^{a} \left[\frac{\partial g(x, H(x, a_0, a^\prime), E_{a^\prime})}{\partial H} + d(x, H(x, a_0, a^\prime), E_{a^\prime})\right] {\rm d} a^\prime \right). \tag{1}\]

Equation 1 states that the size-density \(N\) at a specific patch age \(a\) is the product of the size-density at patch age \(a_{0}\) adjusted for changes through growth and mortality. Size-density decreases through time because of mortality, as in a typical survival equation, but also changes because of growth. If growth is slowing with size, (i.e., \(\partial g / \partial H < 0\)), size-density will increase since the characteristics compress. Conversely, size-density will increase if \(\partial g / \partial H > 0\).

Denoting by \(\left[H_0, H_{ + } \right)\) the range of heights attainable by any individual, our algorithm for solving metacommunity dynamics proceeds by sub-dividing this interval into a series of nodes with heights \(H_0 < H_1 < \ldots < H_k\) at the initial points of the characteristic curves. These nodes are then transported along the characteristics of that PDE. The placement of nodes is controlled indirectly, via the schedule of patch ages at which new nodes are introduced into the metacommunity. We then track the demography of each such node.

The integral in Equation 1 is solved via the IVP

\[ \frac{dy}{dt} = \frac{\partial g(x, H_i(t), E_t)}{\partial H} + d(x, H_i(t), E_t), \tag{2}\]

\[ y(0) = - \ln\left(N(H_0 | x, a_0) \, S_{\rm G} (x, H_0, E_{{\rm a}0}) \right), \tag{3}\]

from which we obtain the size-density

\[ N(H_0 | x, a_0) = \exp( - y(a)). \tag{4}\]

Controls on approximation error

We now outline how to control the error of the approximate solutions to the system of equations described above. In our algorithm, numerical solutions are required to address a variety of problems:

To estimate the amount of light at a given height in a patch requires numerically integrating over the size-density distribution within that patch.
To calculate the assimilation of a plant requires numerically integrating photosynthesis over this light profile.
To simulate patch dynamics requires numerically identifying a vector of patch ages at which new nodes are introduced, and then numerically stepping the equations for each node forward in time to estimate their size, survival, and fecundity at different subsequent patch ages.
To solve for the initial height of a plant given its seed mass, and for the equilibrium birth rates across the metacommunity, requires numerical root finding.

As with all numerical techniques, solutions to each of these problems are accurate only up to a specified level. These levels are controlled via parameters in the plant code. Below, we provide a brief overview of the different numerical techniques being applied and outline how error tolerance can be increased or decreased. We refer to various control parameters that can be found within the control object. For a worked example illustrating how to modify these control parameters, see the section parameters of Appendix S3.

Initial plant heights

When a seed germinates, it produces a seedling of given height. The height of these seedlings is assumed to vary with the seed mass. Because there is no analytical solution relating seedling height to seed mass – at least when using the default FF16 physiological model – we must solve for this height numerically. The calculation is performed by the function height_seed within the physiological model, using the Boost library’s one-dimensional bisect routine (Schäling, 2014; Eddelbuettel, Emerson, & Kane, 2015). The accuracy of the solution is controlled by the parameter offspring_production_tol.

Approximation of size-density distribution via the characteristic method

Errors in the approximation of the size-density distribution arise from two sources: (i) coarse stepping of nodes through time and (ii) poor spacing of nodes across the attainable size range.

As described above, the stepping of the ODE solver is controlled by two control parameters for relative and absolute accuracy, ode_tol_rel and ode_tol_abs (see ODE stepping).

A second factor controlling the accuracy with which nodes are stepped through time is the accuracy of the derivative calculation according to Equation 1, calculated via standard finite differencing (Abramowitz & Stegun, 2012). When the parameter node_gradient_richardson is TRUE a Richardson extrapolation (Stoer & Bulirsch, 2002) is used to refine the estimate, up to depth node_gradient_richardson. The overall accuracy of the derivative is controlled by node_gradient_eps.

The primary factor controlling the spacing of nodes is the schedule of node introduction times. Because the system of equations to be integrated is deterministic, the schedule of node introduction times determines the spacing of nodes throughout the entire development of a patch. Poor node spacing introduces error because various emergent properties – such as total leaf area, biomass, or seed output – are estimated by integrating over the size-density distribution. The accuracy of these integrations declines directly with the inappropriate spacing of nodes. Thus, our algorithm aims to build an appropriately refined schedule, which allows the required integrations to be performed with the desired accuracy at every time point. At the same time, for reasons of computational efficiency, we want as few nodes as possible. The general idea of adaptively refining node spacing times was first applied by Falster, Brännström, Dieckmann, & Westoby (2011), and was described further by Falster, Brännström, Westoby, & Dieckmann (2015).

For a worked example illustrating schedule refinement (run_scm(refine_schedule = TRUE), backed by SCM::refine_schedule()), see node spacing.

Calculation of light environment and influence on assimilation

To progress with solving the system of ODEs requires that we calculate the amount of shading on each of the nodes, from all other plants in the patch.

Calculating the canopy openness \(E_a(z)\) at a given height \(z\) in a patch of age \(a\) requires that we integrate over the size-density distribution (the light equation on the size-structured PDE page). This integration is performed using the trapezium rule, within the function area_leaf_above in species.h. The main factor controlling the accuracy of the integration is the spacing of nodes. The node introduction times determining the spacing of nodes are adaptively refined as described above. This implies that also the trapezium integration within the area_leaf_above function is adaptively refined via SCM::refine_schedule() (exposed in R as run_scm(refine_schedule = TRUE)). See node spacing for detail.

The cost of calculating \(E_a(z)\) linearly increases with the number of nodes in the metacommunity. Since the same calculation must be repeated for every node, the overall computational cost of a step increases as \(O(k^2 )\), where \(k\) is the total number of nodes across all species. This disproportionate increase in computational cost with the number of nodes is highly undesirable.

We reduce the computational cost from \(O(k^2)\) to \(O(k)\) by approximating \(E_a(z)\) with a spline. The light equation describes a function monotonically increasing with size. This function is easily approximated using a piecewise continuous spline fitted to a limited number of points. Once fitted, the spline can be used to estimate any additional evaluations of competitive effect. Since spline evaluations are computationally cheaper than integrating over the size-density distribution, this approach reduces the overall cost of stepping the resident population. A new spline is constructed for each time step.

The accuracy of the spline interpolation depends on the number of points used in its construction and on their placement along the size axis. We select the number and locations of points via an adaptive algorithm. Starting with an initial set of light_availability_spline_nbase points (17 by default), we assess how much each point contributes to the accuracy of the spline fit at the location of each node, first via exact calculation, and second by linearly interpolating from adjacent nodes. The absolute difference in these values is compared to the spline tolerance light_availability_spline_tol (default 1e-4). Note that this is a property of the light spline configured on the environment (e.g. FF16_Environment), not a field of the Control object. If the error is greater than this tolerance, the interval is bisected and the process repeated, up to light_availability_spline_max_depth (16 by default) levels (see adaptive_interpolator.h for details).

Integration over light environment

Plants have leaf area distributed over a range of heights. Estimating a plant’s assimilation at each time step thus requires integrating leaf-level rates over the plant. The integration is performed using Gauss-Kronrod quadrature. A rule controls the level of detail in the integration.

Solving for offspring production

For a single species, solving for \(Y_x\) is a straightforward one-dimensional root-finding problem, which can be solved via a simple bisection algorithm.

Solving for offspring production in metacommunities with multiple species is significantly harder, because there is no generally applicable method for multi-dimensional root finding. In plant, we have therefore implemented several different approaches, which are available via the package {regnans}.

References

Abramowitz, M., & Stegun, I. A. (2012). Handbook of mathematical functions: With formulas, graphs, and mathematical tables. Courier Corporation.

Angulo, O., & López-Marcos, J. C. (2004). Numerical integration of fully nonlinear size-structured population models. Applied Numerical Mathematics, 50(3-4), 291–327. doi:10.1016/j.apnum.2004.01.007

Angulo, O., López-Marcos, J. C., & López-Marcos, M. A. (2014). Analysis of an efficient integrator for a size-structured population model with a dynamical resource. Computers & Mathematics with Applications, 68(9), 941–961. doi:10.1016/j.camwa.2014.04.009

Angulo, O., López-Marcos, J. C., & López-Marcos, M. A. (2016). Study on the efficiency in the numerical integration of size-structured population models: Error and computational cost. Journal of Computational and Applied Mathematics, 291, 391–401. doi:10.1016/j.cam.2015.03.022

Cash, J. R., & Karp, A. H. (1990). A variable order Runge-Kutta method for initial value problems with rapidly varying right-hand sides. ACM Transactions on Mathematical Software, 16(3), 201–222. doi:10.1145/79505.79507

De Roos, A. M. (1988). Numerical methods for structured population models: The Escalator Boxcar Train. Numerical Methods for Partial Differential Equations, 4(3), 173–195. doi:10.1002/num.1690040303

De Roos, A. M., Diekmann, O., & Metz, J. A. J. (1992). Studying the dynamics of structured population models: A versatile technique and its application to daphnia. American Naturalist, 139(1), 123. doi:abs/10.1086/285316

De Roos, A. M., Taljapurkar, S., & Caswell, H. (1997). A gentle introduction to physiologically structured population models. In Structured population models in marine, terrestrial and fresh-water systems (pp. 119–204). New York: Chapman & Hall.

Eddelbuettel, D., Emerson, J. W., & Kane, M. J. (2015). BH: Boost c++ header files. Retrieved from http://CRAN.R-project.org/package=BH

Falster, D. S., Brännström, Å., Dieckmann, U., & Westoby, M. (2011). Influence of four major plant traits on average height, leaf-area cover, net primary productivity, and biomass density in single-species forests: A theoretical investigation. Journal of Ecology, 99(1), 148–164. doi:10.1111/j.1365-2745.2010.01735.x

Falster, D. S., Brännström, Åke, Westoby, M., & Dieckmann, U. (2015). Multi-trait eco-evolutionary dynamics explain niche diversity and evolved neutrality in forests. bioRxiv, 014605. doi:10.1101/014605

Galassi, M. (Ed.). (2009). GNU scientific library: Reference manual (3. ed., for GSL version 1.12). s.l.: Network Theory.

Schäling, B. (2014). The Boost C++ libraries (2nd ed.). XML Press.

Stoer, J., & Bulirsch, R. (2002). Introduction to numerical analysis. Springer Science & Business Media.