EGM$^n$: The Sequential Endogenous Grid Method - Alan Lujan

Abstract¶

Heterogeneous agent models with multiple decisions are often solved using inefficient grid search methods that require many evaluations and are slow. This paper provides a novel method for solving such models using an extension of the Endogenous Grid Method (EGM) that uses Gaussian Process Regression (GPR) to interpolate functions on unstructured grids. First, I propose an intuitive and strategic procedure for decomposing a problem into subproblems which allows the use of efficient solution methods. Second, using an exogenous grid of post-decision states and solving for an endogenous grid of pre-decision states that obey a first-order condition greatly speeds up the solution process. Third, since the resulting endogenous grid can often be non-rectangular at best and unstructured at worst, GPR provides an efficient and accurate method for interpolating the value, marginal value, and decision functions. Applied sequentially to each decision within the problem, the method is able to solve heterogeneous agent models with multiple decisions in a fraction of the time and with less computational resources than are required by standard methods currently used. Software to reproduce these methods is available under the Econ-ARK/HARK project for the python programming language.

Keywords:endogenous grid methoddynamic programmingmachine learning¶

1Introduction¶

1.1Background¶

Macroeconomic modeling aims to describe a complex world of agents interacting with each other and making decisions in a dynamic setting. The models are often very complex, require strong underlying assumptions, and use a lot of computational power to solve. One of the most common methods to solve these complex problems is using a grid search method to solve the model. The Endogenous Grid Method (EGM) developed by Carroll (2006) allows dynamic optimization problems to be solved in a more computationally efficient and faster manner than the previous method of convex optimization using grid search. Many problems that before took hours to compute became much easier to solve and allowed macroeconomists and computational economists to focus on estimation and simulation. However, the Endogenous Grid Method is limited to a few specific classes of problems. Recently, the classes of problems to which EGM can be applied have been expanded^[1], but with every new method comes a new set of limitations. This paper introduces a new approach to EGM in a multivariate setting. The method is called Sequential EGM (or EGMⁿ) and introduces a novel way of breaking down complex problems into a sequence of simpler, smaller, and more tractable problems, along with an exploration of new multidimensional interpolation methods that can be used to solve these problems.

1.2Literature Review¶

Carroll (2006) first introduced the Endogenous Grid Method as a way to speed up the solution of dynamic stochastic consumption-savings problems. The method consists of starting with an exogenous grid of post-decision states and using the inverse of the first-order condition to find the optimal consumption policy that rationalizes such post-decision states. Given the optimal policy and post-decision states, it is straightforward to calculate the initial pre-decision state that leads to the optimal policy. Although this method is certainly innovative, it only applied to a model with one control variable and one state variable. Barillas & Fernández-Villaverde (2007) further extend this method by including more than one control variable in the form of a labor-leisure choice, as well as a second state variable for stochastic persistence.

Hintermaier & Koeniger (2010) introduce a model with collateral constraints and non-separable utility and solve using an EGM method that allows for occasionally binding constraints among endogenous variables. Jørgensen (2013) evaluates the performance of the Endogenous Grid Method against other methods for solving dynamic stochastic optimization problems and finds it to be fast and efficient. Maliar & Maliar (2013) develop the Envelope Condition Method based on a similar idea as the Endogenous Grid Method, avoiding the need for costly numerical optimization and grid search. However, their model is limited to infinite horizon problems as it is a forward solution method.

Further development into a multivariate Endogenous Grid Method expanded the ability of researchers to solve models efficiently. White (2015) formally characterized the conditions for the Endogenous Grid Method and developed an interpolation method for structured non-rectilinear, or curvilinear, grids. Iskhakov (2015) additionally establishes conditions for solving multivariate models with EGM, requiring the invertibility of a triangular system of first-order conditions. Ludwig & Schön (2018) also develops a novel interpolating method using Delaunay triangulation of the resulting unstructured endogenous grid. However, the authors show that the gains from avoiding the grid search method are often offset by the costly construction of the triangulation.

For the papers discussed above, continuity and smoothness of the value and first-order conditions are strict requirements. Fella (2014) first introduced a method to solve non-convex problems using the Endogenous Grid Method. The idea is based on evaluating necessary but not sufficient candidates for the first-order condition in overlapping regions of the state space. Arellano et al. (2016) use the Envelope Condition Method to solve a sovereign default risk model with similar efficiency gains to EGM. Iskhakov et al. (2017) further advances the methodology by using extreme errors to solve discrete choice problems with Endogenous Grid Method. These methods however were only applied to a single control variable and a single state variable. Druedahl & Jørgensen (2017) introduces the $G2EGM$ to handle non-convex problems with more than 1 control variable and more than 1 state variable. This method is also capable of handling occasionally binding constraints which previous multivariate EGM methods were not.

Clausen & Strub (2020) formalize the applicability of the Endogenous Grid Method and its extensions to discrete choice models and discuss the nesting of problems to efficiently find accurate solutions. Druedahl (2021) similarly suggest the nesting of problems to efficiently use the Endogenous Grid Method within problems with multiple control variables. However, while these nested methods reduce the complexity of solving these models, they often still require grid search methods as is the case with Druedahl (2021).

1.3Research Question¶

The purpose of this paper is to describe a new method for solving dynamic optimization problems efficiently and accurately while avoiding convex optimization and grid search methods with the use of the Endogenous Grid Method and first-order conditions. The method is called Sequential EGM (or EGMⁿ) and introduces a novel way of breaking down complex problems into a sequence of simpler, smaller, and more tractable problems, along with an exploration of new multidimensional interpolation methods that can be used to solve these problems. This paper also illustrates an example of how Sequential EGM can be used to solve a dynamic optimization problem in a multivariate setting.

1.4Methodology¶

The sequential Endogenous Grid Method consists of 3 major parts: First, the problem to be solved should be broken up into a sequence of smaller problems that themselves don’t add any additional state variables or introduce asynchronous dynamics with respect to the uncertainty. If the problem is broken up in such a way that uncertainty can happen in more than one period, then the solution to this sequence of problems might be different from the aggregate problem due to giving the agent additional information about the future by realizing some uncertainty. Second, I evaluate each of the smaller problems to see if they can be solved using the Endogenous Grid Method. This evaluation is of greater scope than the traditional Endogenous Grid Method, as it allows for the resulting exogenous grid to be non-regular. If the subproblem can not be solved with EGM, then convex optimization is used. Third, if the exogenous grid generated by the EGM is non-regular, then I use a multidimensional interpolation method that takes advantage of machine learning tools to generate an interpolating function. Solving each subproblem in this way, the sequential Endogenous Grid Method is capable of solving complex problems that are not solvable with the traditional Endogenous Grid Method and are difficult and time-consuming to solve with convex optimization and grid search methods.

1.5Contributions¶

The Sequential Endogenous Grid Method is capable of solving multivariate dynamic optimization problems in an efficient and fast manner by avoiding grid search. This should allow researchers and practitioners to solve more complex problems that were previously not easily accessible to them, but more accurately capture the dynamics of the macroeconomy. By using advancements in machine learning techniques such as Gaussian Process Regression, the Sequential Endogenous Grid Method is capable of solving problems that were not previously able to be solved using the traditional Endogenous Grid Method. In particular, the Sequential Endogenous Grid Method is different from NEGM in that it allows for using more than one Endogenous Grid Method step to solve a problem, avoiding costly grid search methods to the extent that the problem allows.

Additionally, the Sequential Endogenous Grid Method often sheds light on the problem by breaking it down into a sequence of simpler problems that were not previously apparent. This is because intermediary steps in the solution process generate value and marginal value functions of different pre- and post-decision states that can be used to understand the problem better.

1.6Outline¶

Section 2 presents a basic model that illustrates the sequential Endogenous Grid Method in one dimension. Then Section 3 introduces a more complex method with two state variables to demonstrate the use of machine learning tools to generate an interpolating function. In Section 4 I present the unstructured interpolation methods using machine learning in more detail. Section 5 discusses the theoretical requirements to use the Sequential Endogenous Grid Method. Finally, Section 6 concludes with some limitations and future work.

2The Sequential Endogenous Grid Method¶

2.1A basic model¶

The baseline problem which I will use to demonstrate the Sequential Endogenous Grid Method (EGMⁿ) is a discrete time version of Bodie et al. (1992) where a consumer has the ability to adjust their labor as well as their consumption in response to financial risk. The objective consists of maximizing the present discounted lifetime utility of consumption and leisure.

\VFunc_0(\BLev_0, \tShkEmp_0) = \max \Ex_{t} \left[ \sum_{n = 0}^{T-t} \DiscFac^{n} \utilFunc(\CLev_{t+n}, \Leisure_{t+n}) \right]

(1)

In particular, this example makes use of a utility function that is based on Example 1 in the paper, which is that of additively separable utility of labor and leisure as

\utilFunc(\CLev, \Leisure) = \util(\CLev) + \h(\Leisure) = \frac{C^{1-\CRRA}}{1-\CRRA} + \labShare^{1-\CRRA} \frac{\Leisure^{1-\leiShare}}{1-\leiShare}

(2)

where the term $\labShare^{1-\CRRA}$ is introduced to allow for a balanced growth path as in Mertens & Ravn (2011). The use of additively separable utility is ad-hoc, as it will allow for the use of multiple EGM steps in the solution process, as we’ll see later.

This model represents a consumer who begins the period with a level of bank balances $\bRat_{t}$ and a given wage offer $\tShkEmp_{t}$ . Simultaneously, they are able to choose consumption, labor intensity, and a risky portfolio share with the objective of maximizing their utility of consumption and leisure, as well as their future wealth.

The problem can be written in normalized recursive form^[2] as

\begin{split} \vFunc_{t}(\bRat_{t}, \tShkEmp_{t}) & = \max_{\{\cRat_{t}, \leisure_{t}, \riskyshare_{t}\}} \utilFunc(\cRat_{t}, \leisure_{t}) + \DiscFac \Ex_{t} \left[ \PGro_{t+1}^{1-\CRRA} \vFunc_{t+1} (\bRat_{t+1}, \tShkEmp_{t+1}) \right] \\ & \text{s.t.} \\ \labor_{t} & = 1 - \leisure_{t} \\ \mRat_{t} & = \bRat_{t} + \tShkEmp_{t}\labor_{t} \\ \aRat_{t} & = \mRat_{t} - \cRat_{t} \\ \Rport_{t+1} & = \Rfree + (\Risky_{t+1} - \Rfree) \riskyshare_{t} \\ \bRat_{t+1} & = \aRat_{t} \Rport_{t+1} / \PGro_{t+1} \end{split}

(3)

in which $\labor_{t}$ is the time supplied to labor net of leisure, $\mRat_{t}$ is the market resources totaling bank balances and labor income, $\aRat_{t}$ is the amount of saving assets held by the consumer, and $\riskyshare_{t}$ is the risky share of assets, which induce a $\Rport_{t+1}$ return on portfolio that results in next period’s bank balances $\bRat_{t+1}$ normalized by next period’s permanent income $\PGro_{t+1}$ .

2.2Restating the problem sequentially¶

We can make a few choices to create a sequential problem which will allow us to use multiple EGM steps in succession. First, the agent decides their labor-leisure trade-off and receives a wage. Their wage plus their previous bank balance then becomes their market resources. Second, given market resources, the agent makes a pure consumption-saving decision. Finally, given an amount of savings, the consumer then decides their risky portfolio share.

Starting from the beginning of the period, we can define the labor-leisure problem as

\begin{split} \vFunc_{t}(\bRat_{t}, \tShkEmp_{t}) & = \max_{ \leisure_{t}} \h(\leisure_{t}) + \vOpt_{t} (\mRat_{t}) \\ & \text{s.t.} \\ 0 & \leq \leisure_{t} \leq 1 \\ \labor_{t} & = 1 - \leisure_{t} \\ \mRat_{t} & = \bRat_{t} + \tShkEmp_{t}\labor_{t}. \end{split}

(4)

The pure consumption-saving problem is then

\begin{split} \vOpt_{t}(\mRat_{t}) & = \max_{\cRat_{t}} \util(\cRat_{t}) + \DiscFac\vEnd_{t}(\aRat_{t}) \\ & \text{s.t.} \\ 0 & \leq \cRat_{t} \leq \mRat_{t} \\ \aRat_{t} & = \mRat_{t} - \cRat_{t}. \end{split}

(5)

Finally, the risky portfolio problem is

\begin{split} \vEnd_{t}(\aRat_{t}) & = \max_{\riskyshare_{t}} \Ex_{t} \left[ \PGro_{t+1}^{1-\CRRA} \vFunc_{t+1}(\bRat_{t+1}, \tShkEmp_{t+1}) \right] \\ & \text{s.t.} \\ 0 & \leq \riskyshare_{t} \leq 1 \\ \Rport_{t+1} & = \Rfree + (\Risky_{t+1} - \Rfree) \riskyshare_{t} \\ \bRat_{t+1} & = \aRat_{t} \Rport_{t+1} / \PGro_{t+1}. \end{split}

(6)

This sequential approach is explicitly modeled after the nested approaches explored in Clausen & Strub (2020) and Druedahl (2021). However, I will offer additional insights that expand on these methods. An important observation is that now, every single choice is self-contained in a subproblem, and although the structure is specifically chosen to minimize the number of state variables at every stage, the problem does not change by this structural imposition. This is because there is no additional information or realization of uncertainty that happens between decisions, as can be seen by the expectation operator being in the last subproblem. From the perspective of the consumer, these decisions are essentially simultaneous, but a careful organization into sub-period problems enables us to solve the model more efficiently and can provide key economic insights. In this problem, as we will see, a key insight will be the ability to explicitly calculate the marginal value of wealth and the Frisch elasticity of labor.

2.3The portfolio decision subproblem¶

As useful as it is to be able to use the EGM step more than once, there are clear problems where the EGM step is not applicable. This basic labor-portfolio choice problem demonstrates where we can use an additional EGM step, and where we can not. First, we go over a subproblem where we can not use the EGM step.

In reorganizing the labor-portfolio problem into subproblems, we assigned the utility of leisure to the leisure-labor subproblem and the utility of consumption to the consumption-savings subproblem. There are no more separable convex utility functions to assign to this problem, and even if we re-organized the problem in a way that moved one of the utility functions into this subproblem, they would not be useful in solving this subproblem via EGM as there is no direct relation between the risky share of portfolio and consumption or leisure. Therefore, the only way to solve this subproblem is through standard convex optimization and root-finding techniques.

Restating the problem in compact form gives

\vEnd_{t}(\aRat_{t}) = \max_{\riskyshare_{t}} \Ex_{t} \left[ \PGro_{t+1}^{1-\CRRA} \vFunc_{t+1}\left(\aRat_{t}(\Rfree + (\Risky_{t+1} - \Rfree) \riskyshare_{t}), \tShkEmp_{t+1}\right) \right].

(7)

The first-order condition with respect to the risky portfolio share is then

\Ex_{t} \left[ \PGro_{t+1}^{-\CRRA} \vFunc_{t+1}^{\bRat}\left(\bRat_{t+1}, \tShkEmp_{t+1}\right) (\Risky_{t+1} - \Rfree) \right] = 0.

(8)

Finding the optimal risky share requires numerical optimization and root-solving of the first-order condition. To close out the problem, we can calculate the envelope condition as

\vEnd_{t}'(\aRat_{t}) = \Ex_{t} \left[ \PGro_{t+1}^{-\CRRA} \vFunc_{t+1}^{\bRat}\left(\bRat_{t+1}, \tShkEmp_{t+1}\right) \Rport_{t+1} \right].

(9)

2.3.1A note on avoiding taking expectations more than once¶

We could instead define the portfolio choice subproblem as:

\vEnd_{t}(\aRat_{t}) = \max_{\riskyshare_{t}} \vOptAlt(\aRat_{t}, \riskyshare_{t})

(10)

where

\begin{split} \vOptAlt_{t}(\aRat_{t}, \riskyshare_{t}) & = \Ex_{t} \left[ \PGro_{t+1}^{1-\CRRA} \vFunc_{t+1}\left(\bRat_{t+1}, \tShkEmp_{t+1}\right) \right] \\ \Rport_{t+1} & = \Rfree + (\Risky_{t+1} - \Rfree) \riskyshare_{t} \\ \bRat_{t+1} & = \aRat_{t} \Rport_{t+1} / \PGro_{t+1} \end{split}

(11)

In this case, the process is similar. The only difference is that we don’t have to take expectations more than once. Given the next period’s solution, we can calculate the marginal value functions as:

\begin{split} \vOptAlt_{t}^{\aRat}(\aRat_{t}, \riskyshare_{t}) & = \Ex_{t} \left[ \PGro_{t+1}^{-\CRRA} \vFunc_{t+1}'\left(\bRat_{t+1}, \tShkEmp_{t+1}\right) \Rport_{t+1} \right] \\ \vOptAlt_{t}^{\riskyshare}(\aRat_{t}, \riskyshare_{t}) & = \Ex_{t} \left[ \PGro_{t+1}^{-\CRRA} \vFunc_{t+1}'\left(\bRat_{t+1}, \tShkEmp_{t+1}\right) \aRat_{t} (\Risky_{t+1} - \Rfree) \right] \\ \end{split}

(12)

If we are clever, we can calculate both of these in one step. Now, the optimal risky share can be found by the first-order condition and we can use it to evaluate the envelope condition.

\text{F.O.C.:} \qquad \vOptAlt_{t}^{\riskyshare}(\aRat_{t}, \riskyshare_{t}^{*}) = 0 \qquad \text{E.C.:} \qquad \vEnd_{t}^{\aRat}(\aRat_{t}) = \vOptAlt_{t}^{\aRat}(\aRat_{t}, \riskyshare_{t}^{*})

(13)

2.4The consumption-saving subproblem¶

The consumption-saving EGM follows Carroll (2006) but I will cover it for exposition. We can begin the solution process by restating the consumption-savings subproblem in a more compact form, substituting the market resources constraint and ignoring the no-borrowing constraint for now. The problem is:

\vOpt_{t}(\mRat_{t}) = \max_{\cRat_{t}} \util(\cRat_{t}) + \DiscFac \vEnd_{t}(\mRat_{t}-\cRat_{t})

(14)

To solve, we derive the first-order condition with respect to $\cRat_{t}$ which gives the familiar Euler equation:

\utilFunc'(\cRat_t) = \DiscFac \vEnd_{t}'(\mRat_{t} - \cRat_{t}) = \DiscFac \vEnd_{t}'(\aRat_{t})

(15)

Inverting the above equation is the (first) EGM step.

\cEndFunc_{t}(\aRat_{t}) = \utilFunc'^{-1}\left( \DiscFac \vEnd_{t}'(\aRat_{t}) \right)

(16)

Given the utility function above, the marginal utility of consumption and its inverse are

\utilFunc'(\cRat) = \cRat^{-\CRRA} \qquad \utilFunc'^{-1}(\xRat) = \xRat^{-1/\CRRA}.

(17)

Carroll (2006) demonstrates that by using an exogenous grid of $\aMat$ points we can find the unique $\cEndFunc_{t}(\aMat)$ that optimizes the consumption-saving problem, since the first-order condition is necessary and sufficient. Further, using the market resources constraint, we can recover the exact amount of market resources that is consistent with this consumption-saving decision as

\mEndFunc_{t}(\aMat) = \cEndFunc_{t}(\aMat) + \aMat.

(18)

This $\mEndFunc_{t}(\aMat)$ is the ``endogenous’’ grid that is consistent with the exogenous decision grid $\aMat$ . Now that we have a $(\mEndFunc_{t}(\aMat), \cEndFunc_{t}(\aMat))$ pair for each $\aRat \in \aMat$ , we can construct an interpolating consumption function for market resources points that are off-the-grid.

The envelope condition will be useful in the next section, but for completeness is defined here.

\vOpt_{t}'(\mRat_{t}) = \DiscFac \vEnd_{t}'(\aRat_{t}) = \utilFunc'(\cRat_{t})

(19)

2.5The labor-leisure subproblem¶

The labor-leisure subproblem can be restated more compactly as:

\vFunc_{t}(\bRat_{t}, \tShkEmp_{t}) = \max_{ \leisure_{t}} \h(\leisure_{t}) + \vOpt_{t}(\bRat_{t} + \tShkEmp_{t}(1-\leisure_{t}))

(20)

The first-order condition with respect to leisure implies the labor-leisure Euler equation

\h'(\leisure_{t}) = \vOpt_{t}'(\mRat_{t}) \tShkEmp_{t}

(21)

The marginal utility of leisure and its inverse are

\h'(\leisure) = \labShare\leisure^{-\leiShare} \qquad \h'^{-1}(\xRat) = (\xRat/\labShare)^{-1/\leiShare}

(22)

Using an exogenous grid of $\mMat$ and $\tShkMat$ , we can find leisure as

\zEndFunc_{t}(\mMat, \tShkMat) = \h'^{-1}\left( \vOpt_{t}'(\mMat) \tShkMat \right)

(23)

In this case, it’s important to note that there are conditions for leisure itself. An agent with a small level of market resources $\mRat_{t}$ might want to work more than their available time endowment, especially at higher levels of income $\tShkEmp_{t}$ , if the utility of leisure is not enough to compensate for their low wealth. In these situations, the optimal unconstrained leisure might be negative, so we must impose a constraint on the optimal leisure function. This is similar to the treatment of an artificial borrowing constraint in the pure consumption subproblem. From now on, let’s call this constrained optimal function $\hat{\zEndFunc}_{t}(\mMat, \tShkMat)$ , where

\hat{\zEndFunc}_{t}(\mMat, \tShkMat) = \max \left[ \min \left[ \zEndFunc_{t}(\mMat, \tShkMat), 1 \right], 0 \right]

(24)

Then, we derive labor as $\lEndFunc_{t}(\mRat_{t}, \tShkEmp_{t}) = 1 - \hat{\zEndFunc}_{t}(\mRat_{t}, \tShkEmp_{t})$ . Finally, for each $\tShkEmp_{t}$ and $\mRat_{t}$ as an exogenous grid, we can find the endogenous grid of bank balances as $\bEndFunc_{t}(\mRat_{t}, \tShkEmp_{t}) = \mRat_{t} - \tShkEmp_{t}\lEndFunc_{t}(\mRat_{t}, \tShkEmp_{t})$ .

The envelope condition then provides a heterogeneous Frisch elasticity of labor as simply

\vFunc_{t}^{b}(\bRat_{t}, \tShkEmp_{t}) = \vOpt_{t}'(\mRat_{t}) = \h'(\leisure_{t})/\tShkEmp_{t}.

(25)

2.6Alternative Parametrization¶

An alternative formulation for the utility of leisure is to state it in terms of the disutility of labor as in (references)

\h(\labor) = - \leiShare \frac{\labor^{1+\labShare}}{1+\labShare}

(26)

In this case, we can restate the problem as

\h(\leisure) = - \leiShare \frac{(1-\leisure)^{1+\labShare}}{1+\labShare}

(27)

The marginal utility of leisure and its inverse are

\h'(\leisure) = \leiShare(1-\leisure)^{\labShare} \qquad \h'^{-1}(\xRat) = 1 - (\xRat/\leiShare)^{1/\labShare}

(28)

2.7Curvilinear Grids¶

Although EGMⁿ seems to be a simple approach, there is one important caveat that we have not discussed, which is the details of the interpolation. In the pure consumption-savings problem, a one-dimensional exogenous grid of post-decision liquid assets $\aMat$ results in a one-dimensional endogenous grid of total market resources $\mMat$ . However, as we know from standard EGM, the spacing in the $\mMat$ grid is different from the spacing in the $\aMat$ grid as the inverted Euler equation is non-linear. This is no problem in a one-dimensional problem as we can simply use non-uniform linear interpolation.

However, the same is true of higher dimensional problems, where the exogenous grid gets mapped to a warped endogenous grid. In this case, it is not possible to use standard multi-linear interpolation, as the resulting endogenous grid is not rectilinear. Instead, I introduce a novel approach to interpolation that I call Warped Grid Interpolation (WGI), which is similar to White (2015)’s approach but computationally more efficient and robust. The details of this interpolation method will be further explained in Section 4, but for now, we show the resulting warped endogenous grid for the labor-leisure problem.

Figure 1:Warped Curvlinear Grid that results from multivariate EGM. This grid can be interpolated by WGI.

2.8Warped Grid Interpolation (WGI)¶

Assume we have a set of points indexed by $(i,j)$ in two-dimensional space for which we have corresponding functional values in a third dimension, such that $f(x_{ij},y_{ij}) = z_{ij}$ . In practice, we are interested in cases where the $z_{ij}$ are difficult to compute and $f(x_{ij},y_{ij})$ is unknown, so we are unable to compute them at other values of $x$ and $y$ --- which is why we want to interpolate^[3]. These $(x_{ij},y_{ij})$ points however are not evenly spaced and do not form a rectilinear grid which would make it easy to interpolate the function off the grid. Nevertheless, these points do have a regular structure as we will see.

Figure 2:True function and curvilinear grid of points for which we know the value of the function.

In Figure 2, we can see the true function in three-dimensional space, along with the points for which we actually know the value of the function. The underlying regular structure comes from the points’ position in the matrix, the $(i,j)$ coordinates. If we join the points along every row and every column, we can see that the resulting grid is regular and piecewise affine (curvilinear).

In Figure 3 we see the values of the function at their index coordinate points in the matrix. We can see that there exists a mapping between the curvilinear grid and the index coordinates of the matrix.

Figure 3:Homotopy between the curvilinear grid and the index coordinates of the matrix.

The objective is to be able to interpolate the value of the function at any point off the grid, where presumably we are only interested in points internal to the curvilinear space and not outside the boundaries. For example, we can imagine that we want an approximation to the function at the point $(x,y) = (3, 5)$ pictured Figure 4. If we could find the corresponding point in the coordinate grid, interpolation would be straightforward. We can find where the $x$ -coordinate of the point of interest intersects with the index-coordinates of the matrix. This is similar to assuming that we have 3 linear interpolators formed by connecting the points on the green lines in the x-direction, and for each interpolator we can approximate the corresponding y and z values using the grid data. Now, for each circle in Figure 4, we have a corresponding pair $(y,z)$ , and we can interpolate in the y-direction to find the corresponding z-value for the point’s y-coordinate^[4].

The method consist of extending the loci of points in the x dimension to find the corresponding crossing points in the y dimension. — Figure 4:The method consist of extending the loci of points in the $x$ dimension to find the corresponding crossing points in the $y$ dimension.

3The EGMⁿ in Higher Dimensions¶

The problem in Section 2 demonstrates the simplicity of solving problems sequentially. However, as constructed, the problem has only one state variable and one post-decision state variable per stage. Can EGMⁿ be used to solve higher dimensional problems? In short, yes, but it requires additional thought on interpolation.

3.1A more complex problem¶

For a demonstration, we now turn to the problem of a worker saving up for retirement. This worker must consume, save, and deposit resources into a tax-advantaged account that can not be liquidated until retirement. In the recursive problem, the worker begins a new period with a liquid account of market resources $\mRat_{t}$ and an illiquid account of retirement savings $\nRat_{t}$ . The worker maximizes their utility by choosing consumption $\cRat_{t}$ and pension deposit $\dRat_{t}$ . The pension deposit is set aside on a retirement account that is exposed to a risky return, while their post-consumption liquid assets accrue risk-free interest every period. The worker additionally receives an income that faces a permanent ( $\PGro_{t+1}$ ) and a transitory ( $\tShkEmp_{t+1}$ ) shock every period. At the age of 65, the worker is retired and their assets are liquidated, at which point the state reduces to one liquid account of market resources. The worker’s recursive problem is:

\begin{split} \vFunc_{t}(\mRat_{t}, \nRat_{t}) & = \max_{\cRat_{t}, \dRat_{t}} \util(\cRat_{t}) + \DiscFac \Ex_{t} \left[ \PGro_{t+1}^{1-\CRRA} \vFunc_{t+1}(\mRat_{t+1}, \nRat_{t+1}) \right] \\ & \text{s.t.} \quad \cRat_{t} \ge 0, \quad \dRat_{t} \ge 0 \\ \aRat_{t} & = \mRat_{t} - \cRat_{t} - \dRat_{t} \\ \bRat_{t} & = \nRat_{t} + \dRat_{t} + g(\dRat_{t}) \\ \mRat_{t+1} & = \aRat_{t} \Rfree / \PGro_{t+1} + \tShkEmp_{t+1} \\ \nRat_{t+1} & = \bRat_{t} \Risky_{t+1} / / \PGro_{t+1} \end{split}

(31)

where

\gFunc(\dRat) = \xFer \log(1+\dRat).

(32)

This problem can subsequently be broken down into 3 stages: a pension deposit stage, a consumption stage, and an income shock stage.

3.2Breaking down the problem¶

In the deposit stage, the worker begins with market resources and a retirement savings account. The worker must maximize their value of liquid wealth $\lRat_{t}$ and retirement balance $\bRat_{t}$ by choosing a pension deposit $\dRat_{t}$ , which must be positive. The retirement balance $\bRat$ is the cash value of their retirement account plus their pension deposit and an additional amount $g(\dRat_{t})$ that provides an incentive to save for retirement. As we’ll see, this additional term will allow us to use the Endogenous Grid Method to solve this subproblem.

\begin{split} \vFunc_{t}(\mRat_{t}, \nRat_{t}) & = \max_{\dRat_{t}} \vOpt_{t}(\lRat_{t}, \bRat_{t}) \\ & \text{s.t.} \quad \dRat_{t} \ge 0 \\ \lRat_{t} & = \mRat_{t} - \dRat_{t} \\ \bRat_{t} & = \nRat_{t} + \dRat_{t} + g(\dRat_{t}) \end{split}

(33)

After making their pension decision, the worker begins their consumption stage with liquid wealth $\lRat_{t}$ and retirement balance $\bRat_{t}$ . From their liquid wealth, the worker must choose a level of consumption to maximize utility and continuation value $\wFunc_{t}$ . After consumption, the worker is left with post-decision states that represent liquid assets $\aRat_{t}$ and retirement balance $\bRat_{t}$ , which passes through this problem unaffected because it can’t be liquidated until retirement.

\begin{split} \vOpt_{t}(\lRat_{t}, \bRat_{t}) & = \max_{\cRat_{t}} \util(\cRat_{t}) + \DiscFac \wFunc_{t}(\aRat_{t}, \bRat_{t}) \\ & \text{s.t.} \quad \cRat_{t} \ge 0 \\ \aRat_{t} & = \lRat_{t} - \cRat_{t} \end{split}

(34)

Finally, the post-decision value function $\wFunc_{t}$ represents the value of both liquid and illiquid account balances before the realization of uncertainty regarding the risky return and income shocks. Since we are dealing with a normalized problem, this stage handles the normalization of state variables and value functions into the next period.

\begin{split} \wFunc_{t}(\aRat_{t}, \bRat_{t}) & = \Ex_{t} \left[ \PGro_{t+1}^{1-\CRRA} \vFunc_{t+1}(\mRat_{t+1}, \mRat_{t+1}) \right] \\ & \text{s.t.} \quad \aRat_{t} \ge 0, \quad \bRat_{t} \ge 0 \\ \mRat_{t+1} & = \aRat_{t} \Rfree / \PGro_{t+1} + \tShkEmp_{t+1} \\ \nRat_{t+1} & = \bRat_{t} \Risky_{t+1} / \PGro_{t+1} \end{split}

(35)

The advantage of conceptualizing this subproblem as a separate stage is that we can construct a function $\wFunc_{t}$ and use it in the prior optimization problems without having to worry about stochastic optimization and taking expectations repeatedly.

3.3The consumption-saving problem¶

As seen in the consumption stage above, the retirement balance $\bRat_{t}$ passes through the problem unaffected because it can’t be liquidated until retirement. In this sense, it is already a post-decision state variable. To solve this problem, we can use a fixed grid of $\bRat_{t}$ and for each obtain endogenous consumption and ex-ante market resources using the simple Endogenous Grid Method for the consumption problem.

3.4The pension deposit problem¶

In the deposit stage, both the state variables and post-decision variables are different since both are affected by the pension deposit decision.

First, we can rewrite the pension deposit problem more compactly:

\vFunc_{t}(\mRat_{t}, \nRat_{t}) = \max_{\dRat_{t}} \vOpt_{t}(\mRat_{t} - \dRat_{t}, \nRat_{t} + \dRat_{t} + \gFunc(\dRat_{t}))

(36)

The first-order condition is

\vOpt_{t}^{\lRat}(\lRat_{t}, \bRat_{t})(-1) + \vOpt_{t}^{\bRat}(\lRat_{t}, \bRat_{t})(1+\gFunc'(\dRat_{t})) = 0.

(37)

Rearranging this equation gives

\gFunc'(\dRat_{t}) = \frac{\vOpt_{t}^{\lRat}(\lRat_{t}, \bRat_{t})}{\vOpt_{t}^{\bRat}(\lRat_{t}, \bRat_{t})} - 1

(38)

where

\gFunc'(\dRat) = \frac{\xFer}{1+\dRat} \qquad \gFunc'^{-1}(y) = \xFer/y - 1

(39)

Given that $\gFunc'(\dRat)$ exists and is invertible, we can find

\dEndFunc_{t}(\lRat_{t}, \bRat_{t}) = \gFunc'^{-1}\left( \frac{\vOpt_{t}^{\lRat}(\lRat_{t}, \bRat_{t})}{\vOpt_{t}^{\bRat}(\lRat_{t}, \bRat_{t})} - 1 \right)

(40)

Using this, we can back out $\nRat_{t}$ as

\nEndFunc_{t}(\lRat_{t}, \bRat_{t}) = \bRat_{t} - \dEndFunc_{t}(\lRat_{t}, \bRat_{t}) - \gFunc(\dEndFunc_{t}(\lRat_{t}, \bRat_{t}))

(41)

and $\mRat_{t}$ as

\mEndFunc_{t}(\lRat_{t}, \bRat_{t}) = \lRat_{t} + \dEndFunc_{t}(\lRat_{t}, \bRat_{t})

(42)

In sum, given an exogenous grid $(\lRat_{t}, \bRat_{t})$ we obtain the triple $\left(\mEndFunc_{t}(\lRat_{t}, \bRat_{t}), \nEndFunc_{t}(\lRat_{t}, \bRat_{t}), \dEndFunc_{t}(\lRat_{t}, \bRat_{t})\right)$ , which we can use to create an interpolator for the decision rule $\dRat_{t}$ .

To close the solution method, the envelope conditions are

\begin{split} \vFunc_{t}^{\mRat}(\mRat_{t}, \nRat_{t}) & = \vOpt_{t}^{\lRat}(\lRat_{t}, \bRat_{t}) \\ \vFunc_{t}^{\nRat}(\mRat_{t}, \nRat_{t}) & = \vOpt_{t}^{\bRat}(\lRat_{t}, \bRat_{t}) \end{split}

(43)

3.5Unstructured Grid Interpolation¶

A regular, rectilinear exogenous grid of pension balances after deposit \bRat_{t} and liquid assets after consumption \lRat_{t}. — Figure 5:A regular, rectilinear exogenous grid of pension balances after deposit $\bRat_{t}$ and liquid assets after consumption $\lRat_{t}$ .

As in Section 2, the resulting endogenous grid is not rectilinear, and in this more complex problem it is not even a regular grid. We can see in Figure 5 that starting from a regular and rectilinear exogenous grid of liquid assets post-consumption $\lRat_{t}$ and pension balances post-deposit $\bRat_{t}$ , we obtain Figure 6 which shows an irregular and unstructured endogenous grid of market resources $\mRat_{t}$ and pension balances pre-deposit $\nRat_{t}$ .

An irregular, unstructured endogenous grid of market resources \mRat_{t} and pension balances before deposit \nRat_{t}. — Figure 6:An irregular, unstructured endogenous grid of market resources $\mRat_{t}$ and pension balances before deposit $\nRat_{t}$ .

To interpolate a function defined on an unstructured grid, we use Gaussian Process Regression as in Scheidegger & Bilionis (2019).

4Multivariate Interpolation on Unstructured Grids¶

This section presents alternative interpolation methods for non-rectilinear grids. First, I present the relatively simple case of fast warped interpolation on a curvilinear grid, which improves upon the interpolation in White (2015). Then, I present a machine learning approach to interpolation on unstructured grids based on Gaussian Process Regression as presented in Scheidegger & Bilionis (2019).

4.1Unstructured Grids¶

Unstructured interpolation arises in many dynamic programming applications when using the Endogenous Grid Method because the first-order conditions might be highly non-linear and non-monotonic, or because boundary constraints induce kinks in the policy and value functions. In these cases, the grid points generated by the EGM step are not evenly spaced, leading to the need for curvilinear interpolation. We saw in the previous subsection an approach to curvilinear interpolation based on White (2015) that is incapable of interpolation on structured grids. A similar approach was presented in Ludwig & Schön (2018) which used Delaunay interpolation. However, this approach is not well suited for our purposes because triangulation can be computationally intensive and slow, often offsetting the efficiency gains from the Endogenous Grid Method.

As an alternative to these methods, I introduce the use of Gaussian Process Regression (GPR) along with the Endogenous Grid Method. GPR is computationally efficient, and tools exist to easily parallelize and take advantage of hardware such as Graphics Processing Units (GPU)^[5].

4.1.1Gaussian Process Regression¶

A Gaussian Process is an infinite dimensional random process for which every subset of random variables is jointly Gaussian or has a multivariate normal distribution.

\begin{gathered} \mathbf{X} \sim \mathcal{N}(\mathbf{\mu}, \mathbf{\Sigma}) \quad \text{s.t.} \quad x_i \sim \mathcal{N}(\mu_i, \sigma_ {ii}) \\ \text{and} \quad \sigma_{ij} = \Ex[(x_i - \mu_i)(x_j - \mu_j)] \quad \forall i,j \in \{1, \ldots, n\}. \end{gathered}

(44)

where

\mathbf{X} = \begin{bmatrix} x_1 \\ x_2 \\ \vdots \\ x_n \end{bmatrix} \quad \mathbf{\mu} = \begin{bmatrix} \mu_1 \\ \mu_2 \\ \vdots \\ \mu_n \end{bmatrix} \quad \mathbf{\Sigma} = \begin{bmatrix} \sigma_{11} & \sigma_{12} & \cdots & \sigma_{1n} \\ \sigma_{21} & \sigma_{22} & \cdots & \sigma_{2n} \\ \vdots & \vdots & \ddots & \vdots \\ \sigma_{n1} & \sigma_{n2} & \cdots & \sigma_{nn} \end{bmatrix}.

(45)

Being infinitely dimensional, a Gaussian Process can be used to represent a probability distribution over the space of functions in $n$ dimensions. Thus, a Gaussian Process Regression is used to find the best fit function to a set of data points.

\mathbb{P}(\mathbf{f} | \mathbf{X}) = \mathcal{N}(\mathbf{f} | \mathbf{m}, \mathbf{K})

(46)

where $\mathbf{f}$ is the vector of function values at the points $\mathbf{X}$ , $\mathbf{m}$ is the mean of the function, and $\mathbf{K}$ is a kernel function that describes the covariance between the function values at different points.

A standard kernel function is the squared exponential kernel, or the radial basis function kernel, which is defined as

k(\mathbf{x}_i, \mathbf{x}_j) = \sigma^2_f \exp\left(-\frac{1}{2l^2} (\mathbf{x}_i - \mathbf{x}_j)' (\mathbf{x}_i - \mathbf{x}_j)\right).

(47)

Using GPR to interpolate a function $f$ , we can both predict the value of the function at a point $\mathbf{x}_*$ and the uncertainty in the prediction, which provides useful information as to the accuracy of the approximation.

4.1.2An example of the GPR¶

In Figure 7, we see the function we are trying to approximate along with a sample of data points for which we know the value of the function. In practice, the value of the function is unknown and/or expensive to compute, so we must use a limited amount of data to approximate it.

Figure 7:The true function that we are trying to approximate and a sample of data points.

As we discussed, a Gaussian Process is an infinite dimensional random process which can be used to represent a probability of distributions over the space of functions. In Figure 8, we see a random sample of functions from the GPR posterior, which is a Gaussian Process conditioned on fitting the data. From this small sample of functions, we can see that the GP generates functions that fit the data well, and the goal of GPR is to find the one function that best fits the data given some hyperparameters by minimizing the negative log-likelihood of the data.

Figure 8:A random sample of functions from the GPR posterior that fit the data. The goal of GPR is to find the function that best fits the data.

In Figure 9, we see the result of GPR with a particular parametrization^[6] of the kernel function. The dotted line shows the true function, while the blue dots show the known data points. GPR provides the mean function which best fits the data, represented in the figure as an orange line. The shaded region represents a 95% confidence interval, which is the uncertainty of the predicted function. Along with finding the best fit of the function, GPR provides the uncertainty of the prediction, which is useful information as to the accuracy of the approximation.

Figure 9:GPR finds the function that best fits the data given some hyperparameters. GPR then optimizes over the parameter space to find the function that minimizes the negative log-likelihood of the data.

5Conditions for using the Sequential Endogenous Grid Method¶

5.1Splitting the problem into subproblems¶

The first step in using the Sequential Endogenous Grid Method is to split the problem into subproblems. This process of splitting up the problem has to be strategic to not insert additional complexity into the original problem. If one is not careful when doing this, the subproblems can become more complex and intractable than the original problem.

To split up the problem, we first count the number of control variables or decisions faced by the agent. Ideally, if the agent has $n$ control variables, then the problem should be split into $n$ subproblems, each handling a different control variable. For counting the number of control variables, it is important to not double count variables which are equivalent and have market clearing conditions. For example, the decision of how much to consume and how much to save may seem like two different choices, but because of the market clearing condition $\cRat + \aRat = \mRat$ they are resolved simultaneously and count as only one decision variable. Similarly, the choice between labor and leisure are simultaneous and count as only one decision.

Having counted our control variables, we look for differentiable and invertible utility functions which are separable in the dynamic programming problem, such as in Section 2 of the paper, or differentiable and invertible functions in the transition, as in Section 3 of the paper.

5.1.1Separable utility functions¶

In Section 2, we have additively separable utility of consumption and leisure, which allows for each of these control variables to be handled by separate subproblems. So, it makes sense to split the utility between subproblems and attach one to the consumption subproblem and one to the leisure subproblem.

As mentioned in that section, however, there are only two separable utility functions in the problem which have been assigned to two subproblems already. This leaves one control variable without a separable utility function. In that case, there is not another Endogenous Grid Method step to exploit, and this subproblem has to be handled by standard convex optimization techniques such as maximization of the value function (VFI) or finding the root of the Euler equation (PFI).

Now that we have split the problem into conceptual subproblems, it is important to sequence them in such a way that they don’t become more complex than the original problem. The key here is to avoid adding unnecessary state variables. For example, in the consumption-leisure-portfolio problem, if we were to choose consumption first, we would have to track the wage rate into the following leisure subproblem. This would mean that our consumption problem would be two-dimensional as well as our labor decision problem. As presented, the choice of order in Section 2 ensures that the consumption problem is one-dimensional, as we can shed the information about the wage rate offer after the agent has made their labor-leisure decision. If we did this the other way, the problem would be more complex and require additional computational resources.

The consumption subproblem would be two-dimensional instead of one-dimensional, adding more complexity,

\begin{split} \vFunc(\bRat, \tShkEmp) & = \max_{\cRat} \uFunc(\cRat) + \vOpt(\bRat', \tShkEmp) \\ & \text{s.t.}\\ \bRat' & = \bRat - \cRat \ge - \tShkEmp \end{split}

(48)

while the labor-leisure subproblem would have an additional constraint

\begin{split} \vOpt(\bRat', \tShkEmp) & = \max_{\leisure} \h(\leisure) + \vEnd(\aRat) \\ & \text{s.t.} \\ 0 & \le \leisure \le 1 \\ \aRat & = \bRat' + \tShkEmp(1 - \leisure) \ge 0. \end{split}

(49)

Therefore, strategic ordering of subproblems can greatly simplify the solution process and reduce computational the burden.

Consider the utility function of the form

\UFunc( \yRat) = \uFunc_{-i}( \yRat^{-i}) + \uFunc_i(\yRat^i)

(50)

where $\yRat^{i}$ is the $i$ -th control variable and $\yRat^{-i}$ is the vector of all control variables except the $i$ -th one.

which is separable in the state and control variables that correspond to the index $i$ .

\begin{split} \VFunc_{t}(\xRat_t, \sRat_t) &= \max_{\yRat_t \in \Gamma_t(\xRat_t, \sRat_t)} \UFunc(\yRat_t) + \DiscFac \Ex_{t} \left[ \VFunc_{t+1}(\xRat_{t+1}, \sRat_{t+1}) | \tilde{\xRat}_t, \sRat_t \right] \\ & \text{s.t.} \\ \tilde{\xRat}_t &= \TFunc_t(\xRat_t, \yRat_t) \\ \xRat_{t+1} &= \GFunc_{t+1}(\tilde{\xRat}_t, \sRat_t) \\ \end{split}

(51)

For simplicity, define

\WFunc_t(\tilde{\xRat}_t, \sRat_t) = \DiscFac \Ex_{t} \left[ \VFunc_{t+1}(\GFunc_{t+1}(\tilde{\xRat}_t, \sRat_t), \sRat_{t+1}) | \tilde{\xRat}_t, \sRat_t \right]

(52)

then

\begin{split} \VFunc_{t}(\xRat_t, \sRat_t) &= \max_{\yRat_t \in \Gamma_t(\xRat_t, \sRat_t)} \UFunc( \yRat_t) + \WFunc_t(\tilde{\xRat}_t, \sRat_t) \\ & \text{s.t.} \\ \tilde{\xRat}_t &= \TFunc_t(\xRat_t, \yRat_t) \end{split}

(53)

the first order condition

\frac{\partial \UFunc( \yRat_t)}{\partial \yRat_t^i} + \sum_{j=1}^{n} \frac{\partial \WFunc_t(\tilde{\xRat}_t, \sRat_t)}{\partial \tilde{\xRat}_{t}^j} \frac{\partial \TFunc_{t}^j(\xRat_t, \yRat_t)}{\partial \yRat_t^i} = 0

(54)

we require $\frac{\partial \TFunc_{t}^j(\xRat_t, \yRat_t)}{\partial \yRat_t^i} = 0$ for $j \neq i$ to be able to solve for $\yRat_t^i$ .

\frac{\partial \UFunc( \yRat_t)}{\partial \yRat_t^i} + \frac{\partial \WFunc_t(\tilde{\xRat}_t, \sRat_t)}{\partial \tilde{\xRat}_{t}^i} \frac{\partial \TFunc_{t}^i(\xRat_t, \yRat_t)}{\partial \yRat_t^i} = 0

(55)

5.1.2Differentiable and invertible transition¶

In Section 3, we see that a problem with a differentiable and invertible transition can also be used to embed an additional Endogenous Grid Method step. Because the transition applies independently to a state variable that is not related to the other control variable, consumption, it can be handled separately from the consumption subproblem.

In this particular problem, however, it turns out to make no difference how we order the two subproblems. This is because the control variables, consumption and pension deposit, each affect a separate resource account, namely market resources and pension balance. Because of this, the two subproblems are independent of each other and can be solved in any order.

A good rule of thumb is that when splitting up a problem into subproblems, we should try to reduce the information set that is passed onto the next subproblem. In Section 2, choosing leisure-labor and realizing total market resources before consumption allows us to shed the wage rate offer state variable before the consumption problem, and we know that for the portfolio choice we only need to know liquid assets after expenditures (consumption). Thus, the order makes intuitive sense; agent first chooses leisure-labor, realizing total market resources, then chooses consumption and savings, and finally chooses their risky portfolio choice. In Section 3, there are two expenditures that are independent of each other, consumption and deposit, and making one decision or the other first does not reduce the information set for the agent, thus the order of these subproblems does not matter.

5.2The Endogenous Grid Method for Subproblems¶

Once we have strategically split the problem into subproblems, we can use the Endogenous Grid Method in each applicable subproblem while iterating backwards from the terminal period. As we discussed in Sections Section 2 and Section 3, the EGM step can be applied when there is a separable, differentiable and invertible utility function in the subproblem or when there is a differentiable and invertible transition in the subproblem. We will discuss each of these cases in turn.

5.2.1Utility function¶

A generic subproblem with a differentiable and invertible utility function can be characterized as follows:

\begin{split} \VFunc(\xRat) & = \max_{\yRat \in \PGro(\xRat)} \UFunc(\xRat, \yRat) + \DiscFac \WFunc(\aRat) \\ & \text{s.t.} \\ \aRat & = \TFunc(\xRat,\yRat) \end{split}

(56)

For an interior solution, the first-order condition is thus

\UFunc'_{\yRat}(\xRat, \yRat) + \DiscFac \WFunc'(\aRat)\TFunc'_{\yRat}(\xRat,\yRat) = 0

(57)

If, as we assumed, the utility function is differentiable and invertible, then the Endogenous Grid Method consists of

\yRat = \left(\UFunc'_{\yRat}(\xRat, \yRat)\right)^{-1} \left[ -\DiscFac \WFunc'(\aRat)\TFunc'_{\yRat}(\xRat,\yRat)\right]

(58)

By using an exogenous grid of the post-decision state $\aRat$ , we can solve for the optimal decision rule $\yRat$ at each point on the grid. This is the Endogenous Grid Method step.

5.2.2Transition¶

If the generic subproblem has no separable utility, but instead has a differentiable and invertible transition, then the Endogenous Grid Method can still be used.

\begin{split} \VFunc(\xRat) & = \max_{\yRat \in \PGro(\xRat)} \WFunc(\aRat) \\ & \text{s.t.} \\ \aRat & = \TFunc(\xRat,\yRat) \end{split}

(59)

Here, the first-order condition is

\WFunc'(\aRat)\TFunc'_{\yRat}(\xRat,\yRat) = 0

(60)

and the Endogenous Grid Method step is

\yRat = \left(\TFunc'_{\yRat}(\xRat,\yRat)\right)^{-1} \left[ 1 / \WFunc'(\aRat)\right]

(61)

6Conclusion¶

This paper introduces a novel method for solving dynamic stochastic optimization problems called the Sequential Endogenous Grid Method (EGMⁿ). Given a problem with multiple decisions (or control variables), the Sequential Endogenous Grid Method proposes separating the problem into a sequence of smaller subproblems that can be solved sequentially by using more than one EGM step. Then, depending on the resulting endogenous grid from each subproblem, this paper proposes different methods for interpolating functions on non-rectilinear grids, called the Warped Grid Interpolation (WGI) and the Gaussian Process Regression (GPR) method.

EGMⁿ is similar to the Nested Endogenous Grid Method (NEGM)^[7] and the Generalized Endogenous Grid Method (G2EGM)^[8] in that it can solve problems with multiple decisions, but it differs from these methods in that by choosing the subproblems strategically, we can take advantage of multiple sequential EGM steps to solve complex multidimensional models in a fast and efficient manner. Additionally, the use of machine learning tools such as the GPR overcomes bottlenecks seen in unstructured interpolation using Delauany triangulation and other similar methods.

7Appendix: Solving the illustrative G2EGM model with EGMⁿ¶

7.1The problem for a retired household¶

I designate as $\wFunc_{t}(\mRat_{t})$ the problem of a retired household at time $t$ with total resources $\mRat$ . The retired household solves a simple consumption-savings problem with no income uncertainty and a certain next period pension of $\underline{\tShkEmp}$ .

\begin{split} \wFunc_{t}(\mRat_{t}) & = \max_{\cRat_{t}} \util(\cRat_{t}) + \DiscFac \wFunc_{t+1}(\mRat_{t}) \\ & \text{s.t.} \\ \aRat_{t} & = \mRat_{t} - \cRat_{t} \\ \mRat_{t+1} & = \Rfree_{\aRat} \aRat_{t} + \underline{\tShkEmp} \end{split}

(62)

Notice that there is no uncertainty and the household receives a retirement income $\underline{\tShkEmp}$ every period until death.

7.2The problem for a worker household¶

The value function of a worker household is

\VFunc_{t}(\mRat_{t}, \nRat_{t}) = \Ex_\error \max \left\{ \vFunc_{t}(\mRat_{t}, \nRat_{t}, \Work) + \sigma_{\error} \error_{\Work} , \vFunc_{t}(\mRat_{t}, \nRat_{t}, \Retire) + \sigma_{\error} \error_{\Retire} \right\}

(63)

where the choice specific problem for a working household that decides to continue working is

\begin{split} \vFunc_{t}(\mRat_{t}, \nRat_{t}, \Work) & = \max_{\cRat_{t}, \dRat_{t}} \util(\cRat_{t}) - \kapShare + \DiscFac \Ex_{t} \left[ \VFunc_{t+1}(\mRat_{t+1}, \nRat_{t+1}) \right] \\ & \text{s.t.} \\ \aRat_{t} & = \mRat_{t} - \cRat_{t} - \dRat_{t} \\ \bRat_{t} & = \nRat_{t} + \dRat_{t} + \gFunc(\dRat_{t}) \\ \mRat_{t+1} & = \Rfree_{\aRat} \aRat_{t} + \tShkEmp_{t+1} \\ \nRat_{t+1} & = \Rfree_{\bRat} \bRat_{t} \end{split}

(64)

and the choice specific problem for a working household that decides to retire is

\vFunc_{t}(\mRat_{t}, \nRat_{t}, \Retire) = \wFunc_{t}(\mRat_{t}+\nRat_{t})

(65)

7.3Applying the Sequential EGM¶

The first step is to define a post-decision value function. Once the household decides their level of consumption and pension deposits, they are left with liquid assets they are saving for the future and illiquid assets in their pension account which they can’t access again until retirement. The post-decision value function can be defined as

\begin{split} \vEnd_{t}(\aRat_{t}, \bRat_{t}) & = \DiscFac \Ex_{t} \left[ \VFunc_{t+1}(\mRat_{t+1}, \nRat_{t+1}) \right] \\ & \text{s.t.} \\ \mRat_{t+1} & = \Rfree_{\aRat} \aRat_{t} + \tShkEmp_{t+1} \\ \nRat_{t+1} & = \Rfree_{\bRat} \bRat_{t} \end{split}

(66)

Then redefine the working agent’s problem as

\begin{split} \vFunc_{t}(\mRat_{t}, \nRat_{t}, \Work) & = \max_{\cRat_{t}, \dRat_{t}} \util(\cRat_{t}) - \kapShare + \vEnd_{t}(\aRat_{t}, \bRat_{t}) \\ \aRat_{t} & = \mRat_{t} - \cRat_{t} - \dRat_{t} \\ \bRat_{t} & = \nRat_{t} + \dRat_{t} + \gFunc(\dRat_{t}) \\ \end{split}

(67)

Clearly, the structure of the problem remains the same, and this is the problem that G2EGM solves. We’ve only moved some of the stochastic mechanics out of the problem. Now, we can apply the sequential EGMⁿ method. Let the agent first decide $\dRat_{t}$ , the deposit amount into their retirement; we will call this the deposit problem, or outer loop. Thereafter, the agent will have net liquid assets of $\lRat_{t}$ and pension assets of $\bRat_{t}$ .

\begin{split} \vFunc_{t}(\mRat_{t}, \nRat_{t}, \Work) & = \max_{\dRat_{t}} \vOpt_{t}(\lRat_{t}, \bRat_{t}) \\ & \text{s.t.} \\ \lRat_{t} & = \mRat_{t} - \dRat_{t} \\ \bRat_{t} & = \nRat_{t} + \dRat_{t} + \gFunc(\dRat_{t}) \end{split}

(68)

Now, the agent can move on to picking their consumption and savings; we can call this the pure consumption problem or inner loop.

\begin{split} \vOpt_{t}(\lRat_{t}, \bRat_{t}) & = \max_{\cRat_{t}} \util(\cRat_{t}) - \kapShare + \vEnd_{t}(\aRat_{t}, \bRat_{t}) \\ & \text{s.t.} \\ \aRat_{t} & = \lRat_{t} - \cRat_{t} \\ \end{split}

(69)

Because we’ve already made the pension decision, the amount of pension assets does not change in this loop and it just passes through to the post-decision value function.

7.4Solving the problem¶

7.4.1Solving the Inner Consumption Saving Problem¶

Let’s start with the pure consumption-saving problem, which we can summarize by substitution as

\vOpt_{t}(\lRat_{t}, \bRat_{t}) = \max_{\cRat_{t}} \util(\cRat_{t}) - \kapShare + \vEnd_{t}(\lRat_{t} - \cRat_{t}, \bRat_{t})

(70)

The first-order condition is

\util'(\cRat_{t}) = \vEnd_{t}^{\aRat}(\lRat_{t}-\cRat_{t}, \bRat_{t}) = \vEnd_{t}^{\aRat}(\aRat_{t}, \bRat_{t})

(71)

We can invert this Euler equation as in standard EGM to obtain the consumption function.

\cEndFunc_{t}(\aRat_{t}, \bRat_{t}) = \util'^{-1}\left(\vEnd_{t}^{\aRat}(\aRat_{t}, \bRat_{t})\right)

(72)

Again as before, $\lEndFunc_{t}(\aRat_{t}, \bRat_{t}) = \cEndFunc_{t}(\aRat_{t}, \bRat_{t}) + \aRat_{t}$ . To sum up, using an exogenous grid of $(\aRat_{t}, \bRat_{t})$ we obtain the trio $(\cEndFunc_{t}(\aRat_{t}, \bRat_{t}), \lEndFunc_{t}(\aRat_{t}, \bRat_{t}), \bRat_{t})$ which provides an interpolating function for our optimal consumption decision rule over the $(\lRat, \bRat)$ grid. Without loss of generality, assume $\lEndFunc_{t} = \lEndFunc_{t}(\aRat_{t}, \bRat_{t})$ and define the interpolating function as

\cTarg_{t}(\lEndFunc_{t}, \bRat_{t}) \equiv \cEndFunc_{t}(\aRat_{t}, \bRat_{t})

(73)

For completeness, we derive the envelope conditions as well, and as we will see, these will be useful when solving the next section.

\begin{split} \vOpt_{t}^{\lRat}(\lRat_{t}, \bRat_{t}) & = \vEnd_{t}^{\aRat}(\aRat_{t}, \bRat_{t}) = \util'(\cRat_{t}) \\ \vOpt_{t}^{\bRat}(\lRat_{t}, \bRat_{t}) & = \vEnd_{t}^{\bRat}(\aRat_{t}, \bRat_{t}) \end{split}

(74)

7.4.2Solving the Outer Pension Deposit Problem¶

Now, we can move on to solving the deposit problem, which we can also summarize as

\vFunc_{t}(\mRat_{t}, \nRat_{t}, \Work) = \max_{\dRat_{t}} \vOpt_{t}(\mRat_{t} - \dRat_{t}, \nRat_{t} + \dRat_{t} + \gFunc(\dRat_{t}))

(75)

The first-order condition is

\vOpt_{t}^{\lRat}(\lRat_{t}, \bRat_{t})(-1) + \vOpt_{t}^{\bRat}(\lRat_{t}, \bRat_{t})(1+\gFunc'(\dRat_{t})) = 0

(76)

Rearranging this equation gives

\gFunc'(\dRat_{t}) = \frac{\vOpt_{t}^{\lRat}(\lRat_{t}, \bRat_{t})}{\vOpt_{t}^{\bRat}(\lRat_{t}, \bRat_{t})} - 1

(77)

Assuming that $\gFunc'(\dRat)$ exists and is invertible, we can find

\dEndFunc_{t}(\lRat_{t}, \bRat_{t}) = \gFunc'^{-1}\left( \frac{\vOpt_{t}^{\lRat}(\lRat_{t}, \bRat_{t})}{\vOpt_{t}^{\bRat}(\lRat_{t}, \bRat_{t})} - 1 \right)

(78)

Using this, we can back out $\nRat_{t}$ as

\nEndFunc_{t}(\lRat_{t}, \bRat_{t}) = \bRat_{t} - \dEndFunc_{t}(\lRat_{t}, \bRat_{t}) - \gFunc(\dEndFunc_{t}(\lRat_{t}, \bRat_{t}))

(79)

and $\mRat_{t}$ as

\mEndFunc_{t}(\lRat_{t}, \bRat_{t}) = \lRat_{t} + \dEndFunc_{t}(\lRat_{t}, \bRat_{t})

(80)

To close the solution method, the envelope conditions are

\begin{split} \vFunc_{t}^{\mRat}(\mRat_{t}, \nRat_{t}, \Work) & = \vOpt_{t}^{\lRat}(\lRat_{t}, \bRat_{t}) \\ \vFunc_{t}^{\nRat}(\mRat_{t}, \nRat_{t}, \Work) & = \vOpt_{t}^{\bRat}(\lRat_{t}, \bRat_{t}) \end{split}

(81)

7.5Is g invertible?¶

We’ve already seen that $\util'(\cdot)$ is invertible, but is $\gFunc$ ?

\gFunc(\dRat) = \xFer \log(1+\dRat) \qquad \gFunc'(\dRat) = \frac{\xFer}{1+\dRat} \qquad \gFunc'^{-1}(y) = \xFer/y - 1

(82)

7.6The Post-Decision Value and Marginal Value Functions¶

\begin{split} \vEnd_{t}(\aRat, \bRat) & = \DiscFac \Ex_{t} \left[ \VFunc(\mRat_{t+1}, \nRat_{t+1}) \right] \\ & \text{s.t.} \\ \mRat_{t+1} & = \Rfree_{\aRat} \aRat_{t} + \tShkEmp_{t+1} \\ \nRat_{t+1} & = \Rfree_{\bRat} \bRat_{t} \end{split}

(83)

and

\begin{split} \vEnd_{t}^{\aRat}(\aRat_{t}, \bRat_{t}) & = \DiscFac \Rfree_{\aRat} \Ex_{t} \left[ \VFunc^{\mRat}_{t+1}(\mRat_{t+1}, \nRat_{t+1}) \right] \\ & \text{s.t.} \\ \mRat_{t+1} & = \Rfree_{\aRat} \aRat_{t} + \tShkEmp_{t+1} \\ \nRat_{t+1} & = \Rfree_{\bRat} \bRat_{t} \end{split}

(84)

and

\begin{split} \vEnd_{t}^{\bRat}(\aRat_{t}, \bRat_{t}) & = \DiscFac \Rfree_{\bRat} \Ex_{t} \left[ \VFunc^{\nRat}_{t+1}(\mRat_{t+1}, \nRat_{t+1}) \right] \\ & \text{s.t.} \\ \mRat_{t+1} & = \Rfree_{\aRat} \aRat_{t} + \tShkEmp_{t+1} \\ \nRat_{t+1} & = \Rfree_{\bRat} \bRat_{t} \end{split}

(85)

7.7Taste Shocks¶

From discrete choice theory and from DCEGM paper, we know that

\Ex_{t} \left[ \VFunc_{t+1}(\mRat_{t+1}, \nRat_{t+1}, \error_{t+1}) \right] = \sigma \log \left[ \sum_{\Decision \in \{\Work, \Retire\}} \exp \left( \frac{\vFunc_{t+1}(\mRat_{t+1}, \nRat_{t+1}, \Decision)}{\sigma_\error} \right) \right]

(86)

and

\Prob_{t}(\Decision ~ \lvert ~ \mRat_{t+1}, \nRat_{t+1}) = \frac{\exp \left( \vFunc_{t + 1}(\mRat_{t+1}, \nRat_{t+1}, \Decision) / \sigma_\error \right) }{ \sum\limits_{\Decision \in \{\Work, \Retire\}} \exp \left( \frac{\vFunc_{t+1}(\mRat_{t+1}, \nRat_{t+1}, \Decision)}{\sigma_\error} \right)}

(87)

the first-order conditions are therefore

\vOptAlt_{t}^{\mRat}(\mRat_{t+1}, \nRat_{t+1}) = \sum_{\Decision \in \{\Work, \Retire\}} \Prob_{t}(\Decision ~ \lvert ~ \mRat_{t+1}, \nRat_{t+1}) \vFunc_{t+1}^{\mRat}(\mRat_{t+1}, \nRat_{t+1}, \Decision)

(88)

Acknowledgments¶

I would like to thank Chris Carroll, Matthew White, and Simon Scheidegger for their helpful comments and suggestions. The remaining errors are my own. All figures and other numerical results were produced using the Econ-ARK/HARK toolkit Carroll et al., 2018. Additional libraries used in the production of this paper include but are not limited to: scipy Virtanen et al., 2020, numpy Harris et al., 2020, numba Lam et al., 2015, cupy Okuta et al., 2017, scikit-learn Pedregosa et al., 2011, pytorch Paszke et al., 2019, and gpytorch Gardner et al., 2018

Footnotes¶

Barillas & Fernández-Villaverde (2007)Maliar & Maliar (2013)Fella (2014)White (2015)Iskhakov et al. (2017), among others.
↩
As in Carroll (2009), where the utility of normalized consumption and leisure is defined as
$\utilFunc(\cRat_{t}, \leisure_{t}) = \PLev_{t}^{1-\CRRA} \frac{\cRat_{t}^{1-\CRRA}}{1-\CRRA} + (\labShare\PLev_{t}) ^{1-\CRRA} \frac{\leisure_{t}^{1-\leiShare}}{1-\leiShare}$
(29)
↩
For this illustration, we generate $z$ ’s arbitrarily using the function
$f(x,y) = (xy)^{1/4}.$
(30)
↩
For more examples of the Warped Grid Interpolation method in action, see the github project alanlujan91/multinterp.
↩
Gardner et al. (2018)
↩
For details see notebook.
↩
Druedahl (2021).
↩
Druedahl & Jørgensen (2017).
↩

References¶

Carroll, C., Kaufman, A., Kazil, J., Palmer, N., & White, M. (2018). The econ-ARK and HARK: Open source tools for computational economics. In F. Akici, D. Lippa, D. Niederhut, & M. Pacer (Eds.), Proceedings of the Python in Science Conference (pp. 25–30). SciPy. 10.25080/majora-4af1f417-004
Virtanen, P., Gommers, R., Oliphant, T. E., Haberland, M., Reddy, T., Cournapeau, D., Burovski, E., Peterson, P., Weckesser, W., Bright, J., van der Walt, S. J., Brett, M., Wilson, J., Millman, K. J., Mayorov, N., Nelson, A. R. J., Jones, E., Kern, R., Larson, E., … SciPy 1.0 Contributors. (2020). SciPy 1.0: fundamental algorithms for scientific computing in Python. Nature Methods, 17(3), 261–272. 10.1038/s41592-019-0686-2
Harris, C. R., Millman, K. J., van der Walt, S. J., Gommers, R., Virtanen, P., Cournapeau, D., Wieser, E., Taylor, J., Berg, S., Smith, N. J., Kern, R., Picus, M., Hoyer, S., van Kerkwijk, M. H., Brett, M., Haldane, A., Del Rı́o, J. F., Wiebe, M., Peterson, P., … Oliphant, T. E. (2020). Array programming with NumPy. Nature, 585(7825), 357–362. 10.1038/s41586-020-2649-2
Lam, S. K., Pitrou, A., & Seibert, S. (2015). Numba: a LLVM-based Python JIT compiler. Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in HPC, Article 7, 1–6. 10.1145/2833157.2833162
Okuta, R., Unno, Y., Nishino, D., Hido, S., & Loomis, C. (2017). Cupy: A numpy-compatible library for nvidia gpu calculations. Proceedings of Workshop on Machine Learning Systems (LearningSys) in the Thirty-First Annual Conference on Neural Information Processing Systems (NIPS), 5.

Abstract¶

1Introduction¶

1.1Background¶

1.2Literature Review¶

1.3Research Question¶

1.4Methodology¶

1.5Contributions¶

1.6Outline¶

2The Sequential Endogenous Grid Method¶

2.1A basic model¶

2.2Restating the problem sequentially¶

2.3The portfolio decision subproblem¶

2.3.1A note on avoiding taking expectations more than once¶

2.4The consumption-saving subproblem¶

2.5The labor-leisure subproblem¶

2.6Alternative Parametrization¶

2.7Curvilinear Grids¶

2.8Warped Grid Interpolation (WGI)¶

3The EGMn in Higher Dimensions¶

3.1A more complex problem¶

3.2Breaking down the problem¶

3.3The consumption-saving problem¶

3.4The pension deposit problem¶

3.5Unstructured Grid Interpolation¶

4Multivariate Interpolation on Unstructured Grids¶

4.1Unstructured Grids¶

4.1.1Gaussian Process Regression¶

4.1.2An example of the GPR¶

5Conditions for using the Sequential Endogenous Grid Method¶

5.1Splitting the problem into subproblems¶

5.1.1Separable utility functions¶

5.1.2Differentiable and invertible transition¶

5.2The Endogenous Grid Method for Subproblems¶

5.2.1Utility function¶

5.2.2Transition¶

6Conclusion¶

7Appendix: Solving the illustrative G2EGM model with EGMn¶

7.1The problem for a retired household¶

7.2The problem for a worker household¶

7.3Applying the Sequential EGM¶

7.4Solving the problem¶

7.4.1Solving the Inner Consumption Saving Problem¶

7.4.2Solving the Outer Pension Deposit Problem¶

7.5Is g invertible?¶

7.6The Post-Decision Value and Marginal Value Functions¶

7.7Taste Shocks¶

Acknowledgments¶

3The EGMⁿ in Higher Dimensions¶

7Appendix: Solving the illustrative G2EGM model with EGMⁿ¶