## Abstract

Urban theory models cities as spatial equilibria to derive their aggregate properties as functions of extensive variables, such as population size. However, this assumption seems at odds with cities’ most interesting properties as engines of fast and variable processes of growth and change. Here, we build a general statistical dynamics of cities across scales, from single agents to entire urban systems. We include agents’ strategic behavior to produce predictable growth rates, which requires balancing relative incomes and costs over time. We implement these dynamics using stochastic differential equations and control theory to demonstrate a number of general emergent properties of cities deriving from limit theorems applied to growth rates. This framework establishes necessary conditions for scaling to be conserved by urban dynamics and shows how exponent corrections can be calculated. These ideas are tested using stochastic simulations and a long timeseries for 382 US Metropolitan Areas over nearly five decades.

## INTRODUCTION

Classical approaches to urban theory—in economic geography (*1*–*3*) and, more recently, in complex systems (*4*)—often treat cities as spatial equilibria, where a balance of benefits and costs is achieved out of a set of social and economic exchanges, including wages, land rents, and transportation costs (*1*, *3*–*5*). While these modeling approaches have proven powerful for generating quantitative predictions in agreement with many observed properties of cities (*3*–*5*), they leave unresolved two fundamental problems: the problem of statistics and the problem of growth.

Both growth and statistics denote a broad set of phenomena that must be unpacked so that we can fully appreciate what is at stake. By statistics, we mean that in dealing with real cities, we must appreciate the wide variation between individuals and places. This variation has positive manifestations in that cities are extremely diverse in terms of the types of the people and lifestyles they support, including a broad set of coexisting cultures, professions, languages, races, and ethnicities (*6*–*9*). This interdependent functional diversity is what J. Jacobs famously called organized complexity and is at the heart of the “kind of problem a city is” (*10*). Negative expressions of these same heterogeneities are also familiar, such as ethnic, racial, and economic segregation (*11*, *12*), inequality, and variable access to justice and opportunity. Moreover, it has been observed that these differences between places and people within each city are persistent over time (*9*, *12*) and do not have the fleeting character of noisy fluctuations in statistical physics. Instead, they can pile up over time and lead to patterns of cumulative advantage and disadvantage (*9*, *13*), which are at the root of most challenges of human development. Thus, the problem of statistics in cities deals not only with the existence of structural differences on how the same quantity is distributed across different people and places but also with the temporal persistence and amplification of these effects.

By growth, we mean that (modern) cities are characterized by fast, typically exponential change across many variables. On the one hand, modern cities tend to experience annual population growth rates between about a fraction of 1 and 3 to 4%, as we shall see below. Exceptions exist at either end at least over some periods of time, as different places experience contextually specific factors. However, the principal change in modern cities is the fast pace of their economic growth and technological transformations. Across the world today, we observe rates of urban economic growth that are typically larger than those in their corresponding populations, reaching in some cases 10% a year, with 2 to 4% being typical (*14*). These growth rates mean that the size of a city’s economy doubles every few decades, making it possible to transition from poverty to wealth in one or two generations, as has happened in many places over the last century. With such fast growth rates at play, how is it tenable to model cities as spatial equilibria? Even more importantly, how do different resource growth rates, experienced by different households, neighborhoods, or cities, shape the heterogeneity (inequality) of outcomes for different people? Why are cities not torn apart by differential growth more often?

It turns out that these two problems, of statistics and of growth, are intimately connected and must be tackled together. This is a twist on classical statistical mechanics linking the strength of fluctuations to dissipation around equilibrium (*15*), which, in the context of exponential growth processes in populations, takes a character that we may call fluctuation-amplification, typical of evolutionary processes (*16*). The literature of complex systems applied to urban growth, especially from the perspective of geography, has demonstrated the importance of this type of stochastic nonlinear dynamics with strong feedbacks for a number of decades (*17*, *18*).

To continue to make progress on these issues, an analytical approach is necessary that identifies and articulates the essential joint mechanics of scaling, growth, and statistics in cities. Here, we show how this synthesis can be constructed and illustrate its theoretical and empirical implications using stochastic simulations and a particularly long time series of wages and populations of U.S. metropolitan statistical areas (MSAs) covering nearly five decades. The central insight is the realization that the budget constraint used to define functional cities (*1*–*4*) is not static or homogeneous across agents but must be managed adaptively to promote growth and avoid instability. This is because what is being controlled are stochastic net resource flows over time, as incomes minus costs, not static quantities such as forces. It is precisely this fluctuating difference that accumulates to drive resource growth for each agent and for cities as populations of agents, with implications for both aggregate growth and inequality.

The manuscript is organized following the scheme of Fig. 1. We first show how scaling analysis isolates the parameters that control urban growth, providing a number of quantitative targets for explanation and prediction. We then introduce standard theory for stochastic geometric growth and outline its properties. This allows us to establish the connection between well-known models of cities as spatial equilibria and processes of stochastic growth for individual agents. This connection disaggregates a city-wide budget condition to the level of agents and requires that they act strategically in their own self-interest to control fluctuations in net incomes, which is naturally handled via adaptive temporal averaging of expenditures. This is the central assumption of this manuscript, which leads to the expectation that net income volatilities are kept finite and small and that temporal averages of incomes and expenditures become statistically dependent. A number of results follow from standard limit theorems: The statistics of resources, incomes, and costs at the agent and group level become asymptotically lognormal, even as these quantities grow exponentially. The self-similarity of growth processes across group sizes also emerges, defining running couplings characterizing the mean growth rate for populations and its associated volatility. Explicitly computing these quantities allows us to identify the circumstances when urban scaling is conserved by the system dynamics. Conversely, we show when these conditions are violated, creating corrections to mean-field scaling exponents when volatilities are scale dependent but small and the breakdown of scaling if they become large. These procedures are illustrated using data on wages for U.S. metropolitan areas. We finish with a discussion of the significance of the results toward a general statistical dynamics of cities and its relation to analogous dynamics of resource flows in other complex systems.

## RESULTS

### Scaling relations and statistical deviations

Scaling analysis provides a simple and straightforward way to characterize urban quantities (*19*) and extract average agglomeration effects at play across cities of different sizes in an urban system. This section also shows that scaling analysis provides an efficient parameterization of growth processes, different from growth accounting in economics (*20*).

The starting point is to write an extensive positive quantity, *Y _{i}*(

*t*), here the total wages paid in city

*i*over a time period

*t*, as

*Y*

_{0}(

*t*) is time dependent but independent of city population size

*N*(

_{i}*t*), which also changes over time due to standard demographic processes. The parameter β is the scaling exponent, and the quantity ξ

*(*

_{i}*t*) is the time-dependent deviation from the average scaling prediction for city

*i*. This expression is exact because any deviation from the average scaling relation,

*Y*(

_{i}*N*,

_{i}*t*) =

*Y*

_{0}(

*t*)

*N*(

_{i}*t*)

^{β}, is absorbed in the residuals ξ

*(*

_{i}*t*) (see section S1).

Averaging over cities can now be used to isolate a few quantities of interest. To do this, let us first take the logarithm of Eq. 1*Y*(*t*)〉 = ln *Y*_{0}(*t*) + β〈ln *N _{i}*〉. This ensemble average is defined explicitly by

*N*is the total number of cities in the urban system such as the United States. We will refer to the quantities 〈ln

_{c}*Y*(

*t*)〉 and 〈ln

*N*(

*t*)〉 as centers (

*21*), which are collective coordinates tracking the temporal motion of the entire system of cities (yellow symbols in Fig. 2, A and B) analogous to center-of-mass coordinates in many-body physics.

By definition, the ensemble average of the deviations is zero, 〈ξ(*t*)〉 = 0, so that the second (variance) and higher moments of the ξ become the leading quantities of interest. From these expressions, we can write the deviations ξ* _{i}*(

*t*) as

The first expression is the most common interpretation of the ξ* _{i}* as (multiplicative) residuals from the scaling relation, whereas the second makes their status as deviations from the collective coordinates (centers) explicit. For these reasons, the ξ

*(*

_{i}*t*) provides a city size-independent measure of city performance and are also known as scale-adjusted metropolitan indicators (SAMIs) (

*22*). Characterizing these three quantities, the two centers and the statistics of the ξ, gives a complete description of growth and deviations in a system of cities and separates collective effects from idiosyncratic events in each city.

Figure 2 illustrates the meaning of these quantities. Figure 2A shows the total wages, *Y _{i}*(

*t*), for U.S. MSAs between 1969 and 2016 (47 years), year by year (colors light blue to brown). The growth trajectory of some specific cities, such as New York City, Los Angeles, Chicago, or Silicon Valley (San Jose–Santa Clara MSA), is easily visualized in this way. The solid lines show the scaling relation for each year (see caption for details). We see how scaling is a good fit to the data each year, reproducing a slowly shifting spatial equilibrium in each instance (inset) (

*4*). We also see how the position of the center (yellow squares) moves from year to year, reflecting the overall growth in population (shifts to the right) and especially in wages (movement upward). These results include both real and nominal growth of wages due to inflation.

Figure 2B shows the result of removing collective growth by moving all data clouds so that their centers coincide at the origin (0,0) (*21*). We see that removing the center’s motion (inset) results in the reproduction of the same scaling pattern at each time, with additional small and slow moving deviations, ξ* _{i}*(

*t*), changing only slightly from year to year. Figure 2C shows the histogram of these deviations (gray) about the overall best fit scaling relation (Fig. 2B), pooled across all years. We observe that the statistical distribution is well localized and symmetrical about the origin, but that is not very well fit by a normal distribution (blue line). A better fit is provided by another model (dashed read line), which will be derived below. Last, in Fig. 2D, we see the change in ξ

*(*

_{i}*t*) over time for some of the most extreme trajectories. Specifically, some cities became substantially richer in relative terms over this period (Silicon Valley and Boulder), some experienced loss in economic status (Las Vegas, NV and Havasu, AZ), and a few others, such as New York, Los Angeles and McAllen, TX (one of the worst performers, by this measure), have not changed much. These trajectories also show how slow the change in the ξ’s is. Particular events that affect different cities at specific times are easily identifiable, such as the dot-com economic boom and bust around 2000, specifically for Silicon Valley and Boulder.

### Stochastic growth dynamics

We now seek to connect the macroscopic statistics of entire cities to a microscopic general model of single agent’s behavior. We note that the budget condition, which is taken as the starting point of an important set of urban theories (*1*–*4*), is a version of the fundamental law of energy conservation. Hence, it must apply to every single agent and to cities as collections of agents.

To see this, let us start by introducing a variable, *r*(*t*), denoting the accumulation of the net quantity of *y*(*t*) over time *t*. For example, if *y* stands for an agent’s income, then *r* becomes its monetary wealth, but we should think of *r* more generally as resources that can be grown over time and used in turn (reinvested) to generate more *y* and so on. An important noneconomic example, which applies to premodern human societies and other biological populations, is when *r* is stored energy and *y* is an energy income per unit time. We write the dynamics of *r*, given *y*, as* _{r}* is the stochastic growth rate of resources. The first equality in Eq. 4 is pure accounting, stating that resources grow by the difference between income and costs,

*c*, (i.e., net income) over some time interval,

*dt*. Costs in cities are local and include real estate rents, transportation, and consumption, as well as others, such as health care and losses resulting from crime or poor urban services. In this sense, costs and benefits are also affected by migration decisions about where to live and work. The centerpiece of this equation is the difference between income and costs

*y*(

*t*) −

*c*(

*t*), which must be balanced by all agents in their specific environments. For urban agents, this difference is the “budget condition” for the spatial equilibrium that defines a city according to the Alonso model of economic geography and urban scaling theory (

*1*,

*3*,

*4*). In the original version, this difference is typically set to zero, although the meaning of incomes and costs is rather flexible and can include savings (

*23*). In urban scaling theory, this difference is nonzero in general (

*4*) and becomes the target of maximization through the self-consistency of infrastructural and social network properties. This implies that a positive difference between incomes and transportation costs is necessary for cities to exist (Fig. 2A, inset) and to generate exponential resource growth. It also implies that the scaling of resources, incomes, and costs has the same population size dependence characterized by a single common exponent for all these quantities, β > 1.

The second equality in Eq. 4 is a definition of the growth rate η* _{r}* implying that

*24*), the standard model of financial mathematics and asset pricing (

*25*,

*26*), and the one good wealth accumulation model, which is the basic tool in economics to analyze dynamical issues of wealth inequality (

*23*). Stochastic proportional growth and resulting lognormal (and power law) distributions are associated with many forms of human behavior, including the statistics for the time to complete a task, epidemic dynamics, demography and even the statistics of marriage age [see (

*27*) for a review].

Let us see how this model works in practice. Because the equation is nonlinear (the stochastic term is multiplicative), we have to be careful and use the rules of stochastic (Itô) calculus to integrate its solution in time, leading to* _{r}*, respectively, in the usual sense of those obtained over the probability density of η

*. Now, let us define the average effective growth rate*

_{r}^{−1}. Thus, the standard deviation (SD) σ

*(known as the volatility) has dimensions of (time)*

_{r}^{−1/2}. The stochastic noise Θ(

*t*) is the sum over the integration time,

*t*, or more explicitly,

*t*) to express universal behavior as a consequence of the central limit theorem (

*25*). In the simplest case, where η

*is statistically independent across time with finite variance, we obtain that Θ = σ*

_{r}*(*

_{r}W*t*) is a Wiener process, which is a normal variable with zero mean and variance

A number of key general results follow from this solution and associated limiting theorems. First, (i) the central limit of Θ implies that * _{r}t* and variance

*r*(

*t*) is asymptotically distributed as a lognormal variable, a result that will become important later. In turn, (iii) the temporal mean growth rate

*t*

^{1/2}σ

*. Last, (iv) the characteristic time,*

_{r}These properties are illustrated in Fig. 3 (A to C), obtained from numerical simulations of Eq. 4, with η* _{r}* taken as Gaussian white noise. The asymptotic behavior of all quantities depends on whether the effective growth rate is positive γ

*> 0 or equivalently*

_{r}*r*becomes apparent on a time scale longer than

*t*

_{*}(Fig. 3A), which can be very short when the volatility is small. In this regime, the distribution narrows on the scale of the mean as

*t*becomes larger, and predictable exponential growth emerges. However, when

*r*(

*t*) decays toward zero while experiencing large fluctuations (Fig. 3A, red and purple trajectories). When

*≥ 0,*

_{r}*r*displays wide fluctuations (asymptotically distributed as a lognormal; Fig. 3B, inset), which appear wider and wider for larger

*t*(Fig. 3B). Multiplicative growth processes thus have the curious property that because drift is asymmetric, a positive mean growth rate

*t*

_{*}goes to infinity (Fig. 3C, inset) and will struggle to tell whether growth persists and estimate its time scale to plan. As a consequence, low volatility and positive average rates are necessary for sustained growth (Fig. 3C). Given these results for individual agents, it becomes critical to establish the conditions for these dynamics to apply also for populations such as for entire cities (Fig. 3D) and to determine how corresponding growth rates change across scales.

### Stable growth through budget control

From the general properties of stochastic growth processes, we can conclude that any agent seeking growth must aim at a positive mean growth rate and small volatility. The conundrum is that the volatility and the mean growth rate are, to a large extent, properties of the environment, outside the agent’s control. What is under the agent’s control, however, are his/her own actions, which we show next can adapt to extrinsic circumstances via processes local in time so as to produce low volatility and stable growth.

It is important to realize that, besides levels of population aggregation, there is also a hierarchy of time scales involved in the process of balancing costs and benefits and observing growth (Fig. 3D). Over the very short term, there will be moments when the agent is resource flow negative, e.g., while shopping. However, judicious choices over time should result in more even positive net flow over the longer term, integrating together periods when incomes are larger than costs (at work) and vice versa (at home, socializing, etc.). This process of balancing costs and benefits over time is necessary in dissipative complex systems because there are always resources lost in any activity or exchange. Balancing costs and benefits over time creates strong correlations between *y*, *c*, and *r* and results on ratios, *y*/*r* and *c*/*r*, that can become independent of the level of wealth, as we show next.

Consider the basic accounting (Eq. 4) for a single agent, *y*(*t*) − *c*(*t*) = η* _{r}*(

*t*)

*r*(

*t*). As we have seen, dividing by

*r*(

*t*) > 0 gives us the definition of the growth rate η

*(*

_{r}*t*). Defining the two resulting ratios as

*b*(

*t*) ≡

*y*(

*t*)/

*r*(

*t*) and

*a*(

*t*) ≡

*c*(

*t*)/

*r*(

*t*) and averaging over time leads to

This means, in general, that we can also define * _{r}*(

*t*) is the error (or fluctuations) away from the growth rate’s temporal mean such that

*t*) in the previous section.

What kind of process sets the statistical properties of these fluctuations? On a short-term basis, fluctuations will be large if *a*, *b* vary strongly and independently of each other. Then, the amplitude of ε* _{r}* will be large over some period of time and, if negative, may deplete stored resources (

*r*→ 0), placing the agent at risk of death or bankruptcy. Thus, it is in the vital self-interest of the agent to act so as to minimize, or at least control, fluctuations.

How is this to be achieved? The point is that the variations in expenditures, *a*(*t*), should not just be seen as passive costs but rather as strategic dynamical investments under the agent’s control. Conversely, the returns on this investment, *b*(*t*), are stochastic and will always fluctuate because of environmental factors (Fig. 4A). Thus, *a*(*t*) should be chosen to generate a target growth rate and reduce fluctuations, in other words, to achieve stable and predictable growth (Fig. 4, B and C).

To demonstrate how this can be achieved, we write the returns as *v*(*t*) are (stochastic) variations in returns, whereas *u*(*t*) will play a role of a control variable adjusted by the agent. This leads to

We must now specify how control is implemented to tame the errors. Most general practical controllers are in the Proportional-Integral-Derivative (PID) class (*28*), which specifies *u*(*t*) as a function of the error, ε* _{r}*(

*t*), as

*k*,

_{P}*k*, and

_{I}*k*are constants (in time) to be chosen by the agent. These three terms allow for different kinds of strategy to reduce fluctuations:

_{D}*k*is the magnitude of an instantaneous response against the fluctuation,

_{P}*k*refers to averaging of the error over time (known as smoothing, because averaged errors are smaller and converge to zero), and

_{I}*k*describes a corrective reaction in the direction of the temporal change in the error. Of these, only smoothing by time integration will prove essential. Note also that

_{D}*u*(

*t*) is a simple quantity that can be updated locally in time via the current observed error, ε

*(*

_{r}*t*), and its addition and subtraction to the integral and difference, which requires remembering only two numbers. The stochastic dynamics of the errors is best captured via the derivative of Eq. 8

This equation for the error describes a simple driven oscillator: It is familiar from stochastic calculus when we take ^{2}. The solution is provided in section S2, showing that ε* _{r}* converges to a normal distribution with zero mean and variance

*k*larger has the double effect of accelerating the temporal convergence to a time-independent distribution and narrowing the error variance. The other parameters,

_{I}*k*and

_{P}*k*, can be set to zero, leading to very simple control based on the temporal averaging of the fluctuations. The effect of the environmental variance Ω

_{D}^{2}is simply to increase the error variance proportionally. Thus, the control process effectively filters out environmental shocks and makes the net-income variance smaller as a function of parameters chosen by the agent. This is a very simple general mechanism that allows agents to cope with environmental uncertainty and generate stable growth by adjusting their expenditures over time. Much more sophisticated strategies are possible that can maximize growth rates if more of the structure of returns,

*b*, are known (

*28*).

We see how averaging expenditures over time (known as consumption smoothing in economics) gives a general mechanism whereby agents can make their average resource growth rate take on a target value, up to stochastic fluctuations with variances given by the balance between the unpredictability of the environment and the quality of their control. Effective control generates strong statistical correlations between income and costs over time, which constitute the basis for (a spatial) equilibrium. In this light, variations between agents may persist as the result of differences in their specific experienced environments and/or the quality of their control. Exposing these issues requires the consideration of averages over populations of agents as in Fig. 3D, to which we now turn.

### Population dynamics and emerging inequality

To compute the growth dynamics for a city, we now define the averages over a population of size *G*, *r _{j}* are individual

*j*’s resources and so on for growth rates, incomes, and costs (see section S6 for a summary of notation). To derive the corresponding dynamics, we take these averages over Eq. 4

*r*subscripts on the rate, for simplicity, so that in this section, η

*→ η*

_{rG}*. The average of the product is*

_{G}The quantity *r _{G}*. This quantity equals the simple arithmetic group average, η

*, plus a correction due to the fact that growth rate variations may not be statistically independent from variations in resources across individuals. The covariance term is familiar from evolutionary theory in the context of the Price equation (*

_{G}*16*) and signals selection. For example, if richer individuals experience higher growth rates across the group, then the average growth rate will be higher and vice versa. This flags the important issue that pursuing the highest possible group-level growth rates in a heterogeneous population will increase inequality. Conversely, pursuing growth such that poorer individuals enjoy higher rates leads to more equitable outcomes in distribution but subtracts from the average η

*because the covariance is negative.*

_{G}To complete the derivation, we now characterize the mean and stochastic components of * _{G}* is a stochastic noise term resulting from the group average of the errors for each individual. The properties of ε

*are inherited from those of each agent and their statistical correlations. The mean remains zero, while the variance is given by*

_{G}*and σ*

_{j}*are the volatilities for agents*

_{k}*j*and

*k*and ρ

*is the correlation matrix between them. (The correlation matrix is symmetric, with −1 ≤ ρ*

_{jk}*≤ 1 and with ones along the diagonal, corresponding to each agent’s squared volatilities).*

_{jk}In the simplest case, when errors are statistically independent across agents, ρ* _{jk}* = 0 for

*k*≠

*j*, and if all SD are the same,

*G*limit. Thus, if errors are independent across individuals, both long times and large-population pooling leads to a convergence to the behavior set by the temporal means. This, curiously, implies that the group average grows faster than the agent’s temporal average in general and provides a strong quantitative argument for pooling resources either via government action or risk management instruments, such as insurance (

*26*).

The case of nonindependent variables is interesting because the treatment of the last section suggests that it would follow from different agents either experiencing correlated fluctuations and/or generating coordinated “institutional” control responses, which is likely in many circumstances. When all variables are fully correlated ρ* _{jk}* = 1 and

*r*becomes independent of group size. In urban settings, we may expect some correlation between agents as they experience a common spatial and socioeconomic environment of the city. For U.S. MSAs,

_{G}*G*(see fig. S3).

The covariance term between individual growth rates and resources adds additional corrections

With these results in hand, we can now write the time evolution of average group resources as

We see that the statistical behavior of *r _{G}* is set by the dependencies of these quantities (see section S3 for discussion). When

*r*(but may depend on

_{G}*G*and

*t*) and

*r*will follow a multiplicative random growth process (Fig. 3D). This process, similar to Eq. 4, will then integrate to give

_{G}*W*(

*t*) converges to a normal variable as the result of the central limit theorem, then the statistics of

*r*(

_{G}*t*) become lognormal at long times (see section S3 for an example and further discussion of necessary conditions, exceptions, and related results) (

*29*). It is important to stress that growth rates and volatilities now “run” (i.e., change) with group sizes,

*G*, and time,

*t*, depending on the correlations captured by the several covariance terms (see Fig. 3D).

### From stochastic growth to statistical scaling relations

We are now ready to express the quantities in scaling relations as functions of stochastic growth rates. This will provide us with a statistical theory that derives urban scaling beyond mean-field calculations (*4*). To keep the notation simple, the index *i* denotes cities. We take each city to be a group with *G* = *N _{i}* and write the simplified notation

*〉 (see section S6 for a summary of notation).*

_{i}*Running scaling exponents and the emergence of scale invariance*. Let us see when a power law scaling relation is a conserved quantity of the stochastic growth dynamics. We start with the integral trajectory for total resources, *R _{i}*(

*t*),

*30*)

This property of long-time means specifies necessary conditions for scaling to hold over time. To see this, define *B _{i}*

^{(0)}is independent of time and scale.

*B*varies slowly with ln

_{i}*N*so that

_{i}*B*is analogous to a beta function expressing the change (“running”) of a coupling with scale in statistical physics (

_{i}*15*). Replacing it into Eq. 15 obtains

*N*and time. Under time averaging and control, it is natural for

_{i}To see this, consider that the volatility *N*. In this case, only at sufficiently large *4*). This is not an issue if *31*, *32*). It also implies correlations between the behavior of the prefactor, *Y*_{0}(*t*), the ξ variance, and the scaling exponent β, as noted recently in (*33*).

These results show that strict scaling invariance is predicated on *34*) applied to the population growth rate. Away from this fixed point, we have now shown how to compute corrections to scaling exponents, which are the result of the scale-dependent statistics of growth rates. Last, note that the scale independence of growth rates for cities is a standard assumption known as Gibrat’s law (or law of proportional growth) (*24*). This assumption is necessary to derive Zipf’s law for the statistics of city sizes. Figure S3 illustrates this general analysis with the growth rates and variances for wages in U.S. metropolitan areas since 1969, showing that the effective growth rates are city size independent to an excellent approximation, justifying the observed persistence of scaling with a time-invariant exponent.

*Equations of motion for prefactors and scaling residuals*. We now translate stochastic growth into equations of motion for both scaling prefactors and residuals; details of the derivations are given in section S4. For the prefactors, we obtain

For the residuals, we obtain* ^{r}*)

^{2}〉. The two other terms enforce the convergence to the population averages in terms of growth rates of resources and population and guarantee that 〈ξ

*〉 = 0 is preserved by the urban system’s growth dynamics.*

^{r}*The emergent statistics of urban indicators*. The statistics of resources follows from integrating Eq. 19, leading to the general expectation that the statistics of the ξ* ^{r}* become normal at long times (see section S4). This means, in turn, that cumulative urban indicators (stocks) are expected by the same argument to be lognormal, as we saw more directly above. Flow quantities, such as income or costs, are often more accessible empirically (Fig. 2). Their statistics follow from the analysis of the previous sections, where we wrote

*R*,

_{i}*Y*, this implies that

_{i}*Y*

_{0}− ln

*R*

_{0}= 〈ln

*b*〉, which allows us to write

This shows that the statistics of income are set by two different processes, the first resulting from the statistics of associated resources and the second due to stochastic returns. The first piece is characterized by the accumulation of variations over time, which entails time averaging and is expected to become approximately normal. The second term is instantaneous and consequently not subject to limit theorems. Hence, it can have more arbitrary statistics.

To see this, we return to the analysis of stochastic returns *b _{i}* under agents’ adaptive control to obtain the explicit time evolution equation

*dv*/

_{i}*dt*was taken here to be white noise

*dW*(the differential of the Wigner process

_{i}*W*) with variance

_{t}*dv*/

_{i}*dt*has nonrandom components, the expression is similar but more complicated. Here,

*dW*is the average stochastic force over cities, and we assumed that fluctuations are uncorrelated to population variations in Ω and

*t*, inset) to become negligible for long times.

Note that in the limit of strong control, at the individual level and/or as an emergent average within cities when the * _{i}*. In addition, this derivation leads to a set of quantitative expectations that can be checked against the data: Figure 5A shows that the quantity

*t*

^{2}〈Δ

^{2}〉 ∼ σ

^{2}

*t*behaves approximately like the displacement of a one-dimensional (1D) random walk. This is well described by the straight line in time with slope given by the variance, although empirically we also observe shorter periods of acceleration or deceleration relative to the main trend. Figure 5B shows an analogous picture depicting each SAMI, ξ

*, trajectory, starting all cities at ξ*

_{i}*= 0 in 1969. This demonstrates the spread of the SAMIs over time according to the behavior of a 1D random walk (red line, the same as straight line in Fig. 5A). Figure 5C shows the volatility and mean growth rates for all cities over the 47 years and corresponding estimates from measurements of dispersion over time (Fig. 5, A and B) and over the ensemble of cities: The observed statistical agreement of these two strategies for measuring the square volatility demonstrate the ergodicity of the statistics of ξ*

_{i}*(*

_{i}*t*) once drift has been removed. Last, Fig. 5 (A and D) shows that the income residuals’ variance is actually time dependent, spreading very slowly over time as predicted by the derived lognormal part of the distribution. The overall distribution is better described, however, by the sum of two Gaussians, one broad and one narrow, corresponding to the two terms in Eq. 21 (red dashed line in Fig. 2C). It is only because the annual volatilities are so small that this temporal pooling and a deduction of a pure lognormal behavior appeared reasonable for flow variables in earlier work (

*22*,

*35*).

## DISCUSSION

We showed how quantitative urban theory can be taken beyond a stationary approach based on an average budget constraint, characteristic of spatial equilibrium. In its place, we proposed the primacy of stochastic growth processes and agents’ strategic behavior as the dynamical statistical theory from which more particular results follow (Fig. 1). This provides a common foundation for nonequilibrium modeling of cities across scales (*17*, *18*) and shows how these processes are associated with urban scaling and agglomeration effects. From this point of view, we see how the budget condition of spatial equilibrium models becomes the emergent property of a much more fundamental process, whereby agents subject to stochastic resource flows (incomes and costs) must develop adaptive strategies to reduce potentially fatal volatility. This point of departure is both necessary for dissipative complex systems and is very general so that it offers a number of connections with the statistical dynamics of other natural and engineered systems (*28*, *36*, *37*).

The key advantage of this bottom-up stochastic approach is that it naturally unifies processes of resource flow management (“equilibrium”), growth, and statistics. Hence, the framework emphasizes the critical role played by growth rate variations in a number of important urban phenomena. Specifically, we showed how the properties of the growth rate volatility are implicated in the (non-)preservation of urban scale invariance and set the boundary between growth and decay regimes, including the time scale for exponential growth to become manifest as fluctuations average out. In particular, we demonstrated how the general property of ergodicity in population dynamics and formal demography (*30*) is intimately connected, together with a renormalization fixed point condition on the growth rates, to the emergence of “mean-field” scaling relations (*4*).

There are a number of important consequences for urban theory that these results clarify and unify. First, they show how spatial equilibrium is, after all, consistent with observed exponential growth in cities both economic and demographic, which has been an assumption in previous models. Second, they show how to derive macroscopic statistical behavior for cities and urban systems from microscopic strategic choices at the agents’ level and provide expressions for how to aggregate growth rates over time and populations. Third, this process exposes issues of inequality of wealth and income and how they are compounded over time (*23*). Specifically, the quantities discussed here show how policies aimed at maximizing aggregate economic growth may naturally deemphasize the relative growth of poorest sections of the population. Last, and in many ways the central motivation of the paper, the results derived here demonstrate that the statistics of most urban indicators are not universal in a simple sense. Rather, they are emergent as the consequence of limit theorems under stochastic (exponential) growth. In particular, the statistics of urban indicators that account for incomes and costs are the result of a mixture of a more universal component, inherited from their association to accumulating quantities and corresponding limit theorems, and a nonuniversal part, arising from the short term “hustle” (accidents and the quality of the agent’s control) in variable stochastic environments. Thus, statistical tests to evaluate the Gaussianity of urban (log-)quantities (*22*, *35*, *38*), to be meaningful, must be performed with care and explicitly acknowledge the distinct distributions of different urban indicators.

The models for the budget constraint, the growth of resources, and associated control strategies introduced here are standard starting points in demography, geography, economics, and finance (*23*–*25*). They can clearly be made more complex and, where necessary, also more realistic. The concept of resources and incomes is not 1D. Issues of energy, monetary wealth, knowledge, and social capital all contribute to resource growth in human populations. The extent to which these quantities, which can all be accumulated, interact with each other is critical for a general understanding of human development. Models for the dynamics of volatilities and more sophisticated control of fluctuations and maximization of growth rates may also become important. Some inspiration should be derived from population biology and mathematical finance (*16*, *25*), where such models are more developed. Last, data on detailed expenditures, wealth, and other financial and social characteristics are becoming increasingly available for households at finer temporal resolutions (*14*) and will be critical to test and improve the ideas introduced here and to identify systematic heterogeneities in agents’ behavior, e.g., associated with conditions of poverty and uncertainty.

The approach developed here can be applied to other contexts beyond the contemporary United States but requires appropriate contextualization. In all societies, household adaptive management of resources is likely to remain important. However, in more collectivist societies or in those with stronger top-down governance, the aggregate management of benefits and costs will replace, to a larger extent, bottom-up agency. As shown, managing aggregate costs at the societal level can achieve considerable benefits because this strategy minimizes some risk. Its success hinges, however, on the effective investment of resources that generate society-wide benefits and their redistribution and related inequality. In circumstances of low growth, such as in most preindustrial societies, adaptive control of resources and the associated dynamics of volatility provide us with an important window into their (in)stability. This allows us to connect more proximate explanations of collapse, e.g., related to environmental stresses or violence, to the broader collective and political dynamics of societies, expressed as the capacity to manage shocks or disintegrate instead.

Empirically, the U.S. urban system, at least in terms of changes in total wages in MSAs, turns out to be very well-behaved: Its growth volatilities are almost always very small, fluctuations converge to limiting statistics quickly, and scaling relations are conserved over time. However, our theoretical results show that these properties pertain only to quantities and systems of cities with small, population size–independent growth rate volatilities. In the United States, over the past nearly 50 years, despite a number of notable events, observed average square volatilities associated with wages and population growth are about one order of magnitude smaller than average growth rates, making their effects almost negligible. It will be interesting in the future to investigate other urban systems and quantities characterized by larger volatility, such as crime or innovation (*22*, *33*, *35*), for which the present framework makes a number of testable predictions.

The flip side of the observed constancy and stability of growth rates in American cities is that extant wage disparities become very slow to reverse. The typical square displacement in ξ over nearly five decades (Fig. 5A; ∼σ^{2}*t*) is just 0.054. Assuming a similar rate of change in the past means that the observed variance in deviations from scaling at the beginning of our dataset (in 1969, about 0.043) would have been the product of the previous 40 years, taking us back to the time of the roaring 1920s and the subsequent great depression. Thus, the answer to the question at the beginning of this paper about predicting the magnitude of deviations from scaling in any given year is now recast not so much in terms of parameters of stationary statistics (*33*). Rather, this variance is the result of accounting for the accumulation of much smaller accidents and variations that make up the stochastic history of cities, which compound short-term noisy growth under partial control of heterogeneous agents over entire urban areas and long-time periods of decades (*22*). This is the quantitative sense in which history matters for cities, and their development becomes path dependent (*18*, *39*, *40*).

## SUPPLEMENTARY MATERIALS

Supplementary material for this article is available at http://advances.sciencemag.org/cgi/content/full/6/34/eaat8812/DC1

This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial license, which permits use, distribution, and reproduction in any medium, so long as the resultant use is **not** for commercial advantage and provided the original work is properly cited.

## REFERENCES AND NOTES

**Acknowledgments:**I thank J. Lobo and S. Ortman, A. Gomez-Lievano, and O. Peters for discussions.

**Funding:**This research was partially supported by the Mansueto Institute for Urban Innovation at the University of Chicago.

**Author contributions:**L.M.A.B. conceived the study, developed theory, analyzed data, performed numerical simulations, and wrote the manuscript.

**Competing interests:**The author declares that he has no competing interests.

**Data and materials availability:**All data needed to evaluate the conclusions in the paper are present in the paper and/or the Supplementary Materials. The data and python code used in the analyses of the paper and generation of each figure are available at https://github.com/mansueto-institute/Urban-Growth-Emergent-Statistics. A continuously updated version of the data is available from the U.S. Bureau of Economic Analysis website as Table CA30 Economic Profile: Wages and Salaries (www.bea.gov/regional/downloadzip.cfm). Additional data related to this paper may be requested from the author.

- Copyright © 2020 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution NonCommercial License 4.0 (CC BY-NC).