Vladislav Morozov, Andrea Sy
1. Problem Statement and Motivation
The central economic puzzle addressed is the systematic productivity advantage of firms located in dense urban areas compared to those in less dense regions. Two primary mechanisms are hypothesized to drive this:
- Agglomeration Economies: Dense environments inherently make firms more productive (shifting the entire productivity distribution).
- Firm Selection: Intense competition in dense areas culls less productive firms, truncating the left tail of the productivity distribution.
The Combes et al. (2012) Framework:
To disentangle these mechanisms, Combes et al. (2012) (hereafter CDGPR12) proposed a decomposition method based on a critical assumption: Total Factor Productivity (TFP) distributions between dense (Above-Median Density, AMD) and less dense (Below-Median Density, BMD) areas are identical up to three parameters:
- Location (μ): Reflects agglomeration.
- Scale (σ): Reflects agglomeration.
- Left-tail truncation (ξ): Reflects selection.
The Gap:
While CDGPR12's framework is widely used for policy evaluation (e.g., tax incentives, infrastructure), the core assumption of distributional equality (up to these parameters) has never been directly empirically validated.
- Challenge 1: TFP is unobserved and must be estimated, introducing measurement noise.
- Challenge 2: Naive comparisons of noisy estimates can conflate true heterogeneity with estimation error, leading to invalid inference (Jochmans and Weidner, 2024).
- Consequence: If the assumption fails (e.g., if distributions differ in shape or higher moments), the CDGPR12 decomposition is biased, potentially misguiding policy regarding agglomeration vs. competition.
2. Data and Estimation Strategy
Data Source:
- Source: Spanish administrative firm-level data (Banco de España's CBI dataset).
- Period: 2000–2019.
- Coverage: ~80% of incorporated non-financial Spanish firms.
- Classification: Firms are categorized into AMD and BMD areas based on the "experienced density" measure (de la Roca and Puga, 2017).
TFP Estimation:
The authors follow the CDGPR12 two-step procedure:
- Production Function Estimation: For each sector s and area a, they estimate a Cobb-Douglas production function using the Ackerberg, Caves, and Frazer (2015) control function approach to handle simultaneity bias.
Vit=exp(θi)Kitβ1Litβ2exp(Uit+β0,t) - TFP Calculation: Firm-level TFP (θ^i) is calculated as the average residual of the estimated production function over time.
Sample Restriction (Crucial for Validity):
To ensure valid inference and control for estimation noise, the authors restrict the sample to firms observed for at least 15 periods.
- Economic Rationale: Aligns with the long-run theoretical framework where selection effects manifest; excludes recently entered, potentially unproductive firms not yet culled.
- Statistical Rationale: Reduces the variance of the TFP estimator (θ^), making the noise manageable for distributional testing.
3. Methodology: Testing Distribution Equality
The authors propose a novel nonparametric testing framework to validate the null hypothesis (H0) that TFP distributions in AMD and BMD areas differ only by location (μ) and scale (σ), excluding the need for a truncation parameter (ξ).
The Null Hypothesis:
H0:Fs,AMD(σs,AMDθ−μs,AMD)=Fs,BMD(σs,BMDθ−μs,BMD)∀θ∈R
Technical Innovations in Testing:
Standard Kolmogorov-Smirnov (KS) tests fail here because θ^i are noisy estimates. The authors employ a tailored two-sample KS test with two specific corrections:
- Half-Panel Jackknife (HPJ) Debiasing: Following Dhaene and Jochmans (2015) and Jochmans and Weidner (2024), this corrects the bias introduced by the noise in the TFP estimates.
- Firm-Level Bootstrap: Used to account for all sources of sampling variability and the uncertainty in estimating the mean and variance parameters.
This approach allows for a valid goodness-of-fit test even when the underlying variable (TFP) is measured with error.
4. Key Results
The study tests the hypothesis across 10 major sectors (Manufacturing, Construction, Wholesale/Retail, Transport, Hospitality, ICT, Real Estate, Professional Services, Admin Services, Arts/Entertainment).
Statistical Findings:
- For all sectors, the authors fail to reject the null hypothesis.
- The bootstrap p-values are uniformly high (ranging from 0.499 to 0.974), indicating no statistically significant difference in the shape of the distributions after standardizing for mean and variance.
- Visual inspection of the debiased Cumulative Distribution Functions (CDFs) confirms that the curves for AMD and BMD areas are nearly identical once location and scale are aligned.
Implication for Parameters:
- The distributions are fully aligned using only the mean (μ) and variance (σ).
- The left-tail truncation parameter (ξ) is not required. This contradicts the necessity of strong selection effects to explain urban productivity gaps.
5. Key Contributions
- Empirical Validation: Provides the first direct, nonparametric validation of the CDGPR12 distributional equality assumption using noisy data. It confirms that the assumption holds across diverse sectors in Spain.
- Methodological Advancement: Develops a robust framework for testing distributional equality with noisy estimates (combining HPJ debiasing and bootstrapping). This methodology is applicable to other fields involving unobserved heterogeneity with measurement error (e.g., worker skill distributions, mutual fund manager skills).
- Refinement of Mechanism: Demonstrates that the productivity advantage of cities is driven almost entirely by agglomeration economies (shifts in mean and variance) rather than selection (tail truncation).
6. Significance and Policy Implications
- Policy Focus: Since selection (culling weak firms) does not appear to be the primary driver of the productivity gap, policies aimed at increasing competition or reducing entry barriers may be less effective than previously thought for closing these gaps.
- Recommended Interventions: Policymakers should prioritize agglomeration-enhancing policies, such as:
- Improving local infrastructure.
- Fostering labor market thickness.
- Encouraging knowledge spillovers.
- Theoretical Confidence: The results validate the use of CDGPR12-style decompositions in future research and policy analysis, ensuring that interpretations of sorting patterns and place-based interventions are not confounded by invalid distributional assumptions.
In conclusion, the paper establishes that the "productivity advantage of cities" is a matter of scale and location shifts in the productivity distribution, not a matter of selection filtering out the bottom tail. This shifts the policy paradigm from competition-focused interventions to agglomeration-focused investments.
Drowning in papers in your field?
Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.
Get the best economics papers every week.
Trusted by researchers at Stanford, Cambridge, and the French Academy of Sciences.
Check your inbox to confirm your subscription.
Something went wrong. Try again?
No spam, unsubscribe anytime.
More like this
Root-n Asymptotically Normal Maximum Score Estimation
This paper proposes a smooth, strictly concave surrogate score function for binary choice models that overcomes the traditional maximum score method's limitations by achieving root-n convergence to a normal limiting distribution under specific primitive conditions, thereby enabling standard inference.
Waiting for Help: Timely Access to Psychological Support for Young Adults Exposed to Parental Substance Misuse
Using a randomized waitlist-controlled trial in Denmark, this study demonstrates that eliminating waiting times for psychological support yields significant and persistent short-term improvements in the mental health of young adults exposed to parental substance misuse, despite showing limited average effects on broader health or labor market outcomes.
Daycare Matching with Siblings: Social Implementation and Welfare Evaluation
Using Japanese daycare data, this paper develops an empirical framework that accounts for sibling complementarities to demonstrate that a 2024 reform prioritizing joint assignments significantly boosts welfare and reduces inequality, while revealing a distinct tradeoff where maximizing efficiency would reverse these equity gains.
On the Design of Stochastic Electricity Auctions
This paper proposes enhancing day-ahead electricity auctions by conditioning contracts on specific states of the world to address renewable energy uncertainty, utilizing microeconomic equilibrium theory to define optimal state partitions and demonstrating their application through a North Sea offshore wind case study.
Balanced Contributions in Networks and Games with Externalities
This paper introduces and characterizes the unique component-efficient Balanced Contributions (BCE) allocation rule for networks with externalities, overcoming existence challenges via a cycle-sum identity to bridge spanning-tree constructions with full network requirements, while distinguishing it from fairness-based rules and demonstrating its consistency with established values in specific game-theoretic contexts.