I am a PhD Candidate in the Economics Department at Northwestern University, and a job market candidate for the 2025/26 season. I am an econometrician also working on empirical development economics, with a focus on causal inference and machine learning methods. My research develops new econometric methods motivated by empirical challenges in development economics, and I use theoretical insights and contextual knowledge to identify and answer questions in my applied work.

Job Market Paper

A Central Limit Theorem for Split-Sample Estimators

As artificial intelligence grows in popularity, an increasing number of researchers, policymakers and industry practitioners are tasked with using the same dataset to both train a new model and evaluate some of its properties, such as accuracy and fairness. Conducting each task with separate random splits of the data facilitates inference, but comes at the cost of using fewer data at each step. Averaging over multiple splits, such as with cross-fitting, uses more data for each task and leads to better reproducibility properties, but inference is challenging due to the statistical dependence among splits. I address this challenge by proving a new Central Limit Theorem for a large class of split-sample estimators under arguably mild and general conditions. Importantly, I make no restriction on the model complexity, convergence rate or rate of algorithmic stability. I show that the dependence across splits is asymptotically negligible in many cases, document cases when it matters, and provide confidence intervals that explicitly account for the dependence structure. Moreover, I provide a measure of reproducibility for p-values obtained from split-sample estimators, and show its asymptotic validity. I apply my results to predicting poverty in Ghana and show that my confidence intervals are able to detect a subgroup at elevated risk of falling below the extreme poverty line, while previous methods do not. Finally, I revisit the problem of learning heterogeneous treatment effects in randomized experiments and develop an ensemble method that combines predictions across multiple predictors. In simulations and an empirical application to charitable giving, the ensemble method achieves greater power than previous approaches in detecting treatment effect heterogeneity.

Working Papers

  1. Predicting the Distribution of Treatment Effects via Covariate-Adjustment, with an Application to Microcredit [arxiv] [Presentation MEG 2024]
    Best Student Paper Award at the 32nd Midwest Econometrics Group Annual Conference (2024)

Publications

  1. Probabilistic Nearest Neighbors Classification [pdf] [R Package]
    (with Paulo C. Marques F. and Hedibert F. Lopes) Entropy, 2024, 26(1), 39.
  2. The Illusion of the Illusion of Sparsity: An exercise in prior sensitivity [pdf] [code]
    (with Hedibert F. Lopes) Brazilian Journal of Probability and Statistics, 2021, Vol. 35, No. 4, 699-720.

Work in Progress

  1. Algorithmic Bias in Microcredit: Consequences of Data-Driven Lending Practices
    (with Susan Athey, Dean Karlan, Adam Osman, and Jonathan Zinman)
  2. Is Participant Feedback Predictive of Impact?
    (with Gharad Bryan, Dean Karlan, Isabel Oñate, and Christopher Udry)
  3. What Can We Learn from Harmonizing and Analyzing RCTs of Grant and Training Programs to Promote Entrepreneurship?
    (with Florian de Bundel, Dean Karlan, William Parienté, and Christopher Udry)