PANEDA | High-Dimensional Inference for Panel and Network Data

Summary
Improved data availability in Economics provides access to richer datasets with increased complexity. There is regularly a network aspect to the data, whenever outcomes are observed for matches of different economic units (e.g. households, individuals, firms, products, markets). Such observations include, e.g., wages for workers in firms, academic achievement for students taught by teachers in schools, and purchasing decisions for consumers in stores. The underlying network structure is often sparse, because we only observe a small subset of all possible matches, say between workers and firms. In addition, we aim to estimate models with many parameters, for example to control for and to estimate unobserved heterogeneity of economic units by including (e.g. worker and firm specific) fixed effects.

The combination of sparsity of the underlying network structure and a large number of parameters in the model creates challenging Econometric problems. In particular, there is a serious gap between empirical practice, where applied researchers regularly use such sparse network datasets, and the theoretical justifications for those inference methods that are based on classic data structures (cross-sectional, time-series, and panel data) that do not account for the sparsity aspect of the data.

The goal of this research project is to develop robust inference methods for such sparse panel and network datasets. This requires to establish a mathematical representation of the network that allows to formalize asymptotic inference results for sequences of growing networks. Subsequently, new bias correction and robust standard error estimation methods will be developed that account for the sparsity structure of the data. I will also advance more parsimonious modeling and estimation approaches (e.g. grouped heterogeneity or empirical Bayes) for situations where the data are otherwise uninformative for the parameters of interest.
Unfold all
/
Fold all
More information & hyperlinks
Web resources: https://cordis.europa.eu/project/id/819086
Start date: 01-08-2019
End date: 31-07-2025
Total budget - Public funding: 1 478 831,00 Euro - 1 478 831,00 Euro
Cordis data

Original description

Improved data availability in Economics provides access to richer datasets with increased complexity. There is regularly a network aspect to the data, whenever outcomes are observed for matches of different economic units (e.g. households, individuals, firms, products, markets). Such observations include, e.g., wages for workers in firms, academic achievement for students taught by teachers in schools, and purchasing decisions for consumers in stores. The underlying network structure is often sparse, because we only observe a small subset of all possible matches, say between workers and firms. In addition, we aim to estimate models with many parameters, for example to control for and to estimate unobserved heterogeneity of economic units by including (e.g. worker and firm specific) fixed effects.

The combination of sparsity of the underlying network structure and a large number of parameters in the model creates challenging Econometric problems. In particular, there is a serious gap between empirical practice, where applied researchers regularly use such sparse network datasets, and the theoretical justifications for those inference methods that are based on classic data structures (cross-sectional, time-series, and panel data) that do not account for the sparsity aspect of the data.

The goal of this research project is to develop robust inference methods for such sparse panel and network datasets. This requires to establish a mathematical representation of the network that allows to formalize asymptotic inference results for sequences of growing networks. Subsequently, new bias correction and robust standard error estimation methods will be developed that account for the sparsity structure of the data. I will also advance more parsimonious modeling and estimation approaches (e.g. grouped heterogeneity or empirical Bayes) for situations where the data are otherwise uninformative for the parameters of interest.

Status

SIGNED

Call topic

ERC-2018-COG

Update Date

27-04-2024
Images
No images available.
Geographical location(s)
Structured mapping
Unfold all
/
Fold all
Horizon 2020
H2020-EU.1. EXCELLENT SCIENCE
H2020-EU.1.1. EXCELLENT SCIENCE - European Research Council (ERC)
ERC-2018
ERC-2018-COG