Periodic Reporting for period 1 - NETWORK (A coherent approach to analysing heterogeneity in network data)
Période du rapport: 2023-01-01 au 2025-06-30
The aim of NETWORK is to provide a coherent approach to the formulation of models for network interaction in the presence of unobserved heterogeneity, to present identification results, and to propose computationally convenient estimators based on them.
In a second paper we have derived an identification result for general dyadic data under the assumption that they are jointly exchangeable and dissociated. In this case they admit a non-separable specification with two-way unobserved heterogeneity. We provide conditions under which both the distribution of the observed random variables conditional on the unit-specific heterogeneity and the distribution of the unit-specific heterogeneity itself are uniquely recoverable from knowledge of the joint marginal distribution of the observable random variables alone without imposing parametric restrictions.
In a third paper we look at a general class of problems with unobserved heterogeneity. A popular approach to perform inference in the presence of nuisance parameters is to construct estimating equations that are orthogonal to the nuisance parameters, in the sense that their expected first derivative is zero. Such first-order orthogonalization may, however, not suffice when the nuisance parameters are very imprecisely estimated. Leading examples where this is the case are models for panel and network data that feature fixed effects. We show how, in the conditional-likelihood setting, estimating equations can be constructed that are orthogonal to any chosen order. Combining these equations with sample splitting yields higher-order bias-corrected estimators. In an empirical application we apply our method to a fixed-effect model of team production and obtain estimates of complementarity in production and impacts of counterfactual re-allocations.
In a fourth paper we look at the judge-leniency design, which is very popular in causal inference. Evaluating whether conventional inference procedures apply to it is not immediate. We frame such a design as an inference problem from grouped data in a setting with a growing number of groups and limited variation between groups. Such an approximation is well suited for the data sets encountered in practice. The two-stage least-squares estimator should never be used. The jackknife instrumental-variable estimator presents a reliable tool for inference, provided that a non-standard variance estimator is used. Conventional decision rules to gauge instrument strength should not be used. An alternative such decision rule is provided in this context and evaluated.
In a fifth paper we explore the possibility of the bootstrap to provide correct inference in longitudinal data when dynamics and fixed effects are present. The paper shows that the bias arising from unrestricted feedback can be correctly replicated by a panel data version of the moving block bootstrap.
In a sixth paper we consider the problem of nonparametrically identifying models for worker/firm data in the presence of both unobserved worker and firm heterogeneity. This is a first-order concern. Indeed, this type of model is a workhorse tool in the earnings literature but identification results in the presence of two-sided unobserved heterogeneity are non-existent. We provide such results for several model specifications.
One interesting aspect of the derived results is that they are useful not only in settings where the data consists of a single large network, but also where one has access to data on many small networks. In such a case, alternative approaches based on say spectral clustering or brute-force fixed-effect estimation are not viable options. Given the prevalence of stochastic block models in applied work these results should be of much use.
We have also derived the first known results on the nonparametric identifiability of models for matched data (usually worker/firm) with two-sided random effects.
NETWORK has also produced a novel way to deal with estimation noise caused by many imprecisely estimated nuisance parameters in inference on a target parameter of interest. In the context of the project the chief application lies in models for network data with node-specific heterogeneity, but the approach is general and contributed to the large literature on inference in the presence of nuisance parameters. Indeed, the project has generalized the classic notion of Neyman orthogonality and, importantly, provides a way to achieve it in general likelihood-based models. Such a result is new and is expected to generate much additional research in the future.