Periodic Reporting for period 1 - FORECASTING (Forecasting with large datasets: A time varying covariance matrix)
Reporting period: 2016-09-06 to 2018-09-05
Time variation in economic relationships has been largely studied in economics. It can be seen either as abrupt shifts in the assumed generating mechanisms of the variables, or as smooth stochastic or deterministic changes in that. Either way, it can be considered as the result of altering forces such as institutional switching, economic transitions, preference fluctuations, policy transformations or technological changes, inter alia. All these can imply instabilities in the assumed economic relationships.
Large datasets are, nowadays, a key characteristic of human development (e.g. computers, being in the middle of most economic transactions generate huge amounts of data that can be analyzed to extract critical information). This is relevant for answering economic policy questions or a key to various scientific discoveries. In large datasets, conventional statistical and econometric techniques such as sample covariance estimation or regression coefficient estimation fail to work consistently due to the dimensionality of the estimation object. For instance, in a linear economic relationship we frequently obtain T observations of a dependent variable (y) as a function of many potential predictors (p predictors). When the number of predictors p is large or larger than the temporal dimension T, then a regression with all available covariates becomes extremely problematic if not impossible. Analogously, when our aim is to estimate the large covariance matrix of the p predictors, the sample estimate becomes heavily unreliable. It is also, particularly, computationally demanding since the dimension of the estimated object rises as a square of the dimension of the dataset under analysis. The current literature provides some novel answers but only when we assume a fixed, across time, covariance matrix, of the true data generating mechanism.
These two aspects of the observed datasets are important characteristics of the reality and failure to provide a framework that can accommodate these, simultaneously, will certainly result to unreliable scientific discoveries. In economics, this implies that the developed models will be insufficient to capture important characteristics of the economy, delivering false or unsuccessful policy suggestions.
We provide a unified framework that can accommodate these aspects of real datasets, with nice theoretical properties. To this end, the large dimensional econometrics literature, is combined with the non parametric estimation literature, in an innovative fashion, and novel methodologies on large covariance matrix and large dimensional regression, are proposed. As it is shown, our methods imply significant improvements, in a wide range of applications and metrics, over the relevant methodologies that currently dominate the literature.
All results were well disseminated across the academic community, in Europe and the USA. Presentations, accompanied with short research visits to relevant academic institutions, such as the University of Southern California (USA), King's College (UK), Queen Mary University of London (UK), Universitat Pompeu Fabra (Spain), Athens University of Economics and Business (Greece) and Bank of Greece members, aim to inform the interested scholars and accelerate further the research agenda, increase to the highest level the networking opportunities, which constitute significant dissemination activities to targeted groups of Econometricians. The results were also presented in the University of Cyprus, within the brown bag seminar series, where the widely established, local academic community had the chance to understand deeper the theoretical problems that arise within the context of large dimensional econometric theory and notice the strong theoretical and empirical contributions of the project. The proposed methodologies were accepted with increased enthusiasm, while their comments and discussions helped to better exploit this research area.