Project Details
Description
The project is motivated by problems such as geo-political event prediction, crime data analysis, and modeling transportation and trading networks. Many data from these applications share three common and salient features: (i) they can be represented as tensors (multi-dimensional arrays), (ii) they are generated over time and exhibit dynamic relationship, and (iii) the values of the data are binary, counts, proportions etc. Such diverse data types, which are referred to as the non-Gaussian tensor time series, call urgently for the development of more adaptable analytical tools. The investigators introduce a general, flexible and efficient framework for the modeling, interpretation and prediction for such non-Gaussian tensor time series through dynamic factor models. The dynamic factor models contain an observation layer specified for the generation of the non-Gaussian observations, and a latent layer to account for the dynamic and concurrent dependence. They are capable of extracting dynamic information, enhancing comprehension of underlying mechanisms, and generating reliable forecasts, and are therefore poised to assist organizations and policymakers in making well-informed decisions. The project advances education through the research training of both undergraduate and graduate students, as well as its incorporation into special topic courses. The project is also committed to promoting diversity and inclusion in STEM fields, and actively seeks to recruit students from groups that are historically under-represented in science and engineering.A novel dynamic matrix factor model for non-Gaussian data marks a significant advancement in modeling large and complex dependent data. These models effectively tackle challenges arising from the size, complexity, and discreteness of the data. More specifically, the dynamic is introduced in the hidden layer through the factor structure with tensor Tucker or CP decompositions, and a nonlinear or generalized linear model (Poisson, negative binomial, Gamma, zero-inflated, etc) is employed in the observation layer for the generation of the observations. An autocovariance-based approach is used for the estimation of the loading matrices and vectors, which takes advantage of the temporal dependence to reduce the bias. A tensor autoregressive model is imposed on the factors to enable the forecasting. For the estimation of the autoregressive model, again an autocovariance-based procedure is used to mitigate the impact of the estimation error of factors. This approach's computational efficiency is particularly well suited for handling big and complex data. The methodology and theoretical analysis lay the groundwork for applying the moment method in broader models and applications. Furthermore, the methodology underscores the significance of the "blessing of dependence" phenomenon, demonstrating how the temporal dependence can be utilized to enhance/reduce the signal/noise to achieve more accurate estimation, compared to the corresponding works under the IID setting. It is noteworthy that the framework accommodates extensions to more general distributions, and the developed methodology has broad applications.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
Status | Active |
---|---|
Effective start/end date | 8/1/24 → 7/31/27 |
Funding
- National Science Foundation: $300,001.00
Fingerprint
Explore the research topics touched on by this project. These labels are generated based on the underlying awards/grants. Together they form a unique fingerprint.