
Studying unified model averaging estimation for situations with complicated data structures, we propose a novel model averaging method based on cross-validation (MACV). MACV unifies a large class of new and existing model averaging estimators and covers a very general class of loss functions. Furthermore, to reduce the computational burden caused by the conventional leave-subject/one-out cross-validation, we propose a SEcond-order-Approximated Leave-one/subject-out (SEAL) cross-validation, which largely improves the computation efficiency. In the context of nonindependent and non-identically distributed random variables, we establish the unified theory for analyzing the asymptotic behaviors of the proposed MACV and SEAL methods, where the number of candidate models is allowed to diverge with sample size. To demonstrate the breadth of the proposed methodology, we exemplify four optimal model averaging estimators under four important situations, that is, longitudinal data with discrete responses, within-cluster correlation structure modeling, conditional prediction in spatial data, and quantile regression with a potential correlation structure. We conduct extensive simulation studies and analyze real-data examples to illustrate the advantages of the proposed methods. Supplementary materials for this article are available online, including a standardized description of the materials available for reproducing the work.
Publication:
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION2025, VOL. 00, NO. 0, 1–12: Theory and Methods
https://doi.org/10.1080/01621459.2025.2487215
Author:
Dalei Yua
Department of Statistics and Data Science, School of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an, China
Xinyu Zhang
IIF, School of Management,University of Science and Technology of China, Hefei, China
SKLMS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences,Beijing, China
Hua Liang
Department of Statistics, George Washington University, Washington, DC
附件下载: