TY - JOUR
T1 - iFusion
T2 - Individualized Fusion Learning
AU - Shen, Jieli
AU - Liu, Regina Y.
AU - Xie, Min ge
N1 - Funding Information:
The authors gratefully acknowledge the support from the National Science Foundation through grant #DMS151348, #DMS 1737857, #IIS-1741390, and #DMS-1812048. They also thank Professor Lingsong Xue for sharing data. The first author acknowledges the generous graduate support from Rutgers University. The views and opinions expressed in this article are those of the authors and do not necessarily reflect the views of Deustche Bank.
Publisher Copyright:
© 2019 American Statistical Association.
PY - 2020/7/2
Y1 - 2020/7/2
N2 - Inferences from different data sources can often be fused together, a process referred to as “fusion learning,” to yield more powerful findings than those from individual data sources alone. Effective fusion learning approaches are in growing demand as increasing number of data sources have become easily available in this big data era. This article proposes a new fusion learning approach, called “iFusion,” for drawing efficient individualized inference by fusing learnings from relevant data sources. Specifically, iFusion (i) summarizes inferences from individual data sources as individual confidence distributions (CDs); (ii) forms a clique of individuals that bear relevance to the target individual and then combines the CDs from those relevant individuals; and, finally, (iii) draws inference for the target individual from the combined CD. In essence, iFusion strategically “borrows strength” from relevant individuals to enhance the efficiency of the target individual inference while preserving its validity. This article focuses on the setting where each individual study has a number of observations but its inference can be further improved by incorporating additional information from similar studies that is referred to as its clique. Under the setting, iFusion is shown to achieve oracle property under suitable conditions. It is also shown to be flexible and robust in handling heterogeneity arising from diverse data sources. The development is ideally suited for goal-directed applications. Computationally, iFusion is parallel in nature and scales up easily for big data. An efficient scalable algorithm is provided for implementation. Simulation studies and a real application in financial forecasting are presented. In effect, this article covers methodology, theory, computation, and application for individualized inference by iFusion.
AB - Inferences from different data sources can often be fused together, a process referred to as “fusion learning,” to yield more powerful findings than those from individual data sources alone. Effective fusion learning approaches are in growing demand as increasing number of data sources have become easily available in this big data era. This article proposes a new fusion learning approach, called “iFusion,” for drawing efficient individualized inference by fusing learnings from relevant data sources. Specifically, iFusion (i) summarizes inferences from individual data sources as individual confidence distributions (CDs); (ii) forms a clique of individuals that bear relevance to the target individual and then combines the CDs from those relevant individuals; and, finally, (iii) draws inference for the target individual from the combined CD. In essence, iFusion strategically “borrows strength” from relevant individuals to enhance the efficiency of the target individual inference while preserving its validity. This article focuses on the setting where each individual study has a number of observations but its inference can be further improved by incorporating additional information from similar studies that is referred to as its clique. Under the setting, iFusion is shown to achieve oracle property under suitable conditions. It is also shown to be flexible and robust in handling heterogeneity arising from diverse data sources. The development is ideally suited for goal-directed applications. Computationally, iFusion is parallel in nature and scales up easily for big data. An efficient scalable algorithm is provided for implementation. Simulation studies and a real application in financial forecasting are presented. In effect, this article covers methodology, theory, computation, and application for individualized inference by iFusion.
KW - Combining inferences
KW - Combining information
KW - Confidence distribution
KW - Fusion learning
KW - Individualized inference
KW - iFusion
UR - http://www.scopus.com/inward/record.url?scp=85074918438&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85074918438&partnerID=8YFLogxK
U2 - 10.1080/01621459.2019.1672557
DO - 10.1080/01621459.2019.1672557
M3 - Article
AN - SCOPUS:85074918438
SN - 0162-1459
VL - 115
SP - 1251
EP - 1267
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 531
ER -