TY - JOUR
T1 - Linear operator-based statistical analysis
T2 - A useful paradigm for big data
AU - Li, Bing
N1 - Funding Information:
I would like to thank two anonymous referees for their useful comments and suggestions, which helped me greatly in revising an earlier manuscript. This research is supported in part by a National Science Foundation grant.
Publisher Copyright:
© 2017 Statistical Society of Canada
PY - 2018/3
Y1 - 2018/3
N2 - In this article we lay out some basic structures, technical machineries, and key applications, of Linear Operator-Based Statistical Analysis, and organize them toward a unified paradigm. This paradigm can play an important role in analyzing big data due to the nature of linear operators: they process large number of functions in batches. The system accommodates at least four statistical settings: multivariate data analysis, functional data analysis, nonlinear multivariate data analysis via kernel learning, and nonlinear functional data analysis via kernel learning. We develop five linear operators within each statistical setting: the covariance operator, the correlation operator, the conditional covariance operator, the regression operator, and the partial correlation operator, which provide us with a powerful means to study the interconnections between random variables or random functions in a nonparametric and comprehensive way. We present a case study tracing the development of sufficient dimension reduction, and describe in detail how these linear operators play increasingly critical roles in its recent development. We also present a coordinate mapping method which can be systematically applied to implement these operators at the sample level.
AB - In this article we lay out some basic structures, technical machineries, and key applications, of Linear Operator-Based Statistical Analysis, and organize them toward a unified paradigm. This paradigm can play an important role in analyzing big data due to the nature of linear operators: they process large number of functions in batches. The system accommodates at least four statistical settings: multivariate data analysis, functional data analysis, nonlinear multivariate data analysis via kernel learning, and nonlinear functional data analysis via kernel learning. We develop five linear operators within each statistical setting: the covariance operator, the correlation operator, the conditional covariance operator, the regression operator, and the partial correlation operator, which provide us with a powerful means to study the interconnections between random variables or random functions in a nonparametric and comprehensive way. We present a case study tracing the development of sufficient dimension reduction, and describe in detail how these linear operators play increasingly critical roles in its recent development. We also present a coordinate mapping method which can be systematically applied to implement these operators at the sample level.
UR - http://www.scopus.com/inward/record.url?scp=85042164057&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85042164057&partnerID=8YFLogxK
U2 - 10.1002/cjs.11329
DO - 10.1002/cjs.11329
M3 - Article
AN - SCOPUS:85042164057
SN - 0319-5724
VL - 46
SP - 79
EP - 103
JO - Canadian Journal of Statistics
JF - Canadian Journal of Statistics
IS - 1
ER -