Xinghao QIAO
Prof. Xinghao Qiao
Innovation and Information Management
Associate Professor
MAIB Deputy Programme Director and Admissions Tutor

3910 3109

KK 1340

Academic & Professional Qualification
  • PhD in Business Statistics, Marshall School of Business, University of Southern California
  • MS in Statistics, University of Chicago
  • BS in Mathematics and Physics, Tsinghua University
Biography

Xinghao Qiao is an Associate Professor in the area of Innovation and Information Management at the HKU Business School. Prior to HKU, he was an Associate Professor in the Department of Statistics at London School of Economics.

Teaching
  • Forecasting and Predictive Analytics (HKU)
  • Probability and Statistics for Business (HKU)
  • Machine Learning and Data Mining (LSE)
  • Artificial Intelligence (LSE)
  • Regression and Generalized Linear Models (LSE)
  • Applied Business Statistics (USC)
Research Interest
  • Functional data analysis
  • Complex time series analysis
  • High-dimensional statistics/econometrics
  • AI and machine learning with business applications
Selected Publications
  • Li, D., Qiao, X. and Wang, Z. (2025). Factor-guided estimation of large covariance matrix function with conditional functional sparsity. Journal of Econometrics, 251, 106070.
  • Guo, S., Qiao, X., Wang, Q. and Wang, Z. (2025). Factor modelling for high-dimensional functional time series. Journal of Business & Economic Statistics, in press.
  • Guo, S., Li, D., Qiao, X. and Wang, Y. (2025). From sparse to dense functional data in high dimensions: Revisiting phase transitions from a non-asymptotic perspective. Journal of Machine Learning Research, 26(15), 1-40.
  • Chang, J., Fang, Q., Qiao, X., and Yao, Q. (2024). On the Modeling and Prediction of High-Dimensional Functional Time Series. Journal of the American Statistical Association, in press.
  • Liu, Y., Qiao, X., Wang, L. and Pei, Y. (2024). Deep functional factor models: Forecasting high-dimensional functional time series via Bayesian nonparametric factorization, the 41th International Conference on Machine Learning, PMLR, 235, 31709-31727.
  • Fang, Q., Guo, S. and Qiao, X. (2024). Adaptive functional thresholding for sparse covariance function estimation in high dimensions. Journal of the American Statistical Association,119, 1473-1485.
  • Chang, J., Chen, C., Qiao, X. and Yao, Q. (2024). An autocovariance-based learning framework for high-dimensional functional time series. Journal of Econometrics, 239, 105385.
  • Liu, Y., Qiao, X., Wang, L. and Lam, J. (2023). EEGNN: Edge enhanced graph neural networks with a Bayesian nonparametric graph model, the 26th International Conference on Artificial Intelligence and Statistics, PMLR, 206, 2132-2146.
  • Guo, S. and Qiao, X. (2023). On consistency and sparsity for high-dimensional functional time series with application to autoregressions. Bernoulli, 29, 451-472.
  • Liu, Y., Qiao, X. and Lam, J. (2022). CATVI: Conditional and adaptively truncated variational inference for hierarchical Bayesian nonparametric models, the 25th International Conference on Artificial Intelligence and Statistics, PMLR, 151, 3647-3662.
  • Fang, Q., Guo, S. and Qiao, X. (2022). Finite sample theory for high-dimensional functional time series with applications. Electronic Journal of Statistics, 16, 527-591.
  • Chen, C., Guo, S. and Qiao, X. (2022). Functional linear regression: dependence and error contamination. Journal of Business & Economic Statistics, 40, 444-457.
  • Lian, H., Qiao, X. and Zhang, W. (2021). Homogeneity pursuit in single index models-based panel data analysis. Journal of Business & Economic Statistics, 39, 386-401.
  • Qiao, X., Qian, C., James, G. and Guo, S. (2020). Doubly functional graphical models in high dimensions, Biometrika, 107, 415-431.
  • Qiao, X., Guo, S. and James, G. (2019). Functional graphical models. Journal of the American Statistical Association, 114, 211-222.
  • Radchenko, P., Qiao, X. and James, G. (2015). Index models for sparsely sampled functional data. Journal of the American Statistical Association, 110, 824-836.
Recent Publications
On the Modeling and Prediction of High-Dimensional Functional Time Series

We propose a two-step procedure to model and predict high-dimensional functional time series, where the number of function-valued time series p is large in relation to the length of time series n. Our first step performs an eigenanalysis of a positive definite matrix, which leads to a one-to-one linear transformation for the original high-dimensional functional time series, and the transformed curve series can be segmented into several groups such that any two subseries from any two different groups are uncorrelated both contemporaneously and serially. Consequently in our second step those groups are handled separately without the information loss on the overall linear dynamic structure. The second step is devoted to establishing a finite-dimensional dynamical structure for all the transformed functional time series within each group. Furthermore the finite-dimensional structure is represented by that of a vector time series. Modeling and forecasting for the original high-dimensional functional time series are realized via those for the vector time series in all the groups. We investigate the theoretical properties of our proposed methods, and illustrate the finite-sample performance through both extensive simulation and two real datasets. Supplementary materials for this article are available online, including a standardized description of the materials available for reproducing the work.

Factor-guided Estimation of Large Covariance Matrix Function with Conditional Functional Sparsity

This paper addresses the fundamental task of estimating covariance matrix functions for high-dimensional functional data/functional time series. We consider two functional factor structures encompassing either functional factors with scalar loadings or scalar factors with functional loadings, and postulate functional sparsity on the covariance of idiosyncratic errors after taking out the common unobserved factors. To facilitate estimation, we rely on the spiked matrix model and its functional generalization, and derive some novel asymptotic identifiability results, based on which we develop DIGIT and FPOET estimators under two functional factor models, respectively. Both estimators involve performing associated eigenanalysis to estimate the covariance of common components, followed by adaptive functional thresholding applied to the residual covariance. We also develop functional information criteria for model selection with theoretical guarantees. The convergence rates of involved estimated quantities are respectively established for DIGIT and FPOET estimators. Numerical studies including extensive simulations and a real data application on functional portfolio allocation are conducted to examine the finite-sample performance of the proposed methodology.

From Sparse to Dense Functional Data in High Dimensions: Revisiting Phase Transitions from a Non-Asymptotic Perspective

Nonparametric estimation of the mean and covariance functions is ubiquitous in functional data analysis and local linear smoothing techniques are most frequently used. Zhang and Wang (2016) explored different types of asymptotic properties of the estimation, which reveal interesting phase transition phenomena based on the relative order of the average sampling frequency per subject TT to the number of subjects nn, partitioning the data into three categories: “sparse”, “semi-dense”, and “ultra-dense”. In an increasingly available high-dimensional scenario, where the number of functional variables pp is large in relation to nn, we revisit this open problem from a non-asymptotic perspective by deriving comprehensive concentration inequalities for the local linear smoothers. Besides being of interest by themselves, our non-asymptotic results lead to elementwise maximum rates of L2L2 convergence and uniform convergence serving as a fundamentally important tool for further convergence analysis when pp grows exponentially with nn and possibly TT. With the presence of extra logplog⁡p terms to account for the high-dimensional effect, we then investigate the scaled phase transitions and the corresponding elementwise maximum rates from sparse to semi-dense to ultra-dense functional data in high dimensions. We also discuss a couple of applications of our theoretical results. Finally, numerical studies are carried out to confirm the established theoretical properties.