Scalable Methods for Multivariate Normal Probability Estimation with Applications in Confidence Region Detection, Transport Phenomena, and Parallel Computing Using RCOMPSs
This thesis addresses computing high-dimensional multivariate normal (MVN) probabilities in environmental and geospatial applications by combining high-performance computing, numerical approximation, and transport-based covariance modeling to make several important spatial procedures usable at larger scales.
Overview
Computing high-dimensional multivariate normal (MVN) probabilities is a recurring bottleneck in spatial statistics. Classical methods such as the Separation-Of-Variables (SOV) algorithm are reliable, but their dependence on dense Cholesky factorization leads to O(n3) time and O(n2) memory costs, where n is the problem dimension. As a result, methods that are statistically well understood often become impractical at the scales now common in environmental and geospatial applications.
First, we develop a parallel implementation of the SOV algorithm based on task-based scheduling and tile-based linear algebra. We then incorporate Tile Low-Rank (TLR) approximation to reduce cost while retaining the accuracy needed for confidence region detection. On synthetic and real wind-speed datasets, the proposed framework achieves up to 20X speedups over dense implementations.
Second, we study confidence regions for geostatistical excursion sets, whose construction requires repeated large covariance operations and conditional simulations. We reformulate the confidence-region algorithm as a collection of tile-based operations executed through the PaRSEC runtime, with GPU acceleration and mixed-precision arithmetic in the dominant kernels. The resulting implementation is substantially faster than the R-based baseline, reaching up to 33X speedups while matching the original method's statistical output in our experiments.
Third, we extend the Lagrangian framework for spatio-temporal covariance modeling by introducing random acceleration in addition to random velocity. A quadratic-form completion identity yields closed-form covariance and cross-covariance expressions for several transported Gaussian models. Simulation studies and an application to GOES-19 satellite cloud imagery show that acceleration can improve short-term prediction when transport paths bend or decelerate over time.
Finally, we introduce RCOMPSs, a task-based runtime system that brings parallel and distributed execution to R with limited changes to user code. Built on the COMPSs framework, RCOMPSs lets users annotate sequential R functions as tasks while the runtime handles dependency analysis, scheduling, and data movement. Experiments with k-means clustering, k-nearest neighbors, and linear regression show consistent speedups on both multicore and distributed platforms.
Taken together, these contributions show how advances in parallel computing and statistical modeling can be combined to widen the practical range of modern spatial methods, from probability computation and uncertainty quantification to transport-aware covariance modeling and scalable statistical software.
Presenters
Brief Biography
Xiran Zhang is a Ph.D. candidate in Statistics at King Abdullah University of Science and Technology (KAUST). He received his B.S. in Mathematics and Applied Mathematics from the University of Science and Technology of China (USTC) in June 2021 and his M.S. in Statistics from KAUST in December 2022. His research lies at the intersection of statistics and high-performance computing, with a particular focus on scalable methods for large-scale geostatistical and spatio-temporal problems. Key words of his work include distributed CPU/GPU computing, parallel algorithms, uncertainty quantification for massive spatial data, and spatio-temporal cross-covariance modeling.
Xiran has his work published or presented at major international conferences, including IPDPS, JSM, and SC. In addition to his research, he has been actively involved in teaching and mentoring, serving as a teaching assistant for several STAT courses at KAUST and at King Fahad Security College for the Ministry of Interior. He has received several honors, including the Al-Kindi Statistics Top Quals Student Award in 2021 and the KAUST Dean’s List Award in 2024 and 2025.