Thursday, September 21, 2023, 12:00
- 13:00
Building 9, Level 2, Room 2325
Contact Person
Cross-validation is an algorithmic technique extensively used for estimating the prediction error, tuning the regularization parameter, and choosing between competing predictive rules.
Tuesday, April 04, 2023, 16:00
- 19:00
B4, L5, R5220
This Ph.D. research focuses on proposing new statistical methods for two types of time series data: integer-valued data and multivariate nonstationary extreme data. For the former, the researcher proposes a novel approach to building an integer-valued autoregressive (INAR) model that offers the flexibility to specify both marginal and innovation distributions, leading to several new INAR processes. For the latter, the researcher proposes new extreme value theory methods for analyzing multivariate nonstationary extreme data, specifically EEG recordings from patients with epilepsy. Two extreme-value methods, Conex-Connect and Club Exco, are proposed to study alterations in the brain network during extreme events such as epileptic seizures.
Prof. Mohamed El Machkouri, Associate professor, Applied Mathematics, University of Rouen Normandy
Sunday, February 12, 2023, 16:00
- 17:30
Building 1, Level 4, Room 4102
Contact Person
In this talk, we present theoretical asymptotic results for the nonparametric estimation of the conditional density of a scalar response variable Y given the explanatory X taking values in a finite-dimensional space when the sample of observations is considered as a sequence of dependent random variables.
Thursday, January 05, 2023, 17:00
- 19:00
Building 1, Room 4214
Contact Person
The standard approach to analyzing brain electrical activity is to examine the spectral density function (SDF) and identify frequency bands, defined apriori, that have the most substantial relative contributions to the overall variance of the signal. However, a limitation of this approach is that the precise frequency and bandwidth of oscillations are not uniform across cognitive demands. Thus, these bands should not be arbitrarily set in any analysis. To overcome this limitation, we propose three Bayesian Non-parametric models for time series decomposition, which are data-driven approaches that identify (i) the number of prominent spectral peaks, (ii) the frequency peak locations, and (iii) their corresponding bandwidths (or spread of power around the peaks).
Prof. Raquel Prado, Department of Statistics, University of California
Thursday, April 28, 2022, 16:30
- 17:30
Auditorium 0215 (BW Building 2 and 3)
Contact Person
During the first part of the talk we present an approach that allows for flexible analysis of multivariate non-stationary time series via dynamic models on the partial autocorrelation domain. We discuss various aspects of these models, including the use of shrinkage priors to deal with overfitting issues, as well as hierarchical extensions.
Prof. Raquel Prado, Department of Statistics, University of California
Wednesday, April 27, 2022, 16:00
- 17:30
Building 1, Level 4, Room 4102
Contact Person
In the first part of this lecture we present a review dynamic linear models for multivariate time series and hierarchical dynamic linear models for multiple time series. Topics related to model building as well as closed form, approximate and simulation-based methods for Bayesian filtering, smoothing and forecasting within these classes of models will be discussed.
Prof. Bruno Sanso, Department of Statistics, University of California
Tuesday, April 26, 2022, 16:30
- 17:30
Auditorium 0215 (BW Building 2 and 3)
Contact Person
We present a framework for non-Gaussian spatial processes that encompasses large distribution families. Spatial dependence for a set of irregularly scattered locations is described with a mixture of pairwise kernels. Focusing on the nearest neighbors of a given location, within a reference set, we obtain a valid spatial process:
Prof. Raquel Prado, Department of Statistics, University of California
Tuesday, April 26, 2022, 14:00
- 15:30
Building 1, Level 4, Room 3119
Contact Person
We discuss conditionally Gaussian dynamic linear models for analysis and forecasting of univariate time series and present simulation-based methods for Bayesian filtering and smoothing within this class of models, including Markov chain Monte Carlo (MCMC) and sequential Monte Carlo (SMC) methods.
Prof. Bruno Sanso, Department of Statistics, University of California
Monday, April 25, 2022, 17:45
- 19:00
Building 1, Level 4, Room 4102
Contact Person
We will start by presenting the general framework of Bayesian hierarchical dynamic models (BHDM) for space-time data. Within this framework, we will consider in some detail the special case of linear dynamics. We will review MCMC estimation for conditionally linear dynamic models. We will introduce integro-differential models and give a SPDE justification that provides insights into the connections between the dynamics of the process and the properties of the kernel defining the IDE.
Prof. Raquel Prado, Department of Statistics, University of California
Monday, April 25, 2022, 16:00
- 17:30
Building 1, Level 4, Room 4102
Contact Person
In this lecture we present an overview of dynamic linear models for analysis and forecasting of univariate time series. We will discuss principles for model building and methods for Bayesian filtering, smoothing and forecasting. We will illustrate the use of these models in several case studies arising in different applied areas including environmental sciences and neuroscience.
Yi Li, Professor, Biostatistics, University of Michigan
Sunday, April 24, 2022, 16:00
- 18:00
Building 1, Level 4, Room 4102
Contact Person
Continuing on with Lecture 1, this short course introduces various cutting-edge methods that handle survival outcome data with ultrahigh dimensional predictors, that is, when the dimension of predictors is much higher than the sample size. We will also discuss several new methods for quantifying the uncertainty of estimates in a high dimensional survival setting, a very active area in machine learning.
Yi Li, Professor, Biostatistics, University of Michigan
Thursday, April 21, 2022, 16:30
- 17:30
Auditorium 0215 (BW Building 2 and 3)
Contact Person
Though Gaussian graphical  models have been widely used in many scientific fields, relatively limited progress has been made to link graph structures to external covariates. We propose a Gaussian graphical regression model, which regresses both the mean and the precision matrix of a Gaussian graphical model on covariates. In the context of co-expression quantitative trait locus (QTL) studies, our method can determine how genetic variants and clinical conditions modulate the subject-level network structures, and recover both the population-level and subject-level gene networks.
Yi Li, Professor, Biostatistics, University of Michigan
Tuesday, April 19, 2022, 16:00
- 18:00
Building 1, Level 4, Room 4102
Contact Person
In the era of biomedical big data, survival outcome data with high-throughput predictors are routinely collected. These high dimensional data defy classical survival regression models, which are either infeasible to fit or likely to incur low predictability because of overfitting. This short course will introduce the basic concepts of survival analysis and various cutting-edge methods that handle survival outcome data with high dimensional predictors. I will cover statistical principles and concepts behind the methods, and will also discuss their applications to the real medical examples.
Jinyuan Liu, PhD Student, Biostatistics, University of California, USA
Sunday, November 28, 2021, 08:30
- 09:30
KAUST
Contact Person
Breakthroughs such as high-throughput sequencing are generating flourishing high-dimensional data that provoke challenges in both statistical analyses and interpretations.
Prof. John Kornak, Biostatistics, University of California, San Francisco
Thursday, October 14, 2021, 16:30
- 17:45
Auditorium, between buildings 4&5
Contact Person
Tuesday, November 24, 2020, 09:00
- 10:00
KAUST
Contact Person
Intrinsic connectivity networks (ICNs) refer to brain functional networks that are consistently found under various conditions, during tasks or at rest. Some studies demonstrated that while some stimuli do not impact intrinsic connectivity, other stimuli actually activate intrinsic connectivity through suppression, excitation, moderation or modification. Most analyses of fMRI data use ad-hoc methods to estimate the latent structure of ICNs. Modeling the effects on ICNs has also not been fully investigated. We propose a Bayesian Intrinsic Connectivity Network (BICNet) model, an extended Bayesian dynamic sparse latent factor model, to identify the ICNs and quantify task-related effects on the ICNs. BICNet has the following advantages: (1) It simultaneously identifies the individual and group-level ICNs; (2) It robustly identifies ICNs by jointly modeling rfMRI and tfMRI; (3) Compared to ICA-based methods, it can quantify the difference of ICN amplitudes across different states; (4) The sparsity of ICNs automatically performs feature selection, instead of ad-hoc thresholding. We apply BICNet to the rfMRI and language tfMRI data from the HCP and identify several ICNs related to distinct language processing functions.
Thursday, April 02, 2020, 12:00
- 13:00
KAUST
Contact Person
This talk presents a new classification method for functional data. We consider the case where different groups of functions have similar means so that it is difficult to classify them based on only the mean function. To overcome this limitation, we propose the second moment based functional classifier (SMFC). Here, we demonstrate that the new method is sensitive to divergence in the second moment structure and thus produces lower rate of misclassification compared to other competitor methods. Our method uses the Hilbert-Schmidt norm to measure the divergence of second moment structure. One important innovation of our classification procedure lies in the dimension reduction step. The method data-adaptively discovers the basis functions that best capture the discrepancy between the second moment structures of the groups, rather than uses the functional principal component of each individual group, and good performance can be achieved as unnecessary variability is removed so that the classification accuracy is improved. Consistency properties of the classification procedure and the relevant estimators are established. Simulation study and real data analysis on phoneme and rat brain activity trajectories empirically validate the superiority of the proposed method.
Paula Moraga, Lecturer, Department of Mathematical Sciences, University of Bath, UK
Wednesday, February 05, 2020, 12:00
- 13:00
Building 9, Level 2, Hall 2
Contact Person
In this talk, I will give an overview of my research which focuses on the development of innovative statistical methods and interactive visualization applications for geospatial data analysis and health surveillance. I will illustrate some of my projects in the following areas: 1. Development of new statistical methodology; 2. Development of open-source statistical software such as the R packages; 3. Health surveillance projects. Finally, I will describe my future research on innovation in data acquisition and visualization, precision disease mapping, and digital health surveillance, and how it can inform policymaking and improve population health globally.
Monday, November 18, 2019, 00:00
- 23:45
Auditorium 0215, between building 2 and 3
2019 Statistics and Data Science Workshop confirmed speakers include Prof. Alexander Aue, University of California Davis, USA, Prof. Francois Bachoc, University Toulouse 3, France, Prof. Rosa M. Crujeiras Casais, University of Santiago de Compostela, Spain, Prof. Emanuele Giorgi, Lancaster University, UK, Prof. Jeremy Heng, ESSEC Asia-Pacific, Singapore, Prof. Birgir Hrafnkelsson, University of Iceland, Iceland, Prof. Ajay Jasra, KAUST, Saudi Arabia, Prof. Emtiyaz Khan, RIKEN Center for Advanced Intelligence Project, Japan, Prof. Robert Krafty, University of Pittsburgh, USA, Prof. Guido Kuersteiner, University of Maryland, USA, Prof. Paula Moraga, University of Bath, UK, Prof. Tadeusz Patzek, KAUST, Saudi Arabia, Prof. Brian Reich, North Carolina State University, USA, Prof. Dag Tjostheim, University Bergen, Norway, Prof. Xiangliang Zhang, KAUST, Saudi Arabia, Sylvia Rose Esterby, University of British Colombia, Canada, Prof. Abdel El-Shaarawi, Retired Professor at the National Water Research Institute, Canada. View Workshop schedule and abstracts here.
Prof, David Stoffer, University of Pittsburgh, Pennsylvania, USA
Friday, April 26, 2019, 15:00
- 18:00
B1 L4 Room 4102
Contact Person
Ever wonder why, when you fly to Jeddah you don't end up in Riyadh?  The tracking devices use a nonlinear state space model to make sure your plane is on course. While inference for the linear Gaussian model is fairly simple, inference for nonlinear models can be difficult and often relies on derivative free numerical optimization techniques.  A promising method that I will discuss is based on particle approximations of the conditional distribution of the hidden process given the data. This distribution is needed for both classical inference (e.g., Monte Carlo EM type algorithms) and Bayesian inference (e.g., Gibbs sampler). 
Prof. Daniel Peña Sánchez de Rivera, Department of Statistics, Universidad Carlos III de Madrid
Thursday, April 25, 2019, 16:00
- 17:00
B1 L4 Room 4102
Contact Person
Generalized Dynamic principal components are presented and it is shown how to define one side inear combinations of the present and past values of the series that minimize the reconstruction mean squared error (ODPC). It is shown that the ODPC introduced in this paper can be successfully used for forecasting high-dimensional multiple time series.
Professor Ngai Hang Chan, Professor of Statistics, Chinese University of Hong Kong
Tuesday, April 23, 2019, 16:00
- 17:00
B1 L4 room 4102
Contact Person
Non-stationary spatial models are widely applicable in diverse disciplines, ranging from bio-medical sciences to geophysical studies. In many of theses applications, testing for structural changes in the trend and testing the specific form of the trend are highly relevant. A novel statistics based on a discrepancy measure over small regions is proposed in this paper. Such a measure can be used to construct tests for structural trends and to identify change boundaries of the trends. By virtue of the m-dependence approximation of a stationary random eld, asymptotic properties and limit distributions of these tests are established. The method is illustrated by simulations and data analysis.