Wednesday, April 18, 2018, 10:00
- 11:30
Building 3, Room 5208
Contact Person
In silico prioritization of undiscovered associations can help find causal genes of newly discovered diseases. Some existing methods are based on known associations and side information of diseases and genes. We exploit the possibility of using a neural network model, Neural Inductive Matrix Completion (NIMC) in disease-gene prediction.
Monday, March 19, 2018, 08:00
- 17:00
Building 9, Level 2, Hall 2
Contact Person
We are now in the fourth paradigm of science: Data Science. The massive amount of structured and unstructured data has posed new challenges and opportunities to the fields of computer science and statistics. Traditional computational and statistical methods for data storage, curation, sharing, querying, updating, visualization, analysis, and privacy have been shown to fail in the big data scenario due to the unprecedented volume, velocity, variety, veracity and value of the big data. This conference will bring together a number of prominent researchers in Computer Science and Statistics with common interests and active research in big data, as well as the researchers at KAUST who regularly generate or face big data, such as those in bioscience and red sea research.
Monday, February 05, 2018, 08:00
- 05:00
Conference Center Hall, B19 L3
Contact Person
The age of "big data" is here: data of unprecedented sizes is becoming ubiquitous, which brings new challenges and new opportunities. With this comes the need to solve optimization problems of unprecedented sizes.
Tuesday, June 13, 2017, 09:00
- 10:00
B1 L4 Room 4102
Contact Person
In this thesis defense, I will talk about two topics—computational methods for large spatial datasets and functional data ranking. Both are tackling the challenges of big and high-dimensional data.
Wednesday, May 17, 2017, 15:00
- 17:00
Building 3, Level 5, Room 5209​
Contact Person
Growth phenotype profiling of genome-wide gene-deletion strains overstresses conditions can offer a clear picture that the essentiality of genes depends on environmental conditions. In this dissertation, we first demonstrate that detecting such "co-fit" gene groups can be cast as a less well-studied problem in biclustering, i.e., constant-column biclustering. Despite significant advances in biclustering techniques, very few were designed for mining in growth phenotype data.
Dorit Hammerling, National Center for Atmospheric Research (NCAR)
Wednesday, February 10, 2016, 15:30
- 16:30
B1 L4 Room 4102
Contact Person
With data of rapidly increasing sizes in the environmental and geosciences such as satellite observations and high-resolution climate model runs, the spatial statistics community has recently focused on methods that are applicable to very large data. One such state-of-the-art method is the multi-resolution approximation (MRA), which was specifically developed with high performance computer architecture in mind.
Xiaohui Chang, Assistant Professor, College of Business at Oregon State University
Monday, November 09, 2015, 15:30
- 16:00
B1 L4 Room 4102
Contact Person
We propose a novel statistical framework by supplementing case–control data with summary statistics on the population at risk for a subset of risk factors. Our approach is to first form two unbiased estimating equations, one based on the case–control data and the other on both the case data and the summary statistics, and then optimally combine them to derive another estimating equation to be used for the estimation.