Automated and Scalable Algorithms and Systems for Unsupervised ML
- Yue Zhao, Computer Engineering, Carnegie Mellon University
B9 L2 H2
Many real-world events do not have outcome labels. For example, the fraudulence of a transaction remains unknown until it is discovered.
Overview
Abstract
Many real-world events do not have outcome labels. For example, the fraudulence of a transaction remains unknown until it is discovered. This is where unsupervised machine learning (ML) becomes crucial in real-world scenarios as it can make decisions based solely on observations. In this talk, I will address two key challenges in unsupervised ML: (i) automating the selection of the best ML model without evaluation and (ii) developing scalable learning systems that can handle large amounts of data. The first part of the talk will describe an automated algorithm called MetaOD, which can select top ML models for various applications without relying on labels or evaluations. The second part will cover an ML system called TOD, which can "transpile" a diverse group of unsupervised ML algorithms for GPU acceleration. Lastly, I will discuss my future plans, including the ML+X initiative, which aims to bring the advantages of ML automation and systems to other CS/ECE areas (e.g., power systems, mobility, and hardware), and the creation of a fully automated ML pipeline that chooses hardware, systems, and models seamlessly.
Brief Biography
Yue Zhao is a Ph.D. candidate at CMU. He focuses on creating automated and scalable ML algorithms and systems, and has published over 30 papers in top venues such as VLDB, MLSys, NeurIPS, TKDE, and JMLR. His open-source systems (https://github.com/yzhao062) have been widely deployed in firms and industries such as Morgan Stanley and Tesla, and have received over 15,000 GitHub stars and 11 million downloads. Yue has received the CMU Presidential Fellowship and Norton Graduate Fellowship. More information about him can be found at https://www.andrew.cmu.edu/user/yuezhao2/