Divide and conquer pattern searching

2016-12-27 - 10:58

CEMSE CS New Data Mining-Strategy — A new data mining strategy that offers unprecedented pattern search speed could lead to new insights from massive data.

© Mopic / Alamy Stock Photo DTFTEM

Searching for recurring patterns in network systems has become a fundamental part of research and discovery in fields as diverse as biology and social media. KAUST researchers have developed a pattern or graph-mining framework that promises to significantly speed up searches on massive network data sets.

“A graph is a data structure that models complex relationships among objects,” explained Panagiotis Kalnis, leader of the research team from the KAUST Extreme Computing Research Center. “Graphs are widely used in many modern applications, including social networks, biological networks like protein-to-protein interactions, and communication networks like the internet.”

In these applications, one of the most important operations is the process of finding recurring graphs that reveal how objects tend to connect to each other. The process, which is called frequent subgraph mining (FSM), is an essential building block of many knowledge extraction techniques in social studies, bioinformatics and image processing, as well as in security and fraud detection. However, graphs may contain hundreds of millions of objects and billions of relationships, which means that extracting recurring patterns places huge demands on time and computing resources.

“In essence, if we can provide a better algorithm, all the applications that depend on FSM will be able to perform deeper analysis on larger data in less time,” Kalnis noted.

Kalnis and his colleagues developed a system called ScaleMine that offers a ten-fold acceleration compared with existing methods.

“FSM involves a vast number of graph operations, each of which is computationally expensive, so the only practical way to support FSM in large graphs is through massively parallel computation,” he said.

Read the full article

Related Persons

Panagiotis Kalnis

Professor, Computer Science

Professors

Welcome to Extreme Computing Research Center

Related Persons

Panagiotis Kalnis

Events

Latest News

Matteo Parsani finishes hand-cycle from east to west coast

Mixed Feelings About Mixed Precisions: Birds of a Feather at SC23!

Balancing renewable energy systems in Saudi buildings

CEMSE - Computer, Electrical and Mathematical Sciences and Engineering Division

Biological and Environmental Sciences Engineering Division

Physical Science and Engineering Division

Study

Expanding Knowledge

Student Affairs

Living in KAUST

About KAUST

Latest from KAUST

Extreme Computing Research Center

Welcome to ​Extreme Computing Research Center

Divide and conquer pattern searching

Related Persons

Study

Expanding Knowledge

Student Affairs

Living in KAUST

About KAUST

Latest from KAUST

Welcome to Extreme Computing Research Center