Leveraging Double Simulation to Efficiently Evaluate Hybrid Patterns on Data Graphs
Document Type
Conference Proceeding
Publication Date
1-1-2020
Abstract
Labeled graphs are used to represent entities and their relationships in a plethora of Web applications. Graph pattern matching is a fundamental operation for the analysis and exploration of data graphs. In this paper, we address the problem of efficiently finding homomorphic matches for hybrid patterns, where each edge may be mapped either to an edge or to a path, thus allowing for higher expressiveness and flexibility in query formulation. We design a novel holistic graph simulation-based algorithm, called GraphMatch-Sim, which leverages simulation to precisely identify, in advance, all the graph nodes that participate in the pattern matches returned. GraphMatch-Sim can flexibly employ any reachability index as a plug-in component. Unlike existing methods, it produces no redundant intermediate results, thus achieving worst-case optimality. An extensive experimental evaluation on both real and synthetic datasets shows that our method evaluates hybrid patterns orders of magnitude faster than existing algorithms and has much better scalability.
Identifier
85096531947 (Scopus)
ISBN
[9783030620042]
Publication Title
Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics
External Full Text Location
https://doi.org/10.1007/978-3-030-62005-9_19
e-ISSN
16113349
ISSN
03029743
First Page
255
Last Page
269
Volume
12342 LNCS
Grant
61872276
Fund Ref
National Natural Science Foundation of China
Recommended Citation
Wu, Xiaoying; Theodoratos, Dimitri; Skoutas, Dimitrios; and Lan, Michael, "Leveraging Double Simulation to Efficiently Evaluate Hybrid Patterns on Data Graphs" (2020). Faculty Publications. 5714.
https://digitalcommons.njit.edu/fac_pubs/5714
