Faculty Publications

Finding patterns in three-dimensional graphs: Algorithms and applications to scientific data mining

Xiong Wang, IEEE
Jason T.L. Wang, IEEE
Dennis Shasha, Courant Institute of Mathematical Sciences
Bruce A. Shapiro, National Cancer Institute at Frederick
Isidore Rigoutsos, IEEE
Kaizhong Zhang, Western University

Document Type

Article

Publication Date

7-1-2002

Abstract

This paper presents a method for finding patterns in 3D graphs. Each node in a graph is an undecomposable or atomic unit and has a label. Edges are links between the atomic units. Patterns are rigid substructures that may occur in a graph after allowing for an arbitrary number of whole-structure rotations and translations as well as a small number (specified by the user) of edit operations in the patterns or in the graph. (When a pattern appears in a graph only after the graph has been modified, we call that appearance "approximate occurrence"). The edit operations include relabeling a node, deleting a node and inserting a node. The proposed method is based on the geometric hashing technique, which hashes node-triplets of the graphs into a 3D table and compresses the labeltriplets in the table. To demonstrate the utility of our algorithms, we discuss two applications of them in scientific data mining. First, we apply the method to locating frequently occurring motifs in two families of proteins pertaining to RNA-directed DNA Polymerase and Thymidylate Synthase and use the motifs to classify the proteins. Then, we apply the method to clustering chemical compounds pertaining to aromatic, bicyclicalkanes, and photosynthesis. Experimental results indicate the good performance of our algorithms and high recall and precision rates for both classification and clustering.

Identifier

0036650077 (Scopus)

Publication Title

IEEE Transactions on Knowledge and Data Engineering

External Full Text Location

https://doi.org/10.1109/TKDE.2002.1019211

ISSN

10414347

First Page

731

Last Page

749

Issue

Volume

Grant

IIS-9988345

Fund Ref

Natural Sciences and Engineering Research Council of Canada

Recommended Citation

Wang, Xiong; Wang, Jason T.L.; Shasha, Dennis; Shapiro, Bruce A.; Rigoutsos, Isidore; and Zhang, Kaizhong, "Finding patterns in three-dimensional graphs: Algorithms and applications to scientific data mining" (2002). Faculty Publications. 14663.
https://digitalcommons.njit.edu/fac_pubs/14663

This document is currently not available here.

COinS

DOI

10.1109/TKDE.2002.1019211

Faculty Publications

Finding patterns in three-dimensional graphs: Algorithms and applications to scientific data mining

Document Type

Publication Date

Abstract

Identifier

Publication Title

External Full Text Location

ISSN

First Page

Last Page

Issue

Volume

Grant

Fund Ref

Recommended Citation

DOI

Search

Browse

Author Corner

Links

Faculty Publications

Finding patterns in three-dimensional graphs: Algorithms and applications to scientific data mining

Authors

Document Type

Publication Date

Abstract

Identifier

Publication Title

External Full Text Location

ISSN

First Page

Last Page

Issue

Volume

Grant

Fund Ref

Recommended Citation

Share

DOI

Search

Browse

Author Corner

Links