Containment of partially specified tree-pattern queries in the presence of dimension graphs

Document Type

Article

Publication Date

1-1-2009

Abstract

Nowadays, huge volumes of data are organized or exported in tree-structured form. Querying capabilities are provided through tree-pattern queries. The need for querying tree-structured data sources when their structure is not fully known, and the need to integrate multiple data sources with different tree structures have driven, recently, the suggestion of query languages that relax the complete specification of a tree pattern. In this paper, we consider a query language that allows the partial specification of a tree pattern. Queries in this language range from structureless keyword-based queries to completely specified tree patterns. To support the evaluation of partially specified queries, we use semantically rich constructs, called dimension graphs, which abstract structural information of the tree-structured data. We address the problem of query containment in the presence of dimension graphs and we provide necessary and sufficient conditions for query containment. As checking query containment can be expensive, we suggest two heuristic approaches for query containment in the presence of dimension graphs. Our approaches are based on extracting structural information from the dimension graph that can be added to the queries while preserving equivalence with respect to the dimension graph. We considered both cases: extracting and storing different types of structural information in advance, and extracting information on-the-fly (at query time). Both approaches are implemented, validated, and compared through experimental evaluation. © 2008 Springer-Verlag.

Identifier

58149477078 (Scopus)

Publication Title

VLDB Journal

External Full Text Location

https://doi.org/10.1007/s00778-008-0097-y

e-ISSN

0949877X

ISSN

10668888

First Page

233

Last Page

254

Issue

1

Volume

18

This document is currently not available here.

Share

COinS