Containment of partially specified tree-pattern queries in the presence of dimension graphs
Document Type
Article
Publication Date
1-1-2009
Abstract
Nowadays, huge volumes of data are organized or exported in tree-structured form. Querying capabilities are provided through tree-pattern queries. The need for querying tree-structured data sources when their structure is not fully known, and the need to integrate multiple data sources with different tree structures have driven, recently, the suggestion of query languages that relax the complete specification of a tree pattern. In this paper, we consider a query language that allows the partial specification of a tree pattern. Queries in this language range from structureless keyword-based queries to completely specified tree patterns. To support the evaluation of partially specified queries, we use semantically rich constructs, called dimension graphs, which abstract structural information of the tree-structured data. We address the problem of query containment in the presence of dimension graphs and we provide necessary and sufficient conditions for query containment. As checking query containment can be expensive, we suggest two heuristic approaches for query containment in the presence of dimension graphs. Our approaches are based on extracting structural information from the dimension graph that can be added to the queries while preserving equivalence with respect to the dimension graph. We considered both cases: extracting and storing different types of structural information in advance, and extracting information on-the-fly (at query time). Both approaches are implemented, validated, and compared through experimental evaluation. © 2008 Springer-Verlag.
Identifier
58149477078 (Scopus)
Publication Title
VLDB Journal
External Full Text Location
https://doi.org/10.1007/s00778-008-0097-y
e-ISSN
0949877X
ISSN
10668888
First Page
233
Last Page
254
Issue
1
Volume
18
Recommended Citation
Theodoratos, Dimitri; Placek, Pawel; Dalamagas, Theodore; Souldatos, Stefanos; and Sellis, Timos, "Containment of partially specified tree-pattern queries in the presence of dimension graphs" (2009). Faculty Publications. 12240.
https://digitalcommons.njit.edu/fac_pubs/12240
