Cohesive keyword search on tree data
Document Type
Conference Proceeding
Publication Date
1-1-2016
Abstract
Keyword search is the most popular querying technique on semistructured data. Keyword queries are simple and convenient. However, as a consequence of their imprecision, there is usually a huge number of candidate results of which only very few match the user's intent. Unfortunately, the existing semantics for keyword queries are ad-hoc and they generally fail to "guess" the user intent. Therefore, the quality of their answers is poor and the existing algorithms do not scale satisfactorily. In this paper, we introduce the novel concept of cohesive keyword queries for tree data. Intuitively, a cohesiveness relationship on keywords indicates that they should form a cohesive whole in a query result. Cohesive keyword queries allow term nesting and keyword repetition. Cohesive keyword queries bridge the gap between flat keyword queries and structured queries. Although more expressive, they are as simple as flat keyword queries and not require any schema knowledge. We provide formal semantics for cohesive keyword queries and rank query results on the proximity of the keyword instances. We design a stack based algorithm which efficiently evaluates cohesive keyword queries. Our experiments demonstrate that our approach outperforms in quality previous filtering semantics and our algorithm scales smoothly on queries of even 20 keywords on large datasets.
Identifier
85046689352 (Scopus)
ISBN
[9783893180707]
Publication Title
Advances in Database Technology Edbt
External Full Text Location
https://doi.org/10.5441/002/edbt.2016.15
e-ISSN
23672005
First Page
137
Last Page
148
Volume
2016-March
Recommended Citation
Dimitriou, Aggeliki; Theodoratos, Dimitri; Dass, Ananya; and Vassiliou, Yannis, "Cohesive keyword search on tree data" (2016). Faculty Publications. 10852.
https://digitalcommons.njit.edu/fac_pubs/10852
