Supporting Complex Query Time Enrichment For Analytics
Document Type
Conference Proceeding
Publication Date
1-1-2023
Abstract
Several application domains require data to be enriched prior to its use. Data enrichment is often performed using expensive machine learning models to interpret low-level data (e.g., models for face detection) into semantically meaningful observation. Collecting and enriching data offline before loading it to a database is infeasible if one desires online analysis on data as it arrives. Enriching data on the fly at insertion could result in redundant work (if applications require only a fraction of the data to be enriched) and could result in a bottleneck (if enrichment functions are expensive). Any scalable solution requires enrichment during query processing. This paper explores two different architectures for integrating enrichment into query processing - a loosely coupled approach wherein enrichment is performed outside of the DBMS and a tightly coupled approach wherein it is performed within the DBMS. The paper addresses the challenges of increased query latency due to query time enrichment.
Identifier
85137603808 (Scopus)
ISBN
[9783893180882]
Publication Title
Advances in Database Technology Edbt
External Full Text Location
https://doi.org/10.48786/edbt.2023.08
e-ISSN
23672005
First Page
92
Last Page
104
Issue
1
Volume
26
Grant
1952247
Fund Ref
National Science Foundation
Recommended Citation
    Ghosh, Dhrubajyoti; Gupta, Peeyush; Mehrotra, Sharad; and Sharma, Shantanu, "Supporting Complex Query Time Enrichment For Analytics" (2023). Faculty Publications.  2112.
    
    
    
        https://digitalcommons.njit.edu/fac_pubs/2112
    
 
				 
					