Building a lung and ovarian cancer data warehouse
Document Type
Article
Publication Date
1-1-2020
Abstract
Objectives: Despite the collection of vast amounts of data by the healthcare sector, effective decision-making in medical practice is still challenging. Data warehousing technology can be applied for the collection and management of clinical data from various sources to provide meaningful insights for physicians and administrators. Cancer data are extremely compli-cated and massive; hence, a clinical data warehouse system can provide insights into prevention, diagnosis and treatment processes through the use of online analytical processing tools for the analysis of multi-dimensional data at different granu-larity levels. Methods: In this study, a clinical data warehouse was developed for lung cancer data, which were kindly pro-vided by the United States National Cancer Institute. Lung and ovarian cancer data were imported in specific formats and cleaned to remove errors and redundancies. SQL server integration services (SSIS) were used for the extract-transform-load (ETL) process. Results: The design of the clinical data warehouse responds efficiently to all types of queries by adopting the fact constellation schema model. Various online analytical processing queries can be expressed using the proposed approach. Conclusions: This model succeeded in responding to complex queries, and the analysis of data is facilitated by using online analytical processing cubes and viewing multilevel data details.
Identifier
85097157297 (Scopus)
Publication Title
Healthcare Informatics Research
External Full Text Location
https://doi.org/10.4258/hir.2020.26.4.303
e-ISSN
2093369X
ISSN
20933681
First Page
303
Last Page
310
Issue
4
Volume
26
Fund Ref
National Cancer Institute
Recommended Citation
Atay, Canan Eren and Garani, Georgia, "Building a lung and ovarian cancer data warehouse" (2020). Faculty Publications. 5677.
https://digitalcommons.njit.edu/fac_pubs/5677
