DI++: A deep learning system for patient condition identification in clinical notes
Document Type
Article
Publication Date
1-1-2022
Abstract
Accurately recording a patient's medical conditions in an EHR system is the basis of effectively documenting patient health status, coding for billing, and supporting data-driven clinical decision making. However, patient conditions are often not fully captured in structured EHR systems, but may be documented in unstructured clinical notes. The challenge is that not all disease mentions in clinical notes actually refer to a patient's conditions. We developed a two-step workflow for identifying patient's conditions from clinical notes: disease mention extraction and disease mention classification. We implemented this workflow in a prototype system, DI++, for Disease Identification. An advanced deep learning model, CLSTM-Attention model, is developed for disease mention classification in DI++. Extensive empirical evaluation on about one million pages of de-identified clinical notes demonstrates that DI++ has significant performance advantage over existing systems on F1 Score, Area Under the Curve metrics, and efficiency. The proposed CLSTM-Attention model outperforms the existing deep learning models for disease mention classification.
Identifier
85121759610 (Scopus)
Publication Title
Artificial Intelligence in Medicine
External Full Text Location
https://doi.org/10.1016/j.artmed.2021.102224
e-ISSN
18732860
ISSN
09333657
PubMed ID
34998515
Volume
123
Grant
UL1TR003017
Fund Ref
National Institutes of Health
Recommended Citation
Shi, Jinhe; Gao, Xiangyu; Kinsman, William C.; Ha, Chenyu; Gao, Guodong Gordon; and Chen, Yi, "DI++: A deep learning system for patient condition identification in clinical notes" (2022). Faculty Publications. 3371.
https://digitalcommons.njit.edu/fac_pubs/3371