Feature selection and prediction of small-for-gestational-age infants

Document Type

Article

Publication Date

3-1-2024

Abstract

The small-for-gestational-age (SGA) condition often causes serious problems. Therefore, identifying the risk factors for SGA is important. Traditional statistical methods such as stepwise logistic regression (LR) have been widely utilized to discover possible risk factors. However, other feature selection methods from machine learning field have rarely been employed for the task. In this paper, a comparison of five feature selection methods from both fields for SGA risk factors analysis is conducted for the first time. To evaluate their performance, four classification algorithms are used to construct SGA prediction models. The evaluation criteria are precision and the area under the receiver operator characteristic curve. Stepwise LR achieves the best performance among the five feature selection methods, because it conducts both a univariate significance test and a model significance test, which make it more suitable for handling the complex relations among features. The top 20 features selected by each feature selection method and the 27 features selected by four or five of them could assist physicians to revise traditional SGA evaluation models. Ensemble method is also exploited to build effective SGA prediction models based on the feature subsets, which is indeed superior compared with the individual ones shown in the results.

Identifier

85049552920 (Scopus)

Publication Title

Journal of Ambient Intelligence and Humanized Computing

External Full Text Location

https://doi.org/10.1007/s12652-018-0892-2

e-ISSN

18685145

ISSN

18685137

First Page

1881

Last Page

1895

Issue

3

Volume

15

Grant

2017YFB1400803

Fund Ref

National Key Research and Development Program of China

This document is currently not available here.

Share

COinS