TY - JOUR
T1 - Prediction of low Apgar score at five minutes following labor induction intervention in vaginal deliveries
T2 - machine learning approach for imbalanced data at a tertiary hospital in North Tanzania
AU - Tarimo, Clifford Silver
AU - Bhuyan, Soumitra S.
AU - Zhao, Yizhen
AU - Ren, Weicun
AU - Mohammed, Akram
AU - Li, Quanman
AU - Gardner, Marilyn
AU - Mahande, Michael Johnson
AU - Wang, Yuhui
AU - Wu, Jian
N1 - Funding Information:
We would like to thank the staff of the Birth Registry, Department of Obstetrics & Gynecology of the Kilimanjaro Christian Medical Centre and the Department of Epidemiology and Applied Biostatistics of the Kilimanjaro Christian Medical University College for their substantial support during this study. Special thanks to women who participated in the KCMC birth registry study and the Norwegian birth registry for partnering with us in providing the limited dataset used for this study. Sincere thanks to Urmi Basu of the University of Nebraska Medical Center (UNMC) for providing a distinguished motivation and experience in professional writing.
Funding Information:
Research on 2021 Postgraduate Education Reform and Quality Improvement Project of Henan Province (YJS2021KC07), Performance Evaluation of New Basic Public Health Service Projects in Henan Province(2020130B), CDC-Hospital-Community Trinity Coordinated Prevention and Control System for Major Infectious Diseases (XKZDQY202007). We declare that the funder had no influence on the study design, collection, analysis, and interpretation of data and on writing the manuscript.
Publisher Copyright:
© 2022, The Author(s).
PY - 2022/12
Y1 - 2022/12
N2 - Background: Prediction of low Apgar score for vaginal deliveries following labor induction intervention is critical for improving neonatal health outcomes. We set out to investigate important attributes and train popular machine learning (ML) algorithms to correctly classify neonates with a low Apgar scores from an imbalanced learning perspective. Methods: We analyzed 7716 induced vaginal deliveries from the electronic birth registry of the Kilimanjaro Christian Medical Centre (KCMC). 733 (9.5%) of which constituted of low (< 7) Apgar score neonates. The ‘extra-tree classifier’ was used to assess features’ importance. We used Area Under Curve (AUC), recall, precision, F-score, Matthews Correlation Coefficient (MCC), balanced accuracy (BA), bookmaker informedness (BM), and markedness (MK) to evaluate the performance of the selected six (6) machine learning classifiers. To address class imbalances, we examined three widely used resampling techniques: the Synthetic Minority Oversampling Technique (SMOTE) and Random Oversampling Examples (ROS) and Random undersampling techniques (RUS). We applied Decision Curve Analysis (DCA) to evaluate the net benefit of the selected classifiers. Results: Birth weight, maternal age, and gestational age were found to be important predictors for the low Apgar score following induced vaginal delivery. SMOTE, ROS and and RUS techniques were more effective at improving “recalls” among other metrics in all the models under investigation. A slight improvement was observed in the F1 score, BA, and BM. DCA revealed potential benefits of applying Boosting method for predicting low Apgar scores among the tested models. Conclusion: There is an opportunity for more algorithms to be tested to come up with theoretical guidance on more effective rebalancing techniques suitable for this particular imbalanced ratio. Future research should prioritize a debate on which performance indicators to look up to when dealing with imbalanced or skewed data.
AB - Background: Prediction of low Apgar score for vaginal deliveries following labor induction intervention is critical for improving neonatal health outcomes. We set out to investigate important attributes and train popular machine learning (ML) algorithms to correctly classify neonates with a low Apgar scores from an imbalanced learning perspective. Methods: We analyzed 7716 induced vaginal deliveries from the electronic birth registry of the Kilimanjaro Christian Medical Centre (KCMC). 733 (9.5%) of which constituted of low (< 7) Apgar score neonates. The ‘extra-tree classifier’ was used to assess features’ importance. We used Area Under Curve (AUC), recall, precision, F-score, Matthews Correlation Coefficient (MCC), balanced accuracy (BA), bookmaker informedness (BM), and markedness (MK) to evaluate the performance of the selected six (6) machine learning classifiers. To address class imbalances, we examined three widely used resampling techniques: the Synthetic Minority Oversampling Technique (SMOTE) and Random Oversampling Examples (ROS) and Random undersampling techniques (RUS). We applied Decision Curve Analysis (DCA) to evaluate the net benefit of the selected classifiers. Results: Birth weight, maternal age, and gestational age were found to be important predictors for the low Apgar score following induced vaginal delivery. SMOTE, ROS and and RUS techniques were more effective at improving “recalls” among other metrics in all the models under investigation. A slight improvement was observed in the F1 score, BA, and BM. DCA revealed potential benefits of applying Boosting method for predicting low Apgar scores among the tested models. Conclusion: There is an opportunity for more algorithms to be tested to come up with theoretical guidance on more effective rebalancing techniques suitable for this particular imbalanced ratio. Future research should prioritize a debate on which performance indicators to look up to when dealing with imbalanced or skewed data.
KW - Imbalanced data
KW - Low five-minute Apgar score
KW - Machine learning
KW - North-Tanzania
KW - Successful labor induction
UR - http://www.scopus.com/inward/record.url?scp=85127451600&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85127451600&partnerID=8YFLogxK
U2 - 10.1186/s12884-022-04534-0
DO - 10.1186/s12884-022-04534-0
M3 - Article
C2 - 35365129
AN - SCOPUS:85127451600
SN - 1471-2393
VL - 22
JO - BMC Pregnancy and Childbirth
JF - BMC Pregnancy and Childbirth
IS - 1
M1 - 275
ER -