TY - JOUR
T1 - Quantifying the Impacts of Pre- and Post-Conception TSH Levels on Birth Outcomes
T2 - An Examination of Different Machine Learning Models
AU - Sun, Yuantong
AU - Zheng, Weiwei
AU - Zhang, Ling
AU - Zhao, Huijuan
AU - Li, Xun
AU - Zhang, Chao
AU - Ma, Wuren
AU - Tian, Dajun
AU - Yu, Kun Hsing
AU - Xiao, Shuo
AU - Jin, Liping
AU - Hua, Jing
N1 - Funding Information:
This work was supported by the National Natural Science Foundation of China (Nos. 81773379, 82122058, and 81202165), the Shanghai Municipal Commission of Health and the Family Foundation for Fifth Round of the Three-Year Public Health Action Plan of Shanghai (GWV-1.1 and GWV-10.1-XK08 and 202040186), and for Young Talents (2017YQ023). K-HY was supported by the Harvard Data Science Fellowship. Natural Science Foundation of Shanghai (21 ZR 1438400).
Publisher Copyright:
Copyright © 2021 Sun, Zheng, Zhang, Zhao, Li, Zhang, Ma, Tian, Yu, Xiao, Jin and Hua.
PY - 2021/10/29
Y1 - 2021/10/29
N2 - Background: While previous studies identified risk factors for diverse pregnancy outcomes, traditional statistical methods had limited ability to quantify their impacts on birth outcomes precisely. We aimed to use a novel approach that applied different machine learning models to not only predict birth outcomes but systematically quantify the impacts of pre- and post-conception serum thyroid-stimulating hormone (TSH) levels and other predictive characteristics on birth outcomes. Methods: We used data from women who gave birth in Shanghai First Maternal and Infant Hospital from 2014 to 2015. We included 14,110 women with the measurement of preconception TSH in the first analysis and 3,428 out of 14,110 women with both pre- and post-conception TSH measurement in the second analysis. Synthetic Minority Over-sampling Technique (SMOTE) was applied to adjust the imbalance of outcomes. We randomly split (7:3) the data into a training set and a test set in both analyses. We compared Area Under Curve (AUC) for dichotomous outcomes and macro F1 score for categorical outcomes among four machine learning models, including logistic model, random forest model, XGBoost model, and multilayer neural network models to assess model performance. The model with the highest AUC or macro F1 score was used to quantify the importance of predictive features for adverse birth outcomes with the loss function algorithm. Results: The XGBoost model provided prominent advantages in terms of improved performance and prediction of polytomous variables. Predictive models with abnormal preconception TSH or not-well-controlled TSH, a novel indicator with pre- and post-conception TSH levels combined, provided the similar robust prediction for birth outcomes. The highest AUC of 98.7% happened in XGBoost model for predicting low Apgar score with not-well-controlled TSH adjusted. By loss function algorithm, we found that not-well-controlled TSH ranked 4th, 6th, and 7th among 14 features, respectively, in predicting birthweight, induction, and preterm birth, and 3rd among 19 features in predicting low Apgar score. Conclusions: Our four machine learning models offered valid predictions of birth outcomes in women during pre- and post-conception. The predictive features panel suggested the combined TSH indicator (not-well-controlled TSH) could be a potentially competitive biomarker to predict adverse birth outcomes.
AB - Background: While previous studies identified risk factors for diverse pregnancy outcomes, traditional statistical methods had limited ability to quantify their impacts on birth outcomes precisely. We aimed to use a novel approach that applied different machine learning models to not only predict birth outcomes but systematically quantify the impacts of pre- and post-conception serum thyroid-stimulating hormone (TSH) levels and other predictive characteristics on birth outcomes. Methods: We used data from women who gave birth in Shanghai First Maternal and Infant Hospital from 2014 to 2015. We included 14,110 women with the measurement of preconception TSH in the first analysis and 3,428 out of 14,110 women with both pre- and post-conception TSH measurement in the second analysis. Synthetic Minority Over-sampling Technique (SMOTE) was applied to adjust the imbalance of outcomes. We randomly split (7:3) the data into a training set and a test set in both analyses. We compared Area Under Curve (AUC) for dichotomous outcomes and macro F1 score for categorical outcomes among four machine learning models, including logistic model, random forest model, XGBoost model, and multilayer neural network models to assess model performance. The model with the highest AUC or macro F1 score was used to quantify the importance of predictive features for adverse birth outcomes with the loss function algorithm. Results: The XGBoost model provided prominent advantages in terms of improved performance and prediction of polytomous variables. Predictive models with abnormal preconception TSH or not-well-controlled TSH, a novel indicator with pre- and post-conception TSH levels combined, provided the similar robust prediction for birth outcomes. The highest AUC of 98.7% happened in XGBoost model for predicting low Apgar score with not-well-controlled TSH adjusted. By loss function algorithm, we found that not-well-controlled TSH ranked 4th, 6th, and 7th among 14 features, respectively, in predicting birthweight, induction, and preterm birth, and 3rd among 19 features in predicting low Apgar score. Conclusions: Our four machine learning models offered valid predictions of birth outcomes in women during pre- and post-conception. The predictive features panel suggested the combined TSH indicator (not-well-controlled TSH) could be a potentially competitive biomarker to predict adverse birth outcomes.
KW - birth outcomes
KW - machine learning
KW - post-conception
KW - preconception
KW - thyroid-stimulating hormone (TSH)
UR - http://www.scopus.com/inward/record.url?scp=85119045214&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85119045214&partnerID=8YFLogxK
U2 - 10.3389/fendo.2021.755364
DO - 10.3389/fendo.2021.755364
M3 - Article
AN - SCOPUS:85119045214
SN - 1664-2392
VL - 12
JO - Frontiers in Endocrinology
JF - Frontiers in Endocrinology
M1 - 755364
ER -