TY - JOUR
T1 - Critical evaluation of human oral bioavailability for pharmaceutical drugs by using various cheminformatics approaches
AU - Kim, Marlene T.
AU - Sedykh, Alexander
AU - Chakravarti, Suman K.
AU - Saiakhov, Roustem D.
AU - Zhu, Hao
N1 - Funding Information:
Research reported in this publication was supported, in part, by the National Institute of Environmental Health Sciences of the National Institutes of Health under Award Number R15ES023148 and the Colgate-Palmolive Grant for Alternative Research. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
PY - 2014/4
Y1 - 2014/4
N2 - Purpose: Oral bioavailability (%F) is a key factor that determines the fate of a new drug in clinical trials. Traditionally, %F is measured using costly and time-consuming experimental tests. Developing computational models to evaluate the %F of new drugs before they are synthesized would be beneficial in the drug discovery process. Methods: We employed Combinatorial Quantitative Structure-Activity Relationship approach to develop several computational %F models. We compiled a %F dataset of 995 drugs from public sources. After generating chemical descriptors for each compound, we used random forest, support vector machine, k nearest neighbor, and CASE Ultra to develop the relevant QSAR models. The resulting models were validated using five-fold cross-validation. Results: The external predictivity of %F values was poor (R2 = 0.28, n = 995, MAE = 24), but was improved (R2 = 0.40, n = 362, MAE = 21) by filtering unreliable predictions that had a high probability of interacting with MDR1 and MRP2 transporters. Furthermore, classifying the compounds according to the %F values (%F < 50% as "low", %F ≥ 50% as 'high") and developing category QSAR models resulted in an external accuracy of 76%. Conclusions: In this study, we developed predictive %F QSAR models that could be used to evaluate new drug compounds, and integrating drug-transporter interactions data greatly benefits the resulting models.
AB - Purpose: Oral bioavailability (%F) is a key factor that determines the fate of a new drug in clinical trials. Traditionally, %F is measured using costly and time-consuming experimental tests. Developing computational models to evaluate the %F of new drugs before they are synthesized would be beneficial in the drug discovery process. Methods: We employed Combinatorial Quantitative Structure-Activity Relationship approach to develop several computational %F models. We compiled a %F dataset of 995 drugs from public sources. After generating chemical descriptors for each compound, we used random forest, support vector machine, k nearest neighbor, and CASE Ultra to develop the relevant QSAR models. The resulting models were validated using five-fold cross-validation. Results: The external predictivity of %F values was poor (R2 = 0.28, n = 995, MAE = 24), but was improved (R2 = 0.40, n = 362, MAE = 21) by filtering unreliable predictions that had a high probability of interacting with MDR1 and MRP2 transporters. Furthermore, classifying the compounds according to the %F values (%F < 50% as "low", %F ≥ 50% as 'high") and developing category QSAR models resulted in an external accuracy of 76%. Conclusions: In this study, we developed predictive %F QSAR models that could be used to evaluate new drug compounds, and integrating drug-transporter interactions data greatly benefits the resulting models.
KW - QSAR
KW - drugs
KW - intestinal membrane transporter
KW - oral bioavailability
UR - http://www.scopus.com/inward/record.url?scp=84897106909&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84897106909&partnerID=8YFLogxK
U2 - 10.1007/s11095-013-1222-1
DO - 10.1007/s11095-013-1222-1
M3 - Article
C2 - 24306326
AN - SCOPUS:84897106909
SN - 0724-8741
VL - 31
SP - 1002
EP - 1014
JO - Pharmaceutical Research
JF - Pharmaceutical Research
IS - 4
ER -