TY - JOUR
T1 - Non-replication of association studies
T2 - "Pseudo-failures" to replicate?
AU - Gorroochurn, Prakash
AU - Hodge, Susan E.
AU - Heiman, Gary A.
AU - Durner, Martina
AU - Greenberg, David A.
N1 - Copyright:
Copyright 2008 Elsevier B.V., All rights reserved.
PY - 2007/6
Y1 - 2007/6
N2 - Recently, serious doubts have been cast on the usefulness of association studies as a means to genetically dissect complex diseases because most initial findings fail to replicate in subsequent studies. The reasons usually invoked are population stratification, genetic heterogeneity, and inflated Type I errors. In this article, we argue that, even when these problems are addressed, the scientific community usually has unreasonably high expectations on replication success, based on initial low P values, a phenomenon known as the replication fallacy. We present a modified formula that gives the replication power of a second association study based on the P value of an initial study. When both studies have similar sample sizes, this formula shows that: (1) a P value only slightly lower than the nominal α results in only approximately 50% replication power; (2) very low P values are required to achieve a replication power of at least 80% (e.g., at α = 0.05, a P value of <0.005 is required). Because many initially significant findings result in low replication power, replication failure should not be surprising or be interpreted as necessarily refuting the initial findings. We refer to replication failures for which the replication power is low as "pseudo-failures."
AB - Recently, serious doubts have been cast on the usefulness of association studies as a means to genetically dissect complex diseases because most initial findings fail to replicate in subsequent studies. The reasons usually invoked are population stratification, genetic heterogeneity, and inflated Type I errors. In this article, we argue that, even when these problems are addressed, the scientific community usually has unreasonably high expectations on replication success, based on initial low P values, a phenomenon known as the replication fallacy. We present a modified formula that gives the replication power of a second association study based on the P value of an initial study. When both studies have similar sample sizes, this formula shows that: (1) a P value only slightly lower than the nominal α results in only approximately 50% replication power; (2) very low P values are required to achieve a replication power of at least 80% (e.g., at α = 0.05, a P value of <0.005 is required). Because many initially significant findings result in low replication power, replication failure should not be surprising or be interpreted as necessarily refuting the initial findings. We refer to replication failures for which the replication power is low as "pseudo-failures."
KW - Complex diseases
KW - Effect size
KW - Replication fallacy
KW - Replication probability
UR - http://www.scopus.com/inward/record.url?scp=34250789692&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34250789692&partnerID=8YFLogxK
U2 - 10.1097/GIM.0b013e3180676d79
DO - 10.1097/GIM.0b013e3180676d79
M3 - Review article
C2 - 17575498
AN - SCOPUS:34250789692
VL - 9
SP - 325
EP - 331
JO - Genetics in Medicine
JF - Genetics in Medicine
SN - 1098-3600
IS - 6
ER -