This work has two purposes: (i) empirically selecting levels of significance that maximize the fraction of markers close to a gene (hit rate) when performing linkage analyses of simulated data and (ii) evaluating the utility of a previously reported scan statistic on the same data. Genotype data were simulated from a trait model of seven susceptibility genes. For purpose (i), five statistics were evaluated on all marker loci in fifty replicates; two-point lod and heterogeneity lod scores maximized over dominance (mlod, mhlod), a multi-allelic TDT test, an affected sib-pair test (ASP), and a model-free test on all sib-pairs (ALL_SIBS). Within each replicate the fraction of markers (hit rate) significant at specified levels of significance and also (a) within fifty markers of, or (b) on the same chromosome as a major gene was calculated. For purpose (ii), scan statistics of length 15 were calculated for each chromosome and their empirical significance levels estimated on the basis of 500 replicates generated under no linkage. The scan statistic was applied to the mhlod scores from one replicate (Replicate 5). Empirical p-values for the scan statistic were determined by computing mhlod scores on 500 replicates of simulated null data. For purpose (i), significance levels between 0.001 and 0.01 had the greatest hit rate for all five methods and both criteria. For criterion (a) at the 0.001 level of significance, both mlod and mhlod displayed the highest hit rates, approximately 0.4 for each. For criterion (b), all methods but ALL_SIBS and ASP had hit rates ranging between 0.4 and 0.5. For purpose (ii), the scan statistic proved equally or more powerful than the single-locus statistic for two of the seven susceptibility genes while the remaining five genes were not detected.
All Science Journal Classification (ASJC) codes
- Linkage analysis
- Scan statistics
- Selection methods