Candidate gene identification of existing or induced mutations with pipelines applicable to large genomes

Jiaqiang Dong, Min Tu, Yaping Feng, Anna Zdepski, Fei Ge, Dibyendu Kumar, Janet P. Slovin, Joachim Messing

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

Bulked segregant analysis (BSA) is used to identify existing or induced variants that are linked to phenotypes. Although it is widely used in Arabidopsis and rice, it remains challenging for crops with large genomes, such as maize. Moreover, analysis of huge data sets can present a bottleneck linking phenotypes to their molecular basis, especially for geneticists without programming experience. Here, we identified two genes of maize defective kernel mutants with newly developed analysis pipelines that require no programing skills and should be applicable to any large genome. In the 1970s, Neuffer and Sheridan generated a chemically induced defective kernel (dek) mutant collection with the potential to uncover critical genes for seed development. To locate such mutations, the dek phenotypes were introgressed into two inbred lines to take advantage of maize haplotype variations and their sequenced genomes. We generated two pipelines that take fastq files derived from next-generation (nextGen) paired-end DNA and cDNA sequencing as input, call on several well established and freely available genomic analysis tools to call SNPs and INDELs, and generate lists of the most likely causal mutations together with variant index plots to locate the mutation to a specific sequence position on a chromosome. The pipelines were validated with a known strawberry mutation before cloning the dek mutants, thereby enabling phenotypic analysis of large genomes by next-generation sequencing.

Original languageEnglish (US)
Pages (from-to)673-682
Number of pages10
JournalPlant Journal
Volume97
Issue number4
DOIs
StatePublished - Feb 1 2019

Fingerprint

Genetic Association Studies
mutagenesis
Genome
Zea mays
mutation
Mutation
genome
seeds
Phenotype
phenotype
mutants
corn
genes
Fragaria
geneticists
seed development
DNA Sequence Analysis
Arabidopsis
Haplotypes
inbred lines

All Science Journal Classification (ASJC) codes

  • Genetics
  • Plant Science
  • Cell Biology

Keywords

  • bulked segregant analysis
  • introgressions
  • maize DEK mutations
  • natural or EMS-induced mutations
  • pipelines

Cite this

Dong, Jiaqiang ; Tu, Min ; Feng, Yaping ; Zdepski, Anna ; Ge, Fei ; Kumar, Dibyendu ; Slovin, Janet P. ; Messing, Joachim. / Candidate gene identification of existing or induced mutations with pipelines applicable to large genomes. In: Plant Journal. 2019 ; Vol. 97, No. 4. pp. 673-682.
@article{622d06b78d6d40a1a037666bf965f68c,
title = "Candidate gene identification of existing or induced mutations with pipelines applicable to large genomes",
abstract = "Bulked segregant analysis (BSA) is used to identify existing or induced variants that are linked to phenotypes. Although it is widely used in Arabidopsis and rice, it remains challenging for crops with large genomes, such as maize. Moreover, analysis of huge data sets can present a bottleneck linking phenotypes to their molecular basis, especially for geneticists without programming experience. Here, we identified two genes of maize defective kernel mutants with newly developed analysis pipelines that require no programing skills and should be applicable to any large genome. In the 1970s, Neuffer and Sheridan generated a chemically induced defective kernel (dek) mutant collection with the potential to uncover critical genes for seed development. To locate such mutations, the dek phenotypes were introgressed into two inbred lines to take advantage of maize haplotype variations and their sequenced genomes. We generated two pipelines that take fastq files derived from next-generation (nextGen) paired-end DNA and cDNA sequencing as input, call on several well established and freely available genomic analysis tools to call SNPs and INDELs, and generate lists of the most likely causal mutations together with variant index plots to locate the mutation to a specific sequence position on a chromosome. The pipelines were validated with a known strawberry mutation before cloning the dek mutants, thereby enabling phenotypic analysis of large genomes by next-generation sequencing.",
keywords = "bulked segregant analysis, introgressions, maize DEK mutations, natural or EMS-induced mutations, pipelines",
author = "Jiaqiang Dong and Min Tu and Yaping Feng and Anna Zdepski and Fei Ge and Dibyendu Kumar and Slovin, {Janet P.} and Joachim Messing",
year = "2019",
month = "2",
day = "1",
doi = "10.1111/tpj.14153",
language = "English (US)",
volume = "97",
pages = "673--682",
journal = "Plant Journal",
issn = "0960-7412",
publisher = "Wiley-Blackwell",
number = "4",

}

Candidate gene identification of existing or induced mutations with pipelines applicable to large genomes. / Dong, Jiaqiang; Tu, Min; Feng, Yaping; Zdepski, Anna; Ge, Fei; Kumar, Dibyendu; Slovin, Janet P.; Messing, Joachim.

In: Plant Journal, Vol. 97, No. 4, 01.02.2019, p. 673-682.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Candidate gene identification of existing or induced mutations with pipelines applicable to large genomes

AU - Dong, Jiaqiang

AU - Tu, Min

AU - Feng, Yaping

AU - Zdepski, Anna

AU - Ge, Fei

AU - Kumar, Dibyendu

AU - Slovin, Janet P.

AU - Messing, Joachim

PY - 2019/2/1

Y1 - 2019/2/1

N2 - Bulked segregant analysis (BSA) is used to identify existing or induced variants that are linked to phenotypes. Although it is widely used in Arabidopsis and rice, it remains challenging for crops with large genomes, such as maize. Moreover, analysis of huge data sets can present a bottleneck linking phenotypes to their molecular basis, especially for geneticists without programming experience. Here, we identified two genes of maize defective kernel mutants with newly developed analysis pipelines that require no programing skills and should be applicable to any large genome. In the 1970s, Neuffer and Sheridan generated a chemically induced defective kernel (dek) mutant collection with the potential to uncover critical genes for seed development. To locate such mutations, the dek phenotypes were introgressed into two inbred lines to take advantage of maize haplotype variations and their sequenced genomes. We generated two pipelines that take fastq files derived from next-generation (nextGen) paired-end DNA and cDNA sequencing as input, call on several well established and freely available genomic analysis tools to call SNPs and INDELs, and generate lists of the most likely causal mutations together with variant index plots to locate the mutation to a specific sequence position on a chromosome. The pipelines were validated with a known strawberry mutation before cloning the dek mutants, thereby enabling phenotypic analysis of large genomes by next-generation sequencing.

AB - Bulked segregant analysis (BSA) is used to identify existing or induced variants that are linked to phenotypes. Although it is widely used in Arabidopsis and rice, it remains challenging for crops with large genomes, such as maize. Moreover, analysis of huge data sets can present a bottleneck linking phenotypes to their molecular basis, especially for geneticists without programming experience. Here, we identified two genes of maize defective kernel mutants with newly developed analysis pipelines that require no programing skills and should be applicable to any large genome. In the 1970s, Neuffer and Sheridan generated a chemically induced defective kernel (dek) mutant collection with the potential to uncover critical genes for seed development. To locate such mutations, the dek phenotypes were introgressed into two inbred lines to take advantage of maize haplotype variations and their sequenced genomes. We generated two pipelines that take fastq files derived from next-generation (nextGen) paired-end DNA and cDNA sequencing as input, call on several well established and freely available genomic analysis tools to call SNPs and INDELs, and generate lists of the most likely causal mutations together with variant index plots to locate the mutation to a specific sequence position on a chromosome. The pipelines were validated with a known strawberry mutation before cloning the dek mutants, thereby enabling phenotypic analysis of large genomes by next-generation sequencing.

KW - bulked segregant analysis

KW - introgressions

KW - maize DEK mutations

KW - natural or EMS-induced mutations

KW - pipelines

UR - http://www.scopus.com/inward/record.url?scp=85058363259&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85058363259&partnerID=8YFLogxK

U2 - 10.1111/tpj.14153

DO - 10.1111/tpj.14153

M3 - Article

C2 - 30417446

AN - SCOPUS:85058363259

VL - 97

SP - 673

EP - 682

JO - Plant Journal

JF - Plant Journal

SN - 0960-7412

IS - 4

ER -