DOI: 10.1111/pbi.12183 ISSN:
Characterization of polyploid wheat genomic diversity using a high‐density 90 000 single nucleotide polymorphism array
Shichen Wang, Debbie Wong, Kerrie Forrest, Alexandra Allen, Shiaoman Chao, Bevan E. Huang, Marco Maccaferri, Silvio Salvi, Sara G. Milner, Luigi Cattivelli, Anna M. Mastrangelo, Alex Whan, Stuart Stephen, Gary Barker, Ralf Wieseke, Joerg Plieske, Morten Lillemo, Diane Mather, Rudi Appels, Rudy Dolferus, Gina Brown‐Guedira, Abraham Korol, Alina R. Akhunova, Catherine Feuillet, Jerome Salse, Michele Morgante, Curtis Pozniak, Ming‐Cheng Luo, Jan Dvorak, Matthew Morell, Jorge Dubcovsky, Martin Ganal, Roberto Tuberosa, Cindy Lawley, Ivan Mikoulitch, Colin Cavanagh, Keith J. Edwards, Matthew Hayden, Eduard Akhunov, Summary
High‐density single nucleotide polymorphism (SNP) genotyping arrays are a powerful tool for studying genomic patterns of diversity, inferring ancestral relationships between individuals in populations and studying marker–trait associations in mapping experiments. We developed a genotyping array including about 90 000 gene‐associated SNPs and used it to characterize genetic variation in allohexaploid and allotetraploid wheat populations. The array includes a significant fraction of common genome‐wide distributed SNPs that are represented in populations of diverse geographical origin. We used density‐based spatial clustering algorithms to enable high‐throughput genotype calling in complex data sets obtained for polyploid wheat. We show that these model‐free clustering algorithms provide accurate genotype calling in the presence of multiple clusters including clusters with low signal intensity resulting from significant sequence divergence at the target SNP site or gene deletions. Assays that detect low‐intensity clusters can provide insight into the distribution of presence–absence variation (PAV) in wheat populations. A total of 46 977 SNPs from the wheat 90K array were genetically mapped using a combination of eight mapping populations. The developed array and cluster identification algorithms provide an opportunity to infer detailed haplotype structure in polyploid wheat and will serve as an invaluable resource for diversity studies and investigating the genetic basis of trait variation in wheat.