Abstract
Background: Automated genotype calling in tetraploid species was until recently not possible, which hampered
genetic analysis. Modern genotyping assays often produce two signals, one for each allele of a bi-allelic marker.
While ample software is available to obtain genotypes (homozygous for either allele, or heterozygous) for diploid
species from these signals, such software is not available for tetraploid species which may be scored as five
alternative genotypes (aaaa, baaa, bbaa, bbba and bbbb; nulliplex to quadruplex).
Results: We present a novel algorithm, implemented in the R package fitTetra, to assign genotypes for bi-allelic
markers to tetraploid samples from genotyping assays that produce intensity signals for both alleles. The algorithm
is based on the fitting of several mixture models with five components, one for each of the five possible
genotypes. The models have different numbers of parameters specifying the relation between the five component
means, and some of them impose a constraint on the mixing proportions to conform to Hardy-Weinberg
equilibrium (HWE) ratios. The software rejects markers that do not allow a reliable genotyping for the majority of
the samples, and it assigns a missing score to samples that cannot be scored into one of the five possible
genotypes with sufficient confidence.
Conclusions: We have validated the software with data of a collection of 224 potato varieties assayed with an
Illumina GoldenGate™ 384 SNP array and shown that all SNPs with informative ratio distributions are fitted. Almost
all fitted models appear to be correct based on visual inspection and comparison with diploid samples. When the
collection of potato varieties is analyzed as if it were a population, almost all markers seem to be in Hardy-
Weinberg equilibrium. The R package fitTetra is freely available under the GNU Public License from http://www.
plantbreeding.wur.nl/UK/software_fitTetra.html and as Additional files with this article.
Original language | English |
---|---|
Article number | 172 |
Number of pages | 11 |
Journal | BMC Bioinformatics |
Volume | 12 |
DOIs | |
Publication status | Published - 2011 |
Keywords
- microsatellite dna
- sect. caninae
- genome
- potato
- configuration
- construction
- map