TY - JOUR
T1 - Measurements of soil protist richness and community composition are influenced by primer pair, annealing temperature, and bioinformatics choices
AU - Mau, Rebecca
AU - Hayer, Michaela
AU - Purcell, Alicia
AU - Geisen, S.A.
AU - Hungate, Bruce A.
AU - Schwartz, Egbert
PY - 2024
Y1 - 2024
N2 - Protists are a diverse and understudied group of microbial eukaryotic organisms especially in terrestrial environments. Advances in molecular methods are increasing our understanding of the distribution and functions of these creatures; however, there is a vast array of choices researchers make including barcoding genes, primer pairs, PCR settings, and bioinformatic options that can impact the outcome of protist community surveys. Here, we tested four commonly used primer pairs targeting the V4 and V9 regions of the 18S rRNA gene using different PCR annealing temperatures and processed the sequences with different bioinformatic parameters in 10 diverse soils to evaluate how primer pair, amplification parameters, and bioinformatic choices influence the composition and richness of protist and non-protist taxa using Illumina sequencing. Our results showed that annealing temperature influenced sequencing depth and protist taxon richness for most primer pairs, and that merging forward and reverse sequencing reads for the V4 primer pairs dramatically reduced the number of sequences and taxon richness of protists. The data sets of primers that targeted the same 18S rRNA gene region (e.g., V4 or V9) had similar protist community compositions; however, data sets from primers targeting the V4 18S rRNA gene region detected a greater number of protist taxa compared to those prepared with primers targeting the V9 18S rRNA region. There was limited overlap of protist taxa between data sets targeting the two different gene regions (80/549 taxa). Together, we show that laboratory and bioinformatic choices can substantially affect the results and conclusions about protist diversity and community composition using metabarcoding.
AB - Protists are a diverse and understudied group of microbial eukaryotic organisms especially in terrestrial environments. Advances in molecular methods are increasing our understanding of the distribution and functions of these creatures; however, there is a vast array of choices researchers make including barcoding genes, primer pairs, PCR settings, and bioinformatic options that can impact the outcome of protist community surveys. Here, we tested four commonly used primer pairs targeting the V4 and V9 regions of the 18S rRNA gene using different PCR annealing temperatures and processed the sequences with different bioinformatic parameters in 10 diverse soils to evaluate how primer pair, amplification parameters, and bioinformatic choices influence the composition and richness of protist and non-protist taxa using Illumina sequencing. Our results showed that annealing temperature influenced sequencing depth and protist taxon richness for most primer pairs, and that merging forward and reverse sequencing reads for the V4 primer pairs dramatically reduced the number of sequences and taxon richness of protists. The data sets of primers that targeted the same 18S rRNA gene region (e.g., V4 or V9) had similar protist community compositions; however, data sets from primers targeting the V4 18S rRNA gene region detected a greater number of protist taxa compared to those prepared with primers targeting the V9 18S rRNA region. There was limited overlap of protist taxa between data sets targeting the two different gene regions (80/549 taxa). Together, we show that laboratory and bioinformatic choices can substantially affect the results and conclusions about protist diversity and community composition using metabarcoding.
U2 - 10.1128/aem.00800-24
DO - 10.1128/aem.00800-24
M3 - Article
SN - 0099-2240
VL - 90
JO - Applied and Environmental Microbiology
JF - Applied and Environmental Microbiology
IS - 7
ER -