TY - JOUR
T1 - The contribution of mitochondrial metagenomics to large-scale data mining and phylogenetic analysis of Coleoptera
AU - Linard, Benjamin
AU - Crampton-Platt, Alex
AU - Moriniere, Jerome
AU - Timmermans, Martijn J.T.N.
AU - Andújar, Carmelo
AU - Arribas, Paula
AU - Miller, Kirsten E.
AU - Lipecki, Julia
AU - Favreau, Emeline
AU - Hunter, Amie
AU - Gómez-Rodríguez, Carola
AU - Barton, Christopher
AU - Nie, Ruie
AU - Gillett, Conrad P.D.T.
AU - Breeschoten, Thijmen
AU - Bocak, Ladislav
AU - Vogler, Alfried P.
PY - 2018/11/1
Y1 - 2018/11/1
N2 - A phylogenetic tree at the species level is still far off for highly diverse insect orders, including the Coleoptera, but the taxonomic breadth of public sequence databases is growing. In addition, new types of data may contribute to increasing taxon coverage, such as metagenomic shotgun sequencing for assembly of mitogenomes from bulk specimen samples. The current study explores the application of these techniques for large-scale efforts to build the tree of Coleoptera. We used shotgun data from 17 different ecological and taxonomic datasets (5 unpublished) to assemble a total of 1942 mitogenome contigs of >3000 bp. These sequences were combined into a single dataset together with all mitochondrial data available at GenBank, in addition to nuclear markers widely used in molecular phylogenetics. The resulting matrix of nearly 16,000 species with two or more loci produced trees (RAxML) showing overall congruence with the Linnaean taxonomy at hierarchical levels from suborders to genera. We tested the role of full-length mitogenomes in stabilizing the tree from GenBank data, as mitogenomes might link terminals with non-overlapping gene representation. However, the mitogenome data were only partly useful in this respect, presumably because of the purely automated approach to assembly and gene delimitation, but improvements in future may be possible by using multiple assemblers and manual curation. In conclusion, the combination of data mining and metagenomic sequencing of bulk samples provided the largest phylogenetic tree of Coleoptera to date, which represents a summary of existing phylogenetic knowledge and a defensible tree of great utility, in particular for studies at the intra-familial level, despite some shortcomings for resolving basal nodes.
AB - A phylogenetic tree at the species level is still far off for highly diverse insect orders, including the Coleoptera, but the taxonomic breadth of public sequence databases is growing. In addition, new types of data may contribute to increasing taxon coverage, such as metagenomic shotgun sequencing for assembly of mitogenomes from bulk specimen samples. The current study explores the application of these techniques for large-scale efforts to build the tree of Coleoptera. We used shotgun data from 17 different ecological and taxonomic datasets (5 unpublished) to assemble a total of 1942 mitogenome contigs of >3000 bp. These sequences were combined into a single dataset together with all mitochondrial data available at GenBank, in addition to nuclear markers widely used in molecular phylogenetics. The resulting matrix of nearly 16,000 species with two or more loci produced trees (RAxML) showing overall congruence with the Linnaean taxonomy at hierarchical levels from suborders to genera. We tested the role of full-length mitogenomes in stabilizing the tree from GenBank data, as mitogenomes might link terminals with non-overlapping gene representation. However, the mitogenome data were only partly useful in this respect, presumably because of the purely automated approach to assembly and gene delimitation, but improvements in future may be possible by using multiple assemblers and manual curation. In conclusion, the combination of data mining and metagenomic sequencing of bulk samples provided the largest phylogenetic tree of Coleoptera to date, which represents a summary of existing phylogenetic knowledge and a defensible tree of great utility, in particular for studies at the intra-familial level, despite some shortcomings for resolving basal nodes.
KW - Biodiversity discovery
KW - Coleoptera
KW - Mass-trapped samples
KW - Metagenome skimming
KW - Mitochondrial metagenomics
U2 - 10.1016/j.ympev.2018.07.008
DO - 10.1016/j.ympev.2018.07.008
M3 - Article
AN - SCOPUS:85050543241
SN - 1055-7903
VL - 128
SP - 1
EP - 11
JO - Molecular Phylogenetics and Evolution
JF - Molecular Phylogenetics and Evolution
ER -