Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Tan G, Muffato M, Ledergerber C, Herrero J, Goldman N, Gil M, Dessimoz C. Current Methods for Automated Filtering of Multiple Sequence Alignments Frequently Worsen Single-Gene Phylogenetic Inference. Syst Biol 2015;64:778-91. [PMID: 26031838 PMCID: PMC4538881 DOI: 10.1093/sysbio/syv033] [Citation(s) in RCA: 143] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2014] [Accepted: 05/26/2015] [Indexed: 01/09/2023] Open

For:	Tan G, Muffato M, Ledergerber C, Herrero J, Goldman N, Gil M, Dessimoz C. Current Methods for Automated Filtering of Multiple Sequence Alignments Frequently Worsen Single-Gene Phylogenetic Inference. Syst Biol 2015;64:778-91. [PMID: 26031838 PMCID: PMC4538881 DOI: 10.1093/sysbio/syv033] [Citation(s) in RCA: 143] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2014] [Accepted: 05/26/2015] [Indexed: 01/09/2023] Open

Number

Cited by Other Article(s)

101

A Robust Phylogenomic Time Tree for Biotechnologically and Medically Important Fungi in the Genera Aspergillus and Penicillium. mBio 2019;10:mBio.00925-19. [PMID: 31289177 PMCID: PMC6747717 DOI: 10.1128/mbio.00925-19] [Citation(s) in RCA: 74] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open

Abstract

Understanding the evolution of traits across technologically and medically significant fungi requires a robust phylogeny. Even though species in the Aspergillus and Penicillium genera (family Aspergillaceae, class Eurotiomycetes) are some of the most significant technologically and medically relevant fungi, we still lack a genome-scale phylogeny of the lineage or knowledge of the parts of the phylogeny that exhibit conflict among analyses. Here, we used a phylogenomic approach to infer evolutionary relationships among 81 genomes that span the diversity of Aspergillus and Penicillium species, to identify conflicts in the phylogeny, and to determine the likely underlying factors of the observed conflicts. Using a data matrix comprised of 1,668 genes, we found that while most branches of the phylogeny of the Aspergillaceae are robustly supported and recovered irrespective of method of analysis, a few exhibit various degrees of conflict among our analyses. Further examination of the observed conflict revealed that it largely stems from incomplete lineage sorting and hybridization or introgression. Our analyses provide a robust and comprehensive evolutionary genomic roadmap for this important lineage, which will facilitate the examination of the diverse technologically and medically relevant traits of these fungi in an evolutionary context.

The filamentous fungal family Aspergillaceae contains >1,000 known species, mostly in the genera Aspergillus and Penicillium. Several species are used in the food, biotechnology, and drug industries (e.g., Aspergillus oryzae and Penicillium camemberti), while others are dangerous human and plant pathogens (e.g., Aspergillus fumigatus and Penicillium digitatum). To infer a robust phylogeny and pinpoint poorly resolved branches and their likely underlying contributors, we used 81 genomes spanning the diversity of Aspergillus and Penicillium to construct a 1,668-gene data matrix. Phylogenies of the nucleotide and amino acid versions of this full data matrix as well as of several additional data matrices were generated using three different maximum likelihood schemes (i.e., gene-partitioned, unpartitioned, and coalescence) and using both site-homogenous and site-heterogeneous models (total of 64 species-level phylogenies). Examination of the topological agreement among these phylogenies and measures of internode certainty identified 11/78 (14.1%) bipartitions that were incongruent and pinpointed the likely underlying contributing factors, which included incomplete lineage sorting, hidden paralogy, hybridization or introgression, and reconstruction artifacts associated with poor taxon sampling. Relaxed molecular clock analyses suggest that Aspergillaceae likely originated in the lower Cretaceous and that the Aspergillus and Penicillium genera originated in the upper Cretaceous. Our results shed light on the ongoing debate on Aspergillus systematics and taxonomy and provide a robust evolutionary and temporal framework for comparative genomic analyses in Aspergillaceae. More broadly, our approach provides a general template for phylogenomic identification of resolved and contentious branches in densely genome-sequenced lineages across the tree of life.

Collapse

102

Vasilikopoulos A, Balke M, Beutel RG, Donath A, Podsiadlowski L, Pflug JM, Waterhouse RM, Meusemann K, Peters RS, Escalona HE, Mayer C, Liu S, Hendrich L, Alarie Y, Bilton DT, Jia F, Zhou X, Maddison DR, Niehuis O, Misof B. Phylogenomics of the superfamily Dytiscoidea (Coleoptera: Adephaga) with an evaluation of phylogenetic conflict and systematic error. Mol Phylogenet Evol 2019;135:270-285. [DOI: 10.1016/j.ympev.2019.02.022] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Revised: 02/22/2019] [Accepted: 02/25/2019] [Indexed: 02/07/2023]

103

Barrett K, Lange L. Peptide-based functional annotation of carbohydrate-active enzymes by conserved unique peptide patterns (CUPP). BIOTECHNOLOGY FOR BIOFUELS 2019;12:102. [PMID: 31168320 PMCID: PMC6489277 DOI: 10.1186/s13068-019-1436-5] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/27/2018] [Accepted: 04/13/2019] [Indexed: 05/24/2023]

Abstract

BACKGROUND

Insight into the function of carbohydrate-active enzymes is required to understand their biological role and industrial potential. There is a need for better use of the ample genomic data in order to enable selection of the most interesting proteins for further studies. The basis for elaborating a new approach to sequence analysis is the hypothesis that when using conserved peptide patterns to determine the similarities between proteins, the exact spacing between conserved adjacent amino acids in the proteins plays a prominent functional role. Thus, the objective of developing the method of conserved unique peptide patterns (CUPP) is to construct a peptide-based grouping and validate the method to provide evidence that CUPP captures function-related features of the individual carbohydrate-active enzymes (as defined by CAZy families). This approach facilitates grouping of enzymes at a level lower than protein families and/or subfamilies. A standardized, efficient, and robust approach to functional annotation of carbohydrate-active enzymes would support improved molecular insight into enzyme-substrate interaction.

RESULTS

A new nonalignment-based clustering and functional annotation tool was developed that uses conserved unique peptides patterns to perform automated clustering of proteins and formation of protein groups. A peptide-based model was constructed for each of these protein CUPP groups to be used to automatically annotate protein family, subfamily, and EC function of carbohydrate-active enzymes. CUPP prediction can annotate proteins (from any CAZy family) with high F-score to existing family (0.966), subfamily (0.961), and EC-function (0.843). The speed of the CUPP program was estimated and exemplified by prediction of the 504,017 nonredundant proteins of CAZy in less than four CPU hours.

CONCLUSION

It was possible to construct an automated system for clustering proteins within families and use the resulting CUPP groups to directly build peptide-based models for genome annotation. The CUPP runtime, F-score, sensitivity, and precisions of family and subfamily annotations match or represent an improvement compared to state-of-the-art tools. The speed of the CUPP annotation is similar to the rapid DIAMOND annotation tool. CUPP facilitates automated annotation of full genome assemblies to any CAZy family.

Collapse

104

Laumer CE. Inferring Ancient Relationships with Genomic Data: A Commentary on Current Practices. Integr Comp Biol 2019;58:623-639. [PMID: 29982611 DOI: 10.1093/icb/icy075] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open

105

Kelnarova I, Jendek E, Grebennikov VV, Bocak L. First molecular phylogeny of Agrilus (Coleoptera: Buprestidae), the largest genus on Earth, with DNA barcode database for forestry pest diagnostics. BULLETIN OF ENTOMOLOGICAL RESEARCH 2019;109:200-211. [PMID: 29784069 DOI: 10.1017/s0007485318000330] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

106

Six Impossible Things before Breakfast: Assumptions, Models, and Belief in Molecular Dating. Trends Ecol Evol 2019;34:474-486. [PMID: 30904189 DOI: 10.1016/j.tree.2019.01.017] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2018] [Revised: 01/29/2019] [Accepted: 01/31/2019] [Indexed: 01/16/2023]

107

Küppers GC, da Silva Paiva T, do Nascimento Borges B, Alfaro ER, Claps MC. A new oligotrich (Ciliophora, Oligotrichia) from Argentina, with redefinition of Novistrombidium Song and Bradbury. Eur J Protistol 2019;69:20-36. [PMID: 30870724 DOI: 10.1016/j.ejop.2019.02.006] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2018] [Revised: 02/05/2019] [Accepted: 02/11/2019] [Indexed: 11/27/2022]

108

Chang JM, Floden EW, Herrero J, Gascuel O, Di Tommaso P, Notredame C. Incorporating alignment uncertainty into Felsenstein's phylogenetic bootstrap to improve its reliability. Bioinformatics 2019;37:1506-1514. [PMID: 30726875 PMCID: PMC8275982 DOI: 10.1093/bioinformatics/btz082] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Revised: 12/12/2018] [Accepted: 02/05/2019] [Indexed: 12/30/2022] Open

109

Di Franco A, Poujol R, Baurain D, Philippe H. Evaluating the usefulness of alignment filtering methods to reduce the impact of errors on evolutionary inferences. BMC Evol Biol 2019;19:21. [PMID: 30634908 PMCID: PMC6330419 DOI: 10.1186/s12862-019-1350-2] [Citation(s) in RCA: 64] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2018] [Accepted: 01/02/2019] [Indexed: 11/10/2022] Open

Abstract

Background

Multiple Sequence Alignments (MSAs) are the starting point of molecular evolutionary analyses. Errors in MSAs generate a non-historical signal that can lead to incorrect inferences. Therefore, numerous efforts have been made to reduce the impact of alignment errors, by improving alignment algorithms and by developing methods to filter out poorly aligned regions. However, MSAs do not only contain alignment errors, but also primary sequence errors. Such errors may originate from sequencing errors, from assembly errors, or from erroneous structural annotations (such as incorrect intron/exon boundaries). Even though their existence is acknowledged, the impact of primary sequence errors on evolutionary inference is poorly characterized.

Results

In a first step to fill this gap, we have developed a program called HmmCleaner, which detects and eliminates these errors from MSAs. It uses profile hidden Markov models (pHMM) to identify sequence segments that poorly fit their MSA and selectively removes them. We assessed its performances using > 700 amino-acid MSAs from prokaryotes and eukaryotes, in which we introduced several types of simulated primary sequence errors. The sensitivity of HmmCleaner towards simulated primary sequence errors was > 95%. In a second step, we compared the impact of segment filtering software (HmmCleaner and PREQUAL) relative to commonly used block-filtering software (BMGE and TrimAI) on evolutionary analyses. Using real data from vertebrates, we observed that segment-filtering methods improve the quality of evolutionary inference more than the currently used block-filtering methods. The formers were especially effective at improving branch length inferences, and at reducing false positive rate during detection of positive selection.

Conclusions

Segment filtering methods such as HmmCleaner accurately detect simulated primary sequence errors. Our results suggest that these errors are more detrimental than alignment errors. However, they also show that stochastic (sampling) error is predominant in single-gene evolutionary inferences. Therefore, we argue that MSA filtering should focus on segment instead of block removal and that more studies are required to find the optimal balance between accuracy improvement and stochastic error increase brought by data removal.

Electronic supplementary material

The online version of this article (10.1186/s12862-019-1350-2) contains supplementary material, which is available to authorized users.

Collapse

110

Borowiec ML. Convergent Evolution of the Army Ant Syndrome and Congruence in Big-Data Phylogenetics. Syst Biol 2019;68:642-656. [DOI: 10.1093/sysbio/syy088] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2018] [Revised: 11/09/2018] [Accepted: 12/15/2018] [Indexed: 11/12/2022] Open

111

Ashkenazy H, Sela I, Levy Karin E, Landan G, Pupko T. Multiple Sequence Alignment Averaging Improves Phylogeny Reconstruction. Syst Biol 2019;68:117-130. [PMID: 29771363 PMCID: PMC6657586 DOI: 10.1093/sysbio/syy036] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2017] [Revised: 05/07/2018] [Accepted: 05/09/2018] [Indexed: 01/11/2023] Open

112

Rojas-Cruz A, Reyes-Bermúdez A. Phylogenetic analysis of Alphapapillomavirus based on L1, E6 and E7 regions suggests that carcinogenicity and tissue tropism have appeared multiple times during viral evolution. INFECTION GENETICS AND EVOLUTION 2018;67:210-221. [PMID: 30458293 DOI: 10.1016/j.meegid.2018.11.008] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Revised: 11/07/2018] [Accepted: 11/08/2018] [Indexed: 11/18/2022]

113

Penzar D, Krivozubov M, Spirin S. PQ, a new program for phylogeny reconstruction. BMC Bioinformatics 2018;19:374. [PMID: 30314446 PMCID: PMC6186109 DOI: 10.1186/s12859-018-2399-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2018] [Accepted: 09/25/2018] [Indexed: 12/04/2022] Open

114

Masrati G, Dwivedi M, Rimon A, Gluck-Margolin Y, Kessel A, Ashkenazy H, Mayrose I, Padan E, Ben-Tal N. Broad phylogenetic analysis of cation/proton antiporters reveals transport determinants. Nat Commun 2018;9:4205. [PMID: 30310075 PMCID: PMC6181914 DOI: 10.1038/s41467-018-06770-5] [Citation(s) in RCA: 46] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2018] [Accepted: 09/24/2018] [Indexed: 11/08/2022] Open

115

Villaverde T, Pokorny L, Olsson S, Rincón-Barrado M, Johnson MG, Gardner EM, Wickett NJ, Molero J, Riina R, Sanmartín I. Bridging the micro- and macroevolutionary levels in phylogenomics: Hyb-Seq solves relationships from populations to species and above. THE NEW PHYTOLOGIST 2018;220:636-650. [PMID: 30016546 DOI: 10.1111/nph.15312] [Citation(s) in RCA: 71] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/01/2018] [Accepted: 06/04/2018] [Indexed: 05/20/2023]

116

Kobayashi G, Goto R, Takano T, Kojima S. Molecular phylogeny of Maldanidae (Annelida): Multiple losses of tube-capping plates and evolutionary shifts in habitat depth. Mol Phylogenet Evol 2018;127:332-344. [DOI: 10.1016/j.ympev.2018.04.036] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2017] [Revised: 04/16/2018] [Accepted: 04/23/2018] [Indexed: 11/27/2022]

117

Krah FS, Bässler C, Heibl C, Soghigian J, Schaefer H, Hibbett DS. Evolutionary dynamics of host specialization in wood-decay fungi. BMC Evol Biol 2018;18:119. [PMID: 30075699 PMCID: PMC6091043 DOI: 10.1186/s12862-018-1229-7] [Citation(s) in RCA: 58] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2017] [Accepted: 07/03/2018] [Indexed: 11/23/2022] Open

Abstract

Background

The majority of wood decomposing fungi are mushroom-forming Agaricomycetes, which exhibit two main modes of plant cell wall decomposition: white rot, in which all plant cell wall components are degraded, including lignin, and brown rot, in which lignin is modified but not appreciably removed. Previous studies suggested that brown rot fungi tend to be specialists of gymnosperm hosts and that brown rot promotes gymnosperm specialization. However, these hypotheses were based on analyses of limited datasets of Agaricomycetes. Overcoming this limitation, we used a phylogeny with 1157 species integrating available sequences, assembled decay mode characters from the literature, and coded host specialization using the newly developed R package, rusda.

Results

We found that most brown rot fungi are generalists or gymnosperm specialists, whereas most white rot fungi are angiosperm specialists. A six-state model of the evolution of host specialization revealed high transition rates between generalism and specialization in both decay modes. However, while white rot lineages switched most frequently to angiosperm specialists, brown rot lineages switched most frequently to generalism. A time-calibrated phylogeny revealed that Agaricomycetes is older than the flowering plants but many of the large clades originated after the diversification of the angiosperms in the Cretaceous.

Conclusions

Our results challenge the current view that brown rot fungi are primarily gymnosperm specialists and reveal intensive white rot specialization to angiosperm hosts. We thus suggest that brown rot associated convergent loss of lignocellulose degrading enzymes was correlated with host generalism, rather than gymnosperm specialism. A likelihood model of host specialization evolution together with a time-calibrated phylogeny further suggests that the rise of the angiosperms opened a new mega-niche for wood-decay fungi, which was exploited particularly well by white rot lineages.

Electronic supplementary material

The online version of this article (10.1186/s12862-018-1229-7) contains supplementary material, which is available to authorized users.

Collapse

118

Griesmann M, Chang Y, Liu X, Song Y, Haberer G, Crook MB, Billault-Penneteau B, Lauressergues D, Keller J, Imanishi L, Roswanjaya YP, Kohlen W, Pujic P, Battenberg K, Alloisio N, Liang Y, Hilhorst H, Salgado MG, Hocher V, Gherbi H, Svistoonoff S, Doyle JJ, He S, Xu Y, Xu S, Qu J, Gao Q, Fang X, Fu Y, Normand P, Berry AM, Wall LG, Ané JM, Pawlowski K, Xu X, Yang H, Spannagl M, Mayer KFX, Wong GKS, Parniske M, Delaux PM, Cheng S. Phylogenomics reveals multiple losses of nitrogen-fixing root nodule symbiosis. Science 2018;361:science.aat1743. [DOI: 10.1126/science.aat1743] [Citation(s) in RCA: 198] [Impact Index Per Article: 33.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2018] [Accepted: 05/16/2018] [Indexed: 12/20/2022]

119

Mai U, Mirarab S. TreeShrink: fast and accurate detection of outlier long branches in collections of phylogenetic trees. BMC Genomics 2018;19:272. [PMID: 29745847 PMCID: PMC5998883 DOI: 10.1186/s12864-018-4620-2] [Citation(s) in RCA: 154] [Impact Index Per Article: 25.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open

120

Horizontal gene transfer constrains the timing of methanogen evolution. Nat Ecol Evol 2018;2:897-903. [DOI: 10.1038/s41559-018-0513-7] [Citation(s) in RCA: 75] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2017] [Accepted: 02/20/2018] [Indexed: 11/08/2022]

121

Kim S, de Medeiros BAS, Byun BK, Lee S, Kang JH, Lee B, Farrell BD. West meets East: How do rainforest beetles become circum-Pacific? Evolutionary origin of Callipogon relictus and allied species (Cerambycidae: Prioninae) in the New and Old Worlds. Mol Phylogenet Evol 2018. [PMID: 29524651 DOI: 10.1016/j.ympev.2018.02.019] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

Abstract

The longhorn beetle genus Callipogon Audinet-Serville represents a small group of large wood-boring beetles whose distribution pattern exhibits a unique trans-Pacific disjunction between the East Asian temperate rainforest and the tropical rainforest of the Neotropics. To understand the biogeographic history underlying this circum-Pacific disjunct distribution, we reconstructed a molecular phylogeny of the subfamily Prioninae with extensive sampling of Callipogon using multilocus sequence data of 99 prionine and four parandrine samples (ingroups), together with two distant outgroup species. Our sampling of Callipogon includes 18 of the 24 currently accepted species, with complete representation of all species in our focal subgenera. Our phylogenetic analyses confirmed the purported affinity between the Palearctic Callipogon relictus and its Neotropical congeners. Furthermore, based on molecular dating under the fossilized birth-death (FBD) model with comprehensive fossil records and probabilistic ancestral range reconstructions, we estimated the crown group Callipogon to have originated in the Paleocene circa 60 million years ago (Ma) across the Neotropics and Eastern Palearctics. The divergence between the Palearctic C. relictus and its Neotropical congeners is explained as the result of a vicariance event following the demise of boreotropical forest across Beringia at the Eocene-Oligocene boundary. As C. relictus represents the unique relictual species that evidentiates the lineage's expansive ancient distribution, we evaluated its conservation importance through species distribution modelling. Though we estimated a range expansion for C. relictus by 2050, we emphasize a careful implementation of conservation programs towards the protection of primary forest across its current habitats, as the species remains highly vulnerable to habitat disturbance.

Collapse

122

Vieira WAS, Lima WG, Nascimento ES, Michereff SJ, Câmara MPS, Doyle VP. The impact of phenotypic and molecular data on the inference of Colletotrichum diversity associated with Musa. Mycologia 2018;109:912-934. [PMID: 29494311 DOI: 10.1080/00275514.2017.1418577] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

123

Bogusz M, Whelan S. Phylogenetic Tree Estimation With and Without Alignment: New Distance Methods and Benchmarking. Syst Biol 2018;66:218-231. [PMID: 27633353 DOI: 10.1093/sysbio/syw074] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2016] [Accepted: 08/23/2016] [Indexed: 12/20/2022] Open

Abstract

Phylogenetic tree inference is a critical component of many systematic and evolutionary studies. The majority of these studies are based on the two-step process of multiple sequence alignment followed by tree inference, despite persistent evidence that the alignment step can lead to biased results. Here we present a two-part study that first presents PaHMM-Tree, a novel neighbor joining-based method that estimates pairwise distances without assuming a single alignment. We then use simulations to benchmark its performance against a wide-range of other phylogenetic tree inference methods, including the first comparison of alignment-free distance-based methods against more conventional tree estimation methods. Our new method for calculating pairwise distances based on statistical alignment provides distance estimates that are as accurate as those obtained using standard methods based on the true alignment. Pairwise distance estimates based on the two-step process tend to be substantially less accurate. This improved performance carries through to tree inference, where PaHMM-Tree provides more accurate tree estimates than all of the pairwise distance methods assessed. For close to moderately divergent sequence data we find that the two-step methods using statistical inference, where information from all sequences is included in the estimation procedure, tend to perform better than PaHMM-Tree, particularly full statistical alignment, which simultaneously estimates both the tree and the alignment. For deep divergences we find the alignment step becomes so prone to error that our distance-based PaHMM-Tree outperforms all other methods of tree inference. Finally, we find that the accuracy of alignment-free methods tends to decline faster than standard two-step methods in the presence of alignment uncertainty, and identify no conditions where alignment-free methods are equal to or more accurate than standard phylogenetic methods even in the presence of substantial alignment error. [Alignment-free; distance-based phylogenetics; pair Hidden Markov Models; phylogenetic inference; statistical alignment.].

Collapse

124

Zou Q, Wan S, Zeng X, Ma ZS. Reconstructing evolutionary trees in parallel for massive sequences. BMC SYSTEMS BIOLOGY 2017;11:100. [PMID: 29297337 PMCID: PMC5751538 DOI: 10.1186/s12918-017-0476-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

125

Quattrini AM, Faircloth BC, Dueñas LF, Bridge TCL, Brugler MR, Calixto‐Botía IF, DeLeo DM, Forêt S, Herrera S, Lee SMY, Miller DJ, Prada C, Rádis‐Baptista G, Ramírez‐Portilla C, Sánchez JA, Rodríguez E, McFadden CS. Universal target‐enrichment baits for anthozoan (Cnidaria) phylogenomics: New approaches to long‐standing problems. Mol Ecol Resour 2017;18:281-295. [DOI: 10.1111/1755-0998.12736] [Citation(s) in RCA: 85] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2017] [Revised: 10/28/2017] [Accepted: 11/06/2017] [Indexed: 12/31/2022]

Affiliation(s)

Andrea M. Quattrini Department of Biology Harvey Mudd College Claremont CA USA
Brant C. Faircloth Department of Biological Sciences and Museum of Natural Science Louisiana State University Baton Rouge LA USA
Luisa F. Dueñas Departamento de Ciencias Biológicas‐Facultad de Ciencias Laboratorio de Biología Molecular Marina (BIOMMAR) Universidad de los Andes Bogotá Colombia
Tom C. L. Bridge Queensland Museum Network Townsville QLD Australia Australian Research Council Centre of Excellence for Coral Reef Studies James Cook University Townsville QLD Australia
Mercer R. Brugler Division of Invertebrate Zoology American Museum of Natural History New York NY USA Biological Sciences Department NYC College of Technology City University of New York Brooklyn NY USA
Iván F. Calixto‐Botía Departamento de Ciencias Biológicas‐Facultad de Ciencias Laboratorio de Biología Molecular Marina (BIOMMAR) Universidad de los Andes Bogotá Colombia Department of Animal Ecology and Systematics Justus Liebig Universität Giessen Germany
Danielle M. DeLeo Department of Biological Sciences Florida International University North Miami FL USA Biology Department Temple University Philadelphia PA USA
Sylvain Forêt Research School of Biology Australian National University Canberra ACT Australia
Santiago Herrera Department of Biological Sciences Lehigh University Bethlehem PA USA
Simon M. Y. Lee State Key Laboratory of Quality Research in Chinese Medicine and Institute of Chinese Medical Sciences University of Macau Macao China
David J. Miller Australian Research Council Centre of Excellence for Coral Reef Studies James Cook University Townsville QLD Australia
Carlos Prada Department of Biological Sciences University of Rhode Island Kingston RI USA
Gandhi Rádis‐Baptista Institute for Marine Sciences Federal University of Ceara Fortaleza CE Brazil
Catalina Ramírez‐Portilla Departamento de Ciencias Biológicas‐Facultad de Ciencias Laboratorio de Biología Molecular Marina (BIOMMAR) Universidad de los Andes Bogotá Colombia Department of Animal Ecology and Systematics Justus Liebig Universität Giessen Germany
Juan A. Sánchez Departamento de Ciencias Biológicas‐Facultad de Ciencias Laboratorio de Biología Molecular Marina (BIOMMAR) Universidad de los Andes Bogotá Colombia
Estefanía Rodríguez Division of Invertebrate Zoology American Museum of Natural History New York NY USA
Catherine S. McFadden Department of Biology Harvey Mudd College Claremont CA USA

Collapse

126

Revision of Podocotyloides Yamaguti, 1934 (Digenea: Opecoelidae), resurrection of Pedunculacetabulum Yamaguti, 1934 and the naming of a cryptic opecoelid species. Syst Parasitol 2017;95:1-31. [DOI: 10.1007/s11230-017-9761-1] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2017] [Accepted: 10/29/2017] [Indexed: 10/18/2022]

127

Mishra B, Choi YJ, Thines M. Phylogenomics of Bartheletia paradoxa reveals its basal position in Agaricomycotina and that the early evolutionary history of basidiomycetes was rapid and probably not strictly bifurcating. Mycol Prog 2017. [DOI: 10.1007/s11557-017-1349-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

128

Ratmann O, Wymant C, Colijn C, Danaviah S, Essex M, Frost S, Gall A, Gaseitsiwe S, Grabowski MK, Gray R, Guindon S, von Haeseler A, Kaleebu P, Kendall M, Kozlov A, Manasa J, Minh BQ, Moyo S, Novitsky V, Nsubuga R, Pillay S, Quinn TC, Serwadda D, Ssemwanga D, Stamatakis A, Trifinopoulos J, Wawer M, Brown AL, de Oliveira T, Kellam P, Pillay D, Fraser C, on behalf of the PANGEA-HIV Consort. HIV-1 full-genome phylogenetics of generalized epidemics in sub-Saharan Africa: impact of missing nucleotide characters in next-generation sequences. AIDS Res Hum Retroviruses 2017;33:1083-1098. [PMID: 28540766 PMCID: PMC5597042 DOI: 10.1089/aid.2017.0061] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open

Abstract

To characterize HIV-1 transmission dynamics in regions where the burden of HIV-1 is greatest, the “Phylogenetics and Networks for Generalised HIV Epidemics in Africa” consortium (PANGEA-HIV) is sequencing full-genome viral isolates from across sub-Saharan Africa. We report the first 3,985 PANGEA-HIV consensus sequences from four cohort sites (Rakai Community Cohort Study, n = 2,833; MRC/UVRI Uganda, n = 701; Mochudi Prevention Project, n = 359; Africa Health Research Institute Resistance Cohort, n = 92). Next-generation sequencing success rates varied: more than 80% of the viral genome from the gag to the nef genes could be determined for all sequences from South Africa, 75% of sequences from Mochudi, 60% of sequences from MRC/UVRI Uganda, and 22% of sequences from Rakai. Partial sequencing failure was primarily associated with low viral load, increased for amplicons closer to the 3′ end of the genome, was not associated with subtype diversity except HIV-1 subtype D, and remained significantly associated with sampling location after controlling for other factors. We assessed the impact of the missing data patterns in PANGEA-HIV sequences on phylogeny reconstruction in simulations. We found a threshold in terms of taxon sampling below which the patchy distribution of missing characters in next-generation sequences (NGS) has an excess negative impact on the accuracy of HIV-1 phylogeny reconstruction, which is attributable to tree reconstruction artifacts that accumulate when branches in viral trees are long. The large number of PANGEA-HIV sequences provides unprecedented opportunities for evaluating HIV-1 transmission dynamics across sub-Saharan Africa and identifying prevention opportunities. Molecular epidemiological analyses of these data must proceed cautiously because sequence sampling remains below the identified threshold and a considerable negative impact of missing characters on phylogeny reconstruction is expected.

Collapse

Affiliation(s)

Oliver Ratmann MRC Centre for Outbreak Analyses and Modelling, Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, London, United Kingdom
Chris Wymant Oxford Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
Caroline Colijn Department of Mathematics, Imperial College London, London, United Kingdom
Siva Danaviah Africa Health Research Institute, KwaZulu-Natal, South Africa
Max Essex Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, Massachusetts Botswana Harvard AIDS Institute Partnership, Gaborone, Botswana
Simon Frost Department of Veterinary Medicine, University of Cambridge, Cambridge, United Kingdom
Astrid Gall Department of Veterinary Medicine, University of Cambridge, Cambridge, United Kingdom
Simani Gaseitsiwe Botswana Harvard AIDS Institute Partnership, Gaborone, Botswana
Mary K. Grabowski Department of Epidemiology Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland Rakai Health Sciences Program, Entebbe, Uganda
Ronald Gray Department of Epidemiology Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland Rakai Health Sciences Program, Entebbe, Uganda
Stephane Guindon Department of Statistics, University of Auckland, Auckland, New Zealand Laboratoire d'Informatique, de Robotique et de Microelectronique de Montpellier–UMR 5506, CNRS & UM, Montpellier, France
Arndt von Haeseler Centre for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, Vienna, Austria Bioinformatics and Computational Biology, Faculty of Computer Science, University of Vienna, Vienna, Austria
Pontiano Kaleebu MRC/UVRI Uganda Research Unit on AIDS, Entebbe, Uganda
Michelle Kendall Department of Mathematics, Imperial College London, London, United Kingdom
Alexey Kozlov Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
Justen Manasa Africa Health Research Institute, KwaZulu-Natal, South Africa
Bui Quang Minh Centre for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, Vienna, Austria
Sikhulile Moyo Botswana Harvard AIDS Institute Partnership, Gaborone, Botswana
Vlad Novitsky Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, Massachusetts Botswana Harvard AIDS Institute Partnership, Gaborone, Botswana
Rebecca Nsubuga MRC/UVRI Uganda Research Unit on AIDS, Entebbe, Uganda
Sureshnee Pillay Africa Health Research Institute, KwaZulu-Natal, South Africa
Thomas C. Quinn Rakai Health Sciences Program, Entebbe, Uganda Division of Intramural Research, National Institute of Allergy and Infectious Diseases, NIH, Bethesda, Maryland Department of Medicine Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland
David Serwadda Rakai Health Sciences Program, Entebbe, Uganda Makerere University School of Public Health, Makerere University College of Health Sciences, Kampala, Uganda
Deogratius Ssemwanga MRC/UVRI Uganda Research Unit on AIDS, Entebbe, Uganda
Alexandros Stamatakis Heidelberg Institute for Theoretical Studies, Heidelberg, Germany Institute for Theoretical Informatics, Karlsruhe Institute of Technology, Karlsruhe, Germany
Jana Trifinopoulos Centre for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, Vienna, Austria
Maria Wawer Department of Epidemiology Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland Rakai Health Sciences Program, Entebbe, Uganda
Andy Leigh Brown School of Biological Sciences, Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, United Kingdom
Tulio de Oliveira Nelson R. Mandela School of Medicine, School of Laboratory Medicine and Medical Sciences, College of Health Sciences, University of KwaZulu-Natal, Durban, South Africa
Paul Kellam Department of Infectious Diseases and Immunity, Imperial College London, United Kingdom
Deenan Pillay Africa Health Research Institute, KwaZulu-Natal, South Africa Division of Infection & Immunity, Faculty of Medical Sciences, University College London, London, United Kingdom
Christophe Fraser Oxford Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
on behalf of the PANGEA-HIV Consort

Collapse

129

Edwards SV, Cloutier A, Baker AJ. Conserved Nonexonic Elements: A Novel Class of Marker for Phylogenomics. Syst Biol 2017;66:1028-1044. [PMID: 28637293 PMCID: PMC5790140 DOI: 10.1093/sysbio/syx058] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2016] [Revised: 06/03/2017] [Accepted: 06/06/2017] [Indexed: 01/12/2023] Open

Abstract

Noncoding markers have a particular appeal as tools for phylogenomic analysis because, at least in vertebrates, they appear less subject to strong variation in GC content among lineages. Thus far, ultraconserved elements (UCEs) and introns have been the most widely used noncoding markers. Here we analyze and study the evolutionary properties of a new type of noncoding marker, conserved nonexonic elements (CNEEs), which consists of noncoding elements that are estimated to evolve slower than the neutral rate across a set of species. Although they often include UCEs, CNEEs are distinct from UCEs because they are not ultraconserved, and, most importantly, the core region alone is analyzed, rather than both the core and its flanking regions. Using a data set of 16 birds plus an alligator outgroup, and ∼3600-∼3800 loci per marker type, we found that although CNEEs were less variable than bioinformatically derived UCEs or introns and in some cases exhibited a slower approach to branch resolution as determined by phylogenomic subsampling, the quality of CNEE alignments was superior to those of the other markers, with fewer gaps and missing species. Phylogenetic resolution using coalescent approaches was comparable among the three marker types, with most nodes being fully and congruently resolved. Comparison of phylogenetic results across the three marker types indicated that one branch, the sister group to the passerine + falcon clade, was resolved differently and with moderate (>70%) bootstrap support between CNEEs and UCEs or introns. Overall, CNEEs appear to be promising as phylogenomic markers, yielding phylogenetic resolution as high as for UCEs and introns but with fewer gaps, less ambiguity in alignments and with patterns of nucleotide substitution more consistent with the assumptions of commonly used methods of phylogenetic analysis.

Collapse

130

Hallas JM, Chichvarkhin A, Gosliner TM. Aligning evidence: concerns regarding multiple sequence alignments in estimating the phylogeny of the Nudibranchia suborder Doridina. ROYAL SOCIETY OPEN SCIENCE 2017;4:171095. [PMID: 29134101 PMCID: PMC5666284 DOI: 10.1098/rsos.171095] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/09/2017] [Accepted: 09/20/2017] [Indexed: 06/07/2023]

131

Huston DC, Cutmore SC, Cribb TH. Molecular phylogeny of the Haplosplanchnata Olson, Cribb, Tkach, Bray and Littlewood, 2003, with a description of Schikhobalotrema huffmani n. sp. Acta Parasitol 2017;62:502-512. [PMID: 28682775 DOI: 10.1515/ap-2017-0060] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2016] [Accepted: 03/27/2017] [Indexed: 11/15/2022]

Abstract

We describe Schikhobalotrema huffmani n. sp. from Tylosurus crocodilus (Péron and Leseur) (Belonidae) collected off Lizard Island, Great Barrier Reef, Queensland, Australia and Tylosurus gavialoides (Castelnau) collected from Moreton Bay, Queensland. Schikhobalotrema huffmani n. sp., along with Schikhobalotrema ablennis (Abdul-Salam and Khalil, 1987) Madhavi, 2005, Schikhobalotrema acutum (Linton, 1910) Skrjabin and Guschanskaja, 1955 and Schikhobalotrema adacutum (Manter, 1937) Skrjabin and Guschanskaja, 1955 are distinguished from all other species of Schikhobalotrema Skrjabin and Guschanskaja, 1955 in having ventral suckers which bear lateral lobes and have longitudinal apertures. Schikhobalotrema huffmani n. sp. differs from S. ablennis in having an obvious post-vitelline region and a longer forebody. From S. acutum, S. huffmani n. sp. differs in having a prostatic bulb smaller than the pharynx and more anterior testis. From S. adacutum, S. huffmani n. sp. differs in having more prominent ventral sucker lobes, a conspicuous prostatic bulb and a longer forebody. We also report the first Australian record of Haplosplanchnus pachysomus (Eysenhardt, 1829) Looss, 1902, from Mugil cephalus Linnaeus (Mugilidae) collected in Moreton Bay. Molecular sequence data (ITS2, 18S and 28S rDNA) were generated for Schikhobalotrema huffmani n. sp., H. pachysomus and archived specimens of Hymenocotta mulli Manter, 1961. The new 18S and 28S molecular data were combined with published data of five other haplosplanchnid taxa to expand the phylogeny for the Haplosplanchnata. Bayesian inference and Maximum Likelihood analyses recovered identical tree topology and demonstrated the Haplosplanchnata as a well-supported monophyletic group. However, relationships at and below the subfamily level remain poorly resolved.

Collapse

132

Basso A, Babbucci M, Pauletto M, Riginella E, Patarnello T, Negrisolo E. The highly rearranged mitochondrial genomes of the crabs Maja crispata and Maja squinado (Majidae) and gene order evolution in Brachyura. Sci Rep 2017;7:4096. [PMID: 28642542 PMCID: PMC5481413 DOI: 10.1038/s41598-017-04168-9] [Citation(s) in RCA: 51] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2016] [Accepted: 05/11/2017] [Indexed: 11/09/2022] Open

133

Le VS, Dang CC, Le QS. Improved mitochondrial amino acid substitution models for metazoan evolutionary studies. BMC Evol Biol 2017;17:136. [PMID: 28606055 PMCID: PMC5469158 DOI: 10.1186/s12862-017-0987-y] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2017] [Accepted: 06/03/2017] [Indexed: 11/10/2022] Open

134

Anderson FE, Williams BW, Horn KM, Erséus C, Halanych KM, Santos SR, James SW. Phylogenomic analyses of Crassiclitellata support major Northern and Southern Hemisphere clades and a Pangaean origin for earthworms. BMC Evol Biol 2017;17:123. [PMID: 28558722 PMCID: PMC5450073 DOI: 10.1186/s12862-017-0973-4] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2016] [Accepted: 05/18/2017] [Indexed: 11/26/2022] Open

135

James AM, Jayasena AS, Zhang J, Berkowitz O, Secco D, Knott GJ, Whelan J, Bond CS, Mylne JS. Evidence for Ancient Origins of Bowman-Birk Inhibitors from Selaginella moellendorffii. THE PLANT CELL 2017;29:461-473. [PMID: 28298518 PMCID: PMC5385957 DOI: 10.1105/tpc.16.00831] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Revised: 02/27/2017] [Accepted: 03/14/2017] [Indexed: 05/16/2023]

136

Martin SB, Cutmore SC, Cribb TH. Revision of Neolebouria Gibson, 1976 (Digenea: Opecoelidae), with Trilobovarium n. g., for species infecting tropical and subtropical shallow-water fishes. Syst Parasitol 2017;94:307-338. [DOI: 10.1007/s11230-017-9707-7] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2016] [Accepted: 02/02/2017] [Indexed: 11/24/2022]

137

Ayad LAK, Pissis SP. MARS: improving multiple circular sequence alignment using refined sequences. BMC Genomics 2017;18:86. [PMID: 28088189 PMCID: PMC5237495 DOI: 10.1186/s12864-016-3477-5] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2016] [Accepted: 12/26/2016] [Indexed: 12/04/2022] Open

Abstract

Background

A fundamental assumption of all widely-used multiple sequence alignment techniques is that the left- and right-most positions of the input sequences are relevant to the alignment. However, the position where a sequence starts or ends can be totally arbitrary due to a number of reasons: arbitrariness in the linearisation (sequencing) of a circular molecular structure; or inconsistencies introduced into sequence databases due to different linearisation standards. These scenarios are relevant, for instance, in the process of multiple sequence alignment of mitochondrial DNA, viroid, viral or other genomes, which have a circular molecular structure. A solution for these inconsistencies would be to identify a suitable rotation (cyclic shift) for each sequence; these refined sequences may in turn lead to improved multiple sequence alignments using the preferred multiple sequence alignment program.

Results

We present MARS, a new heuristic method for improving Multiple circular sequence Alignment using Refined Sequences. MARS was implemented in the C++ programming language as a program to compute the rotations (cyclic shifts) required to best align a set of input sequences. Experimental results, using real and synthetic data, show that MARS improves the alignments, with respect to standard genetic measures and the inferred maximum-likelihood-based phylogenies, and outperforms state-of-the-art methods both in terms of accuracy and efficiency. Our results show, among others, that the average pairwise distance in the multiple sequence alignment of a dataset of widely-studied mitochondrial DNA sequences is reduced by around 5% when MARS is applied before a multiple sequence alignment is performed.

Conclusions

Analysing multiple sequences simultaneously is fundamental in biological research and multiple sequence alignment has been found to be a popular method for this task. Conventional alignment techniques cannot be used effectively when the position where sequences start is arbitrary. We present here a method, which can be used in conjunction with any multiple sequence alignment program, to address this problem effectively and efficiently.

Collapse

138

TreeShrink: Efficient Detection of Outlier Tree Leaves. COMPARATIVE GENOMICS 2017. [DOI: 10.1007/978-3-319-67979-2_7] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

139

Phylogenetics and Phylogenomics of Rust Fungi. FUNGAL PHYLOGENETICS AND PHYLOGENOMICS 2017;100:267-307. [DOI: 10.1016/bs.adgen.2017.09.011] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

140

Ngoc PCT, Greenhalgh R, Dermauw W, Rombauts S, Bajda S, Zhurov V, Grbić M, Van de Peer Y, Van Leeuwen T, Rouzé P, Clark RM. Complex Evolutionary Dynamics of Massively Expanded Chemosensory Receptor Families in an Extreme Generalist Chelicerate Herbivore. Genome Biol Evol 2016;8:3323-3339. [PMID: 27797949 PMCID: PMC5203786 DOI: 10.1093/gbe/evw249] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open

141

Chiner-Oms A, González-Candelas F. EvalMSA: A Program to Evaluate Multiple Sequence Alignments and Detect Outliers. Evol Bioinform Online 2016;12:277-284. [PMID: 27920488 PMCID: PMC5127606 DOI: 10.4137/ebo.s40583] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2016] [Revised: 10/02/2016] [Accepted: 10/05/2016] [Indexed: 12/01/2022] Open

142

Nute M, Warnow T. Scaling statistical multiple sequence alignment to large datasets. BMC Genomics 2016;17:764. [PMID: 28185555 PMCID: PMC5123300 DOI: 10.1186/s12864-016-3101-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open

143

Iwai S, Weinmaier T, Schmidt BL, Albertson DG, Poloso NJ, Dabbagh K, DeSantis TZ. Piphillin: Improved Prediction of Metagenomic Content by Direct Inference from Human Microbiomes. PLoS One 2016;11:e0166104. [PMID: 27820856 PMCID: PMC5098786 DOI: 10.1371/journal.pone.0166104] [Citation(s) in RCA: 198] [Impact Index Per Article: 24.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2016] [Accepted: 10/07/2016] [Indexed: 01/30/2023] Open

144

Nunez JCB, Oleksiak MF. A Cost-Effective Approach to Sequence Hundreds of Complete Mitochondrial Genomes. PLoS One 2016;11:e0160958. [PMID: 27505419 PMCID: PMC4978415 DOI: 10.1371/journal.pone.0160958] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2016] [Accepted: 07/27/2016] [Indexed: 12/11/2022] Open

145

Vanhoutreve R, Kress A, Legrand B, Gass H, Poch O, Thompson JD. LEON-BIS: multiple alignment evaluation of sequence neighbours using a Bayesian inference system. BMC Bioinformatics 2016;17:271. [PMID: 27387560 PMCID: PMC4936259 DOI: 10.1186/s12859-016-1146-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2016] [Accepted: 07/01/2016] [Indexed: 11/13/2022] Open

146

Simmons MP, Gatesy J. Biases of tree-independent-character-subsampling methods. Mol Phylogenet Evol 2016;100:424-443. [DOI: 10.1016/j.ympev.2016.04.022] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2015] [Revised: 03/16/2016] [Accepted: 04/15/2016] [Indexed: 12/21/2022]

147

Jaiteh M, Taly A, Hénin J. Evolution of Pentameric Ligand-Gated Ion Channels: Pro-Loop Receptors. PLoS One 2016;11:e0151934. [PMID: 26986966 PMCID: PMC4795631 DOI: 10.1371/journal.pone.0151934] [Citation(s) in RCA: 70] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2015] [Accepted: 03/07/2016] [Indexed: 01/27/2023] Open

Abstract

Pentameric ligand-gated ion channels (pLGICs) are ubiquitous neurotransmitter receptors in Bilateria, with a small number of known prokaryotic homologues. Here we describe a new inventory and phylogenetic analysis of pLGIC genes across all kingdoms of life. Our main finding is a set of pLGIC genes in unicellular eukaryotes, some of which are metazoan-like Cys-loop receptors, and others devoid of Cys-loop cysteines, like their prokaryotic relatives. A number of such “Cys-less” receptors also appears in invertebrate metazoans. Together, those findings draw a new distribution of pLGICs in eukaryotes. A broader distribution of prokaryotic channels also emerges, including a major new archaeal taxon, Thaumarchaeota. More generally, pLGICs now appear nearly ubiquitous in major taxonomic groups except multicellular plants and fungi. However, pLGICs are sparsely present in unicellular taxa, suggesting a high rate of gene loss and a non-essential character, contrasting with their essential role as synaptic receptors of the bilaterian nervous system. Multiple alignments of these highly divergent sequences reveal a small number of conserved residues clustered at the interface between the extracellular and transmembrane domains. Only the “Cys-loop” proline is absolutely conserved, suggesting the more fitting name “Pro loop” for that motif, and “Pro-loop receptors” for the superfamily. The infered molecular phylogeny shows a Cys-loop and a Cys-less clade in eukaryotes, both containing metazoans and unicellular members. This suggests new hypotheses on the evolutionary history of the superfamily, such as a possible origin of the Cys-loop cysteines in an ancient unicellular eukaryote. Deeper phylogenetic relationships remain uncertain, particularly around the split between bacteria, archaea, and eukaryotes.

Collapse

148

Alors D, Lumbsch HT, Divakar PK, Leavitt SD, Crespo A. An Integrative Approach for Understanding Diversity in the Punctelia rudecta Species Complex (Parmeliaceae, Ascomycota). PLoS One 2016;11:e0146537. [PMID: 26863231 PMCID: PMC4749632 DOI: 10.1371/journal.pone.0146537] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2015] [Accepted: 12/18/2015] [Indexed: 11/23/2022] Open

149

Cannon JT, Kocot KM. Phylogenomics Using Transcriptome Data. Methods Mol Biol 2016;1452:65-80. [PMID: 27460370 DOI: 10.1007/978-1-4939-3774-5_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]