Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Dylus D, Altenhoff A, Majidian S, Sedlazeck FJ, Dessimoz C. Inference of phylogenetic trees directly from raw sequencing reads using Read2Tree. Nat Biotechnol 2024;42:139-147. [PMID: 37081138 PMCID: PMC10791578 DOI: 10.1038/s41587-023-01753-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2022] [Accepted: 03/16/2023] [Indexed: 04/22/2023]

For:	Dylus D, Altenhoff A, Majidian S, Sedlazeck FJ, Dessimoz C. Inference of phylogenetic trees directly from raw sequencing reads using Read2Tree. Nat Biotechnol 2024;42:139-147. [PMID: 37081138 PMCID: PMC10791578 DOI: 10.1038/s41587-023-01753-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2022] [Accepted: 03/16/2023] [Indexed: 04/22/2023]

Number

Cited by Other Article(s)

Jackson DJ, Cerveau N, Posnien N. De novo assembly of transcriptomes and differential gene expression analysis using short-read data from emerging model organisms - a brief guide. Front Zool 2024;21:17. [PMID: 38902827 PMCID: PMC11188175 DOI: 10.1186/s12983-024-00538-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Accepted: 06/12/2024] [Indexed: 06/22/2024] Open

Abstract

Many questions in biology benefit greatly from the use of a variety of model systems. High-throughput sequencing methods have been a triumph in the democratization of diverse model systems. They allow for the economical sequencing of an entire genome or transcriptome of interest, and with technical variations can even provide insight into genome organization and the expression and regulation of genes. The analysis and biological interpretation of such large datasets can present significant challenges that depend on the 'scientific status' of the model system. While high-quality genome and transcriptome references are readily available for well-established model systems, the establishment of such references for an emerging model system often requires extensive resources such as finances, expertise and computation capabilities. The de novo assembly of a transcriptome represents an excellent entry point for genetic and molecular studies in emerging model systems as it can efficiently assess gene content while also serving as a reference for differential gene expression studies. However, the process of de novo transcriptome assembly is non-trivial, and as a rule must be empirically optimized for every dataset. For the researcher working with an emerging model system, and with little to no experience with assembling and quantifying short-read data from the Illumina platform, these processes can be daunting. In this guide we outline the major challenges faced when establishing a reference transcriptome de novo and we provide advice on how to approach such an endeavor. We describe the major experimental and bioinformatic steps, provide some broad recommendations and cautions for the newcomer to de novo transcriptome assembly and differential gene expression analyses. Moreover, we provide an initial selection of tools that can assist in the journey from raw short-read data to assembled transcriptome and lists of differentially expressed genes.

Collapse

Agustinho DP, Fu Y, Menon VK, Metcalf GA, Treangen TJ, Sedlazeck FJ. Unveiling microbial diversity: harnessing long-read sequencing technology. Nat Methods 2024;21:954-966. [PMID: 38689099 DOI: 10.1038/s41592-024-02262-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 03/29/2024] [Indexed: 05/02/2024]

Roestel JA, Wiersema JH, Jansen RK, Borsch T, Gruenstaeudl M. On the importance of sequence alignment inspections in plastid phylogenomics - an example from revisiting the relationships of the water-lilies. Cladistics 2024. [PMID: 38761095 DOI: 10.1111/cla.12584] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Revised: 04/27/2024] [Accepted: 04/29/2024] [Indexed: 05/20/2024] Open

Abstract

The water-lily clade represents the second earliest-diverging branch of angiosperms. Most of its species belong to Nymphaeaceae, of which the "core Nymphaeaceae"-comprising the genera Euryale, Nymphaea and Victoria-is the most diverse clade. Despite previous molecular phylogenetic studies on the core Nymphaeaceae, various aspects of their evolutionary relationships have remained unresolved. The length-variable introns and intergenic spacers are known to contain most of the sequence variability within the water-lily plastomes. Despite the challenges with multiple sequence alignment, any new molecular phylogenetic investigation on the core Nymphaeaceae should focus on these noncoding plastome regions. For example, a new plastid phylogenomic study on the core Nymphaeaceae should generate DNA sequence alignments of all plastid introns and intergenic spacers based on the principle of conserved sequence motifs. In this investigation, we revisit the phylogenetic history of the core Nymphaeaceae by employing such an approach. Specifically, we use a plastid phylogenomic analysis strategy in which all coding and noncoding partitions are separated and then undergo software-driven DNA sequence alignment, followed by a motif-based alignment inspection and adjustment. This approach allows us to increase the reliability of the character base compared to the default practice of aligning complete plastomes through software algorithms alone. Our approach produces significantly different phylogenetic tree reconstructions for several of the plastome regions under study. The results of these reconstructions underscore that Nymphaea is paraphyletic in its current circumscription, that each of the five subgenera of Nymphaea is monophyletic, and that the subgenus Nymphaea is sister to all other subgenera of Nymphaea. Our results also clarify many evolutionary relationships within the Nymphaea subgenera Brachyceras, Hydrocallis and Nymphaea. In closing, we discuss whether the phylogenetic reconstructions obtained through our motif-based alignment adjustments are in line with morphological evidence on water-lily evolution.

Collapse

Kille B, Nute MG, Huang V, Kim E, Phillippy AM, Treangen TJ. Parsnp 2.0: scalable core-genome alignment for massive microbial datasets. Bioinformatics 2024;40:btae311. [PMID: 38724243 PMCID: PMC11128092 DOI: 10.1093/bioinformatics/btae311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 04/12/2024] [Accepted: 05/07/2024] [Indexed: 05/21/2024] Open

Park S, Kwak M, Park S. Complete organelle genomes of Korean fir, Abies koreana and phylogenomics of the gymnosperm genus Abies using nuclear and cytoplasmic DNA sequence data. Sci Rep 2024;14:7636. [PMID: 38561351 PMCID: PMC10985005 DOI: 10.1038/s41598-024-58253-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Accepted: 03/27/2024] [Indexed: 04/04/2024] Open

Abstract

Abies koreana E.H.Wilson is an endangered evergreen coniferous tree that is native to high altitudes in South Korea and susceptible to the effects of climate change. Hybridization and reticulate evolution have been reported in the genus; therefore, multigene datasets from nuclear and cytoplasmic genomes are needed to better understand its evolutionary history. Using the Illumina NovaSeq 6000 and Oxford Nanopore Technologies (ONT) PromethION platforms, we generated complete mitochondrial (1,174,803 bp) and plastid (121,341 bp) genomes from A. koreana. The mitochondrial genome is highly dynamic, transitioning from cis- to trans-splicing and breaking conserved gene clusters. In the plastome, the ONT reads revealed two structural conformations of A. koreana. The short inverted repeats (1186 bp) of the A. koreana plastome are associated with different structural types. Transcriptomic sequencing revealed 1356 sites of C-to-U RNA editing in the 41 mitochondrial genes. Using A. koreana as a reference, we additionally produced nuclear and organelle genomic sequences from eight Abies species and generated multiple datasets for maximum likelihood and network analyses. Three sections (Balsamea, Momi, and Pseudopicea) were well grouped in the nuclear phylogeny, but the phylogenomic relationships showed conflicting signals in the mitochondrial and plastid genomes, indicating a complicated evolutionary history that may have included introgressive hybridization. The obtained data illustrate that phylogenomic analyses based on sequences from differently inherited organelle genomes have resulted in conflicting trees. Organelle capture, organelle genome recombination, and incomplete lineage sorting in an ancestral heteroplasmic individual can contribute to phylogenomic discordance. We provide strong support for the relationships within Abies and new insights into the phylogenomic complexity of this genus.

Collapse

Wang F, Wang Y, Zeng X, Zhang S, Yu J, Li D, Zhang X. MIKE: an ultrafast, assembly-, and alignment-free approach for phylogenetic tree construction. Bioinformatics 2024;40:btae154. [PMID: 38547397 PMCID: PMC10990684 DOI: 10.1093/bioinformatics/btae154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 02/06/2024] [Indexed: 04/05/2024] Open

Affiliation(s)

Fang Wang College of Computer Science and Technology, Taiyuan University of Technology, Taiyuan, Shanxi 030024, China National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China
Yibin Wang National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China
Xiaofei Zeng Department of Human Cell Biology and Genetics, Joint Laboratory of Guangdong-Hong Kong Universities for Vascular Homeostasis and Diseases, School of Medicine, Southern University of Science and Technology, Shenzhen, Guangdong 508055, China
Shengcheng Zhang National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China
Jiaxin Yu National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China
Dongxi Li College of Computer Science and Technology, Taiyuan University of Technology, Taiyuan, Shanxi 030024, China
Xingtan Zhang National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China

Collapse

Kille B, Nute MG, Huang V, Kim E, Phillippy AM, Treangen TJ. Parsnp 2.0: Scalable Core-Genome Alignment for Massive Microbial Datasets. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.30.577458. [PMID: 38352342 PMCID: PMC10862825 DOI: 10.1101/2024.01.30.577458] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2024]

Altenhoff AM, Warwick Vesztrocy A, Bernard C, Train CM, Nicheperovich A, Prieto Baños S, Julca I, Moi D, Nevers Y, Majidian S, Dessimoz C, Glover NM. OMA orthology in 2024: improved prokaryote coverage, ancestral and extant GO enrichment, a revamped synteny viewer and more in the OMA Ecosystem. Nucleic Acids Res 2024;52:D513-D521. [PMID: 37962356 PMCID: PMC10767875 DOI: 10.1093/nar/gkad1020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 10/17/2023] [Accepted: 10/23/2023] [Indexed: 11/15/2023] Open

Affiliation(s)

Adrian M Altenhoff SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland ETH Zurich, Computer Science, Universitätstr. 6, 8092 Zurich, Switzerland
Alex Warwick Vesztrocy SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
Charles Bernard SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
Clement-Marie Train Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
Alina Nicheperovich Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
Silvia Prieto Baños SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
Irene Julca SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
David Moi SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
Yannis Nevers SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
Sina Majidian SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
Christophe Dessimoz SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
Natasha M Glover SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland

Collapse

Thalén F, Köhne CG, Bleidorn C. Patchwork: Alignment-Based Retrieval and Concatenation of Phylogenetic Markers from Genomic Data. Genome Biol Evol 2023;15:evad227. [PMID: 38085033 PMCID: PMC10735302 DOI: 10.1093/gbe/evad227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/06/2023] [Indexed: 12/23/2023] Open

Kim BY, Gellert HR, Church SH, Suvorov A, Anderson SS, Barmina O, Beskid SG, Comeault AA, Crown KN, Diamond SE, Dorus S, Fujichika T, Hemker JA, Hrcek J, Kankare M, Katoh T, Magnacca KN, Martin RA, Matsunaga T, Medeiros MJ, Miller DE, Pitnick S, Simoni S, Steenwinkel TE, Schiffer M, Syed ZA, Takahashi A, Wei KHC, Yokoyama T, Eisen MB, Kopp A, Matute D, Obbard DJ, O'Grady PM, Price DK, Toda MJ, Werner T, Petrov DA. Single-fly assemblies fill major phylogenomic gaps across the Drosophilidae Tree of Life. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.02.560517. [PMID: 37873137 PMCID: PMC10592941 DOI: 10.1101/2023.10.02.560517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]

Abstract

Long-read sequencing is driving rapid progress in genome assembly across all major groups of life, including species of the family Drosophilidae, a longtime model system for genetics, genomics, and evolution. We previously developed a cost-effective hybrid Oxford Nanopore (ONT) long-read and Illumina short-read sequencing approach and used it to assemble 101 drosophilid genomes from laboratory cultures, greatly increasing the number of genome assemblies for this taxonomic group. The next major challenge is to address the laboratory culture bias in taxon sampling by sequencing genomes of species that cannot easily be reared in the lab. Here, we build upon our previous methods to perform amplification-free ONT sequencing of single wild flies obtained either directly from the field or from ethanol-preserved specimens in museum collections, greatly improving the representation of lesser studied drosophilid taxa in whole-genome data. Using Illumina Novaseq X Plus and ONT P2 sequencers with R10.4.1 chemistry, we set a new benchmark for inexpensive hybrid genome assembly at US $150 per genome while assembling genomes from as little as 35 ng of genomic DNA from a single fly. We present 183 new genome assemblies for 179 species as a resource for drosophilid systematics, phylogenetics, and comparative genomics. Of these genomes, 62 are from pooled lab strains and 121 from single adult flies. Despite the sample limitations of working with small insects, most single-fly diploid assemblies are comparable in contiguity (>1Mb contig N50), completeness (>98% complete dipteran BUSCOs), and accuracy (>QV40 genome-wide with ONT R10.4.1) to assemblies from inbred lines. We present a well-resolved multi-locus phylogeny for 360 drosophilid and 4 outgroup species encompassing all publicly available (as of August 2023) genomes for this group. Finally, we present a Progressive Cactus whole-genome, reference-free alignment built from a subset of 298 suitably high-quality drosophilid genomes. The new assemblies and alignment, along with updated laboratory protocols and computational pipelines, are released as an open resource and as a tool for studying evolution at the scale of an entire insect family.

Collapse

Affiliation(s)

Bernard Y Kim Department of Biology, Stanford University, USA
Hannah R Gellert Department of Biology, Stanford University, USA
Samuel H Church Department of Ecology and Evolutionary Biology, Yale University, USA
Anton Suvorov Department of Biological Sciences, Virginia Tech, USA
Sean S Anderson Department of Biology, University of North Carolina Chapel Hill, USA
Olga Barmina Department of Evolution and Ecology, University of California Davis, USA
Sofia G Beskid Department of Biology, Stanford University, USA
Aaron A Comeault School of Environmental and Natural Sciences, Bangor University, UK
K Nicole Crown Department of Biology, Case Western Reserve University, USA
Sarah E Diamond Department of Biology, Case Western Reserve University, USA
Steve Dorus Center for Reproductive Evolution, Department of Biology, Syracuse University, USA
Takako Fujichika Department of Biological Sciences, Tokyo Metropolitan University, Japan
James A Hemker Department of Developmental Biology, Stanford University, USA
Jan Hrcek Institute of Entomology, Biology Centre, Czech Academy of Sciences, Czechia
Maaria Kankare Department of Biological and Environmental Science, University of Jyväskylä, Finland
Toru Katoh Department of Biological Sciences, Hokkaido University, Japan
Karl N Magnacca Hawaii Invertebrate Program, Division of Forestry & Wildlife, State of Hawaii, USA
Ryan A Martin Department of Biology, Case Western Reserve University, USA
Teruyuki Matsunaga Department of Complexity Science and Engineering, The University of Tokyo, Japan
Matthew J Medeiros Pacific Biosciences Research Center, University of Hawai'i, Mānoa, USA
Danny E Miller Division of Genetic Medicine, Department of Pediatrics; Department of Laboratory Medicine and Pathology, University of Washington, USA
Scott Pitnick Center for Reproductive Evolution, Department of Biology, Syracuse University, USA
Sara Simoni Department of Biology, Stanford University, USA
Tessa E Steenwinkel Baylor College of Medicine, USA
Michele Schiffer Daintree Rainforest Observatory, James Cook University, Australia
Zeeshan A Syed Center for Reproductive Evolution, Department of Biology, Syracuse University, USA
Aya Takahashi Department of Biological Sciences, Tokyo Metropolitan University, Japan
Kevin H-C Wei Department of Zoology, The University of British Columbia
Tsuya Yokoyama Department of Biology, Stanford University, USA
Michael B Eisen Department of Cell and Molecular Biology, University of California Berkeley, United States Howard Hughes Medical Institute,University of California Berkeley, United States
Artyom Kopp Department of Evolution and Ecology, University of California Davis, USA
Daniel Matute Department of Biology, University of North Carolina Chapel Hill, USA
Darren J Obbard Institute of Ecology and Evolution, University of Edinburgh, UK
Patrick M O'Grady Department of Entomology, Cornell University, USA
Donald K Price School of Life Sciences, University of Nevada Las Vegas, USA
Masanori J Toda Hokkaido University Museum, Hokkaido University, Japan
Thomas Werner Department of Biological Sciences, Michigan Technological University, USA
Dmitri A Petrov Department of Biology, Stanford University, USA CZ Biohub, Investigator

Collapse