1
|
Jain K, Panigrahi M, Nayak SS, Rajawat D, Sharma A, Sahoo SP, Bhushan B, Dutt T. The evolution of contemporary livestock species: Insights from mitochondrial genome. Gene 2024; 927:148728. [PMID: 38944163 DOI: 10.1016/j.gene.2024.148728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Revised: 06/05/2024] [Accepted: 06/24/2024] [Indexed: 07/01/2024]
Abstract
The domestication of animals marks a pivotal moment in human history, profoundly influencing our demographic and cultural progress. This process has led to significant genetic, behavioral, and physical changes in livestock species compared to their wild ancestors. Understanding the evolutionary history and genetic diversity of livestock species is crucial, and mitochondrial DNA (mtDNA) has emerged as a robust marker for investigating molecular diversity in animals. Its highly conserved gene content across animal species, minimal duplications, absence of introns, and short intergenic regions make mtDNA analysis ideal for such studies. Mitochondrial DNA analysis has uncovered distinct cattle domestication events dating back to 8000 years BC in Southwestern Asia. The sequencing of water buffalo mtDNA in 2004 provided important insights into their domestication history. Caprine mtDNA analysis identified three haplogroups, indicating varied maternal origins. Sheep, domesticated 12,000 years ago, exhibit diverse mtDNA lineages, suggesting multiple domestication events. Ovine mtDNA studies revealed clades A, B, C, and a fourth lineage, group D. The origins of domestic pigs were traced to separate European and Asian events followed by interbreeding. In camels, mtDNA elucidated the phylogeographic structure and genetic differentiation between wild and domesticated species. Horses, domesticated around 3500 BC, show significant mtDNA variability, highlighting their diverse origins. Yaks exhibit unique adaptations for high-altitude environments, with mtDNA analysis providing insights into their adaptation. Chicken mtDNA studies supported a monophyletic origin from Southeast Asia's red jungle fowl, with evidence of multiple origins. This review explores livestock evolution and diversity through mtDNA studies, focusing on cattle, water buffalo, goat, sheep, pig, camel, horse, yak and chicken. It highlights mtDNA's significance in unraveling maternal lineages, genetic diversity, and domestication histories, concluding with insights into its potential application in improving livestock production and reproduction dynamics.
Collapse
Affiliation(s)
- Karan Jain
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Manjit Panigrahi
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India.
| | - Sonali Sonejita Nayak
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Divya Rajawat
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Anurodh Sharma
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | | | - Bharat Bhushan
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Triveni Dutt
- Livestock Production and Management Section, ICAR-Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| |
Collapse
|
2
|
Hannon-Hatfield JA, Chen J, Bergman CM, Garfinkel DJ. Evolution of a Restriction Factor by Domestication of a Yeast Retrotransposon. Mol Biol Evol 2024; 41:msae050. [PMID: 38442736 PMCID: PMC10951436 DOI: 10.1093/molbev/msae050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Revised: 02/13/2024] [Accepted: 02/23/2024] [Indexed: 03/07/2024] Open
Abstract
Transposable elements drive genome evolution in all branches of life. Transposable element insertions are often deleterious to their hosts and necessitate evolution of control mechanisms to limit their spread. The long terminal repeat retrotransposon Ty1 prime (Ty1'), a subfamily of the Ty1 family, is present in many Saccharomyces cerevisiae strains, but little is known about what controls its copy number. Here, we provide evidence that a novel gene from an exapted Ty1' sequence, domesticated restriction of Ty1' relic 2 (DRT2), encodes a restriction factor that inhibits Ty1' movement. DRT2 arose through domestication of a Ty1' GAG gene and contains the C-terminal domain of capsid, which in the related Ty1 canonical subfamily functions as a self-encoded restriction factor. Bioinformatic analysis reveals the widespread nature of DRT2, its evolutionary history, and pronounced structural variation at the Ty1' relic 2 locus. Ty1' retromobility analyses demonstrate DRT2 restriction factor functionality, and northern blot and RNA-seq analysis indicate that DRT2 is transcribed in multiple strains. Velocity cosedimentation profiles indicate an association between Drt2 and Ty1' virus-like particles or assembly complexes. Chimeric Ty1' elements containing DRT2 retain retromobility, suggesting an ancestral role of productive Gag C-terminal domain of capsid functionality is present in the sequence. Unlike Ty1 canonical, Ty1' retromobility increases with copy number, suggesting that C-terminal domain of capsid-based restriction is not limited to the Ty1 canonical subfamily self-encoded restriction factor and drove the endogenization of DRT2. The discovery of an exapted Ty1' restriction factor provides insight into the evolution of the Ty1 family, evolutionary hot-spots, and host-transposable element interactions.
Collapse
Affiliation(s)
- J Adam Hannon-Hatfield
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA, USA
| | - Jingxuan Chen
- Institute of Bioinformatics, University of Georgia, Athens, GA, USA
| | - Casey M Bergman
- Institute of Bioinformatics, University of Georgia, Athens, GA, USA
- Department of Genetics, University of Georgia, Athens, GA, USA
| | - David J Garfinkel
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA, USA
| |
Collapse
|
3
|
Bao Y, He M, Zhang C, Jiang S, Zhao L, Ye Z, Sun Q, Xia Z, Zou M. Advancing understanding of Ficus carica: a comprehensive genomic analysis reveals evolutionary patterns and metabolic pathway insights. FRONTIERS IN PLANT SCIENCE 2023; 14:1298417. [PMID: 38155853 PMCID: PMC10754049 DOI: 10.3389/fpls.2023.1298417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 11/20/2023] [Indexed: 12/30/2023]
Abstract
Ficus carica L. (dioecious), the most significant commercial species in the genus Ficus, which has been cultivated for more than 11,000 years and was one of the first species to be domesticated. Herein, we reported the most comprehensive F. carica genome currently. The contig N50 of the Orphan fig was 9.78 Mb, and genome size was 366.34 Mb with 13 chromosomes. Based on the high-quality genome, we discovered that F. carica diverged from Ficus microcarpa ~34 MYA, and a WGD event took place about 2─3 MYA. Throughout the evolutionary history of F. carica, chromosomes 2, 8, and 10 had experienced chromosome recombination, while chromosome 3 saw a fusion and fission. It is worth proposing that the chromosome 9 experienced both inversion and translocation, which facilitated the emergence of the F. carica as a new species. And the selections of F. carica for the genes of recombination chromosomal fragment are compatible with their goal of domestication. In addition, we found that the F. carica has the FhAG2 gene, but there are structural deletions and positional jumps. This gene is thought to replace the one needed for female common type F. carica to be pollinated. Subsequently, we conducted genomic, transcriptomic, and metabolomic analysis to demonstrate significant differences in the expression of CHS among different varieties of F. carica. The CHS playing an important role in the anthocyanin metabolism pathway of F. carica. Moreover, the CHS gene of F. carica has a different evolutionary trend compared to other Ficus species. These high-quality genome assembly, transcriptomic, and metabolomic resources further enrich F. carica genomics and provide insights for studying the chromosomes evolution, sexual system, and color characteristics of Ficus.
Collapse
Affiliation(s)
- Yuting Bao
- Sanya Nanfan Research Institute of Hainan University, Hainan Yazhou Bay Seed Laboratory, Sanya, China
| | - Miaohua He
- Sanya Nanfan Research Institute of Hainan University, Hainan Yazhou Bay Seed Laboratory, Sanya, China
| | - Chenji Zhang
- Sanya Nanfan Research Institute of Hainan University, Hainan Yazhou Bay Seed Laboratory, Sanya, China
- College of Agriculture, China Agricultural University, Beijing, China
| | - Sirong Jiang
- Sanya Nanfan Research Institute of Hainan University, Hainan Yazhou Bay Seed Laboratory, Sanya, China
| | - Long Zhao
- Sanya Nanfan Research Institute of Hainan University, Hainan Yazhou Bay Seed Laboratory, Sanya, China
- Academy of Agriculture and Forestry Sciences, Qinghai University, Xining, Qinghai, China
| | - Zhengwen Ye
- Forestry and Fruit Research Institute, Shanghai Academy of Agricultural Sciences, Shanghai, China
| | - Qian Sun
- Sanya Nanfan Research Institute of Hainan University, Hainan Yazhou Bay Seed Laboratory, Sanya, China
- College of Life Science and Technology, Guangxi University, Guangxi, China
| | - Zhiqiang Xia
- Sanya Nanfan Research Institute of Hainan University, Hainan Yazhou Bay Seed Laboratory, Sanya, China
| | - Meiling Zou
- Sanya Nanfan Research Institute of Hainan University, Hainan Yazhou Bay Seed Laboratory, Sanya, China
| |
Collapse
|
4
|
Hikida H, Okazaki Y, Zhang R, Nguyen TT, Ogata H. A rapid genome-wide analysis of isolated giant viruses using MinION sequencing. Environ Microbiol 2023; 25:2621-2635. [PMID: 37543720 DOI: 10.1111/1462-2920.16476] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 07/20/2023] [Indexed: 08/07/2023]
Abstract
Following the discovery of Acanthamoeba polyphaga mimivirus, diverse giant viruses have been isolated. However, only a small fraction of these isolates have been completely sequenced, limiting our understanding of the genomic diversity of giant viruses. MinION is a portable and low-cost long-read sequencer that can be readily used in a laboratory. Although MinION provides highly error-prone reads that require correction through additional short-read sequencing, recent studies assembled high-quality microbial genomes only using MinION sequencing. Here, we evaluated the accuracy of MinION-only genome assemblies for giant viruses by re-sequencing a prototype marseillevirus. Assembled genomes presented over 99.98% identity to the reference genome with a few gaps, demonstrating a high accuracy of the MinION-only assembly. As a proof of concept, we de novo assembled five newly isolated viruses. Average nucleotide identities to their closest known relatives suggest that the isolates represent new species of marseillevirus, pithovirus and mimivirus. The assembly of subsampled reads demonstrated that their taxonomy and genomic composition could be analysed at the 50× sequencing coverage. We also identified a pithovirus gene whose homologues were detected only in metagenome-derived relatives. Collectively, we propose that MinION-only assembly is an effective approach to rapidly perform a genome-wide analysis of isolated giant viruses.
Collapse
Affiliation(s)
- Hiroyuki Hikida
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Kyoto, Japan
| | - Yusuke Okazaki
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Kyoto, Japan
| | - Ruixuan Zhang
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Kyoto, Japan
| | - Thi Tuyen Nguyen
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Kyoto, Japan
| | - Hiroyuki Ogata
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Kyoto, Japan
| |
Collapse
|
5
|
Huang YY, Li QS, Li ZD, Sun AH, Hu SP. Rapid diagnosis of Mycobacterium marinum infection using targeted nanopore sequencing: a case report. Front Cell Infect Microbiol 2023; 13:1238872. [PMID: 37965260 PMCID: PMC10642934 DOI: 10.3389/fcimb.2023.1238872] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Accepted: 10/16/2023] [Indexed: 11/16/2023] Open
Abstract
Mycobacterium marinum (M. marinum) is a non-tuberculous mycobacterium (NTM) that can cause infectious diseases in aquatic animals and humans. Culture-based pathogen detection is the gold standard for diagnosing NTM infection. However, this method is time-consuming and has low positivity rates for fastidious organisms. Oxford Nanopore MinION sequencing is an emerging third-generation sequencing technology that can sequence DNA or RNA directly in a culture-independent manner and offers rapid microbial identification. Further benefits include low cost, short turnaround time, long read lengths, and small equipment size. Nanopore sequencing plays a crucial role in assessing drug resistance, clinical identification of microbes, and monitoring infectious diseases. Some reports on Mycobacterium tuberculosis (MTB) using nanopore sequencing have been published, however, there are few reports on NTM, such as M. marinum. Here, we report the use of nanopore sequencing for the diagnosis of M. marinum.
Collapse
Affiliation(s)
- Yan-Ying Huang
- Department of Pathology, Hangzhou Red Cross Hospital, Hangzhou, China
| | - Qiu-Shi Li
- Department of Ophthalmology, Hangzhou Red Cross Hospital, Hangzhou, China
| | - Zhao-Dong Li
- Department of Clinical laboratory, Hangzhou Red Cross Hospital, Hangzhou, China
| | - Ai-Hua Sun
- School of Basic Medical Sciences and Forensic Medicine, Hangzhou Medical College, Hangzhou, China
| | - Sheng-Ping Hu
- Department of Orthopaedic, Hangzhou Red Cross Hospital, Hangzhou, China
| |
Collapse
|
6
|
Yu PL, Fulton JC, Hudson OH, Huguet-Tapia JC, Brawner JT. Next-generation fungal identification using target enrichment and Nanopore sequencing. BMC Genomics 2023; 24:581. [PMID: 37784013 PMCID: PMC10544392 DOI: 10.1186/s12864-023-09691-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 09/21/2023] [Indexed: 10/04/2023] Open
Abstract
BACKGROUND Rapid and accurate pathogen identification is required for disease management. Compared to sequencing entire genomes, targeted sequencing may be used to direct sequencing resources to genes of interest for microbe identification and mitigate the low resolution that single-locus molecular identification provides. This work describes a broad-spectrum fungal identification tool developed to focus high-throughput Nanopore sequencing on genes commonly employed for disease diagnostics and phylogenetic inference. RESULTS Orthologs of targeted genes were extracted from 386 reference genomes of fungal species spanning six phyla to identify homologous regions that were used to design the baits used for enrichment. To reduce the cost of producing probes without diminishing the phylogenetic power, DNA sequences were first clustered, and then consensus sequences within each cluster were identified to produce 26,000 probes that targeted 114 genes. To test the efficacy of our probes, we applied the technique to three species representing Ascomycota and Basidiomycota fungi. The efficiency of enrichment, quantified as mean target coverage over the mean genome-wide coverage, ranged from 200 to 300. Furthermore, enrichment of long reads increased the depth of coverage across the targeted genes and into non-coding flanking sequence. The assemblies generated from enriched samples provided well-resolved phylogenetic trees for taxonomic assignment and molecular identification. CONCLUSIONS Our work provides data to support the utility of targeted Nanopore sequencing for fungal identification and provides a platform that may be extended for use with other phytopathogens.
Collapse
Affiliation(s)
- Pei-Ling Yu
- Department of Plant Pathology, University of Florida, Gainesville, FL, 32611, USA
| | - James C Fulton
- Department of Plant Pathology, University of Florida, Gainesville, FL, 32611, USA
- Florida Department of Agriculture and Consumer Services, Division of Plant Industry, Gainesville, FL, 32608, USA
| | - Owen H Hudson
- Department of Plant Pathology, University of Florida, Gainesville, FL, 32611, USA
| | - Jose C Huguet-Tapia
- Department of Plant Pathology, University of Florida, Gainesville, FL, 32611, USA
| | - Jeremy T Brawner
- Department of Plant Pathology, University of Florida, Gainesville, FL, 32611, USA.
| |
Collapse
|
7
|
Wang W, Zhang X, Garcia S, Leitch AR, Kovařík A. Intragenomic rDNA variation - the product of concerted evolution, mutation, or something in between? Heredity (Edinb) 2023; 131:179-188. [PMID: 37402824 PMCID: PMC10462631 DOI: 10.1038/s41437-023-00634-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Revised: 06/12/2023] [Accepted: 06/12/2023] [Indexed: 07/06/2023] Open
Abstract
The classical model of concerted evolution states that hundreds to thousands of ribosomal DNA (rDNA) units undergo homogenization, making the multiple copies of the individual units more uniform across the genome than would be expected given mutation frequencies and gene redundancy. While the universality of this over 50-year-old model has been confirmed in a range of organisms, advanced high throughput sequencing techniques have also revealed that rDNA homogenization in many organisms is partial and, in rare cases, even apparently failing. The potential underpinning processes leading to unexpected intragenomic variation have been discussed in a number of studies, but a comprehensive understanding remains to be determined. In this work, we summarize information on variation or polymorphisms in rDNAs across a wide range of taxa amongst animals, fungi, plants, and protists. We discuss the definition and description of concerted evolution and describe whether incomplete concerted evolution of rDNAs predominantly affects coding or non-coding regions of rDNA units and if it leads to the formation of pseudogenes or not. We also discuss the factors contributing to rDNA variation, such as interspecific hybridization, meiotic cycles, rDNA expression status, genome size, and the activity of effector genes involved in genetic recombination, epigenetic modifications, and DNA editing. Finally, we argue that a combination of approaches is needed to target genetic and epigenetic phenomena influencing incomplete concerted evolution, to give a comprehensive understanding of the evolution and functional consequences of intragenomic variation in rDNA.
Collapse
Affiliation(s)
- Wencai Wang
- Science and Technology Innovation Center, Guangzhou University of Chinese Medicine, Guangzhou, 510405, China
| | - Xianzhi Zhang
- Department of Horticulture, College of Horticulture and Landscape Architecture, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, China
| | - Sònia Garcia
- Institut Botànic de Barcelona, IBB (CSIC - Ajuntament de Barcelona), Barcelona, Spain
| | - Andrew R Leitch
- School of Biological and Behavioral Sciences, Queen Mary University of London, London, E1 4NS, UK
| | - Aleš Kovařík
- Institute of Biophysics, Academy of Sciences of the Czech Republic, Brno, CZ-61200, Czech Republic.
| |
Collapse
|
8
|
Guerrini V, Conte A, Grossi R, Liti G, Rosone G, Tattini L. phyBWT2: phylogeny reconstruction via eBWT positional clustering. Algorithms Mol Biol 2023; 18:11. [PMID: 37537624 PMCID: PMC10399073 DOI: 10.1186/s13015-023-00232-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Accepted: 06/10/2023] [Indexed: 08/05/2023] Open
Abstract
BACKGROUND Molecular phylogenetics studies the evolutionary relationships among the individuals of a population through their biological sequences. It may provide insights about the origin and the evolution of viral diseases, or highlight complex evolutionary trajectories. A key task is inferring phylogenetic trees from any type of sequencing data, including raw short reads. Yet, several tools require pre-processed input data e.g. from complex computational pipelines based on de novo assembly or from mappings against a reference genome. As sequencing technologies keep becoming cheaper, this puts increasing pressure on designing methods that perform analysis directly on their outputs. From this viewpoint, there is a growing interest in alignment-, assembly-, and reference-free methods that could work on several data including raw reads data. RESULTS We present phyBWT2, a newly improved version of phyBWT (Guerrini et al. in 22nd International Workshop on Algorithms in Bioinformatics (WABI) 242:23-12319, 2022). Both of them directly reconstruct phylogenetic trees bypassing both the alignment against a reference genome and de novo assembly. They exploit the combinatorial properties of the extended Burrows-Wheeler Transform (eBWT) and the corresponding eBWT positional clustering framework to detect relevant blocks of the longest shared substrings of varying length (unlike the k-mer-based approaches that need to fix the length k a priori). As a result, they provide novel alignment-, assembly-, and reference-free methods that build partition trees without relying on the pairwise comparison of sequences, thus avoiding to use a distance matrix to infer phylogeny. In addition, phyBWT2 outperforms phyBWT in terms of running time, as the former reconstructs phylogenetic trees step-by-step by considering multiple partitions, instead of just one partition at a time, as previously done by the latter. CONCLUSIONS Based on the results of the experiments on sequencing data, we conclude that our method can produce trees of quality comparable to the benchmark phylogeny by handling datasets of different types (short reads, contigs, or entire genomes). Overall, the experiments confirm the effectiveness of phyBWT2 that improves the performance of its previous version phyBWT, while preserving the accuracy of the results.
Collapse
Affiliation(s)
| | - Alessio Conte
- Dipartimento di Informatica, University of Pisa, Pisa, Italy.
| | - Roberto Grossi
- Dipartimento di Informatica, University of Pisa, Pisa, Italy.
| | - Gianni Liti
- CNRS UMR 7284, INSERM U1081 Université Côte d'Azu, Nice, France
| | - Giovanna Rosone
- Dipartimento di Informatica, University of Pisa, Pisa, Italy.
| | - Lorenzo Tattini
- CNRS UMR 7284, INSERM U1081 Université Côte d'Azu, Nice, France
| |
Collapse
|
9
|
O'Donnell S, Yue JX, Saada OA, Agier N, Caradec C, Cokelaer T, De Chiara M, Delmas S, Dutreux F, Fournier T, Friedrich A, Kornobis E, Li J, Miao Z, Tattini L, Schacherer J, Liti G, Fischer G. Telomere-to-telomere assemblies of 142 strains characterize the genome structural landscape in Saccharomyces cerevisiae. Nat Genet 2023; 55:1390-1399. [PMID: 37524789 PMCID: PMC10412453 DOI: 10.1038/s41588-023-01459-y] [Citation(s) in RCA: 22] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Accepted: 06/26/2023] [Indexed: 08/02/2023]
Abstract
Pangenomes provide access to an accurate representation of the genetic diversity of species, both in terms of sequence polymorphisms and structural variants (SVs). Here we generated the Saccharomyces cerevisiae Reference Assembly Panel (ScRAP) comprising reference-quality genomes for 142 strains representing the species' phylogenetic and ecological diversity. The ScRAP includes phased haplotype assemblies for several heterozygous diploid and polyploid isolates. We identified circa (ca.) 4,800 nonredundant SVs that provide a broad view of the genomic diversity, including the dynamics of telomere length and transposable elements. We uncovered frequent cases of complex aneuploidies where large chromosomes underwent large deletions and translocations. We found that SVs can impact gene expression near the breakpoints and substantially contribute to gene repertoire evolution. We also discovered that horizontally acquired regions insert at chromosome ends and can generate new telomeres. Overall, the ScRAP demonstrates the benefit of a pangenome in understanding genome evolution at population scale.
Collapse
Affiliation(s)
- Samuel O'Donnell
- Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, Paris, France
| | - Jia-Xing Yue
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Guangdong Key Laboratory of Nasopharyngeal Carcinoma Diagnosis and Therapy, Sun Yat-sen University Cancer Center, Guangzhou, China
- Université Côte d'Azur, CNRS, INSERM, IRCAN, Nice, France
| | - Omar Abou Saada
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Nicolas Agier
- Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, Paris, France
| | - Claudia Caradec
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Thomas Cokelaer
- Biomics Technological Platform, Center for Technological Resources and Research (C2RT), Institut Pasteur, Paris, France
- Bioinformatics and Biostatistics Hub, Computational Biology Department, Institut Pasteur, Paris, France
| | | | - Stéphane Delmas
- Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, Paris, France
| | - Fabien Dutreux
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Téo Fournier
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Anne Friedrich
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Etienne Kornobis
- Biomics Technological Platform, Center for Technological Resources and Research (C2RT), Institut Pasteur, Paris, France
- Bioinformatics and Biostatistics Hub, Computational Biology Department, Institut Pasteur, Paris, France
| | - Jing Li
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Guangdong Key Laboratory of Nasopharyngeal Carcinoma Diagnosis and Therapy, Sun Yat-sen University Cancer Center, Guangzhou, China
- Université Côte d'Azur, CNRS, INSERM, IRCAN, Nice, France
| | - Zepu Miao
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Guangdong Key Laboratory of Nasopharyngeal Carcinoma Diagnosis and Therapy, Sun Yat-sen University Cancer Center, Guangzhou, China
| | | | | | - Gianni Liti
- Université Côte d'Azur, CNRS, INSERM, IRCAN, Nice, France.
| | - Gilles Fischer
- Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, Paris, France.
| |
Collapse
|
10
|
Weller CA, Andreev I, Chambers MJ, Park M, Bloom JS, Sadhu MJ. Highly complete long-read genomes reveal pangenomic variation underlying yeast phenotypic diversity. Genome Res 2023; 33:729-740. [PMID: 37127330 PMCID: PMC10317115 DOI: 10.1101/gr.277515.122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 04/26/2023] [Indexed: 05/03/2023]
Abstract
Understanding the genetic causes of trait variation is a primary goal of genetic research. One way that individuals can vary genetically is through variable pangenomic genes: genes that are only present in some individuals in a population. The presence or absence of entire genes could have large effects on trait variation. However, variable pangenomic genes can be missed in standard genotyping workflows, owing to reliance on aligning short-read sequencing to reference genomes. A popular method for studying the genetic basis of trait variation is linkage mapping, which identifies quantitative trait loci (QTLs), regions of the genome that harbor causative genetic variants. Large-scale linkage mapping in the budding yeast Saccharomyces cerevisiae has found thousands of QTLs affecting myriad yeast phenotypes. To enable the resolution of QTLs caused by variable pangenomic genes, we used long-read sequencing to generate highly complete de novo genome assemblies of 16 diverse yeast isolates. With these assemblies, we resolved QTLs for growth on maltose, sucrose, raffinose, and oxidative stress to specific genes that are absent from the reference genome but present in the broader yeast population at appreciable frequency. Copies of genes also duplicate onto chromosomes where they are absent in the reference genome, and we found that these copies generate additional QTLs whose resolution requires pangenome characterization. Our findings show the need for highly complete genome assemblies to identify the genetic basis of trait variation.
Collapse
Affiliation(s)
- Cory A Weller
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Ilya Andreev
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Michael J Chambers
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Morgan Park
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Joshua S Bloom
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, California 90095, USA
- Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, California 90095, USA
- Howard Hughes Medical Institute, University of California, Los Angeles, Los Angeles, California 90095, USA
- Institute for Quantitative and Computational Biology, University of California, Los Angeles, Los Angeles, California 90095, USA
- Department of Computational Medicine, University of California, Los Angeles, Los Angeles, California 90095, USA
| | - Meru J Sadhu
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA;
| |
Collapse
|
11
|
Mierke F, Brink DP, Norbeck J, Siewers V, Andlid T. Functional genome annotation and transcriptome analysis of Pseudozyma hubeiensis BOT-O, an oleaginous yeast that utilizes glucose and xylose at equal rates. Fungal Genet Biol 2023; 166:103783. [PMID: 36870442 DOI: 10.1016/j.fgb.2023.103783] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 02/10/2023] [Accepted: 02/27/2023] [Indexed: 03/06/2023]
Abstract
Pseudozyma hubeiensis is a basidiomycete yeast that has the highly desirable traits for lignocellulose valorisation of being equally efficient at utilization of glucose and xylose, and capable of their co-utilization. The species has previously mainly been studied for its capacity to produce secreted biosurfactants in the form of mannosylerythritol lipids, but it is also an oleaginous species capable of accumulating high levels of triacylglycerol storage lipids during nutrient starvation. In this study, we aimed to further characterize the oleaginous nature of P. hubeiensis by evaluating metabolism and gene expression responses during storage lipid formation conditions with glucose or xylose as a carbon source. The genome of the recently isolated P. hubeiensis BOT-O strain was sequenced using MinION long-read sequencing and resulted in the most contiguous P. hubeiensis assembly to date with 18.95 Mb in 31 contigs. Using transcriptome data as experimental support, we generated the first mRNA-supported P. hubeiensis genome annotation and identified 6540 genes. 80% of the predicted genes were assigned functional annotations based on protein homology to other yeasts. Based on the annotation, key metabolic pathways in BOT-O were reconstructed, including pathways for storage lipids, mannosylerythritol lipids and xylose assimilation. BOT-O was confirmed to consume glucose and xylose at equal rates, but during mixed glucose-xylose cultivation glucose was found to be taken up faster. Differential expression analysis revealed that only a total of 122 genes were significantly differentially expressed at a cut-off of |log2 fold change| ≥ 2 when comparing cultivation on xylose with glucose, during exponential growth and during nitrogen-starvation. Of these 122 genes, a core-set of 24 genes was identified that were differentially expressed at all time points. Nitrogen-starvation resulted in a larger transcriptional effect, with a total of 1179 genes with significant expression changes at the designated fold change cut-off compared with exponential growth on either glucose or xylose.
Collapse
Affiliation(s)
- Friederike Mierke
- Food and Nutrition Science, Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden; Systems and Synthetic Biology, Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden
| | - Daniel P Brink
- Systems and Synthetic Biology, Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden; Applied Microbiology, Department of Chemistry, Lund University, Lund, Sweden
| | - Joakim Norbeck
- Systems and Synthetic Biology, Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden
| | - Verena Siewers
- Systems and Synthetic Biology, Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden.
| | - Thomas Andlid
- Food and Nutrition Science, Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden
| |
Collapse
|
12
|
Song Y, Li C, Liu L, Hu P, Li G, Zhao X, Zhou H. The population genomic analyses of chloroplast genomes shed new insights on the complicated ploidy and evolutionary history in Fragaria. FRONTIERS IN PLANT SCIENCE 2023; 13:1065218. [PMID: 36874917 PMCID: PMC9975502 DOI: 10.3389/fpls.2022.1065218] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Accepted: 12/30/2022] [Indexed: 06/18/2023]
Abstract
The genus Fragaria consists of a rich diversity of ploidy levels with diploid (2x), tetraploid (4x), pentaploid (5x), hexaploidy (6x), octoploid (8x) and decaploid (10x) species. Only a few studies have explored the origin of diploid and octoploid strawberry, and little is known about the roles of tetraploidy and hexaploidy during the evolution of octoploid strawberry. The chloroplast genome is usually a stable circular genome and is widely used in investigating the evolution and matrilineal identification. Here, we assembled the chloroplast genomes of F. x ananassa cv. 'Benihoppe' (8x) using Illumina and HiFi data seperately. The genome alignment results showed that more InDels were located in the chloroplast genomes based on the PacBio HiFi data than Illumina data. We obtain highly accurate chloroplast genomes assembled through GetOrganelle using Illumina reads. We assembled 200 chloroplast genomes including 198 Fragaria (21 species) and 2 Potentilla samples. Analyses of sequence variation, phylogenetic and PCA analyses showed that Fragaria was divided into five groups. F. iinumae, F. nilgerrensis and all octoploid accessions formed Group A, C and E separately. Species native to western China were clustered into Group B. Group D consisted of F. virdis, F. orientalis, F. moschata, and F. vesca. STRUCTURE and haplotype network confirmed that the diploid F. vesca subsp. bracteata was the last maternal donator of octoploid strawberry. The dN/dS ratio estimated for the protein-coding genes revealed that genes involved in ATP synthase and photosystem function were under positive selection. These findings demonstrate the phylogeny of totally 21 Fragaria species and the origin of octoploid species. F. vesca was the last female donator of octoploid, which confirms the hypothesis that the hexaploid species F. moschata may be an evolutionary intermediate between the diploids and wild octoploid species.
Collapse
Affiliation(s)
- Yanhong Song
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, China
| | - Chaochao Li
- Key Laboratory of Horticultural Plant Biology (Ministry of Education), Huazhong Agricultural University, Wuhan, China
| | - Lifeng Liu
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, China
| | - Panpan Hu
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, China
| | - Gang Li
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, China
| | - Xia Zhao
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, China
| | - Houcheng Zhou
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, China
| |
Collapse
|
13
|
Li J, Guan D, Halstead MM, Islas-Trejo AD, Goszczynski DE, Ernst CW, Cheng H, Ross P, Zhou H. Transcriptome annotation of 17 porcine tissues using nanopore sequencing technology. Anim Genet 2023; 54:35-44. [PMID: 36385508 DOI: 10.1111/age.13274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Revised: 10/20/2022] [Accepted: 11/01/2022] [Indexed: 11/18/2022]
Abstract
The annotation of animal genomes plays an important role in elucidating molecular mechanisms behind the genetic control of economically important traits. Here, we employed long-read sequencing technology, Oxford Nanopore Technology, to annotate the pig transcriptome across 17 tissues from two Yorkshire littermate pigs. More than 9.8 million reads were obtained from a single flow cell, and 69 781 unique transcripts at 50 108 loci were identified. Of these transcripts, 16 255 were found to be novel isoforms, and 22 344 were found at loci that were novel and unannotated in the Ensembl (release 102) and NCBI (release 106) annotations. Novel transcripts were mostly expressed in cerebellum, followed by lung, liver, spleen, and hypothalamus. By comparing the unannotated transcripts to existing databases, there were 21 285 (95.3%) transcripts matched to the NT database (v5) and 13 676 (61.2%) matched to the NR database (v5). Moreover, there were 4324 (19.4%) transcripts matched to the SwissProt database (v5), corresponding to 11 356 proteins. Tissue-specific gene expression analyses showed that 9749 transcripts were highly tissue-specific, and cerebellum contained the most tissue-specific transcripts. As the same samples were used for the annotation of cis-regulatory elements in the pig genome, the transcriptome annotation generated by this study provides an additional and complementary annotation resource for the Functional Annotation of Animal Genomes effort to comprehensively annotate the pig genome.
Collapse
Affiliation(s)
- Jinghui Li
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Dailu Guan
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Michelle M Halstead
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Alma D Islas-Trejo
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Daniel E Goszczynski
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Catherine W Ernst
- Department of Animal Science, Michigan State University, East Lansing, Michigan, USA
| | - Hao Cheng
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Pablo Ross
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Huaijun Zhou
- Department of Animal Science, University of California Davis, Davis, California, USA
| |
Collapse
|
14
|
Vasquez I, Retamales J, Parra B, Machimbirike V, Robeson J, Santander J. Comparative Genomics of a Polyvalent Escherichia-Salmonella Phage fp01 and In Silico Analysis of Its Receptor Binding Protein and Conserved Enterobacteriaceae Phage Receptor. Viruses 2023; 15:v15020379. [PMID: 36851593 PMCID: PMC9961651 DOI: 10.3390/v15020379] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Revised: 01/23/2023] [Accepted: 01/24/2023] [Indexed: 01/31/2023] Open
Abstract
The polyvalent bacteriophage fp01, isolated from wastewater in Valparaiso, Chile, was described to have lytic activity across bacterial species, including Escherichia coli and Salmonella enterica serovars. Due to its polyvalent nature, the bacteriophage fp01 has potential applications in the biomedical, food and agricultural industries. Also, fundamental aspects of polyvalent bacteriophage biology are unknown. In this study, we sequenced and described the complete genome of the polyvalent phage fp01 (MH745368.2) using long- (MinION, Nanopore) and short-reads (MiSeq, Illumina) sequencing. The bacteriophage fp01 genome has 109,515 bp, double-stranded DNA with an average G+C content of 39%, and 158 coding sequences (CDSs). Phage fp01 has genes with high similarity to Escherichia coli, Salmonella enterica, and Shigella sp. phages. Phylogenetic analyses indicated that the phage fp01 is a new Tequintavirus fp01 specie. Receptor binding protein gp108 was identified as potentially responsible for fp01 polyvalent characteristics, which binds to conserved amino acid regions of the FhuA receptor of Enterobacteriaceae.
Collapse
Affiliation(s)
- Ignacio Vasquez
- Marine Microbial Pathogenesis and Vaccinology Laboratory, Department of Ocean Science, Memorial University, St. John’s, NL A1C 5S7, Canada
| | - Julio Retamales
- Instituto de Ciencias Naturales, Universidad de las Américas, Viña del Mar 2520000, Chile
| | - Barbara Parra
- Subdepartment of Molecular Genetics, Public Health Institute of Chile, Santiago 9140000, Chile
| | - Vimbai Machimbirike
- Marine Microbial Pathogenesis and Vaccinology Laboratory, Department of Ocean Science, Memorial University, St. John’s, NL A1C 5S7, Canada
| | - James Robeson
- Laboratory of Microbiology, Institute of Biology, Pontifical Catholic University of Valparaíso, Valparaiso 2370000, Chile
- Correspondence: (J.R.); (J.S.)
| | - Javier Santander
- Marine Microbial Pathogenesis and Vaccinology Laboratory, Department of Ocean Science, Memorial University, St. John’s, NL A1C 5S7, Canada
- Correspondence: (J.R.); (J.S.)
| |
Collapse
|
15
|
Miguel GA, Carlsen S, Arneborg N, Saerens SM, Laulund S, Knudsen GM. Non-Saccharomyces yeasts for beer production: Insights into safety aspects and considerations. Int J Food Microbiol 2022; 383:109951. [DOI: 10.1016/j.ijfoodmicro.2022.109951] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Revised: 09/14/2022] [Accepted: 09/22/2022] [Indexed: 11/05/2022]
|
16
|
Salzberg LI, Martos AAR, Lombardi L, Jermiin LS, Blanco A, Byrne KP, Wolfe KH. A widespread inversion polymorphism conserved among Saccharomyces species is caused by recurrent homogenization of a sporulation gene family. PLoS Genet 2022; 18:e1010525. [PMID: 36441813 PMCID: PMC9731477 DOI: 10.1371/journal.pgen.1010525] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2022] [Revised: 12/08/2022] [Accepted: 11/12/2022] [Indexed: 11/29/2022] Open
Abstract
Saccharomyces genomes are highly collinear and show relatively little structural variation, both within and between species of this yeast genus. We investigated the only common inversion polymorphism known in S. cerevisiae, which affects a 24-kb 'flip/flop' region containing 15 genes near the centromere of chromosome XIV. The region exists in two orientations, called reference (REF) and inverted (INV). Meiotic recombination in this region is suppressed in crosses between REF and INV orientation strains such as the BY x RM cross. We find that the inversion polymorphism is at least 17 million years old because it is conserved across the genus Saccharomyces. However, the REF and INV isomers are not ancient alleles but are continually being re-created by re-inversion of the region within each species. Inversion occurs due to continual homogenization of two almost identical 4-kb sequences that form an inverted repeat (IR) at the ends of the flip/flop region. The IR consists of two pairs of genes that are specifically and strongly expressed during the late stages of sporulation. We show that one of these gene pairs, YNL018C/YNL034W, codes for a protein that is essential for spore formation. YNL018C and YNL034W are the founder members of a gene family, Centroid, whose members in other Saccharomycetaceae species evolve fast, duplicate frequently, and are preferentially located close to centromeres. We tested the hypothesis that Centroid genes are a meiotic drive system, but found no support for this idea.
Collapse
Affiliation(s)
- Letal I. Salzberg
- Conway Institute, University College Dublin, Dublin, Ireland
- School of Medicine, University College Dublin, Dublin, Ireland
| | - Alexandre A. R. Martos
- Conway Institute, University College Dublin, Dublin, Ireland
- School of Medicine, University College Dublin, Dublin, Ireland
| | - Lisa Lombardi
- Conway Institute, University College Dublin, Dublin, Ireland
- School of Biomolecular and Biomedical Science, University College Dublin, Dublin, Ireland
| | - Lars S. Jermiin
- School of Medicine, University College Dublin, Dublin, Ireland
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
- Earth Institute, University College Dublin, Dublin, Ireland
- Research School of Biology, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Alfonso Blanco
- Conway Institute, University College Dublin, Dublin, Ireland
| | - Kevin P. Byrne
- Conway Institute, University College Dublin, Dublin, Ireland
- School of Medicine, University College Dublin, Dublin, Ireland
| | - Kenneth H. Wolfe
- Conway Institute, University College Dublin, Dublin, Ireland
- School of Medicine, University College Dublin, Dublin, Ireland
- * E-mail:
| |
Collapse
|
17
|
Liu D, Xie X, Tong B, Zhou C, Qu K, Guo H, Zhao Z, El-Kassaby YA, Li W, Li W. A high -quality genome assembly and annotation of Quercus acutissima Carruth. FRONTIERS IN PLANT SCIENCE 2022; 13:1068802. [PMID: 36507419 PMCID: PMC9729791 DOI: 10.3389/fpls.2022.1068802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Accepted: 11/02/2022] [Indexed: 06/17/2023]
Abstract
INTRODUCTION Quercus acutissima is an economic and ecological tree species often used for afforestation of arid and semi-arid lands and is considered as an excellent tree for soil and water conservation. METHODS Here, we combined PacBio long reads, Hi-C, and Illumina short reads to assemble Q. acutissima genome. RESULTS We generated a 957.1 Mb genome with a contig N50 of 1.2 Mb and scaffold N50 of 77.0 Mb. The repetitive sequences constituted 55.63% of the genome, among which long terminal repeats were the majority and accounted for 23.07% of the genome. Ab initio, homology-based and RNA sequence-based gene prediction identified 29,889 protein-coding genes, of which 82.6% could be functionally annotated. Phylogenetic analysis showed that Q. acutissima and Q. variabilis were differentiated around 3.6 million years ago, and showed no evidence of species-specific whole genome duplication. CONCLUSION The assembled and annotated high-quality Q. acutissima genome not only promises to accelerate the species molecular biology studies and breeding, but also promotes genome level evolutionary studies.
Collapse
Affiliation(s)
- Dan Liu
- National Engineering Research Center of Tree Breeding and Ecological Restoration, State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
- Shandong Provincial Center of Forest and Grass Germplasm Resources, Jinan, China
| | - Xiaoman Xie
- Shandong Provincial Center of Forest and Grass Germplasm Resources, Jinan, China
| | - Boqiang Tong
- Shandong Provincial Center of Forest and Grass Germplasm Resources, Jinan, China
| | - Chengcheng Zhou
- National Engineering Research Center of Tree Breeding and Ecological Restoration, State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Kai Qu
- National Engineering Research Center of Tree Breeding and Ecological Restoration, State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Haili Guo
- Shandong Provincial Center of Forest and Grass Germplasm Resources, Jinan, China
| | - Zhiheng Zhao
- National Engineering Research Center of Tree Breeding and Ecological Restoration, State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Yousry A. El-Kassaby
- Department of Forest and Conservation Sciences, The University of British Columbia, Vancouver, BC, Canada
| | - Wei Li
- National Engineering Research Center of Tree Breeding and Ecological Restoration, State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Wenqing Li
- Shandong Provincial Center of Forest and Grass Germplasm Resources, Jinan, China
| |
Collapse
|
18
|
MinION Whole-Genome Sequencing in Resource-Limited Settings: Challenges and Opportunities. CURRENT CLINICAL MICROBIOLOGY REPORTS 2022; 9:52-59. [DOI: 10.1007/s40588-022-00183-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/04/2022] [Indexed: 11/18/2022]
Abstract
Abstract
Purpose of Review
The introduction of MinION whole-genome sequencing technology greatly increased and simplified complete genome sequencing in various fields of science across the globe. Sequences have been generated from complex organisms to microorganisms and are stored in genome databases that are readily accessible by researchers. Various new software for genome analysis, along with upgrades to older software packages, are being generated. New protocols are also being validated that enable WGS technology to be rapidly and increasingly used for sequencing in field settings.
Recent Findings
MinION WGS technology has been implemented in developed countries due to its advantages: portability, real-time analysis, and lower cost compared to other sequencing technologies. While these same advantages are critical in developing countries, MinION WGS technology is still under-utilized in resource-limited settings.
Summary
In this review, we look at the applications, advantages, challenges, and opportunities of using MinION WGS in resource-limited settings.
Collapse
|
19
|
Wang YX, Chen HF, Yin ZY, Chen WL, Lu LT. The genetic adaptations of Toxoptera aurantii facilitated its rapid multiple plant hosts dispersal and invasion. Genomics 2022; 114:110472. [PMID: 36055573 DOI: 10.1016/j.ygeno.2022.110472] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2022] [Revised: 08/24/2022] [Accepted: 08/27/2022] [Indexed: 01/14/2023]
Abstract
Toxoptera aurantii Boyer de Fonscolombe (Hemiptera: Aphididae) can attack many plant hosts, including tea (Camellia sinensis L.), citrus (Citrus spp.), lychee (Litchi chinensis Sonn.), banana (Musa spp.), and pineapple (Ananas comasus L.) among others. It is a widely distributed hexapod and one of the most destructive pests in tea plantations, causing enormous economic losses in tea production each year. A high-quality reference genome is important to study the phylogenetics and evolution of T. aurantii because its genome is highly heterozygous and repetitive. We obtained a de novo genome assembly of T. aurantii at the chromosome level using a combination of long Nanopore reads from sequencing with high-throughput chromosome conformation capture technology. When finally assembled, the genome was 318.95 Mb on four chromosomes with a 15.19 Mb scaffold N50. A total of 12,162 genes encoded proteins, while there were 22.01% repetitive sequences that totaled 67.73 Mb. Phylogenetic analyses revealed that T. aurantii and Aphis gossypii parted ways approximately 7.6 million years ago (Mya). We used a combination of long-read single-molecule sequencing with Hi-C-based chromatin interaction maps that resulted in a reference chromosomal level reference genome of T. aurantii that was high quality. Our results will enable the exploration of the genetics behind the special biological features of T. aurantii and also provide a source of data that should be useful to compare the compare genome among the Hemiptera.
Collapse
Affiliation(s)
- Yan-Xia Wang
- Provincial Key Laboratory for Agricultural Pest Management of Mountainous Region, Institute of Entomology, Guizhou University, Guiyang 550025, China; College of Tea Science, Guizhou University, Guiyang 550025, China
| | - Hu-Fang Chen
- College of Tea Science, Guizhou University, Guiyang 550025, China
| | - Zheng-Yan Yin
- Provincial Key Laboratory for Agricultural Pest Management of Mountainous Region, Institute of Entomology, Guizhou University, Guiyang 550025, China
| | - Wen-Long Chen
- Provincial Key Laboratory for Agricultural Pest Management of Mountainous Region, Institute of Entomology, Guizhou University, Guiyang 550025, China.
| | - Li-Tang Lu
- College of Tea Science, Guizhou University, Guiyang 550025, China.
| |
Collapse
|
20
|
Yang J, Ling C, Zhang H, Hussain Q, Lyu S, Zheng G, Liu Y. A Comparative Genomics Approach for Analysis of Complete Mitogenomes of Five Actinidiaceae Plants. Genes (Basel) 2022; 13:genes13101827. [PMID: 36292711 PMCID: PMC9601400 DOI: 10.3390/genes13101827] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2022] [Revised: 10/01/2022] [Accepted: 10/01/2022] [Indexed: 11/04/2022] Open
Abstract
Actinidiaceae, an economically important plant family, includes the Actinidia, Clematoclethra and Saurauia genus. Kiwifruit, with remarkably high vitamin C content, is an endemic species widely distributed in China with high economic value. Although many Actinidiaceae chloroplast genomes have been reported, few complete mitogenomes of Actinidiaceae have been studied. Here, complete circular mitogenomes of the four kiwifruit species and Saurauia tristyla were assembled. Codon usage, sequence repeats, RNA editing, gene transfers, selective pressure, and phylogenetic relationships in the four kiwifruit species and S. tristyla were comparatively analyzed. This research will contribute to the study of phylogenetic relationships within Actiniaceae and molecular barcoding in kiwifruit.
Collapse
Affiliation(s)
- Jun Yang
- College of Horticulture, Anhui Agriculture University, Hefei 350002, China
| | - Chengcheng Ling
- College of Horticulture, Anhui Agriculture University, Hefei 350002, China
| | - Huamin Zhang
- College of Horticulture, Anhui Agriculture University, Hefei 350002, China
| | - Quaid Hussain
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, 666 Wusu Street, Hangzhou 311300, China
| | - Shiheng Lyu
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, 666 Wusu Street, Hangzhou 311300, China
| | - Guohua Zheng
- College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Correspondence: (G.Z.); (Y.L.)
| | - Yongsheng Liu
- College of Horticulture, Anhui Agriculture University, Hefei 350002, China
- Correspondence: (G.Z.); (Y.L.)
| |
Collapse
|
21
|
Lang J, Sun J, Yang Z, He L, He Y, Chen Y, Huang L, Li P, Li J, Qin L. Nano2NGS-Muta: a framework for converting nanopore sequencing data to NGS-liked sequencing data for hotspot mutation detection. NAR Genom Bioinform 2022; 4:lqac033. [PMID: 35464239 PMCID: PMC9022462 DOI: 10.1093/nargab/lqac033] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2021] [Revised: 03/30/2022] [Accepted: 04/13/2022] [Indexed: 12/12/2022] Open
Abstract
Nanopore sequencing, also known as single-molecule real-time sequencing, is a third/fourth generation sequencing technology that enables deciphering single DNA/RNA molecules without the polymerase chain reaction. Although nanopore sequencing has made significant progress in scientific research and clinical practice, its application has been limited compared with next-generation sequencing (NGS) due to specific design principle and data characteristics, especially in hotspot mutation detection. Therefore, we developed Nano2NGS-Muta as a data analysis framework for hotspot mutation detection based on long reads from nanopore sequencing. Nano2NGS-Muta is characterized by applying nanopore sequencing data to NGS-liked data analysis pipelines. Long reads can be converted into short reads and then processed through existing NGS analysis pipelines in combination with statistical methods for hotspot mutation detection. Nano2NGS-Muta not only effectively avoids false positive/negative results caused by non-random errors and unexpected insertions-deletions (indels) of nanopore sequencing data, improves the detection accuracy of hotspot mutations compared to conventional nanopore sequencing data analysis algorithms but also breaks the barriers of data analysis methods between short-read sequencing and long-read sequencing. We hope Nano2NGS-Muta can serves as a reference method for nanopore sequencing data and promotes higher application scope of nanopore sequencing technology in scientific research and clinical practice.
Collapse
Affiliation(s)
- Jidong Lang
- Bioinformatics and Product Development Department, Qitan Technology (Beijing) Co., Ltd, Beijing 100192, China
| | - Jiguo Sun
- Bioinformatics and Product Development Department, Qitan Technology (Beijing) Co., Ltd, Beijing 100192, China
| | - Zhi Yang
- Bioinformatics and Product Development Department, Qitan Technology (Beijing) Co., Ltd, Beijing 100192, China
| | - Lei He
- Bioinformatics and Product Development Department, Qitan Technology (Beijing) Co., Ltd, Beijing 100192, China
| | - Yu He
- Bioinformatics and Product Development Department, Qitan Technology (Beijing) Co., Ltd, Beijing 100192, China
| | - Yanmei Chen
- Bioinformatics and Product Development Department, Qitan Technology (Beijing) Co., Ltd, Beijing 100192, China
| | - Lei Huang
- Bioinformatics and Product Development Department, Qitan Technology (Beijing) Co., Ltd, Beijing 100192, China
| | - Ping Li
- Bioinformatics and Product Development Department, Qitan Technology (Beijing) Co., Ltd, Beijing 100192, China
| | - Jialin Li
- Bioinformatics and Product Development Department, Qitan Technology (Beijing) Co., Ltd, Beijing 100192, China
| | - Liu Qin
- Bioinformatics and Product Development Department, Qitan Technology (Beijing) Co., Ltd, Beijing 100192, China
| |
Collapse
|
22
|
Segal-Kischinevzky C, Romero-Aguilar L, Alcaraz LD, López-Ortiz G, Martínez-Castillo B, Torres-Ramírez N, Sandoval G, González J. Yeasts Inhabiting Extreme Environments and Their Biotechnological Applications. Microorganisms 2022; 10:794. [PMID: 35456844 PMCID: PMC9028089 DOI: 10.3390/microorganisms10040794] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Revised: 04/06/2022] [Accepted: 04/06/2022] [Indexed: 11/17/2022] Open
Abstract
Yeasts are microscopic fungi inhabiting all Earth environments, including those inhospitable for most life forms, considered extreme environments. According to their habitats, yeasts could be extremotolerant or extremophiles. Some are polyextremophiles, depending on their growth capacity, tolerance, and survival in the face of their habitat's physical and chemical constitution. The extreme yeasts are relevant for the industrial production of value-added compounds, such as biofuels, lipids, carotenoids, recombinant proteins, enzymes, among others. This review calls attention to the importance of yeasts inhabiting extreme environments, including metabolic and adaptive aspects to tolerate conditions of cold, heat, water availability, pH, salinity, osmolarity, UV radiation, and metal toxicity, which are relevant for biotechnological applications. We explore the habitats of extreme yeasts, highlighting key species, physiology, adaptations, and molecular identification. Finally, we summarize several findings related to the industrially-important extremophilic yeasts and describe current trends in biotechnological applications that will impact the bioeconomy.
Collapse
Affiliation(s)
- Claudia Segal-Kischinevzky
- Departamento de Biología Celular, Facultad de Ciencias, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Coyoacán, Mexico City 04510, Mexico; (C.S.-K.); (L.D.A.); (B.M.-C.); (N.T.-R.)
| | - Lucero Romero-Aguilar
- Departamento de Bioquímica, Facultad de Medicina, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Coyoacán, Mexico City 04510, Mexico;
| | - Luis D. Alcaraz
- Departamento de Biología Celular, Facultad de Ciencias, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Coyoacán, Mexico City 04510, Mexico; (C.S.-K.); (L.D.A.); (B.M.-C.); (N.T.-R.)
| | - Geovani López-Ortiz
- Subdivisión de Medicina Familiar, Facultad de Medicina, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Coyoacán, Mexico City 04510, Mexico;
| | - Blanca Martínez-Castillo
- Departamento de Biología Celular, Facultad de Ciencias, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Coyoacán, Mexico City 04510, Mexico; (C.S.-K.); (L.D.A.); (B.M.-C.); (N.T.-R.)
| | - Nayeli Torres-Ramírez
- Departamento de Biología Celular, Facultad de Ciencias, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Coyoacán, Mexico City 04510, Mexico; (C.S.-K.); (L.D.A.); (B.M.-C.); (N.T.-R.)
| | - Georgina Sandoval
- Laboratorio de Innovación en Bioenergéticos y Bioprocesos Avanzados (LIBBA), Unidad de Biotecnología Industrial, Centro de Investigación y Asistencia en Tecnología y Diseño del Estado de Jalisco AC (CIATEJ), Av. Normalistas No. 800 Col. Colinas de la Normal, Guadalajara 44270, Mexico;
| | - James González
- Departamento de Biología Celular, Facultad de Ciencias, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Coyoacán, Mexico City 04510, Mexico; (C.S.-K.); (L.D.A.); (B.M.-C.); (N.T.-R.)
| |
Collapse
|
23
|
Yu J, Xia M, Wang Y, Chi X, Xu H, Chen S, Zhang F. Short and long reads chloroplast genome assemblies and phylogenomics of Artemisia tangutica (Asteraceae). Biologia (Bratisl) 2022. [DOI: 10.1007/s11756-021-00951-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
24
|
Saada OA, Tsouris A, Large C, Friedrich A, Dunham MJ, Schacherer J. Phased polyploid genomes provide deeper insight into the multiple origins of domesticated Saccharomyces cerevisiae beer yeasts. Curr Biol 2022; 32:1350-1361.e3. [PMID: 35180385 DOI: 10.1016/j.cub.2022.01.068] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Revised: 12/02/2021] [Accepted: 01/21/2022] [Indexed: 10/19/2022]
Abstract
Yeasts, and in particular Saccharomyces cerevisiae, have been used for brewing beer for thousands of years. Population genomic surveys highlighted that beer yeasts are polyphyletic, with the emergence of different domesticated subpopulations characterized by high genetic diversity and ploidy level. However, the different origins of these subpopulations are still unclear as reconstruction of polyploid genomes is required. To gain better insight into the differential evolutionary trajectories, we sequenced the genomes of 35 Saccharomyces cerevisiae isolates coming from different beer-brewing clades, using a long-read sequencing strategy. By phasing the genomes and using a windowed approach, we identified three main beer subpopulations based on allelic content (European dominant, Asian dominant, and African beer). They were derived from different admixtures between populations and are characterized by distinctive genomic patterns. By comparing the fully phased genes, the most diverse in our dataset are enriched for functions relevant to the brewing environment such as carbon metabolism, oxidoreduction, and cell wall organization activity. Finally, independent domestication, evolution, and adaptation events across subpopulations were also highlighted by investigating specific genes previously linked to the brewing process. Altogether, our analysis based on phased polyploid genomes has led to new insight into the contrasting evolutionary history of beer isolates.
Collapse
Affiliation(s)
- Omar Abou Saada
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Andreas Tsouris
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Chris Large
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Anne Friedrich
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Maitreya J Dunham
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Joseph Schacherer
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France; Institut Universitaire de France (IUF), Paris, France.
| |
Collapse
|
25
|
Eberlein C, Abou Saada O, Friedrich A, Albertin W, Schacherer J. Different trajectories of polyploidization shape the genomic landscape of the Brettanomyces bruxellensis yeast species. Genome Res 2021; 31:2316-2326. [PMID: 34815309 PMCID: PMC8647821 DOI: 10.1101/gr.275380.121] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Accepted: 10/25/2021] [Indexed: 01/01/2023]
Abstract
Polyploidization events are observed across the tree of life and occur in many fungi, plant, and animal species. During evolution, polyploidy is thought to be an important source of speciation and tumorigenesis. However, the origin of polyploid populations is not always clear, and little is known about the precise nature and structure of their complex genome. Using a long-read sequencing strategy, we sequenced 71 strains from the Brettanomyces bruxellensis yeast species, which is found in anthropized environments (e.g., beer, contaminant of wine, kombucha, and ethanol production) and characterized by several polyploid subpopulations. To reconstruct the polyploid genomes, we phased them by using different strategies and found that each subpopulation had a unique polyploidization history with distinct trajectories. The polyploid genomes contain either genetically closely related (with a genetic divergence <1%) or diverged copies (>3%), indicating auto- as well as allopolyploidization events. These latest events have occurred independently with a specific and unique donor in each of the polyploid subpopulations and exclude the known Brettanomyces sister species as possible donors. Finally, loss of heterozygosity events has shaped the structure of these polyploid genomes and underline their dynamics. Overall, our study highlights the multiplicity of the trajectories leading to polyploid genomes within the same species.
Collapse
Affiliation(s)
- Chris Eberlein
- Université de Strasbourg, CNRS, GMGM UMR 7156, 67000 Strasbourg, France
| | - Omar Abou Saada
- Université de Strasbourg, CNRS, GMGM UMR 7156, 67000 Strasbourg, France
| | - Anne Friedrich
- Université de Strasbourg, CNRS, GMGM UMR 7156, 67000 Strasbourg, France
| | - Warren Albertin
- Université de Bordeaux, ISVV, Unité de Recherche Œnologie EA 4577, USC 1366 INRA, Bordeaux INP, F-33140 Villenave d'Ornon, France
- ENSCBP, Bordeaux INP, 33600 Pessac, France
| | - Joseph Schacherer
- Université de Strasbourg, CNRS, GMGM UMR 7156, 67000 Strasbourg, France
- Institut Universitaire de France (IUF), 75231 Paris, France
| |
Collapse
|
26
|
Zhang S, Gao X, Wang L, Jiang W, Su H, Jing T, Cui J, Zhang L, Yang Y. Chromosome-level genome assemblies of two cotton-melon aphid Aphis gossypii biotypes unveil mechanisms of host adaption. Mol Ecol Resour 2021; 22:1120-1134. [PMID: 34601821 DOI: 10.1111/1755-0998.13521] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Revised: 09/22/2021] [Accepted: 09/24/2021] [Indexed: 11/30/2022]
Abstract
The cotton-melon aphid Aphis gossypii is a sap-sucking insect that is considered a serious global pest. The species is distributed over a large geographical range and uses a wide variety of hosts, with some populations being specialized to attack different plant species. Here, we provide de novo chromosome-level genome assemblies of a cotton specialist population (Hap1) and a cucurbit specialist population (Hap3). We achieved this by using a combination of third-generation sequencing platforms, namely Illumina and Hi-C sequencing technologies. We were able to anchor a total of 334.89 Mb (scaffold N50 of 89.13 Mb) and 359.95 Mb (scaffold N50 of 68.88 Mb) to four chromosomes for Hap1 and Hap3, respectively. Moreover, our results showed that the X-chromosome of Hap3 (113.01 Mb) was significantly longer than that of Hap1 (100.26 Mb), with a high level of sequence conservation between the aphid species. We also report variation in the number of protein-coding genes and repeat sequences between Hap1 and Hap3. In particular, olfactory and gustatory receptor genes underwent a high level of gene duplication and expansion events in A. gossypii, including between Hap1 and Hap3. Moreover, we identified two glutathione S-transferase genes which underwent single gene duplications in Hap3, and tandem duplication and inversion events affecting the cytochrome P450 monooxygenase between Hap1 and Hap3, all of which include the CYP3 family. Our results illustrate the variance in the genomic composition of two specialized A. gossypii populations and provide a helpful resource for the study of aphid population evolution, host adaption and insecticide resistance.
Collapse
Affiliation(s)
- Shuai Zhang
- School of Horticulture and Plant Protection, Yangzhou University, Yangzhou, China
| | - Xueke Gao
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Li Wang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Weili Jiang
- Basic Experimental Teaching Center of Life Sciences, Yangzhou University, Yangzhou, China
| | - Honghua Su
- School of Horticulture and Plant Protection, Yangzhou University, Yangzhou, China
| | - Tianxing Jing
- School of Horticulture and Plant Protection, Yangzhou University, Yangzhou, China
| | - Jinjie Cui
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Lijuan Zhang
- Department of Entomology, College of Plant Protection, Henan Agricultural University, Zhengzhou, China
| | - Yizhong Yang
- School of Horticulture and Plant Protection, Yangzhou University, Yangzhou, China
| |
Collapse
|
27
|
Wang B, Yang X, Jia Y, Xu Y, Jia P, Dang N, Wang S, Xu T, Zhao X, Gao S, Dong Q, Ye K. High-quality Arabidopsis thaliana Genome Assembly with Nanopore and HiFi Long Reads. GENOMICS, PROTEOMICS & BIOINFORMATICS 2021; 20:4-13. [PMID: 34487862 PMCID: PMC9510872 DOI: 10.1016/j.gpb.2021.08.003] [Citation(s) in RCA: 68] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Revised: 08/18/2021] [Accepted: 08/23/2021] [Indexed: 02/08/2023]
Abstract
Arabidopsis thaliana is an important and long-established model species for plant molecular biology, genetics, epigenetics, and genomics. However, the latest version of reference genome still contains significant number of missing segments. Here, we report a high-quality and almost complete Col-0 genome assembly with two gaps (Col-XJTU) using combination of Oxford Nanopore Technology ultra-long reads, PacBio high-fidelity long reads, and Hi-C data. The total genome assembly size is 133,725,193 bp, introducing 14.6 Mb of novel sequences compared to the TAIR10.1 reference genome. All five chromosomes of Col-XJTU assembly are highly accurate with consensus quality (QV) scores > 60 (ranging from 62 to 68), which are higher than those of TAIR10.1 reference (QV scores ranging from 45 to 52). We have completely resolved chromosome (Chr) 3 and Chr5 in a telomere-to-telomere manner. Chr4 has been completely resolved except the nucleolar organizing regions, which comprise long repetitive DNA fragments. The Chr1 centromere (CEN1), reportedly around 9 Mb in length, is particularly challenging to assemble due to the presence of tens of thousands of CEN180 satellite repeats. Using the cutting-edge sequencing data and novel computational approaches, we assembled about 4 Mb of sequence for CEN1 and a 3.5-Mb-long CEN2. We investigated the structure and epigenetics of centromeres. We detected four clusters of CEN180 monomers, and found that the centromere-specific histone H3-like protein (CENH3) exhibits a strong preference for CEN180 cluster 3. Moreover, we observed hypomethylation patterns in CENH3-enriched regions. We believe that this high-quality genome assembly, Col-XJTU, would serve as a valuable reference to better understand the global pattern of centromeric polymorphisms, as well as genetic and epigenetic features in plants.
Collapse
Affiliation(s)
- Bo Wang
- MOE Key Laboratory for Intelligent Networks & Network Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, 710049, China
| | - Xiaofei Yang
- School of Computer Science and Technology, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, 710049, China.
| | - Yanyan Jia
- MOE Key Laboratory for Intelligent Networks & Network Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, 710049, China
| | - Yu Xu
- School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, 710049, China
| | - Peng Jia
- MOE Key Laboratory for Intelligent Networks & Network Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, 710049, China; School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China
| | - Ningxin Dang
- MOE Key Laboratory for Intelligent Networks & Network Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, 710049, China; School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China; Genome Institute, the First Affiliated Hospital of Xi'an Jiaotong University, Xi'an 710061, China
| | - Songbo Wang
- MOE Key Laboratory for Intelligent Networks & Network Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, 710049, China; School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China
| | - Tun Xu
- MOE Key Laboratory for Intelligent Networks & Network Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, 710049, China; School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China
| | - Xixi Zhao
- Genome Institute, the First Affiliated Hospital of Xi'an Jiaotong University, Xi'an 710061, China
| | - Shenghan Gao
- MOE Key Laboratory for Intelligent Networks & Network Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, 710049, China; School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China
| | - Quanbin Dong
- Genome Institute, the First Affiliated Hospital of Xi'an Jiaotong University, Xi'an 710061, China
| | - Kai Ye
- MOE Key Laboratory for Intelligent Networks & Network Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, 710049, China; School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, 710049, China; School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China; Genome Institute, the First Affiliated Hospital of Xi'an Jiaotong University, Xi'an 710061, China.
| |
Collapse
|
28
|
Wan H, Wei Q, Ji Q, Lan H, Dai X, Chen W, Dong Y, Zeng C. The karyotype, genome survey, and assembly of Mud artemisia (Artemisia selengensis). Mol Biol Rep 2021; 48:5897-5904. [PMID: 34297325 DOI: 10.1007/s11033-021-06584-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Accepted: 07/20/2021] [Indexed: 11/30/2022]
Abstract
BACKGROUND Artemisia selengensis is traditional Chinese medicine and phytochemical analysis indicated that A. selengensis contains essential oils, fatty acids and phenolic acids. The lack of reference genomic information may lead to tardiness in molecular biology research of A. selengensis. METHOD AND RESULTS Karyotype analysis, genome survey, and genome assembly was employed to acquire information on the genome structure of A. selengensis. The chromosome number is 2n = 2x = 36, karyotype formula is 28 m + 8Sm, karyotype asymmetry coefficient is 58.8%, and karyotypes were symmetric to Stebbins' type 2A. Besides, the flow cytometry findings reported that the mean peak value of fluorescent intensity is 1,170,677, 2C DNA content is 12 pg and the genome size was estimated to be approximately 5.87 Gb. Furthermore, the genome survey generates 341,478,078 clean reads, unfortunately, after K-mer analysis, no significant peak can be observed, the heterozygosity, repetitive rate and genome size was unable to estimated. It is speculated that this phenomenon might be due to the complexity of genome structure. 37,266 contigs are preliminary assembled with Oxford Nanopore Technology (ONT) sequencing, totaling 804 Mb and GC content was 34.08%. The total length is 804,475,881 bp, N50 is 29,624 bp, and the largest contig length is 239,792 bp. CONCLUSION This study reveals the preliminary information of genome size of A. selengensis. These findings may provide supportive information for sequencing and assembly of whole-genome sequencing and encourage the progress of functional gene discovery, genetic improvement, evolutionary study, and structural studies of A. selengensis.
Collapse
Affiliation(s)
- Heping Wan
- Hubei Engineering Research Center for Protection and Utilization of Special Biological Resources in the Hanjiang River BasinSchool of Life Science, Jianghan University, Wuhan, 430056, China
| | - Qingying Wei
- Hubei Engineering Research Center for Protection and Utilization of Special Biological Resources in the Hanjiang River BasinSchool of Life Science, Jianghan University, Wuhan, 430056, China
| | - Qiangqiang Ji
- Hubei Engineering Research Center for Protection and Utilization of Special Biological Resources in the Hanjiang River BasinSchool of Life Science, Jianghan University, Wuhan, 430056, China
| | - Hong Lan
- Hubei Engineering Research Center for Protection and Utilization of Special Biological Resources in the Hanjiang River BasinSchool of Life Science, Jianghan University, Wuhan, 430056, China
| | - Xigang Dai
- Hubei Engineering Research Center for Protection and Utilization of Special Biological Resources in the Hanjiang River BasinSchool of Life Science, Jianghan University, Wuhan, 430056, China
| | - Weida Chen
- Hubei Engineering Research Center for Protection and Utilization of Special Biological Resources in the Hanjiang River BasinSchool of Life Science, Jianghan University, Wuhan, 430056, China
| | - Yuanhuo Dong
- Hubei Engineering Research Center for Protection and Utilization of Special Biological Resources in the Hanjiang River BasinSchool of Life Science, Jianghan University, Wuhan, 430056, China
| | - Changli Zeng
- Hubei Engineering Research Center for Protection and Utilization of Special Biological Resources in the Hanjiang River BasinSchool of Life Science, Jianghan University, Wuhan, 430056, China.
| |
Collapse
|
29
|
Saccharomyces cerevisiae strains used industrially for bioethanol production. Essays Biochem 2021; 65:147-161. [PMID: 34156078 DOI: 10.1042/ebc20200160] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Revised: 05/25/2021] [Accepted: 05/28/2021] [Indexed: 01/21/2023]
Abstract
Fuel ethanol is produced by the yeast Saccharomyces cerevisiae mainly from corn starch in the United States and from sugarcane sucrose in Brazil, which together manufacture ∼85% of a global yearly production of 109.8 million m3 (in 2019). While in North America genetically engineered (GE) strains account for ∼80% of the ethanol produced, including strains that express amylases and are engineered to produce higher ethanol yields; in South America, mostly (>90%) non-GE strains are used in ethanol production, primarily as starters in non-aseptic fermentation systems with cell recycling. In spite of intensive research exploring lignocellulosic ethanol (or second generation ethanol), this option still accounts for <1% of global ethanol production. In this mini-review, we describe the main aspects of fuel ethanol production, emphasizing bioprocesses operating in North America and Brazil. We list and describe the main properties of several commercial yeast products (i.e., yeast strains) that are available worldwide to bioethanol producers, including GE strains with their respective genetic modifications. We also discuss recent studies that have started to shed light on the genes and traits that are important for the persistence and dominance of yeast strains in the non-aseptic process in Brazil. While Brazilian bioethanol yeast strains originated from a historical process of domestication for sugarcane fermentation, leading to a unique group with significant economic applications, in U.S.A., guided selection, breeding and genetic engineering approaches have driven the generation of new yeast products for the market.
Collapse
|
30
|
Massaiu I, Songia P, Chiesa M, Valerio V, Moschetta D, Alfieri V, Myasoedova VA, Schmid M, Cassetta L, Colombo GI, D’Alessandra Y, Poggio P. Evaluation of Oxford Nanopore MinION RNA-Seq Performance for Human Primary Cells. Int J Mol Sci 2021; 22:ijms22126317. [PMID: 34204756 PMCID: PMC8231517 DOI: 10.3390/ijms22126317] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Revised: 05/17/2021] [Accepted: 06/09/2021] [Indexed: 12/12/2022] Open
Abstract
Transcript sequencing is a crucial tool for gaining a deep understanding of biological processes in diagnostic and clinical medicine. Given their potential to study novel complex eukaryotic transcriptomes, long-read sequencing technologies are able to overcome some limitations of short-read RNA-Seq approaches. Oxford Nanopore Technologies (ONT) offers the ability to generate long-read sequencing data in real time via portable protein nanopore USB devices. This work aimed to provide the user with the number of reads that should be sequenced, through the ONT MinION platform, to reach the desired accuracy level for a human cell RNA study. We sequenced three cDNA libraries prepared from poly-adenosine RNA of human primary cardiac fibroblasts. Since the runs were comparable, they were combined in a total dataset of 48 million reads. Synthetic datasets with different sizes were generated starting from the total and analyzed in terms of the number of identified genes and their expression levels. As expected, an improved sensitivity was obtained, increasing the sequencing depth, particularly for the non-coding genes. The reliability of expression levels was assayed by (i) comparison with PCR quantifications of selected genes and (ii) by the implementation of a user-friendly multiplexing method in a single run.
Collapse
Affiliation(s)
- Ilaria Massaiu
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
| | - Paola Songia
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
| | - Mattia Chiesa
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
| | - Vincenza Valerio
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
- Dipartimento di Medicina Clinica e Chirurgia, Università degli Studi di Napoli Federico II, 80131 Napoli, Italy
| | - Donato Moschetta
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
- Dipartimento di Scienze Farmacologiche e Biomolecolari, Università degli Studi di Milano, 20133 Milano, Italy
| | - Valentina Alfieri
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
| | - Veronika A. Myasoedova
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
| | - Michael Schmid
- Genexa AG, Dienerstrasse 7, CH-8004 Zürich, Switzerland;
| | - Luca Cassetta
- The Queen’s Medical Research Council Centre for Reproductive Health, University of Edinburgh, Edinburgh EH16 4TJ, UK;
| | - Gualtiero I. Colombo
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
| | - Yuri D’Alessandra
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
| | - Paolo Poggio
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
- Correspondence:
| |
Collapse
|
31
|
Bleykasten-Grosshans C, Fabrizio R, Friedrich A, Schacherer J. Species-wide transposable element repertoires retrace the evolutionary history of the Saccharomyces cerevisiae host. Mol Biol Evol 2021; 38:4334-4345. [PMID: 34115140 PMCID: PMC8476168 DOI: 10.1093/molbev/msab171] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Transposable elements (TE) are an important source of genetic variation with a dynamic and content that greatly differ in a wide range of species. The origin of the intraspecific content variation is not always clear and little is known about the precise nature of it. Here, we surveyed the species-wide content of the Ty LTR-retrotransposons in a broad collection of 1,011 Saccharomyces cerevisiae natural isolates to understand what can stand behind the variation of the repertoire that is the type and number of Ty elements. We have compiled an exhaustive catalog of all the TE sequence variants present in the S. cerevisiae species by identifying a large set of new sequence variants. The characterization of the TE content in each isolate clearly highlighted that each subpopulation exhibits a unique and specific repertoire, retracing the evolutionary history of the species. Most interestingly, we have shown that ancient interspecific hybridization events had a major impact in the birth of new sequence variants and therefore in the shaping of the TE repertoires. We also investigated the transpositional activity of these elements in a large set of natural isolates, and we found a broad variability related to the level of ploidy as well as the genetic background. Overall, our results pointed out that the evolution of the Ty content is deeply impacted by clade-specific events such as introgressions and therefore follows the population structure. In addition, our study lays the foundation for future investigations to better understand the transpositional regulation and more broadly the TE–host interactions.
Collapse
Affiliation(s)
| | - Romeo Fabrizio
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Anne Friedrich
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Joseph Schacherer
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France.,Institut Universitaire de France (IUF)
| |
Collapse
|
32
|
Li Y, Leveau A, Zhao Q, Feng Q, Lu H, Miao J, Xue Z, Martin AC, Wegel E, Wang J, Orme A, Rey MD, Karafiátová M, Vrána J, Steuernagel B, Joynson R, Owen C, Reed J, Louveau T, Stephenson MJ, Zhang L, Huang X, Huang T, Fan D, Zhou C, Tian Q, Li W, Lu Y, Chen J, Zhao Y, Lu Y, Zhu C, Liu Z, Polturak G, Casson R, Hill L, Moore G, Melton R, Hall N, Wulff BBH, Doležel J, Langdon T, Han B, Osbourn A. Subtelomeric assembly of a multi-gene pathway for antimicrobial defense compounds in cereals. Nat Commun 2021; 12:2563. [PMID: 33963185 PMCID: PMC8105312 DOI: 10.1038/s41467-021-22920-8] [Citation(s) in RCA: 48] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Accepted: 04/07/2021] [Indexed: 02/06/2023] Open
Abstract
Non-random gene organization in eukaryotes plays a significant role in genome evolution. Here, we investigate the origin of a biosynthetic gene cluster for production of defence compounds in oat-the avenacin cluster. We elucidate the structure and organisation of this 12-gene cluster, characterise the last two missing pathway steps, and reconstitute the entire pathway in tobacco by transient expression. We show that the cluster has formed de novo since the divergence of oats in a subtelomeric region of the genome that lacks homology with other grasses, and that gene order is approximately colinear with the biosynthetic pathway. We speculate that the positioning of the late pathway genes furthest away from the telomere may mitigate against a 'self-poisoning' scenario in which toxic intermediates accumulate as a result of telomeric gene deletions. Our investigations reveal a striking example of adaptive evolution underpinned by remarkable genome plasticity.
Collapse
Affiliation(s)
- Yan Li
- National Centre for Gene Research, CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Centre of Excellence for Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences (CAS), Shanghai, China
| | | | - Qiang Zhao
- National Centre for Gene Research, CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Centre of Excellence for Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences (CAS), Shanghai, China
| | - Qi Feng
- National Centre for Gene Research, CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Centre of Excellence for Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences (CAS), Shanghai, China
| | - Hengyun Lu
- National Centre for Gene Research, CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Centre of Excellence for Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences (CAS), Shanghai, China
| | - Jiashun Miao
- National Centre for Gene Research, CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Centre of Excellence for Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences (CAS), Shanghai, China
| | - Zheyong Xue
- John Innes Centre, Norwich Research Park, Norwich, UK
| | | | - Eva Wegel
- John Innes Centre, Norwich Research Park, Norwich, UK
| | - Jing Wang
- John Innes Centre, Norwich Research Park, Norwich, UK
| | | | | | - Miroslava Karafiátová
- Institute of Experimental Botany of the Czech Academy of Sciences, Centre of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czech Republic
| | - Jan Vrána
- Institute of Experimental Botany of the Czech Academy of Sciences, Centre of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czech Republic
| | | | - Ryan Joynson
- Earlham Institute, Norwich Research Park, Norwich, UK
| | | | - James Reed
- John Innes Centre, Norwich Research Park, Norwich, UK
| | | | | | - Lei Zhang
- National Centre for Gene Research, CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Centre of Excellence for Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences (CAS), Shanghai, China
| | - Xuehui Huang
- National Centre for Gene Research, CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Centre of Excellence for Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences (CAS), Shanghai, China
| | - Tao Huang
- National Centre for Gene Research, CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Centre of Excellence for Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences (CAS), Shanghai, China
| | - Danling Fan
- National Centre for Gene Research, CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Centre of Excellence for Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences (CAS), Shanghai, China
| | - Congcong Zhou
- National Centre for Gene Research, CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Centre of Excellence for Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences (CAS), Shanghai, China
| | - Qilin Tian
- National Centre for Gene Research, CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Centre of Excellence for Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences (CAS), Shanghai, China
| | - Wenjun Li
- National Centre for Gene Research, CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Centre of Excellence for Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences (CAS), Shanghai, China
| | - Yiqi Lu
- National Centre for Gene Research, CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Centre of Excellence for Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences (CAS), Shanghai, China
| | - Jiaying Chen
- National Centre for Gene Research, CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Centre of Excellence for Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences (CAS), Shanghai, China
| | - Yan Zhao
- National Centre for Gene Research, CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Centre of Excellence for Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences (CAS), Shanghai, China
| | - Ying Lu
- National Centre for Gene Research, CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Centre of Excellence for Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences (CAS), Shanghai, China
| | - Chuanrang Zhu
- National Centre for Gene Research, CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Centre of Excellence for Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences (CAS), Shanghai, China
| | - Zhenhua Liu
- Joint Center for Single Cell Biology, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Guy Polturak
- John Innes Centre, Norwich Research Park, Norwich, UK
| | | | - Lionel Hill
- John Innes Centre, Norwich Research Park, Norwich, UK
| | - Graham Moore
- John Innes Centre, Norwich Research Park, Norwich, UK
| | - Rachel Melton
- John Innes Centre, Norwich Research Park, Norwich, UK
| | - Neil Hall
- Earlham Institute, Norwich Research Park, Norwich, UK
| | | | - Jaroslav Doležel
- Institute of Experimental Botany of the Czech Academy of Sciences, Centre of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czech Republic
| | - Tim Langdon
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, Gogerddan, Aberystwyth, Ceredigion, SY23 3EE, UK
| | - Bin Han
- National Centre for Gene Research, CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Centre of Excellence for Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences (CAS), Shanghai, China.
| | - Anne Osbourn
- John Innes Centre, Norwich Research Park, Norwich, UK.
| |
Collapse
|
33
|
Saxenborn P, Baxter J, Tilevik A, Fagerlind M, Dyrkell F, Pernestig AK, Enroth H, Tilevik D. Genotypic Characterization of Clinical Klebsiella spp. Isolates Collected From Patients With Suspected Community-Onset Sepsis, Sweden. Front Microbiol 2021; 12:640408. [PMID: 33995300 PMCID: PMC8120268 DOI: 10.3389/fmicb.2021.640408] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Accepted: 04/13/2021] [Indexed: 02/02/2023] Open
Abstract
Klebsiella is a genus of Gram-negative bacteria known to be opportunistic pathogens that may cause a variety of infections in humans. Highly drug-resistant Klebsiella species, especially K. pneumoniae, have emerged rapidly and are becoming a major concern in clinical management. Although K. pneumoniae is considered the most important pathogen within the genus, the true clinical significance of the other species is likely underrecognized due to the inability of conventional microbiological methods to distinguish between the species leading to high rates of misidentification. Bacterial whole-genome sequencing (WGS) enables precise species identification and characterization that other technologies do not allow. Herein, we have characterized the diversity and traits of Klebsiella spp. in community-onset infections by WGS of clinical isolates (n = 105) collected during a prospective sepsis study in Sweden. The sequencing revealed that 32 of the 82 isolates (39.0%) initially identified as K. pneumoniae with routine microbiological methods based on cultures followed by matrix-assisted laser desorption-time of flight mass spectrometry (MALDI-TOF MS) had been misidentified. Of these, 23 were identified as Klebsiella variicola and nine as other members of the K. pneumoniae complex. Comparisons of the number of resistance genes showed that significantly fewer resistance genes were detected in Klebsiella oxytoca compared to K. pneumoniae and K. variicola (both values of p < 0.001). Moreover, a high proportion of the isolates within the K. pneumoniae complex were predicted to be genotypically multidrug-resistant (MDR; 79/84, 94.0%) in contrast to K. oxytoca (3/16, 18.8%) and Klebsiella michiganensis (0/4, 0.0%). All isolates predicted as genotypically MDR were found to harbor the combination of β-lactam, fosfomycin, and quinolone resistance markers. Multi-locus sequence typing (MLST) revealed a high diversity of sequence types among the Klebsiella spp. with ST14 (10.0%) and ST5429 (10.0%) as the most prevalent ones for K. pneumoniae, ST146 for K. variicola (12.0%), and ST176 for K. oxytoca (25.0%). In conclusion, the results from this study highlight the importance of using high-resolution genotypic methods for identification and characterization of clinical Klebsiella spp. isolates. Our findings indicate that infections caused by other members of the K. pneumoniae complex than K. pneumoniae are a more common clinical problem than previously described, mainly due to high rates of misidentifications.
Collapse
Affiliation(s)
- Patricia Saxenborn
- Systems Biology Research Centre, School of Bioscience, University of Skövde, Skövde, Sweden
| | - John Baxter
- Systems Biology Research Centre, School of Bioscience, University of Skövde, Skövde, Sweden
| | - Andreas Tilevik
- Systems Biology Research Centre, School of Bioscience, University of Skövde, Skövde, Sweden
| | - Magnus Fagerlind
- Systems Biology Research Centre, School of Bioscience, University of Skövde, Skövde, Sweden
| | | | - Anna-Karin Pernestig
- Systems Biology Research Centre, School of Bioscience, University of Skövde, Skövde, Sweden
| | - Helena Enroth
- Systems Biology Research Centre, School of Bioscience, University of Skövde, Skövde, Sweden.,Molecular Microbiology, Laboratory Medicine, Unilabs AB, Skövde, Sweden
| | - Diana Tilevik
- Systems Biology Research Centre, School of Bioscience, University of Skövde, Skövde, Sweden
| |
Collapse
|
34
|
Engelbrecht J, Duong TA, Prabhu SA, Seedat M, van den Berg N. Genome of the destructive oomycete Phytophthora cinnamomi provides insights into its pathogenicity and adaptive potential. BMC Genomics 2021; 22:302. [PMID: 33902447 PMCID: PMC8074420 DOI: 10.1186/s12864-021-07552-y] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2020] [Accepted: 03/24/2021] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND Phytophthora cinnamomi is an oomycete pathogen of global relevance. It is considered as one of the most invasive species, which has caused irreversible damage to natural ecosystems and horticultural crops. There is currently a lack of a high-quality reference genome for this species despite several attempts that have been made towards sequencing its genome. The lack of a good quality genome sequence has been a setback for various genetic and genomic research to be done on this species. As a consequence, little is known regarding its genome characteristics and how these contribute to its pathogenicity and invasiveness. RESULTS In this work we generated a high-quality genome sequence and annotation for P. cinnamomi using a combination of Oxford Nanopore and Illumina sequencing technologies. The annotation was done using RNA-Seq data as supporting gene evidence. The final assembly consisted of 133 scaffolds, with an estimated genome size of 109.7 Mb, N50 of 1.18 Mb, and BUSCO completeness score of 97.5%. Genome partitioning analysis revealed that P. cinnamomi has a two-speed genome characteristic, similar to that of other oomycetes and fungal plant pathogens. In planta gene expression analysis revealed up-regulation of pathogenicity-related genes, suggesting their important roles during infection and host degradation. CONCLUSION This study has provided a high-quality reference genome and annotation for P. cinnamomi. This is among the best assembled genomes for any Phytophthora species assembled to date and thus resulted in improved identification and characterization of pathogenicity-related genes, some of which were undetected in previous versions of genome assemblies. Phytophthora cinnamomi harbours a large number of effector genes which are located in the gene-poor regions of the genome. This unique genomic partitioning provides P. cinnamomi with a high level of adaptability and could contribute to its success as a highly invasive species. Finally, the genome sequence, its annotation and the pathogenicity effectors identified in this study will serve as an important resource that will enable future studies to better understand and mitigate the impact of this important pathogen.
Collapse
Affiliation(s)
- Juanita Engelbrecht
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute, University of Pretoria, Pretoria, South Africa.
| | - Tuan A Duong
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute, University of Pretoria, Pretoria, South Africa
| | - S Ashok Prabhu
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute, University of Pretoria, Pretoria, South Africa
| | - Mohamed Seedat
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute, University of Pretoria, Pretoria, South Africa
| | - Noëlani van den Berg
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute, University of Pretoria, Pretoria, South Africa
| |
Collapse
|
35
|
Barbitoff YA, Matveenko AG, Matiiv AB, Maksiutenko EM, Moskalenko SE, Drozdova PB, Polev DE, Beliavskaia AY, Danilov LG, Predeus AV, Zhouravleva GA. Chromosome-level genome assembly and structural variant analysis of two laboratory yeast strains from the Peterhof Genetic Collection lineage. G3-GENES GENOMES GENETICS 2021; 11:6129118. [PMID: 33677552 PMCID: PMC8759820 DOI: 10.1093/g3journal/jkab029] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Accepted: 01/22/2021] [Indexed: 01/23/2023]
Abstract
Thousands of yeast genomes have been sequenced with both traditional and long-read technologies, and multiple observations about modes of genome evolution for both wild and laboratory strains have been drawn from these sequences. In our study, we applied Oxford Nanopore and Illumina technologies to assemble complete genomes of two widely used members of a distinct laboratory yeast lineage, the Peterhof Genetic Collection (PGC), and investigate the structural features of these genomes including transposable element content, copy number alterations, and structural rearrangements. We identified numerous notable structural differences between genomes of PGC strains and the reference S288C strain. We discovered a substantial enrichment of mid-length insertions and deletions within repetitive coding sequences, such as in the SCH9 gene or the NUP100 gene, with possible impact of these variants on protein amyloidogenicity. High contiguity of the final assemblies allowed us to trace back the history of reciprocal unbalanced translocations between chromosomes I, VIII, IX, XI, and XVI of the PGC strains. We show that formation of hybrid alleles of the FLO genes during such chromosomal rearrangements is likely responsible for the lack of invasive growth of yeast strains. Taken together, our results highlight important features of laboratory yeast strain evolution using the power of long-read sequencing.
Collapse
Affiliation(s)
- Yury A Barbitoff
- Department of Genetics and Biotechnology, St. Petersburg State University, St. Petersburg 199034, Russia.,Bioinformatics Institute, St. Petersburg 197342, Russia
| | - Andrew G Matveenko
- Department of Genetics and Biotechnology, St. Petersburg State University, St. Petersburg 199034, Russia.,Bioinformatics Institute, St. Petersburg 197342, Russia
| | - Anton B Matiiv
- Department of Genetics and Biotechnology, St. Petersburg State University, St. Petersburg 199034, Russia.,Bioinformatics Institute, St. Petersburg 197342, Russia
| | - Evgeniia M Maksiutenko
- Department of Genetics and Biotechnology, St. Petersburg State University, St. Petersburg 199034, Russia.,St. Petersburg Branch, Vavilov Institute of General Genetics of the Russian Academy of Sciences, St. Petersburg 199034, Russia
| | - Svetlana E Moskalenko
- Department of Genetics and Biotechnology, St. Petersburg State University, St. Petersburg 199034, Russia.,St. Petersburg Branch, Vavilov Institute of General Genetics of the Russian Academy of Sciences, St. Petersburg 199034, Russia
| | | | | | - Alexandra Y Beliavskaia
- Department of Invertebrate Zoology, St. Petersburg State University, 199034 St. Petersburg, Russia
| | - Lavrentii G Danilov
- Department of Genetics and Biotechnology, St. Petersburg State University, St. Petersburg 199034, Russia
| | - Alexander V Predeus
- Bioinformatics Institute, St. Petersburg 197342, Russia.,University of Liverpool, Liverpool, UK, L7 3EA
| | - Galina A Zhouravleva
- Department of Genetics and Biotechnology, St. Petersburg State University, St. Petersburg 199034, Russia
| |
Collapse
|
36
|
Jacobus AP, Stephens TG, Youssef P, González-Pech R, Ciccotosto-Camp MM, Dougan KE, Chen Y, Basso LC, Frazzon J, Chan CX, Gross J. Comparative Genomics Supports That Brazilian Bioethanol Saccharomyces cerevisiae Comprise a Unified Group of Domesticated Strains Related to Cachaça Spirit Yeasts. Front Microbiol 2021; 12:644089. [PMID: 33936002 PMCID: PMC8082247 DOI: 10.3389/fmicb.2021.644089] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2020] [Accepted: 03/08/2021] [Indexed: 01/05/2023] Open
Abstract
Ethanol production from sugarcane is a key renewable fuel industry in Brazil. Major drivers of this alcoholic fermentation are Saccharomyces cerevisiae strains that originally were contaminants to the system and yet prevail in the industrial process. Here we present newly sequenced genomes (using Illumina short-read and PacBio long-read data) of two monosporic isolates (H3 and H4) of the S. cerevisiae PE-2, a predominant bioethanol strain in Brazil. The assembled genomes of H3 and H4, together with 42 draft genomes of sugarcane-fermenting (fuel ethanol plus cachaça) strains, were compared against those of the reference S288C and diverse S. cerevisiae. All genomes of bioethanol yeasts have amplified SNO2(3)/SNZ2(3) gene clusters for vitamin B1/B6 biosynthesis, and display ubiquitous presence of a particular family of SAM-dependent methyl transferases, rare in S. cerevisiae. Widespread amplifications of quinone oxidoreductases YCR102C/YLR460C/YNL134C, and the structural or punctual variations among aquaporins and components of the iron homeostasis system, likely represent adaptations to industrial fermentation. Interesting is the pervasive presence among the bioethanol/cachaça strains of a five-gene cluster (Region B) that is a known phylogenetic signature of European wine yeasts. Combining genomes of H3, H4, and 195 yeast strains, we comprehensively assessed whole-genome phylogeny of these taxa using an alignment-free approach. The 197-genome phylogeny substantiates that bioethanol yeasts are monophyletic and closely related to the cachaça and wine strains. Our results support the hypothesis that biofuel-producing yeasts in Brazil may have been co-opted from a pool of yeasts that were pre-adapted to alcoholic fermentation of sugarcane for the distillation of cachaça spirit, which historically is a much older industry than the large-scale fuel ethanol production.
Collapse
Affiliation(s)
- Ana Paula Jacobus
- Laboratory for Genomics and Experimental Evolution of Yeasts, Institute for Bioenergy Research, São Paulo State University, Rio Claro, Brazil
| | - Timothy G Stephens
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Pierre Youssef
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Raul González-Pech
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Michael M Ciccotosto-Camp
- Australian Centre for Ecogenomics, The University of Queensland, Brisbane, QLD, Australia.,School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD, Australia
| | - Katherine E Dougan
- Australian Centre for Ecogenomics, The University of Queensland, Brisbane, QLD, Australia.,School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD, Australia
| | - Yibi Chen
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia.,Australian Centre for Ecogenomics, The University of Queensland, Brisbane, QLD, Australia.,School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD, Australia
| | - Luiz Carlos Basso
- Biological Science Department, Escola Superior de Agricultura Luiz de Queiroz, University of São Paulo (USP), Piracicaba, Brazil
| | - Jeverson Frazzon
- Institute of Food Science and Technology, Federal University of Rio Grande do Sul (UFRGS), Porto Alegre, Brazil
| | - Cheong Xin Chan
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia.,Australian Centre for Ecogenomics, The University of Queensland, Brisbane, QLD, Australia.,School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD, Australia
| | - Jeferson Gross
- Laboratory for Genomics and Experimental Evolution of Yeasts, Institute for Bioenergy Research, São Paulo State University, Rio Claro, Brazil
| |
Collapse
|
37
|
Que T, Wang H, Yang W, Wu J, Hou C, Pei S, Wu Q, Li LM, Wei S, Xie X, Huang H, Chen P, Huang Y, Wu A, He M, Nong D, Wei X, Wu J, Nong R, Huang N, Zhou Q, Lin Y, Lu T, Wei Y, Li S, Yao J, Zhong Y, Qin H, Tan L, Li Y, Li W, Liu T, Liu S, Yu Y, Qiu H, Jiang Y, Li Y, Liu Z, Huang CM, Hu Y. The reference genome and transcriptome of the limestone langur, Trachypithecus leucocephalus, reveal expansion of genes related to alkali tolerance. BMC Biol 2021; 19:67. [PMID: 33832502 PMCID: PMC8034193 DOI: 10.1186/s12915-021-00998-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2020] [Accepted: 03/05/2021] [Indexed: 01/13/2023] Open
Abstract
Background Trachypithecus leucocephalus, the white-headed langur, is a critically endangered primate that is endemic to the karst mountains in the southern Guangxi province of China. Studying the genomic and transcriptomic mechanisms underlying its local adaptation could help explain its persistence within a highly specialized ecological niche. Results In this study, we used PacBio sequencing and optical assembly and Hi-C analysis to create a high-quality de novo assembly of the T. leucocephalus genome. Annotation and functional enrichment revealed many genes involved in metabolism, transport, and homeostasis, and almost all of the positively selected genes were related to mineral ion binding. The transcriptomes of 12 tissues from three T. leucocephalus individuals showed that the great majority of genes involved in mineral absorption and calcium signaling were expressed, and their gene families were significantly expanded. For example, FTH1 primarily functions in iron storage and had 20 expanded copies. Conclusions These results increase our understanding of the evolution of alkali tolerance and other traits necessary for the persistence of T. leucocephalus within an ecologically unique limestone karst environment.
Collapse
Affiliation(s)
- Tengcheng Que
- Terrestrial Wildlife Rescue and Epidemic Diseases Surveillance Center of Guangxi, Nanning, Guangxi, 530003, China
| | - Huifeng Wang
- Department of Biochemistry and Molecular Biology, School of Pre-Clinical Medicine, Guangxi Medical University, Nanning, Guangxi, 530021, China
| | - Weifei Yang
- Annoroad Gene Technology, Beijing, 100176, China
| | - Jianbao Wu
- Guangxi Chongzuo white headed langur national nature reserve, Chongzuo, Guangxi, 532200, China
| | - Chenyang Hou
- School of Information and Management, Guangxi Medical University, Nanning, Guangxi, 530021, China
| | - Surui Pei
- Annoroad Gene Technology, Beijing, 100176, China
| | - Qunying Wu
- Department of Biochemistry and Molecular Biology, School of Pre-Clinical Medicine, Guangxi Medical University, Nanning, Guangxi, 530021, China
| | - Liu Ming Li
- Guangxi Reproductive Medical Research Center, First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, 530021, China
| | - Shilu Wei
- Life Sciences Institute, Guangxi Medical University, Nanning, Guangxi, 530021, China
| | - Xing Xie
- Life Sciences Institute, Guangxi Medical University, Nanning, Guangxi, 530021, China
| | - Hongli Huang
- Life Sciences Institute, Guangxi Medical University, Nanning, Guangxi, 530021, China
| | - Panyu Chen
- Terrestrial Wildlife Rescue and Epidemic Diseases Surveillance Center of Guangxi, Nanning, Guangxi, 530003, China
| | - Yiming Huang
- Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi, 530021, China
| | - Aiqiong Wu
- Terrestrial Wildlife Rescue and Epidemic Diseases Surveillance Center of Guangxi, Nanning, Guangxi, 530003, China
| | - Meihong He
- Terrestrial Wildlife Rescue and Epidemic Diseases Surveillance Center of Guangxi, Nanning, Guangxi, 530003, China
| | - Dengpan Nong
- Terrestrial Wildlife Rescue and Epidemic Diseases Surveillance Center of Guangxi, Nanning, Guangxi, 530003, China
| | - Xiao Wei
- Guangxi Chongzuo white headed langur national nature reserve, Chongzuo, Guangxi, 532200, China
| | - Junyi Wu
- Nanning Animal Zoo, Nanning, Guangxi, 530021, China
| | - Ru Nong
- Nanning Animal Zoo, Nanning, Guangxi, 530021, China
| | - Ning Huang
- Nanning Animal Zoo, Nanning, Guangxi, 530021, China
| | - Qingniao Zhou
- Department of Biochemistry and Molecular Biology, School of Pre-Clinical Medicine, Guangxi Medical University, Nanning, Guangxi, 530021, China
| | - Yaowang Lin
- Department of Biochemistry and Molecular Biology, School of Pre-Clinical Medicine, Guangxi Medical University, Nanning, Guangxi, 530021, China
| | - Tingxi Lu
- School of Information and Management, Guangxi Medical University, Nanning, Guangxi, 530021, China
| | - Yongjie Wei
- Terrestrial Wildlife Rescue and Epidemic Diseases Surveillance Center of Guangxi, Nanning, Guangxi, 530003, China
| | - Shousheng Li
- Terrestrial Wildlife Rescue and Epidemic Diseases Surveillance Center of Guangxi, Nanning, Guangxi, 530003, China
| | - Jianglong Yao
- Terrestrial Wildlife Rescue and Epidemic Diseases Surveillance Center of Guangxi, Nanning, Guangxi, 530003, China
| | - Yanli Zhong
- Terrestrial Wildlife Rescue and Epidemic Diseases Surveillance Center of Guangxi, Nanning, Guangxi, 530003, China
| | - Huayong Qin
- Terrestrial Wildlife Rescue and Epidemic Diseases Surveillance Center of Guangxi, Nanning, Guangxi, 530003, China
| | - Luohao Tan
- Terrestrial Wildlife Rescue and Epidemic Diseases Surveillance Center of Guangxi, Nanning, Guangxi, 530003, China
| | - Yingjiao Li
- Terrestrial Wildlife Rescue and Epidemic Diseases Surveillance Center of Guangxi, Nanning, Guangxi, 530003, China
| | - Weidong Li
- Life Sciences Institute, Guangxi Medical University, Nanning, Guangxi, 530021, China
| | - Tao Liu
- Annoroad Gene Technology, Beijing, 100176, China
| | - Sanyang Liu
- Annoroad Gene Technology, Beijing, 100176, China
| | - Yongyi Yu
- Annoroad Gene Technology, Beijing, 100176, China
| | - Hong Qiu
- Annoroad Gene Technology, Beijing, 100176, China
| | - Yonghua Jiang
- Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi, 530021, China
| | - Youcheng Li
- Life Sciences Institute, Guangxi Medical University, Nanning, Guangxi, 530021, China
| | - Zhijin Liu
- College of Life Sciences, Capital Normal University, Beijing, 100048, China
| | - Cheng Ming Huang
- College of Life Sciences, Capital Normal University, Beijing, 100048, China.
| | - Yanling Hu
- Department of Biochemistry and Molecular Biology, School of Pre-Clinical Medicine, Guangxi Medical University, Nanning, Guangxi, 530021, China. .,Life Sciences Institute, Guangxi Medical University, Nanning, Guangxi, 530021, China. .,Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi, 530021, China.
| |
Collapse
|
38
|
Pan Y, Fang G, Wang Z, Cao Y, Liu Y, Li G, Liu X, Xiao Q, Zhan S. Chromosome-level genome reference and genome editing of the tea geometrid. Mol Ecol Resour 2021; 21:2034-2049. [PMID: 33738922 DOI: 10.1111/1755-0998.13385] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Revised: 02/25/2021] [Accepted: 03/10/2021] [Indexed: 12/16/2022]
Abstract
The tea geometrid is a destructive insect pest on tea plants, which seriously affects tea production in terms of both yield and quality and causes severe economic losses. The tea geometrid also provides an important study system to address the ecological adaptive mechanisms underlying its unique host plant adaptation and protective resemblance. In this study, we fully sequenced and de novo assembled the reference genome of the tea geometrid, Ectropis grisescens, using long sequencing reads. We presented a highly continuous, near-complete genome reference (787.4 Mb; scaffold N50: 26.9 Mb), along with the annotation of 18,746 protein-coding genes and 53.3% repeat contents. Importantly, we successfully placed 97.8% of the assembly in 31 chromosomes based on Hi-C interactions and characterized the sex chromosome based on sex-biased sequencing coverage. Multiple quality-control assays and chromosome-scale synteny with the model species all supported the high quality of the presented genome reference. We focused biological annotations on gene families related to the host plant adaptation and camouflage in the tea geometrid and performed comparisons with other representative lepidopteran species. Important findings include the E. grisescens-specific expansion of CYP6 P450 genes that might be involved in metabolism of tea defensive chemicals and unexpected massive expansion of gustatory receptor gene families that suggests potential polyphagy for this tea pest. Furthermore, we developed an efficient genome editing system based on CRISPR/Cas9 technology and successfully implement mutagenesis of a Hox gene in the tea geometrid. Our study provides key genomic resources both for exploring unique mechanisms underlying the ecological adaptation of tea geometrids and for developing environment-friendly strategies for tea pest management.
Collapse
Affiliation(s)
- Yunjie Pan
- CAS Key Laboratory of Insect Developmental and Evolutionary Biology, CAS Center for Excellence in Molecular Plant Sciences, Chinese Academy of Sciences, Shanghai, China.,CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China
| | - Gangqi Fang
- CAS Key Laboratory of Insect Developmental and Evolutionary Biology, CAS Center for Excellence in Molecular Plant Sciences, Chinese Academy of Sciences, Shanghai, China.,CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China
| | - Zhibo Wang
- Key Laboratory of Tea Quality and Safety Control, Tea Research Institute, Ministry of Agriculture, Chinese Academy of Agricultural Sciences, Hangzhou, China
| | - Yanghui Cao
- CAS Key Laboratory of Insect Developmental and Evolutionary Biology, CAS Center for Excellence in Molecular Plant Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Yongjian Liu
- CAS Key Laboratory of Insect Developmental and Evolutionary Biology, CAS Center for Excellence in Molecular Plant Sciences, Chinese Academy of Sciences, Shanghai, China.,CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China
| | - Guiyun Li
- CAS Key Laboratory of Insect Developmental and Evolutionary Biology, CAS Center for Excellence in Molecular Plant Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Xiaojing Liu
- CAS Key Laboratory of Insect Developmental and Evolutionary Biology, CAS Center for Excellence in Molecular Plant Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Qiang Xiao
- Key Laboratory of Tea Quality and Safety Control, Tea Research Institute, Ministry of Agriculture, Chinese Academy of Agricultural Sciences, Hangzhou, China
| | - Shuai Zhan
- CAS Key Laboratory of Insect Developmental and Evolutionary Biology, CAS Center for Excellence in Molecular Plant Sciences, Chinese Academy of Sciences, Shanghai, China.,CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
39
|
Rao G, Zhang J, Liu X, Lin C, Xin H, Xue L, Wang C. De novo assembly of a new Olea europaea genome accession using nanopore sequencing. HORTICULTURE RESEARCH 2021; 8:64. [PMID: 33790235 PMCID: PMC8012569 DOI: 10.1038/s41438-021-00498-y] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2020] [Revised: 01/04/2021] [Accepted: 01/11/2021] [Indexed: 05/17/2023]
Abstract
Olive (Olea europaea L.) is internationally renowned for its high-end product, extra virgin olive oil. An incomplete genome of O. europaea was previously obtained using shotgun sequencing in 2016. To further explore the genetic and breeding utilization of olive, an updated draft genome of olive was obtained using Oxford Nanopore third-generation sequencing and Hi-C technology. Seven different assembly strategies were used to assemble the final genome of 1.30 Gb, with contig and scaffold N50 sizes of 4.67 Mb and 42.60 Mb, respectively. This greatly increased the quality of the olive genome. We assembled 1.1 Gb of sequences of the total olive genome to 23 pseudochromosomes by Hi-C, and 53,518 protein-coding genes were predicted in the current assembly. Comparative genomics analyses, including gene family expansion and contraction, whole-genome replication, phylogenetic analysis, and positive selection, were performed. Based on the obtained high-quality olive genome, a total of nine gene families with 202 genes were identified in the oleuropein biosynthesis pathway, which is twice the number of genes identified from the previous data. This new accession of the olive genome is of sufficient quality for genome-wide studies on gene function in olive and has provided a foundation for the molecular breeding of olive species.
Collapse
Affiliation(s)
- Guodong Rao
- State Key Laboratory of Tree Genetics and Breeding, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China.
- Collaborative Innovation Center of Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, 210037, China.
- Key Laboratory of Tree Breeding and Cultivation, National Forestry and Grassland Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China.
| | - Jianguo Zhang
- State Key Laboratory of Tree Genetics and Breeding, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China.
- Collaborative Innovation Center of Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, 210037, China.
- Key Laboratory of Tree Breeding and Cultivation, National Forestry and Grassland Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China.
| | - Xiaoxia Liu
- State Key Laboratory of Tree Genetics and Breeding, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China
| | - Chunfu Lin
- MIANNING Yuansheng Agricultural Science and Technology Co., Ltd., Liangshan Yi Autonomous Prefecture Mianning County, Sichuan, 615600, China
| | - Huaigen Xin
- Biomarker Technologies Corporation, Beijing, 101300, China
| | - Li Xue
- State Key Laboratory of Tree Genetics and Breeding, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China
| | - Chenhe Wang
- State Key Laboratory of Tree Genetics and Breeding, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China
| |
Collapse
|
40
|
He M, Chi X, Ren J. Applications of Oxford Nanopore Sequencing in Schizosaccharomyces pombe. Methods Mol Biol 2021; 2196:97-116. [PMID: 32889716 DOI: 10.1007/978-1-0716-0868-5_9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]
Abstract
Recent years have seen great progresses in third-generation sequencing. New commercial platforms from Oxford Nanopore Technologies (ONT) can generate ultra-long reads from single-molecule nucleic acid fragments of kilobases up to megabases, exceeding the limitation of short reads and dependency on template amplification suffered by the previous generation of sequencing technologies. Moreover, it can detect epigenetic modifications directly, as well as providing all-around field usage, being pocket-sized and low cost. It has already been applied to yeast research in many aspects, such as complete de novo genome assemblies, the phylogeny of large-brewing yeasts, gene isoform identification, and base modification detection. These applications have delivered novel insights into yeast genomic and transcriptomic analysis.
Collapse
Affiliation(s)
- Ming He
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, China National Center for Bioinformation, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Xu Chi
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, China National Center for Bioinformation, Beijing, China
| | - Jie Ren
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, China National Center for Bioinformation, Beijing, China. .,University of Chinese Academy of Sciences, Beijing, China. .,Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing, China. .,Sino-Danish College, School of Future Technology, University of Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
41
|
Farshad M, Rasaiah JC. Light-Nucleotide versus Ion-Nucleotide Interactions for Single-Nucleotide Resolution. J Phys Chem B 2021; 125:2863-2870. [PMID: 33688740 DOI: 10.1021/acs.jpcb.0c10759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Several parallel reads of ionic currents through multiple CsgG nanopores provide information about ion-nucleotide interactions for sequencing single-stranded DNA (ss-DNA) using base-calling algorithms. However, the information in ion-nucleotide interactions seems insufficient for single-read nanopore DNA sequencing. Here we report discriminative light-nucleotide interactions calculated from density functional theory (DFT), which are compared with ionic currents obtained from molecular dynamics (MD) simulations. The MD simulations were performed on a system containing a transverse nanochannel and a longitudinal solid state nanopore. We show that both of the transverse and longitudinal ionic currents during the translocation of A16, G16, T16, and C16 through the nanopore, overlapped widely. On the other hand, the UV-vis and Raman spectra of different types of single nucleotides, nucleosides, and nucleobases show relatively higher resolution than the ionic currents. Light-nucleotide interactions provide better information for characterizing the nucleotides in comparison to ion-nucleotide interactions for nanopore DNA sequencing. This can be realized by using optical techniques including surface-enhanced Raman spectroscopy (SERS) or tip-enhanced Raman spectroscopy (TERS), while plasmon excitation can be used to localize light and control the rate of nucleotide flow.
Collapse
Affiliation(s)
- Mohsen Farshad
- Department of Chemistry, University of Maine, Orono, Maine 04469, United States
| | - Jayendran C Rasaiah
- Department of Chemistry, University of Maine, Orono, Maine 04469, United States
| |
Collapse
|
42
|
Liu H, Wu S, Li A, Ruan J. SMARTdenovo: a de novo assembler using long noisy reads. GIGABYTE 2021; 2021:gigabyte15. [PMID: 36824332 PMCID: PMC9632051 DOI: 10.46471/gigabyte.15] [Citation(s) in RCA: 89] [Impact Index Per Article: 29.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Accepted: 03/05/2021] [Indexed: 12/11/2022] Open
Abstract
Long-read single-molecule sequencing has revolutionized de novo genome assembly and enabled the automated reconstruction of reference-quality genomes. It has also been widely used to study structural variants, phase haplotypes and more. Here, we introduce the assembler SMARTdenovo, a single-molecule sequencing (SMS) assembler that follows the overlap-layout-consensus (OLC) paradigm. SMARTdenovo (RRID: SCR_017622) was designed to be a rapid assembler, which, unlike contemporaneous SMS assemblers, does not require highly accurate raw reads for error correction. It has performed well in the evaluation of congeneric assemblers and has been successfully users for various assembly projects. It is compatible with Canu for assembling high-quality genomes, and several of the assembly strategies in this program have been incorporated into subsequent popular assemblers. The assembler has been in use since 2015; here we provide information on the development of SMARTdenovo and how to implement its algorithms into current projects.
Collapse
Affiliation(s)
- Hailin Liu
- Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Shigang Wu
- Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Alun Li
- Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Jue Ruan
- Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| |
Collapse
|
43
|
Lin W, Shi Y, Jia G, Sun H, Sun T, Hou D. Genome sequencing and annotation and phylogenomic analysis of the medicinal mushroom Amauroderma rugosum, a traditional medicinal species in the family Ganodermataceae. Mycologia 2021; 113:268-277. [PMID: 33555992 DOI: 10.1080/00275514.2020.1851135] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Amauroderma rugosum is one of the traditional Chinese medicinal mushrooms and is used to reduce inflammation, treat diuretic and upset stomach, and prevent cancer. Here, we present a genomic resource of Amauroderma rugosum (ACCC 51706) for further understanding its biology and exploration of the synthesis pathway of bioactive compounds. Genomic DNA was extracted and then subjected to Illumina HiSeq X Ten and PacBio Sequel I sequencing. The final genome is 40.66 Mb in size, with an N50 scaffold size of 36.6 Mb, and encodes 10 181 putative predicted genes. Among them, 6931 genes were functionally annotated. Phylogenomic analysis suggested that A. rugosum and Ganoderma sinense were not clustered together into a group and the latter was grouped with the Polyporaceae. Further, we also identified 377 carbohydrate-active enzymes (CAZymes) and 15 secondary metabolite biosynthetic gene clusters. This is the first genome-scale assembly and annotation for an Amauroderma species. The identification of novel secondary metabolite biosynthetic gene clusters would promote pharmacological research and development of novel bioactive compounds in the future.
Collapse
Affiliation(s)
- Weiping Lin
- School of Bioscience and Technology, Weifang Medical University, Weifang 261053, China.,Key Laboratory of Biological Medicines in Universities of Shandong Province, Weifang Medical University, Weifang 261053, China
| | - Yanhua Shi
- School of Bioscience and Technology, Weifang Medical University, Weifang 261053, China.,Key Laboratory of Biological Medicines in Universities of Shandong Province, Weifang Medical University, Weifang 261053, China
| | - Guangtao Jia
- School of Bioscience and Technology, Weifang Medical University, Weifang 261053, China.,Key Laboratory of Biological Medicines in Universities of Shandong Province, Weifang Medical University, Weifang 261053, China
| | - Hengyi Sun
- Key Laboratory of Biological Medicines in Universities of Shandong Province, Weifang Medical University, Weifang 261053, China
| | - Tongyi Sun
- School of Bioscience and Technology, Weifang Medical University, Weifang 261053, China.,Key Laboratory of Biological Medicines in Universities of Shandong Province, Weifang Medical University, Weifang 261053, China
| | - Dianhai Hou
- School of Bioscience and Technology, Weifang Medical University, Weifang 261053, China.,Key Laboratory of Biological Medicines in Universities of Shandong Province, Weifang Medical University, Weifang 261053, China
| |
Collapse
|
44
|
Giraldo PA, Shinozuka H, Spangenberg GC, Smith KF, Cogan NOI. Rapid and Detailed Characterization of Transgene Insertion Sites in Genetically Modified Plants via Nanopore Sequencing. FRONTIERS IN PLANT SCIENCE 2021; 11:602313. [PMID: 33613582 PMCID: PMC7889508 DOI: 10.3389/fpls.2020.602313] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2020] [Accepted: 12/24/2020] [Indexed: 05/15/2023]
Abstract
Molecular characterization of genetically modified plants can provide crucial information for the development of detection and identification methods, to comply with traceability, and labeling requirements prior to commercialization. Detailed description of the genetic modification was previously a challenging step in the safety assessment, since it required the use of laborious and time-consuming techniques. In this study an accurate, simple, and fast method was developed for molecular characterization of genetically modified (GM) plants, following a user-friendly workflow for researchers with limited bioinformatic capabilities. Three GM events from a diverse array of crop species-perennial ryegrass, white clover, and canola-were used to test the approach that exploits long-read sequencing by the MinION device, from Oxford Nanopore Technologies. The method delivered a higher degree of resolution of the transgenic events within the host genome than has previously been possible with the standard Illumina short-range sequencing strategies. The flanking sequences, copy number, and presence of backbone sequences, and overall transgene insertion structure were determined for each of the plant genomes, with the additional identification of moderate-sized secondary insertions that would have previously been missed. The proposed workflow takes only about 1 week from DNA extraction to analyzed result, and the method will complement the existing approaches for molecular characterization of GM plants, since it makes the process faster, simpler, and more cost-effective.
Collapse
Affiliation(s)
- Paula A. Giraldo
- Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Parkville, VIC, Australia
- Agriculture Victoria Research, AgriBio, The Centre for AgriBioscience, Bundoora, VIC, Australia
| | - Hiroshi Shinozuka
- Agriculture Victoria Research, AgriBio, The Centre for AgriBioscience, Bundoora, VIC, Australia
| | - German C. Spangenberg
- Agriculture Victoria Research, AgriBio, The Centre for AgriBioscience, Bundoora, VIC, Australia
- AgriBio, The Centre for AgriBioscience, School of Applied Systems Biology, La Trobe University, Bundoora, VIC, Australia
| | - Kevin F. Smith
- Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Parkville, VIC, Australia
- Agriculture Victoria Research, Hamilton, VIC, Australia
| | - Noel O. I. Cogan
- Agriculture Victoria Research, AgriBio, The Centre for AgriBioscience, Bundoora, VIC, Australia
- AgriBio, The Centre for AgriBioscience, School of Applied Systems Biology, La Trobe University, Bundoora, VIC, Australia
| |
Collapse
|
45
|
Boocock J, Sadhu MJ, Durvasula A, Bloom JS, Kruglyak L. Ancient balancing selection maintains incompatible versions of the galactose pathway in yeast. Science 2021; 371:415-419. [PMID: 33479156 PMCID: PMC8384573 DOI: 10.1126/science.aba0542] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2019] [Revised: 05/08/2020] [Accepted: 12/17/2020] [Indexed: 02/02/2023]
Abstract
Metabolic pathways differ across species but are expected to be similar within a species. We discovered two functional, incompatible versions of the galactose pathway in Saccharomyces cerevisiae We identified a three-locus genetic interaction for growth in galactose, and used precisely engineered alleles to show that it arises from variation in the galactose utilization genes GAL2, GAL1/10/7, and phosphoglucomutase (PGM1), and that the reference allele of PGM1 is incompatible with the alternative alleles of the other genes. Multiloci balancing selection has maintained the two incompatible versions of the pathway for millions of years. Strains with alternative alleles are found primarily in galactose-rich dairy environments, and they grow faster in galactose but slower in glucose, revealing a trade-off on which balancing selection may have acted.
Collapse
Affiliation(s)
- James Boocock
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, CA, USA
- Howard Hughes Medical Institute, University of California, Los Angeles, Los Angeles, CA, USA
- Institute for Quantitative and Computational Biology, University of California, Los Angeles, Los Angeles, CA, USA
| | - Meru J Sadhu
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, CA, USA
- Howard Hughes Medical Institute, University of California, Los Angeles, Los Angeles, CA, USA
- Institute for Quantitative and Computational Biology, University of California, Los Angeles, Los Angeles, CA, USA
| | - Arun Durvasula
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, USA
| | - Joshua S Bloom
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, USA.
- Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, CA, USA
- Howard Hughes Medical Institute, University of California, Los Angeles, Los Angeles, CA, USA
- Institute for Quantitative and Computational Biology, University of California, Los Angeles, Los Angeles, CA, USA
| | - Leonid Kruglyak
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, USA.
- Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, CA, USA
- Howard Hughes Medical Institute, University of California, Los Angeles, Los Angeles, CA, USA
- Institute for Quantitative and Computational Biology, University of California, Los Angeles, Los Angeles, CA, USA
| |
Collapse
|
46
|
Farhat S, Le P, Kayal E, Noel B, Bigeard E, Corre E, Maumus F, Florent I, Alberti A, Aury JM, Barbeyron T, Cai R, Da Silva C, Istace B, Labadie K, Marie D, Mercier J, Rukwavu T, Szymczak J, Tonon T, Alves-de-Souza C, Rouzé P, Van de Peer Y, Wincker P, Rombauts S, Porcel BM, Guillou L. Rapid protein evolution, organellar reductions, and invasive intronic elements in the marine aerobic parasite dinoflagellate Amoebophrya spp. BMC Biol 2021; 19:1. [PMID: 33407428 PMCID: PMC7789003 DOI: 10.1186/s12915-020-00927-9] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2020] [Accepted: 11/12/2020] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND Dinoflagellates are aquatic protists particularly widespread in the oceans worldwide. Some are responsible for toxic blooms while others live in symbiotic relationships, either as mutualistic symbionts in corals or as parasites infecting other protists and animals. Dinoflagellates harbor atypically large genomes (~ 3 to 250 Gb), with gene organization and gene expression patterns very different from closely related apicomplexan parasites. Here we sequenced and analyzed the genomes of two early-diverging and co-occurring parasitic dinoflagellate Amoebophrya strains, to shed light on the emergence of such atypical genomic features, dinoflagellate evolution, and host specialization. RESULTS We sequenced, assembled, and annotated high-quality genomes for two Amoebophrya strains (A25 and A120), using a combination of Illumina paired-end short-read and Oxford Nanopore Technology (ONT) MinION long-read sequencing approaches. We found a small number of transposable elements, along with short introns and intergenic regions, and a limited number of gene families, together contribute to the compactness of the Amoebophrya genomes, a feature potentially linked with parasitism. While the majority of Amoebophrya proteins (63.7% of A25 and 59.3% of A120) had no functional assignment, we found many orthologs shared with Dinophyceae. Our analyses revealed a strong tendency for genes encoded by unidirectional clusters and high levels of synteny conservation between the two genomes despite low interspecific protein sequence similarity, suggesting rapid protein evolution. Most strikingly, we identified a large portion of non-canonical introns, including repeated introns, displaying a broad variability of associated splicing motifs never observed among eukaryotes. Those introner elements appear to have the capacity to spread over their respective genomes in a manner similar to transposable elements. Finally, we confirmed the reduction of organelles observed in Amoebophrya spp., i.e., loss of the plastid, potential loss of a mitochondrial genome and functions. CONCLUSION These results expand the range of atypical genome features found in basal dinoflagellates and raise questions regarding speciation and the evolutionary mechanisms at play while parastitism was selected for in this particular unicellular lineage.
Collapse
Affiliation(s)
- Sarah Farhat
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ. Evry, Université Paris-Saclay, 91057, Evry, France
- School of Marine and Atmospheric Sciences, Stony Brook University, Stony Brook, New York, 11794, USA
| | - Phuong Le
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Ehsan Kayal
- Sorbonne Université, CNRS, FR2424, Station Biologique de Roscoff, Place Georges Teissier, 29680, Roscoff, France
| | - Benjamin Noel
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ. Evry, Université Paris-Saclay, 91057, Evry, France
| | - Estelle Bigeard
- Sorbonne Université, CNRS, UMR7144 Adaptation et Diversité en Milieu Marin, Ecology of Marine Plankton (ECOMAP), Station Biologique de Roscoff SBR, 29680, Roscoff, France
| | - Erwan Corre
- Sorbonne Université, CNRS, FR2424, Station Biologique de Roscoff, Place Georges Teissier, 29680, Roscoff, France
| | - Florian Maumus
- URGI, INRA, Université Paris-Saclay, 78026, Versailles, France
| | - Isabelle Florent
- Unité Molécules de Communication et Adaptation des Microorganismes (MCAM, UMR7245), Muséum national d'Histoire naturelle, CNRS, CP 52, 57 rue Cuvier, 75005, Paris, France
| | - Adriana Alberti
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ. Evry, Université Paris-Saclay, 91057, Evry, France
| | - Jean-Marc Aury
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ. Evry, Université Paris-Saclay, 91057, Evry, France
| | - Tristan Barbeyron
- Sorbonne Université, CNRS, UMR 8227, Station Biologique de Roscoff, Place Georges Teissier, 29680, Roscoff, France
| | - Ruibo Cai
- Sorbonne Université, CNRS, UMR7144 Adaptation et Diversité en Milieu Marin, Ecology of Marine Plankton (ECOMAP), Station Biologique de Roscoff SBR, 29680, Roscoff, France
| | - Corinne Da Silva
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ. Evry, Université Paris-Saclay, 91057, Evry, France
| | - Benjamin Istace
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ. Evry, Université Paris-Saclay, 91057, Evry, France
| | - Karine Labadie
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ. Evry, Université Paris-Saclay, 91057, Evry, France
| | - Dominique Marie
- Sorbonne Université, CNRS, UMR7144 Adaptation et Diversité en Milieu Marin, Ecology of Marine Plankton (ECOMAP), Station Biologique de Roscoff SBR, 29680, Roscoff, France
| | - Jonathan Mercier
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ. Evry, Université Paris-Saclay, 91057, Evry, France
| | - Tsinda Rukwavu
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ. Evry, Université Paris-Saclay, 91057, Evry, France
| | - Jeremy Szymczak
- Sorbonne Université, CNRS, FR2424, Station Biologique de Roscoff, Place Georges Teissier, 29680, Roscoff, France
- Sorbonne Université, CNRS, UMR7144 Adaptation et Diversité en Milieu Marin, Ecology of Marine Plankton (ECOMAP), Station Biologique de Roscoff SBR, 29680, Roscoff, France
| | - Thierry Tonon
- Centre for Novel Agricultural Products, Department of Biology, University of York, Heslington, York, YO10 5DD, UK
| | - Catharina Alves-de-Souza
- Algal Resources Collection, MARBIONC, Center for Marine Sciences, University of North Carolina Wilmington, 5600 Marvin K. Moss Lane, Wilmington, NC, 28409, USA
| | - Pierre Rouzé
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
- Department of Biochemistry, Genetics and Microbiology, University of Pretoria, Pretoria, South Africa
| | - Patrick Wincker
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ. Evry, Université Paris-Saclay, 91057, Evry, France
| | - Stephane Rombauts
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Betina M Porcel
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ. Evry, Université Paris-Saclay, 91057, Evry, France.
| | - Laure Guillou
- Sorbonne Université, CNRS, UMR7144 Adaptation et Diversité en Milieu Marin, Ecology of Marine Plankton (ECOMAP), Station Biologique de Roscoff SBR, 29680, Roscoff, France.
| |
Collapse
|
47
|
Rousseau-Gueutin M, Belser C, Da Silva C, Richard G, Istace B, Cruaud C, Falentin C, Boideau F, Boutte J, Delourme R, Deniot G, Engelen S, de Carvalho JF, Lemainque A, Maillet L, Morice J, Wincker P, Denoeud F, Chèvre AM, Aury JM. Long-read assembly of the Brassica napus reference genome Darmor-bzh. Gigascience 2020; 9:giaa137. [PMID: 33319912 PMCID: PMC7736779 DOI: 10.1093/gigascience/giaa137] [Citation(s) in RCA: 56] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 09/18/2020] [Accepted: 11/09/2020] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND The combination of long reads and long-range information to produce genome assemblies is now accepted as a common standard. This strategy not only allows access to the gene catalogue of a given species but also reveals the architecture and organization of chromosomes, including complex regions such as telomeres and centromeres. The Brassica genus is not exempt, and many assemblies based on long reads are now available. The reference genome for Brassica napus, Darmor-bzh, which was published in 2014, was produced using short reads and its contiguity was extremely low compared with current assemblies of the Brassica genus. FINDINGS Herein, we report the new long-read assembly of Darmor-bzh genome (Brassica napus) generated by combining long-read sequencing data and optical and genetic maps. Using the PromethION device and 6 flowcells, we generated ∼16 million long reads representing 93× coverage and, more importantly, 6× with reads longer than 100 kb. This ultralong-read dataset allows us to generate one of the most contiguous and complete assemblies of a Brassica genome to date (contig N50 > 10 Mb). In addition, we exploited all the advantages of the nanopore technology to detect modified bases and sequence transcriptomic data using direct RNA to annotate the genome and focus on resistance genes. CONCLUSION Using these cutting-edge technologies, and in particular by relying on all the advantages of the nanopore technology, we provide the most contiguous Brassica napus assembly, a resource that will be valuable to the Brassica community for crop improvement and will facilitate the rapid selection of agronomically important traits.
Collapse
Affiliation(s)
| | - Caroline Belser
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Corinne Da Silva
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Gautier Richard
- IGEPP, INRAE, Institut Agro, Université de Rennes, Domaine de la Motte, 35653 Le Rheu, France
| | - Benjamin Istace
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Corinne Cruaud
- Genoscope, Institut François Jacob, Commissariat à l'Energie Atomique (CEA), Université Paris-Saclay, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Cyril Falentin
- IGEPP, INRAE, Institut Agro, Université de Rennes, Domaine de la Motte, 35653 Le Rheu, France
| | - Franz Boideau
- IGEPP, INRAE, Institut Agro, Université de Rennes, Domaine de la Motte, 35653 Le Rheu, France
| | - Julien Boutte
- IGEPP, INRAE, Institut Agro, Université de Rennes, Domaine de la Motte, 35653 Le Rheu, France
| | - Regine Delourme
- IGEPP, INRAE, Institut Agro, Université de Rennes, Domaine de la Motte, 35653 Le Rheu, France
| | - Gwenaëlle Deniot
- IGEPP, INRAE, Institut Agro, Université de Rennes, Domaine de la Motte, 35653 Le Rheu, France
| | - Stefan Engelen
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 2 rue Gaston Crémieux, 91057 Evry, France
| | | | - Arnaud Lemainque
- Genoscope, Institut François Jacob, Commissariat à l'Energie Atomique (CEA), Université Paris-Saclay, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Loeiz Maillet
- IGEPP, INRAE, Institut Agro, Université de Rennes, Domaine de la Motte, 35653 Le Rheu, France
| | - Jérôme Morice
- IGEPP, INRAE, Institut Agro, Université de Rennes, Domaine de la Motte, 35653 Le Rheu, France
| | - Patrick Wincker
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 2 rue Gaston Crémieux, 91057 Evry, France
| | - France Denoeud
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Anne-Marie Chèvre
- IGEPP, INRAE, Institut Agro, Université de Rennes, Domaine de la Motte, 35653 Le Rheu, France
| | - Jean-Marc Aury
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 2 rue Gaston Crémieux, 91057 Evry, France
| |
Collapse
|
48
|
Jin F, Zhou Z, Guo Q, Liang Z, Yang R, Jiang J, He Y, Zhao Q, Zhao Q. High-quality genome assembly of Metaphire vulgaris. PeerJ 2020; 8:e10313. [PMID: 33240640 PMCID: PMC7666815 DOI: 10.7717/peerj.10313] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Accepted: 10/15/2020] [Indexed: 12/22/2022] Open
Abstract
Earthworms enrich the soil and protect the health of their ecological environment. Previous studies on these invertebrates determined their protein content, hormone secretions, medicinal value, and ecological habits, but their whole genomic sequence remains incomplete. We performed whole genome sequencing of Metaphire vulgaris (Chen, 1930), which belongs to the genus Metaphire of the family Megascolecidae. The genome assembly was 729 Mb, with a N50 contig size of 4.2 Mb. In total, 559 contigs were anchored to 41 chromosomes according to the results of Hi-C (High-throughput Chromosome Conformation Capture) technology, which was confirmed by karyological analysis. A comparison of the genomic sequences and genes indicated that there was a whole-genome duplication in M. vulgaris followed by several chromosome fusion events. Hox genes and lumbrokinase genes were identified as partial clusters surrounding the genome. Our high-quality genome assembly of M. vulgaris will provide valuable information for gene function and evolutionary studies in earthworms.
Collapse
Affiliation(s)
- Feng Jin
- College of Rehabilitation Sciences, Shanghai University of Medicine and Health Sciences, Shanghai, China
| | - Zhaoli Zhou
- Shanghai Key Laboratory of Molecular Imaging, Shanghai University of Medicine and Health Sciences, Shanghai, China
| | - Qi Guo
- College of Rehabilitation Sciences, Shanghai University of Medicine and Health Sciences, Shanghai, China
| | - Zhenwen Liang
- College of Rehabilitation Sciences, Shanghai University of Medicine and Health Sciences, Shanghai, China
| | - Ruoyu Yang
- College of Rehabilitation Sciences, Shanghai University of Medicine and Health Sciences, Shanghai, China
| | - Jibao Jiang
- Department of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Yanlin He
- College of Rehabilitation Sciences, Shanghai University of Medicine and Health Sciences, Shanghai, China
| | - Qi Zhao
- Department of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Qiang Zhao
- Shanghai Key Laboratory of Molecular Imaging, Shanghai University of Medicine and Health Sciences, Shanghai, China
| |
Collapse
|
49
|
Ma W, Xu L, Hua H, Chen M, Guo M, He K, Zhao J, Li F. Chromosomal-level genomes of three rice planthoppers provide new insights into sex chromosome evolution. Mol Ecol Resour 2020; 21:226-237. [PMID: 32780934 DOI: 10.1111/1755-0998.13242] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2020] [Revised: 07/27/2020] [Accepted: 08/04/2020] [Indexed: 10/23/2022]
Abstract
The brown planthopper Nilaparvata lugens, white-backed planthopper Sogatella furcifera, and small brown planthopper Laodelphax striatellus are three major insect pests of rice. They are genetically close; however, they differ in several ecological traits such as host range, migration capacity, and in their sex chromosomes. Though the draft genome of these three planthoppers have been previously released, the quality of genome assemblies need to be improved. The absence of chromosome-level genome resources has hindered in-depth research of these three species. Here, we performed a de novo genome assembly for N. lugens to increase its genome assembly quality with PacBio and Illumina platforms, increasing the contig N50 to 589.46 Kb. Then, with the new N. lugens genome and previously reported S. furcifera and L. striatellus genome assemblies, we generated chromosome-level scaffold assemblies of these three planthopper species using HiC scaffolding technique. The scaffold N50s significantly increased to 77.63 Mb, 43.36 Mb and 29.24 Mb for N. lugens, S. furcifera and L. striatellus, respectively. To identify sex chromosomes of these three planthopper species, we carried out genome re-sequencing of males and females and successfully determined the X and Y chromosomes for N. lugens, and X chromosome for S. furcifera and L. striatellus. The gene content of the sex chromosomes showed high diversity among these three planthoppers suggesting the rapid evolution of sex-linked genes, and all chromosomes showed high synteny. The chromosome-level genome assemblies of three planthoppers would provide a valuable resource for a broad range of future research in molecular ecology, and subsequently benefits development of modern pest control strategies.
Collapse
Affiliation(s)
- Weihua Ma
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, China
| | - Le Xu
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insect Pests, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Hongxia Hua
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, China
| | - Mengyao Chen
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insect Pests, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Mengjian Guo
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, China
| | - Kang He
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insect Pests, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Jing Zhao
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, China
| | - Fei Li
- State Key Laboratory of Rice Biology & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insect Pests, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| |
Collapse
|
50
|
Dhorne-Pollet S, Barrey E, Pollet N. A new method for long-read sequencing of animal mitochondrial genomes: application to the identification of equine mitochondrial DNA variants. BMC Genomics 2020; 21:785. [PMID: 33176683 PMCID: PMC7661214 DOI: 10.1186/s12864-020-07183-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Accepted: 10/26/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Mitochondrial DNA is remarkably polymorphic. This is why animal geneticists survey mitochondrial genomes variations for fundamental and applied purposes. We present here an approach to sequence whole mitochondrial genomes using nanopore long-read sequencing. Our method relies on the selective elimination of nuclear DNA using an exonuclease treatment and on the amplification of circular mitochondrial DNA using a multiple displacement amplification step. RESULTS We optimized each preparative step to obtain a 100 million-fold enrichment of horse mitochondrial DNA relative to nuclear DNA. We sequenced these amplified mitochondrial DNA using nanopore sequencing technology and obtained mitochondrial DNA reads that represented up to half of the sequencing output. The sequence reads were 2.3 kb of mean length and provided an even coverage of the mitochondrial genome. Long-reads spanning half or more of the whole mtDNA provided a coverage that varied between 118X and 488X. We evaluated SNPs identified using these long-reads by Sanger sequencing as ground truth and found a precision of 100.0%; a recall of 93.1% and a F1-score of 0.964 using the Twilight horse mtDNA reference. The choice of the mtDNA reference impacted variant calling efficiency with F1-scores varying between 0.947 and 0.964. CONCLUSIONS Our method to amplify mtDNA and to sequence it using the nanopore technology is usable for mitochondrial DNA variant analysis. With minor modifications, this approach could easily be applied to other large circular DNA molecules.
Collapse
Affiliation(s)
- Sophie Dhorne-Pollet
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | - Eric Barrey
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | - Nicolas Pollet
- Université Paris-Saclay, CNRS, IRD, UMR Évolution, Génomes, Comportement et Écologie, 91198, Gif-sur-Yvette, France.
| |
Collapse
|