1
|
Qin T, Han J, Fan C, Sun H, Rauf N, Wang T, Yin Z, Chen X. Unveiling axolotl transcriptome for tissue regeneration with high-resolution annotation via long-read sequencing. Comput Struct Biotechnol J 2024; 23:3186-3198. [PMID: 39263210 PMCID: PMC11388199 DOI: 10.1016/j.csbj.2024.08.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Revised: 08/08/2024] [Accepted: 08/14/2024] [Indexed: 09/13/2024] Open
Abstract
Axolotls are known for their remarkable regeneration ability. Exploring their transcriptome provides insight into regenerative mechanisms. However, the current annotation of the axolotl transcriptome is limited, leaving the role of unannotated transcripts in regeneration unknown. To discourse this challenge, we exploited long-read sequencing technology, which enables direct observation of full-length RNA transcripts, greatly enhancing the coverage and accuracy of axolotl transcriptome annotation. By utilizing this method, we identified 222 novel gene loci and 4775 novel transcripts, which were quantified using short-read sequencing data. Through the inclusive analysis, we discovered novel homologs, potential functional proteins, noncoding RNAs, and alternative splicing events in key regeneration pathways. In particular, we identified novel transcripts with high protein-coding potential implicated in cell cycle regulation and musculoskeletal development, and regeneration were identified. Interestingly, alternative splice variants were also detected across diverse pathways critical to regeneration. This specifies that these novel transcripts potentially play vital roles underpinning the robust regenerative capacities of axolotls. Single-cell transcriptomic analysis further revealed these isoforms to predominantly exist in axolotl limb chondrocytes and mature tissue cell populations. Overall, the findings significantly advanced consideration of the axolotl transcriptome and provided a new perspective for understanding the mechanisms of regenerative abilities of axolotls.
Collapse
Affiliation(s)
- Tian Qin
- Department of Orthopedic Surgery of Sir Run Run Shaw Hospital, and Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, China
- Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cells and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, China
- China Orthopedic Regenerative Medicine Group (CORMed), Hangzhou, China
| | - Jie Han
- Department of Sports Medicine & Orthopedic Surgery, The Second Affiliated Hospital, and Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, China
- Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cells and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, China
| | - Chunmei Fan
- Department of Sports Medicine & Orthopedic Surgery, The Second Affiliated Hospital, and Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, China
- Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cells and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, China
- China Orthopedic Regenerative Medicine Group (CORMed), Hangzhou, China
- Key Laboratory of Novel Targets and Drug Study for Neural Repair of Zhejiang Province, Department of Clinical Medicine, School of Medicine, Hangzhou City University, Hangzhou, Zhejiang, China
| | - Heng Sun
- Medical 3D Printing Center, Orthopedic Institute, Department of Orthopedic Surgery, The First Affiliated Hospital, School of Basic Medical Sciences, Suzhou Medical College, Soochow University, Suzhou, Jiangsu 215000, China
| | - Naveed Rauf
- Department of Sports Medicine & Orthopedic Surgery, The Second Affiliated Hospital, and Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, China
- Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cells and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, China
- China Orthopedic Regenerative Medicine Group (CORMed), Hangzhou, China
- Key Laboratory of Tissue Engineering and Regenerative Medicine of Zhejiang Province, Zhejiang University School of Medicine, Hangzhou, China
| | - Tingzhang Wang
- Key Laboratory of Microbial Technology and Bioinformatics of Zhejiang Province, Hangzhou, China
| | - Zi Yin
- Department of Orthopedic Surgery of Sir Run Run Shaw Hospital, and Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, China
- Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cells and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, China
- China Orthopedic Regenerative Medicine Group (CORMed), Hangzhou, China
| | - Xiao Chen
- Department of Sports Medicine & Orthopedic Surgery, The Second Affiliated Hospital, and Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, China
- Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cells and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, China
- China Orthopedic Regenerative Medicine Group (CORMed), Hangzhou, China
| |
Collapse
|
2
|
Yang Y, Xiong D, Zhao D, Huang H, Tian C. Genome sequencing of Elaeocarpus spp. stem blight pathogen Pseudocryphonectria elaeocarpicola reveals potential adaptations to colonize woody bark. BMC Genomics 2024; 25:714. [PMID: 39048950 PMCID: PMC11267912 DOI: 10.1186/s12864-024-10615-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2024] [Accepted: 07/12/2024] [Indexed: 07/27/2024] Open
Abstract
BACKGROUND Elaeocarpus spp. stem blight, caused by Pseudocryphonectria elaeocarpicola, is a destructive disease, which will significantly reduce the productivity and longevity of Elaeocarpus spp. plants, especially in the Guangdong Province of China. However, few information is available for P. elaeocarpicola. To unravel the potential adaptation mechanism of stem adaptation, the whole genome of P. elaeocarpicola was sequenced by using the DNBSEQ and PacBio platforms. RESULTS P. elaeocarpicola harbors 44.49 Mb genome with 10,894 predicted coding genes. Genome analysis revealed that the P. elaeocarpicola genome encodes a plethora of pathogenicity-related genes. Analysis of carbohydrate-active enzymes (CAZymes) revealed a rich variety of enzymes participated in plant cell wall degradation, which could effectively degrade cellulose, hemicellulose and xyloglucans in the plant cell wall and promote the invasion of the host plant. There are 213 CAZyme families found in P. elaeocarpicola, among which glycoside hydrolase (GH) family has the largest number, far exceeding other tested fungi by 53%. Besides, P. elaeocarpicola has twice as many genes encoding chitin and cellulose degradation as Cryphonectria parasitica, which belong to the same family. The predicted typical secreted proteins of P. elaeocarpicola are numerous and functional, including many known virulence effector factors, indicating that P. elaeocarpicola has great potential to secrete virulence effectors to promote pathogenicity on host plants. AntiSMASH revealed that the genome encoded 61 secondary metabolic gene clusters including 86 secondary metabolic core genes which was much higher than C. parasitica (49). Among them, two gene cluster of P. elaeocarpicola, cluster12 and cluster52 showed 100% similarity with the mycotoxins synthesis clusters from Aspergillus steynii and Alternaria alternata, respectively. In addition, we annotated cytochrome P450 related enzymes, transporters, and transcription factors in P. elaeocarpicola, which are important virulence determinants of pathogenic fungi. CONCLUSIONS Taken together, our study represents the first genome assembly for P. elaeocarpicola and reveals the key virulence factors in the pathogenic process of P. elaeocarpicola, which will promote our understanding of its pathogenic mechanism. The acquired knowledge lays a foundation for further exploration of molecular interactions with the host and provide target for management strategies in future research.
Collapse
Affiliation(s)
- Yuchen Yang
- State Key laboratory of Efficient Production of Forest Resources, Beijing Forestry University, Beijing, 100083, China
| | - Dianguang Xiong
- State Key laboratory of Efficient Production of Forest Resources, Beijing Forestry University, Beijing, 100083, China.
| | - Danyang Zhao
- Guangdong Provincial Key Laboratory of Silviculture, Protection and Utilization, Guangdong Academy of Forestry, Guangzhou, 510520, Guangdong, China
| | - Huayi Huang
- Guangdong Provincial Key Laboratory of Silviculture, Protection and Utilization, Guangdong Academy of Forestry, Guangzhou, 510520, Guangdong, China.
| | - Chengming Tian
- State Key laboratory of Efficient Production of Forest Resources, Beijing Forestry University, Beijing, 100083, China
| |
Collapse
|
3
|
Gao J, Jing Y, Cheng Z, Huang K, Zhang H, Liu Y, Yang L, Liu S. Whole-genome sequence of Pseudomonas benzopyrenica MLY92: isolation from diseased leaves of tobacco in China. Microbiol Resour Announc 2024; 13:e0017624. [PMID: 38888325 PMCID: PMC11256799 DOI: 10.1128/mra.00176-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Accepted: 04/30/2024] [Indexed: 06/20/2024] Open
Abstract
Here we present a sketch of the whole-genome sequence of Pseudomonas benzopyrenica. The strain comes from the leaf veins of a diseased tobacco plant. This study has significant research implications for gaining insights into the characteristics of microorganisms belonging to the genus Pseudomonas.
Collapse
Affiliation(s)
- Juntao Gao
- School of Biological & Chemical Engineering, Zhejiang University of Science & Technology, Hangzhou, China
| | - Yongfeng Jing
- China Tobacco Hunan Industrial Co. Ltd, Changsha, China
| | - Zhijun Cheng
- China Tobacco Hunan Industrial Co. Ltd, Changsha, China
| | - Ke Huang
- China Tobacco Hunan Industrial Co. Ltd, Changsha, China
| | - Huilin Zhang
- China Tobacco Hunan Industrial Co. Ltd, Changsha, China
| | - Yong Liu
- School of Biological & Chemical Engineering, Zhejiang University of Science & Technology, Hangzhou, China
| | - Lei Yang
- China Tobacco Hunan Industrial Co. Ltd, Changsha, China
| | - Shiwang Liu
- School of Biological & Chemical Engineering, Zhejiang University of Science & Technology, Hangzhou, China
| |
Collapse
|
4
|
Shukla HG, Chakraborty M, Emerson J. Genetic variation in recalcitrant repetitive regions of the Drosophila melanogaster genome. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.11.598575. [PMID: 38915508 PMCID: PMC11195212 DOI: 10.1101/2024.06.11.598575] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/26/2024]
Abstract
Many essential functions of organisms are encoded in highly repetitive genomic regions, including histones involved in DNA packaging, centromeres that are core components of chromosome segregation, ribosomal RNA comprising the protein translation machinery, telomeres that ensure chromosome integrity, piRNA clusters encoding host defenses against selfish elements, and virtually the entire Y chromosome. These regions, formed by highly similar tandem arrays, pose significant challenges for experimental and informatic study, impeding sequence-level descriptions essential for understanding genetic variation. Here, we report the assembly and variation analysis of such repetitive regions in Drosophila melanogaster, offering significant improvements to the existing community reference assembly. Our work successfully recovers previously elusive segments, including complete reconstructions of the histone locus and the pericentric heterochromatin of the X chromosome, spanning the Stellate locus to the distal flank of the rDNA cluster. To infer structural changes in these regions where alignments are often not practicable, we introduce landmark anchors based on unique variants that are putatively orthologous. These regions display considerable structural variation between different D. melanogaster strains, exhibiting differences in copy number and organization of homologous repeat units between haplotypes. In the histone cluster, although we observe minimal genetic exchange indicative of crossing over, the variation patterns suggest mechanisms such as unequal sister chromatid exchange. We also examine the prevalence and scale of concerted evolution in the histone and Stellate clusters and discuss the mechanisms underlying these observed patterns.
Collapse
Affiliation(s)
- Harsh G. Shukla
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California 92697, USA
- Graduate Program in Mathematical, Computational and Systems Biology, University of California Irvine, Irvine, California 92697, USA
| | - Mahul Chakraborty
- Department of Biology, Texas A&M University, College Station, Texas 77843, USA
| | - J.J. Emerson
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California 92697, USA
- Center for Complex Biological Systems, University of California Irvine, Irvine, California 92697, USA
| |
Collapse
|
5
|
Sun S, Wang S, Yin Y, Yang Y, Wang Y, Zhang J, Wang W. Competitive mechanism of salt-tolerance/degradation-performance of organic pollutant in bacteria: Na +/H + antiporters contribute to salt-stress resistance but impact phenol degradation. WATER RESEARCH 2024; 255:121448. [PMID: 38503180 DOI: 10.1016/j.watres.2024.121448] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/07/2024] [Revised: 03/05/2024] [Accepted: 03/10/2024] [Indexed: 03/21/2024]
Abstract
Phenolic-laden wastewater is typically characterized by its high toxicity and high salinity, imposing serious limits on the application of bioremediation. Although a few halotolerant microorganisms have been reported to degrade phenol, their removal efficiency on high concentrations of phenol remains unsatisfactory. What's more, the deep interaction molecular mechanism of salt-tolerance/phenol-degradation performance has not been clearly revealed. Here, a halotolerant strain Aeribacillus pallidus W-12 employed a meta-pathway to efficiently degrade high concentration of phenol even under high salinity conditions. Investigation of salt-tolerance strategy indicated that four Na+/H+ antiporters, which are widely distributed in bacteria, synergistically endowed the strain with excellent salt adaptability. All these antiporters differentially but positively responded to salinity changes and induction of phenol, forming a synergistic transport effect on salt ions and phenol. In-depth analysis revealed a competitive relationship between salt tolerance and degradation performance, which significantly impaired the degradation efficiency at relatively high salinity. The efficient degradation performance of W-12 under different phenol concentrations and salinity conditions indicated its bioremediation potential for multiple types of phenolic wastewater. Collectively, the competitive mechanism of salt tolerance and degradation performance enlightens a new strategy of introducing or re-constructing Na+/H+ antiporters to further improve bioremediation efficiency of hypersaline organic wastewater.
Collapse
Affiliation(s)
- Shenmei Sun
- Key Laboratory of Molecular Microbiology and Technology, Ministry of Education, TEDA Institute of Biological Sciences and Biotechnology, Nankai University, TEDA, Tianjin 300457, PR China
| | - Shuo Wang
- Key Laboratory of Molecular Microbiology and Technology, Ministry of Education, TEDA Institute of Biological Sciences and Biotechnology, Nankai University, TEDA, Tianjin 300457, PR China
| | - Yalin Yin
- Key Laboratory of Molecular Microbiology and Technology, Ministry of Education, TEDA Institute of Biological Sciences and Biotechnology, Nankai University, TEDA, Tianjin 300457, PR China
| | - Yue Yang
- Key Laboratory of Molecular Microbiology and Technology, Ministry of Education, TEDA Institute of Biological Sciences and Biotechnology, Nankai University, TEDA, Tianjin 300457, PR China
| | - Yijia Wang
- Laboratory of Oncologic Molecular Medicine, Tianjin Union Medical Center, Nankai University, Tianjin 300121, PR China
| | - Jingjing Zhang
- Key Laboratory of Molecular Microbiology and Technology, Ministry of Education, TEDA Institute of Biological Sciences and Biotechnology, Nankai University, TEDA, Tianjin 300457, PR China
| | - Wei Wang
- Key Laboratory of Molecular Microbiology and Technology, Ministry of Education, TEDA Institute of Biological Sciences and Biotechnology, Nankai University, TEDA, Tianjin 300457, PR China; Tianjin Key Laboratory of Microbial Functional Genomics, Tianjin 300457, PR China.
| |
Collapse
|
6
|
Zhimulev I, Vatolina T, Levitsky V, Tsukanov A. Developmental and Housekeeping Genes: Two Types of Genetic Organization in the Drosophila Genome. Int J Mol Sci 2024; 25:4068. [PMID: 38612878 PMCID: PMC11012173 DOI: 10.3390/ijms25074068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 04/01/2024] [Accepted: 04/02/2024] [Indexed: 04/14/2024] Open
Abstract
We developed a procedure for locating genes on Drosophila melanogaster polytene chromosomes and described three types of chromosome structures (gray bands, black bands, and interbands), which differed markedly in morphological and genetic properties. This was reached through the use of our original methods of molecular and genetic analysis, electron microscopy, and bioinformatics data processing. Analysis of the genome-wide distribution of these properties led us to a bioinformatics model of the Drosophila genome organization, in which the genome was divided into two groups of genes. One was constituted by 65, in which the genome was divided into two groups, 62 genes that are expressed in most cell types during life cycle and perform basic cellular functions (the so-called "housekeeping genes"). The other one was made up of 3162 genes that are expressed only at particular stages of development ("developmental genes"). These two groups of genes are so different that we may state that the genome has two types of genetic organization. Different are the timings of their expression, chromatin packaging levels, the composition of activating and deactivating proteins, the sizes of these genes, the lengths of their introns, the organization of the promoter regions of the genes, the locations of origin recognition complexes (ORCs), and DNA replication timings.
Collapse
Affiliation(s)
- Igor Zhimulev
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Science, 630090 Novosibirsk, Russia;
| | - Tatyana Vatolina
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Science, 630090 Novosibirsk, Russia;
| | - Victor Levitsky
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Science, 630090 Novosibirsk, Russia; (V.L.); (A.T.)
| | - Anton Tsukanov
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Science, 630090 Novosibirsk, Russia; (V.L.); (A.T.)
| |
Collapse
|
7
|
Zheng J, Guo N, Huang Y, Guo X, Wagner A. High temperature delays and low temperature accelerates evolution of a new protein phenotype. Nat Commun 2024; 15:2495. [PMID: 38553445 PMCID: PMC10980763 DOI: 10.1038/s41467-024-46332-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 02/19/2024] [Indexed: 04/02/2024] Open
Abstract
Since the origin of life, temperatures on earth have fluctuated both on short and long time scales. How such changes affect the rate at which Darwinian evolution can bring forth new phenotypes remains unclear. On the one hand, high temperature may accelerate phenotypic evolution because it accelerates most biological processes. On the other hand, it may slow phenotypic evolution, because proteins are usually less stable at high temperatures and therefore less evolvable. Here, to test these hypotheses experimentally, we evolved a green fluorescent protein in E. coli towards the new phenotype of yellow fluorescence at different temperatures. Yellow fluorescence evolved most slowly at high temperature and most rapidly at low temperature, in contradiction to the first hypothesis. Using high-throughput population sequencing, protein engineering, and biochemical assays, we determined that this is due to the protein-destabilizing effect of neofunctionalizing mutations. Destabilization is highly detrimental at high temperature, where neofunctionalizing mutations cannot be tolerated. Their detrimental effects can be mitigated through excess stability at low temperature, leading to accelerated adaptive evolution. By modifying protein folding stability, temperature alters the accessibility of mutational paths towards high-fitness genotypes. Our observations have broad implications for our understanding of how temperature changes affect evolutionary adaptations and innovations.
Collapse
Affiliation(s)
- Jia Zheng
- Zhejiang Key Laboratory of Structural Biology, School of Life Sciences, Westlake University, Hangzhou, China.
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, China.
- Institute of Biology, Westlake Institute for Advanced Study, Hangzhou, China.
| | - Ning Guo
- Zhejiang Key Laboratory of Structural Biology, School of Life Sciences, Westlake University, Hangzhou, China
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, China
- Institute of Biology, Westlake Institute for Advanced Study, Hangzhou, China
| | - Yuxiang Huang
- Zhejiang Key Laboratory of Structural Biology, School of Life Sciences, Westlake University, Hangzhou, China
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, China
- Institute of Biology, Westlake Institute for Advanced Study, Hangzhou, China
| | - Xiang Guo
- Zhejiang Key Laboratory of Structural Biology, School of Life Sciences, Westlake University, Hangzhou, China
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, China
- Institute of Biology, Westlake Institute for Advanced Study, Hangzhou, China
| | - Andreas Wagner
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland.
- Swiss Institute of Bioinformatics, Lausanne, Switzerland.
- The Santa Fe Institute, Santa Fe, USA.
| |
Collapse
|
8
|
Zhou X, Yao Q, Li N, Xia M, Deng Y. Multi-Omics Strategies to Investigate the Biodegradation of Hexahydro-1,3,5-trinitro-1,3,5-triazine in Rhodococcus sp. Strain DN22. Microorganisms 2023; 12:76. [PMID: 38257903 PMCID: PMC10820124 DOI: 10.3390/microorganisms12010076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Revised: 12/25/2023] [Accepted: 12/26/2023] [Indexed: 01/24/2024] Open
Abstract
Hexahydro-1,3,5-trinitro-1,3,5-triazine (RDX) is an energetic and persistent explosive with long-lasting properties. Rhodococcus sp. strain DN22 has been discovered to be a microbe capable of degrading RDX. Herein, the complete genome of Rhodococcus sp. strain DN22 was sequenced and analyzed. The entire sequences of genes that encoded the two proteins participating in RDX degradation in Rhodococcus sp. strain DN22 were obtained, and were validated through proteomic data. In addition, few studies have investigated the physiological changes and metabolic pathways occurring within Rhodococcus sp. cells when treated with RDX, particularly through mass spectrometry-based omics. Hence, proteomic and metabolomic analyses were carried out on Rhodococcus sp. strain DN22 with the existence or lack of RDX in the medium. A total of 3186 proteins were identified between the two groups, with 115 proteins being significantly differentially expressed proteins. There were 1056 metabolites identified in total, among which 130 metabolites were significantly different. Through the combined analysis of differential proteomics and metabolomics, KEGG pathways including two-component system, ABC transporters, alanine, aspartate and glutamate metabolism, arginine biosynthesis, purine metabolism, nitrogen metabolism, and phosphotransferase system (PTS), were observed to be significantly enriched. These findings provided ponderable perspectives on the physiological alterations and metabolic pathways in Rhodococcus sp. strain DN22, responding to the existence or lack of RDX. This study is anticipated to expand the knowledge of Rhodococcus sp. strain DN22, as well as advancing understanding of microbial degradation.
Collapse
Affiliation(s)
- Xiangzhe Zhou
- School of Medical Technology, Beijing Institute of Technology, Beijing 100081, China; (X.Z.)
| | - Qifa Yao
- School of Materials Science and Engineering, Beijing Institute of Technology, Beijing 100081, China
| | - Nuomin Li
- School of Medical Technology, Beijing Institute of Technology, Beijing 100081, China; (X.Z.)
| | - Min Xia
- School of Materials Science and Engineering, Beijing Institute of Technology, Beijing 100081, China
| | - Yulin Deng
- School of Medical Technology, Beijing Institute of Technology, Beijing 100081, China; (X.Z.)
| |
Collapse
|
9
|
Jagtap SS, Liu JJ, Walukiewicz HE, Riley R, Ahrendt S, Koriabine M, Cobaugh K, Salamov A, Yoshinaga Y, Ng V, Daum C, Grigoriev IV, Slininger PJ, Dien BS, Jin YS, Rao CV. Draft genome sequence of Yarrowia lipolytica NRRL Y-64008, an oleaginous yeast capable of growing on lignocellulosic hydrolysates. Microbiol Resour Announc 2023; 12:e0043523. [PMID: 37982613 PMCID: PMC10720525 DOI: 10.1128/mra.00435-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 10/10/2023] [Indexed: 11/21/2023] Open
Abstract
Yarrowia lipolytica is an oleaginous yeast that produces high titers of fatty acid-derived biofuels and biochemicals. It can grow on hydrophobic carbon sources and lignocellulosic hydrolysates. The genome sequence of Y. lipolytica NRRL Y-64008 is reported to aid in its development as a biotechnological chassis for producing biofuels and bioproducts.
Collapse
Affiliation(s)
- Sujit Sadashiv Jagtap
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
| | - Jing-Jing Liu
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
| | - Hanna E. Walukiewicz
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
| | - Robert Riley
- Lawrence Berkeley National Laboratory, US Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Steven Ahrendt
- Lawrence Berkeley National Laboratory, US Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Maxim Koriabine
- Lawrence Berkeley National Laboratory, US Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Kelly Cobaugh
- Lawrence Berkeley National Laboratory, US Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Asaf Salamov
- Lawrence Berkeley National Laboratory, US Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Yuko Yoshinaga
- Lawrence Berkeley National Laboratory, US Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Vivian Ng
- Lawrence Berkeley National Laboratory, US Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Chris Daum
- Lawrence Berkeley National Laboratory, US Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Igor V. Grigoriev
- Lawrence Berkeley National Laboratory, US Department of Energy Joint Genome Institute, Berkeley, California, USA
- Department of Plant and Microbial Biology, University of California, Berkeley, California, USA
| | - Patricia J. Slininger
- Bioenergy Research Unit, National Center for Agricultural Utilization Research, USDA-ARS, Peoria, Illinois, USA
| | - Bruce S. Dien
- Bioenergy Research Unit, National Center for Agricultural Utilization Research, USDA-ARS, Peoria, Illinois, USA
| | - Yong-Su Jin
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
- Department of Food Science and Nutrition, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
| | - Christopher V. Rao
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana–Champaign, Urbana, Illinois, USA
| |
Collapse
|
10
|
Jagtap SS, Liu JJ, Walukiewicz HE, Pangilinan J, Lipzen A, Ahrendt S, Koriabine M, Cobaugh K, Salamov A, Yoshinaga Y, Ng V, Daum C, Grigoriev IV, Slininger PJ, Dien BS, Jin YS, Rao CV. Near-complete genome sequence of Lipomyces tetrasporous NRRL Y-64009, an oleaginous yeast capable of growing on lignocellulosic hydrolysates. Microbiol Resour Announc 2023; 12:e0042623. [PMID: 37906027 PMCID: PMC10652991 DOI: 10.1128/mra.00426-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Accepted: 09/11/2023] [Indexed: 11/02/2023] Open
Abstract
Lipomyces tetrasporous is an oleaginous yeast that can utilize a variety of plant-based sugars. It accumulates lipids during growth on lignocellulosic biomass hydrolysates. We present the annotated genome sequence of L. tetrasporous NRRL Y-64009 to aid in its development as a platform organism for producing lipids and lipid-based bioproducts.
Collapse
Affiliation(s)
- Sujit Sadashiv Jagtap
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Jing-Jing Liu
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Hanna E. Walukiewicz
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Jasmyn Pangilinan
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Anna Lipzen
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Steven Ahrendt
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Maxim Koriabine
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Kelly Cobaugh
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Asaf Salamov
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Yuko Yoshinaga
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Vivian Ng
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Chris Daum
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Igor V. Grigoriev
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California, USA
| | - Patricia J. Slininger
- Bioenergy Research Unit, National Center for Agricultural Utilization Research, USDA-ARS, Peoria, Illinois, USA
| | - Bruce S. Dien
- Bioenergy Research Unit, National Center for Agricultural Utilization Research, USDA-ARS, Peoria, Illinois, USA
| | - Yong-Su Jin
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
- Department of Food Science and Nutrition, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Christopher V. Rao
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| |
Collapse
|
11
|
A X, Yang Y, Chen X, Tang C, Zhang F, Dong C, Wang B, Liu P, Dai L. Complete Genome Resource of a Hypervirulent Xanthomonas oryzae pv. oryzae Strain YNCX Isolated from Yunnan Plateau Japonica Rice. PLANT DISEASE 2023; 107:3623-3626. [PMID: 37189043 DOI: 10.1094/pdis-04-23-0674-a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
Xanthomonas oryzae pv. oryzae (Xoo), the causal agent of bacterial leaf blight (BLB), is one of the most destructive bacterial pathogens in rice production worldwide. Although several complete genome sequences of Xoo strains have been released in public databases, they are mainly isolated from low-altitude indica rice cultivating areas. Here, a hypervirulent strain, YNCX, isolated from the high-altitude japonica rice-growing region in Yunnan Plateau, was used to extract genomic DNA for PacBio sequencing and Illumina sequencing. After assembly, a high-quality complete genome consisting of a circular chromosome and six plasmids was generated. The genome sequence of YNCX provides a valuable resource for high-altitude races and enables the identification of new virulence TALE effectors, contributing to a better understanding of rice-Xoo interactions.
Collapse
Affiliation(s)
- Xinxiang A
- Biotechnology and Germplasm Resources Institute, Yunnan Academy of Agricultural Sciences/Yunnan Provincial Key Lab of Agricultural Biotechnology/Key Laboratory of Southwestern Crop Gene Resources and Germplasm Innovation, Ministry of Agriculture and Rural Affairs/Scientific Observation Station for Rice Germplasm Resources of Yunnan, Kunming 650223, Yunnan, China
| | - Yayun Yang
- Biotechnology and Germplasm Resources Institute, Yunnan Academy of Agricultural Sciences/Yunnan Provincial Key Lab of Agricultural Biotechnology/Key Laboratory of Southwestern Crop Gene Resources and Germplasm Innovation, Ministry of Agriculture and Rural Affairs/Scientific Observation Station for Rice Germplasm Resources of Yunnan, Kunming 650223, Yunnan, China
| | - Xifeng Chen
- College of Life Sciences, Zhejiang Normal University, Jinhua 321004, Zhejiang, China
| | - Cuifeng Tang
- Biotechnology and Germplasm Resources Institute, Yunnan Academy of Agricultural Sciences/Yunnan Provincial Key Lab of Agricultural Biotechnology/Key Laboratory of Southwestern Crop Gene Resources and Germplasm Innovation, Ministry of Agriculture and Rural Affairs/Scientific Observation Station for Rice Germplasm Resources of Yunnan, Kunming 650223, Yunnan, China
| | - Feifei Zhang
- Biotechnology and Germplasm Resources Institute, Yunnan Academy of Agricultural Sciences/Yunnan Provincial Key Lab of Agricultural Biotechnology/Key Laboratory of Southwestern Crop Gene Resources and Germplasm Innovation, Ministry of Agriculture and Rural Affairs/Scientific Observation Station for Rice Germplasm Resources of Yunnan, Kunming 650223, Yunnan, China
| | - Chao Dong
- Biotechnology and Germplasm Resources Institute, Yunnan Academy of Agricultural Sciences/Yunnan Provincial Key Lab of Agricultural Biotechnology/Key Laboratory of Southwestern Crop Gene Resources and Germplasm Innovation, Ministry of Agriculture and Rural Affairs/Scientific Observation Station for Rice Germplasm Resources of Yunnan, Kunming 650223, Yunnan, China
| | - Bin Wang
- Biotechnology and Germplasm Resources Institute, Yunnan Academy of Agricultural Sciences/Yunnan Provincial Key Lab of Agricultural Biotechnology/Key Laboratory of Southwestern Crop Gene Resources and Germplasm Innovation, Ministry of Agriculture and Rural Affairs/Scientific Observation Station for Rice Germplasm Resources of Yunnan, Kunming 650223, Yunnan, China
| | - Pengcheng Liu
- College of Life Sciences, Zhejiang Normal University, Jinhua 321004, Zhejiang, China
| | - Luyuan Dai
- Biotechnology and Germplasm Resources Institute, Yunnan Academy of Agricultural Sciences/Yunnan Provincial Key Lab of Agricultural Biotechnology/Key Laboratory of Southwestern Crop Gene Resources and Germplasm Innovation, Ministry of Agriculture and Rural Affairs/Scientific Observation Station for Rice Germplasm Resources of Yunnan, Kunming 650223, Yunnan, China
| |
Collapse
|
12
|
Hu J, Tang YX, Zhou Y, Li Z, Rao B, Zhang GJ. Improving DNA 6mA Site Prediction via Integrating Bidirectional Long Short-Term Memory, Convolutional Neural Network, and Self-Attention Mechanism. J Chem Inf Model 2023; 63:5689-5700. [PMID: 37603823 DOI: 10.1021/acs.jcim.3c00698] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/23/2023]
Abstract
Identifying DNA N6-methyladenine (6mA) sites is significantly important to understanding the function of DNA. Many deep learning-based methods have been developed to improve the performance of 6mA site prediction. In this study, to further improve the performance of 6mA site prediction, we propose a new meta method, called Co6mA, to integrate bidirectional long short-term memory (BiLSTM), convolutional neural networks (CNNs), and self-attention mechanisms (SAM) via assembling two different deep learning-based models. The first model developed in this study is called CBi6mA, which is composed of CNN, BiLSTM, and fully connected modules. The second model is borrowed from LA6mA, which is an existing 6mA prediction method based on BiLSTM and SAM modules. Experimental results on two independent testing sets of different model organisms, i.e., Arabidopsis thaliana and Drosophila melanogaster, demonstrate that Co6mA can achieve an average accuracy of 91.8%, covering 89% of all 6mA samples while achieving an average Matthews correlation coefficient value (0.839), which is higher than the second-best method DeepM6A.
Collapse
Affiliation(s)
- Jun Hu
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Yu-Xuan Tang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Yu Zhou
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Zhe Li
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Bing Rao
- School of Information and Electrical Engineering, Hangzhou City University, Hangzhou City University, Hangzhou 310015, China
| | - Gui-Jun Zhang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| |
Collapse
|
13
|
Bao C, Li M, Zhao X, Shi J, Liu Y, Zhang N, Zhou Y, Ma J, Chen G, Zhang S, Chen H. Mining of key genes for cold adaptation from Pseudomonas fragi D12 and analysis of its cold-adaptation mechanism. Front Microbiol 2023; 14:1215837. [PMID: 37485517 PMCID: PMC10358777 DOI: 10.3389/fmicb.2023.1215837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 06/21/2023] [Indexed: 07/25/2023] Open
Abstract
The psychrotroph Pseudomonas fragi D12, which grew strongly under low temperatures, was screened from tundra soil collected from the permanent alpine zone on Changbai Mountain. To mine the genes critical for cold tolerance and to investigate the cold-adaptation mechanism, whole-genome sequencing, comparative genomic analysis, and transcriptome analysis were performed with P. fragi. A total of 124 potential cold adaptation genes were identified, including nineteen unique cold-adaptive genes were detected in the genome of P. fragi D12. Three unique genes associated with pili protein were significantly upregulated at different degrees of low temperature, which may be the key to the strong low-temperature adaptability of P. fragi D12. Meanwhile, we were pleasantly surprised to find that Pseudomonas fragi D12 exhibited different cold-adaptation mechanisms under different temperature changes. When the temperature declined from 30°C to 15°C, the response included maintenance of the fluidity of cell membranes, increased production of extracellular polymers, elevation in the content of compatibility solutes, and reduction in the content of reactive oxygen species, thereby providing a stable metabolic environment. When the temperature decreased from 15°C to 4°C, the response mainly included increases in the expression of molecular chaperones and transcription factors, enabling the bacteria to restore normal transcription and translation. The response mechanism of P. fragi D12 to low-temperature exposure is discussed. The results provide new ideas for the cold-adaptation mechanism of cold-tolerant microorganisms.
Collapse
Affiliation(s)
- Changjie Bao
- Key Laboratory of Straw Comprehensive Utilization and Black Soil Conservation, Ministry of Education, Changchun, China
- College of Life Science, Jilin Agricultural University, Changchun, China
| | - Muzi Li
- College of Life Science, Jilin Agricultural University, Changchun, China
| | - Xuhui Zhao
- College of Life Science, Jilin Agricultural University, Changchun, China
| | - Jia Shi
- Key Laboratory of Straw Comprehensive Utilization and Black Soil Conservation, Ministry of Education, Changchun, China
- College of Life Science, Jilin Agricultural University, Changchun, China
| | - Yehui Liu
- Key Laboratory of Straw Comprehensive Utilization and Black Soil Conservation, Ministry of Education, Changchun, China
- College of Life Science, Jilin Agricultural University, Changchun, China
| | - Na Zhang
- Key Laboratory of Straw Comprehensive Utilization and Black Soil Conservation, Ministry of Education, Changchun, China
- College of Life Science, Jilin Agricultural University, Changchun, China
| | - Yuqi Zhou
- Key Laboratory of Straw Comprehensive Utilization and Black Soil Conservation, Ministry of Education, Changchun, China
- College of Life Science, Jilin Agricultural University, Changchun, China
| | - Jie Ma
- Key Laboratory of Straw Comprehensive Utilization and Black Soil Conservation, Ministry of Education, Changchun, China
- College of Life Science, Jilin Agricultural University, Changchun, China
| | - Guang Chen
- Key Laboratory of Straw Comprehensive Utilization and Black Soil Conservation, Ministry of Education, Changchun, China
- College of Life Science, Jilin Agricultural University, Changchun, China
| | - Sitong Zhang
- Key Laboratory of Straw Comprehensive Utilization and Black Soil Conservation, Ministry of Education, Changchun, China
- College of Life Science, Jilin Agricultural University, Changchun, China
- Key Laboratory of Mollisols Agroecology, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun, China
| | - Huan Chen
- Key Laboratory of Straw Comprehensive Utilization and Black Soil Conservation, Ministry of Education, Changchun, China
- College of Life Science, Jilin Agricultural University, Changchun, China
| |
Collapse
|
14
|
ONT-Based Alternative Assemblies Impact on the Annotations of Unique versus Repetitive Features in the Genome of a Romanian Strain of Drosophila melanogaster. Int J Mol Sci 2022; 23:ijms232314892. [PMID: 36499217 PMCID: PMC9741293 DOI: 10.3390/ijms232314892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 11/21/2022] [Accepted: 11/24/2022] [Indexed: 11/29/2022] Open
Abstract
To date, different strategies of whole-genome sequencing (WGS) have been developed in order to understand the genome structure and functions. However, the analysis of genomic sequences obtained from natural populations is challenging and the biological interpretation of sequencing data remains the main issue. The MinION device developed by Oxford Nanopore Technologies (ONT) is able to generate long reads with minimal costs and time requirements. These valuable assets qualify it as a suitable method for performing WGS, especially in small laboratories. The long reads resulted using this sequencing approach can cover large structural variants and repetitive sequences commonly present in the genomes of eukaryotes. Using MinION, we performed two WGS assessments of a Romanian local strain of Drosophila melanogaster, referred to as Horezu_LaPeri (Horezu). In total, 1,317,857 reads with a size of 8.9 gigabytes (Gb) were generated. Canu and Flye de novo assembly tools were employed to obtain four distinct assemblies with both unfiltered and filtered reads, achieving maximum reference genome coverages of 94.8% (Canu) and 91.4% (Flye). In order to test the quality of these assemblies, we performed a two-step evaluation. Firstly, we considered the BUSCO scores and inquired for a supplemental set of genes using BLAST. Subsequently, we appraised the total content of natural transposons (NTs) relative to the reference genome (ISO1 strain) and mapped the mdg1 retroelement as a resolution assayer. Our results reveal that filtered data provide only slightly enhanced results when considering genes identification, but the use of unfiltered data had a consistent positive impact on the global evaluation of the NTs content. Our comparative studies also revealed differences between Flye and Canu assemblies regarding the annotation of unique versus repetitive genomic features. In our hands, Flye proved to be moderately better for gene identification, while Canu clearly outperformed Flye for NTs analysis. Data concerning the NTs content were compared to those obtained with ONT for the D. melanogaster ISO1 strain, revealing that our strategy conducted to better results. Additionally, the parameters of our ONT reads and assemblies are similar to those reported for ONT experiments performed on various model organisms, revealing that our assembly data are appropriate for a proficient annotation of the Horezu genome.
Collapse
|
15
|
Rabanal FA, Gräff M, Lanz C, Fritschi K, Llaca V, Lang M, Carbonell-Bejerano P, Henderson I, Weigel D. Pushing the limits of HiFi assemblies reveals centromere diversity between two Arabidopsis thaliana genomes. Nucleic Acids Res 2022; 50:12309-12327. [PMID: 36453992 PMCID: PMC9757041 DOI: 10.1093/nar/gkac1115] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 09/13/2022] [Accepted: 11/10/2022] [Indexed: 12/05/2022] Open
Abstract
Although long-read sequencing can often enable chromosome-level reconstruction of genomes, it is still unclear how one can routinely obtain gapless assemblies. In the model plant Arabidopsis thaliana, other than the reference accession Col-0, all other accessions de novo assembled with long-reads until now have used PacBio continuous long reads (CLR). Although these assemblies sometimes achieved chromosome-arm level contigs, they inevitably broke near the centromeres, excluding megabases of DNA from analysis in pan-genome projects. Since PacBio high-fidelity (HiFi) reads circumvent the high error rate of CLR technologies, albeit at the expense of read length, we compared a CLR assembly of accession Eyach15-2 to HiFi assemblies of the same sample. The use of five different assemblers starting from subsampled data allowed us to evaluate the impact of coverage and read length. We found that centromeres and rDNA clusters are responsible for 71% of contig breaks in the CLR scaffolds, while relatively short stretches of GA/TC repeats are at the core of >85% of the unfilled gaps in our best HiFi assemblies. Since the HiFi technology consistently enabled us to reconstruct gapless centromeres and 5S rDNA clusters, we demonstrate the value of the approach by comparing these previously inaccessible regions of the genome between the Eyach15-2 accession and the reference accession Col-0.
Collapse
Affiliation(s)
- Fernando A Rabanal
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Maike Gräff
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Christa Lanz
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Katrin Fritschi
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Victor Llaca
- Genomics Technologies, Corteva Agriscience, Johnston, IA 50131, USA
| | - Michelle Lang
- Genomics Technologies, Corteva Agriscience, Johnston, IA 50131, USA
| | - Pablo Carbonell-Bejerano
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Ian Henderson
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, UK
| | - Detlef Weigel
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| |
Collapse
|
16
|
Srinivas M, O’Sullivan O, Cotter PD, van Sinderen D, Kenny JG. The Application of Metagenomics to Study Microbial Communities and Develop Desirable Traits in Fermented Foods. Foods 2022; 11:3297. [PMID: 37431045 PMCID: PMC9601669 DOI: 10.3390/foods11203297] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2022] [Revised: 10/11/2022] [Accepted: 10/19/2022] [Indexed: 11/18/2022] Open
Abstract
The microbial communities present within fermented foods are diverse and dynamic, producing a variety of metabolites responsible for the fermentation processes, imparting characteristic organoleptic qualities and health-promoting traits, and maintaining microbiological safety of fermented foods. In this context, it is crucial to study these microbial communities to characterise fermented foods and the production processes involved. High Throughput Sequencing (HTS)-based methods such as metagenomics enable microbial community studies through amplicon and shotgun sequencing approaches. As the field constantly develops, sequencing technologies are becoming more accessible, affordable and accurate with a further shift from short read to long read sequencing being observed. Metagenomics is enjoying wide-spread application in fermented food studies and in recent years is also being employed in concert with synthetic biology techniques to help tackle problems with the large amounts of waste generated in the food sector. This review presents an introduction to current sequencing technologies and the benefits of their application in fermented foods.
Collapse
Affiliation(s)
- Meghana Srinivas
- Food Biosciences Department, Teagasc Food Research Centre, Moorepark, P61 C996 Cork, Ireland
- APC Microbiome Ireland, University College Cork, T12 CY82 Cork, Ireland
- School of Microbiology, University College Cork, T12 CY82 Cork, Ireland
| | - Orla O’Sullivan
- Food Biosciences Department, Teagasc Food Research Centre, Moorepark, P61 C996 Cork, Ireland
- APC Microbiome Ireland, University College Cork, T12 CY82 Cork, Ireland
- VistaMilk SFI Research Centre, Fermoy, P61 C996 Cork, Ireland
| | - Paul D. Cotter
- Food Biosciences Department, Teagasc Food Research Centre, Moorepark, P61 C996 Cork, Ireland
- APC Microbiome Ireland, University College Cork, T12 CY82 Cork, Ireland
- VistaMilk SFI Research Centre, Fermoy, P61 C996 Cork, Ireland
| | - Douwe van Sinderen
- APC Microbiome Ireland, University College Cork, T12 CY82 Cork, Ireland
- School of Microbiology, University College Cork, T12 CY82 Cork, Ireland
| | - John G. Kenny
- Food Biosciences Department, Teagasc Food Research Centre, Moorepark, P61 C996 Cork, Ireland
- APC Microbiome Ireland, University College Cork, T12 CY82 Cork, Ireland
- VistaMilk SFI Research Centre, Fermoy, P61 C996 Cork, Ireland
| |
Collapse
|
17
|
Near-Complete Genome Sequence of Zygosaccharomyces rouxii NRRL Y-64007, a Yeast Capable of Growing on Lignocellulosic Hydrolysates. Microbiol Resour Announc 2022; 11:e0005022. [PMID: 35442079 PMCID: PMC9119105 DOI: 10.1128/mra.00050-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The halotolerant and osmotolerant yeast Zygosaccharomyces rouxii can produce multiple volatile compounds and has the ability to grow on lignocellulosic hydrolysates. We report the annotated genome sequence of Z. rouxii NRRL Y-64007 to support its development as a platform organism for biofuel and bioproduct production.
Collapse
|
18
|
Patchigolla VS, Mellone BG. Enrichment of Non-B-Form DNA at D. melanogaster Centromeres. Genome Biol Evol 2022; 14:evac054. [PMID: 35441684 PMCID: PMC9070824 DOI: 10.1093/gbe/evac054] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/14/2022] [Indexed: 11/17/2022] Open
Abstract
Centromeres are essential chromosomal regions that mediate the accurate inheritance of genetic information during eukaryotic cell division. Despite their conserved function, centromeres do not contain conserved DNA sequences and are instead epigenetically marked by the presence of the centromere-specific histone H3 variant centromeric protein A. The functional contribution of centromeric DNA sequences to centromere identity remains elusive. Previous work found that dyad symmetries with a propensity to adopt noncanonical secondary DNA structures are enriched at the centromeres of several species. These findings lead to the proposal that noncanonical DNA structures may contribute to centromere specification. Here, we analyze the predicted secondary structures of the recently identified centromere DNA sequences of Drosophila melanogaster. Although dyad symmetries are only enriched on the Y centromere, we find that other types of noncanonical DNA structures, including melted DNA and G-quadruplexes, are common features of all D. melanogaster centromeres. Our work is consistent with previous models suggesting that noncanonical DNA secondary structures may be conserved features of centromeres with possible implications for centromere specification.
Collapse
Affiliation(s)
| | - Barbara G. Mellone
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269, USA
- Institute for Systems Genomics, University of Connecticut, Storrs, CT 06269, USA
| |
Collapse
|
19
|
Zhao D, Wang Q, Meng F, Lu F, Bie X, Lu Z, Lu Y. TetR-Type Regulator Lp_2642 Positively Regulates Plantaricin EF Production Based on Genome-Wide Transcriptome Sequencing of Lactiplantibacillus plantarum 163. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2022; 70:4362-4372. [PMID: 35311254 DOI: 10.1021/acs.jafc.2c00206] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Whole-genome and transcriptome sequences of Lactiplantibacillus plantarum 163 are provided. There was one circular chromosome and four circular plasmids, with sizes of 3,131,367; 56,674; 49,140; 43,628; and 36,387 bp, respectively, in L. plantarum 163. The regulator Lp_2642 was selected from the genome data, the overexpression of which increased the transcriptional levels of related genes in plantaricin EF biosynthesis and enhanced plantaricin EF production. Its production was 17.30 mg/L in 163 (Lp_2642), which was 1.29-fold higher than that of the original strain. The regulation mechanism demonstrated that Lp_2642 can bind to three sites of plnA promoter, which enhances its transcription and expression, thereby increasing plantaricin EF production. Amino acids Asn-100, Asn-64, and Thr-69 may play a key role in the binding of Lp_2642. These results provide a novel strategy for mass production of plantaricin EF, which facilitates its large-scale production and application in the agriculture and food industries as a preservative.
Collapse
Affiliation(s)
- Deyin Zhao
- College of Food Science and Technology, Nanjing Agricultural University, Nanjing 210095, China
| | - Qian Wang
- College of Food Science and Technology, Nanjing Agricultural University, Nanjing 210095, China
| | - Fanqiang Meng
- College of Food Science and Technology, Nanjing Agricultural University, Nanjing 210095, China
| | - Fengxia Lu
- College of Food Science and Technology, Nanjing Agricultural University, Nanjing 210095, China
| | - Xiaomei Bie
- College of Food Science and Technology, Nanjing Agricultural University, Nanjing 210095, China
| | - Zhaoxin Lu
- College of Food Science and Technology, Nanjing Agricultural University, Nanjing 210095, China
| | - Yingjian Lu
- College of Food Science & Engineering, Nanjing University of Finance and Economics, Nanjing 210023, China
| |
Collapse
|
20
|
Abstract
The cost of maintaining exabytes of data produced by sequencing experiments every year has become a major issue in today's genomic research. In spite of the increasing popularity of third-generation sequencing, the existing algorithms for compressing long reads exhibit a minor advantage over the general-purpose gzip. We present CoLoRd, an algorithm able to reduce the size of third-generation sequencing data by an order of magnitude without affecting the accuracy of downstream analyses.
Collapse
|
21
|
Yu L, Ma X, Wadlington W, Ming R. Identification of structural variation and polymorphisms of a sex co-segregating scaffold in spinach. PLANT REPRODUCTION 2022; 35:19-30. [PMID: 34319458 DOI: 10.1007/s00497-021-00424-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Accepted: 06/28/2021] [Indexed: 06/13/2023]
Abstract
Spinach is a common vegetable, and dioecy is maintained by a pair of XY sex chromosomes. Due to limited genomic resources and its highly repetitive genome, limited studies were conducted to investigate the genomic landscape of the region near sex-determining loci. In this study, we screened the structure variations (SVs) between Y-linked contigs and a 1.78-Mb X scaffold (Super_scaffold 66), which enabled the development of 12 sex co-segregating DNA markers. These markers were tested in one F1 mapping population and 40 spinach accessions, which comprised 692 individual plants with the strong sex linkage pattern. In addition, we found that Super_scaffold 66 was highly repetitive along with the enriched LTR-RTs insertions and decreased microsatellite distribution compared with the rest genome, which matches extremely low gene density featured by only nine annotated genes. Synteny analysis between Y contigs and Superscaffold_66 revealed a 340-Kb accumulative Y contig (non-continuous) and a 500-Kb X counterpart along with SVs and wide-spread tandem duplications. Among the nine genes, one ABC transporter gene revealed noticeable SVs between Y contig and X counterpart, as an approximate 5-Kb recent Gypsy LTR-RT insertion in the Y-linked allele, but not the X allele. The gene paucity, SVs, and sex-linked polymorphisms attributed to the recombination suppression. We proposed that Super_scaffold 66 is part of the non-recombining region containing the sex determination genes. The spread of 12 sex co-segregating markers from this 1.78 Mb genomic region indicated the existence and expansion of sex determination region during progression of the Y chromosome.
Collapse
Affiliation(s)
- Li'ang Yu
- Department of Plant Biology, University of Illinois At Urbana-Champaign, 1201 West Gregory Drive, Urbana, IL, 61801-3838, USA
| | - Xiaokai Ma
- Center for Genomics and Biotechnology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, 350002, Fujian, China
| | - William Wadlington
- Department of Plant Biology, University of Illinois At Urbana-Champaign, 1201 West Gregory Drive, Urbana, IL, 61801-3838, USA
| | - Ray Ming
- Department of Plant Biology, University of Illinois At Urbana-Champaign, 1201 West Gregory Drive, Urbana, IL, 61801-3838, USA.
- Center for Genomics and Biotechnology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, 350002, Fujian, China.
| |
Collapse
|
22
|
Chang CH, Gregory LE, Gordon KE, Meiklejohn CD, Larracuente AM. Unique structure and positive selection promote the rapid divergence of Drosophila Y chromosomes. eLife 2022; 11:e75795. [PMID: 34989337 PMCID: PMC8794474 DOI: 10.7554/elife.75795] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Accepted: 12/18/2021] [Indexed: 02/06/2023] Open
Abstract
Y chromosomes across diverse species convergently evolve a gene-poor, heterochromatic organization enriched for duplicated genes, LTR retrotransposons, and satellite DNA. Sexual antagonism and a loss of recombination play major roles in the degeneration of young Y chromosomes. However, the processes shaping the evolution of mature, already degenerated Y chromosomes are less well-understood. Because Y chromosomes evolve rapidly, comparisons between closely related species are particularly useful. We generated de novo long-read assemblies complemented with cytological validation to reveal Y chromosome organization in three closely related species of the Drosophila simulans complex, which diverged only 250,000 years ago and share >98% sequence identity. We find these Y chromosomes are divergent in their organization and repetitive DNA composition and discover new Y-linked gene families whose evolution is driven by both positive selection and gene conversion. These Y chromosomes are also enriched for large deletions, suggesting that the repair of double-strand breaks on Y chromosomes may be biased toward microhomology-mediated end joining over canonical non-homologous end-joining. We propose that this repair mechanism contributes to the convergent evolution of Y chromosome organization across organisms.
Collapse
Affiliation(s)
- Ching-Ho Chang
- Department of Biology, University of RochesterRochesterUnited States
| | - Lauren E Gregory
- Department of Biology, University of RochesterRochesterUnited States
| | - Kathleen E Gordon
- School of Biological Sciences, University of Nebraska-LincolnLincolnUnited States
| | - Colin D Meiklejohn
- School of Biological Sciences, University of Nebraska-LincolnLincolnUnited States
| | | |
Collapse
|
23
|
Marques A, Hufnagel B, Soriano A, Péret B. The Highly Repeat-Diverse (Peri) Centromeres of White Lupin ( Lupinus albus L.). FRONTIERS IN PLANT SCIENCE 2022; 13:862079. [PMID: 35449890 PMCID: PMC9016224 DOI: 10.3389/fpls.2022.862079] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Accepted: 03/03/2022] [Indexed: 05/03/2023]
Abstract
Plant genomes are known to be mainly composed of repetitive DNA sequences. Regardless of the non-genic function of these sequences, they are important for chromosome structure and stability during cell-cycle. Based on the recent available whole-genome assembly of white lupin (Lupinus albus L.; WL), we have in silico annotated and in situ mapped the main classes of DNA repeats identified with RepeatExplorer. A highly diverse and an abundance of satellite DNAs were found representing more than 10 families, where three of them were highly associated with CENH3-immunoprecipitated chromatin. Applying a strategy of several re-hybridization steps with different combinations of satDNA, rDNA, and LTR-RTs probes, we were able to construct a repeat-based chromosome map for the identification of most chromosome pairs. Two families of LTR retrotransposons, Ty1/copia SIRE and Ty3/gypsy Tekay, were highly abundant at pericentromeric regions, while the centromeric retrotransposon of WL (CRWL) from the CRM clade showed strong centromere-specific localization in most chromosomes and was also highly enriched with CENH3-immunoprecipitated chromatin. FISH mapping of repeat DNA showed some incongruences with the reference genome, which can be further used for improving the current version of the genome. Our results demonstrate that despite the relatively small genome of WL, a high diversity of pericentromeric repeats was found, emphasizing the rapid evolution of repeat sequences in plant genomes.
Collapse
|
24
|
Genome-Mediated Methods to Unravel the Native Biogeographical Diversity and Biosynthetic Potential of Trichoderma for Plant Health. Fungal Biol 2022. [DOI: 10.1007/978-981-16-8877-5_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
25
|
Complete Genome Sequences and Genome-Wide Characterization of Trichoderma Biocontrol Agents Provide New Insights into their Evolution and Variation in Genome Organization, Sexual Development, and Fungal-Plant Interactions. Microbiol Spectr 2021; 9:e0066321. [PMID: 34908505 PMCID: PMC8672877 DOI: 10.1128/spectrum.00663-21] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Trichoderma spp. represent one of the most important fungal genera to mankind and in natural environments. The genus harbors prolific producers of wood-decaying enzymes, biocontrol agents against plant pathogens, plant-growth-promoting biofertilizers, as well as model organisms for studying fungal-plant-plant pathogen interactions. Pursuing highly accurate, contiguous, and chromosome-level reference genomes has become a primary goal of fungal research communities. Here, we report the chromosome-level genomic sequences and whole-genome annotation data sets of four strains used as biocontrol agents or biofertilizers (Trichoderma virens Gv29-8, Trichoderma virens FT-333, Trichoderma asperellum FT-101, and Trichoderma atroviride P1). Our results provide comprehensive categorization, correct positioning, and evolutionary detail of both nuclear and mitochondrial genomes, including telomeres, AT-rich blocks, centromeres, transposons, mating-type loci, nuclear-encoded mitochondrial sequences, as well as many new secondary metabolic and carbohydrate-active enzyme gene clusters. We have also identified evolutionarily conserved core genes contributing to plant-fungal interactions, as well as variations potentially linked to key behavioral traits such as sex, genome defense, secondary metabolism, and mycoparasitism. The genomic resources we provide herein significantly extend our knowledge not only of this economically important fungal genus, but also fungal evolution and basic biology in general. IMPORTANCE Telomere-to-telomere and gapless reference genome assemblies are necessary to ensure that all genomic variants are studied and discovered, including centromeres, telomeres, AT-rich blocks, mating type loci, biosynthetic, and metabolic gene clusters. Here, we applied long-range sequencing technologies to determine the near-completed genome sequences of four widely used biocontrol agents or biofertilizers: Trichoderma virens Gv29-8 and FT-333, Trichoderma asperellum FT-101, and Trichoderma atroviride P1. Like those of three Trichoderma reesei wild isolates [QM6a, CBS999.97(MAT1-1) and CBS999.97(MAT1-2)] we reported previously, these four biocontrol agent genomes each contain seven nuclear chromosomes and a circular mitochondrial genome. Substantial intraspecies and intragenus diversities are also discovered, including single nucleotide polymorphisms, chromosome shuffling, as well as genomic relics derived from historical transposition events and repeat-induced point (RIP) mutations.
Collapse
|
26
|
Zhang Y, Liu Y, Xu J, Wang X, Peng X, Song J, Yu DJ. Leveraging the attention mechanism to improve the identification of DNA N6-methyladenine sites. Brief Bioinform 2021; 22:bbab351. [PMID: 34459479 PMCID: PMC8575024 DOI: 10.1093/bib/bbab351] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 08/02/2021] [Accepted: 08/09/2021] [Indexed: 11/12/2022] Open
Abstract
DNA N6-methyladenine is an important type of DNA modification that plays important roles in multiple biological processes. Despite the recent progress in developing DNA 6mA site prediction methods, several challenges remain to be addressed. For example, although the hand-crafted features are interpretable, they contain redundant information that may bias the model training and have a negative impact on the trained model. Furthermore, although deep learning (DL)-based models can perform feature extraction and classification automatically, they lack the interpretability of the crucial features learned by those models. As such, considerable research efforts have been focused on achieving the trade-off between the interpretability and straightforwardness of DL neural networks. In this study, we develop two new DL-based models for improving the prediction of N6-methyladenine sites, termed LA6mA and AL6mA, which use bidirectional long short-term memory to respectively capture the long-range information and self-attention mechanism to extract the key position information from DNA sequences. The performance of the two proposed methods is benchmarked and evaluated on the two model organisms Arabidopsis thaliana and Drosophila melanogaster. On the two benchmark datasets, LA6mA achieves an area under the receiver operating characteristic curve (AUROC) value of 0.962 and 0.966, whereas AL6mA achieves an AUROC value of 0.945 and 0.941, respectively. Moreover, an in-depth analysis of the attention matrix is conducted to interpret the important information, which is hidden in the sequence and relevant for 6mA site prediction. The two novel pipelines developed for DNA 6mA site prediction in this work will facilitate a better understanding of the underlying principle of DL-based DNA methylation site prediction and its future applications.
Collapse
Affiliation(s)
- Ying Zhang
- School of Computer Science and Engineering at Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China
| | - Yan Liu
- School of Computer Science and Engineering at Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China
| | - Jian Xu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China
| | - Xiaoyu Wang
- Monash Biomedicine Discovery Institute and the Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Australia
| | - Xinxin Peng
- Monash Biomedicine Discovery Institute and the Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Australia
| | - Jiangning Song
- Monash Biomedicine Discovery Institute and the Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Australia
| | - Dong-Jun Yu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China
| |
Collapse
|
27
|
Tedersoo L, Albertsen M, Anslan S, Callahan B. Perspectives and Benefits of High-Throughput Long-Read Sequencing in Microbial Ecology. Appl Environ Microbiol 2021; 87:e0062621. [PMID: 34132589 PMCID: PMC8357291 DOI: 10.1128/aem.00626-21] [Citation(s) in RCA: 74] [Impact Index Per Article: 24.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Short-read, high-throughput sequencing (HTS) methods have yielded numerous important insights into microbial ecology and function. Yet, in many instances short-read HTS techniques are suboptimal, for example, by providing insufficient phylogenetic resolution or low integrity of assembled genomes. Single-molecule and synthetic long-read (SLR) HTS methods have successfully ameliorated these limitations. In addition, nanopore sequencing has generated a number of unique analysis opportunities, such as rapid molecular diagnostics and direct RNA sequencing, and both Pacific Biosciences (PacBio) and nanopore sequencing support detection of epigenetic modifications. Although initially suffering from relatively low sequence quality, recent advances have greatly improved the accuracy of long-read sequencing technologies. In spite of great technological progress in recent years, the long-read HTS methods (PacBio and nanopore sequencing) are still relatively costly, require large amounts of high-quality starting material, and commonly need specific solutions in various analysis steps. Despite these challenges, long-read sequencing technologies offer high-quality, cutting-edge alternatives for testing hypotheses about microbiome structure and functioning as well as assembly of eukaryote genomes from complex environmental DNA samples.
Collapse
Affiliation(s)
- Leho Tedersoo
- Mycology and Microbiology Center, University of Tartu, Tartu, Estonia
| | - Mads Albertsen
- Department of Chemistry and Bioscience, Aalborg University, Aalborg, Denmark
| | - Sten Anslan
- Mycology and Microbiology Center, University of Tartu, Tartu, Estonia
- Braunschweig University of Technology, Zoological Institute, Braunschweig, Germany
| | - Benjamin Callahan
- Department of Population Health and Pathobiology, College of Veterinary Medicine and Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, USA
| |
Collapse
|
28
|
Kim BY, Wang JR, Miller DE, Barmina O, Delaney E, Thompson A, Comeault AA, Peede D, D'Agostino ERR, Pelaez J, Aguilar JM, Haji D, Matsunaga T, Armstrong EE, Zych M, Ogawa Y, Stamenković-Radak M, Jelić M, Veselinović MS, Tanasković M, Erić P, Gao JJ, Katoh TK, Toda MJ, Watabe H, Watada M, Davis JS, Moyle LC, Manoli G, Bertolini E, Košťál V, Hawley RS, Takahashi A, Jones CD, Price DK, Whiteman N, Kopp A, Matute DR, Petrov DA. Highly contiguous assemblies of 101 drosophilid genomes. eLife 2021; 10:e66405. [PMID: 34279216 PMCID: PMC8337076 DOI: 10.7554/elife.66405] [Citation(s) in RCA: 72] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 07/16/2021] [Indexed: 12/13/2022] Open
Abstract
Over 100 years of studies in Drosophila melanogaster and related species in the genus Drosophila have facilitated key discoveries in genetics, genomics, and evolution. While high-quality genome assemblies exist for several species in this group, they only encompass a small fraction of the genus. Recent advances in long-read sequencing allow high-quality genome assemblies for tens or even hundreds of species to be efficiently generated. Here, we utilize Oxford Nanopore sequencing to build an open community resource of genome assemblies for 101 lines of 93 drosophilid species encompassing 14 species groups and 35 sub-groups. The genomes are highly contiguous and complete, with an average contig N50 of 10.5 Mb and greater than 97% BUSCO completeness in 97/101 assemblies. We show that Nanopore-based assemblies are highly accurate in coding regions, particularly with respect to coding insertions and deletions. These assemblies, along with a detailed laboratory protocol and assembly pipelines, are released as a public resource and will serve as a starting point for addressing broad questions of genetics, ecology, and evolution at the scale of hundreds of species.
Collapse
Affiliation(s)
- Bernard Y Kim
- Department of Biology, Stanford UniversityStanfordUnited States
| | - Jeremy R Wang
- Department of Genetics, University of North CarolinaChapel HillUnited States
| | - Danny E Miller
- Department of Pediatrics, Division of Genetic Medicine, University of Washington and Seattle Children’s HospitalSeattleUnited States
| | - Olga Barmina
- Department of Evolution and Ecology, University of California DavisDavisUnited States
| | - Emily Delaney
- Department of Evolution and Ecology, University of California DavisDavisUnited States
| | - Ammon Thompson
- Department of Evolution and Ecology, University of California DavisDavisUnited States
| | - Aaron A Comeault
- School of Natural Sciences, Bangor UniversityBangorUnited Kingdom
| | - David Peede
- Biology Department, University of North CarolinaChapel HillUnited States
| | | | - Julianne Pelaez
- Department of Integrative Biology, University of California, BerkeleyBerkeleyUnited States
| | - Jessica M Aguilar
- Department of Integrative Biology, University of California, BerkeleyBerkeleyUnited States
| | - Diler Haji
- Department of Integrative Biology, University of California, BerkeleyBerkeleyUnited States
| | - Teruyuki Matsunaga
- Department of Integrative Biology, University of California, BerkeleyBerkeleyUnited States
| | | | - Molly Zych
- Molecular and Cellular Biology Program, University of WashingtonSeattleUnited States
| | - Yoshitaka Ogawa
- Department of Biological Sciences, Tokyo Metropolitan UniversityHachiojiJapan
| | | | - Mihailo Jelić
- Faculty of Biology, University of BelgradeBelgradeSerbia
| | | | - Marija Tanasković
- University of Belgrade, Institute for Biological Research "Siniša Stanković", National Institute of Republic of SerbiaBelgradeSerbia
| | - Pavle Erić
- University of Belgrade, Institute for Biological Research "Siniša Stanković", National Institute of Republic of SerbiaBelgradeSerbia
| | - Jian-Jun Gao
- School of Ecology and Environmental Science, Yunnan UniversityKunmingChina
| | - Takehiro K Katoh
- School of Ecology and Environmental Science, Yunnan UniversityKunmingChina
| | | | - Hideaki Watabe
- Biological Laboratory, Sapporo College, Hokkaido University of EducationSapporoJapan
| | - Masayoshi Watada
- Graduate School of Science and Engineering, Ehime UniversityMatsuyamaJapan
| | - Jeremy S Davis
- Department of Biology, University of KentuckyLexingtonUnited States
| | - Leonie C Moyle
- Department of Biology, Indiana UniversityBloomingtonUnited States
| | - Giulia Manoli
- Neurobiology and Genetics, Theodor Boveri Institute, Biocentre, University of WürzburgWürzburgGermany
| | - Enrico Bertolini
- Neurobiology and Genetics, Theodor Boveri Institute, Biocentre, University of WürzburgWürzburgGermany
| | - Vladimír Košťál
- Institute of Entomology, Biology Centre, Academy of Sciences of the Czech RepublicPragueCzech Republic
| | - R Scott Hawley
- Department of Molecular and Integrative Physiology, University of Kansas Medical Center, Stowers Institute for Medical ResearchKansas CityUnited States
| | - Aya Takahashi
- Department of Biological Sciences, Tokyo Metropolitan UniversityHachiojiJapan
| | - Corbin D Jones
- Biology Department, University of North CarolinaChapel HillUnited States
| | - Donald K Price
- School of Life Science, University of NevadaLas VegasUnited States
| | - Noah Whiteman
- Department of Integrative Biology, University of California, BerkeleyBerkeleyUnited States
| | - Artyom Kopp
- Department of Evolution and Ecology, University of California DavisDavisUnited States
| | - Daniel R Matute
- Biology Department, University of North CarolinaChapel HillUnited States
| | - Dmitri A Petrov
- Department of Biology, Stanford UniversityStanfordUnited States
| |
Collapse
|
29
|
Zheng J, Bratulic S, Lischer HEL, Wagner A. Mistranslation can promote the exploration of alternative evolutionary trajectories in enzyme evolution. J Evol Biol 2021; 34:1302-1315. [PMID: 34145657 PMCID: PMC8457080 DOI: 10.1111/jeb.13892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2021] [Revised: 05/19/2021] [Accepted: 06/16/2021] [Indexed: 12/01/2022]
Abstract
Darwinian evolution preferentially follows mutational pathways whose individual steps increase fitness. Alternative pathways with mutational steps that do not increase fitness are less accessible. Here, we show that mistranslation, the erroneous incorporation of amino acids into nascent proteins, can increase the accessibility of such alternative pathways and, ultimately, of high fitness genotypes. We subject populations of the beta‐lactamase TEM‐1 to directed evolution in Escherichia coli under both low‐ and high‐mistranslation rates, selecting for high activity on the antibiotic cefotaxime. Under low mistranslation rates, different evolving TEM‐1 populations ascend the same high cefotaxime‐resistance peak, which requires three canonical DNA mutations. In contrast, under high mistranslation rates they ascend three different high cefotaxime‐resistance genotypes, which leads to higher genotypic diversity among populations. We experimentally reconstruct the adaptive DNA mutations and the potential evolutionary paths to these high cefotaxime‐resistance genotypes. This reconstruction shows that some of the DNA mutations do not change fitness under low mistranslation, but cause a significant increase in fitness under high‐mistranslation, which helps increase the accessibility of different high cefotaxime‐resistance genotypes. In addition, these mutations form a network of pairwise epistatic interactions that leads to mutually exclusive evolutionary trajectories towards different high cefotaxime‐resistance genotypes. Our observations demonstrate that protein mistranslation and the phenotypic mutations it causes can alter the evolutionary exploration of fitness landscapes and reduce the predictability of evolution.
Collapse
Affiliation(s)
- Jia Zheng
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland.,Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, Lausanne, Switzerland
| | | | - Heidi E L Lischer
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland.,Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, Lausanne, Switzerland
| | - Andreas Wagner
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland.,Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, Lausanne, Switzerland.,The Santa Fe Institute, Santa Fe, NM, USA
| |
Collapse
|
30
|
Bulankova P, Sekulić M, Jallet D, Nef C, van Oosterhout C, Delmont TO, Vercauteren I, Osuna-Cruz CM, Vancaester E, Mock T, Sabbe K, Daboussi F, Bowler C, Vyverman W, Vandepoele K, De Veylder L. Mitotic recombination between homologous chromosomes drives genomic diversity in diatoms. Curr Biol 2021; 31:3221-3232.e9. [PMID: 34102110 DOI: 10.1016/j.cub.2021.05.013] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2020] [Revised: 03/17/2021] [Accepted: 05/06/2021] [Indexed: 01/31/2023]
Abstract
Diatoms, an evolutionarily successful group of microalgae, display high levels of intraspecific genetic variability in natural populations. However, the contribution of various mechanisms generating such diversity is unknown. Here we estimated the genetic micro-diversity within a natural diatom population and mapped the genomic changes arising within clonally propagated diatom cell cultures. Through quantification of haplotype diversity by next-generation sequencing and amplicon re-sequencing of selected loci, we documented a rapid accumulation of multiple haplotypes accompanied by the appearance of novel protein variants in cell cultures initiated from a single founder cell. Comparison of the genomic changes between mother and daughter cells revealed copy number variation and copy-neutral loss of heterozygosity leading to the fixation of alleles within individual daughter cells. The loss of heterozygosity can be accomplished by recombination between homologous chromosomes. To test this hypothesis, we established an endogenous readout system and estimated that the frequency of interhomolog mitotic recombination was under standard growth conditions 4.2 events per 100 cell divisions. This frequency is increased under environmental stress conditions, including treatment with hydrogen peroxide and cadmium. These data demonstrate that copy number variation and mitotic recombination between homologous chromosomes underlie clonal variability in diatom populations. We discuss the potential adaptive evolutionary benefits of the plastic response in the interhomolog mitotic recombination rate, and we propose that this may have contributed to the ecological success of diatoms.
Collapse
Affiliation(s)
- Petra Bulankova
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium; Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium.
| | - Mirna Sekulić
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium; Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium; Protistology and Aquatic Ecology, Department of Biology, Ghent University, 9000 Ghent, Belgium
| | - Denis Jallet
- TBI, Université de Toulouse, CNRS, INRAE, INSA, 135 Avenue de Rangueil, 31077 Toulouse, France
| | - Charlotte Nef
- Institut de Biologie de l'École Normale Supérieure (IBENS), École Normale Supérieure, CNRS, INSERM, PSL Université Paris, 75005 Paris, France
| | - Cock van Oosterhout
- School of Environmental Sciences, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, UK
| | - Tom O Delmont
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91000 Evry, France
| | - Ilse Vercauteren
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium; Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
| | - Cristina Maria Osuna-Cruz
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium; Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium; Bioinformatics Institute Ghent, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
| | - Emmelien Vancaester
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium; Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium; Bioinformatics Institute Ghent, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
| | - Thomas Mock
- School of Environmental Sciences, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, UK
| | - Koen Sabbe
- Protistology and Aquatic Ecology, Department of Biology, Ghent University, 9000 Ghent, Belgium
| | - Fayza Daboussi
- TBI, Université de Toulouse, CNRS, INRAE, INSA, 135 Avenue de Rangueil, 31077 Toulouse, France
| | - Chris Bowler
- Institut de Biologie de l'École Normale Supérieure (IBENS), École Normale Supérieure, CNRS, INSERM, PSL Université Paris, 75005 Paris, France
| | - Wim Vyverman
- Protistology and Aquatic Ecology, Department of Biology, Ghent University, 9000 Ghent, Belgium
| | - Klaas Vandepoele
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium; Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium; Bioinformatics Institute Ghent, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
| | - Lieven De Veylder
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium; Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium.
| |
Collapse
|
31
|
High-Quality Reference Genome Sequence for the Oomycete Vegetable Pathogen Phytophthora capsici Strain LT1534. Microbiol Resour Announc 2021; 10:e0029521. [PMID: 34042486 PMCID: PMC8201633 DOI: 10.1128/mra.00295-21] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
The oomycete Phytophthora capsici is a destructive pathogen of a wide range of vegetable hosts, especially peppers and cucurbits. A 94.17-Mb genome assembly was constructed using PacBio and Illumina data and annotated with support from transcriptome sequencing (RNA-Seq) reads.
Collapse
|
32
|
Maltose effective improving production and regulatory biosynthesis of plantaricin EF in Lactobacillus plantarum 163. Appl Microbiol Biotechnol 2021; 105:2713-2723. [PMID: 33710357 DOI: 10.1007/s00253-021-11218-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 02/22/2021] [Accepted: 03/02/2021] [Indexed: 10/21/2022]
Abstract
Plantaricin EF, a kind of natural antibacterial substance, has shown inhibitory effect on most pathogen and spoilage microorganisms, which possessed great potential in food preservation. However, the lower production of plantaricin EF has limited its large-scale production and application. In this study, the effect of maltose on plantaricin EF production and its regulation mechanism in Lactobacillus plantarum 163 were investigated. Maltose significantly improved the biomass and plantaricin EF production, which increased by 3.35 and 3.99 times comparing to the control without maltose, respectively. The maximum production of plantaricin E and F in fed-batch fermentation were 10.55 mg/L and 22.94 mg/L, respectively. Besides, qPCR results showed that maltose remarkably improved transcription of plnA, plnB, plnD, plnE, plnF, plnG1 and plnH, and heighten transcription of lamR, lamK, hpk6 and rrp6. These results provided an effective method to enhance plantaricin EF production and revealed a possible regulatory mechanism from transcriptome results that hpk6, rrp6, lamK and lamR were relative to plantaricin EF production. Genes, hpk6 and rrp6, promote transcription of plnG1, whereas lamK and lamR enhance transcription of plnA, plnB and plnD, which increased plantaricin EF production. KEYPOINTS: • Maltose was proved to be effective in promoting the biosynthesis of plantaricin EF. • Maltose promoted the transcription of biosynthesis and secretion genes of plantaricin EF. • Up-regulation of genes lamR, lamK, hpk6 and rrp6 heightened the plantaricin EF production.
Collapse
|
33
|
Cheng QQ, Ouyang Y, Tang ZY, Lao CC, Zhang YY, Cheng CS, Zhou H. Review on the Development and Applications of Medicinal Plant Genomes. FRONTIERS IN PLANT SCIENCE 2021; 12:791219. [PMID: 35003182 PMCID: PMC8732986 DOI: 10.3389/fpls.2021.791219] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 11/23/2021] [Indexed: 05/04/2023]
Abstract
With the development of sequencing technology, the research on medicinal plants is no longer limited to the aspects of chemistry, pharmacology, and pharmacodynamics, but reveals them from the genetic level. As the price of next-generation sequencing technology becomes affordable, and the long-read sequencing technology is established, the medicinal plant genomes with large sizes have been sequenced and assembled more easily. Although the review of plant genomes has been reported several times, there is no review giving a systematic and comprehensive introduction about the development and application of medicinal plant genomes that have been reported until now. Here, we provide a historical perspective on the current situation of genomes in medicinal plant biology, highlight the use of the rapidly developing sequencing technologies, and conduct a comprehensive summary on how the genomes apply to solve the practical problems in medicinal plants, like genomics-assisted herb breeding, evolution history revelation, herbal synthetic biology study, and geoherbal research, which are important for effective utilization, rational use and sustainable protection of medicinal plants.
Collapse
Affiliation(s)
- Qi-Qing Cheng
- State Key Laboratory of Quality Research in Chinese Medicine, Faculty of Chinese Medicine, Macau University of Science and Technology, Taipa, Macao SAR, China
| | - Yue Ouyang
- State Key Laboratory of Quality Research in Chinese Medicine, Faculty of Chinese Medicine, Macau University of Science and Technology, Taipa, Macao SAR, China
| | - Zi-Yu Tang
- State Key Laboratory of Quality Research in Chinese Medicine, Faculty of Chinese Medicine, Macau University of Science and Technology, Taipa, Macao SAR, China
| | - Chi-Chou Lao
- State Key Laboratory of Quality Research in Chinese Medicine, Faculty of Chinese Medicine, Macau University of Science and Technology, Taipa, Macao SAR, China
| | - Yan-Yu Zhang
- State Key Laboratory of Quality Research in Chinese Medicine, Faculty of Chinese Medicine, Macau University of Science and Technology, Taipa, Macao SAR, China
| | - Chun-Song Cheng
- State Key Laboratory of Quality Research in Chinese Medicine, Faculty of Chinese Medicine, Macau University of Science and Technology, Taipa, Macao SAR, China
- Lushan Botanical Garden, Chinese Academy of Sciences, Jiujiang, China
| | - Hua Zhou
- State Key Laboratory of Quality Research in Chinese Medicine, Faculty of Chinese Medicine, Macau University of Science and Technology, Taipa, Macao SAR, China
- Joint Laboratory for Translational Cancer Research of Chinese Medicine, The Ministry of Education of the People’s Republic of China, Macau University of Science and Technology, Taipa, Macao SAR, China
- *Correspondence: Hua Zhou,
| |
Collapse
|
34
|
Zheng J, Guo N, Wagner A. Selection enhances protein evolvability by increasing mutational robustness and foldability. Science 2020; 370:370/6521/eabb5962. [DOI: 10.1126/science.abb5962] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Accepted: 09/25/2020] [Indexed: 01/14/2023]
Abstract
Natural selection can promote or hinder a population’s evolvability—the ability to evolve new and adaptive phenotypes—but the underlying mechanisms are poorly understood. To examine how the strength of selection affects evolvability, we subjected populations of yellow fluorescent protein to directed evolution under different selection regimes and then evolved them toward the new phenotype of green fluorescence. Populations under strong selection for the yellow phenotype evolved the green phenotype most rapidly. They did so by accumulating mutations that increase both robustness to mutations and foldability. Under weak selection, neofunctionalizing mutations rose to higher frequency at first, but more frequent deleterious mutations undermined their eventual success. Our experiments show how selection can enhance evolvability by enhancing robustness and create the conditions necessary for evolutionary success.
Collapse
Affiliation(s)
- Jia Zheng
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
- Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, Lausanne, Switzerland
| | - Ning Guo
- Zwirnereistrasse 11, Wallisellen, Zurich, Switzerland
| | - Andreas Wagner
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
- Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, Lausanne, Switzerland
- The Santa Fe Institute, Santa Fe, NM, USA
| |
Collapse
|
35
|
Peng Z, Bredeson JV, Wu GA, Shu S, Rawat N, Du D, Parajuli S, Yu Q, You Q, Rokhsar DS, Gmitter FG, Deng Z. A chromosome-scale reference genome of trifoliate orange (Poncirus trifoliata) provides insights into disease resistance, cold tolerance and genome evolution in Citrus. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 104:1215-1232. [PMID: 32985030 PMCID: PMC7756384 DOI: 10.1111/tpj.14993] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Accepted: 09/17/2020] [Indexed: 05/19/2023]
Abstract
Trifoliate orange (Poncirus trifoliata), a deciduous close relative of evergreen Citrus, has important traits for citrus production, including tolerance/resistance to citrus greening disease (Huanglongbing, HLB) and other major diseases, and cold tolerance. It has been one of the most important rootstocks, and one of the most valuable sources of resistance and tolerance genes for citrus. Here we present a high-quality, chromosome-scale genome assembly of P. trifoliata. The 264.9-Mb assembly contains nine chromosomal pseudomolecules with 25 538 protein-coding genes, covering 97.2% of the estimated gene space. Comparative analyses of P. trifoliata and nine Citrus genomes revealed 605 species-specific genes and six rapidly evolving gene families in the P. trifoliata genome. Poncirus trifoliata has evolved specific adaptation in the C-repeat/DREB binding factor (CBF)-dependent and CBF-independent cold signaling pathways to tolerate cold. We identified candidate genes within quantitative trait loci for HLB tolerance, and at the loci for resistance to citrus tristeza virus and citrus nematode. Genetic diversity analysis of Poncirus accessions and Poncirus/Citrus hybrids shows a narrow genetic base in the US germplasm collection, and points to the importance of collecting and preserving more natural genetic variation. Two phenotypically divergent Poncirus accessions are found to be clonally related, supporting a previous conjecture that dwarf Flying Dragon originated as a mutant of a non-dwarfing type. The high-quality genome reveals features and evolutionary insights of Poncirus, and it will serve as a valuable resource for genetic, genomic and molecular research and manipulation in citrus.
Collapse
Affiliation(s)
- Ze Peng
- Department of Environmental HorticultureGulf Coast Research and Education CenterUniversity of FloridaIFAS14625 County Road 672WimaumaFL33598USA
| | - Jessen V. Bredeson
- Molecular and Cell Biology DepartmentUniversity of California, BerkeleyBerkeleyCA94720USA
| | - Guohong A. Wu
- US Department of Energy Joint Genome InstituteLawrence Berkeley National Lab1 Cyclotron RoadBerkeleyCA94720USA
| | - Shengqiang Shu
- US Department of Energy Joint Genome InstituteLawrence Berkeley National Lab1 Cyclotron RoadBerkeleyCA94720USA
| | - Nidhi Rawat
- Department of Environmental HorticultureGulf Coast Research and Education CenterUniversity of FloridaIFAS14625 County Road 672WimaumaFL33598USA
| | - Dongliang Du
- Citrus Research and Education CenterUniversity of Florida, IFAS700 Experiment Station RdLake AlfredFL33850USA
| | - Saroj Parajuli
- Department of Environmental HorticultureGulf Coast Research and Education CenterUniversity of FloridaIFAS14625 County Road 672WimaumaFL33598USA
| | - Qibin Yu
- Citrus Research and Education CenterUniversity of Florida, IFAS700 Experiment Station RdLake AlfredFL33850USA
| | - Qian You
- Department of Environmental HorticultureGulf Coast Research and Education CenterUniversity of FloridaIFAS14625 County Road 672WimaumaFL33598USA
| | - Daniel S. Rokhsar
- Molecular and Cell Biology DepartmentUniversity of California, BerkeleyBerkeleyCA94720USA
- US Department of Energy Joint Genome InstituteLawrence Berkeley National Lab1 Cyclotron RoadBerkeleyCA94720USA
| | - Frederick G. Gmitter
- Citrus Research and Education CenterUniversity of Florida, IFAS700 Experiment Station RdLake AlfredFL33850USA
| | - Zhanao Deng
- Department of Environmental HorticultureGulf Coast Research and Education CenterUniversity of FloridaIFAS14625 County Road 672WimaumaFL33598USA
| |
Collapse
|
36
|
Jaworski CC, Allan CW, Matzkin LM. Chromosome‐level hybrid de novo genome assemblies as an attainable option for nonmodel insects. Mol Ecol Resour 2020; 20:1277-1293. [DOI: 10.1111/1755-0998.13176] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Revised: 03/31/2020] [Accepted: 04/16/2020] [Indexed: 11/27/2022]
Affiliation(s)
- Coline C. Jaworski
- Department of Entomology The University of Arizona Tucson AZ USA
- Univ Avignon CNRS IRD IMBE Aix Marseille Université Marseille France
- Department of Zoology University of Oxford Oxford UK
| | - Carson W. Allan
- Department of Entomology The University of Arizona Tucson AZ USA
| | - Luciano M. Matzkin
- Department of Entomology The University of Arizona Tucson AZ USA
- BIO5 Institute The University of Arizona Tucson AZ USA
- Department of Ecology and Evolutionary Biology The University of Arizona Tucson AZ USA
| |
Collapse
|
37
|
Complete Genome Sequence of Lactobacillus nenjiangensis SH-Y15, Isolated from Sauerkraut. Microbiol Resour Announc 2020; 9:9/23/e01473-19. [PMID: 32499359 PMCID: PMC7272568 DOI: 10.1128/mra.01473-19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
We isolated a strain of Lactobacillus nenjiangensis named SH-Y15 from traditional suan-cai used in northeastern China because it has a high capacity for degrading nitrites at low temperatures. The complete genome of SH-Y15 contains a single circular chromosome and a plasmid. The complete length is 2,249,893 bp, and the G+C content is 39.68%. We isolated a strain of Lactobacillus nenjiangensis named SH-Y15 from traditional suan-cai used in northeastern China because it has a high capacity for degrading nitrites at low temperatures. The complete genome of SH-Y15 contains a single circular chromosome and a plasmid. The complete length is 2,249,893 bp, and the G+C content is 39.68%.
Collapse
|
38
|
Liu J, Peng L, Su H, Tang H, Chen D, Xu Z, Wu A. Chromosome and Plasmid Features of Two ST37 Clostridioides difficile Strains Isolated in China Reveal Distinct Multidrug Resistance and Virulence Determinants. Microb Drug Resist 2020; 26:1503-1508. [PMID: 32407161 DOI: 10.1089/mdr.2019.0461] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
Clostridioides difficile ST37 is an emerging and prevalent multilocus sequence type and represents a lineage of clinical significance. This study aimed to characterize two epidemic C. difficile ST37 strains, CD161 and CDT4. CD161 acquires a chromosome and two distinct plasmids, pCD161-L, sharing high similarity with Clostridium phage, and pCD161-S, while CDT4 has a chromosome and a plasmid pCDT4 identical to pCD161-S. In the chromosome of both strains, three CdISt1-like elements and a skinCd element, which might influence sporulation, were identified. The multidrug resistance of the strains was due to the mutation in 23S rDNA, gyrA, and gyrB genes and the acquisition of ermB, ant6-Ia, aac6'-aph2'', and tetM genes. In addition, a distinct pathogenicity locus (PaLoc) with truncated tcdA gene represents the genetic feature of ST37 strains. To our knowledge, this is the first complete genome, both chromosomes and plasmids, of epidemic C. difficile ST37 strains in China.
Collapse
Affiliation(s)
- Junyan Liu
- School of Food Science and Engineering, South China University of Technology, Guangzhou, China
| | - Liang Peng
- Department of Clinical Laboratory, the Fifth Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Heng Su
- KingMed School of Laboratory Medicine, Guangzhou Medical University, Guangzhou, China
| | - Haixian Tang
- KingMed School of Laboratory Medicine, Guangzhou Medical University, Guangzhou, China
| | - Dingqiang Chen
- Department of Laboratory Medicine, Southern Medical University, Zhujiang Hospital, Guangzhou, China
| | - Zhenbo Xu
- School of Food Science and Engineering, South China University of Technology, Guangzhou, China.,Department of Microbial Pathogenesis, University of Maryland, Baltimore, Maryland, USA.,Guangdong Province Key Laboratory for Green Processing of Natural Products and Product Safety, Guangzhou, China.,Overseas Expertise Introduction Center for Discipline Innovation of Food Nutrition and Human Health (111 Center), Guangzhou, China
| | - Aiwu Wu
- KingMed School of Laboratory Medicine, Guangzhou Medical University, Guangzhou, China
| |
Collapse
|
39
|
Zhao N, Li D, Guo BJ, Tao X, Lin X, Yan SZ, Chen SL. Genome Sequencing and Analysis of the Hypocrellin-Producing Fungus Shiraia bambusicola S4201. Front Microbiol 2020; 11:643. [PMID: 32373091 PMCID: PMC7179677 DOI: 10.3389/fmicb.2020.00643] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2019] [Accepted: 03/20/2020] [Indexed: 12/29/2022] Open
Abstract
Shiraia bambusicola has long been used as a traditional Chinese medicine and its major medicinal active metabolite is hypocrellin, which exhibits outstanding antiviral and antitumor properties. Here we report the 32 Mb draft genome sequence of S. bambusicola S4201, encoding 11,332 predicted genes. The genome of S. bambusicola is enriched in carbohydrate-active enzymes (CAZy) and pathogenesis-related genes. The phylogenetic tree of S. bambusicola S4201 and nine other sequenced species was constructed and its taxonomic status was supported (Pleosporales, Dothideomycetes). The genome contains a rich set of secondary metabolite biosynthetic gene clusters, suggesting that strain S4201 has a remarkable capacity to produce secondary metabolites. Overexpression of the zinc finger transcription factor zftf, which is involved in hypocrellin A (HA) biosynthesis, increases HA production when compared with wild type. In addition, a new putative HA biosynthetic pathway is proposed. These results provide a framework to study the mechanisms of infection in bamboo and to understand the phylogenetic relationships of S. bambusicola S4201. At the same time, knowledge of the genome sequence may potentially solve the puzzle of HA biosynthesis and lead to the discovery of novel genes and secondary metabolites of importance in medicine and agriculture.
Collapse
Affiliation(s)
- Ning Zhao
- College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Dan Li
- College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Bing-Jing Guo
- College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Xin Tao
- College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Xi Lin
- College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Shu-Zhen Yan
- College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Shuang-Lin Chen
- College of Life Sciences, Nanjing Normal University, Nanjing, China
| |
Collapse
|
40
|
Wang C, Feng J, Chen Y, Li D, Liu L, Wu Y, Zhang S, Du S, Zhang Y. Revealing mitogenome-wide DNA methylation and RNA editing of three Ascomycotina fungi using SMRT sequencing. Mitochondrion 2020; 51:88-96. [PMID: 31923469 DOI: 10.1016/j.mito.2020.01.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2018] [Revised: 12/25/2019] [Accepted: 01/03/2020] [Indexed: 10/25/2022]
Abstract
Beauveria bassiana, Cordyceps militaris and Ophiocordyceps sinensis (Ascomycotina) are traditional Chinese medicines. Here, mitogenomes of these three Ascomycotina fungi were sequenced and de-novo assembled using single-molecule real-time sequencing. The results showed that their complete mitogenomes were 31,258, 31,854 and 157,584 bp, respectively, with sequencing depth approximately 278,760×, 326,283× and 69,385×. Types of repeat sequences were mainly (AA)n, (AAT)n, (TA)n and (TATT)n. DNA methylation motifs were revealed in DNA modifications of these three fungi. We discovered new models of RNA editing through analysis of transcriptomes from B. bassiana and C. militaris. These data lay a solid foundation for further genetic and biological studies about these three fungi, especially for elucidating the mitogenome evolution and exploring the regulatory mechanism of adapting environment.
Collapse
Affiliation(s)
- Chaoxia Wang
- Management Center of Tianjin Modern Agricultural Science and Technology Innovation Base, Tianjin Academy of Agricultural Sciences, Tianjin 300192, China; Tianjin Lakeside Powergene Science Development Co. Ltd, Tianjin 300309, China.
| | - Jianhua Feng
- Tianjin Lakeside Powergene Science Development Co. Ltd, Tianjin 300309, China
| | - Yujiao Chen
- Tuke (Tianjing) Pharmaceutical Technology Co. Ltd, Tianjin 300457, China
| | - Dongmei Li
- Tianjin Lakeside Powergene Science Development Co. Ltd, Tianjin 300309, China
| | - Li Liu
- Tuke (Tianjing) Pharmaceutical Technology Co. Ltd, Tianjin 300457, China
| | - Yuqian Wu
- Gui'an Precision Medicine Academy Co. Ltd, GuiZhou 550029, China
| | - Shujun Zhang
- Research Center of Human Genome, Tianjin University, Tianjin 300309, China
| | - Simiao Du
- Zheng Yuan Tang (Tianjin) Biotechnology Co. Ltd, Tianjin 300457, China
| | - Yaozhou Zhang
- Research Center of Human Genome, Tianjin University, Tianjin 300309, China; Tianjin International Joint Academy of Biomedical, Tianjin 300457, China.
| |
Collapse
|
41
|
Lou F, Song N, Han Z, Gao T. Single-molecule real-time (SMRT) sequencing facilitates Tachypleus tridentatus genome annotation. Int J Biol Macromol 2020; 147:89-97. [PMID: 31923512 DOI: 10.1016/j.ijbiomac.2020.01.029] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2019] [Revised: 01/04/2020] [Accepted: 01/04/2020] [Indexed: 12/19/2022]
Abstract
Tachypleus tridentatus is a keystone species in marine ecosystems. Its hemolymph also provides the limulus amebocyte lysate (LAL) for detection of bacterial endotoxin in human medical service. Here we combined SMRT sequencing and Illumina RNA-seq to characterize the novel isoforms, novel genetic loci, fusion isoforms formation and transcriptome structure and further to unveil the transcriptome complexity of T. tridentatus. We identified 26,705 non-redundancy isoforms form 10,919 genetic loci, including 25,713 novel isoforms, 2403 novel genes and 170 fusion isoforms. In addition, 1578 novel genes and 23,172 novel isoforms were annotated in the NR, Pfam, KOG, COG, eggNOG, Swiss-Prot, KEGG and GO databases. Meanwhile, we have obtained 4671 gene family clustering based on genetic loci. Furthermore, there are 17,296, 4887, 1054, and 1435 APAs, AS events, lncRNAs, and TFs were identified in the T. tridentatus long-read transcriptome and the target genes of 1054 lncRNA sequences were also predicted. Overall, our work firstly provided the long-read transcriptome and these data are very necessary to improve the annotation information of T. tridentatus genome and optimize the boundaries of 12,342 original reference annotated genes. Furthermore, these information are a potential resource to study LAL secretion mechanisms in T. tridentatus.
Collapse
Affiliation(s)
- Fangrui Lou
- Fishery College, Ocean University of China, Qingdao, Shandong 266003, China; Fishery College, Zhejiang Ocean University, Zhoushan, Zhejiang 316022, China
| | - Na Song
- Fishery College, Ocean University of China, Qingdao, Shandong 266003, China
| | - Zhiqiang Han
- Fishery College, Zhejiang Ocean University, Zhoushan, Zhejiang 316022, China.
| | - Tianxiang Gao
- Fishery College, Zhejiang Ocean University, Zhoushan, Zhejiang 316022, China.
| |
Collapse
|
42
|
High-Quality Draft Genome Sequence and Annotation of the Basidiomycete Yeast Sporisorium graminicola CBS10092, a Producer of Mannosylerythritol Lipids. Microbiol Resour Announc 2019; 8:8/42/e00479-19. [PMID: 31624158 PMCID: PMC6797523 DOI: 10.1128/mra.00479-19] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
The basidiomycete Sporisorium graminicola (formally Pseudozyma graminicola) strain CBS10092 was originally isolated from an herbaceous plant in Russia. It is a known producer of mannosylerythritol lipids (MELs), the main component being MEL-C. Here, we present the 19.9-Mb draft genome sequence, which comprises 6,602 genes, including those encoding the MEL biosynthetic pathway. The basidiomycete Sporisorium graminicola (formally Pseudozyma graminicola) strain CBS10092 was originally isolated from an herbaceous plant in Russia. It is a known producer of mannosylerythritol lipids (MELs), the main component being MEL-C. Here, we present the 19.9-Mb draft genome sequence, which comprises 6,602 genes, including those encoding the MEL biosynthetic pathway.
Collapse
|
43
|
Kingan SB, Urban J, Lambert CC, Baybayan P, Childers AK, Coates B, Scheffler B, Hackett K, Korlach J, Geib SM. A high-quality genome assembly from a single, field-collected spotted lanternfly (Lycorma delicatula) using the PacBio Sequel II system. Gigascience 2019; 8:giz122. [PMID: 31609423 PMCID: PMC6791401 DOI: 10.1093/gigascience/giz122] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Revised: 08/08/2019] [Accepted: 09/17/2019] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND A high-quality reference genome is an essential tool for applied and basic research on arthropods. Long-read sequencing technologies may be used to generate more complete and contiguous genome assemblies than alternate technologies; however, long-read methods have historically had greater input DNA requirements and higher costs than next-generation sequencing, which are barriers to their use on many samples. Here, we present a 2.3 Gb de novo genome assembly of a field-collected adult female spotted lanternfly (Lycorma delicatula) using a single Pacific Biosciences SMRT Cell. The spotted lanternfly is an invasive species recently discovered in the northeastern United States that threatens to damage economically important crop plants in the region. RESULTS The DNA from 1 individual was used to make 1 standard, size-selected library with an average DNA fragment size of ∼20 kb. The library was run on 1 Sequel II SMRT Cell 8M, generating a total of 132 Gb of long-read sequences, of which 82 Gb were from unique library molecules, representing ∼36× coverage of the genome. The assembly had high contiguity (contig N50 length = 1.5 Mb), completeness, and sequence level accuracy as estimated by conserved gene set analysis (96.8% of conserved genes both complete and without frame shift errors). Furthermore, it was possible to segregate more than half of the diploid genome into the 2 separate haplotypes. The assembly also recovered 2 microbial symbiont genomes known to be associated with L. delicatula, each microbial genome being assembled into a single contig. CONCLUSIONS We demonstrate that field-collected arthropods can be used for the rapid generation of high-quality genome assemblies, an attractive approach for projects on emerging invasive species, disease vectors, or conservation efforts of endangered species.
Collapse
Affiliation(s)
- Sarah B Kingan
- Pacific Biosciences, 1305 O'Brien Drive, Menlo Park, CA 94025, USA
| | - Julie Urban
- Department of Entomology, 501 ASI Building, The Pennsylvania State University, University Park, PA 16802, USA
| | | | - Primo Baybayan
- Pacific Biosciences, 1305 O'Brien Drive, Menlo Park, CA 94025, USA
| | - Anna K Childers
- USDA-ARS, Bee Research Laboratory, 10300 Baltimore Avenue, Building 306, Room 315, BARC-East, Beltsville, MD 20705, USA
| | - Brad Coates
- USDA-ARS, Corn Insects and Crop Genetics Research Unit, 2333 Genetics Laboratory, 819 Wallace Road, Ames, IA 50011, USA
| | - Brian Scheffler
- USDA-ARS, Genomics and Bioinformatics Research, 141 Experiment Station Road, Stoneville, MS 38776, USA
| | - Kevin Hackett
- USDA-ARS, Office of National Programs, George Washington Carver Center, 5601 Sunnyside Avenue, Beltsville, MD 20705, USA
| | - Jonas Korlach
- Pacific Biosciences, 1305 O'Brien Drive, Menlo Park, CA 94025, USA
| | - Scott M Geib
- USDA-ARS, Daniel K Inouye U.S. Pacific Basin Agricultural Research Center, 64 Nowelo St., Hilo, HI 96720, USA
| |
Collapse
|
44
|
Bongartz P, Schloissnig S. Deep repeat resolution-the assembly of the Drosophila Histone Complex. Nucleic Acids Res 2019; 47:e18. [PMID: 30476267 PMCID: PMC6380962 DOI: 10.1093/nar/gky1194] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2018] [Revised: 09/07/2018] [Accepted: 11/12/2018] [Indexed: 12/30/2022] Open
Abstract
Though the advent of long-read sequencing technologies has led to a leap in contiguity of de novo genome assemblies, current reference genomes of higher organisms still do not provide unbroken sequences of complete chromosomes. Despite reads in excess of 30 000 base pairs, there are still repetitive structures that cannot be resolved by current state-of-the-art assemblers. The most challenging of these structures are tandemly arrayed repeats, which occur in the genomes of all eukaryotes. Untangling tandem repeat clusters is exceptionally difficult, since the rare differences between repeat copies are obscured by the high error rate of long reads. Solving this problem would constitute a major step towards computing fully assembled genomes. Here, we demonstrate by example of the Drosophila Histone Complex that via machine learning algorithms, it is possible to exploit the underlying distinguishing patterns of single nucleotide variants of repeats from very noisy data to resolve a large and highly conserved repeat cluster. The ideas explored in this paper are a first step towards the automated assembly of complex repeat structures and promise to be applicable to a wide range of eukaryotic genomes.
Collapse
Affiliation(s)
- Philipp Bongartz
- Heidelberg Institut für Theoretische Studien, Schloß-Wolfsbrunnenweg 35, 69118 Heidelberg, Germany
| | - Siegfried Schloissnig
- Heidelberg Institut für Theoretische Studien, Schloß-Wolfsbrunnenweg 35, 69118 Heidelberg, Germany
| |
Collapse
|
45
|
Zhai Y, Li Y, Zhang J, Zhang Y, Ren F, Zhang X, Liu G, Liu X, Che Y. Identification of the gene cluster for bistropolone-humulene meroterpenoid biosynthesis in Phoma sp. Fungal Genet Biol 2019; 129:7-15. [DOI: 10.1016/j.fgb.2019.04.004] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2019] [Revised: 03/15/2019] [Accepted: 04/04/2019] [Indexed: 01/27/2023]
|
46
|
Zheng J, Payne JL, Wagner A. Cryptic genetic variation accelerates evolution by opening access to diverse adaptive peaks. Science 2019; 365:347-353. [DOI: 10.1126/science.aax1837] [Citation(s) in RCA: 67] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2019] [Accepted: 06/06/2019] [Indexed: 12/13/2022]
Abstract
Cryptic genetic variation can facilitate adaptation in evolving populations. To elucidate the underlying genetic mechanisms, we used directed evolution in Escherichia coli to accumulate variation in populations of yellow fluorescent proteins and then evolved these proteins toward the new phenotype of green fluorescence. Populations with cryptic variation evolved adaptive genotypes with greater diversity and higher fitness than populations without cryptic variation, which converged on similar genotypes. Populations with cryptic variation accumulated neutral or deleterious mutations that break the constraints on the order in which adaptive mutations arise. In doing so, cryptic variation opens paths to adaptive genotypes, creates historical contingency, and reduces the predictability of evolution by allowing different replicate populations to climb different adaptive peaks and explore otherwise-inaccessible regions of an adaptive landscape.
Collapse
Affiliation(s)
- Jia Zheng
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
- Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, Lausanne, Switzerland
| | - Joshua L. Payne
- Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, Lausanne, Switzerland
- Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland
| | - Andreas Wagner
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
- Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, Lausanne, Switzerland
- The Santa Fe Institute, Santa Fe, NM, USA
| |
Collapse
|
47
|
Hu W, Luo H, Yang Y, Wang Q, Hong N, Wang G, Wang A, Wang L. Comprehensive analysis of full genome sequence and Bd-milRNA/target mRNAs to discover the mechanism of hypovirulence in Botryosphaeria dothidea strains on pear infection with BdCV1 and BdPV1. IMA Fungus 2019; 10:3. [PMID: 32647612 PMCID: PMC7325678 DOI: 10.1186/s43008-019-0008-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2019] [Accepted: 05/06/2019] [Indexed: 11/23/2022] Open
Abstract
Pear ring rot disease, mainly caused by Botryosphaeria dothidea, is widespread in most pear and apple-growing regions. Mycoviruses are used for biocontrol, especially in fruit tree disease. BdCV1 (Botryosphaeria dothidea chrysovirus 1) and BdPV1 (Botryosphaeria dothidea partitivirus 1) influence the biological characteristics of B. dothidea strains. BdCV1 is a potential candidate for the control of fungal disease. Therefore, it is vital to explore interactions between B. dothidea and mycovirus to clarify the pathogenic mechanisms of B. dothidea and hypovirulence of B. dothidea in pear. A high-quality full-length genome sequence of the B. dothidea LW-Hubei isolate was obtained using Single Molecule Real-Time sequencing. It has high repeat sequence with 9.3% and DNA methylation existence in the genome. The 46.34 Mb genomes contained 14,091 predicted genes, which of 13,135 were annotated. B. dothidea was predicted to express 3833 secreted proteins. In bioinformatics analysis, 351 CAZy members, 552 transporters, 128 kinases, and 1096 proteins associated with plant-host interaction (PHI) were identified. RNA-silencing components including two endoribonuclease Dicer, four argonaute (Ago) and three RNA-dependent RNA polymerase (RdRp) molecules were identified and expressed in response to mycovirus infection. Horizontal transfer of the LW-C and LW-P strains indicated that BdCV1 induced host gene silencing in LW-C to suppress BdPV1 transmission. To investigate the role of RNA-silencing in B. dothidea defense, we constructed four small RNA libraries and sequenced B. dothidea micro-like RNAs (Bd-milRNAs) produced in response to BdCV1 and BdPV1 infection. Among these, 167 conserved and 68 candidate novel Bd-milRNAs were identified, of which 161 conserved and 20 novel Bd-milRNA were differentially expressed. WEGO analysis revealed involvement of the differentially expressed Bd-milRNA-targeted genes in metabolic process, catalytic activity, cell process and response to stress or stimulus. BdCV1 had a greater effect on the phenotype, virulence, conidiomata, vertical and horizontal transmission ability, and mycelia cellular structure biological characteristics of B. dothidea strains than BdPV1 and virus-free strains. The results obtained in this study indicate that mycovirus regulates biological processes in B. dothidea through the combined interaction of antiviral defense mediated by RNA-silencing and milRNA-mediated regulation of target gene mRNA expression.
Collapse
Affiliation(s)
- Wangcheng Hu
- State Key Laboratory of Agricultural Microbiology, Wuhan, Hubei 430070 People's Republic of China.,College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei 430070 People's Republic of China.,Key Lab of Plant Pathology of Hubei Province, Wuhan, Hubei 430070 People's Republic of China
| | - Hui Luo
- State Key Laboratory of Agricultural Microbiology, Wuhan, Hubei 430070 People's Republic of China.,College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei 430070 People's Republic of China.,Key Lab of Plant Pathology of Hubei Province, Wuhan, Hubei 430070 People's Republic of China
| | - Yuekun Yang
- State Key Laboratory of Agricultural Microbiology, Wuhan, Hubei 430070 People's Republic of China.,College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei 430070 People's Republic of China.,Key Lab of Plant Pathology of Hubei Province, Wuhan, Hubei 430070 People's Republic of China
| | - Qiong Wang
- State Key Laboratory of Agricultural Microbiology, Wuhan, Hubei 430070 People's Republic of China.,College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei 430070 People's Republic of China.,Key Lab of Plant Pathology of Hubei Province, Wuhan, Hubei 430070 People's Republic of China
| | - Ni Hong
- State Key Laboratory of Agricultural Microbiology, Wuhan, Hubei 430070 People's Republic of China.,College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei 430070 People's Republic of China.,Key Lab of Plant Pathology of Hubei Province, Wuhan, Hubei 430070 People's Republic of China
| | - Guoping Wang
- State Key Laboratory of Agricultural Microbiology, Wuhan, Hubei 430070 People's Republic of China.,College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei 430070 People's Republic of China.,Key Lab of Plant Pathology of Hubei Province, Wuhan, Hubei 430070 People's Republic of China
| | - Aiming Wang
- London Research and Development Centre, Agriculture and Agri-Food Canada, London, ON N5V 4T3 Canada
| | - Liping Wang
- State Key Laboratory of Agricultural Microbiology, Wuhan, Hubei 430070 People's Republic of China.,College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei 430070 People's Republic of China.,Key Lab of Plant Pathology of Hubei Province, Wuhan, Hubei 430070 People's Republic of China
| |
Collapse
|
48
|
Heydari M, Miclotte G, Van de Peer Y, Fostier J. Illumina error correction near highly repetitive DNA regions improves de novo genome assembly. BMC Bioinformatics 2019; 20:298. [PMID: 31159722 PMCID: PMC6545690 DOI: 10.1186/s12859-019-2906-2] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Accepted: 05/17/2019] [Indexed: 11/10/2022] Open
Abstract
Background Several standalone error correction tools have been proposed to correct sequencing errors in Illumina data in order to facilitate de novo genome assembly. However, in a recent survey, we showed that state-of-the-art assemblers often did not benefit from this pre-correction step. We found that many error correction tools introduce new errors in reads that overlap highly repetitive DNA regions such as low-complexity patterns or short homopolymers, ultimately leading to a more fragmented assembly. Results We propose BrownieCorrector, an error correction tool for Illumina sequencing data that focuses on the correction of only those reads that overlap short DNA patterns that are highly repetitive in the genome. BrownieCorrector extracts all reads that contain such a pattern and clusters them into different groups using a community detection algorithm that takes into account both the sequence similarity between overlapping reads and their respective paired-end reads. Each cluster holds reads that originate from the same genomic region and hence each cluster can be corrected individually, thus providing a consistent correction for all reads within that cluster. Conclusions BrownieCorrector is benchmarked using six real Illumina datasets for different eukaryotic genomes. The prior use of BrownieCorrector improves assembly results over the use of uncorrected reads in all cases. In comparison with other error correction tools, BrownieCorrector leads to the best assembly results in most cases even though less than 2% of the reads within a dataset are corrected. Additionally, we investigate the impact of error correction on hybrid assembly where the corrected Illumina reads are supplemented with PacBio data. Our results confirm that BrownieCorrector improves the quality of hybrid genome assembly as well. BrownieCorrector is written in standard C++11 and released under GPL license. BrownieCorrector relies on multithreading to take advantage of multi-core/multi-CPU systems. The source code is available at https://github.com/biointec/browniecorrector. Electronic supplementary material The online version of this article (10.1186/s12859-019-2906-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Mahdi Heydari
- Department of Information Technology, Ghent University-imec, IDLab, Ghent, B-9052, Belgium.,Bioinformatics Institute Ghent, Ghent, B-9052, Belgium
| | - Giles Miclotte
- Department of Information Technology, Ghent University-imec, IDLab, Ghent, B-9052, Belgium.,Bioinformatics Institute Ghent, Ghent, B-9052, Belgium
| | - Yves Van de Peer
- Bioinformatics Institute Ghent, Ghent, B-9052, Belgium.,Center for Plant Systems Biology, VIB, Ghent, B-9052, Belgium.,Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, B-9052, Belgium.,Department of Genetics, Genome Research Institute, University of Pretoria, Pretoria, South Africa
| | - Jan Fostier
- Department of Information Technology, Ghent University-imec, IDLab, Ghent, B-9052, Belgium. .,Bioinformatics Institute Ghent, Ghent, B-9052, Belgium.
| |
Collapse
|
49
|
Ghurye J, Koren S, Small ST, Redmond S, Howell P, Phillippy AM, Besansky NJ. A chromosome-scale assembly of the major African malaria vector Anopheles funestus. Gigascience 2019; 8:giz063. [PMID: 31157884 PMCID: PMC6545970 DOI: 10.1093/gigascience/giz063] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2018] [Revised: 03/28/2019] [Accepted: 05/06/2019] [Indexed: 12/04/2022] Open
Abstract
BACKGROUND Anopheles funestus is one of the 3 most consequential and widespread vectors of human malaria in tropical Africa. However, the lack of a high-quality reference genome has hindered the association of phenotypic traits with their genetic basis in this important mosquito. FINDINGS Here we present a new high-quality A. funestus reference genome (AfunF3) assembled using 240× coverage of long-read single-molecule sequencing for contigging, combined with 100× coverage of short-read Hi-C data for chromosome scaffolding. The assembled contigs total 446 Mbp of sequence and contain substantial duplication due to alternative alleles present in the sequenced pool of mosquitos from the FUMOZ colony. Using alignment and depth-of-coverage information, these contigs were deduplicated to a 211 Mbp primary assembly, which is closer to the expected haploid genome size of 250 Mbp. This primary assembly consists of 1,053 contigs organized into 3 chromosome-scale scaffolds with an N50 contig size of 632 kbp and an N50 scaffold size of 93.811 Mbp, representing a 100-fold improvement in continuity versus the current reference assembly, AfunF1. CONCLUSION This highly contiguous and complete A. funestus reference genome assembly will serve as an improved basis for future studies of genomic variation and organization in this important disease vector.
Collapse
Affiliation(s)
- Jay Ghurye
- Department of Computer Science, University of Maryland, 8125 Paint Branch Drive, College Park, MD 20742, USA
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892, USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892, USA
| | - Scott T Small
- Eck Institute for Global Health and Department of Biological Sciences, University of Notre Dame, 317 Galvin Life Science Center, Notre Dame, IN 46556, USA
| | - Seth Redmond
- Infectious Disease and Microbiome Program, Broad Institute, 415 Main Street, Cambridge, MA 02142, USA
- Department of Immunology and Infectious Disease, Harvard T.H. Chan School of Public Health, 665 Huntington Avenue, Boston, MA 02115, USA
| | - Paul Howell
- Centers for Disease Control and Prevention, 1600 Clifton Road, Atlanta, GA 30329, USA
| | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892, USA
| | - Nora J Besansky
- Eck Institute for Global Health and Department of Biological Sciences, University of Notre Dame, 317 Galvin Life Science Center, Notre Dame, IN 46556, USA
| |
Collapse
|
50
|
Saint-Leandre B, Nguyen SC, Levine MT. Diversification and collapse of a telomere elongation mechanism. Genome Res 2019; 29:920-931. [PMID: 31138619 PMCID: PMC6581046 DOI: 10.1101/gr.245001.118] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Accepted: 05/14/2019] [Indexed: 12/18/2022]
Abstract
In most eukaryotes, telomerase counteracts chromosome erosion by adding repetitive sequence to terminal ends. Drosophila melanogaster instead relies on specialized retrotransposons that insert exclusively at telomeres. This exchange of goods between host and mobile element-wherein the mobile element provides an essential genome service and the host provides a hospitable niche for mobile element propagation-has been called a "genomic symbiosis." However, these telomere-specialized, jockey family retrotransposons may actually evolve to "selfishly" overreplicate in the genomes that they ostensibly serve. Under this model, we expect rapid diversification of telomere-specialized retrotransposon lineages and, possibly, the breakdown of this ostensibly symbiotic relationship. Here we report data consistent with both predictions. Searching the raw reads of the 15-Myr-old melanogaster species group, we generated de novo jockey retrotransposon consensus sequences and used phylogenetic tree-building to delineate four distinct telomere-associated lineages. Recurrent gains, losses, and replacements account for this retrotransposon lineage diversity. In Drosophila biarmipes, telomere-specialized elements have disappeared completely. De novo assembly of long reads and cytogenetics confirmed this species-specific collapse of retrotransposon-dependent telomere elongation. Instead, telomere-restricted satellite DNA and DNA transposon fragments occupy its terminal ends. We infer that D. biarmipes relies instead on a recombination-based mechanism conserved from yeast to flies to humans. Telomeric retrotransposon diversification and disappearance suggest that persistently "selfish" machinery shapes telomere elongation across Drosophila rather than completely domesticated, symbiotic mobile elements.
Collapse
Affiliation(s)
- Bastien Saint-Leandre
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.,Epigenetics Institute, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Son C Nguyen
- Epigenetics Institute, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.,Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Mia T Levine
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.,Epigenetics Institute, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| |
Collapse
|