1
|
Zhou Y, Mabrouk I, Ma J, Liu Q, Song Y, Xue G, Li X, Wang S, Liu C, Hu J, Sun Y. Chromosome-level genome sequencing and multi-omics of the Hungarian White Goose (Anser anser domesticus) reveals novel miRNA-mRNA regulation mechanism of waterfowl feather follicle development. Poult Sci 2024; 103:103933. [PMID: 38943801 PMCID: PMC11261457 DOI: 10.1016/j.psj.2024.103933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2024] [Revised: 05/07/2024] [Accepted: 05/29/2024] [Indexed: 07/01/2024] Open
Abstract
The Hungarian White Goose (Anser anser domesticus) is an excellent European goose breed, with high feather and meat production. Despite its importance in the poultry industry, no available genome assembly information has been published. This study aimed to present Chromosome-level and functional genome sequencing of the Hungarian White Goose. The results showed that the genome assembly has a total length of 1115.82 Mb, 39 pairs of chromosomes, 92.98% of the BUSCO index, and contig N50 and scaffold N50 were up to 2.32 Mb and 60.69 Mb, respectively. Annotation of the genome assembly revealed 19550 genes, 286 miRNAs, etc. We identified 235 expanded and 1,167 contracted gene families in this breed compared with the other 16 species. We performed a positive selection analysis between this breed and four species of Anatidae to uncover the genetic information underlying feather follicle development. Further, we detected the function of miR-199-x, miR-143-y, and miR-23-z on goose embryonic skin fibroblast. In summary, we have successfully generated a highly complete genome sequence of the Hungarian white goose, which will provide a great resource to improve our understanding of gene functions and enhance the studies on feather follicle development at the genomic level.
Collapse
Affiliation(s)
- Yuxuan Zhou
- College of Animal Science and Technology, Jilin Agricultural University, Changchun, 130118, China
| | - Ichraf Mabrouk
- College of Animal Science and Technology, Jilin Agricultural University, Changchun, 130118, China
| | - Jingyun Ma
- College of Animal Science and Technology, Jilin Agricultural University, Changchun, 130118, China
| | - Qiuyuan Liu
- College of Animal Science and Technology, Jilin Agricultural University, Changchun, 130118, China
| | - Yupu Song
- College of Animal Science and Technology, Jilin Agricultural University, Changchun, 130118, China
| | - Guizhen Xue
- College of Animal Science and Technology, Jilin Agricultural University, Changchun, 130118, China
| | - Xinyue Li
- College of Animal Science and Technology, Jilin Agricultural University, Changchun, 130118, China
| | - Sihui Wang
- College of Animal Science and Technology, Jilin Agricultural University, Changchun, 130118, China
| | - Chang Liu
- Changchun Municipal People's Government, Changchun Animal Husbandry Service, Changchun, 130062, China
| | - Jingtao Hu
- College of Animal Science and Technology, Jilin Agricultural University, Changchun, 130118, China
| | - Yongfeng Sun
- College of Animal Science and Technology, Jilin Agricultural University, Changchun, 130118, China; Key Laboratory of Animal Production, Product Quality and Security, Jilin Agricultural University, Ministry of Education, Changchun, 130118, China..
| |
Collapse
|
2
|
Mokhtar MM, El Allali A. MegaLTR: a web server and standalone pipeline for detecting and annotating LTR-retrotransposons in plant genomes. FRONTIERS IN PLANT SCIENCE 2023; 14:1237426. [PMID: 37810401 PMCID: PMC10552921 DOI: 10.3389/fpls.2023.1237426] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 08/21/2023] [Indexed: 10/10/2023]
Abstract
LTR-retrotransposons (LTR-RTs) are a class of RNA-replicating transposon elements (TEs) that can alter genome structure and function by moving positions, repositioning genes, shifting exons, and causing chromosomal rearrangements. LTR-RTs are widespread in many plant genomes and constitute a significant portion of the genome. Their movement and activity in eukaryotic genomes can provide insight into genome evolution and gene function, especially when LTR-RTs are located near or within genes. Building the redundant and non-redundant LTR-RTs libraries and their annotations for species lacking this resource requires extensive bioinformatics pipelines and expensive computing power to analyze large amounts of genomic data. This increases the need for online services that provide computational resources with minimal overhead and maximum efficiency. Here, we present MegaLTR as a web server and standalone pipeline that detects intact LTR-RTs at the whole-genome level and integrates multiple tools for structure-based, homologybased, and de novo identification, classification, annotation, insertion time determination, and LTR-RT gene chimera analysis. MegaLTR also provides statistical analysis and visualization with multiple tools and can be used to accelerate plant species discovery and assist breeding programs in their efforts to improve genomic resources. We hope that the development of online services such as MegaLTR, which can analyze large amounts of genomic data, will become increasingly important for the automated detection and annotation of LTR-RT elements.
Collapse
Affiliation(s)
- Morad M. Mokhtar
- African Genome Center, Mohammed VI Polytechnic University, Benguerir, Morocco
| | - Achraf El Allali
- African Genome Center, Mohammed VI Polytechnic University, Benguerir, Morocco
| |
Collapse
|
3
|
Zhong S, Li B, Chen W, Wang L, Guan J, Wang Q, Yang Z, Yang H, Wang X, Yu X, Fu P, Liu H, Chen C, Tan F, Ren T, Shen J, Luo P. The chromosome-level genome of Akebia trifoliata as an important resource to study plant evolution and environmental adaptation in the Cretaceous. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 112:1316-1330. [PMID: 36305286 DOI: 10.1111/tpj.16011] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2022] [Accepted: 10/21/2022] [Indexed: 06/16/2023]
Abstract
The environmental adaptation of eudicots is the most reasonable explanation for why they compose the largest clade of modern plants (>70% of angiosperms), which indicates that the basal eudicots would be valuable and helpful to study their survival and ability to thrive throughout evolutionary processes. Here, we detected two whole-genome duplication (WGD) events in the high-quality assembled Akebia trifoliata genome (652.73 Mb) with 24 138 protein-coding genes based on the evidence of intragenomic and intergenomic collinearity, synonymous substitution rate (KS ) values and polyploidization and diploidization traces; these events putatively occurred at 85.15 and 146.43 million years ago (Mya). The integrated analysis of 16 species consisting of eight basal and eight core eudicots further revealed that there was a putative ancient WGD at the early stage of eudicots (temporarily designated θ) at 142.72 Mya, similar to the older WGD of Akebia trifoliata, and a putative core eudicot-specific WGD (temporarily designated ω). Functional enrichment analysis of retained duplicate genes following the θ event is suggestive of adaptation to the extreme environment change in both the carbon dioxide concentration and desiccation around the Jurassic-Cretaceous boundary, while the retained duplicate genes following the ω event is suggestive of adaptation to the extreme droughts, possibly leading to the rapid spread of eudicots in the mid-Cretaceous. Collectively, the A. trifoliata genome experienced two WGD events, and the older event may have occurred at the early stage of eudicots, which likely increased plant environmental adaptability and helped them survive in ancient extreme environments.
Collapse
Affiliation(s)
- Shengfu Zhong
- Key Laboratory of Plant Genetics and Breeding at Sichuan Agricutural University of Sichuan Province, Sichuan Agricultural University, 211 Huimin Road in Wenjiang District, 611130, Chengdu, Sichuan Province, China
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, 2 Yuanmingyuan West Road in Haidian District, 100193, Beijing, China
- Institute of Ecological Agriculture, Sichuan Agricultural University, 211 Huimin Road in Wenjiang District, 611130, Chengdu, Sichuan Province, China
| | - Bin Li
- State Key Laboratory of Tree Breeding and Forest Genetics, Research Institute of Forestry, Chinese Academy of Forestry, 1 Dongxiaofu Xiangshan Road in Haidian District, 100091, Beijing, China
| | - Wei Chen
- Key Laboratory of Plant Genetics and Breeding at Sichuan Agricutural University of Sichuan Province, Sichuan Agricultural University, 211 Huimin Road in Wenjiang District, 611130, Chengdu, Sichuan Province, China
| | - Lili Wang
- Biomarker Technologies Co., Ltd, 12 Fuqian Street in Shunyi District, 101300, Beijing, China
| | - Ju Guan
- Key Laboratory of Plant Genetics and Breeding at Sichuan Agricutural University of Sichuan Province, Sichuan Agricultural University, 211 Huimin Road in Wenjiang District, 611130, Chengdu, Sichuan Province, China
| | - Qiang Wang
- Institute of Ecological Agriculture, Sichuan Agricultural University, 211 Huimin Road in Wenjiang District, 611130, Chengdu, Sichuan Province, China
| | - Zujun Yang
- Center for Information in Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, 2006 Xiyuan Avenue in West Hi-Tech Zone, 611731, Chengdu, Sichuan Province, China
| | - Hao Yang
- Key Laboratory of Plant Genetics and Breeding at Sichuan Agricutural University of Sichuan Province, Sichuan Agricultural University, 211 Huimin Road in Wenjiang District, 611130, Chengdu, Sichuan Province, China
| | - Xianshu Wang
- Key Laboratory of Plant Genetics and Breeding at Sichuan Agricutural University of Sichuan Province, Sichuan Agricultural University, 211 Huimin Road in Wenjiang District, 611130, Chengdu, Sichuan Province, China
| | - Xiaojiao Yu
- Key Laboratory of Plant Genetics and Breeding at Sichuan Agricutural University of Sichuan Province, Sichuan Agricultural University, 211 Huimin Road in Wenjiang District, 611130, Chengdu, Sichuan Province, China
| | - Peng Fu
- Key Laboratory of Plant Genetics and Breeding at Sichuan Agricutural University of Sichuan Province, Sichuan Agricultural University, 211 Huimin Road in Wenjiang District, 611130, Chengdu, Sichuan Province, China
| | - Hongchang Liu
- Guizhou Key Laboratory for Propagation and Cultivation of Medicinal Plants, Guizhou University, 2708 Huaxi South Avenue in Huaxi District, 550025, Guiyang, Guizhou province, China
| | - Chen Chen
- Key Laboratory of Plant Genetics and Breeding at Sichuan Agricutural University of Sichuan Province, Sichuan Agricultural University, 211 Huimin Road in Wenjiang District, 611130, Chengdu, Sichuan Province, China
| | - Feiquan Tan
- Key Laboratory of Plant Genetics and Breeding at Sichuan Agricutural University of Sichuan Province, Sichuan Agricultural University, 211 Huimin Road in Wenjiang District, 611130, Chengdu, Sichuan Province, China
- Institute of Ecological Agriculture, Sichuan Agricultural University, 211 Huimin Road in Wenjiang District, 611130, Chengdu, Sichuan Province, China
| | - Tianheng Ren
- Key Laboratory of Plant Genetics and Breeding at Sichuan Agricutural University of Sichuan Province, Sichuan Agricultural University, 211 Huimin Road in Wenjiang District, 611130, Chengdu, Sichuan Province, China
| | - Jinliang Shen
- College of Forestry, Sichuan Agricultural University, 211 Huimin Road in Wenjiang District, 611130, Chengdu, Sichuan Province, China
| | - Peigao Luo
- Key Laboratory of Plant Genetics and Breeding at Sichuan Agricutural University of Sichuan Province, Sichuan Agricultural University, 211 Huimin Road in Wenjiang District, 611130, Chengdu, Sichuan Province, China
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, 2 Yuanmingyuan West Road in Haidian District, 100193, Beijing, China
| |
Collapse
|
4
|
Wang L, Li Z, Liu Y, Chen S, Li L, Duan P, Wang X, Li W, Wang Q, Zhai J, Tian Y. A chromosome-level genome assembly of the potato grouper (Epinephelus tukula). Genomics 2022; 114:110473. [PMID: 36049667 DOI: 10.1016/j.ygeno.2022.110473] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Revised: 08/04/2022] [Accepted: 08/29/2022] [Indexed: 12/29/2022]
Abstract
The potato grouper, Epinephelus tukula, is one of the largest coral reef teleost, and it is an important germplasm resource for selection and cross breeding. Here we report a potato grouper genome assembly generated using PacBio long-read sequencing, Illumina sequencing and high-throughput chromatin conformation capture (Hi-C) technology. The genome size was 1.13 Gb, with a total of 508 contigs anchored into 24 chromosomes. The scaffold N50 was 42.65 Mb. For the genome models, our assembled genome contained 98.11% complete BUSCO with the vertebrata_odb9 database. One more copies of Gh and Hsp90b1 were identified in the E. tukula genome, which might contribute to its fast growth and high resistance to stress. In addition, 435 putative antimicrobial peptide (AMP) genes were identified in the potato grouper. This study provides a good reference for whole genome selective breeding of the potato grouper and for future development of novel marine drugs.
Collapse
Affiliation(s)
- Linna Wang
- Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao 266071, China; Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao 266000, China
| | - Zhentong Li
- Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao 266071, China; Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao 266000, China
| | - Yang Liu
- Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao 266071, China; Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao 266000, China
| | - Shuai Chen
- Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao 266071, China; College of Fisheries and Life Science, Shanghai Ocean University, Shanghai 201306, China
| | - Linlin Li
- Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao 266071, China; Graduate School of Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Pengfei Duan
- Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao 266071, China; College of Fisheries and Life Science, Shanghai Ocean University, Shanghai 201306, China
| | - Xinyi Wang
- Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao 266071, China; College of Fisheries and Life Science, Shanghai Ocean University, Shanghai 201306, China
| | - Wensheng Li
- Mingbo Aquatic Co. Ltd., Laizhou 261400, China
| | | | | | - Yongsheng Tian
- Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao 266071, China; Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao 266000, China.
| |
Collapse
|
5
|
Tian F, Liu S, Zhou B, Tang Y, Zhang Y, Zhang C, Zhao K. Chromosome-level genome of Tibetan naked carp ( Gymnocypris przewalskii) provides insights into Tibetan highland adaptation. DNA Res 2022; 29:6647840. [PMID: 35861387 PMCID: PMC9326183 DOI: 10.1093/dnares/dsac025] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Indexed: 11/13/2022] Open
Abstract
Gymnocypris przewalskii, a cyprinid fish endemic to the Qinghai-Tibetan Plateau, has evolved unique morphological, physiological and genetic characteristics to adapt to the highland environment. Herein, we assembled a high-quality G. przewalskii tetraploid genome with a size of 2.03 Gb and scaffold N50 of 44.93 Mb, which was anchored onto 46 chromosomes. The comparative analysis suggested that gene families related to highland adaptation were significantly expanded in G. przewalskii. According to the G. przewalskii genome, we evaluated the phylogenetic relationship of 13 schizothoracine fishes, and inferred that the demographic history of G. przewalskii was strongly associated with geographic and eco-environmental alterations. We noticed that G. przewalskii experienced whole-genome duplication, and genes preserved post duplication were functionally associated with adaptation to high salinity and alkalinity. In conclusion, a chromosome-scale G. przewalskii genome provides an important genomic resource for teleost fish, and will particularly promote our understanding of the molecular evolution and speciation of fish in the highland environment.
Collapse
Affiliation(s)
- Fei Tian
- Qinghai Provincial Key Laboratory of Animal Ecological Genomics, Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences , Xining, Qinghai, China
- University of Chinese Academy of Sciences , Beijing, China
| | - Sijia Liu
- Qinghai Provincial Key Laboratory of Animal Ecological Genomics, Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences , Xining, Qinghai, China
| | - Bingzheng Zhou
- Qinghai Provincial Key Laboratory of Animal Ecological Genomics, Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences , Xining, Qinghai, China
- University of Chinese Academy of Sciences , Beijing, China
| | - Yongtao Tang
- Qinghai Provincial Key Laboratory of Animal Ecological Genomics, Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences , Xining, Qinghai, China
- Henan Normal University , Xinxiang, China
| | - Yu Zhang
- Qinghai Provincial Key Laboratory of Animal Ecological Genomics, Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences , Xining, Qinghai, China
- University of Chinese Academy of Sciences , Beijing, China
| | - Cunfang Zhang
- Qinghai Provincial Key Laboratory of Animal Ecological Genomics, Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences , Xining, Qinghai, China
- State Key Laboratory of Plateau Ecology and Agriculture, Qinghai University , Xining, Qinghai, China
| | - Kai Zhao
- Qinghai Provincial Key Laboratory of Animal Ecological Genomics, Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences , Xining, Qinghai, China
| |
Collapse
|
6
|
Riehl K, Riccio C, Miska EA, Hemberg M. TransposonUltimate: software for transposon classification, annotation and detection. Nucleic Acids Res 2022; 50:e64. [PMID: 35234904 PMCID: PMC9226531 DOI: 10.1093/nar/gkac136] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Revised: 02/09/2022] [Accepted: 02/14/2022] [Indexed: 12/17/2022] Open
Abstract
Most genomes harbor a large number of transposons, and they play an important role in evolution and gene regulation. They are also of interest to clinicians as they are involved in several diseases, including cancer and neurodegeneration. Although several methods for transposon identification are available, they are often highly specialised towards specific tasks or classes of transposons, and they lack common standards such as a unified taxonomy scheme and output file format. We present TransposonUltimate, a powerful bundle of three modules for transposon classification, annotation, and detection of transposition events. TransposonUltimate comes as a Conda package under the GPL-3.0 licence, is well documented and it is easy to install through https://github.com/DerKevinRiehl/TransposonUltimate. We benchmark the classification module on the large TransposonDB covering 891,051 sequences to demonstrate that it outperforms the currently best existing solutions. The annotation and detection modules combine sixteen existing softwares, and we illustrate its use by annotating Caenorhabditis elegans, Rhizophagus irregularis and Oryza sativa subs. japonica genomes. Finally, we use the detection module to discover 29 554 transposition events in the genomes of 20 wild type strains of C. elegans. Databases, assemblies, annotations and further findings can be downloaded from (https://doi.org/10.5281/zenodo.5518085).
Collapse
Affiliation(s)
- Kevin Riehl
- Gurdon Institute, University of Cambridge, Cambridge CB2 1QN, UK
| | - Cristian Riccio
- Gurdon Institute, University of Cambridge, Cambridge CB2 1QN, UK
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
| | - Eric A Miska
- Gurdon Institute, University of Cambridge, Cambridge CB2 1QN, UK
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
- Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, UK
| | - Martin Hemberg
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
- Evergrande Center for Immunologic Diseases, Harvard Medical School and Brigham and Women’s Hospital, 75 Francis Street, Boston, MA 02215, USA
| |
Collapse
|
7
|
Storer JM, Hubley R, Rosen J, Smit AFA. Methodologies for the De novo Discovery of Transposable Element Families. Genes (Basel) 2022; 13:709. [PMID: 35456515 PMCID: PMC9025800 DOI: 10.3390/genes13040709] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Revised: 04/14/2022] [Accepted: 04/15/2022] [Indexed: 02/07/2023] Open
Abstract
The discovery and characterization of transposable element (TE) families are crucial tasks in the process of genome annotation. Careful curation of TE libraries for each organism is necessary as each has been exposed to a unique and often complex set of TE families. De novo methods have been developed; however, a fully automated and accurate approach to the development of complete libraries remains elusive. In this review, we cover established methods and recent developments in de novo TE analysis. We also present various methodologies used to assess these tools and discuss opportunities for further advancement of the field.
Collapse
Affiliation(s)
| | | | | | - Arian F. A. Smit
- Institute for Systems Biology, Seattle, WA 98109, USA; (J.M.S.); (R.H.); (J.R.)
| |
Collapse
|
8
|
Xiao L, Lu L, Zeng W, Shang X, Cao S, Yan H. DNA Methylome and LncRNAome Analysis Provide Insights Into Mechanisms of Genome-Dosage Effects in Autotetraploid Cassava. FRONTIERS IN PLANT SCIENCE 2022; 13:915056. [PMID: 35860527 PMCID: PMC9289687 DOI: 10.3389/fpls.2022.915056] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Accepted: 06/02/2022] [Indexed: 05/16/2023]
Abstract
Whole genome duplication (WGD) increases the dosage of all coding and non-coding genes, yet the molecular implications of genome-dosage effects remain elusive. In this study, we generated integrated maps of the methylomes and lncRNAomes for diploid and artificially generated autotetraploid cassava (Manihot esculenta Crantz). We found that transposable elements (TEs) suppressed adjacent protein coding gene (PCG)-expression levels, while TEs activated the expression of nearby long non-coding RNAs (lncRNAs) in the cassava genome. The hypermethylation of DNA transposons in mCG and mCHH sites may be an effective way to suppress the expression of nearby PCGs in autotetraploid cassava, resulting in similar expression levels for most of PCGs between autotetraploid and diploid cassava. In the autotetraploid, decreased methylation levels of retrotransposons at mCHG and mCHH sites contributed to reduced methylation of Gypsy-neighboring long intergenic non-coding RNAs, potentially preserving diploid-like expression patterns in the major of lncRNAs. Collectively, our study highlighted that WGD-induced DNA methylation variation in DNA transposons and retrotransposons may be as direct adaptive responses to dosage of all coding-genes and lncRNAs, respectively.
Collapse
Affiliation(s)
- Liang Xiao
- Cash Crops Research Institute, Guangxi Academy of Agricultural Sciences, Nanning, China
| | - Liuying Lu
- Cash Crops Research Institute, Guangxi Academy of Agricultural Sciences, Nanning, China
| | - Wendan Zeng
- Cash Crops Research Institute, Guangxi Academy of Agricultural Sciences, Nanning, China
| | - Xiaohong Shang
- Cash Crops Research Institute, Guangxi Academy of Agricultural Sciences, Nanning, China
| | - Sheng Cao
- Cash Crops Research Institute, Guangxi Academy of Agricultural Sciences, Nanning, China
| | - Huabing Yan
- Cash Crops Research Institute, Guangxi Academy of Agricultural Sciences, Nanning, China
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning, China
- *Correspondence: Huabing Yan,
| |
Collapse
|
9
|
DARTS: An Algorithm for Domain-Associated Retrotransposon Search in Genome Assemblies. Genes (Basel) 2021; 13:genes13010009. [PMID: 35052350 PMCID: PMC8775202 DOI: 10.3390/genes13010009] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 12/16/2021] [Accepted: 12/17/2021] [Indexed: 01/08/2023] Open
Abstract
Retrotransposons comprise a substantial fraction of eukaryotic genomes, reaching the highest proportions in plants. Therefore, identification and annotation of retrotransposons is an important task in studying the regulation and evolution of plant genomes. The majority of computational tools for mining transposable elements (TEs) are designed for subsequent genome repeat masking, often leaving aside the element lineage classification and its protein domain composition. Additionally, studies focused on the diversity and evolution of a particular group of retrotransposons often require substantial customization efforts from researchers to adapt existing software to their needs. Here, we developed a computational pipeline to mine sequences of protein-coding retrotransposons based on the sequences of their conserved protein domains—DARTS (Domain-Associated Retrotransposon Search). Using the most abundant group of TEs in plants—long terminal repeat (LTR) retrotransposons (LTR-RTs)—we show that DARTS has radically higher sensitivity for LTR-RT identification compared to the widely accepted tool LTRharvest. DARTS can be easily customized for specific user needs. As a result, DARTS returns a set of structurally annotated nucleotide and amino acid sequences which can be readily used in subsequent comparative and phylogenetic analyses. DARTS may facilitate researchers interested in the discovery and detailed analysis of the diversity and evolution of retrotransposons, LTR-RTs, and other protein-coding TEs.
Collapse
|
10
|
Impact of Repetitive DNA Elements on Snake Genome Biology and Evolution. Cells 2021; 10:cells10071707. [PMID: 34359877 PMCID: PMC8303610 DOI: 10.3390/cells10071707] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Revised: 06/29/2021] [Accepted: 07/01/2021] [Indexed: 12/11/2022] Open
Abstract
The distinctive biology and unique evolutionary features of snakes make them fascinating model systems to elucidate how genomes evolve and how variation at the genomic level is interlinked with phenotypic-level evolution. Similar to other eukaryotic genomes, large proportions of snake genomes contain repetitive DNA, including transposable elements (TEs) and satellite repeats. The importance of repetitive DNA and its structural and functional role in the snake genome, remain unclear. This review highlights the major types of repeats and their proportions in snake genomes, reflecting the high diversity and composition of snake repeats. We present snakes as an emerging and important model system for the study of repetitive DNA under the impact of sex and microchromosome evolution. We assemble evidence to show that certain repetitive elements in snakes are transcriptionally active and demonstrate highly dynamic lineage-specific patterns as repeat sequences. We hypothesize that particular TEs can trigger different genomic mechanisms that might contribute to driving adaptive evolution in snakes. Finally, we review emerging approaches that may be used to study the expression of repetitive elements in complex genomes, such as snakes. The specific aspects presented here will stimulate further discussion on the role of genomic repeats in shaping snake evolution.
Collapse
|
11
|
Li W, Zhang Q, Zhu T, Tong Y, Li K, Shi C, Zhang Y, Liu Y, Jiang J, Liu Y, Xia E, Huang H, Zhang L, Zhang D, Shi C, Jiang W, Zhao Y, Mao S, Jiao J, Xu P, Yang L, Gao L. Draft genomes of two outcrossing wild rice, Oryza rufipogon and O. longistaminata, reveal genomic features associated with mating-system evolution. PLANT DIRECT 2020; 4:e00232. [PMID: 32537559 PMCID: PMC7287411 DOI: 10.1002/pld3.232] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/19/2019] [Revised: 05/07/2020] [Accepted: 05/15/2020] [Indexed: 05/04/2023]
Abstract
Oryza rufipogon and O. longistaminata are important wild relatives of cultivated rice, harboring a promising source of novel genes for rice breeding programs. Here, we present de novo assembled draft genomes and annotation of O. rufipogon and O. longistaminata. Our analysis reveals a considerable number of lineage-specific gene families associated with the self-incompatibility (SI) and formation of reproductive separation. We show how lineage-specific expansion or contraction of gene families with functional enrichment of the recognition of pollen, thus enlightening their reproductive diversification. We documented a large number of lineage-specific gene families enriched in salt stress, antifungal response, and disease resistance. Our comparative analysis further shows a genome-wide expansion of genes encoding NBS-LRR proteins in these two outcrossing wild species in contrast to six other selfing rice species. Conserved noncoding sequences (CNSs) in the two wild rice genomes rapidly evolve relative to selfing rice species, resulting in the reduction of genomic variation owing to shifts of mating systems. We find that numerous genes related to these rapidly evolving CNSs are enriched in reproductive structure development, flower development, and postembryonic development, which may associate with SI in O. rufipogon and O. longistaminata.
Collapse
Affiliation(s)
- Wei Li
- Institution of Genomics and BioinformaticsSouth China Agricultural UniversityGuangzhouChina
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Qun‐Jie Zhang
- Institution of Genomics and BioinformaticsSouth China Agricultural UniversityGuangzhouChina
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Ting Zhu
- Institution of Genomics and BioinformaticsSouth China Agricultural UniversityGuangzhouChina
- College of Life ScienceLiaoning Normal UniversityDalianChina
| | - Yan Tong
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Kui Li
- Institution of Genomics and BioinformaticsSouth China Agricultural UniversityGuangzhouChina
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Cong Shi
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
- University of the Chinese Academy of SciencesBeijingChina
| | - Yun Zhang
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Yun‐Long Liu
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Jian‐Jun Jiang
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Yuan Liu
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - En‐Hua Xia
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Hui Huang
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Li‐Ping Zhang
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Dan Zhang
- Institution of Genomics and BioinformaticsSouth China Agricultural UniversityGuangzhouChina
| | - Chao Shi
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Wen‐Kai Jiang
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - You‐Jie Zhao
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Shu‐Yan Mao
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Jun‐ying Jiao
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Ping‐Zhen Xu
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Li‐Li Yang
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Li‐Zhi Gao
- Institution of Genomics and BioinformaticsSouth China Agricultural UniversityGuangzhouChina
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| |
Collapse
|
12
|
Ou S, Su W, Liao Y, Chougule K, Agda JRA, Hellinga AJ, Lugo CSB, Elliott TA, Ware D, Peterson T, Jiang N, Hirsch CN, Hufford MB. Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline. Genome Biol 2019. [PMID: 31843001 DOI: 10.1101/657890v1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/11/2023] Open
Abstract
BACKGROUND Sequencing technology and assembly algorithms have matured to the point that high-quality de novo assembly is possible for large, repetitive genomes. Current assemblies traverse transposable elements (TEs) and provide an opportunity for comprehensive annotation of TEs. Numerous methods exist for annotation of each class of TEs, but their relative performances have not been systematically compared. Moreover, a comprehensive pipeline is needed to produce a non-redundant library of TEs for species lacking this resource to generate whole-genome TE annotations. RESULTS We benchmark existing programs based on a carefully curated library of rice TEs. We evaluate the performance of methods annotating long terminal repeat (LTR) retrotransposons, terminal inverted repeat (TIR) transposons, short TIR transposons known as miniature inverted transposable elements (MITEs), and Helitrons. Performance metrics include sensitivity, specificity, accuracy, precision, FDR, and F1. Using the most robust programs, we create a comprehensive pipeline called Extensive de-novo TE Annotator (EDTA) that produces a filtered non-redundant TE library for annotation of structurally intact and fragmented elements. EDTA also deconvolutes nested TE insertions frequently found in highly repetitive genomic regions. Using other model species with curated TE libraries (maize and Drosophila), EDTA is shown to be robust across both plant and animal species. CONCLUSIONS The benchmarking results and pipeline developed here will greatly facilitate TE annotation in eukaryotic genomes. These annotations will promote a much more in-depth understanding of the diversity and evolution of TEs at both intra- and inter-species levels. EDTA is open-source and freely available: https://github.com/oushujun/EDTA.
Collapse
Affiliation(s)
- Shujun Ou
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, 50011, USA
| | - Weija Su
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, IA, 50011, USA
| | - Yi Liao
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA, 92697, USA
| | - Kapeel Chougule
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Jireh R A Agda
- Centre for Biodiversity Genomics, University of Guelph, Guelph, Ontario, N1G 2W1, Canada
| | - Adam J Hellinga
- Centre for Biodiversity Genomics, University of Guelph, Guelph, Ontario, N1G 2W1, Canada
| | | | - Tyler A Elliott
- Centre for Biodiversity Genomics, University of Guelph, Guelph, Ontario, N1G 2W1, Canada
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
- USDA-ARS NEA Robert W. Holley Center for Agriculture and Health, Cornell University, Ithaca, NY, 14853, USA
| | - Thomas Peterson
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, IA, 50011, USA
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA.
| | - Candice N Hirsch
- Department of Agronomy and Plant Genetics, University of Minnesota, Saint Paul, MN, 55108, USA.
| | - Matthew B Hufford
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, 50011, USA.
| |
Collapse
|
13
|
Ou S, Su W, Liao Y, Chougule K, Agda JRA, Hellinga AJ, Lugo CSB, Elliott TA, Ware D, Peterson T, Jiang N, Hirsch CN, Hufford MB. Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline. Genome Biol 2019; 20:275. [PMID: 31843001 PMCID: PMC6913007 DOI: 10.1186/s13059-019-1905-y] [Citation(s) in RCA: 491] [Impact Index Per Article: 98.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2019] [Accepted: 11/28/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Sequencing technology and assembly algorithms have matured to the point that high-quality de novo assembly is possible for large, repetitive genomes. Current assemblies traverse transposable elements (TEs) and provide an opportunity for comprehensive annotation of TEs. Numerous methods exist for annotation of each class of TEs, but their relative performances have not been systematically compared. Moreover, a comprehensive pipeline is needed to produce a non-redundant library of TEs for species lacking this resource to generate whole-genome TE annotations. RESULTS We benchmark existing programs based on a carefully curated library of rice TEs. We evaluate the performance of methods annotating long terminal repeat (LTR) retrotransposons, terminal inverted repeat (TIR) transposons, short TIR transposons known as miniature inverted transposable elements (MITEs), and Helitrons. Performance metrics include sensitivity, specificity, accuracy, precision, FDR, and F1. Using the most robust programs, we create a comprehensive pipeline called Extensive de-novo TE Annotator (EDTA) that produces a filtered non-redundant TE library for annotation of structurally intact and fragmented elements. EDTA also deconvolutes nested TE insertions frequently found in highly repetitive genomic regions. Using other model species with curated TE libraries (maize and Drosophila), EDTA is shown to be robust across both plant and animal species. CONCLUSIONS The benchmarking results and pipeline developed here will greatly facilitate TE annotation in eukaryotic genomes. These annotations will promote a much more in-depth understanding of the diversity and evolution of TEs at both intra- and inter-species levels. EDTA is open-source and freely available: https://github.com/oushujun/EDTA.
Collapse
Affiliation(s)
- Shujun Ou
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011 USA
| | - Weija Su
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, IA 50011 USA
| | - Yi Liao
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA 92697 USA
| | - Kapeel Chougule
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA
| | - Jireh R. A. Agda
- Centre for Biodiversity Genomics, University of Guelph, Guelph, Ontario N1G 2W1 Canada
| | - Adam J. Hellinga
- Centre for Biodiversity Genomics, University of Guelph, Guelph, Ontario N1G 2W1 Canada
| | | | - Tyler A. Elliott
- Centre for Biodiversity Genomics, University of Guelph, Guelph, Ontario N1G 2W1 Canada
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA
- USDA-ARS NEA Robert W. Holley Center for Agriculture and Health, Cornell University, Ithaca, NY 14853 USA
| | - Thomas Peterson
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, IA 50011 USA
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI 48824 USA
| | - Candice N. Hirsch
- Department of Agronomy and Plant Genetics, University of Minnesota, Saint Paul, MN 55108 USA
| | - Matthew B. Hufford
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011 USA
| |
Collapse
|
14
|
Peng Z, Oliveira-Garcia E, Lin G, Hu Y, Dalby M, Migeon P, Tang H, Farman M, Cook D, White FF, Valent B, Liu S. Effector gene reshuffling involves dispensable mini-chromosomes in the wheat blast fungus. PLoS Genet 2019; 15:e1008272. [PMID: 31513573 PMCID: PMC6741851 DOI: 10.1371/journal.pgen.1008272] [Citation(s) in RCA: 63] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Accepted: 06/24/2019] [Indexed: 11/28/2022] Open
Abstract
Newly emerged wheat blast disease is a serious threat to global wheat production. Wheat blast is caused by a distinct, exceptionally diverse lineage of the fungus causing rice blast disease. Through sequencing a recent field isolate, we report a reference genome that includes seven core chromosomes and mini-chromosome sequences that harbor effector genes normally found on ends of core chromosomes in other strains. No mini-chromosomes were observed in an early field strain, and at least two from another isolate each contain different effector genes and core chromosome end sequences. The mini-chromosome is enriched in transposons occurring most frequently at core chromosome ends. Additionally, transposons in mini-chromosomes lack the characteristic signature for inactivation by repeat-induced point (RIP) mutation genome defenses. Our results, collectively, indicate that dispensable mini-chromosomes and core chromosomes undergo divergent evolutionary trajectories, and mini-chromosomes and core chromosome ends are coupled as a mobile, fast-evolving effector compartment in the wheat pathogen genome.
Collapse
Affiliation(s)
- Zhao Peng
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States of America
- Department of Plant Pathology, University of Florida, Gainesville, FL, United States of America
| | - Ely Oliveira-Garcia
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States of America
| | - Guifang Lin
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States of America
| | - Ying Hu
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States of America
| | - Melinda Dalby
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States of America
| | - Pierre Migeon
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States of America
| | - Haibao Tang
- Center for Genomics and Biotechnology and Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fujian, China
| | - Mark Farman
- Department of Plant Pathology, University of Kentucky, Lexington, KY, United States of America
| | - David Cook
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States of America
| | - Frank F. White
- Department of Plant Pathology, University of Florida, Gainesville, FL, United States of America
| | - Barbara Valent
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States of America
| | - Sanzhen Liu
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States of America
| |
Collapse
|
15
|
Peng Z, Oliveira-Garcia E, Lin G, Hu Y, Dalby M, Migeon P, Tang H, Farman M, Cook D, White FF, Valent B, Liu S. Effector gene reshuffling involves dispensable mini-chromosomes in the wheat blast fungus. PLoS Genet 2019; 15:e1008272. [PMID: 31513573 DOI: 10.1101/359455] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Accepted: 06/24/2019] [Indexed: 05/26/2023] Open
Abstract
Newly emerged wheat blast disease is a serious threat to global wheat production. Wheat blast is caused by a distinct, exceptionally diverse lineage of the fungus causing rice blast disease. Through sequencing a recent field isolate, we report a reference genome that includes seven core chromosomes and mini-chromosome sequences that harbor effector genes normally found on ends of core chromosomes in other strains. No mini-chromosomes were observed in an early field strain, and at least two from another isolate each contain different effector genes and core chromosome end sequences. The mini-chromosome is enriched in transposons occurring most frequently at core chromosome ends. Additionally, transposons in mini-chromosomes lack the characteristic signature for inactivation by repeat-induced point (RIP) mutation genome defenses. Our results, collectively, indicate that dispensable mini-chromosomes and core chromosomes undergo divergent evolutionary trajectories, and mini-chromosomes and core chromosome ends are coupled as a mobile, fast-evolving effector compartment in the wheat pathogen genome.
Collapse
Affiliation(s)
- Zhao Peng
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States of America
- Department of Plant Pathology, University of Florida, Gainesville, FL, United States of America
| | - Ely Oliveira-Garcia
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States of America
| | - Guifang Lin
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States of America
| | - Ying Hu
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States of America
| | - Melinda Dalby
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States of America
| | - Pierre Migeon
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States of America
| | - Haibao Tang
- Center for Genomics and Biotechnology and Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fujian, China
| | - Mark Farman
- Department of Plant Pathology, University of Kentucky, Lexington, KY, United States of America
| | - David Cook
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States of America
| | - Frank F White
- Department of Plant Pathology, University of Florida, Gainesville, FL, United States of America
| | - Barbara Valent
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States of America
| | - Sanzhen Liu
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States of America
| |
Collapse
|
16
|
Guliaev AS, Semyenova SK. MGERT: a pipeline to retrieve coding sequences of mobile genetic elements from genome assemblies. Mob DNA 2019; 10:21. [PMID: 31114637 PMCID: PMC6515669 DOI: 10.1186/s13100-019-0163-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2019] [Accepted: 04/26/2019] [Indexed: 12/17/2022] Open
Abstract
Background Genomes of eukaryotes are inhabited by myriads of mobile genetic elements (MGEs) – transposons and retrotransposons - which play a great role in genome plasticity and evolution. A lot of computational tools were developed to annotate them either in genomic assemblies or raw reads using de novo or homology-based approaches. But there has been no pipeline enabling users to get coding and flanking sequences of MGEs suitable for a downstream analysis from genome assemblies. Results We developed a new pipeline, MGERT (Mobile Genetic Elements Retrieving Tool), that automates all the steps necessary to obtain protein-coding sequences of mobile genetic elements from genomic assemblies even if no previous knowledge on MGE content of a particular genome is available. Conclusions Using MGERT, researchers can easily find MGEs, their coding and flanking sequences in the genome of interest. Thus, this pipeline helps researchers to focus on the biological analysis of MGEs rather than excessive scripting and pipelining. Electronic supplementary material The online version of this article (10.1186/s13100-019-0163-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Andrei S Guliaev
- Laboratory of Genome Organization, Institute of Gene Biology of the Russian Academy of Sciences, Vavilov Str., 34/5, Moscow, 119334 Russia
| | - Seraphima K Semyenova
- Laboratory of Genome Organization, Institute of Gene Biology of the Russian Academy of Sciences, Vavilov Str., 34/5, Moscow, 119334 Russia
| |
Collapse
|
17
|
Sävilammi T, Primmer CR, Varadharajan S, Guyomard R, Guiguen Y, Sandve SR, Vøllestad LA, Papakostas S, Lien S. The Chromosome-Level Genome Assembly of European Grayling Reveals Aspects of a Unique Genome Evolution Process Within Salmonids. G3 (BETHESDA, MD.) 2019; 9:1283-1294. [PMID: 30833292 PMCID: PMC6505133 DOI: 10.1534/g3.118.200919] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Accepted: 03/04/2019] [Indexed: 12/13/2022]
Abstract
Salmonids represent an intriguing taxonomical group for investigating genome evolution in vertebrates due to their relatively recent last common whole genome duplication event, which occurred between 80 and 100 million years ago. Here, we report on the chromosome-level genome assembly of European grayling (Thymallus thymallus), which represents one of the earliest diverged salmonid subfamilies. To achieve this, we first generated relatively long genomic scaffolds by using a previously published draft genome assembly along with long-read sequencing data and a linkage map. We then merged those scaffolds by applying synteny evidence from the Atlantic salmon (Salmo salar) genome. Comparisons of the European grayling genome assembly to the genomes of Atlantic salmon and Northern pike (Esox lucius), the latter used as a nonduplicated outgroup, detailed aspects of the characteristic chromosome evolution process that has taken place in European grayling. While Atlantic salmon and other salmonid genomes are portrayed by the typical occurrence of numerous chromosomal fusions, European grayling chromosomes were confirmed to be fusion-free and were characterized by a relatively large proportion of paracentric and pericentric inversions. We further reported on transposable elements specific to either the European grayling or Atlantic salmon genome, on the male-specific sdY gene in the European grayling chromosome 11A, and on regions under residual tetrasomy in the homeologous European grayling chromosome pairs 9A-9B and 25A-25B. The same chromosome pairs have been observed under residual tetrasomy in Atlantic salmon and in other salmonids, suggesting that this feature has been conserved since the subfamily split.
Collapse
Affiliation(s)
- Tiina Sävilammi
- Department of Biology, University of Turku, 20014 Turku, Finland
| | - Craig R Primmer
- Organismal & Evolutionary Biology Research Program, Faculty of Biological & Environmental Sciences
- Institute of Biotechnology, University of Helsinki, 00014 Helsinki, Finland
| | | | - René Guyomard
- INRA, UMR1313 GABI Génétique Animale et Biologie Intégrative, Domaine de Vilvert, 78352, Jouy-en-Josas Cedex, France
| | - Yann Guiguen
- INRA, UR1037 Fish Physiology and Genomics, F-35000, Rennes, France
| | - Simen R Sandve
- Centre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences, Faculty of Biosciences, Norwegian University of Life Sciences, 1430 Ås, Norway
| | | | | | - Sigbjørn Lien
- Centre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences, Faculty of Biosciences, Norwegian University of Life Sciences, 1430 Ås, Norway
| |
Collapse
|
18
|
|
19
|
Kanzaki N, Tsai IJ, Tanaka R, Hunt VL, Liu D, Tsuyama K, Maeda Y, Namai S, Kumagai R, Tracey A, Holroyd N, Doyle SR, Woodruff GC, Murase K, Kitazume H, Chai C, Akagi A, Panda O, Ke HM, Schroeder FC, Wang J, Berriman M, Sternberg PW, Sugimoto A, Kikuchi T. Biology and genome of a newly discovered sibling species of Caenorhabditis elegans. Nat Commun 2018; 9:3216. [PMID: 30097582 PMCID: PMC6086898 DOI: 10.1038/s41467-018-05712-5] [Citation(s) in RCA: 67] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2017] [Accepted: 07/19/2018] [Indexed: 01/19/2023] Open
Abstract
A 'sibling' species of the model organism Caenorhabditis elegans has long been sought for use in comparative analyses that would enable deep evolutionary interpretations of biological phenomena. Here, we describe the first sibling species of C. elegans, C. inopinata n. sp., isolated from fig syconia in Okinawa, Japan. We investigate the morphology, developmental processes and behaviour of C. inopinata, which differ significantly from those of C. elegans. The 123-Mb C. inopinata genome was sequenced and assembled into six nuclear chromosomes, allowing delineation of Caenorhabditis genome evolution and revealing unique characteristics, such as highly expanded transposable elements that might have contributed to the genome evolution of C. inopinata. In addition, C. inopinata exhibits massive gene losses in chemoreceptor gene families, which could be correlated with its limited habitat area. We have developed genetic and molecular techniques for C. inopinata; thus C. inopinata provides an exciting new platform for comparative evolutionary studies.
Collapse
Affiliation(s)
- Natsumi Kanzaki
- Forestry and Forest Products Research Institute, Tsukuba, 305-8687, Japan
- Kansai Research Center, Forestry and Forest Products Research Institute, 68 Nagaikyutaroh, Momoyama, Fushimi, Kyoto, 612-0855, Japan
| | - Isheng J Tsai
- Biodiversity Research Center, Academia Sinica, Taipei city, 11529, Taiwan
| | - Ryusei Tanaka
- Faculty of Medicine, University of Miyazaki, Miyazaki, 889-1692, Japan
| | - Vicky L Hunt
- Faculty of Medicine, University of Miyazaki, Miyazaki, 889-1692, Japan
| | - Dang Liu
- Biodiversity Research Center, Academia Sinica, Taipei city, 11529, Taiwan
| | - Kenji Tsuyama
- Laboratory of Developmental Dynamics, Graduate School of Life Sciences, Tohoku University, Sendai, 980-8577, Japan
| | - Yasunobu Maeda
- Faculty of Medicine, University of Miyazaki, Miyazaki, 889-1692, Japan
| | - Satoshi Namai
- Laboratory of Developmental Dynamics, Graduate School of Life Sciences, Tohoku University, Sendai, 980-8577, Japan
| | - Ryohei Kumagai
- Laboratory of Developmental Dynamics, Graduate School of Life Sciences, Tohoku University, Sendai, 980-8577, Japan
| | - Alan Tracey
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Nancy Holroyd
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Stephen R Doyle
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Gavin C Woodruff
- Forestry and Forest Products Research Institute, Tsukuba, 305-8687, Japan
- Department of Biology, Institute of Ecology and Evolution, University of Oregon, Eugene, OR, 97403-5289, USA
| | - Kazunori Murase
- Faculty of Medicine, University of Miyazaki, Miyazaki, 889-1692, Japan
| | - Hiromi Kitazume
- Faculty of Medicine, University of Miyazaki, Miyazaki, 889-1692, Japan
| | - Cynthia Chai
- HHMI and Division of Biology and Biological Engineering, Caltech, Pasadena, CA, 91125, USA
| | - Allison Akagi
- HHMI and Division of Biology and Biological Engineering, Caltech, Pasadena, CA, 91125, USA
| | - Oishika Panda
- Boyce Thompson Institute, and Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY, 14853, USA
| | - Huei-Mien Ke
- Biodiversity Research Center, Academia Sinica, Taipei city, 11529, Taiwan
| | - Frank C Schroeder
- Boyce Thompson Institute, and Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY, 14853, USA
| | - John Wang
- Biodiversity Research Center, Academia Sinica, Taipei city, 11529, Taiwan
| | - Matthew Berriman
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, CB10 1SA, UK
| | - Paul W Sternberg
- HHMI and Division of Biology and Biological Engineering, Caltech, Pasadena, CA, 91125, USA
| | - Asako Sugimoto
- Laboratory of Developmental Dynamics, Graduate School of Life Sciences, Tohoku University, Sendai, 980-8577, Japan.
| | - Taisei Kikuchi
- Faculty of Medicine, University of Miyazaki, Miyazaki, 889-1692, Japan.
| |
Collapse
|
20
|
Xie Z, Tang H. ISEScan: automated identification of insertion sequence elements in prokaryotic genomes. Bioinformatics 2018; 33:3340-3347. [PMID: 29077810 DOI: 10.1093/bioinformatics/btx433] [Citation(s) in RCA: 161] [Impact Index Per Article: 26.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2016] [Accepted: 07/04/2017] [Indexed: 11/13/2022] Open
Abstract
Motivation The insertion sequence (IS) elements are the smallest but most abundant autonomous transposable elements in prokaryotic genomes, which play a key role in prokaryotic genome organization and evolution. With the fast growing genomic data, it is becoming increasingly critical for biology researchers to be able to accurately and automatically annotate ISs in prokaryotic genome sequences. The available automatic IS annotation systems are either providing only incomplete IS annotation or relying on the availability of existing genome annotations. Here, we present a new IS elements annotation pipeline to address these issues. Results ISEScan is a highly sensitive software pipeline based on profile hidden Markov models constructed from manually curated IS elements. ISEScan performs better than existing IS annotation systems when tested on prokaryotic genomes with curated annotations of IS elements. Applying it to 2784 prokaryotic genomes, we report the global distribution of IS families across taxonomic clades in Archaea and Bacteria. Availability and implementation ISEScan is implemented in Python and released as an open source software at https://github.com/xiezhq/ISEScan. Contact hatang@indiana.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Zhiqun Xie
- School of Informatics and Computing, Indiana University, Bloomington, IN 47405, USA
| | - Haixu Tang
- School of Informatics and Computing, Indiana University, Bloomington, IN 47405, USA
| |
Collapse
|
21
|
Serrato-Capuchina A, Matute DR. The Role of Transposable Elements in Speciation. Genes (Basel) 2018; 9:E254. [PMID: 29762547 PMCID: PMC5977194 DOI: 10.3390/genes9050254] [Citation(s) in RCA: 99] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2018] [Revised: 04/26/2018] [Accepted: 04/26/2018] [Indexed: 01/20/2023] Open
Abstract
Understanding the phenotypic and molecular mechanisms that contribute to genetic diversity between and within species is fundamental in studying the evolution of species. In particular, identifying the interspecific differences that lead to the reduction or even cessation of gene flow between nascent species is one of the main goals of speciation genetic research. Transposable elements (TEs) are DNA sequences with the ability to move within genomes. TEs are ubiquitous throughout eukaryotic genomes and have been shown to alter regulatory networks, gene expression, and to rearrange genomes as a result of their transposition. However, no systematic effort has evaluated the role of TEs in speciation. We compiled the evidence for TEs as potential causes of reproductive isolation across a diversity of taxa. We find that TEs are often associated with hybrid defects that might preclude the fusion between species, but that the involvement of TEs in other barriers to gene flow different from postzygotic isolation is still relatively unknown. Finally, we list a series of guides and research avenues to disentangle the effects of TEs on the origin of new species.
Collapse
Affiliation(s)
- Antonio Serrato-Capuchina
- Biology Department, Genome Sciences Building, University of North Carolina, Chapel Hill, NC 27514, USA.
| | - Daniel R Matute
- Biology Department, Genome Sciences Building, University of North Carolina, Chapel Hill, NC 27514, USA.
| |
Collapse
|
22
|
Ferrareze PAG, Streit RSA, Dos Santos FM, Schrank A, Kmetzsch L, Vainstein MH, Staats CC. sRNAs as possible regulators of retrotransposon activity in Cryptococcus gattii VGII. BMC Genomics 2017; 18:294. [PMID: 28403818 PMCID: PMC5389150 DOI: 10.1186/s12864-017-3688-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2017] [Accepted: 04/06/2017] [Indexed: 01/31/2023] Open
Abstract
Background The absence of Argonaute genes in the fungal pathogen Cryptococcus gattii R265 and other VGII strains indicates that yeasts of this genotype cannot have a functional RNAi pathway, an evolutionarily conserved gene silencing mechanism performed by small RNAs. The success of the R265 strain as a pathogen that caused the Pacific Northwest and Vancouver Island outbreaks may imply that RNAi machinery loss could be beneficial under certain circumstances during evolution. As a result, a hypermutant phenotype would be created with high rates of genome retrotransposition, for instance. This study therefore aimed to evaluate in silicio the effect of retrotransposons and their control mechanisms by small RNAs on genomic stability and synteny loss of C. gattii R265 through retrotransposons sequence comparison and orthology analysis with other 16 C. gattii genomic sequences available. Results Retrotransposon mining identified a higher sequence count to VGI genotype compared to VGII, VGIII, and VGIV. However, despite the lower retrotransposon number, VGII exhibited increased synteny loss and genome rearrangement events. RNA-Seq analysis indicated highly expressed retrotransposons as well as sRNA production. Conclusions Genome rearrangement and synteny loss may suggest a greater retrotransposon mobilization caused by RNAi pathway absence, but the effective presence of sRNAs that matches retrotransposon sequences means that an alternative retrotransposon silencing mechanism could be active in genomic integrity maintenance of C. gattii VGII strains. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3688-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Patrícia Aline Gröhs Ferrareze
- Programa de Pós-Graduação em Biologia Celular e Molecular, Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul (UFRGS), 91501-970, Porto Alegre, RS, Brazil
| | - Rodrigo Silva Araujo Streit
- Departamento de Biologia Molecular e Biotecnologia, Instituto de Biociências, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil
| | - Francine Melise Dos Santos
- Programa de Pós-Graduação em Biologia Celular e Molecular, Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul (UFRGS), 91501-970, Porto Alegre, RS, Brazil
| | - Augusto Schrank
- Programa de Pós-Graduação em Biologia Celular e Molecular, Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul (UFRGS), 91501-970, Porto Alegre, RS, Brazil.,Departamento de Biologia Molecular e Biotecnologia, Instituto de Biociências, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil
| | - Livia Kmetzsch
- Programa de Pós-Graduação em Biologia Celular e Molecular, Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul (UFRGS), 91501-970, Porto Alegre, RS, Brazil.,Departamento de Biologia Molecular e Biotecnologia, Instituto de Biociências, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil
| | - Marilene Henning Vainstein
- Programa de Pós-Graduação em Biologia Celular e Molecular, Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul (UFRGS), 91501-970, Porto Alegre, RS, Brazil.,Departamento de Biologia Molecular e Biotecnologia, Instituto de Biociências, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil
| | - Charley Christian Staats
- Programa de Pós-Graduação em Biologia Celular e Molecular, Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul (UFRGS), 91501-970, Porto Alegre, RS, Brazil. .,Departamento de Biologia Molecular e Biotecnologia, Instituto de Biociências, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil.
| |
Collapse
|