1
|
Zhao Y, Huang S, Zhang Y, Tan C, Feng H. Role of Brassica orphan gene BrLFM on leafy head formation in Chinese cabbage (Brassica rapa). TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 136:170. [PMID: 37420138 DOI: 10.1007/s00122-023-04411-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Accepted: 06/22/2023] [Indexed: 07/09/2023]
Abstract
Brassica orphan gene BrFLM, identified by two allelic mutants, was involved in leafy head formation in Chinese cabbage. Leafy head formation is a unique agronomic trait of Chinese cabbage that determines its yield and quality. In our previous study, an EMS mutagenesis Chinese cabbage mutant library was constructed using the heading Chinese cabbage double haploid (DH) line FT as the wild-type. Here, we screened two extremely similar leafy head deficiency mutants lfm-1 and lfm-2 with geotropic growth leaves from the library to investigate the gene(s) related to leafy head formation. Reciprocal crossing results showed that these two mutants were allelic. We utilized lfm-1 to identify the mutant gene(s). Genetic analysis showed that the mutated trait was controlled by a single nuclear gene Brlfm. Mutmap analysis showed that Brlfm was located on chromosome A05, and BraA05g012440.3C or BraA05g021450.3C were the candidate gene. Kompetitive allele-specific PCR analysis eliminated BraA05g012440.3C from the candidates. Sanger sequencing identified an SNP from G to A at the 271st nucleotide on BraA05g021450.3C. The sequencing of lfm-2 detected another non-synonymous SNP (G to A) located at the 266st nucleotide on BraA05g021450.3C, which verified its function on leafy head formation. We blasted BraA05g021450.3C on database and found that it belongs to a Brassica orphan gene encoding an unknown 13.74 kDa protein, named BrLFM. Subcellular localization showed that BrLFM was located in the nucleus. These findings reveal that BrLFM is involved in leafy head formation in Chinese cabbage.
Collapse
Affiliation(s)
- Yonghui Zhao
- College of Horticulture, Shenyang Agricultural University, 120 Dongling Road, Shenhe District, Shenyang, 110866, People's Republic of China
| | - Shengnan Huang
- College of Horticulture, Shenyang Agricultural University, 120 Dongling Road, Shenhe District, Shenyang, 110866, People's Republic of China
| | - Yun Zhang
- College of Horticulture, Shenyang Agricultural University, 120 Dongling Road, Shenhe District, Shenyang, 110866, People's Republic of China
| | - Chong Tan
- College of Horticulture, Shenyang Agricultural University, 120 Dongling Road, Shenhe District, Shenyang, 110866, People's Republic of China
| | - Hui Feng
- College of Horticulture, Shenyang Agricultural University, 120 Dongling Road, Shenhe District, Shenyang, 110866, People's Republic of China.
| |
Collapse
|
2
|
Jiang M, Li X, Dong X, Zu Y, Zhan Z, Piao Z, Lang H. Research Advances and Prospects of Orphan Genes in Plants. FRONTIERS IN PLANT SCIENCE 2022; 13:947129. [PMID: 35874010 PMCID: PMC9305701 DOI: 10.3389/fpls.2022.947129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 06/23/2022] [Indexed: 06/15/2023]
Abstract
Orphan genes (OGs) are defined as genes having no sequence similarity with genes present in other lineages. OGs have been regarded to play a key role in the development of lineage-specific adaptations and can also serve as a constant source of evolutionary novelty. These genes have often been found related to various stress responses, species-specific traits, special expression regulation, and also participate in primary substance metabolism. The advancement in sequencing tools and genome analysis methods has made the identification and characterization of OGs comparatively easier. In the study of OG functions in plants, significant progress has been made. We review recent advances in the fast evolving characteristics, expression modulation, and functional analysis of OGs with a focus on their role in plant biology. We also emphasize current challenges, adoptable strategies and discuss possible future directions of functional study of OGs.
Collapse
Affiliation(s)
- Mingliang Jiang
- School of Agriculture, Jilin Agricultural Science and Technology College, Jilin, China
| | - Xiaonan Li
- College of Horticulture, Shenyang Agricultural University, Shenyang, China
| | - Xiangshu Dong
- School of Agriculture, Yunnan University, Kunming, China
| | - Ye Zu
- College of Horticulture, Shenyang Agricultural University, Shenyang, China
| | - Zongxiang Zhan
- College of Horticulture, Shenyang Agricultural University, Shenyang, China
| | - Zhongyun Piao
- College of Horticulture, Shenyang Agricultural University, Shenyang, China
| | - Hong Lang
- School of Agriculture, Jilin Agricultural Science and Technology College, Jilin, China
| |
Collapse
|
3
|
Ma D, Ding Q, Guo Z, Xu C, Liang P, Zhao Z, Song S, Zheng HL. The genome of a mangrove plant, Avicennia marina, provides insights into adaptation to coastal intertidal habitats. PLANTA 2022; 256:6. [PMID: 35678934 DOI: 10.1007/s00425-022-03916-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Accepted: 05/17/2022] [Indexed: 05/26/2023]
Abstract
Whole-genome duplication, gene family and lineage-specific genes analysis based on high-quality genome reveal the adaptation mechanisms of Avicennia marina to coastal intertidal habitats. Mangrove plants grow in a complex habitat of coastal intertidal zones with high salinity, hypoxia, etc. Therefore, it is an interesting question how mangroves adapt to the unique intertidal environment. Here, we present a chromosome-level genome of the Avicennia marina, a typical true mangrove with a size of 480.43 Mb, contig N50 of 11.33 Mb and 30,956 annotated protein-coding genes. We identified 621 Avicennia-specific genes that are mainly related to flavonoid and lignin biosynthesis, auxin homeostasis and response to abiotic stimulus. We found that A. marina underwent a novel specific whole-genome duplication, which is in line with a brief era of global warming that occurred during the paleocene-eocene maximum. Comparative genomic and transcriptomic analyses outline the distinct evolution and sophisticated regulations of A. marina adaptation to the intertidal environments, including expansion of photosynthesis and oxidative phosphorylation gene families, unique genes and pathways for antibacterial, detoxifying antioxidant and reactive oxygen species scavenging. In addition, we also analyzed salt gland secretion-related genes, and those involved in the red bark-related flavonoid biosynthesis, while significant expansions of key genes such as NHX, 4CL, CHS and CHI. High-quality genomes in future investigations will facilitate the understand of evolution of mangrove and improve breeding.
Collapse
Affiliation(s)
- Dongna Ma
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361102, Fujian, China
| | - Qiansu Ding
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361102, Fujian, China
| | - Zejun Guo
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361102, Fujian, China
| | - Chaoqun Xu
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361102, Fujian, China
| | - Pingping Liang
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361102, Fujian, China
| | - Zhizhu Zhao
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361102, Fujian, China
| | - Shiwei Song
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361102, Fujian, China
| | - Hai-Lei Zheng
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361102, Fujian, China.
| |
Collapse
|
4
|
Ma D, Lai Z, Ding Q, Zhang K, Chang K, Li S, Zhao Z, Zhong F. Identification, Characterization and Function of Orphan Genes Among the Current Cucurbitaceae Genomes. FRONTIERS IN PLANT SCIENCE 2022; 13:872137. [PMID: 35599909 PMCID: PMC9114813 DOI: 10.3389/fpls.2022.872137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 03/28/2022] [Indexed: 06/15/2023]
Abstract
Orphan genes (OGs) that are missing identifiable homologs in other lineages may potentially make contributions to a variety of biological functions. The Cucurbitaceae family consists of a wide range of fruit crops of worldwide or local economic significance. To date, very few functional mechanisms of OGs in Cucurbitaceae are known. In this study, we systematically identified the OGs of eight Cucurbitaceae species using a comparative genomics approach. The content of OGs varied widely among the eight Cucurbitaceae species, ranging from 1.63% in chayote to 16.55% in wax gourd. Genetic structure analysis showed that OGs have significantly shorter protein lengths and fewer exons in Cucurbitaceae. The subcellular localizations of OGs were basically the same, with only subtle differences. Except for aggregation in some chromosomal regions, the distribution density of OGs was higher near the telomeres and relatively evenly distributed on the chromosomes. Gene expression analysis revealed that OGs had less abundantly and highly tissue-specific expression. Interestingly, the largest proportion of these OGs was significantly more tissue-specific expressed in the flower than in other tissues, and more detectable expression was found in the male flower. Functional prediction of OGs showed that (1) 18 OGs associated with male sterility in watermelon; (2) 182 OGs associated with flower development in cucumber; (3) 51 OGs associated with environmental adaptation in watermelon; (4) 520 OGs may help with the large fruit size in wax gourd. Our results provide the molecular basis and research direction for some important mechanisms in Cucurbitaceae species and domesticated crops.
Collapse
Affiliation(s)
- Dongna Ma
- College of Horticulture, Fujian Agriculture and Forestry University, Fujian, China
- College of the Environment and Ecology, Xiamen University, Fujian, China
| | - Zhengfeng Lai
- Subtropical Agricultural Research Institute, Fujian Academy of Agriculture Sciences, Fujian, China
| | - Qiansu Ding
- College of the Environment and Ecology, Xiamen University, Fujian, China
| | - Kun Zhang
- College of Horticulture, Fujian Agriculture and Forestry University, Fujian, China
| | - Kaizhen Chang
- College of Horticulture, Fujian Agriculture and Forestry University, Fujian, China
| | - Shuhao Li
- College of Horticulture, Fujian Agriculture and Forestry University, Fujian, China
| | - Zhizhu Zhao
- College of the Environment and Ecology, Xiamen University, Fujian, China
| | - Fenglin Zhong
- College of Horticulture, Fujian Agriculture and Forestry University, Fujian, China
| |
Collapse
|
5
|
Xu T, Yang X, Jia Y, Li Z, Tang G, Li X, Wang B, Wang T, Lin J, Guo L, Ye K. A global survey of the transcriptome of the opium poppy (Papaver somniferum) based on single-molecule long-read isoform sequencing. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 110:607-620. [PMID: 35092713 DOI: 10.1111/tpj.15689] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Revised: 01/13/2022] [Accepted: 01/19/2022] [Indexed: 06/14/2023]
Affiliation(s)
- Tun Xu
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
- MOE Key Lab for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
| | - Xiaofei Yang
- MOE Key Lab for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
- School of Computer Science and Technology, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
- Genome Institute, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
| | - Yanyan Jia
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
| | - Zihang Li
- School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, China
| | - Guangbo Tang
- School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, China
| | - Xiujuan Li
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
| | - Bo Wang
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
| | - Tingjie Wang
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
| | - Jiadong Lin
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
- MOE Key Lab for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
- Faculty of Science, Leiden University, Leiden, The Netherlands
| | - Li Guo
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
- MOE Key Lab for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
- School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, China
| | - Kai Ye
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
- MOE Key Lab for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
- Genome Institute, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
- School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, China
- Faculty of Science, Leiden University, Leiden, The Netherlands
| |
Collapse
|
6
|
Dolatabadian A, Fernando WGD. Genomic Variations and Mutational Events Associated with Plant-Pathogen Interactions. BIOLOGY 2022; 11:421. [PMID: 35336795 PMCID: PMC8945218 DOI: 10.3390/biology11030421] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 03/07/2022] [Accepted: 03/08/2022] [Indexed: 12/23/2022]
Abstract
Phytopathologists are actively researching the molecular basis of plant-pathogen interactions. The mechanisms of responses to pathogens have been studied extensively in model crop plant species and natural populations. Today, with the rapid expansion of genomic technologies such as DNA sequencing, transcriptomics, proteomics, and metabolomics, as well as the development of new methods and protocols, data analysis, and bioinformatics, it is now possible to assess the role of genetic variation in plant-microbe interactions and to understand the underlying molecular mechanisms of plant defense and microbe pathogenicity with ever-greater resolution and accuracy. Genetic variation is an important force in evolution that enables organisms to survive in stressful environments. Moreover, understanding the role of genetic variation and mutational events is essential for crop breeders to produce improved cultivars. This review focuses on genetic variations and mutational events associated with plant-pathogen interactions and discusses how these genome compartments enhance plants' and pathogens' evolutionary processes.
Collapse
Affiliation(s)
- Aria Dolatabadian
- Department of Plant Science, Faculty of Agricultural and Food Sciences, University of Manitoba, Winnipeg, MB R3T 2N2, Canada;
| | | |
Collapse
|
7
|
Zhao Z, Ma D. Genome-Wide Identification, Characterization and Function Analysis of Lineage-Specific Genes in the Tea Plant Camellia sinensis. Front Genet 2021; 12:770570. [PMID: 34858483 PMCID: PMC8631334 DOI: 10.3389/fgene.2021.770570] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2021] [Accepted: 10/14/2021] [Indexed: 11/22/2022] Open
Abstract
Genes that have no homologous sequences with other species are called lineage-specific genes (LSGs), are common in living organisms, and have an important role in the generation of new functions, adaptive evolution and phenotypic alteration of species. Camellia sinensis var. sinensis (CSS) is one of the most widely distributed cultivars for quality green tea production. The rich catechins in tea have antioxidant, free radical elimination, fat loss and cancer prevention potential. To further understand the evolution and utilize the function of LSGs in tea, we performed a comparative genomics approach to identify Camellia-specific genes (CSGs). Our result reveals that 1701 CSGs were identified specific to CSS, accounting for 3.37% of all protein-coding genes. The majority of CSGs (57.08%) were generated by gene duplication, and the time of duplication occurrence coincide with the time of two genome-wide replication (WGD) events that happened in CSS genome. Gene structure analysis revealed that CSGs have shorter gene lengths, fewer exons, higher GC content and higher isoelectric point. Gene expression analysis showed that CSG had more tissue-specific expression compared to evolutionary conserved genes (ECs). Weighted gene co-expression network analysis (WGCNA) showed that 18 CSGs are mainly associated with catechin synthesis-related pathways, including phenylalanine biosynthesis, biosynthesis of amino acids, pentose phosphate pathway, photosynthesis and carbon metabolism. Besides, we found that the expression of three CSGs (CSS0030246, CSS0002298, and CSS0030939) was significantly down-regulated in response to both types of stresses (salt and drought). Our study first systematically identified LSGs in CSS, and comprehensively analyzed the features and potential functions of CSGs. We also identified key candidate genes, which will provide valuable assistance for further studies on catechin synthesis and provide a molecular basis for the excavation of excellent germplasm resources.
Collapse
Affiliation(s)
- Zhizhu Zhao
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, China
| | - Dongna Ma
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, China
| |
Collapse
|
8
|
Omidvar R, Vosseler N, Abbas A, Gutmann B, Grünwald-Gruber C, Altmann F, Siddique S, Bohlmann H. Analysis of a gene family for PDF-like peptides from Arabidopsis. Sci Rep 2021; 11:18948. [PMID: 34556705 PMCID: PMC8460643 DOI: 10.1038/s41598-021-98175-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Accepted: 08/31/2021] [Indexed: 11/09/2022] Open
Abstract
Plant defensins are small, basic peptides that have a characteristic three-dimensional folding pattern which is stabilized by four disulfide bridges. We show here that Arabidopsis contains in addition to the proper plant defensins a group of 9 plant defensin-like (PdfL) genes. They are all expressed at low levels while GUS fusions of the promoters showed expression in most tissues with only minor differences. We produced two of the encoded peptides in E. coli and tested the antimicrobial activity in vitro. Both were highly active against fungi but had lower activity against bacteria. At higher concentrations hyperbranching and swollen tips, which are indicative of antimicrobial activity, were induced in Fusarium graminearum by both peptides. Overexpression lines for most PdfL genes were produced using the 35S CaMV promoter to study their possible in planta function. With the exception of PdfL4.1 these lines had enhanced resistance against F. oxysporum. All PDFL peptides were also transiently expressed in Nicotiana benthamiana leaves with agroinfiltration using the pPZP3425 vector. In case of PDFL1.4 this resulted in complete death of the infiltrated tissues after 7 days. All other PDFLs resulted only in various degrees of small necrotic lesions. In conclusion, our results show that at least some of the PdfL genes could function in plant resistance.
Collapse
Affiliation(s)
- Reza Omidvar
- Division of Plant Protection, Department of Crop Sciences, Institute of Plant Protection, University of Natural Resources and Life Sciences Vienna, UFT Tulln, Konrad Lorenz Str. 24, 3430, Tulln, Austria
- Institute of Biotechnology in Plant Production, Department of Agrobiotechnology, University of Natural Resources and Life Sciences, Vienna (BOKU), Tulln, Austria
| | - Nadine Vosseler
- Division of Plant Protection, Department of Crop Sciences, Institute of Plant Protection, University of Natural Resources and Life Sciences Vienna, UFT Tulln, Konrad Lorenz Str. 24, 3430, Tulln, Austria
| | - Amjad Abbas
- Division of Plant Protection, Department of Crop Sciences, Institute of Plant Protection, University of Natural Resources and Life Sciences Vienna, UFT Tulln, Konrad Lorenz Str. 24, 3430, Tulln, Austria
- Department of Plant Pathology, University of Agriculture, Faisalabad, 38040, Pakistan
| | - Birgit Gutmann
- Division of Plant Protection, Department of Crop Sciences, Institute of Plant Protection, University of Natural Resources and Life Sciences Vienna, UFT Tulln, Konrad Lorenz Str. 24, 3430, Tulln, Austria
- RIVIERA Pharma and Cosmetics GmbH, Holzhackerstraße 1, Tulln, Austria
| | - Clemens Grünwald-Gruber
- Department of Chemistry, University of Natural Resources and Life Sciences, Muthgasse 18, 1190, Vienna, Austria
| | - Friedrich Altmann
- Department of Chemistry, University of Natural Resources and Life Sciences, Muthgasse 18, 1190, Vienna, Austria
| | - Shahid Siddique
- Division of Plant Protection, Department of Crop Sciences, Institute of Plant Protection, University of Natural Resources and Life Sciences Vienna, UFT Tulln, Konrad Lorenz Str. 24, 3430, Tulln, Austria
- Department of Entomology and Nematology, University of California Davis, Davis, CA, 95616, USA
| | - Holger Bohlmann
- Division of Plant Protection, Department of Crop Sciences, Institute of Plant Protection, University of Natural Resources and Life Sciences Vienna, UFT Tulln, Konrad Lorenz Str. 24, 3430, Tulln, Austria.
| |
Collapse
|
9
|
Ma D, Ding Q, Guo Z, Zhao Z, Wei L, Li Y, Song S, Zheng HL. Identification, characterization and expression analysis of lineage-specific genes within mangrove species Aegiceras corniculatum. Mol Genet Genomics 2021; 296:1235-1247. [PMID: 34363105 DOI: 10.1007/s00438-021-01810-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2021] [Accepted: 07/22/2021] [Indexed: 11/25/2022]
Abstract
Lineage-specific genes (LSGs) are the genes that have no recognizable homology to any sequences in other species, which are important drivers for the generation of new functions, phenotypic changes, and facilitating species adaptation to environment. Aegiceras corniculatum is one of major mangrove plant species adapted to waterlogging and saline conditions, and the exploration of aegiceras-specific genes (ASGs) is important to reveal its adaptation to the harsh environment. Here, we performed a systematic analysis on ASGs, focusing on their sequence characterization, origination and expression patterns. Our results reveal that there are 4823 ASGs in the genome, approximately 11.84% of all protein-coding genes. High proportion (45.78%) of ASGs originate from gene duplication, and the time of gene duplication of ASGs is consistent with the timing of two genome-wide replication (WGD) events that occurred in A. corniculatum, and also coincides with a short period of global warming during the Paleocene-Eocene Maximum (PETM, 55.5 million years ago). Gene structure analysis showed that ASGs have shorter protein lengths, fewer exons, and higher isoelectric point. Expression patterns analysis showed that ASGs had low levels of expression and more tissue-specific expression. Weighted gene co-expression network analysis (WGCNA) revealed that 86 ASGs co-expressed gene modules were primarily involved in pathways related to adversity stress, including plant hormone signal transduction, phenylpropanoid biosynthesis, photosynthesis, peroxisome and pentose phosphate pathway. This study provides a comprehensive analysis of the characteristics and potential functions of ASGs and identifies key candidate genes, which will contribute to the subsequent further investigation of the adaptation of A. corniculatum to intertidal coastal wetland habitats.
Collapse
Affiliation(s)
- Dongna Ma
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361005, China
| | - Qiansu Ding
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361005, China
| | - Zejun Guo
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361005, China
| | - Zhizhu Zhao
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361005, China
| | - Liufeng Wei
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361005, China
| | - Yiying Li
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, Institute of Applied Ecology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Shiwei Song
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361005, China
| | - Hai-Lei Zheng
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361005, China.
| |
Collapse
|
10
|
Gao Q, Jin X, Xia E, Wu X, Gu L, Yan H, Xia Y, Li S. Identification of Orphan Genes in Unbalanced Datasets Based on Ensemble Learning. Front Genet 2020; 11:820. [PMID: 33133122 PMCID: PMC7567012 DOI: 10.3389/fgene.2020.00820] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Accepted: 07/08/2020] [Indexed: 11/13/2022] Open
Abstract
Orphan genes are associated with regulatory patterns, but experimental methods for identifying orphan genes are both time-consuming and expensive. Designing an accurate and robust classification model to detect orphan and non-orphan genes in unbalanced distribution datasets poses a particularly huge challenge. Synthetic minority over-sampling algorithms (SMOTE) are selected in a preliminary step to deal with unbalanced gene datasets. To identify orphan genes in balanced and unbalanced Arabidopsis thaliana gene datasets, SMOTE algorithms were then combined with traditional and advanced ensemble classified algorithms respectively, using Support Vector Machine, Random Forest (RF), AdaBoost (adaptive boosting), GBDT (gradient boosting decision tree), and XGBoost (extreme gradient boosting). After comparing the performance of these ensemble models, SMOTE algorithms with XGBoost achieved an F1 score of 0.94 with the balanced A. thaliana gene datasets, but a lower score with the unbalanced datasets. The proposed ensemble method combines different balanced data algorithms including Borderline SMOTE (BSMOTE), Adaptive Synthetic Sampling (ADSYN), SMOTE-Tomek, and SMOTE-ENN with the XGBoost model separately. The performances of the SMOTE-ENN-XGBoost model, which combined over-sampling and under-sampling algorithms with XGBoost, achieved higher predictive accuracy than the other balanced algorithms with XGBoost models. Thus, SMOTE-ENN-XGBoost provides a theoretical basis for developing evaluation criteria for identifying orphan genes in unbalanced and biological datasets.
Collapse
Affiliation(s)
- Qijuan Gao
- Anhui Province Key Laboratory of Smart Agricultural Technology and Equipment, Anhui Agriculture University, Hefei, China
| | - Xiu Jin
- Anhui Province Key Laboratory of Smart Agricultural Technology and Equipment, Anhui Agriculture University, Hefei, China
| | - Enhua Xia
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, China
| | - Xiangwei Wu
- School of Resources and Environment, Anhui Agricultural University, Hefei, China
| | - Lichuan Gu
- School of Information and Computer Science, Anhui Agricultural University, Hefei, China
| | - Hanwei Yan
- Key Laboratory of Crop Biology of Anhui Province, Anhui Agricultural University, Hefei, China
| | - Yingchun Xia
- School of Information and Computer Science, Anhui Agricultural University, Hefei, China
| | - Shaowen Li
- Anhui Province Key Laboratory of Smart Agricultural Technology and Equipment, Anhui Agriculture University, Hefei, China
| |
Collapse
|
11
|
Sierocka I, Alaba S, Jarmolowski A, Karlowski WM, Szweykowska-Kulinska Z. The identification of differentially expressed genes in male and female gametophytes of simple thalloid liverwort Pellia endiviifolia sp. B using an RNA-seq approach. PLANTA 2020; 252:21. [PMID: 32671488 PMCID: PMC7363739 DOI: 10.1007/s00425-020-03424-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/14/2020] [Accepted: 07/08/2020] [Indexed: 05/03/2023]
Abstract
MAIN CONCLUSION This study shows differences in gene expression between male and female gametophytes of the simple thalloid liverwort with a distinction between the vegetative and reproductive phases of growth. Pellia endiviifolia is a simple thalloid liverwort that, together with hornworts and mosses, represents the oldest living land plants. The limited taxon sampling for genomic and functional studies hampers our understanding of processes governing evolution of these plants. RNA sequencing represents an attractive way to elucidate the molecular mechanisms of non-model species development. In the present study, RNA-seq was used to profile the differences in gene expression between P. endiviifolia male and female gametophytes, with a distinction between the vegetative and reproductive phases of growth. By comparison of the gene expression profiles from individuals producing sex organs with the remaining thalli types, we have determined a set of genes whose expression might be important for the development of P. endiviifolia reproductive organs. The selected differentially expressed genes (DEGs) were categorized into five main pathways: metabolism, genetic information processing, environmental information processing, cellular processes, and organismal systems. A comparison of the obtained data with the Marchantia polymorpha transcriptome resulted in the identification of genes exhibiting a similar expression pattern during the reproductive phase of growth between members of the two distinct liverwort classes. The common expression profile of 87 selected genes suggests a common mechanism governing sex organ development in both liverwort species. The obtained RNA-seq results were confirmed by RT-qPCR for the DEGs with the highest differences in expression level. Five Pellia-female-specific and two Pellia-male-specific DEGs showed enriched expression in archegonia and antheridia, respectively. The identified genes are promising candidates for functional studies of their involvement in liverwort sexual reproduction.
Collapse
Affiliation(s)
- Izabela Sierocka
- Department of Gene Expression, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University in Poznan, Uniwersytetu Poznanskiego 6, 61-614, Poznan, Poland.
| | - Sylwia Alaba
- Department of Computational Biology, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University in Poznan, Uniwersytetu Poznanskiego 6, 61-614, Poznan, Poland
| | - Artur Jarmolowski
- Department of Gene Expression, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University in Poznan, Uniwersytetu Poznanskiego 6, 61-614, Poznan, Poland
| | - Wojciech M Karlowski
- Department of Computational Biology, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University in Poznan, Uniwersytetu Poznanskiego 6, 61-614, Poznan, Poland
| | - Zofia Szweykowska-Kulinska
- Department of Gene Expression, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University in Poznan, Uniwersytetu Poznanskiego 6, 61-614, Poznan, Poland
| |
Collapse
|
12
|
Chen K, Tian Z, Chen P, He H, Jiang F, Long CA. Genome-wide identification, characterization and expression analysis of lineage-specific genes within Hanseniaspora yeasts. FEMS Microbiol Lett 2020; 367:5837084. [PMID: 32407480 DOI: 10.1093/femsle/fnaa077] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2019] [Accepted: 05/12/2020] [Indexed: 12/13/2022] Open
Abstract
Lineage-specific genes (LSGs) are defined as genes with sequences that are not significantly similar to those in any other lineage. LSGs have been proposed, and sometimes shown, to have significant effects in the evolution of biological function. In this study, two sets of Hanseniaspora spp. LSGs were identified by comparing the sequences of the Kloeckera apiculata genome and of 80 other yeast genomes. This study identified 344 Hanseniaspora-specific genes (HSGs) and 109 genes ('orphan genes') specific to K. apiculata. Three thousand three hundred thirty-one K. apiculata genes that showed significant similarity to at least one sequence outside the Hanseniaspora were classified into evolutionarily conserved genes. We analyzed their sequence features, functional categories, gene origin, gene structure and gene expression. We also investigated the predicted cellular roles and Gene Ontology categories of the LSGs using functional inference. The patterns of the functions of LSGs do not deviate significantly from genome-wide average. The results showed that a few LSGs were formed by gene duplication, followed by rapid sequence divergence. Many of the HSGs and orphan genes exhibited altered expression in response to abiotic stress. Studying these LSGs might be helpful for understanding the molecular mechanism of yeast adaption.
Collapse
Affiliation(s)
- Kai Chen
- School of Biological Engineering and Food, Hubei University of Technology, Wuhan 430068, China
| | - Zhonghuan Tian
- Key Laboratory of Horticultural Plant Biology of the Ministry of Education, National Centre of Citrus Breeding, Huazhong Agricultural University, Wuhan 430070, China
| | - Ping Chen
- Department of Pediatric Hematology, Tongji Hospital Affiliated to Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430000, China
| | - Hua He
- School of Landscape Architecture and Horticulture, Wuhan Institute of Bioengineering, Wuhan 430415, China
| | - Fatang Jiang
- School of Biological Engineering and Food, Hubei University of Technology, Wuhan 430068, China
| | - Chao-An Long
- Key Laboratory of Horticultural Plant Biology of the Ministry of Education, National Centre of Citrus Breeding, Huazhong Agricultural University, Wuhan 430070, China
| |
Collapse
|
13
|
Brennan CJ, Zhou B, Benbow HR, Ajaz S, Karki SJ, Hehir JG, O’Driscoll A, Feechan A, Mullins E, Doohan FM. Taxonomically Restricted Wheat Genes Interact With Small Secreted Fungal Proteins and Enhance Resistance to Septoria Tritici Blotch Disease. FRONTIERS IN PLANT SCIENCE 2020; 11:433. [PMID: 32477375 PMCID: PMC7236048 DOI: 10.3389/fpls.2020.00433] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/24/2019] [Accepted: 03/24/2020] [Indexed: 05/12/2023]
Abstract
Understanding the nuances of host/pathogen interactions are paramount if we wish to effectively control cereal diseases. In the case of the wheat/Zymoseptoria tritici interaction that leads to Septoria tritici blotch (STB) disease, a 10,000-year-old conflict has led to considerable armaments being developed on both sides which are not reflected in conventional model systems. Taxonomically restricted genes (TRGs) have evolved in wheat to better allow it to cope with stress caused by fungal pathogens, and Z. tritici has evolved specialized effectors which allow it to manipulate its' host. A microarray focused on the latent phase response of a resistant wheat cultivar (cv. Stigg) and susceptible wheat cultivar (cv. Gallant) to Z. tritici infection was mined for TRGs within the Poaceae. From this analysis, we identified two TRGs that were significantly upregulated in response to Z. tritici infection, Septoria-responsive TRG6 and 7 (TaSRTRG6 and TaSRTRG7). Virus induced silencing of these genes resulted in an increased susceptibility to STB disease in cvs. Gallant and Stigg, and significantly so in the latter (2.5-fold increase in STB disease). In silico and localization studies categorized TaSRTRG6 as a secreted protein and TaSRTRG7 as an intracellular protein. Yeast two-hybrid analysis and biofluorescent complementation studies demonstrated that both TaSRTRG6 and TaSRTRG7 can interact with small proteins secreted by Z. tritici (potential effector candidates). Thus we conclude that TRGs are an important part of the wheat-Z. tritici co-evolution story and potential candidates for modulating STB resistance.
Collapse
Affiliation(s)
- Ciarán J. Brennan
- UCD School of Biology and Environmental Science and UCD Earth Institute, UCD O’Brien Centre for Science (East), University College Dublin, Belfield, Ireland
| | - Binbin Zhou
- UCD School of Biology and Environmental Science and UCD Earth Institute, UCD O’Brien Centre for Science (East), University College Dublin, Belfield, Ireland
| | - Harriet R. Benbow
- UCD School of Biology and Environmental Science and UCD Earth Institute, UCD O’Brien Centre for Science (East), University College Dublin, Belfield, Ireland
| | - Sobia Ajaz
- UCD School of Biology and Environmental Science and UCD Earth Institute, UCD O’Brien Centre for Science (East), University College Dublin, Belfield, Ireland
| | - Sujit J. Karki
- School of Agriculture and Food Science, University College Dublin, Belfield, Ireland
| | | | | | - Angela Feechan
- School of Agriculture and Food Science, University College Dublin, Belfield, Ireland
| | - Ewen Mullins
- Department of Crop Science, Teagasc, Carlow, Ireland
| | - Fiona M. Doohan
- UCD School of Biology and Environmental Science and UCD Earth Institute, UCD O’Brien Centre for Science (East), University College Dublin, Belfield, Ireland
- *Correspondence: Fiona M. Doohan,
| |
Collapse
|
14
|
Identification, characterization and expression analysis of lineage-specific genes within Triticeae. Genomics 2019; 112:1343-1350. [PMID: 31401233 DOI: 10.1016/j.ygeno.2019.08.003] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2019] [Revised: 08/04/2019] [Accepted: 08/07/2019] [Indexed: 12/11/2022]
Abstract
Lineage-specific genes (LSGs) are a set of genes in a given taxon without significant sequence similarity to genes and intergenic sequences of other taxa and are functional. The tribe Triticeae mainly includes species of different ploidy levels, such as staple food crops wheat (Triticum aestivum L.) and barley (Hordeum vulgare L.). This study is aimed at mining and characterizing the Triticeae-specific genes (TSGs) using expressed sequence data of wheat. A total of 3812 TSGs was identified and they were generally characterized by smaller size, fewer exons, shorter open reading frames and lower expression levels. Most TSGs were expressed with tissue preference and many of them were predominantly expressed in reproduction related tissues, especially in young stamen. Nearly one third of the TSGs were stress-responsive and inducible under abiotic and/or biotic stresses. A co-expression-based annotation supported the relevance of some TSGs with reproduction and stress responses, indicating their potential economic importance.
Collapse
|
15
|
Trujillo DI, Silverstein KAT, Young ND. Nodule-specific PLAT domain proteins are expanded in the Medicago lineage and required for nodulation. THE NEW PHYTOLOGIST 2019; 222:1538-1550. [PMID: 30664233 DOI: 10.1111/nph.15697] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/13/2018] [Accepted: 01/03/2019] [Indexed: 06/09/2023]
Abstract
Symbiotic nitrogen fixation in legumes is mediated by an interplay of signaling processes between plant hosts and rhizobial symbionts. In legumes, several secreted protein families have undergone expansions and play key roles in nodulation. Thus, identifying lineage-specific expansions (LSEs) of nodulation-associated genes can be a strategy to discover candidate gene families. Using bioinformatic tools, we identified 13 LSEs of nodulation-related secreted protein families, each unique to either Glycine, Arachis or Medicago lineages. In the Medicago lineage, nodule-specific Polycystin-1, Lipoxygenase, Alpha Toxin (PLAT) domain proteins (NPDs) expanded to five members. We examined NPD function using CRISPR/Cas9 multiplex genome editing to create Medicago truncatula NPD knockout lines, targeting one to five NPD genes. Mutant lines with differing combinations of NPD gene inactivations had progressively smaller nodules, earlier onset of nodule senescence, or ineffective nodules compared to the wild-type control. Double- and triple-knockout lines showed dissimilar nodulation phenotypes but coincided in upregulation of a DHHC-type zinc finger and an aspartyl protease gene, possible candidates for the observed disturbance of proper nodule function. By postulating that gene family expansions can be used to detect candidate genes, we identified a family of nodule-specific PLAT domain proteins and confirmed that they play a role in successful nodule formation.
Collapse
Affiliation(s)
- Diana I Trujillo
- Department of Plant Biology, University of Minnesota, Saint Paul, MN, 55108, USA
| | - Kevin A T Silverstein
- Supercomputing Institute for Advanced Computational Research, University of Minnesota, Minneapolis, MN, 55455, USA
| | - Nevin D Young
- Department of Plant Pathology, University of Minnesota, Saint Paul, MN, 55108, USA
| |
Collapse
|
16
|
Jiang M, Dong X, Lang H, Pang W, Zhan Z, Li X, Piao Z. Mining of Brassica-Specific Genes (BSGs) and Their Induction in Different Developmental Stages and under Plasmodiophora brassicae Stress in Brassica rapa. Int J Mol Sci 2018; 19:ijms19072064. [PMID: 30012965 PMCID: PMC6073354 DOI: 10.3390/ijms19072064] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2018] [Revised: 06/29/2018] [Accepted: 07/13/2018] [Indexed: 11/16/2022] Open
Abstract
Orphan genes, also called lineage-specific genes (LSGs), are important for responses to biotic and abiotic stresses, and are associated with lineage-specific structures and biological functions. To date, there have been no studies investigating gene number, gene features, or gene expression patterns of orphan genes in Brassica rapa. In this study, 1540 Brassica-specific genes (BSGs) and 1824 Cruciferae-specific genes (CSGs) were identified based on the genome of Brassica rapa. The genic features analysis indicated that BSGs and CSGs possessed a lower percentage of multi-exon genes, higher GC content, and shorter gene length than evolutionary-conserved genes (ECGs). In addition, five types of BSGs were obtained and 145 out of 529 real A subgenome-specific BSGs were verified by PCR in 51 species. In silico and semi-qPCR, gene expression analysis of BSGs suggested that BSGs are expressed in various tissue and can be induced by Plasmodiophora brassicae. Moreover, an A/C subgenome-specific BSG, BSGs1, was specifically expressed during the heading stage, indicating that the gene might be associated with leafy head formation. Our results provide valuable biological information for studying the molecular function of BSGs for Brassica-specific phenotypes and biotic stress in B. rapa.
Collapse
Affiliation(s)
- Mingliang Jiang
- College of Horticulture, Shenyang Agricultural University, #120 Dongling Road, Shenyang 110866, China.
| | - Xiangshu Dong
- School of Agriculture, Yunnan University, Kunming 650504, China.
| | - Hong Lang
- Key Laboratory of Northeast Rice Biology and Breeding, Ministry of Agriculture, Rice Research Institute, Shenyang Agricultural University, Shenyang 110866, China.
| | - Wenxing Pang
- College of Horticulture, Shenyang Agricultural University, #120 Dongling Road, Shenyang 110866, China.
| | - Zongxiang Zhan
- College of Horticulture, Shenyang Agricultural University, #120 Dongling Road, Shenyang 110866, China.
| | - Xiaonan Li
- College of Horticulture, Shenyang Agricultural University, #120 Dongling Road, Shenyang 110866, China.
| | - Zhongyun Piao
- College of Horticulture, Shenyang Agricultural University, #120 Dongling Road, Shenyang 110866, China.
| |
Collapse
|
17
|
Feau N, Beauseigle S, Bergeron MJ, Bilodeau GJ, Birol I, Cervantes-Arango S, Dhillon B, Dale AL, Herath P, Jones SJ, Lamarche J, Ojeda DI, Sakalidis ML, Taylor G, Tsui CK, Uzunovic A, Yueh H, Tanguay P, Hamelin RC. Genome-Enhanced Detection and Identification (GEDI) of plant pathogens. PeerJ 2018; 6:e4392. [PMID: 29492338 PMCID: PMC5825881 DOI: 10.7717/peerj.4392] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2017] [Accepted: 01/29/2018] [Indexed: 12/17/2022] Open
Abstract
Plant diseases caused by fungi and Oomycetes represent worldwide threats to crops and forest ecosystems. Effective prevention and appropriate management of emerging diseases rely on rapid detection and identification of the causal pathogens. The increase in genomic resources makes it possible to generate novel genome-enhanced DNA detection assays that can exploit whole genomes to discover candidate genes for pathogen detection. A pipeline was developed to identify genome regions that discriminate taxa or groups of taxa and can be converted into PCR assays. The modular pipeline is comprised of four components: (1) selection and genome sequencing of phylogenetically related taxa, (2) identification of clusters of orthologous genes, (3) elimination of false positives by filtering, and (4) assay design. This pipeline was applied to some of the most important plant pathogens across three broad taxonomic groups: Phytophthoras (Stramenopiles, Oomycota), Dothideomycetes (Fungi, Ascomycota) and Pucciniales (Fungi, Basidiomycota). Comparison of 73 fungal and Oomycete genomes led the discovery of 5,939 gene clusters that were unique to the targeted taxa and an additional 535 that were common at higher taxonomic levels. Approximately 28% of the 299 tested were converted into qPCR assays that met our set of specificity criteria. This work demonstrates that a genome-wide approach can efficiently identify multiple taxon-specific genome regions that can be converted into highly specific PCR assays. The possibility to easily obtain multiple alternative regions to design highly specific qPCR assays should be of great help in tackling challenging cases for which higher taxon-resolution is needed.
Collapse
Affiliation(s)
- Nicolas Feau
- Department of Forest and Conservation Sciences, Forest Sciences Centre, University of British Columbia, Vancouver, BC, Canada
| | | | | | | | - Inanc Birol
- BC Cancer agency, Genome Sciences Centre, Vancouver, BC, Canada
| | - Sandra Cervantes-Arango
- Department of Forest and Conservation Sciences, Forest Sciences Centre, University of British Columbia, Vancouver, BC, Canada
| | - Braham Dhillon
- Department of Plant Pathology, University of Arkansas at Fayetteville, Fayetteville, AR, United States of America
| | - Angela L. Dale
- Department of Forest and Conservation Sciences, Forest Sciences Centre, University of British Columbia, Vancouver, BC, Canada
- FPInnovations, Vancouver, BC, Canada
| | - Padmini Herath
- Department of Forest and Conservation Sciences, Forest Sciences Centre, University of British Columbia, Vancouver, BC, Canada
| | - Steven J.M. Jones
- BC Cancer agency, Genome Sciences Centre, Vancouver, BC, Canada
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada
| | - Josyanne Lamarche
- Canadian Forest Service, Natural Resources Canada, Quebec city, Quebec, Canada
| | - Dario I. Ojeda
- Department of Biology Unit of Ecology and Genetics, University of Oulu, Oulu, Finland
| | - Monique L. Sakalidis
- Department of Plant, Soil & Microbial Sciences and Department of Forestry, Michigan State University, East Lansing, MI, United States of America
| | - Greg Taylor
- BC Cancer agency, Genome Sciences Centre, Vancouver, BC, Canada
| | - Clement K.M. Tsui
- Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
| | | | - Hesther Yueh
- Department of Forest and Conservation Sciences, Forest Sciences Centre, University of British Columbia, Vancouver, BC, Canada
| | - Philippe Tanguay
- Canadian Forest Service, Natural Resources Canada, Quebec city, Quebec, Canada
| | - Richard C. Hamelin
- Department of Forest and Conservation Sciences, Forest Sciences Centre, University of British Columbia, Vancouver, BC, Canada
- Foresterie et géomatique, Institut de Biologie Intégrative des Systèmes, Laval University, Quebec city, Quebec, Canada
| |
Collapse
|
18
|
Yao C, Yan H, Zhang X, Wang R. A database for orphan genes in Poaceae. Exp Ther Med 2017; 14:2917-2924. [PMID: 28966675 PMCID: PMC5615222 DOI: 10.3892/etm.2017.4918] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2016] [Accepted: 06/08/2017] [Indexed: 11/19/2022] Open
Abstract
Orphan genes refer to a group of protein-coding genes lacking recognizable homologs in the other organisms. Extensive studies have demonstrated that numerous newly sequenced genomes contain a significant number of orphan genes, which have important roles in plant's responses to the environment. Due to a lack of phylogenetic conservation, the origin of orphan genes and their functions are currently not well defined. In the present study, a Poaceae orphan genes database (POGD; http://bioinfo.ahau.edu.cn/pogd) was established to serve as a user-friendly web interface for entry browsing, searching and downloading orphan genes from various plants. Four Poaceae species, including Brachypodium distachyon, Oryza sativa, Sorghum bicolor and Zea mays, are included in the current version of POGD. The database provides gene descriptions (chromosome strands, physical location), gene product records (protein length, isoelectric point, molecular weight as well as gene and protein sequences) and functional annotations (cellular role, gene ontology category, subcellular localization prediction). Basic Local Alignment Search Tool and comparative analyses were also provided on the website. POGD will serve as a comprehensive and reliable repository, which will help uncover regulatory mechanisms of orphan genes and may assist in the development of comparative genomics in plant biology.
Collapse
Affiliation(s)
- Chensong Yao
- Graduate School, Anhui Agricultural University, Hefei, Anhui 230036, P.R. China
| | - Hanwei Yan
- Laboratory of Modern Biotechnology, Anhui Agricultural University, Hefei, Anhui 230036, P.R. China
| | - Xiaodan Zhang
- School of Information and Computer Science, Anhui Agricultural University, Hefei, Anhui 230036, P.R. China
| | - Rongfu Wang
- Department of Life Sciences, Anhui Agricultural University, Hefei, Anhui 230036, P.R. China
| |
Collapse
|
19
|
Kaur P, Bayer PE, Milec Z, Vrána J, Yuan Y, Appels R, Edwards D, Batley J, Nichols P, Erskine W, Doležel J. An advanced reference genome of Trifolium subterraneum L. reveals genes related to agronomic performance. PLANT BIOTECHNOLOGY JOURNAL 2017; 15:1034-1046. [PMID: 28111887 PMCID: PMC5506647 DOI: 10.1111/pbi.12697] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2016] [Revised: 01/05/2017] [Accepted: 01/11/2017] [Indexed: 05/08/2023]
Abstract
Subterranean clover is an important annual forage legume, whose diploidy and inbreeding nature make it an ideal model for genomic analysis in Trifolium. We reported a draft genome assembly of the subterranean clover TSUd_r1.1. Here we evaluate genome mapping on nanochannel arrays and generation of a transcriptome atlas across tissues to advance the assembly and gene annotation. Using a BioNano-based assembly spanning 512 Mb (93% genome coverage), we validated the draft assembly, anchored unplaced contigs and resolved misassemblies. Multiple contigs (264) from the draft assembly coalesced into 97 super-scaffolds (43% of genome). Sequences longer than >1 Mb increased from 40 to 189 Mb giving 1.4-fold increase in N50 with total genome in pseudomolecules improved from 73 to 80%. The advanced assembly was re-annotated using transcriptome atlas data to contain 31 272 protein-coding genes capturing >96% of the gene content. Functional characterization and GO enrichment confirmed gene expression for response to water deprivation, flavonoid biosynthesis and embryo development ending in seed dormancy, reflecting adaptation to the harsh Mediterranean environment. Comparative analyses across Papilionoideae identified 24 893 Trifolium-specific and 6325 subterranean-clover-specific genes that could be mined further for traits such as geocarpy and grazing tolerance. Eight key traits, including persistence, improved livestock health by isoflavonoid production in addition to important agro-morphological traits, were fine-mapped on the high-density SNP linkage map anchored to the assembly. This new genomic information is crucial to identify loci governing traits allowing marker-assisted breeding, comparative mapping and identification of tissue-specific gene promoters for biotechnological improvement of forage legumes.
Collapse
Affiliation(s)
- Parwinder Kaur
- Centre for Plant Genetics and Breeding and Institute of AgricultureThe University of Western AustraliaCrawleyWAAustralia
| | - Philipp E. Bayer
- School of Plant Biology and Institute of AgricultureThe University of Western AustraliaCrawleyWAAustralia
| | - Zbyněk Milec
- Institute of Experimental BotanyCentre of the Region Haná for Biotechnological and Agricultural ResearchOlomoucCzech Republic
| | - Jan Vrána
- Institute of Experimental BotanyCentre of the Region Haná for Biotechnological and Agricultural ResearchOlomoucCzech Republic
| | - Yuxuan Yuan
- School of Plant Biology and Institute of AgricultureThe University of Western AustraliaCrawleyWAAustralia
| | | | - David Edwards
- School of Plant Biology and Institute of AgricultureThe University of Western AustraliaCrawleyWAAustralia
| | - Jacqueline Batley
- School of Plant Biology and Institute of AgricultureThe University of Western AustraliaCrawleyWAAustralia
| | - Phillip Nichols
- School of Plant Biology and Institute of AgricultureThe University of Western AustraliaCrawleyWAAustralia
- Department of Agriculture and Food Western AustraliaSouth PerthWAAustralia
| | - William Erskine
- Centre for Plant Genetics and Breeding and Institute of AgricultureThe University of Western AustraliaCrawleyWAAustralia
| | - Jaroslav Doležel
- Institute of Experimental BotanyCentre of the Region Haná for Biotechnological and Agricultural ResearchOlomoucCzech Republic
| |
Collapse
|
20
|
Withers S, Gongora-Castillo E, Gent D, Thomas A, Ojiambo PS, Quesada-Ocampo LM. Using Next-Generation Sequencing to Develop Molecular Diagnostics for Pseudoperonospora cubensis, the Cucurbit Downy Mildew Pathogen. PHYTOPATHOLOGY 2016; 106:1105-1116. [PMID: 27314624 DOI: 10.1094/phyto-10-15-0260-fi] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
Advances in next-generation sequencing (NGS) allow for rapid development of genomics resources needed to generate molecular diagnostics assays for infectious agents. NGS approaches are particularly helpful for organisms that cannot be cultured, such as the downy mildew pathogens, a group of biotrophic obligate oomycetes that infect crops of economic importance. Unlike most downy mildew pathogens that are highly host-specific, Pseudoperonospora cubensis causes disease on a broad range of crops belonging to the family Cucurbitaceae. In this study, we identified candidate diagnostic markers for P. cubensis by comparing NGS data from a diverse panel of P. cubensis and P. humuli isolates, two very closely related oomycete species. P. cubensis isolates from diverse hosts and geographical regions in the United States were selected for sequencing to ensure that candidates were conserved in P. cubensis isolates infecting different cucurbit hosts. Genomic regions unique to and conserved in P. cubensis isolates were identified through bioinformatics. These candidate regions were then validated using PCR against a larger collection of isolates from P. cubensis, P. humuli, and other oomycetes. Overall seven diagnostic markers were found to be specific to P. cubensis. These markers could be used for pathogen diagnostics on infected tissue, or adapted for monitoring airborne inoculum with real-time PCR and spore traps.
Collapse
Affiliation(s)
- S Withers
- First, second, fourth, fifth, and sixth authors: Department of Plant Pathology, North Carolina State University, Raleigh 27695-7616; third author: U.S. Department of Agriculture-Agricultural Research Service, Forage Seed and Cereal Research Unit, and Oregon State University, Corvallis 97331; and fourth and fifth authors: Center for Integrated Fungal Research, North Carolina State University, Raleigh 27695-7567
| | - E Gongora-Castillo
- First, second, fourth, fifth, and sixth authors: Department of Plant Pathology, North Carolina State University, Raleigh 27695-7616; third author: U.S. Department of Agriculture-Agricultural Research Service, Forage Seed and Cereal Research Unit, and Oregon State University, Corvallis 97331; and fourth and fifth authors: Center for Integrated Fungal Research, North Carolina State University, Raleigh 27695-7567
| | - D Gent
- First, second, fourth, fifth, and sixth authors: Department of Plant Pathology, North Carolina State University, Raleigh 27695-7616; third author: U.S. Department of Agriculture-Agricultural Research Service, Forage Seed and Cereal Research Unit, and Oregon State University, Corvallis 97331; and fourth and fifth authors: Center for Integrated Fungal Research, North Carolina State University, Raleigh 27695-7567
| | - A Thomas
- First, second, fourth, fifth, and sixth authors: Department of Plant Pathology, North Carolina State University, Raleigh 27695-7616; third author: U.S. Department of Agriculture-Agricultural Research Service, Forage Seed and Cereal Research Unit, and Oregon State University, Corvallis 97331; and fourth and fifth authors: Center for Integrated Fungal Research, North Carolina State University, Raleigh 27695-7567
| | - P S Ojiambo
- First, second, fourth, fifth, and sixth authors: Department of Plant Pathology, North Carolina State University, Raleigh 27695-7616; third author: U.S. Department of Agriculture-Agricultural Research Service, Forage Seed and Cereal Research Unit, and Oregon State University, Corvallis 97331; and fourth and fifth authors: Center for Integrated Fungal Research, North Carolina State University, Raleigh 27695-7567
| | - L M Quesada-Ocampo
- First, second, fourth, fifth, and sixth authors: Department of Plant Pathology, North Carolina State University, Raleigh 27695-7616; third author: U.S. Department of Agriculture-Agricultural Research Service, Forage Seed and Cereal Research Unit, and Oregon State University, Corvallis 97331; and fourth and fifth authors: Center for Integrated Fungal Research, North Carolina State University, Raleigh 27695-7567
| |
Collapse
|
21
|
Bedre R, Mangu VR, Srivastava S, Sanchez LE, Baisakh N. Transcriptome analysis of smooth cordgrass (Spartina alterniflora Loisel), a monocot halophyte, reveals candidate genes involved in its adaptation to salinity. BMC Genomics 2016; 17:657. [PMID: 27542721 PMCID: PMC4992267 DOI: 10.1186/s12864-016-3017-3] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2015] [Accepted: 08/13/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Soil salinity affects growth and yield of crop plants. Plants respond to salinity by physiological and biochemical adjustments through a coordinated regulation and expression of a cascade of genes. Recently, halophytes have attracted attention of the biologists to understand their salt adaptation mechanisms. Spartina alterniflora (smooth cordgrass) is a Louisiana native monocot halophyte that can withstand salinity up to double the strength of sea water. To dissect the molecular mechanisms underlying its salinity adaptation, leaf and root transcriptome of S. alterniflora was sequenced using 454/GS-FLX. RESULTS Altogether, 770,690 high quality reads with an average length 324-bp were assembled de novo into 73,131 contigs (average 577-bp long) with 5.9X sequence coverage. Most unigenes (95 %) annotated to proteins with known functions, and had more than 90 % similarity to rice genes. About 28 % unigenes were considered specific to S. alterniflora. Digital expression profiles revealed significant enrichment (P < 0.01) of transporters, vacuolar proton pump members and transcription factors under salt stress, which suggested the role of ion homeostasis and transcriptional regulation in the salinity adaptation of this grass. Also, 10,805 SSRs markers from 9457 unigenes were generated and validated through genetic diversity analysis among 13 accessions of S. alterniflora. CONCLUSIONS The present study explores the transcriptome of S. alterniflora to understand the gene regulation under salt stress in halophytes. The sequenced transcriptome (control and salt-regulated) of S. alterniflora provides a platform for further gene finding studies in grasses. This study and our previously published studies suggested that S. alterniflora is a rich reservoir of salt tolerance genes that can be used to develop salt tolerant cereal crops, especially rice, a major food crop of global importance.
Collapse
Affiliation(s)
- Renesh Bedre
- School of Plant, Environmental and Soil Sciences, Louisiana State University Agricultural Center, Baton Rouge, LA, 70803, USA
| | - Venkata Ramanarao Mangu
- School of Plant, Environmental and Soil Sciences, Louisiana State University Agricultural Center, Baton Rouge, LA, 70803, USA
| | - Subodh Srivastava
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, 29634, USA
| | - Luis Eduardo Sanchez
- School of Plant, Environmental and Soil Sciences, Louisiana State University Agricultural Center, Baton Rouge, LA, 70803, USA
- Current address: Centro de Investigaciones Biotecnológicas del Ecuador (CIBE), Km 30.5 Via Perimetral, Guayaquil, Ecuador
| | - Niranjan Baisakh
- School of Plant, Environmental and Soil Sciences, Louisiana State University Agricultural Center, Baton Rouge, LA, 70803, USA.
| |
Collapse
|
22
|
Li ZW, Chen X, Wu Q, Hagmann J, Han TS, Zou YP, Ge S, Guo YL. On the Origin of De Novo Genes in Arabidopsis thaliana Populations. Genome Biol Evol 2016; 8:2190-202. [PMID: 27401176 PMCID: PMC4987118 DOI: 10.1093/gbe/evw164] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
De novo genes, which originate from ancestral nongenic sequences, are one of the most important sources of protein-coding genes. This origination process is crucial for the adaptation of organisms. However, how de novo genes arise and become fixed in a population or species remains largely unknown. Here, we identified 782 de novo genes from the model plant Arabidopsis thaliana and divided them into three types based on the availability of translational evidence, transcriptional evidence, and neither transcriptional nor translational evidence for their origin. Importantly, by integrating multiple types of omics data, including data from genomes, epigenomes, transcriptomes, and translatomes, we found that epigenetic modifications (DNA methylation and histone modification) play an important role in the origination process of de novo genes. Intriguingly, using the transcriptomes and methylomes from the same population of 84 accessions, we found that de novo genes that are transcribed in approximately half of the total accessions within the population are highly methylated, with lower levels of transcription than those transcribed at other frequencies within the population. We hypothesized that, during the origin of de novo gene alleles, those neutralized to low expression states via DNA methylation have relatively high probabilities of spreading and becoming fixed in a population. Our results highlight the process underlying the origin of de novo genes at the population level, as well as the importance of DNA methylation in this process.
Collapse
Affiliation(s)
- Zi-Wen Li
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, China
| | - Xi Chen
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
| | - Qiong Wu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, China
| | - Jörg Hagmann
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Ting-Shen Han
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
| | - Yu-Pan Zou
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
| | - Song Ge
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, China
| | - Ya-Long Guo
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
23
|
Ekstrom A, Yin Y. ORFanFinder: automated identification of taxonomically restricted orphan genes. Bioinformatics 2016; 32:2053-5. [PMID: 27153690 DOI: 10.1093/bioinformatics/btw122] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2016] [Accepted: 02/26/2016] [Indexed: 01/01/2023] Open
Abstract
MOTIVATION Orphan genes, also known as ORFans, are newly evolved genes in a genome that enable the organism to adapt to specific living environment. The gene content of every sequenced genome can be classified into different age groups, based on how widely/narrowly a gene's homologs are distributed in the context of species taxonomy. Those having homologs restricted to organisms of particular taxonomic ranks are classified as taxonomically restricted ORFans. RESULTS Implementing this idea, we have developed an open source program named ORFanFinder and a free web server to allow automated classification of a genome's gene content and identification of ORFans at different taxonomic ranks. ORFanFinder and its web server will contribute to the comparative genomics field by facilitating the study of the origin of new genes and the emergence of lineage-specific traits in both prokaryotes and eukaryotes. AVAILABILITY AND IMPLEMENTATION http://cys.bios.niu.edu/orfanfinder CONTACT yyin@niu.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Yanbin Yin
- Department of Biological Sciences, Montgomery Hall 325A, Northern Illinois University, DeKalb, IL, USA
| |
Collapse
|
24
|
Gu Y, Xing S, He C. Genome-Wide Analysis Indicates Lineage-Specific Gene Loss during Papilionoideae Evolution. Genome Biol Evol 2016; 8:635-48. [PMID: 26868598 PMCID: PMC4824202 DOI: 10.1093/gbe/evw021] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/01/2016] [Indexed: 02/07/2023] Open
Abstract
Gene loss is the driving force for changes in genome and morphology; however, this particular evolutionary event has been poorly investigated in leguminous plants. Legumes (Fabaceae) have some lineage-specific and diagnostic characteristics that are distinct from other angiosperms. To understand the potential role of gene loss in the evolution of legumes, we compared six genome-sequenced legume species of Papilionoideae, the largest representative clade of Fabaceae, such as Glycine max, with 34 nonlegume plant species, such as Arabidopsis thaliana. The results showed that the putative orthologs of the 34 Arabidopsis genes belonging to 29 gene families were absent in these legume species but these were conserved in the sequenced nonlegume angiosperm lineages. Further evolutionary analyses indicated that the orthologs of these genes were almost completely lost in the Papillionoideae ancestors, thus designated as the legume lost genes (LLGs), and these underwent purifying selection in nonlegume plants. Most LLGs were functionally unknown. In Arabidopsis, two LLGs were well-known genes that played a role in plant immunity such as HARMLESS TO OZONE LAYER 1 and HOPZ-ACTIVATED RESISTANCE 1, and 16 additional LLGs were predicted to participate in plant-pathogen interactions in in silico expression and protein-protein interaction network analyses. Most of these LLGs' orthologs in various plants were also found to be associated with biotic stress response, indicating the conserved role of these genes in plant defense. The evolutionary implication of LLGs during the development of the ability of symbiotic nitrogen fixation involving plant and bacterial interactions, which is a well-known characteristic of most legumes, is also discussed. Our work sheds light on the evolutionary implication of gene loss events in Papilionoideae evolution, as well as provides new insights into crop design to improve nitrogen fixation capacity.
Collapse
Affiliation(s)
- Yongzhe Gu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Nanxincun 20, Xiangshan, Beijing 100093, China Graduate University, Chinese Academy of Sciences, Yuquan Road 19, Beijing 100049, China
| | - Shilai Xing
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Nanxincun 20, Xiangshan, Beijing 100093, China Graduate University, Chinese Academy of Sciences, Yuquan Road 19, Beijing 100049, China
| | - Chaoying He
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Nanxincun 20, Xiangshan, Beijing 100093, China
| |
Collapse
|
25
|
Xu Y, Wu G, Hao B, Chen L, Deng X, Xu Q. Identification, characterization and expression analysis of lineage-specific genes within sweet orange (Citrus sinensis). BMC Genomics 2015; 16:995. [PMID: 26597278 PMCID: PMC4657247 DOI: 10.1186/s12864-015-2211-z] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2015] [Accepted: 11/13/2015] [Indexed: 11/23/2022] Open
Abstract
Background With the availability of rapidly increasing number of genome and transcriptome sequences, lineage-specific genes (LSGs) can be identified and characterized. Like other conserved functional genes, LSGs play important roles in biological evolution and functions. Results Two set of citrus LSGs, 296 citrus-specific genes (CSGs) and 1039 orphan genes specific to sweet orange, were identified by comparative analysis between the sweet orange genome sequences and 41 genomes and 273 transcriptomes. With the two sets of genes, gene structure and gene expression pattern were investigated. On average, both the CSGs and orphan genes have fewer exons, shorter gene length and higher GC content when compared with those evolutionarily conserved genes (ECs). Expression profiling indicated that most of the LSGs expressed in various tissues of sweet orange and some of them exhibited distinct temporal and spatial expression patterns. Particularly, the orphan genes were preferentially expressed in callus, which is an important pluripotent tissue of citrus. Besides, part of the CSGs and orphan genes expressed responsive to abiotic stress, indicating their potential functions during interaction with environment. Conclusion This study identified and characterized two sets of LSGs in citrus, dissected their sequence features and expression patterns, and provided valuable clues for future functional analysis of the LSGs in sweet orange. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-2211-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yuantao Xu
- Key Laboratory of Horticultural Plant Biology (Ministry of Education), Huazhong Agricultural University, Wuhan, 430070, China.
| | - Guizhi Wu
- Key Laboratory of Horticultural Plant Biology (Ministry of Education), Huazhong Agricultural University, Wuhan, 430070, China.
| | - Baohai Hao
- Agricultural Bioinformatics Key laboratory of Hubei Province, College of Information, Huazhong Agricultural University, Wuhan, 430070, China.
| | - Lingling Chen
- Agricultural Bioinformatics Key laboratory of Hubei Province, College of Information, Huazhong Agricultural University, Wuhan, 430070, China.
| | - Xiuxin Deng
- Key Laboratory of Horticultural Plant Biology (Ministry of Education), Huazhong Agricultural University, Wuhan, 430070, China.
| | - Qiang Xu
- Key Laboratory of Horticultural Plant Biology (Ministry of Education), Huazhong Agricultural University, Wuhan, 430070, China.
| |
Collapse
|
26
|
Zhou K, Huang B, Zou M, Lu D, He S, Wang G. Data in support of genome-wide identification of lineage-specific genes within Caenorhabditis elegans. Data Brief 2015; 4:595-601. [PMID: 26442285 PMCID: PMC4552949 DOI: 10.1016/j.dib.2015.07.032] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2015] [Revised: 07/20/2015] [Accepted: 07/26/2015] [Indexed: 11/22/2022] Open
Abstract
Two sets of LSGs were identified using BLAST: Caenorhabditis elegans species-specific genes (SSGs, 1423), and Caenorhabditis genus-specific genes (GSGs, 4539). The data contained in this article show SSGs and GSGs have significant differences in evolution and that most of them were formed by gene duplication and integration of transposable elements (TEs). Subsequent observation of temporal expression and protein function presents that many SSGs and GSGs are expressed and that genes involved with sex determination, specific stress, immune response, and morphogenesis are most represented. The data are related to research article "Genome-wide identification of lineage-specific genes within Caenorhabditis elegans" in Journal of Genomics [1].
Collapse
Affiliation(s)
- Kun Zhou
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan 430079, China
| | - Beibei Huang
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan 430079, China
| | - Ming Zou
- Huazhong Agriculture University, Wuhan 430070, China
| | - Dandan Lu
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan 430079, China
| | - Shunping He
- The Key Laboratory of Aquatic Biodiversity and Conservation of the Chinese Academy of Sciences, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
| | - Guoxiu Wang
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan 430079, China
| |
Collapse
|
27
|
Ocaña S, Seoane P, Bautista R, Palomino C, Claros GM, Torres AM, Madrid E. Large-Scale Transcriptome Analysis in Faba Bean (Vicia faba L.) under Ascochyta fabae Infection. PLoS One 2015; 10:e0135143. [PMID: 26267359 PMCID: PMC4534337 DOI: 10.1371/journal.pone.0135143] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2015] [Accepted: 07/17/2015] [Indexed: 12/21/2022] Open
Abstract
Faba bean is an important food crop worldwide. However, progress in faba bean genomics lags far behind that of model systems due to limited availability of genetic and genomic information. Using the Illumina platform the faba bean transcriptome from leaves of two lines (29H and Vf136) subjected to Ascochyta fabae infection have been characterized. De novo transcriptome assembly provided a total of 39,185 different transcripts that were functionally annotated, and among these, 13,266 were assigned to gene ontology against Arabidopsis. Quality of the assembly was validated by RT-qPCR amplification of selected transcripts differentially expressed. Comparison of faba bean transcripts with those of better-characterized plant genomes such as Arabidopsis thaliana, Medicago truncatula and Cicer arietinum revealed a sequence similarity of 68.3%, 72.8% and 81.27%, respectively. Moreover, 39,060 single nucleotide polymorphism (SNP) and 3,669 InDels were identified for genotyping applications. Mapping of the sequence reads generated onto the assembled transcripts showed that 393 and 457 transcripts were overexpressed in the resistant (29H) and susceptible genotype (Vf136), respectively. Transcripts involved in plant-pathogen interactions such as leucine rich proteins (LRR) or plant growth regulators involved in plant adaptation to abiotic and biotic stresses were found to be differently expressed in the resistant line. The results reported here represent the most comprehensive transcript database developed so far in faba bean, providing valuable information that could be used to gain insight into the pathways involved in the resistance mechanism against A. fabae and to identify potential resistance genes to be further used in marker assisted selection.
Collapse
Affiliation(s)
- Sara Ocaña
- Área de Mejora y Biotecnología, IFAPA Centro Alameda del Obispo, Apdo 3092, E-14080, Córdoba, Spain
| | - Pedro Seoane
- Departamento de Biología Molecular y Bioquímica, Universidad de Málaga, E-29071, Málaga, Spain
| | - Rocio Bautista
- Plataforma Andaluza de Bioinformática, Universidad de Málaga, E-29071, Málaga, Spain
| | - Carmen Palomino
- Área de Mejora y Biotecnología, IFAPA Centro Alameda del Obispo, Apdo 3092, E-14080, Córdoba, Spain
| | - Gonzalo M. Claros
- Departamento de Biología Molecular y Bioquímica, Universidad de Málaga, E-29071, Málaga, Spain
- Plataforma Andaluza de Bioinformática, Universidad de Málaga, E-29071, Málaga, Spain
| | - Ana M. Torres
- Área de Mejora y Biotecnología, IFAPA Centro Alameda del Obispo, Apdo 3092, E-14080, Córdoba, Spain
| | - Eva Madrid
- Institute for Sustainable Agriculture, CSIC, Apdo 4084, E-14080, Córdoba, Spain
| |
Collapse
|
28
|
Krasnov GS, Dmitriev AA, Kudryavtseva AV, Shargunov AV, Karpov DS, Uroshlev LA, Melnikova NV, Blinov VM, Poverennaya EV, Archakov AI, Lisitsa AV, Ponomarenko EA. PPLine: An Automated Pipeline for SNP, SAP, and Splice Variant Detection in the Context of Proteogenomics. J Proteome Res 2015; 14:3729-37. [DOI: 10.1021/acs.jproteome.5b00490] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- George Sergeevich Krasnov
- Engelhardt
Institute of Molecular Biology, Russian Academy of Sciences, Moscow, 111991 Russia
- Orekhovich
Institute of Biomedical Chemistry, Russian Academy of Medical Sciences, Moscow, 119121 Russia
- Mechnikov Research Institute of Vaccines and Sera, Moscow, 105064 Russia
| | | | - Anna Viktorovna Kudryavtseva
- Engelhardt
Institute of Molecular Biology, Russian Academy of Sciences, Moscow, 111991 Russia
- Herzen
Moscow Cancer Research Institute, Ministry of Healthcare of the Russian Federation, Moscow, 125284 Russia
| | - Alexander Valerievich Shargunov
- Orekhovich
Institute of Biomedical Chemistry, Russian Academy of Medical Sciences, Moscow, 119121 Russia
- Mechnikov Research Institute of Vaccines and Sera, Moscow, 105064 Russia
| | - Dmitry Sergeevich Karpov
- Engelhardt
Institute of Molecular Biology, Russian Academy of Sciences, Moscow, 111991 Russia
- Orekhovich
Institute of Biomedical Chemistry, Russian Academy of Medical Sciences, Moscow, 119121 Russia
| | | | | | - Vladimir Mikhailovich Blinov
- Orekhovich
Institute of Biomedical Chemistry, Russian Academy of Medical Sciences, Moscow, 119121 Russia
- Mechnikov Research Institute of Vaccines and Sera, Moscow, 105064 Russia
| | | | | | - Andrey Valerievich Lisitsa
- Orekhovich
Institute of Biomedical Chemistry, Russian Academy of Medical Sciences, Moscow, 119121 Russia
| | | |
Collapse
|
29
|
Zhou K, Huang B, Zou M, Lu D, He S, Wang G. Genome-wide identification of lineage-specific genes within Caenorhabditis elegans. Genomics 2015; 106:242-8. [PMID: 26188256 DOI: 10.1016/j.ygeno.2015.07.002] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2015] [Revised: 07/08/2015] [Accepted: 07/09/2015] [Indexed: 11/19/2022]
Abstract
With the rapid growth of sequencing technology, a number of genomes and transcriptomes of various species have been sequenced, contributing to the study of lineage-specific genes (LSGs). We identified two sets of LSGs using BLAST: one included Caenorhabditis elegans species-specific genes (1423, SSGs), and the other consisted of Caenorhabditis genus-specific genes (4539, GSGs). The subsequent characterization and analysis of the SSGs and GSGs showed that they have significant differences in evolution and that most LSGs were generated by gene duplication and integration of transposable elements (TEs). We then performed temporal expression profiling and protein function prediction and observed that many SSGs and GSGs are expressed and that genes involved with sex determination, specific stress, immune response, and morphogenesis are over-represented, suggesting that these specific genes may be related to the Caenorhabditis nematodes' special ability to survive in severe and extreme environments.
Collapse
Affiliation(s)
- Kun Zhou
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan 430079, China.
| | - Beibei Huang
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan 430079, China.
| | - Ming Zou
- Huazhong Agriculture University, Wuhan 430070, China.
| | - Dandan Lu
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan 430079, China.
| | - Shunping He
- The Key Laboratory of Aquatic Biodiversity and Conservation of the Chinese Academy of Sciences, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China.
| | - Guoxiu Wang
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan 430079, China.
| |
Collapse
|
30
|
Mukherjee S, Panda A, Ghosh TC. Elucidating evolutionary features and functional implications of orphan genes in Leishmania major. INFECTION GENETICS AND EVOLUTION 2015; 32:330-7. [PMID: 25843649 DOI: 10.1016/j.meegid.2015.03.031] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/13/2015] [Revised: 03/25/2015] [Accepted: 03/26/2015] [Indexed: 11/28/2022]
Abstract
Orphan genes are protein coding genes that lack recognizable homologs in other organisms. These genes were reported to comprise a considerable fraction of coding regions in all sequenced genomes and thought to be allied with organism's lineage-specific traits. However, their evolutionary persistence and functional significance still remain elusive. Due to lack of homologs with the host genome and for their probable lineage-specific functional roles, orphan gene product of pathogenic protozoan might be considered as the possible therapeutic targets. Leishmania major is an important parasitic protozoan of the genus Leishmania that is associated with the disease cutaneous leishmaniasis. Therefore, evolutionary and functional characterization of orphan genes in this organism may help in understanding the factors prevailing pathogen evolution and parasitic adaptation. In this study, we systematically identified orphan genes of L. major and employed several in silico analyses for understanding their evolutionary and functional attributes. To trace the signatures of molecular evolution, we compared their evolutionary rate with non-orphan genes. In agreement with prior observations, here we noticed that orphan genes evolve at a higher rate as compared to non-orphan genes. Lower sequence conservation of orphan genes was previously attributed solely due to their younger gene age. However, here we observed that together with gene age, a number of genomic (like expression level, GC content, variation in codon usage) and proteomic factors (like protein length, intrinsic disorder content, hydropathicity) could independently modulate their evolutionary rate. We considered the interplay of all these factors and analyzed their relative contribution on protein evolutionary rate by regression analysis. On the functional level, we observed that orphan genes are associated with regulatory, growth factor and transport related processes. Moreover, these genes were found to be enriched with various types of interaction and trafficking motifs, implying their possible involvement in host-parasite interactions. Thus, our comprehensive analysis of L. major orphan genes provided evidence for their extensive roles in host-pathogen interactions and virulence.
Collapse
Affiliation(s)
- Sumit Mukherjee
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata 700 054, West Bengal, India; Department of Physical Sciences, Indian Institute of Science Education and Research-Kolkata, Mohanpur 741246, Nadia, West Bengal, India
| | - Arup Panda
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata 700 054, West Bengal, India
| | - Tapash Chandra Ghosh
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata 700 054, West Bengal, India.
| |
Collapse
|
31
|
Arendsee ZW, Li L, Wurtele ES. Coming of age: orphan genes in plants. TRENDS IN PLANT SCIENCE 2014; 19:698-708. [PMID: 25151064 DOI: 10.1016/j.tplants.2014.07.003] [Citation(s) in RCA: 94] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/17/2014] [Revised: 06/27/2014] [Accepted: 07/17/2014] [Indexed: 05/19/2023]
Abstract
Sizable minorities of protein-coding genes from every sequenced eukaryotic and prokaryotic genome are unique to the species. These so-called ‘orphan genes’ may evolve de novo from non-coding sequence or be derived from older coding material. They are often associated with environmental stress responses and species-specific traits or regulatory patterns. However, difficulties in studying genes where comparative analysis is impossible, and a bias towards broadly conserved genes, have resulted in underappreciation of their importance. We review here the identification, possible origins, evolutionary trends, and functions of orphans with an emphasis on their role in plant biology. We exemplify several evolutionary trends with an analysis of Arabidopsis thaliana and present QQS as a model orphan gene.
Collapse
|
32
|
Sharma N, Jung CH, Bhalla PL, Singh MB. RNA sequencing analysis of the gametophyte transcriptome from the liverwort, Marchantia polymorpha. PLoS One 2014; 9:e97497. [PMID: 24841988 PMCID: PMC4026138 DOI: 10.1371/journal.pone.0097497] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2013] [Accepted: 04/17/2014] [Indexed: 01/08/2023] Open
Abstract
The liverwort Marchantia polymorpha is a member of the most basal lineage of land plants (embryophytes) and likely retains many ancestral morphological, physiological and molecular characteristics. Despite its phylogenetic importance and the availability of previous EST studies, M. polymorpha's lack of economic importance limits accessible genomic resources for this species. We employed Illumina RNA-Seq technology to sequence the gametophyte transcriptome of M. polymorpha. cDNA libraries from 6 different male and female developmental tissues were sequenced to delineate a global view of the M. polymorpha transcriptome. Approximately 80 million short reads were obtained and assembled into a non-redundant set of 46,533 transcripts (> = 200 bp) from 46,070 loci. The average length and the N50 length of the transcripts were 757 bp and 471 bp, respectively. Sequence comparison of assembled transcripts with non-redundant proteins from embryophytes resulted in the annotation of 43% of the transcripts. The transcripts were also compared with M. polymorpha expressed sequence tags (ESTs), and approximately 69.5% of the transcripts appeared to be novel. Twenty-one percent of the transcripts were assigned GO terms to improve annotation. In addition, 6,112 simple sequence repeats (SSRs) were identified as potential molecular markers, which may be useful in studies of genetic diversity. A comparative genomics approach revealed that a substantial proportion of the genes (35.5%) expressed in M. polymorpha were conserved across phylogenetically related species, such as Selaginella and Physcomitrella, and identified 580 genes that are potentially unique to liverworts. Our study presents an extensive amount of novel sequence information for M. polymorpha. This information will serve as a valuable genomics resource for further molecular, developmental and comparative evolutionary studies, as well as for the isolation and characterization of functional genes that are involved in sex differentiation and sexual reproduction in this liverwort.
Collapse
Affiliation(s)
- Niharika Sharma
- Plant Molecular Biology and Biotechnology Laboratory, Melbourne School of Land and Environment, The University of Melbourne, Parkville, Victoria, Australia
| | - Chol-Hee Jung
- Victorian Life Sciences Computation Initiative, The University of Melbourne, Carlton, Victoria, Australia
| | - Prem L. Bhalla
- Plant Molecular Biology and Biotechnology Laboratory, Melbourne School of Land and Environment, The University of Melbourne, Parkville, Victoria, Australia
| | - Mohan B. Singh
- Plant Molecular Biology and Biotechnology Laboratory, Melbourne School of Land and Environment, The University of Melbourne, Parkville, Victoria, Australia
- * E-mail:
| |
Collapse
|
33
|
Istvánek J, Jaros M, Krenek A, Řepková J. Genome assembly and annotation for red clover (Trifolium pratense; Fabaceae). AMERICAN JOURNAL OF BOTANY 2014; 101:327-37. [PMID: 24500806 DOI: 10.3732/ajb.1300340] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
PREMISE OF THE STUDY Red clover (Trifolium pratense) is an important forage plant from the legume family with great importance in agronomy and livestock nourishment. Nevertheless, assembling its medium-sized genome presents a challenge, given current hardware and software possibilities. Next-generation sequencing technologies enable us to generate large amounts of sequence data at low cost. In this study, the genome assembly and red clover genome features are presented. METHODS First, assembly software was assessed using data sets from a closely related species to find the best possible combination of assembler plus error correction program to assemble the red clover genome. The newly sequenced genome was characterized by repetitive content, number of protein-coding and nonprotein-coding genes, and gene families and functions. Genome features were also compared with those of other sequenced plant species. KEY RESULTS Abyss with Echo correction was used for de novo assembly of the red clover genome. The presented assembly comprises ∼314.6 Mbp. In contrast to leguminous species with comparable genome sizes, the genome of T. pratense contains a larger repetitive portion and more abundant retrotransposons and DNA transposons. Overall, 47 398 protein-coding genes were annotated from 64 761 predicted genes. Comparative analysis revealed several gene families that are characteristic for T. pratense. Resistance genes, leghemoglobins, and nodule-specific cystein-rich peptides were identified and compared with other sequenced species. CONCLUSIONS The presented red clover genomic data constitute a resource for improvement through molecular breeding and for comparison to other sequenced plant species.
Collapse
Affiliation(s)
- Jan Istvánek
- Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic
| | | | | | | |
Collapse
|
34
|
Campbell MS, Law M, Holt C, Stein JC, Moghe GD, Hufnagel DE, Lei J, Achawanantakun R, Jiao D, Lawrence CJ, Ware D, Shiu SH, Childs KL, Sun Y, Jiang N, Yandell M. MAKER-P: a tool kit for the rapid creation, management, and quality control of plant genome annotations. PLANT PHYSIOLOGY 2014; 164:513-24. [PMID: 24306534 PMCID: PMC3912085 DOI: 10.1104/pp.113.230144] [Citation(s) in RCA: 286] [Impact Index Per Article: 28.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/08/2013] [Accepted: 11/26/2013] [Indexed: 05/18/2023]
Abstract
We have optimized and extended the widely used annotation engine MAKER in order to better support plant genome annotation efforts. New features include better parallelization for large repeat-rich plant genomes, noncoding RNA annotation capabilities, and support for pseudogene identification. We have benchmarked the resulting software tool kit, MAKER-P, using the Arabidopsis (Arabidopsis thaliana) and maize (Zea mays) genomes. Here, we demonstrate the ability of the MAKER-P tool kit to automatically update, extend, and revise the Arabidopsis annotations in light of newly available data and to annotate pseudogenes and noncoding RNAs absent from The Arabidopsis Informatics Resource 10 build. Our results demonstrate that MAKER-P can be used to manage and improve the annotations of even Arabidopsis, perhaps the best-annotated plant genome. We have also installed and benchmarked MAKER-P on the Texas Advanced Computing Center. We show that this public resource can de novo annotate the entire Arabidopsis and maize genomes in less than 3 h and produce annotations of comparable quality to those of the current The Arabidopsis Information Resource 10 and maize V2 annotation builds.
Collapse
|
35
|
Ferguson AA, Zhao D, Jiang N. Selective acquisition and retention of genomic sequences by Pack-Mutator-like elements based on guanine-cytosine content and the breadth of expression. PLANT PHYSIOLOGY 2013; 163:1419-32. [PMID: 24028844 PMCID: PMC3813661 DOI: 10.1104/pp.113.223271] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
The process of gene duplication followed by sequence and functional divergence is important for the generation of new genes. Pack-MULEs, nonautonomous Mutator-like elements (MULEs) that carry genic sequence(s), are potentially involved in generating new open reading frames and regulating parental gene expression. These elements are identified in many plant genomes and are most abundant in rice (Oryza sativa). Despite the abundance of Pack-MULEs, the mechanism by which parental genes are captured by Pack-MULEs remains largely unknown. In this study, we identified all MULEs in rice and examined factors likely important for sequence acquisition. Terminal inverted repeat MULEs are the predominant MULE type and account for the majority of the Pack-MULEs. In addition to genic sequences, rice MULEs capture guanine-cytosine (GC)-rich intergenic sequences, albeit at a much lower frequency. MULEs carrying nontransposon sequences have longer terminal inverted repeats and higher GC content in terminal and subterminal regions. An overrepresentation of genes with known functions and genes with orthologs among parental genes of Pack-MULEs is observed in rice, maize (Zea mays), and Arabidopsis (Arabidopsis thaliana), suggesting preferential acquisition for bona fide genes by these elements. Pack-MULEs selectively acquire/retain parental sequences through a combined effect of GC content and breadth of expression, with GC content playing a stronger role. Increased GC content and number of tissues with detectable expression result in higher chances of a gene being acquired by Pack-MULEs. Such selective acquisition/retention provides these elements greater chances of carrying functional sequences that may provide new genetic resources for the evolution of new genes or the modification of existing genes.
Collapse
|
36
|
Zalewski CS, Floyd SK, Furumizu C, Sakakibara K, Stevenson DW, Bowman JL. Evolution of the class IV HD-zip gene family in streptophytes. Mol Biol Evol 2013; 30:2347-65. [PMID: 23894141 PMCID: PMC3773374 DOI: 10.1093/molbev/mst132] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Class IV homeodomain leucine zipper (C4HDZ) genes are plant-specific transcription factors that, based on phenotypes in Arabidopsis thaliana, play an important role in epidermal development. In this study, we sampled all major extant lineages and their closest algal relatives for C4HDZ homologs and phylogenetic analyses result in a gene tree that mirrors land plant evolution with evidence for gene duplications in many lineages, but minimal evidence for gene losses. Our analysis suggests an ancestral C4HDZ gene originated in an algal ancestor of land plants and a single ancestral gene was present in the last common ancestor of land plants. Independent gene duplications are evident within several lineages including mosses, lycophytes, euphyllophytes, seed plants, and, most notably, angiosperms. In recently evolved angiosperm paralogs, we find evidence of pseudogenization via mutations in both coding and regulatory sequences. The increasing complexity of the C4HDZ gene family through the diversification of land plants correlates to increasing complexity in epidermal characters.
Collapse
Affiliation(s)
| | - Sandra K. Floyd
- School of Biological Sciences, Monash University, Melbourne, Victoria, Australia
| | - Chihiro Furumizu
- School of Biological Sciences, Monash University, Melbourne, Victoria, Australia
| | - Keiko Sakakibara
- School of Biological Sciences, Monash University, Melbourne, Victoria, Australia
- Graduate School of Science, University of Tokyo, Hongo, Tokyo, Japan
| | | | - John L. Bowman
- Section of Plant Biology, University of California, Davis
- School of Biological Sciences, Monash University, Melbourne, Victoria, Australia
| |
Collapse
|
37
|
Divergent evolutionary and expression patterns between lineage specific new duplicate genes and their parental paralogs in Arabidopsis thaliana. PLoS One 2013; 8:e72362. [PMID: 24009676 PMCID: PMC3756979 DOI: 10.1371/journal.pone.0072362] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2013] [Accepted: 07/11/2013] [Indexed: 12/14/2022] Open
Abstract
Gene duplication is an important mechanism for the origination of functional novelties in organisms. We performed a comparative genome analysis to systematically estimate recent lineage specific gene duplication events in Arabidopsis thaliana and further investigate whether and how these new duplicate genes (NDGs) play a functional role in the evolution and adaption of A. thaliana. We accomplished this using syntenic relationship among four closely related species, A. thaliana, A. lyrata, Capsella rubella and Brassica rapa. We identified 100 NDGs, showing clear origination patterns, whose parental genes are located in syntenic regions and/or have clear orthologs in at least one of three outgroup species. All 100 NDGs were transcribed and under functional constraints, while 24% of the NDGs have differential expression patterns compared to their parental genes. We explored the underlying evolutionary forces of these paralogous pairs through conducting neutrality tests with sequence divergence and polymorphism data. Evolution of about 15% of NDGs appeared to be driven by natural selection. Moreover, we found that 3 NDGs not only altered their expression patterns when compared with parental genes, but also evolved under positive selection. We investigated the underlying mechanisms driving the differential expression of NDGs and their parents, and found a number of NDGs had different cis-elements and methylation patterns from their parental genes. Overall, we demonstrated that NDGs acquired divergent cis-elements and methylation patterns and may experience sub-functionalization or neo-functionalization influencing the evolution and adaption of A. thaliana.
Collapse
|
38
|
Singh VK, Garg R, Jain M. A global view of transcriptome dynamics during flower development in chickpea by deep sequencing. PLANT BIOTECHNOLOGY JOURNAL 2013; 11:691-701. [PMID: 23551980 DOI: 10.1111/pbi.12059] [Citation(s) in RCA: 80] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2012] [Revised: 01/22/2013] [Accepted: 01/29/2013] [Indexed: 05/06/2023]
Abstract
Measurement of gene expression can provide important clues about gene function and molecular basis of developmental processes. Here, we have analysed the chickpea transcriptome in vegetative and flower tissues by exploiting the potential of high-throughput sequencing to measure gene expression. We mapped more than 295 million reads to quantify the transcript abundance during flower development. We detected the expression of more than 90% genes in at least one tissue analysed. We found quite a large number of genes were differentially expressed during flower development as compared to vegetative tissues. Further, we identified several genes expressed in a stage-specific manner. Various transcription factor families and metabolic pathways involved in flower development were elucidated. The members of MADS-box family were most represented among the transcription factor genes up-regulated during various stages of flower development. The abundant expression of several well-known genes implicated in flower development in chickpea flower development stages confirmed our results. In addition, we detected the expression specificities of lineage-specific genes during flower development. The expression data presented in this study is the most comprehensive dataset available for chickpea as of now and will serve as resource for unraveling the functions of many specific genes involved in flower development in chickpea and other legumes.
Collapse
Affiliation(s)
- Vikash K Singh
- National Institute of Plant Genome Research-NIPGR, Aruna Asaf Ali Marg, New Delhi 110067, India
| | | | | |
Collapse
|
39
|
Jain M, Misra G, Patel RK, Priya P, Jhanwar S, Khan AW, Shah N, Singh VK, Garg R, Jeena G, Yadav M, Kant C, Sharma P, Yadav G, Bhatia S, Tyagi AK, Chattopadhyay D. A draft genome sequence of the pulse crop chickpea (Cicer arietinum L.). THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2013; 74:715-29. [PMID: 23489434 DOI: 10.1111/tpj.12173] [Citation(s) in RCA: 231] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/24/2013] [Revised: 02/27/2013] [Accepted: 03/04/2013] [Indexed: 05/17/2023]
Abstract
Cicer arietinum L. (chickpea) is the third most important food legume crop. We have generated the draft sequence of a desi-type chickpea genome using next-generation sequencing platforms, bacterial artificial chromosome end sequences and a genetic map. The 520-Mb assembly covers 70% of the predicted 740-Mb genome length, and more than 80% of the gene space. Genome analysis predicts the presence of 27,571 genes and 210 Mb as repeat elements. The gene expression analysis performed using 274 million RNA-Seq reads identified several tissue-specific and stress-responsive genes. Although segmental duplicated blocks are observed, the chickpea genome does not exhibit any indication of recent whole-genome duplication. Nucleotide diversity analysis provides an assessment of a narrow genetic base within the chickpea cultivars. We have developed a resource for genetic markers by comparing the genome sequences of one wild and three cultivated chickpea genotypes. The draft genome sequence is expected to facilitate genetic enhancement and breeding to develop improved chickpea varieties.
Collapse
Affiliation(s)
- Mukesh Jain
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi 110067, India
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Yang L, Zou M, Fu B, He S. Genome-wide identification, characterization, and expression analysis of lineage-specific genes within zebrafish. BMC Genomics 2013; 14:65. [PMID: 23368736 PMCID: PMC3599513 DOI: 10.1186/1471-2164-14-65] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2012] [Accepted: 01/29/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The genomic basis of teleost phenotypic complexity remains obscure, despite increasing availability of genome and transcriptome sequence data. Fish-specific genome duplication cannot provide sufficient explanation for the morphological complexity of teleosts, considering the relatively large number of extinct basal ray-finned fishes. RESULTS In this study, we performed comparative genomic analysis to discover the Conserved Teleost-Specific Genes (CTSGs) and orphan genes within zebrafish and found that these two sets of lineage-specific genes may have played important roles during zebrafish embryogenesis. Lineage-specific genes within zebrafish share many of the characteristics of their counterparts in other species: shorter length, fewer exon numbers, higher GC content, and fewer of them have transcript support. Chromosomal location analysis indicated that neither the CTSGs nor the orphan genes were distributed evenly in the chromosomes of zebrafish. The significant enrichment of immunity proteins in CTSGs annotated by gene ontology (GO) or predicted ab initio may imply that defense against pathogens may be an important reason for the diversification of teleosts. The evolutionary origin of the lineage-specific genes was determined and a very high percentage of lineage-specific genes were generated via gene duplications. The temporal and spatial expression profile of lineage-specific genes obtained by expressed sequence tags (EST) and RNA-seq data revealed two novel properties: in addition to being highly tissue-preferred expression, lineage-specific genes are also highly temporally restricted, namely they are expressed in narrower time windows than evolutionarily conserved genes and are specifically enriched in later-stage embryos and early larval stages. CONCLUSIONS Our study provides the first systematic identification of two different sets of lineage-specific genes within zebrafish and provides valuable information leading towards a better understanding of the molecular mechanisms of the genomic basis of teleost phenotypic complexity for future studies.
Collapse
Affiliation(s)
- Liandong Yang
- The Key Laboratory of Aquatic Biodiversity and Conservation of Chinese Academy of Sciences, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, Hubei 430072, People's Republic of China
| | | | | | | |
Collapse
|
41
|
Oh DH, Dassanayake M, Bohnert HJ, Cheeseman JM. Life at the extreme: lessons from the genome. Genome Biol 2012; 13:241. [PMID: 22390828 DOI: 10.1186/gb-2012-13-3-241] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Extremophile plants thrive in places where most plant species cannot survive. Recent developments in high-throughput technologies and comparative genomics are shedding light on the evolutionary mechanisms leading to their adaptation.
Collapse
Affiliation(s)
- Dong-Ha Oh
- Department of Plant Biology, University of Illinois at Urbana-Champaign, 61801, USA
| | | | | | | |
Collapse
|
42
|
Sasidharan R, Nepusz T, Swarbreck D, Huala E, Paccanaro A. GFam: a platform for automatic annotation of gene families. Nucleic Acids Res 2012; 40:e152. [PMID: 22790981 PMCID: PMC3479161 DOI: 10.1093/nar/gks631] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
We have developed GFam, a platform for automatic annotation of gene/protein families. GFam provides a framework for genome initiatives and model organism resources to build domain-based families, derive meaningful functional labels and offers a seamless approach to propagate functional annotation across periodic genome updates. GFam is a hybrid approach that uses a greedy algorithm to chain component domains from InterPro annotation provided by its 12 member resources followed by a sequence-based connected component analysis of un-annotated sequence regions to derive consensus domain architecture for each sequence and subsequently generate families based on common architectures. Our integrated approach increases sequence coverage by 7.2 percentage points and residue coverage by 14.6 percentage points higher than the coverage relative to the best single-constituent database within InterPro for the proteome of Arabidopsis. The true power of GFam lies in maximizing annotation provided by the different InterPro data sources that offer resource-specific coverage for different regions of a sequence. GFam’s capability to capture higher sequence and residue coverage can be useful for genome annotation, comparative genomics and functional studies. GFam is a general-purpose software and can be used for any collection of protein sequences. The software is open source and can be obtained from http://www.paccanarolab.org/software/gfam/.
Collapse
Affiliation(s)
- Rajkumar Sasidharan
- Department of Molecular, Cell and Developmental Biology, University of California at Los Angeles, Los Angeles, CA 90095, USA.
| | | | | | | | | |
Collapse
|
43
|
Rutter MT, Cross KV, Van Woert PA. Birth, death and subfunctionalization in the Arabidopsis genome. TRENDS IN PLANT SCIENCE 2012; 17:204-12. [PMID: 22326563 DOI: 10.1016/j.tplants.2012.01.006] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2011] [Revised: 01/12/2012] [Accepted: 01/16/2012] [Indexed: 05/08/2023]
Abstract
Arabidopsis thaliana is now a model system, not just for plant biology but also for comparative genomics. The completion of the sequences of two closely related species, Arabidopsis lyrata and Brassica rapa, is complemented by genomic comparisons among A. thaliana accessions and mutation accumulation lines. Together these genomic data document the birth of new genes via gene duplication, transposon exaptation and de novo formation of new genes from noncoding sequence. Most novel loci exhibit low expression, and are undergoing pseudogenization or subfunctionalization. Comparatively, A. thaliana has lost large amounts of sequence through deletion, particularly of transposable elements. Intraspecific genomic variation indicates high rates of deletion mutations and deletion polymorphisms across accessions, shedding light on the history of Arabidopsis genome architecture.
Collapse
Affiliation(s)
- Matthew T Rutter
- Department of Biology, College of Charleston, Charleston, SC 29401, USA.
| | | | | |
Collapse
|
44
|
Oh DH, Dassanayake M, Bohnert HJ, Cheeseman JM. Life at the extreme: lessons from the genome. Genome Biol 2012. [DOI: 10.1186/gb4003] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
|
45
|
Garg R, Jain M. Pyrosequencing data reveals tissue-specific expression of lineage-specific transcripts in chickpea. PLANT SIGNALING & BEHAVIOR 2011; 6:1868-70. [PMID: 22057340 PMCID: PMC3343734 DOI: 10.4161/psb.6.11.17879] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
Chickpea is a very important crop legume plant, which provides a protein-rich supplement to cereal-based diets and has the ability to fix atmospheric nitrogen. Despite its economic importance, the functional genomic resources for chickpea are very limited. Recently, we reported the complete transcriptome of chickpea using next generation sequencing technologies. We analyzed the tissue-specific expression of chickpea transcripts based on RNA-seq data. In addition, we identified two sets of lineage-specific genes, including a total of 3,632 chickpea-specific and 741 as legume-specific transcripts based on sequence comparison with other species within plant kingdom. The study of lineage-specific genes provides insights into the species-/lineage-specific functions and evolutionary processes. In this study, we further analyze the expression profiles of legume- and chickpea-specific transcripts in various tissue samples. Several legume- and chickpea-specific transcripts showed preferential and/or specific expression in the tissue samples analyzed. Our analysis provides evidence for the role of legume- and chickpea-specific transcripts in various tissues and opens an important area of future research to elucidate the exact role of these genes.
Collapse
Affiliation(s)
- Rohini Garg
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi, India
| | | |
Collapse
|
46
|
Garg R, Patel RK, Jhanwar S, Priya P, Bhattacharjee A, Yadav G, Bhatia S, Chattopadhyay D, Tyagi AK, Jain M. Gene discovery and tissue-specific transcriptome analysis in chickpea with massively parallel pyrosequencing and web resource development. PLANT PHYSIOLOGY 2011; 156:1661-78. [PMID: 21653784 PMCID: PMC3149962 DOI: 10.1104/pp.111.178616] [Citation(s) in RCA: 113] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2011] [Accepted: 06/07/2011] [Indexed: 05/17/2023]
Abstract
Chickpea (Cicer arietinum) is an important food legume crop but lags in the availability of genomic resources. In this study, we have generated about 2 million high-quality sequences of average length of 372 bp using pyrosequencing technology. The optimization of de novo assembly clearly indicated that hybrid assembly of long-read and short-read primary assemblies gave better results. The hybrid assembly generated a set of 34,760 transcripts with an average length of 1,020 bp representing about 4.8% (35.5 Mb) of the total chickpea genome. We identified more than 4,000 simple sequence repeats, which can be developed as functional molecular markers in chickpea. Putative function and Gene Ontology terms were assigned to at least 73.2% and 71.0% of chickpea transcripts, respectively. We have also identified several chickpea transcripts that showed tissue-specific expression and validated the results using real-time polymerase chain reaction analysis. Based on sequence comparison with other species within the plant kingdom, we identified two sets of lineage-specific genes, including those conserved in the Fabaceae family (legume specific) and those lacking significant similarity with any non chickpea species (chickpea specific). Finally, we have developed a Web resource, Chickpea Transcriptome Database, which provides public access to the data and results reported in this study. The strategy for optimization of de novo assembly presented here may further facilitate the transcriptome sequencing and characterization in other organisms. Most importantly, the data and results reported in this study will help to accelerate research in various areas of genomics and implementing breeding programs in chickpea.
Collapse
|
47
|
Carvallo MA, Pino MT, Jeknić Z, Zou C, Doherty CJ, Shiu SH, Chen THH, Thomashow MF. A comparison of the low temperature transcriptomes and CBF regulons of three plant species that differ in freezing tolerance: Solanum commersonii, Solanum tuberosum, and Arabidopsis thaliana. JOURNAL OF EXPERIMENTAL BOTANY 2011; 62:3807-19. [PMID: 21511909 PMCID: PMC3134341 DOI: 10.1093/jxb/err066] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2010] [Revised: 02/03/2011] [Accepted: 02/11/2011] [Indexed: 05/18/2023]
Abstract
Solanum commersonii and Solanum tuberosum are closely related plant species that differ in their abilities to cold acclimate; whereas S. commersonii increases in freezing tolerance in response to low temperature, S. tuberosum does not. In Arabidopsis thaliana, cold-regulated genes have been shown to contribute to freezing tolerance, including those that comprise the CBF regulon, genes that are controlled by the CBF transcription factors. The low temperature transcriptomes and CBF regulons of S. commersonii and S. tuberosum were therefore compared to determine whether there might be differences that contribute to their differences in ability to cold acclimate. The results indicated that both plants alter gene expression in response to low temperature to similar degrees with similar kinetics and that both plants have CBF regulons composed of hundreds of genes. However, there were considerable differences in the sets of genes that comprised the low temperature transcriptomes and CBF regulons of the two species. Thus differences in cold regulatory programmes may contribute to the differences in freezing tolerance of these two species. However, 53 groups of putative orthologous genes that are cold-regulated in S. commersonii, S. tuberosum, and A. thaliana were identified. Given that the evolutionary distance between the two Solanum species and A. thaliana is 112-156 million years, it seems likely that these conserved cold-regulated genes-many of which encode transcription factors and proteins of unknown function-have fundamental roles in plant growth and development at low temperature.
Collapse
Affiliation(s)
- Marcela A. Carvallo
- MSU-DOE Plant Research Laboratory, Michigan State University, East Lansing, MI 48824, USA
| | - María-Teresa Pino
- Department of Horticulture, Oregon State University, Corvallis, OR 97331, USA
| | - Zoran Jeknić
- Department of Horticulture, Oregon State University, Corvallis, OR 97331, USA
| | - Cheng Zou
- Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Colleen J. Doherty
- MSU-DOE Plant Research Laboratory, Michigan State University, East Lansing, MI 48824, USA
| | - Shin-Han Shiu
- Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Tony H. H. Chen
- Department of Horticulture, Oregon State University, Corvallis, OR 97331, USA
| | - Michael F. Thomashow
- MSU-DOE Plant Research Laboratory, Michigan State University, East Lansing, MI 48824, USA
- Department of Crop and Soil Sciences, Michigan State University, East Lansing, MI 48824, USA
- To whom correspondence should be addressed. E-mail:
| |
Collapse
|
48
|
Johnson BR, Tsutsui ND. Taxonomically restricted genes are associated with the evolution of sociality in the honey bee. BMC Genomics 2011; 12:164. [PMID: 21447185 PMCID: PMC3072959 DOI: 10.1186/1471-2164-12-164] [Citation(s) in RCA: 92] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2010] [Accepted: 03/29/2011] [Indexed: 11/29/2022] Open
Abstract
Background Studies have shown that taxonomically restricted genes are significant in number and important for the evolution of lineage specific traits. Social insects have gained many novel morphological and behavioral traits relative to their solitary ancestors. The task repertoire of an advanced social insect, for example, can be 40-50 tasks, about twice that of a solitary wasp or bee. The genetic basis of this expansion in behavioral repertoire is still poorly understood, and a role for taxonomically restricted genes has not been explored at the whole genome level. Results Here we present comparative genomics results suggesting that taxonomically restricted genes may have played an important role in generating the expansion of behavioral repertoire associated with the evolution of eusociality. First, we show that the current honey bee official gene set contains about 700 taxonomically restricted genes. These are split between orphans, genes found only in the Hymenoptera, and genes found only in insects. Few of the orphans or genes restricted to the Hymenoptera have been the focus of experimental work, but several of those that have are associated with novel eusocial traits or traits thought to have changed radically as a consequence of eusociality. Second, we predicted that if taxonomically restricted genes are important for generating novel eusocial traits, then they should be expressed with greater frequency in workers relative to the queen, as the workers exhibit most of the novel behavior of the honey bee relative to their solitary ancestors. We found support for this prediction. Twice as many taxonomically restricted genes were found amongst the genes with higher expression in workers compared to those with higher expression in queens. Finally, we compiled an extensive list of candidate taxonomically restricted genes involved in eusocial evolution by analyzing several caste specific gene expression data sets. Conclusions This work identifies a large number of candidate taxonomically restricted genes that may have played a role in eusocial evolution. This work thus lays the foundation for future functional genomics work on the evolution of novelty in the context of social behavior. We also present preliminary evidence, based on biased patterns of gene expression, that taxonomically restricted genes may have played a role in the evolution of caste systems, a characteristic lineage specific social trait.
Collapse
Affiliation(s)
- Brian R Johnson
- Department of Environmental Science, Policy & Management University of California, Berkeley 137 Mulford Hall, MC3114 Berkeley, CA 94720-3114 USA.
| | | |
Collapse
|
49
|
Donoghue MT, Keshavaiah C, Swamidatta SH, Spillane C. Evolutionary origins of Brassicaceae specific genes in Arabidopsis thaliana. BMC Evol Biol 2011; 11:47. [PMID: 21332978 PMCID: PMC3049755 DOI: 10.1186/1471-2148-11-47] [Citation(s) in RCA: 126] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2010] [Accepted: 02/18/2011] [Indexed: 11/21/2022] Open
Abstract
Background All sequenced genomes contain a proportion of lineage-specific genes, which exhibit no sequence similarity to any genes outside the lineage. Despite their prevalence, the origins and functions of most lineage-specific genes remain largely unknown. As more genomes are sequenced opportunities for understanding evolutionary origins and functions of lineage-specific genes are increasing. Results This study provides a comprehensive analysis of the origins of lineage-specific genes (LSGs) in Arabidopsis thaliana that are restricted to the Brassicaceae family. In this study, lineage-specific genes within the nuclear (1761 genes) and mitochondrial (28 genes) genomes are identified. The evolutionary origins of two thirds of the lineage-specific genes within the Arabidopsis thaliana genome are also identified. Almost a quarter of lineage-specific genes originate from non-lineage-specific paralogs, while the origins of ~10% of lineage-specific genes are partly derived from DNA exapted from transposable elements (twice the proportion observed for non-lineage-specific genes). Lineage-specific genes are also enriched in genes that have overlapping CDS, which is consistent with such novel genes arising from overprinting. Over half of the subset of the 958 lineage-specific genes found only in Arabidopsis thaliana have alignments to intergenic regions in Arabidopsis lyrata, consistent with either de novo origination or differential gene loss and retention, with both evolutionary scenarios explaining the lineage-specific status of these genes. A smaller number of lineage-specific genes with an incomplete open reading frame across different Arabidopsis thaliana accessions are further identified as accession-specific genes, most likely of recent origin in Arabidopsis thaliana. Putative de novo origination for two of the Arabidopsis thaliana-only genes is identified via additional sequencing across accessions of Arabidopsis thaliana and closely related sister species lineages. We demonstrate that lineage-specific genes have high tissue specificity and low expression levels across multiple tissues and developmental stages. Finally, stress responsiveness is identified as a distinct feature of Brassicaceae-specific genes; where these LSGs are enriched for genes responsive to a wide range of abiotic stresses. Conclusion Improving our understanding of the origins of lineage-specific genes is key to gaining insights regarding how novel genes can arise and acquire functionality in different lineages. This study comprehensively identifies all of the Brassicaceae-specific genes in Arabidopsis thaliana and identifies how the majority of such lineage-specific genes have arisen. The analysis allows the relative importance (and prevalence) of different evolutionary routes to the genesis of novel ORFs within lineages to be assessed. Insights regarding the functional roles of lineage-specific genes are further advanced through identification of enrichment for stress responsiveness in lineage-specific genes, highlighting their likely importance for environmental adaptation strategies.
Collapse
Affiliation(s)
- Mark Ta Donoghue
- Department of Biochemistry, University College Cork, Cork, Ireland
| | | | | | | |
Collapse
|
50
|
Abstract
An internal time-keeping mechanism has been observed in almost every organism studied from archaea to humans. This circadian clock provides a competitive advantage in fitness and survival ( 18, 30, 95, 129, 137 ). Researchers have uncovered the molecular composition of this internal clock by combining enzymology, molecular biology, genetics, and modeling approaches. However, understanding the mechanistic link between the clock and output responses has been elusive. In three model organisms, Arabidopsis thaliana, Drosophila melanogaster, and Mus musculus, whole-genome expression arrays have enabled researchers to investigate how maintaining a time-keeping mechanism connects to an adaptive advantage. Here, we review the impacts transcriptomics have had on our understanding of the clock and how this molecular clock connects with system-level circadian responses. We explore the discoveries made possible by high-throughput RNA assays, the network approaches used to investigate these large transcript datasets, and potential future directions.
Collapse
Affiliation(s)
- Colleen J Doherty
- Section of Cell and Developmental Biology, Division of Biological Sciences, University of California, San Diego, La Jolla, California 92093, USA.
| | | |
Collapse
|