1
|
Wilson AE, Liberles DA. Expectations of duplicate gene retention under the gene duplicability hypothesis. BMC Ecol Evol 2023; 23:76. [PMID: 38097959 PMCID: PMC10720195 DOI: 10.1186/s12862-023-02174-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 11/02/2023] [Indexed: 12/17/2023] Open
Abstract
BACKGROUND Gene duplication is an important process in evolution. What causes some genes to be retained after duplication and others to be lost is a process not well understood. The most prevalent theory is the gene duplicability hypothesis, that something about the function and number of interacting partners (number of subunits of protein complex, etc.), determines whether copies have more opportunity to be retained for long evolutionary periods. Some genes are also more susceptible to dosage balance effects following WGD events, making them more likely to be retained for longer periods of time. One would expect these processes that affect the retention of duplicate copies to affect the conditional probability ratio after consecutive whole genome duplication events. The probability that a gene will be retained after a second whole genome duplication event (WGD2), given that it was retained after the first whole genome duplication event (WGD1) versus the probability a gene will be retained after WGD2, given it was lost after WGD1 defines the probability ratio that is calculated. RESULTS Since duplicate gene retention is a time heterogeneous process, the time between the events (t1) and the time since the most recent event (t2) are relevant factors in calculating the expectation for observation in any genome. Here, we use a survival analysis framework to predict the probability ratio for genomes with different values of t1 and t2 under the gene duplicability hypothesis, that some genes are more susceptible to selectable functional shifts, some more susceptible to dosage compensation, and others only drifting. We also predict the probability ratio with different values of t1 and t2 under the mutational opportunity hypothesis, that probability of retention for certain genes changes in subsequent events depending upon how they were previously retained. These models are nested such that the mutational opportunity model encompasses the gene duplicability model with shifting duplicability over time. Here we present a formalization of the gene duplicability and mutational opportunity hypotheses to characterize evolutionary dynamics and explanatory power in a recently developed statistical framework. CONCLUSIONS This work presents expectations of the gene duplicability and mutational opportunity hypotheses over time under different sets of assumptions. This expectation will enable formal testing of processes leading to duplicate gene retention.
Collapse
Affiliation(s)
- Amanda E Wilson
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, 1900 N. 12th Street, Philadelphia, PA, 19122, USA
| | - David A Liberles
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, 1900 N. 12th Street, Philadelphia, PA, 19122, USA.
| |
Collapse
|
2
|
Ahmadi H, Sheikh-Assadi M, Fatahi R, Zamani Z, Shokrpour M. Optimizing an efficient ensemble approach for high-quality de novo transcriptome assembly of Thymus daenensis. Sci Rep 2023; 13:12415. [PMID: 37524806 PMCID: PMC10390528 DOI: 10.1038/s41598-023-39620-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Accepted: 07/27/2023] [Indexed: 08/02/2023] Open
Abstract
Non-erroneous and well-optimized transcriptome assembly is a crucial prerequisite for authentic downstream analyses. Each de novo assembler has its own algorithm-dependent pros and cons to handle the assembly issues and should be specifically tested for each dataset. Here, we examined efficiency of seven state-of-art assemblers on ~ 30 Gb data obtained from mRNA-sequencing of Thymus daenensis. In an ensemble workflow, combining the outputs of different assemblers associated with an additional redundancy-reducing step could generate an optimized outcome in terms of completeness, annotatability, and ORF richness. Based on the normalized scores of 16 benchmarking metrics, EvidentialGene, BinPacker, Trinity, rnaSPAdes, CAP3, IDBA-trans, and Velvet-Oases performed better, respectively. EvidentialGene, as the best assembler, totally produced 316,786 transcripts, of which 235,730 (74%) were predicted to have a unique protein hit (on uniref100), and also half of its transcripts contained an ORF. The total number of unique BLAST hits for EvidentialGene was approximately three times greater than that of the worst assembler (Velvet-Oases). EvidentialGene could even capture 17% and 7% more average BLAST hits than BinPacker and Trinity. Although BinPacker and CAP3 produced longer transcripts, the EvidentialGene showed a higher collinearity between transcript size and ORF length. Compared with the other programs, EvidentialGene yielded a higher number of optimal transcript sets, further full-length transcripts, and lower possible misassemblies. Our finding corroborates that in non-model species, relying on a single assembler may not give an entirely satisfactory result. Therefore, this study proposes an ensemble approach of accompanying EvidentialGene pipelines to acquire a superior assembly for T. daenensis.
Collapse
Affiliation(s)
- Hosein Ahmadi
- Department of Horticulture Science, Faculty of Agriculture and Natural Sciences, University of Tehran, Karaj, Iran
| | - Morteza Sheikh-Assadi
- Department of Horticulture Science, Faculty of Agriculture and Natural Sciences, University of Tehran, Karaj, Iran
| | - Reza Fatahi
- Department of Horticulture Science, Faculty of Agriculture and Natural Sciences, University of Tehran, Karaj, Iran.
| | - Zabihollah Zamani
- Department of Horticulture Science, Faculty of Agriculture and Natural Sciences, University of Tehran, Karaj, Iran
| | - Majid Shokrpour
- Department of Horticulture Science, Faculty of Agriculture and Natural Sciences, University of Tehran, Karaj, Iran
| |
Collapse
|
3
|
Walden N, Schranz ME. Synteny Identifies Reliable Orthologs for Phylogenomics and Comparative Genomics of the Brassicaceae. Genome Biol Evol 2023; 15:7059155. [PMID: 36848527 PMCID: PMC10016055 DOI: 10.1093/gbe/evad034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Revised: 01/27/2023] [Accepted: 02/17/2023] [Indexed: 03/01/2023] Open
Abstract
Large genomic data sets are becoming the new normal in phylogenetic research, but the identification of true orthologous genes and the exclusion of problematic paralogs is still challenging when applying commonly used sequencing methods such as target enrichment. Here, we compared conventional ortholog detection using OrthoFinder with ortholog detection through genomic synteny in a data set of 11 representative diploid Brassicaceae whole-genome sequences spanning the entire phylogenetic space. Then, we evaluated the resulting gene sets regarding gene number, functional annotation, and gene and species tree resolution. Finally, we used the syntenic gene sets for comparative genomics and ancestral genome analysis. The use of synteny resulted in considerably more orthologs and also allowed us to reliably identify paralogs. Surprisingly, we did not detect notable differences between species trees reconstructed from syntenic orthologs when compared with other gene sets, including the Angiosperms353 set and a Brassicaceae-specific target enrichment gene set. However, the synteny data set comprised a multitude of gene functions, strongly suggesting that this method of marker selection for phylogenomics is suitable for studies that value downstream gene function analysis, gene interaction, and network studies. Finally, we present the first ancestral genome reconstruction for the Core Brassicaceae which predating the Brassicaceae lineage diversification ∼25 million years ago.
Collapse
Affiliation(s)
- Nora Walden
- Biosystematics Group, Wageningen University, Wageningen, The Netherlands.,Centre for Organismal Studies, Heidelberg University, Heidelberg, Germany
| | | |
Collapse
|
4
|
Saddhe AA, Potocký M. Comparative phylogenomic and structural analysis of canonical secretory PLA2 and novel PLA2-like family in plants. FRONTIERS IN PLANT SCIENCE 2023; 14:1118670. [PMID: 36909415 PMCID: PMC9995887 DOI: 10.3389/fpls.2023.1118670] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 02/14/2023] [Indexed: 06/18/2023]
Abstract
Plant secretory phospholipase A2 (sPLA2) is a family of lipolytic enzymes involved in the sn-2 hydrolysis of phospholipid carboxyester bonds, characterized by the presence of a conserved PA2c domain. PLA2 produces free fatty acids and lysophospholipids, which regulate several physiological functions, including lipid metabolism, plant growth and development, signal transduction, and response to various environmental stresses. In the present work, we have performed a comparative analysis of PA2c domain-containing genes across plants, focusing on gene distribution, phylogenetic analysis, tissue-specific expression, and homology modeling. Our data revealed the widespread occurrence of multiple sPLA2 in most land plants and documented single sPLA2 in multiple algal groups, indicating an ancestral origin of sPLA2. We described a novel PA2c-containing gene family present in all plant lineages and lacking secretory peptide, which we termed PLA2-like. Phylogenetic analysis revealed two independent clades in canonical sPLA2 genes referred to as α and β clades, whereas PLA2-like genes clustered independently as a third clade. Further, we have explored clade-specific gene expressions showing that while all three clades were expressed in vegetative and reproductive tissues, only sPLA2-β and PLA2-like members were expressed in the pollen and pollen tube. To get insight into the conservation of the gene regulatory network of sPLA2 and PLA2-like genes, we have analyzed the occurrence of various cis-acting promoter elements across the plant kingdom. The comparative 3D structure analysis revealed conserved and unique features within the PA2c domain for the three clades. Overall, this study will help to understand the evolutionary significance of the PA2c family and lay the foundation for future sPLA2 and PLA2-like characterization in plants.
Collapse
Affiliation(s)
- Ankush Ashok Saddhe
- Institute of Experimental Botany of the Czech Academy of Sciences, Prague, Czechia
| | - Martin Potocký
- Institute of Experimental Botany of the Czech Academy of Sciences, Prague, Czechia
- Department of Experimental Plant Biology, Faculty of Science, Charles University, Prague, Czechia
| |
Collapse
|
5
|
Comparative genomic analysis of Echinococcus multilocularis with other tapeworms. Biologia (Bratisl) 2022. [DOI: 10.1007/s11756-022-01120-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
6
|
Comparative transcriptomics provides a strategy for phylogenetic analysis and SSR marker development in Chaenomeles. Sci Rep 2021; 11:16441. [PMID: 34385515 PMCID: PMC8361139 DOI: 10.1038/s41598-021-95776-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2021] [Accepted: 07/29/2021] [Indexed: 02/07/2023] Open
Abstract
The genus Chaenomeles has long been considered an important ornamental, herbal and cash crop and is widely cultivated in East Asia. Traditional studies of Chaenomeles mainly focus on evolutionary relationships at the phenotypic level. In this study, we conducted RNA-seq on 10 Chaenomeles germplasms supplemented with one outgroup species, Docynia delavayi (D. delavayi), on the Illumina HiSeq2500 platform. After de novo assemblies, we generated from 40,084 to 49,571 unigenes for each germplasm. After pairwise comparison of the orthologous sequences, 9,659 orthologues within the 11 germplasms were obtained, with 6,154 orthologous genes identified as single-copy genes. The phylogenetic tree was visualized to reveal evolutionary relationships for these 11 germplasms. GO and KEGG analyses were performed for these common single-copy genes to compare their functional similarities and differences. Selective pressure analysis based on 6,154 common single-copy genes revealed that 45 genes were under positive selection. Most of these genes are involved in building the plant disease defence system. A total of 292 genes containing simple sequence repeats (SSRs) were used to develop SSR markers and compare their functions in secondary metabolism pathways. Finally, 10 primers were chosen as SSR marker candidates for Chaenomeles germplasms by comprehensive standards. Our research provides a new methodology and reference for future related research in Chaenomeles and is also useful for improvement, breeding and selection projects in other related species.
Collapse
|
7
|
Kokate PP, Techtmann SM, Werner T. Codon usage bias and dinucleotide preference in 29 Drosophila species. G3 GENES|GENOMES|GENETICS 2021; 11:6291245. [PMID: 34849812 PMCID: PMC8496323 DOI: 10.1093/g3journal/jkab191] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/28/2021] [Accepted: 05/13/2021] [Indexed: 12/30/2022]
Abstract
Abstract
Codon usage bias, where certain codons are used more frequently than their synonymous counterparts, is an interesting phenomenon influenced by three evolutionary forces: mutation, selection, and genetic drift. To better understand how these evolutionary forces affect codon usage bias, an extensive study to detect how codon usage patterns change across species is required. This study investigated 668 single-copy orthologous genes independently in 29 Drosophila species to determine how the codon usage patterns change with phylogenetic distance. We found a strong correlation between phylogenetic distance and codon usage bias and observed striking differences in codon preferences between the two subgenera Drosophila and Sophophora. As compared to the subgenus Sophophora, species of the subgenus Drosophila showed reduced codon usage bias and a reduced preference specifically for codons ending with C, except for codons with G in the second position. We found that codon usage patterns in all species were influenced by the nucleotides in the codon’s 2nd and 3rd positions rather than the biochemical properties of the amino acids encoded. We detected a concordance between preferred codons and preferred dinucleotides (at positions 2 and 3 of codons). Furthermore, we observed an association between speciation, codon preferences, and dinucleotide preferences. Our study provides the foundation to understand how selection acts on dinucleotides to influence codon usage bias.
Collapse
Affiliation(s)
- Prajakta P Kokate
- Department of Biological Sciences, Michigan Technological University, Houghton, MI 49931, USA
| | - Stephen M Techtmann
- Department of Biological Sciences, Michigan Technological University, Houghton, MI 49931, USA
| | - Thomas Werner
- Department of Biological Sciences, Michigan Technological University, Houghton, MI 49931, USA
| |
Collapse
|
8
|
Thiedig K, Weisshaar B, Stracke R. Functional and evolutionary analysis of the Arabidopsis 4R-MYB protein SNAPc4 as part of the SNAP complex. PLANT PHYSIOLOGY 2021; 185:1002-1020. [PMID: 33693812 PMCID: PMC8133616 DOI: 10.1093/plphys/kiaa067] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2020] [Accepted: 11/15/2020] [Indexed: 06/12/2023]
Abstract
Transcription initiation of the genes coding for small nuclear RNA (snRNA) has been extensively analyzed in humans and fruit fly, but only a single ortholog of a snRNA-activating protein complex (SNAPc) subunit has so far been characterized in plants. The genome of the model plant Arabidopsis thaliana encodes orthologs of all three core SNAPc subunits, including A. thaliana SNAP complex 4 (AtSNAPc4)-a 4R-MYB-type protein with four-and-a-half adjacent MYB repeat units. We report the conserved role of AtSNAPc4 as subunit of a protein complex involved in snRNA gene transcription and present genetic evidence that AtSNAPc4 is an essential gene in gametophyte and zygote development. We present experimental evidence that the three A. thaliana SNAPc subunits assemble into a SNAP complex and demonstrate the binding of AtSNAPc4 to snRNA promoters. In addition, co-localization studies show a link between AtSNAPc4 accumulation and Cajal bodies, known to aggregate at snRNA gene loci in humans. Moreover, we show the strong evolutionary conservation of single-copy 4R-MYB/SNAPc4 genes in a broad range of eukaryotes and present additional shared protein features besides the MYB domain, suggesting a conservation of the snRNA transcription initiation machinery along the course of the eukaryotic evolution.
Collapse
Affiliation(s)
- Katharina Thiedig
- Faculty of Biology, Genetics and Genomics of Plants, Bielefeld University, Sequenz 1, Bielefeld 33615, Germany
| | - Bernd Weisshaar
- Faculty of Biology, Genetics and Genomics of Plants, Bielefeld University, Sequenz 1, Bielefeld 33615, Germany
| | - Ralf Stracke
- Faculty of Biology, Genetics and Genomics of Plants, Bielefeld University, Sequenz 1, Bielefeld 33615, Germany
| |
Collapse
|
9
|
Shanmughanandhan J, Shanmughanandhan D, Ragupathy S, Henry TA, Newmaster SG. Quantification of Actaea racemosa L. (black cohosh) from some of its potential adulterants using qPCR and dPCR methods. Sci Rep 2021; 11:4331. [PMID: 33619286 PMCID: PMC7900226 DOI: 10.1038/s41598-020-80465-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Accepted: 12/21/2020] [Indexed: 12/04/2022] Open
Abstract
The demand for popular natural health products (NHPs) such as Black Cohosh is increasing considerably, which in turn challenges quality assurance (QA) throughout the supply chain. To detect and quantify the target species present in a given NHP, DNA-based molecular techniques such as Real-time quantitative PCR (qPCR) and digital PCR (dPCR) are standard tools in the food and pathogen testing industries. There is a gap in the literature concerning validated quantitative PCR methods for botanicals that can be utilized for QA and good manufacturing practices. The objective of this study is to develop an efficient quantification method using qPCR and dPCR techniques for the detection and quantification of Actaea racemosa (Black cohosh) NHPs from its potential adulterants. These developed methods are validated for applicability on commercial NHPs. Species-specific hydrolysis probe assays were designed to analyze the black cohosh NHPs using qPCR and dPCR techniques. The results confirmed that the developed qPCR and dPCR methods are highly precise for identifying and quantifying black cohosh NHPs, indicating their potential applicability in future routine industrial and laboratory testing. This enables a single qPCR test to determine not only the presence of a specific botanical, but also the amount when mixed with an adulterant.
Collapse
Affiliation(s)
- Jeevitha Shanmughanandhan
- NHP Research Alliance, College of Biological Sciences, University of Guelph, 50 Stone Road East, Guelph, ON, N1G 2W1, Canada
| | - Dhivya Shanmughanandhan
- NHP Research Alliance, College of Biological Sciences, University of Guelph, 50 Stone Road East, Guelph, ON, N1G 2W1, Canada.
| | - Subramanyam Ragupathy
- NHP Research Alliance, College of Biological Sciences, University of Guelph, 50 Stone Road East, Guelph, ON, N1G 2W1, Canada
| | - Thomas A Henry
- NHP Research Alliance, College of Biological Sciences, University of Guelph, 50 Stone Road East, Guelph, ON, N1G 2W1, Canada
| | - Steven G Newmaster
- NHP Research Alliance, College of Biological Sciences, University of Guelph, 50 Stone Road East, Guelph, ON, N1G 2W1, Canada
| |
Collapse
|
10
|
Wen J, Yu Y, Xie DF, Peng C, Liu Q, Zhou SD, He XJ. A transcriptome-based study on the phylogeny and evolution of the taxonomically controversial subfamily Apioideae (Apiaceae). ANNALS OF BOTANY 2020; 125:937-953. [PMID: 32016402 PMCID: PMC7218814 DOI: 10.1093/aob/mcaa011] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2019] [Accepted: 01/28/2020] [Indexed: 05/26/2023]
Abstract
BACKGROUND AND AIMS A long-standing controversy in the subfamily Apioideae concerns relationships among the major lineages, which has prevented a comprehensive study of their fruits and evolutionary history. Here we use single copy genes (SCGs) generated from transcriptome datasets to generate a reliable species tree and explore the evolutionary history of Apioideae. METHODS In total, 3351 SCGs were generated from 27 transcriptome datasets and one genome, and further used for phylogenetic analysis using coalescent-based methods. Fruit morphology and anatomy were studied in combination with the species tree. Eleven SCGs were screened out for dating analysis with two fossils selected for calibration. KEY RESULTS A well-supported species tree was generated with a topology [Chamaesieae, (Bupleureae, (Pleurospermeae, (Physospermopsis Clade, (Group C, (Group A, Group B)))))] that differed from previous trees. Daucinae and Torilidinae were not in the tribe Scandiceae and existed as sister groups to the Acronema Clade. Five branches (I-V) of the species tree showed low quartet support but strong local posterior probabilities. Dating analysis suggested that Apioideae originated around 56.64 Mya (95 % highest posterior density interval, 45.18-73.53 Mya). CONCLUSIONS This study resolves a controversial phylogenetic relationship in Apioideae based on 3351 SCGs and coalescent-based species tree estimation methods. Gene trees that contributed to the species tree may undergoing rapid evolutionary divergence and incomplete lineage sorting. Fruits of Apioideae might have evolved in two directions, anemochorous and hydrochorous, with epizoochorous as a derived mode. Molecular and morphological evidence suggests that Daucinae and Torilidinae should be restored to the tribe level. Our results provide new insights into the morphological evolution of this subfamily, which may contribute to a better understanding of species diversification in Apioideae. Molecular dating analysis suggests that uplift of the Qinghai-Tibetan Plateau (QTP) and climate changes probably drove rapid speciation and diversification of Apioideae in the QTP region.
Collapse
Affiliation(s)
- Jun Wen
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, P.R. China
- Key Laboratory of Mountain Ecological Restoration and Bioresource Utilization, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, Sichuan, P.R. China
| | - Yan Yu
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, P.R. China
| | - Deng-Feng Xie
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, P.R. China
| | - Chang Peng
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, P.R. China
| | - Qing Liu
- Key Laboratory of Mountain Ecological Restoration and Bioresource Utilization, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, Sichuan, P.R. China
| | - Song-Dong Zhou
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, P.R. China
| | - Xing-Jin He
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, P.R. China
| |
Collapse
|
11
|
Bortoloto TM, Fuchs-Ferraz MCP, Kettener K, Martins Rubio L, González ER, de Souza ICG, Oda S, Rossini BC, Marino CL. Identification of a molecular marker associated with lignotuber in Eucalyptus ssp. Sci Rep 2020; 10:3608. [PMID: 32107409 PMCID: PMC7046637 DOI: 10.1038/s41598-020-60308-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Accepted: 12/16/2019] [Indexed: 11/09/2022] Open
Abstract
About 95% of Eucalyptus species present an organ known as a lignotuber, a basal woody swelling that holds a large number of dormant buds in a protected position along with carbohydrates and other nutrients. The importance of this trait in Eucalyptus species relates to its regenerative capacity, particularly in the context of coppicing practices and survival in regions of high abiotic stress, especially fire. In this study, we identified and characterized a genomic region associated with the lignotuber trait in commercially important Eucalyptus species by developing a polymorphic marker that co-segregates with lignotuber presence. The marker was then converted into a SCAR (Sequence Characterized Amplified Region) marker, validated in four other Eucalyptus species and hybrids and analyzed in silico. Our investigation presents a marker (ELig) that is effective in identifying individuals with lignotuber. In silico and Southern blot analyses show that the marker is present in a single copy region and is related to auxilin/cyclin-G associated kinase, containing a DnaJ domain. The ELig marker is an important tool that can be used to manage crosses in Eucalyptus breeding programs and inform studies involving lignotuber development and genetics.
Collapse
Affiliation(s)
- Tânia M Bortoloto
- Departamento de Genética, Instituto de Biociências, UNESP - Univ Estadual Paulista, R. Prof. Dr. Antônio Celso Wagner Zanin s/n, Botucatu, SP CEP 18618-689, Brazil
| | - Maria C P Fuchs-Ferraz
- Departamento de Genética, Instituto de Biociências, UNESP - Univ Estadual Paulista, R. Prof. Dr. Antônio Celso Wagner Zanin s/n, Botucatu, SP CEP 18618-689, Brazil
| | - Karine Kettener
- Departamento de Genética, Instituto de Biociências, UNESP - Univ Estadual Paulista, R. Prof. Dr. Antônio Celso Wagner Zanin s/n, Botucatu, SP CEP 18618-689, Brazil
| | - Lígia Martins Rubio
- Departamento de Genética, Instituto de Biociências, UNESP - Univ Estadual Paulista, R. Prof. Dr. Antônio Celso Wagner Zanin s/n, Botucatu, SP CEP 18618-689, Brazil
| | - Esteban R González
- Suzano Papel e Celulose SA, Av. Dr. José Lembo 1010, Itapetininga, SP CEP 18207-780, Brazil
| | - Izabel C G de Souza
- Suzano Papel e Celulose SA, Av. Dr. José Lembo 1010, Itapetininga, SP CEP 18207-780, Brazil
| | - Shinitiro Oda
- Suzano Papel e Celulose SA, Av. Dr. José Lembo 1010, Itapetininga, SP CEP 18207-780, Brazil
| | - Bruno C Rossini
- Departamento de Genética, Instituto de Biociências, UNESP - Univ Estadual Paulista, R. Prof. Dr. Antônio Celso Wagner Zanin s/n, Botucatu, SP CEP 18618-689, Brazil.
- Instituto de Biotecnologia (IBTEC), UNESP - Univ Estadual Paulista, Alameda das Tecomarias s/n, Botucatu, SP CEP 18607-440, Brazil.
| | - Celso L Marino
- Departamento de Genética, Instituto de Biociências, UNESP - Univ Estadual Paulista, R. Prof. Dr. Antônio Celso Wagner Zanin s/n, Botucatu, SP CEP 18618-689, Brazil
- Instituto de Biotecnologia (IBTEC), UNESP - Univ Estadual Paulista, Alameda das Tecomarias s/n, Botucatu, SP CEP 18607-440, Brazil
| |
Collapse
|
12
|
Souza GM, Van Sluys MA, Lembke CG, Lee H, Margarido GRA, Hotta CT, Gaiarsa JW, Diniz AL, Oliveira MDM, Ferreira SDS, Nishiyama MY, ten-Caten F, Ragagnin GT, Andrade PDM, de Souza RF, Nicastro GG, Pandya R, Kim C, Guo H, Durham AM, Carneiro MS, Zhang J, Zhang X, Zhang Q, Ming R, Schatz MC, Davidson B, Paterson AH, Heckerman D. Assembly of the 373k gene space of the polyploid sugarcane genome reveals reservoirs of functional diversity in the world's leading biomass crop. Gigascience 2019; 8:giz129. [PMID: 31782791 PMCID: PMC6884061 DOI: 10.1093/gigascience/giz129] [Citation(s) in RCA: 53] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2019] [Revised: 05/23/2019] [Accepted: 10/08/2019] [Indexed: 11/29/2022] Open
Abstract
BACKGROUND Sugarcane cultivars are polyploid interspecific hybrids of giant genomes, typically with 10-13 sets of chromosomes from 2 Saccharum species. The ploidy, hybridity, and size of the genome, estimated to have >10 Gb, pose a challenge for sequencing. RESULTS Here we present a gene space assembly of SP80-3280, including 373,869 putative genes and their potential regulatory regions. The alignment of single-copy genes in diploid grasses to the putative genes indicates that we could resolve 2-6 (up to 15) putative homo(eo)logs that are 99.1% identical within their coding sequences. Dissimilarities increase in their regulatory regions, and gene promoter analysis shows differences in regulatory elements within gene families that are expressed in a species-specific manner. We exemplify these differences for sucrose synthase (SuSy) and phenylalanine ammonia-lyase (PAL), 2 gene families central to carbon partitioning. SP80-3280 has particular regulatory elements involved in sucrose synthesis not found in the ancestor Saccharum spontaneum. PAL regulatory elements are found in co-expressed genes related to fiber synthesis within gene networks defined during plant growth and maturation. Comparison with sorghum reveals predominantly bi-allelic variations in sugarcane, consistent with the formation of 2 "subgenomes" after their divergence ∼3.8-4.6 million years ago and reveals single-nucleotide variants that may underlie their differences. CONCLUSIONS This assembly represents a large step towards a whole-genome assembly of a commercial sugarcane cultivar. It includes a rich diversity of genes and homo(eo)logous resolution for a representative fraction of the gene space, relevant to improve biomass and food production.
Collapse
Affiliation(s)
- Glaucia Mendes Souza
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, São Paulo, SP 05508-000, Brazil
| | - Marie-Anne Van Sluys
- Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo, Rua do Matão, 277, São Paulo, SP 05508-090, Brazil
| | - Carolina Gimiliani Lembke
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, São Paulo, SP 05508-000, Brazil
| | - Hayan Lee
- Cold Spring Harbor Laboratory, One Bungtown Road, Koch Building #1119, Cold Spring Harbor, NY11724, United States of America
- Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CACA94598, United States of America
| | - Gabriel Rodrigues Alves Margarido
- Departamento de Genética, Escola Superior de Agricultura Luiz de Queiroz, Universidade de São Paulo, Avenida Pádua Dias, 11, Piracicaba, SP 13418-900, Brazil
| | - Carlos Takeshi Hotta
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, São Paulo, SP 05508-000, Brazil
| | - Jonas Weissmann Gaiarsa
- Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo, Rua do Matão, 277, São Paulo, SP 05508-090, Brazil
| | - Augusto Lima Diniz
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, São Paulo, SP 05508-000, Brazil
| | - Mauro de Medeiros Oliveira
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, São Paulo, SP 05508-000, Brazil
| | - Sávio de Siqueira Ferreira
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, São Paulo, SP 05508-000, Brazil
- Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo, Rua do Matão, 277, São Paulo, SP 05508-090, Brazil
| | - Milton Yutaka Nishiyama
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, São Paulo, SP 05508-000, Brazil
- Laboratório Especial de Toxinologia Aplicada, Instituto Butantan, Av. Vital Brasil, 1500, São Paulo, SP05503-900, Brazil
| | - Felipe ten-Caten
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, São Paulo, SP 05508-000, Brazil
| | - Geovani Tolfo Ragagnin
- Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo, Rua do Matão, 277, São Paulo, SP 05508-090, Brazil
| | - Pablo de Morais Andrade
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, São Paulo, SP 05508-000, Brazil
| | - Robson Francisco de Souza
- Departamento de Microbiologia, Instituto de Ciências Biomédicas, Universidade de São Paulo, Av.Professor Lineu Prestes, 1734, São Paulo, SP 05508-900, Brazil
| | - Gianlucca Gonçalves Nicastro
- Departamento de Microbiologia, Instituto de Ciências Biomédicas, Universidade de São Paulo, Av.Professor Lineu Prestes, 1734, São Paulo, SP 05508-900, Brazil
| | - Ravi Pandya
- Microsoft Research, One Microsoft Way, Redmond, WA 98052, United States of America
| | - Changsoo Kim
- Plant Genome Mapping Laboratory, University of Georgia, 120 Green Street, Athens, GA 30602-7223,United States of America
- Department of Crop Science, Chungnam National University, 99 Daehak Ro Yuseong Gu, Deajeon,34134, South Korea
| | - Hui Guo
- Plant Genome Mapping Laboratory, University of Georgia, 120 Green Street, Athens, GA 30602-7223,United States of America
| | - Alan Mitchell Durham
- Departamento de Ciências da Computação, Instituto de Matemática e Estatística, Universidade de São Paulo, Rua do Matão, 1010, São Paulo, SP 05508-090, Brazil
| | - Monalisa Sampaio Carneiro
- Departamento de Biotecnologia e Produção Vegetal e Animal, Centro de Ciências Agrárias, Universidade Federal de São Carlos, Rodovia Washington Luis km 235, Araras, SP 13.565-905, Brazil
| | - Jisen Zhang
- FAFU and UIUC-SIB Joint Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Shangxiadian Road, Fuzhou 350002, Fujian, China
| | - Xingtan Zhang
- FAFU and UIUC-SIB Joint Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Shangxiadian Road, Fuzhou 350002, Fujian, China
| | - Qing Zhang
- FAFU and UIUC-SIB Joint Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Shangxiadian Road, Fuzhou 350002, Fujian, China
| | - Ray Ming
- FAFU and UIUC-SIB Joint Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Shangxiadian Road, Fuzhou 350002, Fujian, China
- Department of Plant Biology, University of Illinois at Urbana-Champaign, 201 W. Gregory Dr. Urbana, Urbana, Illinois 61801, United States of America
| | - Michael C Schatz
- Cold Spring Harbor Laboratory, One Bungtown Road, Koch Building #1119, Cold Spring Harbor, NY11724, United States of America
- Departments of Computer Science and Biology, Johns Hopkins University, 3400 North Charles Street,Baltimore, MD 21218-2608, United States of America
| | - Bob Davidson
- Microsoft Research, One Microsoft Way, Redmond, WA 98052, United States of America
| | - Andrew H Paterson
- Plant Genome Mapping Laboratory, University of Georgia, 120 Green Street, Athens, GA 30602-7223,United States of America
| | - David Heckerman
- Microsoft Research, One Microsoft Way, Redmond, WA 98052, United States of America
| |
Collapse
|
13
|
Debray K, Marie-Magdelaine J, Ruttink T, Clotault J, Foucher F, Malécot V. Identification and assessment of variable single-copy orthologous (SCO) nuclear loci for low-level phylogenomics: a case study in the genus Rosa (Rosaceae). BMC Evol Biol 2019; 19:152. [PMID: 31340752 PMCID: PMC6657147 DOI: 10.1186/s12862-019-1479-z] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2019] [Accepted: 07/16/2019] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND With an ever-growing number of published genomes, many low levels of the Tree of Life now contain several species with enough molecular data to perform shallow-scale phylogenomic studies. Moving away from using just a few universal phylogenetic markers, we can now target thousands of other loci to decipher taxa relationships. Making the best possible selection of informative sequences regarding the taxa studied has emerged as a new issue. Here, we developed a general procedure to mine genomic data, looking for orthologous single-copy loci capable of deciphering phylogenetic relationships below the generic rank. To develop our strategy, we chose the genus Rosa, a rapid-evolving lineage of the Rosaceae family in which several species genomes have recently been sequenced. We also compared our loci to conventional plastid markers, commonly used for phylogenetic inference in this genus. RESULTS We generated 1856 sequence tags in putative single-copy orthologous nuclear loci. Associated in silico primer pairs can potentially amplify fragments able to resolve a wide range of speciation events within the genus Rosa. Analysis of parsimony-informative site content showed the value of non-coding genomic regions to obtain variable sequences despite the fact that they may be more difficult to target in less related species. Dozens of nuclear loci outperform the conventional plastid phylogenetic markers in terms of phylogenetic informativeness, for both recent and ancient evolutionary divergences. However, conflicting phylogenetic signals were found between nuclear gene tree topologies and the species-tree topology, shedding light on the many patterns of hybridization and/or incomplete lineage sorting that occur in the genus Rosa. CONCLUSIONS With recently published genome sequence data, we developed a set of single-copy orthologous nuclear loci to resolve species-level phylogenomics in the genus Rosa. This genome-wide scale dataset contains hundreds of highly variable loci which phylogenetic interest was assessed in terms of phylogenetic informativeness and topological conflict. Our target identification procedure can easily be reproduced to identify new highly informative loci for other taxonomic groups and ranks.
Collapse
Affiliation(s)
- Kevin Debray
- IRHS, Agrocampus-Ouest, INRA, UNIV Angers, SFR 4207 QuaSaV, Beaucouzé, France.
| | | | - Tom Ruttink
- ILVO, Flanders Research Institute for Agriculture, Fisheries and Food, Plant Sciences Unit, Melle, Belgium
| | - Jérémy Clotault
- IRHS, Agrocampus-Ouest, INRA, UNIV Angers, SFR 4207 QuaSaV, Beaucouzé, France
| | - Fabrice Foucher
- IRHS, Agrocampus-Ouest, INRA, UNIV Angers, SFR 4207 QuaSaV, Beaucouzé, France
| | - Valéry Malécot
- IRHS, Agrocampus-Ouest, INRA, UNIV Angers, SFR 4207 QuaSaV, Beaucouzé, France.
| |
Collapse
|
14
|
Improved genomic resources and new bioinformatic workflow for the carcinogenic parasite Clonorchis sinensis: Biotechnological implications. Biotechnol Adv 2018; 36:894-904. [DOI: 10.1016/j.biotechadv.2018.02.008] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2018] [Revised: 02/08/2018] [Accepted: 02/13/2018] [Indexed: 12/27/2022]
|
15
|
Zubimendi JP, Martinatto A, Valacco MP, Moreno S, Andreo CS, Drincovich MF, Tronconi MA. The complex allosteric and redox regulation of the fumarate hydratase and malate dehydratase reactions of Arabidopsis thaliana Fumarase 1 and 2 gives clues for understanding the massive accumulation of fumarate. FEBS J 2018; 285:2205-2224. [PMID: 29688630 DOI: 10.1111/febs.14483] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2017] [Revised: 02/22/2018] [Accepted: 04/19/2018] [Indexed: 11/30/2022]
Abstract
Arabidopsis thaliana possesses two fumarase genes (FUM), AtFUM1 (At2g47510) encoding for the mitochondrial Krebs cycle-associated enzyme and AtFUM2 (At5g50950) for the cytosolic isoform required for fumarate massive accumulation. Here, the comprehensive biochemical studies of AtFUM1 and AtFUM2 shows that they are active enzymes with similar kinetic parameters but differential regulation. For both enzymes, fumarate hydratase (FH) activity is favored over the malate dehydratase (MD) activity; however, MD is the most regulated activity with several allosteric activators. Oxalacetate, glutamine, and/or asparagine are modulators causing the MD reaction to become preferred over the FH reaction. Activity profiles as a function of pH suggest a suboptimal FUM activity in Arabidopsis cells; moreover, the direction of the FUM reaction is sensitive to pH changes. Under mild oxidation conditions, AtFUMs form high mass molecular aggregates, which present both FUM activities decreased to a different extent. The biochemical properties of oxidized AtFUMs (oxAtFUMs) were completely reversed by NADPH-supplied Arabidopsis leaf extracts, suggesting that the AtFUMs redox regulation can be accomplished in vivo. Mass spectrometry analyses indicate the presence of an active site-associated intermolecular disulfide bridge in oxAtFUMs. Finally, a phylogenetic approach points out that other plant species may also possess cytosolic FUM2 enzymes mainly encoded by paralogous genes, indicating that the evolutionary history of this trait has been drawn through a process of parallel evolution. Overall, according to our results, a multilevel regulatory pattern of FUM activities emerges, supporting the role of this enzyme as a carbon flow monitoring point through the organic acid metabolism in plants.
Collapse
Affiliation(s)
- Juan P Zubimendi
- Centro de Estudios Fotosintéticos y Bioquímicos (CEFOBI), Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario (UNR), Santa Fe, Argentina
| | - Andrea Martinatto
- Centro de Estudios Fotosintéticos y Bioquímicos (CEFOBI), Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario (UNR), Santa Fe, Argentina
| | - Maria P Valacco
- Departamento de Química Biológica, Facultad de Ciencias exactas y Naturales, Universidad de Buenos Aires (UBA), Argentina
| | - Silvia Moreno
- Departamento de Química Biológica, Facultad de Ciencias exactas y Naturales, Universidad de Buenos Aires (UBA), Argentina
| | - Carlos S Andreo
- Centro de Estudios Fotosintéticos y Bioquímicos (CEFOBI), Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario (UNR), Santa Fe, Argentina
| | - María F Drincovich
- Centro de Estudios Fotosintéticos y Bioquímicos (CEFOBI), Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario (UNR), Santa Fe, Argentina
| | - Marcos A Tronconi
- Centro de Estudios Fotosintéticos y Bioquímicos (CEFOBI), Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario (UNR), Santa Fe, Argentina
| |
Collapse
|
16
|
Evolutionarily Conserved Alternative Splicing Across Monocots. Genetics 2017; 207:465-480. [PMID: 28839042 DOI: 10.1534/genetics.117.300189] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2017] [Accepted: 08/11/2017] [Indexed: 12/22/2022] Open
Abstract
One difficulty when identifying alternative splicing (AS) events in plants is distinguishing functional AS from splicing noise. One way to add confidence to the validity of a splice isoform is to observe that it is conserved across evolutionarily related species. We use a high throughput method to identify junction-based conserved AS events from RNA-Seq data across nine plant species, including five grass monocots (maize, sorghum, rice, Brachpodium, and foxtail millet), plus two nongrass monocots (banana and African oil palm), the eudicot Arabidopsis, and the basal angiosperm Amborella In total, 9804 AS events were found to be conserved between two or more species studied. In grasses containing large regions of conserved synteny, the frequency of conserved AS events is twice that observed for genes outside of conserved synteny blocks. In plant-specific RS and RS2Z subfamilies of the serine/arginine (SR) splice-factor proteins, we observe both conservation and divergence of AS events after the whole genome duplication in maize. In addition, plant-specific RS and RS2Z splice-factor subfamilies are highly connected with R2R3-MYB in STRING functional protein association networks built using genes exhibiting conserved AS. Furthermore, we discovered that functional protein association networks constructed around genes harboring conserved AS events are enriched for phosphatases, kinases, and ubiquitylation genes, which suggests that AS may participate in regulating signaling pathways. These data lay the foundation for identifying and studying conserved AS events in the monocots, particularly across grass species, and this conserved AS resource identifies an additional layer between genotype to phenotype that may impact future crop improvement efforts.
Collapse
|
17
|
Hao DC, Gu X, Xiao P. Anemone medicinal plants: ethnopharmacology, phytochemistry and biology. Acta Pharm Sin B 2017; 7:146-158. [PMID: 28303220 PMCID: PMC5343163 DOI: 10.1016/j.apsb.2016.12.001] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2016] [Revised: 11/28/2016] [Accepted: 11/29/2016] [Indexed: 12/12/2022] Open
Abstract
The Ranunculaceae genus Anemone (order Ranunculales), comprising more than 150 species, mostly herbs, has long been used in folk medicine and worldwide ethnomedicine. Various medicinal compounds have been found in Anemone plants, especially triterpenoid saponins, some of which have shown anti-cancer activities. Some Anemone compounds and extracts display immunomodulatory, anti-inflammatory, antioxidant, and antimicrobial activities. More than 50 species have ethnopharmacological uses, which provide clues for modern drug discovery. Anemone compounds exert anticancer and other bioactivities via multiple pathways. However, a comprehensive review of the Anemone medicinal resources is lacking. We here summarize the ethnomedical knowledge and recent progress on the chemical and pharmacological diversity of Anemone medicinal plants, as well as the emerging molecular mechanisms and functions of these medicinal compounds. The phylogenetic relationships of Anemone species were reconstructed based on nuclear ITS and chloroplast markers. The molecular phylogeny is largely congruent with the morphology-based classification. Commonly used medicinal herbs are distributed in each subgenus and section, and chemical and biological studies of more unexplored taxa are warranted. Gene expression profiling and relevant "omics" platforms could reveal differential effects of phytometabolites. Genomics, transcriptomics, proteomics, and metabolomics should be highlighted in deciphering novel therapeutic mechanisms and utilities of Anemone phytometabolites.
Collapse
Affiliation(s)
- Da-Cheng Hao
- Biotechnology Institute, School of Environment and Chemical Engineering, Dalian Jiaotong University, Dalian 116028, China
| | - Xiaojie Gu
- Biotechnology Institute, School of Environment and Chemical Engineering, Dalian Jiaotong University, Dalian 116028, China
| | - Peigen Xiao
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences, Beijing 100193, China
| |
Collapse
|
18
|
Fofana B, Ghose K, McCallum J, You FM, Cloutier S. UGT74S1 is the key player in controlling secoisolariciresinol diglucoside (SDG) formation in flax. BMC PLANT BIOLOGY 2017; 17:35. [PMID: 28152982 PMCID: PMC5290659 DOI: 10.1186/s12870-017-0982-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2016] [Accepted: 01/23/2017] [Indexed: 05/02/2023]
Abstract
BACKGROUND Flax lignan, commonly known as secoisolariciresinol (SECO) diglucoside (SDG), has recently been reported with health-promoting activities, including its positive impact in metabolic diseases. However, not much was reported on the biosynthesis of SDG and its monoglucoside (SMG) until lately. Flax UGT74S1 was recently reported to sequentially glucosylate SECO into SMG and SDG in vitro. However, whether this gene is the only UGT achieving SECO glucosylation in flax was not known. RESULTS Flax genome-wide mining for UGTs was performed. Phylogenetic and gene duplication analyses, heterologous gene expression and enzyme assays were conducted to identify family members closely related to UGT74S1 and to establish their roles in SECO glucosylation. A total of 299 different UGTs were identified, of which 241 (81%) were duplicated. Flax UGTs diverged 2.4-153.6 MYA and 71% were found to be under purifying selection pressure. UGT74S1, a single copy gene located on chromosome 7, displayed no evidence of duplication and was deemed to be under positive selection pressure. The phylogenetic analysis identified four main clusters where cluster 4, which included UGT74S1, was the most diverse. The duplicated UGT74S4 and UGT74S3, located on chromosomes 8 and 14, respectively, were the most closely related to UGT74S1 and were differentially expressed in different tissues. Heterologous expression levels of UGT74S1, UGT74S4 and UGT74S3 proteins were similar but UGT74S4 and UGT74S3 glucosylation activity towards SECO was seven fold less than UGT74S1. In addition, they both failed to produce SDG, suggesting neofunctionalization following their divergence from UGT74S1. CONCLUSIONS We showed that UGT74S1 is closely related to two duplicated genes, UGT74S4 and UGT74S3 which, unlike UGT74S1, failed to glucosylate SMG into SDG. The study suggests that UGT74S1 may be the key player in controlling SECO glucosylation into SDG in flax although its closely related genes may also contribute to a minor extent in supplying the SMG precursor to UGT74S1.
Collapse
Affiliation(s)
- Bourlaye Fofana
- Charlottetown Research and Development Centre, Agriculture and Agri-Food Canada, 440 University Avenue, Charlottetown, Prince Edward Island C1A 4N6 Canada
| | - Kaushik Ghose
- Charlottetown Research and Development Centre, Agriculture and Agri-Food Canada, 440 University Avenue, Charlottetown, Prince Edward Island C1A 4N6 Canada
- Department of Plant and Soil Science, Texas Tech University, Lubbock, TX 79409 USA
| | - Jason McCallum
- Charlottetown Research and Development Centre, Agriculture and Agri-Food Canada, 440 University Avenue, Charlottetown, Prince Edward Island C1A 4N6 Canada
| | - Frank M. You
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, 101 Route 100 Unit 100, Morden, Manitoba R6M 1Y5 Canada
| | - Sylvie Cloutier
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, 960 Carling Avenue, Ottawa, Ontario K1A 0C6 Canada
| |
Collapse
|
19
|
Li Z, Defoort J, Tasdighian S, Maere S, Van de Peer Y, De Smet R. Gene Duplicability of Core Genes Is Highly Consistent across All Angiosperms. THE PLANT CELL 2016; 28:326-44. [PMID: 26744215 PMCID: PMC4790876 DOI: 10.1105/tpc.15.00877] [Citation(s) in RCA: 143] [Impact Index Per Article: 17.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/13/2015] [Accepted: 01/04/2016] [Indexed: 05/02/2023]
Abstract
Gene duplication is an important mechanism for adding to genomic novelty. Hence, which genes undergo duplication and are preserved following duplication is an important question. It has been observed that gene duplicability, or the ability of genes to be retained following duplication, is a nonrandom process, with certain genes being more amenable to survive duplication events than others. Primarily, gene essentiality and the type of duplication (small-scale versus large-scale) have been shown in different species to influence the (long-term) survival of novel genes. However, an overarching view of "gene duplicability" is lacking, mainly due to the fact that previous studies usually focused on individual species and did not account for the influence of genomic context and the time of duplication. Here, we present a large-scale study in which we investigated duplicate retention for 9178 gene families shared between 37 flowering plant species, referred to as angiosperm core gene families. For most gene families, we observe a strikingly consistent pattern of gene duplicability across species, with gene families being either primarily single-copy or multicopy in all species. An intermediate class contains gene families that are often retained in duplicate for periods extending to tens of millions of years after whole-genome duplication, but ultimately appear to be largely restored to singleton status, suggesting that these genes may be dosage balance sensitive. The distinction between single-copy and multicopy gene families is reflected in their functional annotation, with single-copy genes being mainly involved in the maintenance of genome stability and organelle function and multicopy genes in signaling, transport, and metabolism. The intermediate class was overrepresented in regulatory genes, further suggesting that these represent putative dosage-balance-sensitive genes.
Collapse
Affiliation(s)
- Zhen Li
- Department of Plant Systems Biology, VIB, B-9052 Ghent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium Bioinformatics Institute Ghent, Ghent University, B-9052 Ghent, Belgium
| | - Jonas Defoort
- Department of Plant Systems Biology, VIB, B-9052 Ghent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium Bioinformatics Institute Ghent, Ghent University, B-9052 Ghent, Belgium
| | - Setareh Tasdighian
- Department of Plant Systems Biology, VIB, B-9052 Ghent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium Bioinformatics Institute Ghent, Ghent University, B-9052 Ghent, Belgium
| | - Steven Maere
- Department of Plant Systems Biology, VIB, B-9052 Ghent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium Bioinformatics Institute Ghent, Ghent University, B-9052 Ghent, Belgium
| | - Yves Van de Peer
- Department of Plant Systems Biology, VIB, B-9052 Ghent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium Bioinformatics Institute Ghent, Ghent University, B-9052 Ghent, Belgium Genomics Research Institute, University of Pretoria, Pretoria 0028, South Africa
| | - Riet De Smet
- Department of Plant Systems Biology, VIB, B-9052 Ghent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium Bioinformatics Institute Ghent, Ghent University, B-9052 Ghent, Belgium
| |
Collapse
|
20
|
Hao DC, Xiao PG. Genomics and Evolution in Traditional Medicinal Plants: Road to a Healthier Life. Evol Bioinform Online 2015; 11:197-212. [PMID: 26461812 PMCID: PMC4597484 DOI: 10.4137/ebo.s31326] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2015] [Revised: 08/24/2015] [Accepted: 08/31/2015] [Indexed: 02/07/2023] Open
Abstract
Medicinal plants have long been utilized in traditional medicine and ethnomedicine worldwide. This review presents a glimpse of the current status of and future trends in medicinal plant genomics, evolution, and phylogeny. These dynamic fields are at the intersection of phytochemistry and plant biology and are concerned with the evolution mechanisms and systematics of medicinal plant genomes, origin and evolution of the plant genotype and metabolic phenotype, interaction between medicinal plant genomes and their environment, the correlation between genomic diversity and metabolite diversity, and so on. Use of the emerging high-end genomic technologies can be expanded from crop plants to traditional medicinal plants, in order to expedite medicinal plant breeding and transform them into living factories of medicinal compounds. The utility of molecular phylogeny and phylogenomics in predicting chemodiversity and bioprospecting is also highlighted within the context of natural-product-based drug discovery and development. Representative case studies of medicinal plant genome, phylogeny, and evolution are summarized to exemplify the expansion of knowledge pedigree and the paradigm shift to the omics-based approaches, which update our awareness about plant genome evolution and enable the molecular breeding of medicinal plants and the sustainable utilization of plant pharmaceutical resources.
Collapse
Affiliation(s)
- Da-Cheng Hao
- Biotechnology Institute, School of Environment and Chemical Engineering, Dalian Jiaotong University, Dalian, P. R. China
| | - Pei-Gen Xiao
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences, Beijing, P. R. China
| |
Collapse
|
21
|
Fopa Fomeju B, Falentin C, Lassalle G, Manzanares-Dauleux MJ, Delourme R. Comparative genomic analysis of duplicated homoeologous regions involved in the resistance of Brassica napus to stem canker. FRONTIERS IN PLANT SCIENCE 2015; 6:772. [PMID: 26442081 PMCID: PMC4585320 DOI: 10.3389/fpls.2015.00772] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2015] [Accepted: 09/08/2015] [Indexed: 05/18/2023]
Abstract
All crop species are current or ancient polyploids. Following whole genome duplication, structural and functional modifications result in differential gene content or regulation in the duplicated regions, which can play a fundamental role in the diversification of genes underlying complex traits. We have investigated this issue in Brassica napus, a species with a highly duplicated genome, with the aim of studying the structural and functional organization of duplicated regions involved in quantitative resistance to stem canker, a disease caused by the fungal pathogen Leptosphaeria maculans. Genome-wide association analysis on two oilseed rape panels confirmed that duplicated regions of ancestral blocks E, J, R, U, and W were involved in resistance to stem canker. The structural analysis of the duplicated genomic regions showed a higher gene density on the A genome than on the C genome and a better collinearity between homoeologous regions than paralogous regions, as overall in the whole B. napus genome. The three ancestral sub-genomes were involved in the resistance to stem canker and the fractionation profile of the duplicated regions corresponded to what was expected from results on the B. napus progenitors. About 60% of the genes identified in these duplicated regions were single-copy genes while less than 5% were retained in all the duplicated copies of a given ancestral block. Genes retained in several copies were mainly involved in response to stress, signaling, or transcription regulation. Genes with resistance-associated markers were mainly retained in more than two copies. These results suggested that some genes underlying quantitative resistance to stem canker might be duplicated genes. Genes with a hydrolase activity that were retained in one copy or R-like genes might also account for resistance in some regions. Further analyses need to be conducted to indicate to what extent duplicated genes contribute to the expression of the resistance phenotype.
Collapse
Affiliation(s)
| | - Cyril Falentin
- Institut National de la Recherche Agronomique, UMR1349 IGEPPLe Rheu, France
| | - Gilles Lassalle
- Institut National de la Recherche Agronomique, UMR1349 IGEPPLe Rheu, France
| | | | - Régine Delourme
- Institut National de la Recherche Agronomique, UMR1349 IGEPPLe Rheu, France
| |
Collapse
|
22
|
Granados Mendoza C, Naumann J, Samain MS, Goetghebeur P, De Smet Y, Wanke S. A genome-scale mining strategy for recovering novel rapidly-evolving nuclear single-copy genes for addressing shallow-scale phylogenetics in Hydrangea. BMC Evol Biol 2015; 15:132. [PMID: 26141718 PMCID: PMC4491267 DOI: 10.1186/s12862-015-0416-z] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2015] [Accepted: 06/09/2015] [Indexed: 12/21/2022] Open
Abstract
Background Identifying orthologous molecular markers that potentially resolve relationships at and below species level has been a major challenge in molecular phylogenetics over the past decade. Non-coding regions of nuclear low- or single-copy markers are a vast and promising source of data providing information for shallow-scale phylogenetics. Taking advantage of public transcriptome data from the One Thousand Plant Project (1KP), we developed a genome-scale mining strategy for recovering potentially orthologous single-copy markers to address low-scale phylogenetics. Our marker design targeted the amplification of intron-rich nuclear single-copy regions from genomic DNA. As a case study we used Hydrangea section Cornidia, one of the most recently diverged lineages within Hydrangeaceae (Cornales), for comparing the performance of three of these nuclear markers to other “fast” evolving plastid markers. Results Our data mining and filtering process retrieved 73 putative nuclear single-copy genes which are potentially useful for resolving phylogenetic relationships at a range of divergence depths within Cornales. The three assessed nuclear markers showed considerably more phylogenetic signal for shallow evolutionary depths than conventional plastid markers. Phylogenetic signal in plastid markers increased less markedly towards deeper evolutionary divergences. Potential phylogenetic noise introduced by nuclear markers was lower than their respective phylogenetic signal across all evolutionary depths. In contrast, plastid markers showed higher probabilities for introducing phylogenetic noise than signal at the deepest evolutionary divergences within the tribe Hydrangeeae (Hydrangeaceae). Conclusions While nuclear single-copy markers are highly informative for shallow evolutionary depths without introducing phylogenetic noise, plastid markers might be more appropriate for resolving deeper-level divergences such as the backbone relationships of the Hydrangeaceae family and deeper, at which non-coding parts of nuclear markers could potentially introduce noise due to elevated rates of evolution. The herein developed and demonstrated transcriptome based mining strategy has a great potential for the design of novel and highly informative nuclear markers for a range of plant groups and evolutionary scales. Electronic supplementary material The online version of this article (doi:10.1186/s12862-015-0416-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Carolina Granados Mendoza
- Department of Biology, Research Group Spermatophytes, Ghent University, K.L. Ledeganckstraat 35, 9000, Ghent, Belgium. .,Departamento de Botánica, Instituto de Biología, Universidad Nacional Autónoma de México, Apartado Postal 70-367, 04510, Coyoacán, Distrito Federal, Mexico.
| | - Julia Naumann
- Institut für Botanik, Technische Universität Dresden, Zellescher Weg 20b, 01062, Dresden, Germany.
| | - Marie-Stéphanie Samain
- Department of Biology, Research Group Spermatophytes, Ghent University, K.L. Ledeganckstraat 35, 9000, Ghent, Belgium. .,Instituto de Ecología, A.C., Centro Regional del Bajío, Avenida Lázaro Cárdenas 253, 61600, Pátzcuaro, Michoacán, Mexico.
| | - Paul Goetghebeur
- Department of Biology, Research Group Spermatophytes, Ghent University, K.L. Ledeganckstraat 35, 9000, Ghent, Belgium.
| | - Yannick De Smet
- Department of Biology, Research Group Spermatophytes, Ghent University, K.L. Ledeganckstraat 35, 9000, Ghent, Belgium.
| | - Stefan Wanke
- Institut für Botanik, Technische Universität Dresden, Zellescher Weg 20b, 01062, Dresden, Germany.
| |
Collapse
|
23
|
Single-copy gene based 50 K SNP chip for genetic studies and molecular breeding in rice. Sci Rep 2015; 5:11600. [PMID: 26111882 PMCID: PMC4481378 DOI: 10.1038/srep11600] [Citation(s) in RCA: 78] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2015] [Accepted: 05/26/2015] [Indexed: 11/17/2022] Open
Abstract
Single nucleotide polymorphism (SNP) is the most abundant DNA sequence variation present in plant genomes. Here, we report the design and validation of a unique genic-SNP genotyping chip for genetic and evolutionary studies as well as molecular breeding applications in rice. The chip incorporates 50,051 SNPs from 18,980 different genes spanning 12 rice chromosomes, including 3,710 single-copy (SC) genes conserved between wheat and rice, 14,959 SC genes unique to rice, 194 agronomically important cloned rice genes and 117 multi-copy rice genes. Assays with this chip showed high success rate and reproducibility because of the SC gene based array with no sequence redundancy and cross-hybridisation problems. The usefulness of the chip in genetic diversity and phylogenetic studies of cultivated and wild rice germplasm was demonstrated. Furthermore, its efficacy was validated for analysing background recovery in improved mega rice varieties with submergence tolerance developed through marker-assisted backcross breeding.
Collapse
|