1
|
Han S, Zhang S, Yi R, Bi D, Ding H, Yang J, Ye Y, Xu W, Wu L, Zhuo R, Kan X. Phylogenomics and plastomics offer new evolutionary perspectives on Kalanchoideae (Crassulaceae). ANNALS OF BOTANY 2024; 133:585-604. [PMID: 38359907 PMCID: PMC11037489 DOI: 10.1093/aob/mcae017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Accepted: 02/08/2024] [Indexed: 02/17/2024]
Abstract
BACKGROUND AND AIMS Kalanchoideae is one of three subfamilies within Crassulaceae and contains four genera. Despite previous efforts, the phylogeny of Kalanchoideae remains inadequately resolved with persistent issues including low support, unstructured topologies and polytomies. This study aimed to address two central objectives: (1) resolving the pending phylogenetic questions within Kalanchoideae by using organelle-scale 'barcodes' (plastomes) and nuclear data; and (2) investigating interspecific diversity patterns among Kalanchoideae plastomes. METHODS To explore the plastome evolution in Kalanchoideae, we newly sequenced 38 plastomes representing all four constituent genera (Adromischus, Cotyledon, Kalanchoe and Tylecodon). We performed comparative analyses of plastomic features, including GC and gene contents, gene distributions at the IR (inverted repeat) boundaries, nucleotide divergence, plastomic tRNA (pttRNA) structures and codon aversions. Additionally, phylogenetic inferences were inferred using both the plastomic dataset (79 genes) and nuclear dataset (1054 genes). KEY RESULTS Significant heterogeneities were observed in plastome lengths among Kalanchoideae, strongly correlated with LSC (large single copy) lengths. Informative diversities existed in the gene content at SSC/IRa (small single copy/inverted repeat a), with unique patterns individually identified in Adromischus leucophyllus and one major Kalanchoe clade. The ycf1 gene was assessed as a shared hypervariable region among all four genera, containing nine lineage-specific indels. Three pttRNAs exhibited unique structures specific to Kalanchoideae and the genera Adromischus and Kalanchoe. Moreover, 24 coding sequences revealed a total of 41 lineage-specific unused codons across all four constituent genera. The phyloplastomic inferences clearly depicted internal branching patterns in Kalanchoideae. Most notably, by both plastid- and nuclear-based phylogenies, our research offers the first evidence that Kalanchoe section Eukalanchoe is not monophyletic. CONCLUSIONS This study conducted comprehensive analyses on 38 newly reported Kalanchoideae plastomes. Importantly, our results not only reconstructed well-resolved phylogenies within Kalanchoideae, but also identified highly informative unique markers at the subfamily, genus and species levels. These findings significantly enhance our understanding of the evolutionary history of Kalanchoideae.
Collapse
Affiliation(s)
- Shiyun Han
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Sijia Zhang
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Ran Yi
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - De Bi
- Suzhou Polytechnic Institute of Agriculture, Suzhou 215000, China
| | - Hengwu Ding
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Jianke Yang
- School of Basic Medical Sciences, Wannan Medical College, Wuhu 241000, China
| | - Yuanxin Ye
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Wenzhong Xu
- Key Laboratory of Plant Resources, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - Longhua Wu
- CAS Key Laboratory of Soil Environment and Pollution Remediation, Institute of Soil Science, Chinese Academy of Sciences, Nanjing 210008, China
| | - Renying Zhuo
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding of Zhejiang Province, Research Institute of Subtropical Forestry, Chinese Academy of Forestry, Hangzhou 311400, China
| | - Xianzhao Kan
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
- The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| |
Collapse
|
2
|
Yang J, Ye Y, Yi R, Bi D, Zhang S, Han S, Kan X. A new perspective on codon usage, selective pressure, and phylogenetic implications of the plastomes in the Telephium clade (Crassulaceae). Gene 2024; 892:147871. [PMID: 37797779 DOI: 10.1016/j.gene.2023.147871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 09/13/2023] [Accepted: 10/02/2023] [Indexed: 10/07/2023]
Abstract
The Telephium clade of the Crassulaceae family contains many medicinal, ornamental, and ecologically restorative plants. However, the phylogenetic relationships within the clade remain debated, and comprehensive analyses of codon usage and selection pressure in Telephium plastomes are limited. In this study, we assembled and annotated four plastomes and performed extensive analyses. The plastomes exhibited a typical quadripartite structure and high conservation. The lengths ranged from 151,357 bp to 151,641 bp with 134 genes identified. The GC content was the highest within IR, followed by LSC, and lowest in the SSC region. Meanwhile, a unique inversion was observed within the LSC region of Meterostachys sikokianus. Polymorphisms analysis revealed minimum nucleotide diversity in the IR regions, with over ten highly polymorphic regions identified. Phylogenetically, two subclades formed within the monophyletic Telephium clade, with Umbilicus as the sister group to the remaining Hylotelephium subclade members. Notably, no significant positive selection was found among the 79 plastid genes, which showed varying evolutionary patterns. However, 19 genes contained codons under positive selection. The specific functions of these sites require further investigation. Synonymous codon usage was biased and conserved across the tested plastomes, shaped by natural selection, mutations and other factors of varying influence. We also identified 34 taxon-specific codon aversion motifs from 49 plastid genes. Our plastomic analyses elucidate phylogenetic relationships and evolutionary patterns in this medicinal clade, providing a foundation for further research on these ecologically and pharmaceutically important plants.
Collapse
Affiliation(s)
- Jianke Yang
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu, Anhui, China; School of Basic Medical Sciences, Wannan Medical College, Wuhu, Anhui, China
| | - Yuanxin Ye
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu, Anhui, China
| | - Ran Yi
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu, Anhui, China
| | - De Bi
- College of Landscape Engineering, Suzhou Polytechnic Institute of Agriculture, Suzhou, Jiangsu, China
| | - Sijia Zhang
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu, Anhui, China
| | - Shiyun Han
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu, Anhui, China
| | - Xianzhao Kan
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu, Anhui, China; The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu, Anhui, China.
| |
Collapse
|
3
|
Khandia R, Pandey MK, Rzhepakovsky IV, Khan AA, Alexiou A. Synonymous Codon Variant Analysis for Autophagic Genes Dysregulated in Neurodegeneration. Mol Neurobiol 2023; 60:2252-2267. [PMID: 36637744 DOI: 10.1007/s12035-022-03081-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Accepted: 09/27/2022] [Indexed: 01/14/2023]
Abstract
Neurodegenerative disorders are often a culmination of the accumulation of abnormally folded proteins and defective organelles. Autophagy is a process of removing these defective proteins, organelles, and harmful substances from the body, and it works to maintain homeostasis. If autophagic removal of defective proteins has interfered, it affects neuronal health. Some of the autophagic genes are specifically found to be associated with neurodegenerative phenotypes. Non-functional, mutated, or gene copies having silent mutations, often termed synonymous variants, might explain this. However, these synonymous variant which codes for exactly similar proteins have different translation rates, stability, and gene expression profiling. Hence, it would be interesting to study the pattern of synonymous variant usage. In the study, synonymous variant usage in various transcripts of autophagic genes ATG5, ATG7, ATG8A, ATG16, and ATG17/FIP200 reported to cause neurodegeneration (if dysregulated) is studied. These genes were analyzed for their synonymous variant usage; nucleotide composition; any possible nucleotide skew in a gene; physical properties of autophagic protein including GRAVY and AROMA; hydropathicity; instability index; and frequency of acidic, basic, neutral amino acids; and gene expression level. The study will help understand various evolutionary forces acting on these genes and the possible augmentation of a gene if showing unusual behavior.
Collapse
Affiliation(s)
- Rekha Khandia
- Department of Biochemistry and Genetics, Barkatullah University, Bhopal, 462026, India.
| | - Megha Katare Pandey
- Department of Translational Medicine, All India Institute of Medical Sciences, Bhopal, 462020, India
| | | | - Azmat Ali Khan
- Pharmaceutical Biotechnology Laboratory, Department of Pharmaceutical Chemistry, College of Pharmacy, King Saud University, Riyadh, 11451, Saudi Arabia.
| | - Athanasios Alexiou
- Novel Global Community Educational Foundation, Hebersham, Australia
- AFNP Med, Wien, Austria
| |
Collapse
|
4
|
Ding H, Han S, Ye Y, Bi D, Zhang S, Yi R, Gao J, Yang J, Wu L, Kan X. Ten Plastomes of Crassula (Crassulaceae) and Phylogenetic Implications. BIOLOGY 2022; 11:1779. [PMID: 36552287 PMCID: PMC9775174 DOI: 10.3390/biology11121779] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 12/05/2022] [Accepted: 12/05/2022] [Indexed: 12/12/2022]
Abstract
The genus Crassula is the second-largest genus in the family Crassulaceae, with about 200 species. As an acknowledged super-barcode, plastomes have been extensively utilized for plant evolutionary studies. Here, we first report 10 new plastomes of Crassula. We further focused on the structural characterizations, codon usage, aversion patterns, and evolutionary rates of plastomes. The IR junction patterns-IRb had 110 bp expansion to rps19-were conservative among Crassula species. Interestingly, we found the codon usage patterns of matK gene in Crassula species are unique among Crassulaceae species with elevated ENC values. Furthermore, subgenus Crassula species have specific GC-biases in the matK gene. In addition, the codon aversion motifs from matK, pafI, and rpl22 contained phylogenetic implications within Crassula. The evolutionary rates analyses indicated all plastid genes of Crassulaceae were under the purifying selection. Among plastid genes, ycf1 and ycf2 were the most rapidly evolving genes, whereas psaC was the most conserved gene. Additionally, our phylogenetic analyses strongly supported that Crassula is sister to all other Crassulaceae species. Our findings will be useful for further evolutionary studies within the Crassula and Crassulaceae.
Collapse
Affiliation(s)
- Hengwu Ding
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Shiyun Han
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Yuanxin Ye
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - De Bi
- College of Landscape Engineering, Suzhou Polytechnic Institute of Agriculture, Suzhou 215000, China
| | - Sijia Zhang
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Ran Yi
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Jinming Gao
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Jianke Yang
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Longhua Wu
- CAS Key Laboratory of Soil Environment and Pollution Remediation, Institute of Soil Science, Chinese Academy of Sciences, Nanjing 210008, China
| | - Xianzhao Kan
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
- The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| |
Collapse
|
5
|
Han S, Wang R, Hong X, Wu C, Zhang S, Kan X. Plastomes of Bletilla (Orchidaceae) and Phylogenetic Implications. Int J Mol Sci 2022; 23:ijms231710151. [PMID: 36077549 PMCID: PMC9456473 DOI: 10.3390/ijms231710151] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Revised: 08/22/2022] [Accepted: 09/02/2022] [Indexed: 11/16/2022] Open
Abstract
The genus Bletilla is a small genus of only five species distributed across Asia, including B. chartacea, B. foliosa, B. formosana, B. ochracea and B. striata, which is of great medicinal importance. Furthermore, this genus is a member of the key tribe Arethuseae (Orchidaceae), harboring an extremely complicated taxonomic history. Recently, the monophyletic status of Bletilla has been challenged, and the phylogenetic relationships within this genus are still unclear. The plastome, which is rich in both sequence and structural variation, has emerged as a powerful tool for understanding plant evolution. Along with four new plastomes, this work is committed to exploring plastomic markers to elucidate the phylogeny of Bletilla. Our results reveal considerable plastomic differences between B. sinensis and the other three taxa in many aspects. Most importantly, the specific features of the IR junction patterns, novel pttRNA structures and codon aversion motifs can serve as useful molecular markers for Bletilla phylogeny. Moreover, based on maximum likelihood and Bayesian inference methods, our phylogenetic analyses based on two datasets of Arethuseae strongly imply that Bletilla is non-monophyletic. Accordingly, our findings from this study provide novel potential markers for species identification, and shed light on the evolution of Bletilla and Arethuseae.
Collapse
Affiliation(s)
- Shiyun Han
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, Wuhu 241000, China
| | - Rongbin Wang
- Institute of Chinese Medicine Resources, Anhui College of Traditional Chinese Medicine, Wuhu 241002, China
- Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Xin Hong
- Anhui Provincial Engineering Laboratory of Wetland Ecosystem Protection and Restoration, School of Resources and Environmental Engineering, Anhui University, Hefei 230601, China
| | - Cuilian Wu
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, Wuhu 241000, China
- Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Sijia Zhang
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, Wuhu 241000, China
- Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Xianzhao Kan
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, Wuhu 241000, China
- Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
- Correspondence: ; Tel.: +86-139-5537-2268
| |
Collapse
|
6
|
Han S, Bi D, Yi R, Ding H, Wu L, Kan X. Plastome evolution of Aeonium and Monanthes (Crassulaceae): insights into the variation of plastomic tRNAs, and the patterns of codon usage and aversion. PLANTA 2022; 256:35. [PMID: 35809200 DOI: 10.1007/s00425-022-03950-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/26/2022] [Accepted: 06/24/2022] [Indexed: 06/15/2023]
Abstract
This study reported 13 new plastomes from Aeonium and Monanthes, and observed new markers for phylogeny and DNA barcoding, such as novel tRNA structures and codon usage bias and aversion. The Macaronesian clade of Crassulaceae consists of three genera: Aichryson, with about 15 species; Monanthes, with about 10 species; Aeonium, with about 40 species. Within this clade, Aeonium, known as "the botanical equivalent of Darwin's finches", is regarded as an excellent model plant for researching adaptive evolution. Differing from the well-resolved relationships among three genera of the Macaronesian clade, the internal branching patterns within the genus Aeonium are largely unclear. In this study, we first reported 13 new plastomes from genus Aeonium and the closely related genus Monanthes. We further performed comprehensive analyses of the plastomes, with focuses on the secondary structures of pttRNAs and the patterns of codon usage and aversion. With a typical circular and quadripartite structure, the 13 plastomes ranged from 149,900 to 151,030 bp in size, and the unique pattern in IR junctions might become a family-specific marker for Crassulaceae species. Surprisingly, the π values of plastomes from Monanthes were almost twice those from Aeonium. Most importantly, we strongly recommend that highly polymorphic regions, novel putative pttRNA structures, patterns of codon usage bias and aversion derived from plastomes might have phylogenetic implications, and could act as new markers for DNA barcoding of plants. The results of phylogenetic analyses strongly supported a clear internal branching pattern in Macaronesian clade (represented by Aeonium and Monanthes), with higher nodal support values. The findings reported here will provide new insights into the variation of pttRNAs, and the patterns of codon usage and aversion of the family Crassulaceae.
Collapse
Affiliation(s)
- Shiyun Han
- The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu, 241000, Anhui, China
| | - De Bi
- Suzhou Polytechnic Institute of Agriculture, Suzhou, 215000, Jiangsu, China
| | - Ran Yi
- The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu, 241000, Anhui, China
| | - Hengwu Ding
- The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu, 241000, Anhui, China
| | - Longhua Wu
- CAS Key Laboratory of Soil Environment and Pollution Remediation, Institute of Soil Science, Chinese Academy of Sciences, Nanjing, 210008, Jiangsu, China
| | - Xianzhao Kan
- The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu, 241000, Anhui, China.
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, Wuhu, 241000, Anhui, China.
| |
Collapse
|
7
|
Determination of the Amino Acid Recruitment Order in Early Life by Genome-Wide Analysis of Amino Acid Usage Bias. Biomolecules 2022; 12:biom12020171. [PMID: 35204672 PMCID: PMC8961565 DOI: 10.3390/biom12020171] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Revised: 01/14/2022] [Accepted: 01/18/2022] [Indexed: 12/11/2022] Open
Abstract
The mechanisms shaping the amino acids recruitment pattern into the proteins in the early life history presently remains a huge mystery. In this study, we conducted genome-wide analyses of amino acids usage and genetic codons structure in 7270 species across three domains of life. The carried-out analyses evidenced ubiquitous usage bias of amino acids that were likely independent from codon usage bias. Taking advantage of codon usage bias, we performed pseudotime analysis to re-determine the chronological order of the species emergence, which inspired a new species relationship by tracing the imprint of codon usage evolution. Furthermore, the multidimensional data integration showed that the amino acids A, D, E, G, L, P, R, S, T and V might be the first recruited into the last universal common ancestry (LUCA) proteins. The data analysis also indicated that the remaining amino acids most probably were gradually incorporated into proteogenesis process in the course of two long-timescale parallel evolutionary routes: I→F→Y→C→M→W and K→N→Q→H. This study provides new insight into the origin of life, particularly in terms of the basic protein composition of early life. Our work provides crucial information that will help in a further understanding of protein structure and function in relation to their evolutionary history.
Collapse
|
8
|
McKinnon LM, Miller JB, Whiting MF, Kauwe JSK, Ridge PG. A comprehensive analysis of the phylogenetic signal in ramp sequences in 211 vertebrates. Sci Rep 2021; 11:622. [PMID: 33436653 PMCID: PMC7803996 DOI: 10.1038/s41598-020-78803-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2020] [Accepted: 11/23/2020] [Indexed: 01/24/2023] Open
Abstract
Ramp sequences increase translational speed and accuracy when rare, slowly-translated codons are found at the beginnings of genes. Here, the results of the first analysis of ramp sequences in a phylogenetic construct are presented. Ramp sequences were compared from 247 vertebrates (114 Mammalian and 133 non-mammalian), where the presence and absence of ramp sequences was analyzed as a binary character in a parsimony and maximum likelihood framework. Additionally, ramp sequences were mapped to the Open Tree of Life synthetic tree to determine the number of parallelisms and reversals that occurred, and those results were compared to random permutations. Parsimony and maximum likelihood analyses of the presence and absence of ramp sequences recovered phylogenies that are highly congruent with established phylogenies. Additionally, 81% of vertebrate mammalian ramps and 81.2% of other vertebrate ramps had less parallelisms and reversals than the mean from 1000 randomly permuted trees. A chi-square analysis of completely orthologous ramp sequences resulted in a p-value < 0.001 as compared to random chance. Ramp sequences recover comparable phylogenies as other phylogenomic methods. Although not all ramp sequences appear to have a phylogenetic signal, more ramp sequences track speciation than expected by random chance. Therefore, ramp sequences may be used in conjunction with other phylogenomic approaches if many orthologs are taken into account. However, phylogenomic methods utilizing few orthologs should be cautious in incorporating ramp sequences because individual ramp sequences may provide conflicting signals.
Collapse
Affiliation(s)
- Lauren M McKinnon
- Department of Biology, Brigham Young University, Provo, UT, 84602, USA
| | - Justin B Miller
- Department of Biology, Brigham Young University, Provo, UT, 84602, USA
| | - Michael F Whiting
- Department of Biology, Brigham Young University, Provo, UT, 84602, USA
- Monte L. Bean Museum, Brigham Young University, Provo, UT, 84602, USA
| | - John S K Kauwe
- Department of Biology, Brigham Young University, Provo, UT, 84602, USA
| | - Perry G Ridge
- Department of Biology, Brigham Young University, Provo, UT, 84602, USA.
| |
Collapse
|
9
|
Hodgman MW, Miller JB, Meurs TE, Kauwe JSK. CUBAP: an interactive web portal for analyzing codon usage biases across populations. Nucleic Acids Res 2020; 48:11030-11039. [PMID: 33045750 PMCID: PMC7641757 DOI: 10.1093/nar/gkaa863] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Revised: 08/18/2020] [Accepted: 09/22/2020] [Indexed: 12/19/2022] Open
Abstract
Synonymous codon usage significantly impacts translational and transcriptional efficiency, gene expression, the secondary structure of both mRNA and proteins, and has been implicated in various diseases. However, population-specific differences in codon usage biases remain largely unexplored. Here, we present a web server, https://cubap.byu.edu, to facilitate analyses of codon usage biases across populations (CUBAP). Using the 1000 Genomes Project, we calculated and visually depict population-specific differences in codon frequencies, codon aversion, identical codon pairing, co-tRNA codon pairing, ramp sequences, and nucleotide composition in 17,634 genes. We found that codon pairing significantly differs between populations in 35.8% of genes, allowing us to successfully predict the place of origin for African and East Asian individuals with 98.8% and 100% accuracy, respectively. We also used CUBAP to identify a significant bias toward decreased CTG pairing in the immunity related GTPase M (IRGM) gene in East Asian and African populations, which may contribute to the decreased association of rs10065172 with Crohn's disease in those populations. CUBAP facilitates in-depth gene-specific and codon-specific visualization that will aid in analyzing candidate genes identified in genome-wide association studies, identifying functional implications of synonymous variants, predicting population-specific impacts of synonymous variants and categorizing genetic biases unique to certain populations.
Collapse
Affiliation(s)
- Matthew W Hodgman
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Justin B Miller
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Taylor E Meurs
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - John S K Kauwe
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| |
Collapse
|
10
|
Miller JB, McKinnon LM, Whiting MF, Kauwe JSK, Ridge PG. Codon Pairs are Phylogenetically Conserved: A comprehensive analysis of codon pairing conservation across the Tree of Life. PLoS One 2020; 15:e0232260. [PMID: 32401752 PMCID: PMC7219770 DOI: 10.1371/journal.pone.0232260] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Accepted: 04/10/2020] [Indexed: 11/27/2022] Open
Abstract
Identical codon pairing and co-tRNA codon pairing increase translational efficiency within genes when two codons that encode the same amino acid are translated by the same tRNA before it diffuses from the ribosome. We examine the phylogenetic signal in both identical and co-tRNA codon pairing across 23 428 species using alignment-free and parsimony methods. We determined that conserved codon pairing typically has a smaller window size than the length of a ribosome, and codon pairing tracks phylogenies across various taxonomic groups. We report a comprehensive analysis of codon pairing, including the extent to which each codon pairs. Our parsimony method generally recovers phylogenies that are more congruent with the established phylogenies than our alignment-free method. However, four of the ten taxonomic groups did not have sufficient orthologous codon pairings and were therefore analyzed using only the alignment-free methods. Since the recovered phylogenies using only codon pairing largely match phylogenies from the Open Tree of Life and the NCBI taxonomy, and are comparable to trees recovered by other algorithms, we propose that codon pairing biases are phylogenetically conserved and should be considered in conjunction with other phylogenomic techniques.
Collapse
Affiliation(s)
- Justin B. Miller
- Department of Biology, Brigham Young University, Provo, UT, United States of America
| | - Lauren M. McKinnon
- Department of Biology, Brigham Young University, Provo, UT, United States of America
| | - Michael F. Whiting
- Department of Biology, Brigham Young University, Provo, UT, United States of America
- M.L. Bean Museum, Brigham Young University, Provo, UT, United States of America
| | - John S. K. Kauwe
- Department of Biology, Brigham Young University, Provo, UT, United States of America
| | - Perry G. Ridge
- Department of Biology, Brigham Young University, Provo, UT, United States of America
| |
Collapse
|
11
|
Miller JB, McKinnon LM, Whiting MF, Ridge PG. Codon use and aversion is largely phylogenetically conserved across the tree of life. Mol Phylogenet Evol 2019; 144:106697. [PMID: 31805345 DOI: 10.1016/j.ympev.2019.106697] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2018] [Revised: 04/10/2019] [Accepted: 11/29/2019] [Indexed: 01/11/2023]
Abstract
Using parsimony, we analyzed codon usages across 12,337 species and 25,727 orthologous genes to rank specific genes and codons according to their phylogenetic signal. We examined each codon within each ortholog to determine the codon usage for each species. In total, 890,814 codons were parsimony informative. Next, we compared species that used a codon with species that did not use the codon. We assessed each codon's congruence with species relationships provided in the Open Tree of Life (OTL) and determined the statistical probability of observing these results by random chance. We determined that 25,771 codons had no parallelisms or reversals when mapped to the OTL. Codon usages from orthologous genes spanning many species were 1109× more likely to be congruent with species relationships in the OTL than would be expected by random chance. Using the OTL as a reference, we show that codon usage is phylogenetically conserved within orthologous genes in archaea, bacteria, plants, mammals, and other vertebrates. We also show how to use our provided framework to test different tree hypotheses by confirming the placement of turtles as sister taxa to archosaurs.
Collapse
Affiliation(s)
- Justin B Miller
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Lauren M McKinnon
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Michael F Whiting
- Department of Biology, Brigham Young University, Provo, UT 84602, USA; M.L. Bean Museum, Brigham Young University, Provo, UT 84602, USA
| | - Perry G Ridge
- Department of Biology, Brigham Young University, Provo, UT 84602, USA.
| |
Collapse
|
12
|
Miller JB, McKinnon LM, Whiting MF, Ridge PG. CAM: an alignment-free method to recover phylogenies using codon aversion motifs. PeerJ 2019; 7:e6984. [PMID: 31198636 PMCID: PMC6555396 DOI: 10.7717/peerj.6984] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2018] [Accepted: 04/17/2019] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Common phylogenomic approaches for recovering phylogenies are often time-consuming and require annotations for orthologous gene relationships that are not always available. In contrast, alignment-free phylogenomic approaches typically use structure and oligomer frequencies to calculate pairwise distances between species. We have developed an approach to quickly calculate distances between species based on codon aversion. METHODS Utilizing a novel alignment-free character state, we present CAM, an alignment-free approach to recover phylogenies by comparing differences in codon aversion motifs (i.e., the set of unused codons within each gene) across all genes within a species. Synonymous codon usage is non-random and differs between organisms, between genes, and even within a single gene, and many genes do not use all possible codons. We report a comprehensive analysis of codon aversion within 229,742,339 genes from 23,428 species across all kingdoms of life, and we provide an alignment-free framework for its use in a phylogenetic construct. For each species, we first construct a set of codon aversion motifs spanning all genes within that species. We define the pairwise distance between two species, A and B, as one minus the number of shared codon aversion motifs divided by the total codon aversion motifs of the species, A or B, containing the fewest motifs. This approach allows us to calculate pairwise distances even when substantial differences in the number of genes or a high rate of divergence between species exists. Finally, we use neighbor-joining to recover phylogenies. RESULTS Using the Open Tree of Life and NCBI Taxonomy Database as expected phylogenies, our approach compares well, recovering phylogenies that largely match expected trees and are comparable to trees recovered using maximum likelihood and other alignment-free approaches. Our technique is much faster than maximum likelihood and similar in accuracy to other alignment-free approaches. Therefore, we propose that codon aversion be considered a phylogenetically conserved character that may be used in future phylogenomic studies. AVAILABILITY CAM, documentation, and test files are freely available on GitHub at https://github.com/ridgelab/cam.
Collapse
Affiliation(s)
- Justin B. Miller
- Department of Biology, Brigham Young University, Provo, UT, United States of America
| | - Lauren M. McKinnon
- Department of Biology, Brigham Young University, Provo, UT, United States of America
| | - Michael F. Whiting
- Department of Biology, Brigham Young University, Provo, UT, United States of America
- Brigham Young University, M.L. Bean Museum, Provo, UT, United States of America
| | - Perry G. Ridge
- Department of Biology, Brigham Young University, Provo, UT, United States of America
| |
Collapse
|