1
|
Shen L, Chen S, Liang M, Qu S, Feng S, Wang D, Wang G. Comparative analysis of codon usage bias in chloroplast genomes of ten medicinal species of Rutaceae. BMC PLANT BIOLOGY 2024; 24:424. [PMID: 38764045 PMCID: PMC11103831 DOI: 10.1186/s12870-024-04999-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2023] [Accepted: 04/08/2024] [Indexed: 05/21/2024]
Abstract
Rutaceae family comprises economically important plants due to their extensive applications in spices, food, oil, medicine, etc. The Rutaceae plants is able to better utilization through biotechnology. Modern biotechnological approaches primarily rely on the heterologous expression of functional proteins in different vectors. However, several proteins are difficult to express outside their native environment. The expression potential of functional genes in heterologous systems can be maximized by replacing the rare synonymous codons in the vector with preferred optimal codons of functional genes. Codon usage bias plays a critical role in biogenetic engineering-based research and development. In the current study, 727 coding sequences (CDSs) obtained from the chloroplast genomes of ten Rutaceae plant family members were analyzed for codon usage bias. The nucleotide composition analysis of codons showed that these codons were rich in A/T(U) bases and preferred A/T(U) endings. Analyses of neutrality plots, effective number of codons (ENC) plots, and correlations between ENC and codon adaptation index (CAI) were conducted, which revealed that natural selection is a major driving force for the Rutaceae plant family's codon usage bias, followed by base mutation. In the ENC vs. CAI plot, codon usage bias in the Rutaceae family had a negligible relationship with gene expression level. For each sample, we screened 12 codons as preferred and high-frequency codons simultaneously, of which GCU encoding Ala, UUA encoding Leu, and AGA encoding Arg were the most preferred codons. Taken together, our study unraveled the synonymous codon usage pattern in the Rutaceae family, providing valuable information for the genetic engineering of Rutaceae plant species in the future.
Collapse
Affiliation(s)
- Lianwen Shen
- Key Laboratory for Forest Resources Conservation and Utilization in the Southwest Mountains of China Ministry of Education, Southwest Forestry University, Kunming, 650224, China
- Key Laboratory for Forest Genetics and Tree Improvement and Propagation in Universities of Yunnan Province, Southwest Forestry University, Kunming, 650224, China
- Guizhou Academy of Forestry, Guiyang, 550005, China
| | | | - Mei Liang
- Guizhou Province Forestry Science and Technology Extension Station, Guiyang, 550000, China
| | - Shang Qu
- Guizhou Academy of Forestry, Guiyang, 550005, China
| | - Shijing Feng
- College of Forestry, Guizhou University, Guiyang, 550025, China
| | - Dawei Wang
- Key Laboratory for Forest Resources Conservation and Utilization in the Southwest Mountains of China Ministry of Education, Southwest Forestry University, Kunming, 650224, China.
- Key Laboratory for Forest Genetics and Tree Improvement and Propagation in Universities of Yunnan Province, Southwest Forestry University, Kunming, 650224, China.
| | - Gang Wang
- Guizhou Academy of Forestry, Guiyang, 550005, China.
| |
Collapse
|
2
|
Zhao R, Li H, Wu G, Wang YF. Codon usage bias analysis in the mitochondrial genomes of five Rhingia Scopoli (Diptera, Syrphidae, Eristalinae) species. Gene 2024; 917:148466. [PMID: 38615984 DOI: 10.1016/j.gene.2024.148466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Revised: 03/26/2024] [Accepted: 04/11/2024] [Indexed: 04/16/2024]
Abstract
This study presents the sequencing and annotation of mitochondrial genomes from five Rhingia species of the family Syrphidae, focusing on codon bias. Each species possessed 22 tRNAs genes, 13 protein-coding genes, 2 rRNAs genes, and a control region, without any observed gene rearrangements. Nucleotide composition analysis revealed a higher AT content compared with GC content, indicating AT enrichment. Neutrality plot, Parity rule 2 bias, and effective number of codons plot analyses collectively indicated that natural selection primarily influences the codon usage bias in the five Rhingia species. Relative synonymous codon usage analysis identified the optimal codons for Rhingia binotata, R. fromosana, R. campestris, R. louguanensis, and R. xanthopoda as 10, 14, 10, 11, and 12, respectively, all ending with A/U and exhibiting AT preference. Phylogenetic analysis, based on maximum likelihood and Bayesian inference methods applied to three datasets, confirmed the monophyly of Rhingia. In conclusion, this research establishes a foundation for understanding the phylogenetic evolution and codon usage patterns in Rhingia, offering valuable for future studies.
Collapse
Affiliation(s)
- Rui Zhao
- Shaanxi Key Laboratory of Bio-Resources, State Key Laboratory of Biological Resources and Ecological Environment of Qinling-Bashan, Qinling-Bashan Mountains Bioresources Comprehensive Development C.I.C., School of Biological Science & Engineering, Shaanxi University of Technology, Hanzhong, Shaanxi, China
| | - Hu Li
- Shaanxi Key Laboratory of Bio-Resources, State Key Laboratory of Biological Resources and Ecological Environment of Qinling-Bashan, Qinling-Bashan Mountains Bioresources Comprehensive Development C.I.C., School of Biological Science & Engineering, Shaanxi University of Technology, Hanzhong, Shaanxi, China.
| | - Gang Wu
- Shaanxi Key Laboratory of Bio-Resources, State Key Laboratory of Biological Resources and Ecological Environment of Qinling-Bashan, Qinling-Bashan Mountains Bioresources Comprehensive Development C.I.C., School of Biological Science & Engineering, Shaanxi University of Technology, Hanzhong, Shaanxi, China
| | - Yi-Fan Wang
- Shaanxi Key Laboratory of Bio-Resources, State Key Laboratory of Biological Resources and Ecological Environment of Qinling-Bashan, Qinling-Bashan Mountains Bioresources Comprehensive Development C.I.C., School of Biological Science & Engineering, Shaanxi University of Technology, Hanzhong, Shaanxi, China
| |
Collapse
|
3
|
Khandia R, Gurjar P, Kamal MA, Greig NH. Relative synonymous codon usage and codon pair analysis of depression associated genes. Sci Rep 2024; 14:3502. [PMID: 38346990 PMCID: PMC10861588 DOI: 10.1038/s41598-024-51909-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Accepted: 01/11/2024] [Indexed: 02/15/2024] Open
Abstract
Depression negatively impacts mood, behavior, and mental and physical health. It is the third leading cause of suicides worldwide and leads to decreased quality of life. We examined 18 genes available at the genetic testing registry (GTR) from the National Center for Biotechnological Information to investigate molecular patterns present in depression-associated genes. Different genotypes and differential expression of the genes are responsible for ensuing depression. The present study, investigated codon pattern analysis, which might play imperative roles in modulating gene expression of depression-associated genes. Of the 18 genes, seven and two genes tended to up- and down-regulate, respectively, and, for the remaining genes, different genotypes, an outcome of SNPs were responsible alone or in combination with differential expression for different conditions associated with depression. Codon context analysis revealed the abundance of identical GTG-GTG and CTG-CTG pairs, and the rarity of methionine-initiated codon pairs. Information based on codon usage, preferred codons, rare, and codon context might be used in constructing a deliverable synthetic construct to correct the gene expression level of the human body, which is altered in the depressive state. Other molecular signatures also revealed the role of evolutionary forces in shaping codon usage.
Collapse
Affiliation(s)
- Rekha Khandia
- Department of Biochemistry and Genetics, Barkatullah University, Bhopal, 462026, MP, India.
| | - Pankaj Gurjar
- Centre for Global Health Research, Saveetha Medical College and Hospital, Saveetha Institute of Medical and Technical Sciences, Saveetha University, Chennai, Tamilnadu, India
- Department of Science and Engineering, Novel Global Community Educational Foundation, Hebersham, NSW, Australia
| | - Mohammad Amjad Kamal
- Joint Laboratory of Artificial Intelligence in Healthcare, Institutes for Systems Genetics and West China School of Nursing, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
- King Fahd Medical Research Center, King Abdulaziz University, Jeddah, 21589, Saudi Arabia
- Department of Pharmacy, Faculty of Allied Health Sciences, Daffodil International University, Dhaka, 1207, Bangladesh
- Enzymoics, Novel Global Community Educational Foundation, 7 Peterlee place, Hebersham, NSW, 2770, Australia
| | - Nigel H Greig
- Translational Gerontology Branch, Intramural Research Program, National Institute on Aging, NIH, Baltimore, MD, 21224, USA.
| |
Collapse
|
4
|
Ji H, Liu J, Chen Y, Yu X, Luo C, Sang L, Zhou J, Liao H. Bioinformatic Analysis of Codon Usage Bias of HSP20 Genes in Four Cruciferous Species. PLANTS (BASEL, SWITZERLAND) 2024; 13:468. [PMID: 38498447 PMCID: PMC10892267 DOI: 10.3390/plants13040468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/06/2024] [Revised: 01/31/2024] [Accepted: 02/04/2024] [Indexed: 03/20/2024]
Abstract
Heat shock protein 20 (HSP20) serves as a chaperone and plays roles in numerous biological processes, but the codon usage bias (CUB) of its genes has remained unexplored. This study identified 140 HSP20 genes from four cruciferous species, Arabidopsis thaliana, Brassica napus, Brassica rapa, and Camelina sativa, that were identified from the Ensembl plants database, and we subsequently investigated their CUB. As a result, the base composition analysis revealed that the overall GC content of HSP20 genes was below 50%. The overall GC content significantly correlated with the constituents at three codon positions, implying that both mutation pressure and natural selection might contribute to the CUB. The relatively high ENc values suggested that the CUB of the HSP20 genes in four cruciferous species was relatively weak. Subsequently, ENc exhibited a negative correlation with gene expression levels. Analyses, including ENc-plot analysis, neutral analysis, and PR2 bias, revealed that natural selection mainly shaped the CUB patterns of HSP20 genes in these species. In addition, a total of 12 optimal codons (ΔRSCU > 0.08 and RSCU > 1) were identified across the four species. A neighbor-joining phylogenetic analysis based on coding sequences (CDS) showed that the 140 HSP20 genes were strictly and distinctly clustered into 12 subfamilies. Principal component analysis and cluster analysis based on relative synonymous codon usage (RSCU) values supported the fact that the CUB pattern was consistent with the genetic relationship at the gene level and (or) species levels. These results will not only enrich the HSP20 gene resource but also advance our understanding of the CUB of HSP20 genes, which may underlie the theoretical basis for exploration of their genetic and evolutionary pattern.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Jiayu Zhou
- School of Life Science and Engineering, Southwest Jiaotong University, Chengdu 610031, China; (H.J.); (J.L.); (Y.C.); (X.Y.); (C.L.); (L.S.)
| | - Hai Liao
- School of Life Science and Engineering, Southwest Jiaotong University, Chengdu 610031, China; (H.J.); (J.L.); (Y.C.); (X.Y.); (C.L.); (L.S.)
| |
Collapse
|
5
|
Zhang K, Wang Y, Zhang Y, Shan X. Codon usage characterization and phylogenetic analysis of the mitochondrial genome in Hemerocallis citrina. BMC Genom Data 2024; 25:6. [PMID: 38218810 PMCID: PMC10788020 DOI: 10.1186/s12863-024-01191-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 01/04/2024] [Indexed: 01/15/2024] Open
Abstract
BACKGROUND Hemerocallis citrina Baroni is a traditional vegetable crop widely cultivated in eastern Asia for its high edible, medicinal, and ornamental value. The phenomenon of codon usage bias (CUB) is prevalent in various genomes and provides excellent clues for gaining insight into organism evolution and phylogeny. Comprehensive analysis of the CUB of mitochondrial (mt) genes can provide rich genetic information for improving the expression efficiency of exogenous genes and optimizing molecular-assisted breeding programmes in H. citrina. RESULTS Here, the CUB patterns in the mt genome of H. citrina were systematically analyzed, and the possible factors shaping CUB were further evaluated. Composition analysis of codons revealed that the overall GC (GCall) and GC at the third codon position (GC3) contents of mt genes were lower than 50%, presenting a preference for A/T-rich nucleotides and A/T-ending codons in H. citrina. The high values of the effective number of codons (ENC) are indicative of fairly weak CUB. Significant correlations of ENC with the GC3 and codon counts were observed, suggesting that not only compositional constraints but also gene length contributed greatly to CUB. Combined ENC-plot, neutrality plot, and Parity rule 2 (PR2)-plot analyses augmented the inference that the CUB patterns of the H. citrina mitogenome can be attributed to multiple factors. Natural selection, mutation pressure, and other factors might play a major role in shaping the CUB of mt genes, although natural selection is the decisive factor. Moreover, we identified a total of 29 high-frequency codons and 22 optimal codons, which exhibited a consistent preference for ending in A/T. Subsequent relative synonymous codon usage (RSCU)-based cluster and mt protein coding gene (PCG)-based phylogenetic analyses suggested that H. citrina is close to Asparagus officinalis, Chlorophytum comosum, Allium cepa, and Allium fistulosum in evolutionary terms, reflecting a certain correlation between CUB and evolutionary relationships. CONCLUSIONS There is weak CUB in the H. citrina mitogenome that is subject to the combined effects of multiple factors, especially natural selection. H. citrina was found to be closely related to Asparagus officinalis, Chlorophytum comosum, Allium cepa, and Allium fistulosum in terms of their evolutionary relationships as well as the CUB patterns of their mitogenomes. Our findings provide a fundamental reference for further studies on genetic modification and phylogenetic evolution in H. citrina.
Collapse
Affiliation(s)
- Kun Zhang
- College of Agriculture and Life Sciences, Shanxi Datong University, Datong, Shanxi, China.
- Key Laboratory of Organic Dry Farming for Special Crops in Datong City, Datong, Shanxi, China.
| | - Yiheng Wang
- State Key Laboratory of Vegetable Biobreeding, Tianjin Academy of Agricultural Sciences, Tianjin, China
| | - Yue Zhang
- College of Agriculture and Life Sciences, Shanxi Datong University, Datong, Shanxi, China
| | - Xiaofei Shan
- College of Agriculture and Life Sciences, Shanxi Datong University, Datong, Shanxi, China
| |
Collapse
|
6
|
He J, Huang Y, Li L, Lin S, Ma M, Wang Y, Lin S. Novel Plastid Genome Characteristics in Fugacium kawagutii and the Trend of Accelerated Evolution of Plastid Proteins in Dinoflagellates. Genome Biol Evol 2024; 16:evad237. [PMID: 38155596 PMCID: PMC10781511 DOI: 10.1093/gbe/evad237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 12/19/2023] [Accepted: 12/22/2023] [Indexed: 12/30/2023] Open
Abstract
Typical (peridinin-containing) dinoflagellates possess plastid genomes composed of small plasmids named "minicircles". Despite the ecological importance of dinoflagellate photosynthesis in corals and marine ecosystems, the structural characteristics, replication dynamics, and evolutionary forcing of dinoflagellate plastid genomes remain poorly understood. Here, we sequenced the plastid genome of the symbiodiniacean species Fugacium kawagutii and conducted comparative analyses. We identified psbT-coding minicircles, features previously not found in Symbiodiniaceae. The copy number of F. kawagutii minicircles showed a strong diel dynamics, changing between 3.89 and 34.3 copies/cell and peaking in mid-light period. We found that F. kawagutii minicircles are the shortest among all dinoflagellates examined to date. Besides, the core regions of the minicircles are highly conserved within genus in Symbiodiniaceae. Furthermore, the codon usage bias of the plastid genomes in Heterocapsaceae, Amphidiniaceae, and Prorocentraceae species are greatly influenced by selection pressure, and in Pyrocystaceae, Symbiodiniaceae, Peridiniaceae, and Ceratiaceae species are influenced by both natural selection pressure and mutation pressure, indicating a family-level distinction in codon usage evolution in dinoflagellates. Phylogenetic analysis using 12 plastid-encoded proteins and five nucleus-encoded plastid proteins revealed accelerated evolution trend of both plastid- and nucleus-encoded plastid proteins in peridinin- and fucoxanthin-dinoflagellate plastids compared to plastid proteins of nondinoflagellate algae. These findings shed new light on the structure and evolution of plastid genomes in dinoflagellates, which will facilitate further studies on the evolutionary forcing and function of the diverse dinoflagellate plastids. The accelerated evolution documented here suggests plastid-encoded sequences are potentially useful for resolving closely related dinoflagellates.
Collapse
Affiliation(s)
- Jiamin He
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China
| | - Yulin Huang
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China
| | - Ling Li
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China
| | - Sitong Lin
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China
| | - Minglei Ma
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China
| | - Yujie Wang
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China
| | - Senjie Lin
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China
- Department of Marine Sciences, University of Connecticut, Groton, CT 06340, USA
| |
Collapse
|
7
|
Ji C, Zhang Y, Feng Y, Zhang X, Gong F, Yao H, Sun X, Pan Z. Circular replication-associated protein-encoding single-stranded DNA virus with risk of spillover is widely prevalent in domestic animals in China. Virus Res 2024; 339:199204. [PMID: 37607596 PMCID: PMC10654594 DOI: 10.1016/j.virusres.2023.199204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 08/18/2023] [Accepted: 08/19/2023] [Indexed: 08/24/2023]
Abstract
Circular replication-associated protein (Rep)-encoding single-stranded (CRESS) DNA viruses are highly diverse and have a broad range of hosts. In this study, we report the detection of Bo-Circo-like virus AH20-1 in the feces of diarrheal cattle. The virus has a circular genome of 3,912 nucleotides, three major putative open reading frames, and encodes a Rep gene of 310 amino acids. We found that the virus is closely related to the Bo-Circo-like virus CH strain, which belongs to the novel Kirkoviridae family. Furthermore, we conducted a nationwide surveillance program and found that the virus is prevalent in China (23.6%, 205/868), with the BCLa subtype being the predominant strain. Our findings suggest that the virus can infect sheep, highlighting the potential for cross-species transmission. Our pressure analysis indicates that the CRESS-DNA Kirkoviridae family has broad host adaptation, and that selection pressure played an important role in the evolution of its Rep genes. Our study underscores the need for continued epidemiological surveillance of this virus due to its widespread prevalence in our ruminant population and potential for cross-species transmission.
Collapse
Affiliation(s)
- Chengyuan Ji
- MOE Joint International Research Laboratory of Animal Health and Food Safety, College of Veterinary Medicine, Nanjing Agricultural University, Nanjing 210095, China
| | - Yao Zhang
- MOE Joint International Research Laboratory of Animal Health and Food Safety, College of Veterinary Medicine, Nanjing Agricultural University, Nanjing 210095, China
| | - Yiqiu Feng
- MOE Joint International Research Laboratory of Animal Health and Food Safety, College of Veterinary Medicine, Nanjing Agricultural University, Nanjing 210095, China
| | - Xinqin Zhang
- MOE Joint International Research Laboratory of Animal Health and Food Safety, College of Veterinary Medicine, Nanjing Agricultural University, Nanjing 210095, China
| | - Fengju Gong
- MOE Joint International Research Laboratory of Animal Health and Food Safety, College of Veterinary Medicine, Nanjing Agricultural University, Nanjing 210095, China
| | - Huochun Yao
- MOE Joint International Research Laboratory of Animal Health and Food Safety, College of Veterinary Medicine, Nanjing Agricultural University, Nanjing 210095, China
| | - Xueqiang Sun
- China Animal Health and Epidemiology Center, Key Laboratory of Animal Biosafety Risk Prevention and Control (South), Qingdao 266000, China.
| | - Zihao Pan
- MOE Joint International Research Laboratory of Animal Health and Food Safety, College of Veterinary Medicine, Nanjing Agricultural University, Nanjing 210095, China..
| |
Collapse
|
8
|
Desingu PA, Rubeni TP, Nagarajan K, Sundaresan NR. Sign of APOBEC editing, purifying selection, frameshift, and in-frame nonsense mutations in the microevolution of lumpy skin disease virus. Front Microbiol 2023; 14:1214414. [PMID: 38033577 PMCID: PMC10682384 DOI: 10.3389/fmicb.2023.1214414] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Accepted: 10/25/2023] [Indexed: 12/02/2023] Open
Abstract
The lumpy skin disease virus (LSDV), which mostly affects ruminants and causes huge-economic loss, was endemic in Africa, caused outbreaks in the Middle East, and was recently detected in Russia, Serbia, Greece, Bulgaria, Kazakhstan, China, Taiwan, Vietnam, Thailand, and India. However, the role of evolutionary drivers such as codon selection, negative/purifying selection, APOBEC editing, and genetic variations such as frameshift and in-frame nonsense mutations in the LSDVs, which cause outbreaks in cattle in various countries, are still largely unknown. In the present study, a frameshift mutation in LSDV035, LSDV019, LSDV134, and LSDV144 genes and in-frame non-sense mutations in LSDV026, LSDV086, LSDV087, LSDV114, LSDV130, LSDV131, LSDV145, LSDV154, LSDV155, LSDV057, and LSDV081 genes were revealed among different clusters. Based on the available complete genome sequences, the prototype wild-type cluster-1.2.1 virus has been found in other than Africa only in India, the wild-type cluster-1.2.2 virus found in Africa were spread outside Africa, and the recombinant viruses spreading only in Asia and Russia. Although LSD viruses circulating in different countries form a specific cluster, the viruses detected in each specific country are distinguished by frameshift and in-frame nonsense mutations. Furthermore, the present study has brought to light that the selection pressure for codons usage bias is mostly exerted by purifying selection, and this process is possibly caused by APOBEC editing. Overall, the present study sheds light on microevolutions in LSDV, expected to help in future studies towards disturbed ORFs, epidemiological diagnostics, attenuation/vaccine reverts, and predicting the evolutionary direction of LSDVs.
Collapse
Affiliation(s)
| | - T. P. Rubeni
- Department of Microbiology and Cell Biology, Indian Institute of Science, Bengaluru, India
| | - K. Nagarajan
- Department of Veterinary Pathology, Madras Veterinary College, Chennai, India
- Veterinary and Animal Sciences University (TANUVAS), Chennai, India
| | | |
Collapse
|
9
|
Li Y, Hu X, Xiao M, Huang J, Lou Y, Hu F, Fu X, Li Y, He H, Cheng J. An analysis of codon utilization patterns in the chloroplast genomes of three species of Coffea. BMC Genom Data 2023; 24:42. [PMID: 37558997 PMCID: PMC10413492 DOI: 10.1186/s12863-023-01143-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 07/28/2023] [Indexed: 08/11/2023] Open
Abstract
BACKGROUND The chloroplast genome of plants is known for its small size and low mutation and recombination rates, making it a valuable tool in plant phylogeny, molecular evolution, and population genetics studies. Codon usage bias, an important evolutionary feature, provides insights into species evolution, gene function, and the expression of exogenous genes. Coffee, a key crop in the global tropical agricultural economy, trade, and daily life, warrants investigation into its codon usage bias to guide future research, including the selection of efficient heterologous expression systems for coffee genetic transformation. RESULTS Analysis of the codon utilization patterns in the chloroplast genomes of three Coffea species revealed a high degree of similarity among them. All three species exhibited similar base compositions, with high A/T content and low G/C content and a preference for A/T-ending codons. Among the 30 high-frequency codons identified, 96.67% had A/T endings. Fourteen codons were identified as ideal. Multiple mechanisms, including natural selection, were found to influence the codon usage patterns in the three coffee species, as indicated by ENc-GC3s mapping, PR2 analysis, and neutral analysis. Nicotiana tabacum and Saccharomyces cerevisiae have potential value as the heterologous expression host for three species of coffee genes. CONCLUSION This study highlights the remarkable similarity in codon usage patterns among the three coffee genomes, primarily driven by natural selection. Understanding the gene expression characteristics of coffee and elucidating the laws governing its genetic evolution are facilitated by investigating the codon preferences in these species. The findings can enhance the efficacy of exogenous gene expression and serve as a basis for future studies on coffee evolution.
Collapse
Affiliation(s)
- Yaqi Li
- Institute of Tropical and Subtropical Cash Crops, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan, China
| | - Xiang Hu
- Institute of Tropical Eco-Agricultural, Yunnan Academy of Agricultural Sciences, Yuanmou, Yunnan, China
| | - Mingkun Xiao
- Institute of Tropical and Subtropical Cash Crops, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan, China
| | - Jiaxiong Huang
- Institute of Tropical and Subtropical Cash Crops, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan, China
| | - Yuqiang Lou
- Institute of Tropical and Subtropical Cash Crops, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan, China
| | - Faguang Hu
- Institute of Tropical and Subtropical Cash Crops, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan, China
| | - Xingfei Fu
- Institute of Tropical and Subtropical Cash Crops, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan, China
| | - Yanan Li
- Institute of Tropical and Subtropical Cash Crops, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan, China
| | - Hongyan He
- Institute of Tropical and Subtropical Cash Crops, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan, China
- Yunnan Academy of Agricultural Engineering, Kunming, Yunnan, China
| | - Jinhuan Cheng
- Institute of Tropical and Subtropical Cash Crops, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan, China.
- Yunnan Academy of Agricultural Engineering, Kunming, Yunnan, China.
| |
Collapse
|
10
|
Fu Y, Liang F, Li C, Warren A, Shin MK, Li L. Codon Usage Bias Analysis in Macronuclear Genomes of Ciliated Protozoa. Microorganisms 2023; 11:1833. [PMID: 37513005 PMCID: PMC10384029 DOI: 10.3390/microorganisms11071833] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 07/12/2023] [Accepted: 07/13/2023] [Indexed: 07/30/2023] Open
Abstract
Ciliated protozoa (ciliates) are unicellular eukaryotes, several of which are important model organisms for molecular biology research. Analyses of codon usage bias (CUB) of the macronuclear (MAC) genome of ciliates can promote a better understanding of the genetic mode and evolutionary history of these organisms and help optimize codons to improve gene editing efficiency in model ciliates. In this study, the following indices were calculated: the guanine-cytosine (GC) content, the frequency of the nucleotides at the third position of codons (T3, C3, A3, G3), the effective number of codons (ENc), GC content at the 3rd position of synonymous codons (GC3s), and the relative synonymous codon usage (RSCU). Parity rule 2 plot analysis, Neutrality plot analysis, ENc plot analysis, and correlation analysis were employed to explore the main influencing factors of CUB. The results showed that the GC content in the MAC genomes of each of 21 ciliate species, the genomes of which were relatively complete, was lower than 50%, and the base compositions of GC and GC3s were markedly distinct. Synonymous codon analysis revealed that the codons in most of the 21 ciliates ended with A or T and four codons were the general putative optimal codons. Collectively, our results indicated that most of the ciliates investigated preferred using the codons with anof AT-ending and that codon usage bias was affected by gene mutation and natural selection.
Collapse
Affiliation(s)
- Yu Fu
- Laboratory of Marine Protozoan Biodiversity and Evolution, Marine College, Shandong University, Weihai 264209, China
| | - Fasheng Liang
- Laboratory of Marine Protozoan Biodiversity and Evolution, Marine College, Shandong University, Weihai 264209, China
| | - Congjun Li
- Laboratory of Marine Protozoan Biodiversity and Evolution, Marine College, Shandong University, Weihai 264209, China
| | - Alan Warren
- Department of Life Sciences, Natural History Museum, London SW7 5BD, UK
| | - Mann Kyoon Shin
- Department of Biology, University of Ulsan, Ulsan 44610, Republic of Korea
| | - Lifang Li
- Laboratory of Marine Protozoan Biodiversity and Evolution, Marine College, Shandong University, Weihai 264209, China
| |
Collapse
|
11
|
Wang ZK, Liu Y, Zheng HY, Tang MQ, Xie SQ. Comparative Analysis of Codon Usage Patterns in Nuclear and Chloroplast Genome of Dalbergia (Fabaceae). Genes (Basel) 2023; 14:genes14051110. [PMID: 37239470 DOI: 10.3390/genes14051110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2023] [Revised: 05/04/2023] [Accepted: 05/16/2023] [Indexed: 05/28/2023] Open
Abstract
The Dalbergia plants are widely distributed across more than 130 tropical and subtropical countries and have significant economic and medicinal value. Codon usage bias (CUB) is a critical feature for studying gene function and evolution, which can provide a better understanding of biological gene regulation. In this study, we comprehensively analyzed the CUB patterns of the nuclear genome, chloroplast genome, and gene expression, as well as systematic evolution of Dalbergia species. Our results showed that the synonymous and optimal codons in the coding regions of both nuclear and chloroplast genome of Dalbergia preferred ending with A/U at the third codon base. Natural selection was the primary factor affecting the CUB features. Furthermore, in highly expressed genes of Dalbergia odorifera, we found that genes with stronger CUB exhibited higher expression levels, and these highly expressed genes tended to favor the use of G/C-ending codons. In addition, the branching patterns of the protein-coding sequences and the chloroplast genome sequences were very similar in the systematic tree, and different with the cluster from the CUB of the chloroplast genome. This study highlights the CUB patterns and features of Dalbergia species in different genomes, explores the correlation between CUB preferences and gene expression, and further investigates the systematic evolution of Dalbergia, providing new insights into codon biology and the evolution of Dalbergia plants.
Collapse
Affiliation(s)
- Zu-Kai Wang
- Key Laboratory of Genetics and Germplasm Innovation of Tropical Special Forest Trees and Ornamental Plants (Ministry of Education), Hainan Key Laboratory for Biology of Tropical Ornamental Plant Germplasm, School of Forestry, Hainan University, Haikou 570228, China
| | - Yi Liu
- Key Laboratory of Genetics and Germplasm Innovation of Tropical Special Forest Trees and Ornamental Plants (Ministry of Education), Hainan Key Laboratory for Biology of Tropical Ornamental Plant Germplasm, School of Forestry, Hainan University, Haikou 570228, China
| | - Hao-Yue Zheng
- Key Laboratory of Genetics and Germplasm Innovation of Tropical Special Forest Trees and Ornamental Plants (Ministry of Education), Hainan Key Laboratory for Biology of Tropical Ornamental Plant Germplasm, School of Forestry, Hainan University, Haikou 570228, China
| | - Min-Qiang Tang
- Key Laboratory of Genetics and Germplasm Innovation of Tropical Special Forest Trees and Ornamental Plants (Ministry of Education), Hainan Key Laboratory for Biology of Tropical Ornamental Plant Germplasm, School of Forestry, Hainan University, Haikou 570228, China
| | - Shang-Qian Xie
- Key Laboratory of Genetics and Germplasm Innovation of Tropical Special Forest Trees and Ornamental Plants (Ministry of Education), Hainan Key Laboratory for Biology of Tropical Ornamental Plant Germplasm, School of Forestry, Hainan University, Haikou 570228, China
| |
Collapse
|
12
|
Khandia R, Pandey MK, Rzhepakovsky IV, Khan AA, Alexiou A. Synonymous Codon Variant Analysis for Autophagic Genes Dysregulated in Neurodegeneration. Mol Neurobiol 2023; 60:2252-2267. [PMID: 36637744 DOI: 10.1007/s12035-022-03081-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Accepted: 09/27/2022] [Indexed: 01/14/2023]
Abstract
Neurodegenerative disorders are often a culmination of the accumulation of abnormally folded proteins and defective organelles. Autophagy is a process of removing these defective proteins, organelles, and harmful substances from the body, and it works to maintain homeostasis. If autophagic removal of defective proteins has interfered, it affects neuronal health. Some of the autophagic genes are specifically found to be associated with neurodegenerative phenotypes. Non-functional, mutated, or gene copies having silent mutations, often termed synonymous variants, might explain this. However, these synonymous variant which codes for exactly similar proteins have different translation rates, stability, and gene expression profiling. Hence, it would be interesting to study the pattern of synonymous variant usage. In the study, synonymous variant usage in various transcripts of autophagic genes ATG5, ATG7, ATG8A, ATG16, and ATG17/FIP200 reported to cause neurodegeneration (if dysregulated) is studied. These genes were analyzed for their synonymous variant usage; nucleotide composition; any possible nucleotide skew in a gene; physical properties of autophagic protein including GRAVY and AROMA; hydropathicity; instability index; and frequency of acidic, basic, neutral amino acids; and gene expression level. The study will help understand various evolutionary forces acting on these genes and the possible augmentation of a gene if showing unusual behavior.
Collapse
Affiliation(s)
- Rekha Khandia
- Department of Biochemistry and Genetics, Barkatullah University, Bhopal, 462026, India.
| | - Megha Katare Pandey
- Department of Translational Medicine, All India Institute of Medical Sciences, Bhopal, 462020, India
| | | | - Azmat Ali Khan
- Pharmaceutical Biotechnology Laboratory, Department of Pharmaceutical Chemistry, College of Pharmacy, King Saud University, Riyadh, 11451, Saudi Arabia.
| | - Athanasios Alexiou
- Novel Global Community Educational Foundation, Hebersham, Australia
- AFNP Med, Wien, Austria
| |
Collapse
|
13
|
Xiong B, Wang T, Huang S, Liao L, Wang X, Deng H, Zhang M, He J, Sun G, He S, Wang Z. Analysis of Codon Usage Bias in Xyloglucan Endotransglycosylase (XET) Genes. Int J Mol Sci 2023; 24:ijms24076108. [PMID: 37047091 PMCID: PMC10094191 DOI: 10.3390/ijms24076108] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 03/17/2023] [Accepted: 03/21/2023] [Indexed: 04/14/2023] Open
Abstract
Xyloglucan endotransglycosylase (XET) genes are widely distributed in most plants, but the codon usage bias of XET genes has remained uncharacterized. Thus, we analyzed the codon usage bias using 4500 codons of 20 XET genes to elucidate the genetic and evolutionary patterns. Phylogenetic and hierarchical cluster analyses revealed that the 20 XET genes belonged to two groups. The closer the genetic distance, the more similar the codon usage preference. The codon usage bias of most XET genes was weak, but there was also some codon usage bias. AGA, AGG, AUC, and GUG were the top four codons (RSCU > 1.5) in the 20 XET genes. CitXET had a stronger codon usage bias, and there were eight optimal codons of CitXET (i.e., AGA, AUU, UCU, CUU, CCA, GCU, GUU, and AAA). The RSCU values underwent a correspondence analysis. The two main factors affecting codon usage bias (i.e., Axes 1 and 2) accounted for 54.8% and 17.6% of the total variation, respectively. Multiple correspondence analysis revealed that XET genes were widely distributed, with Group 1 genes being closer to Axis 1 than Group 2 genes, which were closer to Axis 2. Codons with A/U at the third codon position were distributed closer to Axis 1 than codons with G/C at the third codon position. PgXET, ZmXET, VlXET, VrXET, and PcXET were biased toward codons ending with G/C. In contrast, CitXET, DpXET, and BrpXET were strongly biased toward codons ending with A/U, indicating that these XET genes have a strong codon usage bias. Translational selection and base composition (especially A and U at the third codon position), followed by mutation pressure and natural selection, may be the most important factors affecting codon usage of 20 XET genes. These results may be useful in clarifying the codon usage bias of XET genes and the relevant evolutionary characteristics.
Collapse
Affiliation(s)
- Bo Xiong
- College of Horticulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Tie Wang
- College of Horticulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Shengjia Huang
- College of Horticulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Ling Liao
- College of Horticulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Xun Wang
- Institute of Pomology and Olericulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Honghong Deng
- College of Horticulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Mingfei Zhang
- College of Horticulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Jiaxian He
- College of Horticulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Guochao Sun
- Institute of Pomology and Olericulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Siya He
- Institute of Pomology and Olericulture, Sichuan Agricultural University, Chengdu 611130, China
| | - Zhihui Wang
- College of Horticulture, Sichuan Agricultural University, Chengdu 611130, China
- Institute of Pomology and Olericulture, Sichuan Agricultural University, Chengdu 611130, China
| |
Collapse
|
14
|
Bi D, Han S, Zhou J, Zhao M, Zhang S, Kan X. Codon Usage Analyses Reveal the Evolutionary Patterns among Plastid Genes of Saxifragales at a Larger-Sampling Scale. Genes (Basel) 2023; 14:genes14030694. [PMID: 36980966 PMCID: PMC10048229 DOI: 10.3390/genes14030694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 03/07/2023] [Accepted: 03/10/2023] [Indexed: 03/14/2023] Open
Abstract
Saxifragales is a 15-family order of early-divergent Eudicots with a rich morphological diversity and an ancient rapid radiation. Codon usage bias (CUB) analyses have emerged as an essential tool for understanding the evolutionary dynamics in genes. Thus far, the codon utilization patterns had only been reported in four separate genera within Saxifragales. This study provides a comprehensive assessment of the codon manipulation based on 50 plastid genes, covering 11 constituent families at a larger sampling scale. Our results first showed a high preference for AT bases and AT-ending codons. We then used effective number of codons (ENC) to assess a range of codon bias levels in the plastid genes. We also detected high-informative intrafamilial differences of ENC in three families. Subsequently, parity rule 2 (PR2) plot analyses revealed both family-unique and order-shared bias patterns. Most importantly, the ENC plots and neutrality analyses collectively supported the dominant roles of selection in the CUB of Saxifragales plastid genes. Notably, the phylogenetic affinities inferred by both ML and BI methods were consistent with each other, and they all comprised two primary clades and four subclades. These findings significantly enhance our understanding of the evolutionary processes of the Saxifrage order, and could potentially inspire more CUB analyses at higher taxonomic levels.
Collapse
Affiliation(s)
- De Bi
- Suzhou Polytechnic Institute of Agriculture, Suzhou 215000, China
| | - Shiyun Han
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
- The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Jun Zhou
- Suzhou Polytechnic Institute of Agriculture, Suzhou 215000, China
| | - Maojin Zhao
- Suzhou Polytechnic Institute of Agriculture, Suzhou 215000, China
| | - Sijia Zhang
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
- The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Xianzhao Kan
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
- The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
- Correspondence: ; Tel.: +86-139-5537-2268
| |
Collapse
|
15
|
Fu X, Xie DF, Zhou YY, Cheng RY, Zhang XY, Zhou SD, He XJ. Phylogeny and adaptive evolution of subgenus Rhizirideum (Amaryllidaceae, Allium) based on plastid genomes. BMC PLANT BIOLOGY 2023; 23:70. [PMID: 36726056 PMCID: PMC9890777 DOI: 10.1186/s12870-022-03993-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Accepted: 12/09/2022] [Indexed: 06/01/2023]
Abstract
The subgenus Rhizirideum in the genus Allium consists of 38 species worldwide and forms five sections (A. sect. Rhizomatosa, A. sect. Tenuissima, A. sect. Rhizirideum, A. sect. Eduardia, and A. sect. Caespitosoprason), A. sect. Caespitosoprason being merged into A. sect. Rhizomatosa recently. Previous studies on this subgenus mainly focused on separate sections. To investigate the inter-section and inter-subgenera phylogenetic relationships and adaptive evolution of A. subg. Rhizirideum, we selected thirteen representative species, which cover five sections of this subgenus and can represent four typical phenotypes of it. We conducted the comparative plastome analysis with our thirteen plastomes. And phylogenetic inferences with CDSs and complete sequences of plastomes of our thirteen species and another fifty-four related species were also performed. As a result, the A. subg. Rhizirideum plastomes were relatively conservative in structure, IR/SC borders, codon usage, and repeat sequence. In phylogenetic results, the inter-subgenera relationships among A. subg. Rhizirideum and other genus Allium subgenera were generally similar to the previous reports. In contrast, the inter-section relationships within our subgenus A. subg. Rhizirideum were newly resolved in this study. A. sect. Rhizomatosa and A. sect. Tenuissima were sister branches, which were then clustered with A. sect. Rhizirideum and A. sect. Eduardia successively. However, Allium Polyrhizum Turcz. ex Regel, type species of A. sect. Caespitosoprason, was resolved as the basal taxon of A. subg. Rhizirideum. Allium siphonanthum J. M. Xu was also found in clade A. subg. Cyathophora instead of clade A. subg. Rhizirideum. The selective pressure analysis was also conducted, and most protein-coding genes were under purifying selection. At the same time, just one gene, ycf2, was found under positive selection, and another three genes (rbcL, ycf1a, ycf1b) presented relaxed selection, which were all involved in the photosynthesis. The low temperature, dry climate, and high altitude of the extreme habitats where A. subg. Rhizirideum species grow might impose intense natural selection forces on their plastome genes for photosynthesis. In summary, our research provides new insights into the phylogeny and adaptive evolution of A. subg. Rhizirideum. Moreover, we suggest that the positions of the A. subg. Rhizirideum species A. polyrhizum and A. siphonanthum should be reconsidered.
Collapse
Affiliation(s)
- Xiao Fu
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, 610065, Chengdu, Sichuan, The People's Republic of China
| | - Deng-Feng Xie
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, 610065, Chengdu, Sichuan, The People's Republic of China
| | - Yu-Yang Zhou
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, 610065, Chengdu, Sichuan, The People's Republic of China
| | - Rui-Yu Cheng
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, 610065, Chengdu, Sichuan, The People's Republic of China
| | - Xiang-Yi Zhang
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, 610065, Chengdu, Sichuan, The People's Republic of China
| | - Song-Dong Zhou
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, 610065, Chengdu, Sichuan, The People's Republic of China
| | - Xing-Jin He
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, 610065, Chengdu, Sichuan, The People's Republic of China.
| |
Collapse
|
16
|
Gupta MK, Vadde R. Next-generation development and application of codon model in evolution. Front Genet 2023; 14:1091575. [PMID: 36777719 PMCID: PMC9911445 DOI: 10.3389/fgene.2023.1091575] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 01/17/2023] [Indexed: 01/28/2023] Open
Abstract
To date, numerous nucleotide, amino acid, and codon substitution models have been developed to estimate the evolutionary history of any sequence/organism in a more comprehensive way. Out of these three, the codon substitution model is the most powerful. These models have been utilized extensively to detect selective pressure on a protein, codon usage bias, ancestral reconstruction and phylogenetic reconstruction. However, due to more computational demanding, in comparison to nucleotide and amino acid substitution models, only a few studies have employed the codon substitution model to understand the heterogeneity of the evolutionary process in a genome-scale analysis. Hence, there is always a question of how to develop more robust but less computationally demanding codon substitution models to get more accurate results. In this review article, the authors attempted to understand the basis of the development of different types of codon-substitution models and how this information can be utilized to develop more robust but less computationally demanding codon substitution models. The codon substitution model enables to detect selection regime under which any gene or gene region is evolving, codon usage bias in any organism or tissue-specific region and phylogenetic relationship between different lineages more accurately than nucleotide and amino acid substitution models. Thus, in the near future, these codon models can be utilized in the field of conservation, breeding and medicine.
Collapse
|
17
|
Sahoo S, Rakshit R. The pattern of coding sequences in the chloroplast genome of Atropa belladonna and a comparative analysis with other related genomes in the nightshade family. Genomics Inform 2022; 20:e43. [PMID: 36617650 PMCID: PMC9847383 DOI: 10.5808/gi.22045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 12/12/2022] [Indexed: 12/31/2022] Open
Abstract
Atropa belladonna is a valuable medicinal plant and a commercial source of tropane alkaloids, which are frequently utilized in therapeutic practice. In this study, bioinformaticmethodologies were used to examine the pattern of coding sequences and the factors thatmight influence codon usage bias in the chloroplast genome of Atropa belladonna andother nightshade genomes. The chloroplast engineering being a promising field in modernbiotechnology, the characterization of chloroplast genome is very important. The resultsrevealed that the chloroplast genomes of Nicotiana tabacum, Solanum lycopersicum, Capsicum frutescens, Datura stramonium, Lyciumbarbarum, Solanum melongena, and Solanumtuberosum exhibited comparable codon usage patterns. In these chloroplast genomes, weobserved a weak codon usage bias. According to the correspondence analysis, the genesisof the codon use bias in these chloroplast genes might be explained by natural selection,directed mutational pressure, and other factors. GC12 and GC3S were shown to have nomeaningful relationship. Further research revealed that natural selection primarily shapedthe codon usage in A. belladonna and other nightshade genomes for translational efficiency. The sequencing properties of these chloroplast genomes were also investigated by investing the occurrences of palindromes and inverted repeats, which would be useful forfuture research on medicinal plants.
Collapse
Affiliation(s)
- Satyabrata Sahoo
- Department of Physics, Dhruba Chand Halder College, Dakshin Barasat 743372, India,*Corresponding author E-mail:
| | - Ria Rakshit
- Department of Botany, Baruipur College, Baruipur 743610, India
| |
Collapse
|
18
|
Genetic Diversity and Characterization of Circular Replication (Rep)-Encoding Single-Stranded (CRESS) DNA Viruses. Microbiol Spectr 2022; 10:e0105722. [PMID: 36346238 PMCID: PMC9769708 DOI: 10.1128/spectrum.01057-22] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
The CRESS-DNA viruses are the ubiquitous virus detected in almost all eukaryotic life trees and play an essential role in the maintaining ecosystem of the globe. Still, their genetic diversity is not fully understood. Here, we bring to light the genetic diversity of replication (Rep) and capsid (Cap) proteins of CRESS-DNA viruses. We divided the Rep protein of the CRESS-DNA virus into 10 clusters using CLANS and phylogenetic analyses. Also, most of the Rep protein in Rep cluster 1 (R1) and R2 (Circoviridae, Smacoviridae, Nanoviridae, and CRESSV1-5) contain the Viral_Rep superfamily and P-loop_NTPase superfamily domains, while the Rep protein of viruses in other clusters has no such characterized functional domain. The Circoviridae, Nanoviridae, and CRESSV1-3 viruses contain two domains, such as Viral_Rep and P-loop_NTPase; the CRESSV4 and CRESSV5 viruses have only the Viral_Rep domain; most of the sequences in the pCRESS-related group have only P-loop_NTPase; and Smacoviridae do not have these two domains. Further, we divided the Cap protein of the CRESS-DNA virus into 20 clusters using CLANS and phylogenetic analyses. The Rep and Cap proteins of Circoviridae and Smacoviridae are grouped into a specific cluster. Cap protein of CRESS-DNA viruses grouped with one cluster and Rep protein with another cluster. Further, our study reveals that selection pressure plays a significant role in the evolution of CRESS-DNA viruses' Rep and Cap genes rather than mutational pressure. We hope this study will help determine the genetic diversity of CRESS-DNA viruses as more sequences are discovered in the future. IMPORTANCE The genetic diversity of CRESS-DNA viruses is not fully understood. CRESS-DNA viruses are classified as CRESSV1 to CRESSV6 using only Rep protein. This study revealed that the Rep protein of the CRESS-DNA viruses is classified as CRESSV1 to CRESSV6 groups and the new Smacoviridae-related, CRESSV2-related, pCRESS-related, Circoviridae-related, and 1 to 4 outgroups, according to the Viral_Rep and P-loop_NTPase domain organization, CLANS, and phylogenetic analysis. Furthermore, for the first time in this study, the Cap protein of CRESS-DNA viruses was classified into 20 distinct clusters by CLANS and phylogenetic analysis. Through this classification, the genetic diversity of CRESS-DNA viruses clarifies the possibility of recombinations in Cap and Rep proteins. Finally, it has been shown that selection pressure plays a significant role in the evolution and genetic diversity of Cap and Rep proteins. This study explains the genetic diversity of CRESS-DNA viruses and hopes that it will help classify future detected viruses.
Collapse
|
19
|
Desingu PA, Rubeni TP, Sundaresan NR. Evolution of monkeypox virus from 2017 to 2022: In the light of point mutations. Front Microbiol 2022; 13:1037598. [PMID: 36590408 PMCID: PMC9795006 DOI: 10.3389/fmicb.2022.1037598] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 11/24/2022] [Indexed: 12/23/2022] Open
Abstract
Monkeypox virus (MPXV) causing multi-country outbreak-2022 is related to viruses caused outbreak-2017-2018 in West Africa. Still not fully understood which proteins of the MPXV discovered in Nigeria in 2017 have mutated through different lineages to the extent that it could cause a multi-country outbreak in 2022; similarly, codon usage bias, host adaptation indices, and the role of selection or mutation pressure in the mutated genes are also not fully studied. Here we report that according to the available sequence data this monkeypox virus acquires point mutations in multiple proteins in each period, and these point mutations accumulate and become a virus that can root outbreak-2022. Viruses exported from Nigeria to Singapore, Israel, and the United Kingdom in 2018-2019 were developed as evolutionary ancestors to B.1 viruses (MPXVs causing multi-country outbreak-2022) through MPXV/United States/2021/MD virus. Although these exported viruses have different amino acid mutations in different proteins, amino acid mutations in 10 proteins are common among them. The MPXV-United Kingdom-P2 virus evolved with only mutations in these 10 proteins and further evolved into MPXV/United States/2021/MD with amino acid mutations in 26 (including amino acid mutations in 10 proteins of the MPXV-United States-P2) proteins. It is noteworthy that specific amino acid mutations in these 22/26 (presence in MPXV/United States/2021/MD) proteins are present in B.1 viruses. Further, analysis of Relative Synonymous Codon Usage (RSCU), Synonymous Codon Usage Fraction (SCUF), and Effective Number of Codons (ENc) revealed codon usage bias in genes that exhibited nucleotide mutations in lineage B.1. Also, host adaptation indices analyzes such as Codon Adaptation Index (CAI), Expected-CAI (eCAI), Relative Codon Deoptimization Index (RCDI) and Expected value for the RCDI (eRCDI) analyzes reveal that the genes that demonstrated nucleotide mutations in lineage B.1 are favorable for human adaptation. Similarly, ENc-GC3s plot, Neutrality plot, and Parity Rule 2 (PR2)-bias plot analyzes suggest a major role of selection pressure than mutation pressure in the evolution of genes displaying nucleotide mutations in lineage B.1. Overall, from 2017 to 2022, MPXV's mutation and spread suggests that this virus continues to evolve through point mutation in the genes according to the available sequence data.
Collapse
|
20
|
Comparison of Boraginales Plastomes: Insights into Codon Usage Bias, Adaptive Evolution, and Phylogenetic Relationships. DIVERSITY 2022. [DOI: 10.3390/d14121104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The Boraginales (Boraginaceae a.l.) comprise more than 2450 species worldwide. However, little knowledge exists of the characteristics of the complete plastid genome. In this study, three new sequences representing the first pt genome of Heliotropiaceae and Cordiaceae were assembled and compared with other Boraginales species. The pt genome sizes of Cordia dichotoma, Heliotropium arborescens, and Tournefortia montana were 151,990 bp, 156,243 bp, and 155,891 bp, respectively. Multiple optimal codons were identified, which may provide meaningful information for enhancing the gene expression of Boraginales species. Furthermore, codon usage bias analyses revealed that natural selection and other factors may dominate codon usage patterns in the Boraginales species. The boundaries of the IR/LSC and IR/SSC regions were significantly different, and we also found a signal of obvious IR region expansion in the pt genome of Nonea vesicaria and Arnebia euchroma. Genes with high nucleic acid diversity (pi) values were also calculated, which may be used as potential DNA barcodes to investigate the phylogenetic relationships in Boraginales. psaI, rpl33, rpl36, and rps19 were found to be under positive selection, and these genes play an important role in our understanding of the adaptive evolution of the Boraginales species. Phylogenetic analyses implied that Boraginales can be divided into two groups. The existence of two subfamilies (Lithospermeae and Boragineae) in Boraginaceae is also strongly supported. Our study provides valuable information on pt genome evolution and phylogenetic relationships in the Boraginales species.
Collapse
|
21
|
Gao Y, Lu Y, Song Y, Jing L. Analysis of codon usage bias of WRKY transcription factors in Helianthus annuus. BMC Genom Data 2022; 23:46. [PMID: 35725374 PMCID: PMC9210703 DOI: 10.1186/s12863-022-01064-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2021] [Accepted: 06/13/2022] [Indexed: 11/10/2022] Open
Abstract
Abstract
Background
The phenomenon of codon usage bias is known to exist in many genomes and is mainly determined by mutation and selection. Codon usage bias analysis is a suitable strategy for identifying the principal evolutionary driving forces in different organisms. Sunflower (Helianthus annuus L.) is an annual crop that is cultivated worldwide as ornamentals, food plants and for their valuable oil. The WRKY family genes in plants play a central role in diverse regulation and multiple stress responses. Evolutionary analysis of WRKY family genes of H. annuus can provide rich genetic information for developing hybridization resources of the genus Helianthus.
Results
Bases composition analysis showed the average GC content of WRKY genes of H. annuus was 43.42%, and the average GC3 content was 39.60%, suggesting that WRKY gene family prefers A/T(U) ending codons. There were 29 codons with relative synonymous codon usage (RSCU) greater than 1 and 22 codons ending with A and U base. The effective number of codons (ENC) and codon adaptation index (CAI) in WRKY genes ranged from 43.47–61.00 and 0.14–0.26, suggesting that the codon bias was weak and WRKY genes expression level was low. Neutrality analysis found a significant correlation between GC12 and GC3. ENC-plot showed most genes on or close to the expected curve, suggesting that mutational bias played a major role in shaping codon usage. The Parity Rule 2 plot (PR2) analysis showed that the usage of AT and GC was disproportionate. A total of three codons were identified as the optimal codons.
Conclusion
Apart from natural selection effects, most of the genetic evolution in the H. annuus WRKY genome might be driven by mutation pressure. Our results provide a theoretical foundation for elaborating the genetic architecture and mechanisms of H. annuus and contributing to enrich H. annuus genetic resources.
Collapse
|
22
|
Bansal S, Mallikarjuna MG, Balamurugan A, Nayaka SC, Prakash G. Composition and Codon Usage Pattern Results in Divergence of the Zinc Binuclear Cluster ( Zn(II)2Cys6) Sequences among Ascomycetes Plant Pathogenic Fungi. J Fungi (Basel) 2022; 8:1134. [PMID: 36354901 PMCID: PMC9694491 DOI: 10.3390/jof8111134] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2022] [Revised: 10/22/2022] [Accepted: 10/23/2022] [Indexed: 07/29/2023] Open
Abstract
Zinc binuclear cluster proteins (ZBC; Zn(II)2Cys6) are unique to the fungi kingdom and associated with a series of functions, viz., the utilization of macromolecules, stress tolerance, and most importantly, host-pathogen interactions by imparting virulence to the pathogen. Codon usage bias (CUB) is the phenomenon of using synonymous codons in a non-uniform fashion during the translation event, which has arisen because of interactions among evolutionary forces. The Zn(II)2Cys6 coding sequences from nine Ascomycetes plant pathogenic species and model system yeast were analysed for compositional and codon usage bias patterns. The clustering analysis diverged the Ascomycetes fungi into two clusters. The nucleotide compositional and relative synonymous codon usage (RSCU) analysis indicated GC biasness toward Ascomycetes fungi compared with the model system S. cerevisiae, which tends to be AT-rich. Further, plant pathogenic Ascomycetes fungi belonging to cluster-2 showed a higher number of GC-rich high-frequency codons than cluster-1 and was exclusively AT-rich in S. cerevisiae. The current investigation also showed the mutual effect of the two evolutionary forces, viz. natural selection and compositional constraints, on the CUB of Zn(II)2Cys6 genes. The perseverance of GC-rich codons of Zn(II)2Cys6 in Ascomycetes could facilitate the invasion process. The findings of the current investigation show the role of CUB and nucleotide composition in the evolutionary divergence of Ascomycetes plant pathogens and paves the way to target specific codons and sequences to modulate host-pathogen interactions through genome editing and functional genomics tools.
Collapse
Affiliation(s)
- Shilpi Bansal
- Division of Plant Pathology, ICAR—Indian Agricultural Research Institute, New Delhi 110012, India
| | | | - Alexander Balamurugan
- Division of Plant Pathology, ICAR—Indian Agricultural Research Institute, New Delhi 110012, India
| | - S. Chandra Nayaka
- Department of Studies in Applied Botany and Biotechnology, University of Mysore, Mysore 570005, India
| | - Ganesan Prakash
- Division of Plant Pathology, ICAR—Indian Agricultural Research Institute, New Delhi 110012, India
| |
Collapse
|
23
|
Khandia R, Pandey M, Rzhepakovsky IV, Khan AA, Legaz I. Codon Pattern and Compositional Constraints Determination of Genes Associated with Chronic Periodontitis. Genes (Basel) 2022; 13:genes13111934. [PMID: 36360171 PMCID: PMC9689538 DOI: 10.3390/genes13111934] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 10/10/2022] [Accepted: 10/18/2022] [Indexed: 11/29/2022] Open
Abstract
Genome-wide association studies showed the relationship of NIN, ABHD12B, WHAMM, AP3B2, and SIGLEC5 with chronic periodontitis. The study’s objective was to investigate different molecular patterns and evolutionary forces acting on the mentioned genes. The investigation of molecular patterns encompasses the study of compositional parameters, expression profile, physical properties of genes, codon preferences, degree of codon bias, determination of the most influential codons, and assessment of actions of evolutionary forces, such as mutations and natural selection. The overall compositional analysis revealed the dominance of A and G nucleotides compared to T and C. A relatively low codon usage bias is observed. The CTG codon is the most overused codon, followed by TCC. The genes, AP3B2 and SIGLEC5, preferred GC-ending codons, while NIN, ABHD12B, and WHAMM preferred AT-ending codons. The presence of directional mutational force and natural selection was found to operate codon usage in genes envisaged, and selective forces were dominant over mutational forces. Apart from mutation and selection forces, compositional constraints also played imperative roles. The study enriched our knowledge of specific molecular patterns associated with the set of genes significantly associated with chronic periodontitis. Further studies are warranted to identify more genetic signatures associated with the disease.
Collapse
Affiliation(s)
- Rekha Khandia
- Department of Biochemistry and Genetics, Barkatullah Universty, Bhopal 462026, India
- Correspondence: or (R.K.); (I.L.)
| | - Megha Pandey
- Translational Medicine Center, All India Institute of Medical Sciences, Bhopal 462020, India
| | | | - Azmat Ali Khan
- Pharmaceutical Biotechnology Laboratory, Department of Pharmaceutical Chemistry, College of Pharmacy, King Saud University, Riyadh 11451, Saudi Arabia
| | - Isabel Legaz
- Department of Legal and Forensic Medicine, Biomedical Research Institute (IMIB), Regional Campus of International Excellence “Campus Mare Nostrum”, Faculty of Medicine, University of Murcia, E-30120 Murcia, Spain
- Correspondence: or (R.K.); (I.L.)
| |
Collapse
|
24
|
Zhang Z, Quan S, Niu J, Guo C, Kang C, Liu J, Yuan X. Comprehensive Identification and Analyses of the GRF Gene Family in the Whole-Genome of Four Juglandaceae Species. Int J Mol Sci 2022; 23:ijms232012663. [PMID: 36293519 PMCID: PMC9604165 DOI: 10.3390/ijms232012663] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 10/15/2022] [Accepted: 10/19/2022] [Indexed: 11/05/2022] Open
Abstract
The GRF gene family plays an important role in plant growth and development as regulators involved in plant hormone signaling and metabolism. However, the Juglandaceae GRF gene family remains to be studied. Here, we identified 15, 15, 19, and 20 GRF genes in J. regia, C. illinoinensis, J. sigillata, and J. mandshurica, respectively. The phylogeny shows that the Juglandaceae family GRF is divided into two subfamilies, the ε-group and the non-ε-group, and that selection pressure analysis did not detect amino acid loci subject to positive selection pressure. In addition, we found that the duplications of the Juglandaceae family GRF genes were all segmental duplication events, and a total of 79 orthologous gene pairs and one paralogous homologous gene pair were identified in four Juglandaceae families. The Ka/KS ratios between these homologous gene pairs were further analyzed, and the Ka/KS values were all less than 1, indicating that purifying selection plays an important role in the evolution of the Juglandaceae family GRF genes. The codon bias of genes in the GRF family of Juglandaceae species is weak, and is affected by both natural selection pressure and base mutation, and translation selection plays a dominant role in the mutation pressure in codon usage. Finally, expression analysis showed that GRF genes play important roles in pecan embryo development and walnut male and female flower bud development, but with different expression patterns. In conclusion, this study will serve as a rich genetic resource for exploring the molecular mechanisms of flower bud differentiation and embryo development in Juglandaceae. In addition, this is the first study to report the GRF gene family in the Juglandaceae family; therefore, our study will provide guidance for future comparative and functional genomic studies of the GRF gene family in the Juglandaceae specie.
Collapse
Affiliation(s)
- Zhongrong Zhang
- Department of Horticulture, College of Agriculture, Shihezi University, Shihezi 832003, China
- Xinjiang Production and Construction Corps Key Laboratory of Special Fruits and Vegetables Cultivation Physiology and Germplasm Resources Utilization, Shihezi 832003, China
| | - Shaowen Quan
- Department of Horticulture, College of Agriculture, Shihezi University, Shihezi 832003, China
- Xinjiang Production and Construction Corps Key Laboratory of Special Fruits and Vegetables Cultivation Physiology and Germplasm Resources Utilization, Shihezi 832003, China
| | - Jianxin Niu
- Department of Horticulture, College of Agriculture, Shihezi University, Shihezi 832003, China
- Xinjiang Production and Construction Corps Key Laboratory of Special Fruits and Vegetables Cultivation Physiology and Germplasm Resources Utilization, Shihezi 832003, China
- Correspondence:
| | - Caihua Guo
- Department of Horticulture, College of Agriculture, Shihezi University, Shihezi 832003, China
- Xinjiang Production and Construction Corps Key Laboratory of Special Fruits and Vegetables Cultivation Physiology and Germplasm Resources Utilization, Shihezi 832003, China
| | - Chao Kang
- Department of Horticulture, College of Agriculture, Shihezi University, Shihezi 832003, China
- Xinjiang Production and Construction Corps Key Laboratory of Special Fruits and Vegetables Cultivation Physiology and Germplasm Resources Utilization, Shihezi 832003, China
| | - Jinming Liu
- Department of Horticulture, College of Agriculture, Shihezi University, Shihezi 832003, China
- Xinjiang Production and Construction Corps Key Laboratory of Special Fruits and Vegetables Cultivation Physiology and Germplasm Resources Utilization, Shihezi 832003, China
| | - Xing Yuan
- Department of Horticulture, College of Agriculture, Shihezi University, Shihezi 832003, China
- Xinjiang Production and Construction Corps Key Laboratory of Special Fruits and Vegetables Cultivation Physiology and Germplasm Resources Utilization, Shihezi 832003, China
| |
Collapse
|
25
|
Zhao Y, Qu D, Ma Y. Characterization of the Chloroplast Genome of Argyranthemum frutescens and a Comparison with Other Species in Anthemideae. Genes (Basel) 2022; 13:genes13101720. [PMID: 36292605 PMCID: PMC9602088 DOI: 10.3390/genes13101720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Revised: 09/13/2022] [Accepted: 09/22/2022] [Indexed: 11/16/2022] Open
Abstract
Argyranthemum frutescens, which belongs to the Anthemideae (Asteraceae), is widely cultivated as an ornamental plant. In this study, the complete chloroplast genome of A. frutescens was obtained based on the sequences generated by Illumina HiSeq. The chloroplast genome of A. frutescens was 149,626 base pairs (bp) in length, containing a pair of inverted repeats (IR, 24,510 bp) regions separated by a small single-copy (SSC, 18,352 bp) sequence and a large single-copy (LSC, 82,254 bp) sequence. The genome contained 132 genes, consisting of 85 coding DNA sequences, 37 tRNA genes, and 8 rRNA genes, with nineteen genes duplicated in the IR region. A comparison chloroplast genome analysis among ten species from the tribe of Anthemideae revealed that the chloroplast genome size varied, but the genome structure, gene content, and oligonucleotide repeats were highly conserved. Highly divergent regions, e.g., ycf1, trnK-psbK, petN-psbM intronic, were detected. Phylogenetic analysis supported Argyranthemum as a separate genus. The findings of this study will be helpful in the exploration of the phylogenetic relationships of the tribe of Anthemideae and contribute to the breeding improvement of A. frutescens.
Collapse
|
26
|
Tayşi N, Kaymaz Y, Ateş D, Sari H, Toker C, Tanyolaç MB. Complete chloroplast genome sequence of Lens ervoides and comparison to Lens culinaris. Sci Rep 2022; 12:15068. [PMID: 36064865 PMCID: PMC9445179 DOI: 10.1038/s41598-022-17877-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Accepted: 08/02/2022] [Indexed: 12/05/2022] Open
Abstract
Lens is a member of the Papilionoideae subfamily of Fabaceae and is generally used as a source of vegetable protein as part of human diets in many regions worldwide. Chloroplast (cp) genomes are highly active genetic components of plants and can be utilized as molecular markers for various purposes. As one of the wild lentil species, the Lens ervoides cp genome has been sequenced for the first time in this study using next-generation sequencing. The de novo assembly of the cp genome resulted in a single 122,722 bp sequence as two separate coexisting structural haplotypes with similar lengths. Results indicated that the cp genome of L. ervoides belongs to the inverted repeat lacking clade. Several noteworthy divergences within the coding regions were observed in ndhB, ndhF, rbcL, rpoC2, and ycf2 genes. Analysis of relative synonymous codon usage showed that certain genes, psbN, psaI, psbI, psbE, psbK, petD, and ndhC, preferred using biased codons more often and therefore might have elevated expression and translation efficiencies. Overall, this study exhibited the divergence level between the wild-type and cultured lentil cp genomes and pointed to certain regions that can be utilized as distinction markers for various goals.
Collapse
Affiliation(s)
- Nurbanu Tayşi
- Bioengineering Department, Faculty of Engineering, Ege University, Izmir, Turkey
| | - Yasin Kaymaz
- Bioengineering Department, Faculty of Engineering, Ege University, Izmir, Turkey
| | - Duygu Ateş
- Bioengineering Department, Faculty of Engineering, Ege University, Izmir, Turkey
| | - Hatice Sari
- Department of Field Crops, Faculty of Agriculture, Akdeniz University, Antalya, Turkey
| | - Cengiz Toker
- Department of Field Crops, Faculty of Agriculture, Akdeniz University, Antalya, Turkey
| | - M Bahattin Tanyolaç
- Bioengineering Department, Faculty of Engineering, Ege University, Izmir, Turkey.
| |
Collapse
|
27
|
Revealing the Complete Chloroplast Genome of an Andean Horticultural Crop, Sweet Cucumber (Solanum muricatum), and Its Comparison with Other Solanaceae Species. DATA 2022. [DOI: 10.3390/data7090123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Sweet cucumber (Solanum muricatum) sect. Basarthrum is a neglected horticultural crop native to the Andean region. It is naturally distributed very close to other two Solanum crops of high importance, potatoes, and tomatoes. To date, molecular tools for this crop remain undetermined. In this study, the complete sweet cucumber chloroplast (cp) genome was obtained and compared with seven Solanaceae species. The cp genome of S. muricatum was 155,681 bp in length and included a large single copy (LSC) region of 86,182 bp and a small single-copy (SSC) region of 18,360 bp, separated by a pair of inverted repeats (IR) regions of 25,568 bp. The cp genome possessed 87 protein-coding genes (CDS), 37 transfer RNA (tRNA) genes, eight ribosomal RNA (rRNA) genes, and one pseudogene. Furthermore, 48 perfect microsatellites were identified. These repeats were mainly located in the noncoding regions. Whole cp genome comparative analysis revealed that the SSC and LSC regions showed more divergence than IR regions. Similar to previous studies, our phylogenetic analysis showed that S. muricatum is a sister species to members of sections Petota + Lycopersicum + Etuberosum. We expect that this first sweet cucumber chloroplast genome will provide potential molecular markers and genomic resources to shed light on the genetic diversity and population studies of S. muricatum, which will allow us to identify varieties and ecotypes. Finally, the features and the structural differentiation will provide us with information about the genes of interest, generating tools for the most precise selection of the best individuals of sweet cucumber, in less time and with fewer resources.
Collapse
|
28
|
Desingu PA, Nagarajan K, Dhama K. SARS-CoV-2 gained a novel spike protein S1-N-Terminal Domain (S1-NTD). ENVIRONMENTAL RESEARCH 2022; 211:113047. [PMID: 35292244 PMCID: PMC8917877 DOI: 10.1016/j.envres.2022.113047] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 02/24/2022] [Accepted: 02/25/2022] [Indexed: 05/20/2023]
Abstract
The clue behind the SARS-CoV-2 origin is still a matter of debate. Here, we report that SARS-CoV-2 has gained a novel spike protein S1-N-terminal domain (S1-NTD). In our CLuster ANalysis of Sequences (CLANS) analysis, SARS-CoV/SARS-CoV-2 S1-NTDs displayed a close relationship with OC43 and HKU1. However, in the complete and S1-NTD-free spike protein, SARS-CoV/SARS-CoV-2 revealed closeness with MERS-CoV. Further, we have divided the S1-NTD of SARS-CoV-2 related viruses into three distinct types (Type-I to III S1-NTD) and the S1-NTD of viruses associated with SARS-CoVs into another three classes (Type-A to C S1-NTD) using CLANS and phylogenetic analyses. In particular, the results of our study indicate that SARS-CoV-2, RaTG13, and BANAL-20-52 viruses carry Type-I-S1-NTD and other SARS-CoV-2-related-bat viruses have Type-II and III. In addition, it was revealed that the Pangolin-GX and Pangolin-Guangdong lineages inherited Type-I-like and Type-II-like S1-NTD, respectively. Then our CLANS study shows the potential for evolution of Type-I and Type-III S1-NTD from SARS-CoV-related viruses Type-A and Type-B S1-NTDs, respectively. Furthermore, our analysis clarifies the possibility that Type-II S1-NTDs may have evolved from Type-A-S1-NTD of SARS-CoV-related viruses through Type-I S1-NTDs. We also observed that BANAL-20-103, BANAL-20-236, and Pangolin-Guangdong-lineage viruses containing Type-II-like S1-NTD are very close to SARS-CoV-2 in spike genetic areas other than S1-NTD. Possibly, it suggests that the common ancestor spike gene of SARS-CoV-2/RaTG13/BANAL-20-52-like virus may have evolved by recombining the Pangolin-Guangdong/BANAL-20-103/BANAL-20-236-like spike gene to Pangolin-GX-like Type-I-like-S1-NTD in the unsampled bat or undiscovered intermediate host or possibly pangolin. These may then have evolved into SARS-CoV-2, RaTG13, and BANAL-20-52 virus spike genes by host jump mediated evolution. The potential function of the novel Type-I-S1-NTD and other types of S1-NTDs needs to be studied further to understand better its importance in the ongoing COVID-19 outbreak and for future pandemic preparedness.
Collapse
Affiliation(s)
| | - K Nagarajan
- Department of Veterinary Pathology, Madras Veterinary College, Tamil Nadu Veterinary and Animal Sciences University (TANUVAS), Vepery, Chennai, 600007, India
| | - Kuldeep Dhama
- Avian Diseases Section, Division of Pathology, ICAR-Indian Veterinary Research Institute (IVRI), Izatnagar, U.P, 243 122, India
| |
Collapse
|
29
|
Khandia R, Saeed M, Alharbi AM, Ashraf GM, Greig NH, Kamal MA. Codon Usage Bias Correlates With Gene Length in Neurodegeneration Associated Genes. Front Neurosci 2022; 16:895607. [PMID: 35860292 PMCID: PMC9289476 DOI: 10.3389/fnins.2022.895607] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Accepted: 06/08/2022] [Indexed: 11/13/2022] Open
Abstract
Codon usage analysis is a crucial part of molecular characterization and is used to determine the factors affecting the evolution of a gene. The length of a gene is an important parameter that affects the characteristics of the gene, such as codon usage, compositional parameters, and sometimes, its functions. In the present study, we investigated the association of various parameters related to codon usage with the length of genes. Gene expression is affected by nucleotide disproportion. In sixty genes related to neurodegenerative disorders, the G nucleotide was the most abundant and the T nucleotide was the least. The nucleotide T exhibited a significant association with the length of the gene at both the overall compositional level and the first and second codon positions. Codon usage bias (CUB) of these genes was affected by pyrimidine and keto skews. Gene length was found to be significantly correlated with codon bias in neurodegeneration associated genes. In gene segments with lengths below 1,200 bp and above 2,400 bp, CUB was positively associated with length. Relative synonymous CUB, which is another measure of CUB, showed that codons TTA, GTT, GTC, TCA, GGT, and GGA exhibited a positive association with length, whereas codons GTA, AGC, CGT, CGA, and GGG showed a negative association. GC-ending codons were preferred over AT-ending codons. Overall analysis indicated that the association between CUB and length varies depending on the segment size; however, CUB of 1,200–2,000 bp gene segments appeared not affected by gene length. In synopsis, analysis suggests that length of the genes correlates with various imperative molecular signatures including A/T nucleotide disproportion and codon choices. In the present study we additionally evaluated various molecular features and their correlation with different indices of codon usage, like the Codon Adaptation Index (CAI) and Relative Dynonymous Codon Usage (RSCU) of codons. We also considered the impact of gene fragment size on different molecular features in genes related to neurodegeneration. This analysis will aid our understanding of and in potentially modulating gene expression in cases of defective gene functioning in clinical settings.
Collapse
Affiliation(s)
- Rekha Khandia
- Department of Biochemistry and Genetics, Barkatullah University, Bhopal, India
- *Correspondence: Rekha Khandia, ;
| | - Mohd. Saeed
- Department of Biology, College of Sciences, University of Hail, Hail, Saudi Arabia
| | - Ahmed M. Alharbi
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, University of Hail, Hail, Saudi Arabia
| | - Ghulam Md. Ashraf
- Pre-clinical Research Unit, King Fahd Medical Research Center, King Abdulaziz University, Jeddah, Saudi Arabia
- Department of Medical Laboratory Sciences, Faculty of Applied Medical Sciences, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Nigel H. Greig
- Drug Design and Development Section, Translational Gerontology Branch, Intramural Research Program National Institute on Aging, NIH, Baltimore, MD, United States
| | - Mohammad Amjad Kamal
- Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
- King Fahd Medical Research Center, King Abdulaziz University, Jeddah, Saudi Arabia
- Department of Pharmacy, Faculty of Allied Health Sciences, Daffodil International University, Dhaka, Bangladesh
- Enzymoics, Novel Global Community Educational Foundation, Hebersham, NSW, Australia
| |
Collapse
|
30
|
Khandia R, Sharma A, Alqahtani T, Alqahtani AM, Asiri YI, Alqahtani S, Alharbi AM, Kamal MA. Strong Selectional Forces Fine-Tune CpG Content in Genes Involved in Neurological Disorders as Revealed by Codon Usage Patterns. Front Neurosci 2022; 16:887929. [PMID: 35757545 PMCID: PMC9226491 DOI: 10.3389/fnins.2022.887929] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 04/04/2022] [Indexed: 02/05/2023] Open
Abstract
Neurodegenerative disorders cause irreversible damage to the neurons and adversely affect the quality of life. Protein misfolding and their aggregation in specific parts of the brain, mitochondrial dysfunction, calcium load, proteolytic stress, and oxidative stress are among the causes of neurodegenerative disorders. In addition, altered metabolism has been associated with neurodegeneration as evidenced by reductions in glutamine and alanine in transient global amnesia patients, higher homocysteine-cysteine disulfide, and lower methionine decline in serum urea have been observed in Alzheimer's disease patients. Neurodegeneration thus appears to be a culmination of altered metabolism. The study's objective is to analyze various attributes like composition, physical properties of the protein, and factors like selectional and mutational forces, influencing codon usage preferences in a panel of genes involved directly or indirectly in metabolism and contributing to neurodegeneration. Various parameters, including gene composition, dinucleotide analysis, Relative synonymous codon usage (RSCU), Codon adaptation index (CAI), neutrality and parity plots, and different protein indices, were computed and analyzed to determine the codon usage pattern and factors affecting it. The correlation of intrinsic protein properties such as the grand average of hydropathicity index (GRAVY), isoelectric point, hydrophobicity, and acidic, basic, and neutral amino acid content has been found to influence codon usage. In genes up to 800 amino acids long, the GC3 content was highly variable, while GC12 content was relatively constant. An optimum CpG content is present in genes to maintain a high expression level as required for genes involved in metabolism. Also observed was a low codon usage bias with a higher protein expression level. Compositional parameters and nucleotides at the second position of codons played essential roles in explaining the extent of bias. Overall analysis indicated that the dominance of selection pressure and compositional constraints and mutational forces shape codon usage.
Collapse
Affiliation(s)
- Rekha Khandia
- Department of Biochemistry and Genetics, Barkatullah University, Bhopal, India
| | - Anushri Sharma
- Department of Biochemistry and Genetics, Barkatullah University, Bhopal, India
| | - Taha Alqahtani
- Department of Pharmacology, College of Pharmacy, King Khalid University, Abha, Saudi Arabia
| | - Ali M Alqahtani
- Department of Pharmacology, College of Pharmacy, King Khalid University, Abha, Saudi Arabia
| | - Yahya I Asiri
- Department of Pharmacology, College of Pharmacy, King Khalid University, Abha, Saudi Arabia
| | - Saud Alqahtani
- Department of Pharmacology, College of Pharmacy, King Khalid University, Abha, Saudi Arabia
| | - Ahmed M Alharbi
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, University of Hail, Hail, Saudi Arabia
| | - Mohammad Amjad Kamal
- Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China.,King Fahd Medical Research Center, King Abdulaziz University, Jeddah, Saudi Arabia.,Department of Pharmacy, Faculty of Allied Health Sciences, Daffodil International University, Dhaka, Bangladesh.,Enzymoics, Novel Global Community Educational Foundation, Hebersham, NSW, Australia
| |
Collapse
|
31
|
Saha J, Dey S, Pal A. Whole genome sequencing and comparative genomic analyses of Pseudomonas aeruginosa strain isolated from arable soil reveal novel insights into heavy metal resistance and codon biology. Curr Genet 2022; 68:481-503. [PMID: 35763098 DOI: 10.1007/s00294-022-01245-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Revised: 05/14/2022] [Accepted: 06/06/2022] [Indexed: 11/03/2022]
Abstract
Elevated concentration of non-essential persistent heavy metals and metalloids in the soil is detrimental to essential soil microbes and plants, resulting in diminished diversity and biomass. Thus, isolation, screening, and whole genomic analysis of potent strains of bacteria from arable lands with inherent capabilities of heavy metal resistance and plant growth promotion hold the key for bio remedial applications. This study is an attempt to do the same. In this study, a potent strain of Pseudomonas aeruginosa was isolated from paddy fields, followed by metabolic profiling using FTIR, metal uptake analysis employing ICP-MS, whole genome sequencing and comparative codon usage analysis. ICP-MS study provided insights into a high degree of Cd uptake during the exponential phase of growth under cumulative metal stress to Cd, Zn and Co, which was further corroborated by the detection of cadA gene along with czcCBA operon in the genome upon performing whole-genome sequencing. This potent strain of Pseudomonas aeruginosa also harboured genes, such as copA, chrA, znuA, mgtE, corA, and others conferring resistance against different heavy metals, such as Cd, Zn, Co, Cu, Cr, etc. A comparative codon usage bias analysis at the genomic and genic level, whereby several heavy metal resistant genes were considered in the backdrop of two housekeeping genes among 40 Pseudomonas spp. indicated the presence of a relatively strong codon usage bias in the studied strain. With this work, an effort was made to explore heavy metal-resistant bacteria (isolated from arable soil) and whole genome sequence analysis to get insight into metal resistance for future bio remedial applications.
Collapse
Affiliation(s)
- Jayanti Saha
- Microbiology and Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj, West Bengal, 733134, India
| | - Sourav Dey
- Microbiology and Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj, West Bengal, 733134, India
| | - Ayon Pal
- Microbiology and Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj, West Bengal, 733134, India.
| |
Collapse
|
32
|
Shafat Z, Ahmed A, Parvez MK, Parveen S. Analysis of codon usage patterns in open reading frame 4 of hepatitis E viruses. BENI-SUEF UNIVERSITY JOURNAL OF BASIC AND APPLIED SCIENCES 2022; 11:65. [PMID: 35573872 PMCID: PMC9086417 DOI: 10.1186/s43088-022-00244-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Accepted: 04/19/2022] [Indexed: 12/01/2022] Open
Abstract
Background Hepatitis E virus (HEV) is a member of the family Hepeviridae and causes acute HEV infections resulting in thousands of deaths worldwide. The zoonotic nature of HEV in addition to its tendency from human to human transmission has led scientists across the globe to work on its different aspects. HEV also accounts for about 30% mortality rates in case of pregnant women. The genome of HEV is organized into three open reading frames (ORFs): ORF1 ORF2 and ORF3. A reading frame encoded protein ORF4 has recently been discovered which is exclusive to GT 1 isolates of HEV. The ORF4 is suggested to play crucial role in pregnancy-associated pathology and enhanced replication. Though studies have documented the ORF4’s importance, the genetic features of ORF4 protein genes in terms of compositional patterns have not been elucidated. As codon usage performs critical role in establishment of the host–pathogen relationship, therefore, the present study reports the codon usage analysis (based on nucleotide sequences of HEV ORF4 available in the public database) in three hosts along with the factors influencing the codon usage patterns of the protein genes of ORF4 of HEV. Results The nucleotide composition analysis indicated that ORF4 protein genes showed overrepresentation of C nucleotide and while A nucleotide was the least-represented, with random distribution of G and T(U) nucleotides. The relative synonymous codon usage (RSCU) analysis revealed biasness toward C/G-ended codons (over U/A) in all three natural HEV-hosts (human, rat and ferret). It was observed that all the ORF4 genes were richly endowed with GC content. Further, our results showed the occurrence of both coincidence and antagonistic codon usage patterns among HEV-hosts. The findings further emphasized that both mutational and selection forces influenced the codon usage patterns of ORF4 protein genes. Conclusions To the best of our knowledge, this is first bioinformatics study evaluating codon usage patterns in HEV ORF4 protein genes. The findings from this study are expected to increase our understanding toward significant factors involved in evolutionary changes of ORF4. Supplementary Information The online version contains supplementary material available at 10.1186/s43088-022-00244-w.
Collapse
|
33
|
He Z, Ding S, Guo J, Qin L, Xu X. Synonymous Codon Usage Analysis of Three Narcissus Potyviruses. Viruses 2022; 14:v14050846. [PMID: 35632588 PMCID: PMC9143068 DOI: 10.3390/v14050846] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 04/13/2022] [Accepted: 04/18/2022] [Indexed: 02/04/2023] Open
Abstract
Narcissus degeneration virus (NDV), narcissus late season yellows virus (NLSYV) and narcissus yellow stripe virus (NYSV), which belong to the genus Potyvirus of the family Potyviridae, cause significant losses in the ornamental value and quality of narcissus. Several previous studies have explored the genetic diversity and evolution rate of narcissus viruses, but the analysis of the synonymous codons of the narcissus viruses is still unclear. Herein, the coat protein (CP) of three viruses is used to analyze the viruses’ phylogeny and codon usage pattern. Phylogenetic analysis showed that NYSV, NDV and NLSYV isolates were divided into five, three and five clusters, respectively, and these clusters seemed to reflect the geographic distribution. The effective number of codon (ENC) values indicated a weak codon usage bias in the CP coding region of the three narcissus viruses. ENC-plot and neutrality analysis showed that the codon usage bias of the three narcissus viruses is all mainly influenced by natural selection compared with the mutation pressure. The three narcissus viruses shared the same best optimal codon (CCA) and the synonymous codon prefers to use codons ending with A/U, compared to C/G. Our study shows the codon analysis of different viruses on the same host for the first time, which indicates the importance of the evolutionary-based design to control these viruses.
Collapse
Affiliation(s)
- Zhen He
- School of Horticulture and Plant Protection, Yangzhou University, Yangzhou 225009, China; (S.D.); (L.Q.); (X.X.)
- Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education of China, Yangzhou University, Yangzhou 225009, China
- Correspondence: or
| | - Shiwen Ding
- School of Horticulture and Plant Protection, Yangzhou University, Yangzhou 225009, China; (S.D.); (L.Q.); (X.X.)
| | - Jiyuan Guo
- Department of Resources and Environment, Moutai Institute, Zunyi 564507, China;
| | - Lang Qin
- School of Horticulture and Plant Protection, Yangzhou University, Yangzhou 225009, China; (S.D.); (L.Q.); (X.X.)
| | - Xiaowei Xu
- School of Horticulture and Plant Protection, Yangzhou University, Yangzhou 225009, China; (S.D.); (L.Q.); (X.X.)
| |
Collapse
|
34
|
Comparative Plastome Analysis of Three Amaryllidaceae Subfamilies: Insights into Variation of Genome Characteristics, Phylogeny, and Adaptive Evolution. BIOMED RESEARCH INTERNATIONAL 2022; 2022:3909596. [PMID: 35372568 PMCID: PMC8970886 DOI: 10.1155/2022/3909596] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Revised: 01/19/2022] [Accepted: 02/05/2022] [Indexed: 11/17/2022]
Abstract
In the latest APG IV classification system, Amaryllidaceae is placed under the order of Asparagus and includes three subfamilies: Agapanthoideae, Allioideae, and Amaryllidoideae, which include many economically important crops. With the development of molecular phylogeny, research on the phylogenetic relationship of Amaryllidaceae has become more convenient. However, the current comparative analysis of Amaryllidaceae at the whole chloroplast genome level is still lacking. In this study, we sequenced 18 Allioideae plastomes and combined them with publicly available data (a total of 41 plastomes), including 21 Allioideae species, 1 Agapanthoideae species, 14 Amaryllidoideae species, and 5 Asparagaceae species. Comparative analyses were performed including basic characteristics of genome structure, codon usage, repeat elements, IR boundary, and genome divergence. Phylogenetic relationships were detected using single-copy genes (SCGs) and ribosomal internal transcribed spacer sequences (ITS), and the branch-site model was also employed to conduct the positive selection analysis. The results indicated that all Amaryllidaceae species showed a highly conserved typical tetrad structure. The GC content and five codon usage indexes in Allioideae species were lower than those in the other two subfamilies. Comparison analysis of Bayesian and ML phylogeny based on SCGs strongly supports the monophyly of three subfamilies and the sisterhood among them. Besides, positively selected genes (PSGs) were detected in each of the three subfamilies. Almost all genes with significant posterior probabilities for codon sites were associated with self-replication and photosynthesis. Our study investigated the three subfamilies of Amaryllidaceae at the whole chloroplast genome level and suggested the key role of selective pressure in the adaptation and evolution of Amaryllidaceae.
Collapse
|
35
|
Wang Z, Cai Q, Wang Y, Li M, Wang C, Wang Z, Jiao C, Xu C, Wang H, Zhang Z. Comparative Analysis of Codon Bias in the Chloroplast Genomes of Theaceae Species. Front Genet 2022; 13:824610. [PMID: 35360853 PMCID: PMC8961065 DOI: 10.3389/fgene.2022.824610] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Accepted: 01/31/2022] [Indexed: 11/13/2022] Open
Abstract
Theaceae species are dicotyledonous angiosperms with extremely high ornamental and economic value. The chloroplast genome is traditionally used to study species evolution, expression of chloroplast genes and chloroplast transformation. Codon usage bias (CUB) analysis is beneficial for investigations of evolutionary relationships and can be used to improve gene expression efficiency in genetic transformation research. However, there are relatively few systematic studies of the CUB in the chloroplast genomes of Theaceae species. In this study, CUB and nucleotide compositions parameters were determined by the scripts written in the Perl language, CodonW 1.4.2, CU.Win2000, RStudio and SPSS 23.0. The chloroplast genome data of 40 Theaceae species were obtained to analyse the codon usage (CU) characteristics of the coding regions and the influence of the source of variation on CUB. To explore the relationship between the CUB and gene expression levels in these 40 Theaceae plastomes, the synonymous codon usage order (SCUO) and measure independent of length and composition (MILC) values were determined. Finally, phylogenetic analysis revealed the genetic evolutionary relationships among these Theaceae species. Our results showed that based on the chloroplast genomes of these 40 Theaceae species, the CUB was for codons containing A/T bases and those that ended with A/T bases. Moreover, there was great commonality in the CUB of the Theaceae species according to comparative analysis of relative synonymous codon usage (RSCU) and relative frequency of synonymous codon (RFSC): these species had 29 identical codons with bias (RSCU > 1), and there were 19 identical high-frequency codons. The CUB of Theaceae species is mainly affected by natural selection. The SCUO value of the 40 Theaceae species was 0.23 or 0.24, and the chloroplast gene expression level was moderate, according to MILC values. Additionally, we observed a positive correlation between the SCUO and MILC values, which indicated that CUB might affect gene expression. Furthermore, the phylogenetic analysis showed that the evolutionary relationships in these 40 Theaceae species were relatively conserved. A systematic study on the CUB and expression of Theaceae species provides further evidence for their evolution and phylogeny.
Collapse
Affiliation(s)
- Zhanjun Wang
- College of Life Sciences, Hefei Normal University, Hefei, China
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, China
| | - Qianwen Cai
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Yue Wang
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Minhui Li
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Chenchen Wang
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Zhaoxia Wang
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Chunyan Jiao
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Congcong Xu
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Hongyan Wang
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Zhaoliang Zhang
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, China
- *Correspondence: Zhaoliang Zhang,
| |
Collapse
|
36
|
Khandia R, Ali Khan A, Alexiou A, Povetkin SN, Nikolaevna VM. Codon Usage Analysis of Pro-Apoptotic Bim Gene Isoforms. J Alzheimers Dis 2022; 86:1711-1725. [DOI: 10.3233/jad-215691] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Background: Bim is a Bcl-2 homology 3 (BH3)-only proteins, a group of pro-apoptotic proteins involved in physiological and pathological conditions. Both the overexpression and under-expression of Bim protein are associated with the diseased condition, and various isoforms of Bim protein are present with differential apoptotic potential. Objective: The present study attempted to envisage the association of various molecular signatures with the codon choices of Bim isoforms. Methods: Molecular signatures like composition, codon usage, nucleotide skews, the free energy of mRNA transcript, physical properties of proteins, codon adaptation index, relative synonymous codon usage, and dinucleotide odds ratio were determined and analyzed for their associations with codon choices of Bim gene. Results: Skew analysis of the Bim gene indicated the preference of C nucleotide over G, A, and T and preference of G over T and A nucleotides was observed. An increase in C content at the first and third codon position increased gene expression while it decreased at the second codon position. Compositional constraints on nucleotide C at all three codon positions affected gene expression. The analysis revealed an exceptionally high usage of CpC dinucleotide in all the envisaged 31 isoforms of Bim. We correlated it with the requirement of rapid demethylation machinery to fine-tune the Bimgene expression. Also, mutational pressure played a dominant role in shaping codon usage bias in Bim isoforms. Conclusion: An exceptionally high usage of CpC dinucleotide in all the envisaged 31 isoforms of Bim indicates a high order selectional force to fine tune Bim gene expression.
Collapse
Affiliation(s)
- Rekha Khandia
- Department of Biochemistry and Genetics, Barkatullah University, Bhopal, India
| | - Azmat Ali Khan
- Pharmaceutical Biotechnology Laboratory, Department of Pharmaceutical Chemistry, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia
| | - Athanasios Alexiou
- Novel Global Community Educational Foundation, Australia & AFNP Med, Austria
| | | | | |
Collapse
|
37
|
Zhang Y, Shen Z, Meng X, Zhang L, Liu Z, Liu M, Zhang F, Zhao J. Codon usage patterns across seven Rosales species. BMC PLANT BIOLOGY 2022; 22:65. [PMID: 35123393 PMCID: PMC8817548 DOI: 10.1186/s12870-022-03450-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/24/2021] [Accepted: 01/31/2022] [Indexed: 05/03/2023]
Abstract
BACKGROUND Codon usage bias (CUB) analysis is an effective method for studying specificity, evolutionary relationships, and mRNA translation and discovering new genes among various species. In general, CUB analysis is mainly performed within one species or between closely related species and no such study has been applied among species with distant genetic relationships. Here, seven Rosales species with high economic value were selected to conduct CUB analysis. RESULTS The results showed that the average GC1, GC2 and GC3 contents were 51.08, 40.52 and 43.12%, respectively, indicating that the A/T content is more abundant and the Rosales species prefer A/T as the last codon. Neutrality plot and ENc plot analysis revealed that natural selection was the main factor leading to CUB during the evolution of Rosales species. All 7 Rosales species contained three high-frequency codons, AGA, GTT and TTG, encoding Arg, Val and Leu, respectively. The 7 Rosales species differed in high-frequency codon pairs and the distribution of GC3, though the usage patterns of closely related species were more consistent. The results of the biclustering heat map among 7 Rosales species and 20 other species were basically consistent with the results of genome data, suggesting that CUB analysis is an effective method for revealing evolutionary relationships among species at the family or order level. In addition, chlorophytes prefer using G/C as ending codon, while monocotyledonous and dicotyledonous plants prefer using A/T as ending codon. CONCLUSIONS The CUB pattern among Rosales species was mainly affected by natural selection. This work is the first to highlight the CUB patterns and characteristics of Rosales species and provides a new perspective for studying genetic relationships across a wide range of species.
Collapse
Affiliation(s)
- Yao Zhang
- College of Life Science, Hebei Agricultural University, Baoding, China
- Hebei Key Laboratory of Plant Physiology and Molecular Pathology, Hebei Agricultural University, Baoding, China
| | - Zenan Shen
- High Performance Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190 China
| | - Xiangrui Meng
- College of Life Science, Hebei Agricultural University, Baoding, China
- Hebei Key Laboratory of Plant Physiology and Molecular Pathology, Hebei Agricultural University, Baoding, China
| | - Liman Zhang
- College of Life Science, Hebei Agricultural University, Baoding, China
- Hebei Key Laboratory of Plant Physiology and Molecular Pathology, Hebei Agricultural University, Baoding, China
| | - Zhiguo Liu
- Research Center of Chinese Jujube, Hebei Agricultural University, Baoding, China
| | - Mengjun Liu
- Research Center of Chinese Jujube, Hebei Agricultural University, Baoding, China
| | - Fa Zhang
- High Performance Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190 China
| | - Jin Zhao
- College of Life Science, Hebei Agricultural University, Baoding, China
- Hebei Key Laboratory of Plant Physiology and Molecular Pathology, Hebei Agricultural University, Baoding, China
| |
Collapse
|
38
|
Morton BR. Context-Dependent Mutation Dynamics, Not Selection, Explains the Codon Usage Bias of Most Angiosperm Chloroplast Genes. J Mol Evol 2022; 90:17-29. [PMID: 34932159 PMCID: PMC8821512 DOI: 10.1007/s00239-021-10038-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2021] [Accepted: 11/17/2021] [Indexed: 01/12/2023]
Abstract
Two competing proposals about the degree to which selection affects codon usage of angiosperm chloroplast genes are examined. The first, based on observations that codon usage does not match expectations under the naïve assumption that base composition will be identical at all neutral sites, is that selection plays a significant role. The second is that codon usage is determined almost solely by mutation bias and drift, with selection influencing only one or two highly expressed genes, in particular psbA. First it is shown that, as a result of an influence of neighboring base composition on mutation dynamics, compositional biases are expected to be widely divergent at different sites in the absence of selection. The observed mutation properties are then used to predict expected neutral codon usage biases and to show that observed deviations from the naïve expectations are in fact expected given the context-dependent mutational dynamics. It is also shown that there is a match between the observed and expected codon usage when context effects are taken into consideration, with psbA being a notable exception. Overall, the data support the model that selection is not a widespread factor affecting the codon usage of angiosperm chloroplast genes and highlight the need to have an accurate model of mutational dynamics.
Collapse
Affiliation(s)
- Brian R Morton
- Department of Biology, Barnard College, Columbia University, 3009 Broadway, New York, NY, 10027, USA.
| |
Collapse
|
39
|
Rahman SU, Abdullah M, Khan AW, Haq MIU, Haq NU, Aziz A, Tao S. A detailed comparative analysis of codon usage bias in Alongshan virus. Virus Res 2022; 308:198646. [PMID: 34822954 DOI: 10.1016/j.virusres.2021.198646] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Revised: 11/05/2021] [Accepted: 11/19/2021] [Indexed: 11/25/2022]
Abstract
Alongshan virus (ALSV) is an emerging tick-borne pathogen that infects humans, causing febrile disease. ALSV uses Ixodes Persulcatus ticks to infect humans with a wide range of signs, from asymptomatic to encephalitis-like syndrome. There is an increasing public health concern about the ALSV infection. To get insight into the impacts of viral relations with their hosts on viral ability, survival, and evasion from hosts immune systems remain unknown. The codon usage is a driving force in viral genome evolution; therefore, we enrolled 41 ALSV strains in codon usage analysis to elucidate the molecular evolutionary dynamics of ALSV. The results indicate that the overall codon usage among ALSV isolates is relatively similar and slightly biased. Base compositions for the cds were in order of G >A >C >U and in the third position of codons G3 >A3 >C3 >T3. The RSCU values revealed that the more frequently used codons were mostly GC ended. Different codon preferences in ALSV genes in relation to codon usage of H. sapiens and Ixodes Persulcatus genes were found. Neutrality plot was determined to reveal the superiority of natural selection over directional mutation pressure in causing CUB based on GC12 versus GC3 contents. The results of these studies suggest that the emergence of ALSV in China, Russia and Finland may also be reflected in ALSV codon usage. Altogether, the presence of both mutation pressure and natural selection effect in shaping the codon usage patterns of ALSV.
Collapse
Affiliation(s)
- Siddiq Ur Rahman
- Department of Computer Science & Bioinformatics, Khushal Khan Khattak University, Karak, Khyber Pakhtunkhwa 27200, Pakistan; College of Life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling, Shaanxi, China.
| | - Muhammad Abdullah
- Department of Computer Science & Bioinformatics, Khushal Khan Khattak University, Karak, Khyber Pakhtunkhwa 27200, Pakistan
| | - Abdul Wajid Khan
- Department of Computer Science & Bioinformatics, Khushal Khan Khattak University, Karak, Khyber Pakhtunkhwa 27200, Pakistan
| | - Muhammad Inam Ul Haq
- Department of Computer Science & Bioinformatics, Khushal Khan Khattak University, Karak, Khyber Pakhtunkhwa 27200, Pakistan
| | - Noor Ul Haq
- Department of Computer Science & Bioinformatics, Khushal Khan Khattak University, Karak, Khyber Pakhtunkhwa 27200, Pakistan
| | - Abdul Aziz
- Department of Computer Science & Bioinformatics, Khushal Khan Khattak University, Karak, Khyber Pakhtunkhwa 27200, Pakistan
| | - Shiheng Tao
- College of Life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling, Shaanxi, China.
| |
Collapse
|
40
|
Saldaña CL, Rodriguez-Grados P, Chávez-Galarza JC, Feijoo S, Guerrero-Abad JC, Vásquez HV, Maicelo JL, Jhoncon JH, Arbizu CI. Unlocking the Complete Chloroplast Genome of a Native Tree Species from the Amazon Basin, Capirona ( Calycophyllum Spruceanum, Rubiaceae), and Its Comparative Analysis with Other Ixoroideae Species. Genes (Basel) 2022; 13:genes13010113. [PMID: 35052453 PMCID: PMC8774758 DOI: 10.3390/genes13010113] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 12/31/2021] [Accepted: 01/05/2022] [Indexed: 11/21/2022] Open
Abstract
Capirona (Calycophyllum spruceanum Benth.) belongs to subfamily Ixoroideae, one of the major lineages in the Rubiaceae family, and is an important timber tree. It originated in the Amazon Basin and has widespread distribution in Bolivia, Peru, Colombia, and Brazil. In this study, we obtained the first complete chloroplast (cp) genome of capirona from the department of Madre de Dios located in the Peruvian Amazon. High-quality genomic DNA was used to construct libraries. Pair-end clean reads were obtained by PE 150 library and the Illumina HiSeq 2500 platform. The complete cp genome of C. spruceanum has a 154,480 bp in length with typical quadripartite structure, containing a large single copy (LSC) region (84,813 bp) and a small single-copy (SSC) region (18,101 bp), separated by two inverted repeat (IR) regions (25,783 bp). The annotation of C. spruceanum cp genome predicted 87 protein-coding genes (CDS), 8 ribosomal RNA (rRNA) genes, 37 transfer RNA (tRNA) genes, and one pseudogene. A total of 41 simple sequence repeats (SSR) of this cp genome were divided into mononucleotides (29), dinucleotides (5), trinucleotides (3), and tetranucleotides (4). Most of these repeats were distributed in the noncoding regions. Whole chloroplast genome comparison with the other six Ixoroideae species revealed that the small single copy and large single copy regions showed more divergence than inverted regions. Finally, phylogenetic analyses resolved that C. spruceanum is a sister species to Emmenopterys henryi and confirms its position within the subfamily Ixoroideae. This study reports for the first time the genome organization, gene content, and structural features of the chloroplast genome of C. spruceanum, providing valuable information for genetic and evolutionary studies in the genus Calycophyllum and beyond.
Collapse
Affiliation(s)
- Carla L. Saldaña
- Dirección de Desarrollo Tecnológico Agrario, Instituto Nacional de Innovación Agraria (INIA), Av. La Molina 1981, Lima 15024, Peru; (C.L.S.); (P.R.-G.); (J.C.C.-G.); (H.V.V.); (J.L.M.)
| | - Pedro Rodriguez-Grados
- Dirección de Desarrollo Tecnológico Agrario, Instituto Nacional de Innovación Agraria (INIA), Av. La Molina 1981, Lima 15024, Peru; (C.L.S.); (P.R.-G.); (J.C.C.-G.); (H.V.V.); (J.L.M.)
- Facultad de Ciencias, Universidad Nacional José Faustino Sánchez Carrión, Av. Mercedes Indacochea Nro. 609, Huacho 15136, Peru
| | - Julio C. Chávez-Galarza
- Dirección de Desarrollo Tecnológico Agrario, Instituto Nacional de Innovación Agraria (INIA), Av. La Molina 1981, Lima 15024, Peru; (C.L.S.); (P.R.-G.); (J.C.C.-G.); (H.V.V.); (J.L.M.)
| | - Shefferson Feijoo
- Estación Experimental Agraria San Bernardo, Dirección de Desarrollo Tecnológico Agrario, Instituto Nacional de Innovación Agraria (INIA), Carretera Cusco, Puerto Maldonado, Tambopata, Madre de Dios 17000, Peru;
| | - Juan Carlos Guerrero-Abad
- Dirección de Recursos Genéticos y Biotecnología, Instituto Nacional de Innovación Agraria (INIA), Av. La Molina 1981, Lima 15024, Peru;
| | - Héctor V. Vásquez
- Dirección de Desarrollo Tecnológico Agrario, Instituto Nacional de Innovación Agraria (INIA), Av. La Molina 1981, Lima 15024, Peru; (C.L.S.); (P.R.-G.); (J.C.C.-G.); (H.V.V.); (J.L.M.)
| | - Jorge L. Maicelo
- Dirección de Desarrollo Tecnológico Agrario, Instituto Nacional de Innovación Agraria (INIA), Av. La Molina 1981, Lima 15024, Peru; (C.L.S.); (P.R.-G.); (J.C.C.-G.); (H.V.V.); (J.L.M.)
| | - Jorge H. Jhoncon
- Centro de Investigación de Plantas Andinas y Nativas, Facultad de Ciencias, Universidad Nacional de Educación Enrique Guzmán y Valle, Av. Enrique Guzmán y Valle s/n, Lima 15472, Peru;
- Unidad de Investigación, Perú Maca SAC, Panamericana Sur KM. 37.2 Mz. D1. Lote 03A, Lima 15823, Peru
| | - Carlos I. Arbizu
- Dirección de Desarrollo Tecnológico Agrario, Instituto Nacional de Innovación Agraria (INIA), Av. La Molina 1981, Lima 15024, Peru; (C.L.S.); (P.R.-G.); (J.C.C.-G.); (H.V.V.); (J.L.M.)
- Correspondence:
| |
Collapse
|
41
|
Ren J, Tian J, Jiang H, Zhu XX, Mutie FM, Wanga VO, Ding SX, Yang JX, Dong X, Chen LL, Cai XZ, Hu GW. Comparative and Phylogenetic Analysis Based on the Chloroplast Genome of Coleanthus subtilis (Tratt.) Seidel, a Protected Rare Species of Monotypic Genus. FRONTIERS IN PLANT SCIENCE 2022; 13:828467. [PMID: 35283921 PMCID: PMC8908325 DOI: 10.3389/fpls.2022.828467] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Accepted: 01/31/2022] [Indexed: 05/13/2023]
Abstract
Coleanthus subtilis (Tratt.) Seidel (Poaceae) is an ephemeral grass from the monotypic genus Coleanthus Seidl, which grows on wet muddy areas such as fishponds or reservoirs. As a rare species with strict habitat requirements, it is protected at international and national levels. In this study, we sequenced its whole chloroplast genome for the first time using the next-generation sequencing (NGS) technology on the Illumina platform, and performed a comparative and phylogenetic analysis with the related species in Poaceae. The complete chloroplast genome of C. subtilis is 135,915 bp in length, with a quadripartite structure having two 21,529 bp inverted repeat regions (IRs) dividing the entire circular genome into a large single copy region (LSC) of 80,100 bp and a small single copy region (SSC) of 12,757 bp. The overall GC content is 38.3%, while the GC contents in LSC, SSC, and IR regions are 36.3%, 32.4%, and 43.9%, respectively. A total of 129 genes were annotated in the chloroplast genome, including 83 protein-coding genes, 38 tRNA genes, and 8 rRNA genes. The accD gene and the introns of both clpP and rpoC1 genes were missing. In addition, the ycf1, ycf2, ycf15, and ycf68 were pseudogenes. Although the chloroplast genome structure of C. subtilis was found to be conserved and stable in general, 26 SSRs and 13 highly variable loci were detected, these regions have the potential to be developed as important molecular markers for the subfamily Pooideae. Phylogenetic analysis with species in Poaceae indicated that Coleanthus and Phippsia were sister groups, and provided new insights into the relationship between Coleanthus, Zingeria, and Colpodium. This study presents the initial chloroplast genome report of C. subtilis, which provides an essential data reference for further research on its origin.
Collapse
Affiliation(s)
- Jing Ren
- College of Life Sciences, Hunan Normal University, Changsha, China
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
- Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan, China
| | - Jing Tian
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
- Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Hui Jiang
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
- Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Xin-Xin Zhu
- College of Life Sciences, Xinyang Normal University, Xinyang, China
| | - Fredrick Munyao Mutie
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
- Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Vincent Okelo Wanga
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
- Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Shi-Xiong Ding
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
- Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jia-Xin Yang
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
- Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Xiang Dong
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
- Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Ling-Ling Chen
- College of Life Sciences, Hunan Normal University, Changsha, China
| | - Xiu-Zhen Cai
- College of Life Sciences, Hunan Normal University, Changsha, China
- *Correspondence: Xiu-Zhen Cai,
| | - Guang-Wan Hu
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
- Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan, China
- University of Chinese Academy of Sciences, Beijing, China
- Guang-Wan Hu,
| |
Collapse
|
42
|
Andargie M, Congyi Z. Genome-wide analysis of codon usage in sesame ( Sesamum indicum L.). Heliyon 2022; 8:e08687. [PMID: 35106386 PMCID: PMC8789531 DOI: 10.1016/j.heliyon.2021.e08687] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 11/20/2021] [Accepted: 12/24/2021] [Indexed: 10/28/2022] Open
Abstract
Sesamum indicum is an ancient oil crop grown in tropical and subtropical areas of the world. We have analyzed 23,538 coding sequences (CDS) of S. indicum to understand the factors shaping codon usage in this important oil crop plant. We identified eleven highly preferred codons in S. indicum that have AT-endings. The slope of a neutrality plot was less than one while effective number of codons (ENC) plot showed distribution above and below the standard curve. There is a significant relationship between protein length and relative synonymous codon usage (RSCU) at the primary axis while there is a weak correlation between protein length and Nc values. Correspondence analysis conducted on RSCU values differentiated CDS based on their GC content and their characteristic feature and showed a discrete distribution. Moreover, by determining codon usage, we found out that majority of the lignan biosynthesis related genes showed a weaker codon usage bias. These results provide insights into understanding codon evolution in sesame.
Collapse
Affiliation(s)
- Mebeaselassie Andargie
- University of Goettingen, Molecular Phytopathology and Mycotoxin Research, Grisebachstrasse 6, 37077 Goettingen, Germany
| | - Zhu Congyi
- Key Laboratory of South Subtropical Fruit Biology and Genetic Resource Utilization (MOA), Guangdong Province Key Laboratory of Tropical and Subtropical Fruit Tree Research, Institute of Fruit Tree Research, Guangdong Academy of Agricultural Sciences, Guangzhou, China
| |
Collapse
|
43
|
An extensive evaluation of codon usage pattern and bias of structural proteins p30, p54 and, p72 of the African swine fever virus (ASFV). Virusdisease 2021; 32:810-822. [PMID: 34901328 DOI: 10.1007/s13337-021-00719-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 06/23/2021] [Indexed: 10/20/2022] Open
Abstract
African swine fever virus (ASFV) belongs to the family of Asfarviridae to the genus Asfivirus. ASF virus causes hemorrhage illness with a high mortality rate and hence, commercial loss in the swine community. The ASFV has been categorized by variation in codon usage that is caused by high mutation rates and natural selection. The evolution is caused mainly due to the mutation pressure and regulating the protein gene expression. Based on publicly accessible nucleotide sequences of the ASFV and its host (pig & tick), codon usage bias analysis was performed since an approved effective vaccination is not available to date, it is very important to analyze the codon usage bias of the p30, p54, and p72 proteins of ASFV to produce an effective and efficient vaccine to control the disease. Even though the codon usage bias analyses have been evaluated earlier, the evaluation of the codon usage pattern specific to p30, p54, and p72 of ASFV is inadequate. In all the protein-coding sequences, nucleotide base and codons terminating with base T were most frequent and the mean effective number of codons (Nc) was high, indicating the presence of codon usage bias. The GC contents and dinucleotide frequencies also indicated the codon usage bias of the ASFV pig and tick. The Nc plot, parity plot, neutrality plot analysis, revealed natural selection, as well as mutation pressure, were the major constraints in altering the codon bias of ASF virus. codon usage bias analysis was performed with no substantial differences in codon usage of the ASFV in pig and tick. Supplementary Information The online version contains supplementary material available at 10.1007/s13337-021-00719-x.
Collapse
|
44
|
Ramdhan RS, Hade NA, Algafari RN. Association of multiple SNPs at slc2a9 exon 8 in gout disease in Iraqi population: A molecular study. Meta Gene 2021. [DOI: 10.1016/j.mgene.2021.100942] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022] Open
|
45
|
Chakraborty S, Basumatary P, Nath D, Paul S, Uddin A. Compositional features and pattern of codon usage for mitochondrial CO genes among reptiles. Mitochondrion 2021; 62:111-121. [PMID: 34793987 DOI: 10.1016/j.mito.2021.11.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Revised: 11/02/2021] [Accepted: 11/10/2021] [Indexed: 11/27/2022]
Abstract
The phenomenon of non-random occurrence of synonymous nucleotide triplets (codons) in the coding sequences of genes is the codon usage bias (CUB). In this study, we used bioinformatic tool kit to analyze the compositional pattern and CUB of mitogenes namely COI, COII and COIII across different orders of reptiles. Estimation of overall base composition in the protein-coding sequences of COI, COII and COIII genes of the reptilian orders revealed an uneven usage of nucleotides. The overall count of A nucleotide was found to be the highest while the overall count of G nucleotide was the least. The CO genes across the three reptilian orders were prominently AT biased. Comparison of the GC proportion at each codon position displayed that GC1 percentage ranked the highest in all the three CO genes of the reptilian orders. SCUO values indicated weaker CUB, while considerable variation of SCUO values existed in the three CO genes across the studied reptiles. Relative synonymous codon usage (RSCU) values indicated that mostly the A ending codons were preferred. Based on the parameters namely neutrality plot, mutational responsive index and translational selection, we could conclude that natural selection was the major evolutionary force in COI, COII and COIII genes in the studied reptilian orders. However, correspondence analysis, parity plot and correlation studies indicated the existence of mutation pressure as well on the CO genes.
Collapse
Affiliation(s)
- Supriyo Chakraborty
- Department of Biotechnology, Assam University, Silchar 788011, Assam, India.
| | | | - Durbba Nath
- Department of Biotechnology, Assam University, Silchar 788011, Assam, India
| | - Sunanda Paul
- Department of Biotechnology, Assam University, Silchar 788011, Assam, India
| | - Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial Science College, Algapur, Hailakandi788150, Assam, India.
| |
Collapse
|
46
|
Epidemiology and Evolution of Emerging Porcine Circovirus-like Viruses in Pigs with Hemorrhagic Dysentery and Diarrhea Symptoms in Central China from 2018 to 2021. Viruses 2021; 13:v13112282. [PMID: 34835090 PMCID: PMC8624291 DOI: 10.3390/v13112282] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2021] [Revised: 11/08/2021] [Accepted: 11/10/2021] [Indexed: 12/15/2022] Open
Abstract
Porcine circovirus-like virus (PCLV) is a type of circular Rep-encoding single-stranded DNA virus and may be associated with the development of diarrheal symptoms in pigs. In this study, we retrospectively analyzed three years of past cases in Anhui, China, and reported a case of hemorrhagic enteritis and death in a pregnant sow possibly caused by PCLV. In addition, we analyzed the evolutionary characteristics of PCLV and found that mutation, recombination and selective pressure all played an important role in the evolution of PCLV. We identified N15D and T17S as well as L56T, T58R, K59Q, M62R, L75I and R190K mutations in two different branches, and we noted recombination events in the Rep of a group of Chinese strains. Analysis of selection pressure revealed that PCLV gained more positive selection, indicating that the virus is in a continuous evolutionary state. The PR2 plot, ENC-plot and neutrality analysis showed a greater role of natural selection than that of mutational pressure in the formation of codon usage patterns. This study is the first to identify PCLV in sows with hemorrhagic dysentery and death, and it provides new epidemiological information on PCLV infection in pigs in China.
Collapse
|
47
|
Wang J, Lin Y, Xi M. Analysis of Codon Usage Patterns of Six Sequenced Brachypodium distachyon Lines Reveals a Declining CG Skew of the CDSs from the 5'-ends to the 3'-ends. Genes (Basel) 2021; 12:1467. [PMID: 34680862 PMCID: PMC8535453 DOI: 10.3390/genes12101467] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Revised: 09/09/2021] [Accepted: 09/20/2021] [Indexed: 02/01/2023] Open
Abstract
Brachypodium distachyon, a new monocotyledonous model plant, has received wide attention in biological research due to its small genome and numerous genetic resources. Codon usage bias is an important feature of genes and genomes, and it can be used in transgenic and evolutionary studies. In this study, the nucleotide compositions and patterns of codon usage bias were calculated using Codon W. Additionally, an ENC plot, Parity rule 2 and correspondence analyses were used to explore the major factors influencing codon usage bias patterns. The numbers of hydrogen bonds and skews were used to analyze the GC trend in the 5'-ends of the coding sequences. The results showed that minor differences in the codon usage bias patterns were revealed by the ENC plot, Parity rule 2 and correspondence analyses. The analyses of the CG-skew and the number of hydrogen bonds showed a declining trend in the number of cytosines at the 5'-ends of the CDSs (from the 5'-ends to the 3'-ends), indicating that GC may play a major role in codon usage bias. In addition, our results laid a foundation for the study of codon usage bias patterns in Brachypodium genus and suggested that the GC plays a major role in determining these patterns.
Collapse
Affiliation(s)
- Jianyong Wang
- Key Laboratory of Forest Genetics and Biotechnology of Ministry of Education, Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, China;
| | - Yujing Lin
- Shanghai Center for Plant Stress Biology and Center for Excellence in Molecular Plant Sciences, University of Chinese Academy of Sciences, Shanghai 200032, China;
| | - Mengli Xi
- Key Laboratory of Forest Genetics and Biotechnology of Ministry of Education, Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, China;
| |
Collapse
|
48
|
Gupta S, Singh R. Comparative study of codon usage profiles of Zingiber officinale and its associated fungal pathogens. Mol Genet Genomics 2021; 296:1121-1134. [PMID: 34181071 DOI: 10.1007/s00438-021-01808-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Accepted: 06/22/2021] [Indexed: 01/08/2023]
Abstract
Codon usage bias influences the genetic features prevalent in genomes of all the organisms. It also plays a crucial role in establishing the host-pathogen relationship. The present study elucidates the role of codon usage pattern regarding the predilection of fungal pathogens Aspergillus flavus, Aspergillus niger, Fusarium oxysporum and Colletotrichum gloeosporioides towards host plant Zingiber officinale. We found a similar trend of codon usage pattern operative in plant and fungal pathogens. This concurrence might be attributed for the colonization of fungal pathogens in Z. officinale. The transcriptome of both plant and pathogens showed bias towards GC-ending codons. Natural selection and mutational pressure seem to be accountable for shaping the codon usage pattern of host and pathogen. We also identified some distinctive preferred codons in A. flavus, F. oxysporum and Z. officinale that could be regarded as signature codons for the identification of these organisms. Knowledge of favored, avoided and unique codons will help to devise strategies for reducing spice losses due to fungal pathogens.
Collapse
Affiliation(s)
- Suruchi Gupta
- Plant Biotechnology Division, CSIR-Indian Institute of Integrative Medicine, Jammu, 180001, India
| | - Ravail Singh
- Plant Biotechnology Division, CSIR-Indian Institute of Integrative Medicine, Jammu, 180001, India.
- Academy of Scientific and Innovative Research (AcSIR), Jammu, 180001, India.
- DZMB Senckenberg am Meer, Wilhelmshaven, Germany.
| |
Collapse
|
49
|
Bordoloi H, Nirmala SR. Codon usage bias analysis of genes linked with esophagus cancer. Bioinformation 2021; 17:731-740. [PMID: 35540696 PMCID: PMC9049095 DOI: 10.6026/97320630017731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2021] [Revised: 08/30/2021] [Accepted: 08/30/2021] [Indexed: 11/23/2022] Open
Abstract
Esophageal cancer involves multiple genetic alternations. A systematic codon usage bias analysis was completed to investigate the bias among the esophageal cancer responsive genes. GC-rich genes were low (average effective number of codon value was 49.28). CAG and GTA are over-represented and under-represented codons, respectively. Correspondence analysis, neutrality plot, and parity rule 2 plot analysis confirmed the dominance over mutation pressure in modulating the codon usage pattern of genes linked with esophageal cancer.
Collapse
Affiliation(s)
- Hemashree Bordoloi
- Deptartment of Electronics and Communication Engineering, Gauhati University, Assam, Indi
- Department of Electronics and Communication Engineering, Assam Don Bosco University, Assam, India
| | - SR Nirmala
- School of Electronics and Communication Engineering, KLE Technological University, Karnataka, India
| |
Collapse
|
50
|
Complete Chloroplast Genome Sequence of Fortunella venosa (Champ. ex Benth.) C.C.Huang (Rutaceae): Comparative Analysis, Phylogenetic Relationships, and Robust Support for Its Status as an Independent Species. FORESTS 2021. [DOI: 10.3390/f12080996] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Fortunella venosa (Rutaceae) is an endangered species endemic to China and its taxonomic status has been controversial. The genus Fortunella contains a variety of important economic plants with high value in food, medicine, and ornamental. However, the placement of Genus Fortunella into Genus Citrus has led to controversy on its taxonomy and Systematics. In this present research, the Chloroplast genome of F. venosa was sequenced using the second-generation sequencing, and its structure and phylogenetic relationship analyzed. The results showed that the Chloroplast genome size of F. venosa was 160,265 bp, with a typical angiosperm four-part ring structure containing a large single copy region (LSC) (87,597 bp), a small single copy region (SSC) (18,732 bp), and a pair of inverted repeat regions (IRa\IRb) (26,968 bp each). There are 134 predicted genes in Chloroplast genome, including 89 protein-coding genes, 8 rRNAs, and 37 tRNAs. The GC-content of the whole Chloroplast genome was 43%, with the IR regions having a higher GC content than the LSC and the SSC regions. There were no rearrangements present in the Chloroplast genome; however, the IR regions showed obvious contraction and expansion. A total of 108 simple sequence repeats (SSRs) were present in the entire chloroplast genome and the nucleotide polymorphism was high in LSC and SSC. In addition, there is a preference for codon usage with the non-coding regions being more conserved than the coding regions. Phylogenetic analysis showed that species of Fortunella are nested in the genus of Citrus and the independent species status of F. venosa is supported robustly, which is significantly different from F. japonica. These findings will help in the development of DNA barcodes that can be useful in the study of the systematics and evolution of the genus Fortunella and the family Rutaceae.
Collapse
|