1
|
Kurmi A, Sen P, Dash M, Ray SK, Satapathy SS. Differentially used codons among essential genes in bacteria identified by machine learning-based analysis. Mol Genet Genomics 2024; 299:72. [PMID: 39060647 DOI: 10.1007/s00438-024-02163-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2024] [Accepted: 07/10/2024] [Indexed: 07/28/2024]
Abstract
Codon usage bias (CUB), the uneven usage of synonymous codons encoding the same amino acid, differs among genes within and across bacteria genomes. CUB is known to be influenced by gene expression and accordingly, CUB differs between the high-expression and low-expression genes in several bacteria. In this article, we have extended codon usage study considering gene essentiality as a feature. Using machine learning (ML) based approaches, we have analysed Relative Synonymous Codon Usage (RSCU) values between essential and non-essential genes in Escherichia coli and thirty-four other bacterial genomes whose gene essentiality features were available in public databases. We observed significant differences in codon usage patterns between essential and non-essential genes for majority of the bacterial genomes and accordingly, ML based classifiers achieved high area under curve (AUC) scores, with a minimum score of 70.0 across twenty-eight organisms. Further, importance of the codons towards classifying genes found to differ among the codons in each genome. Arg codon CGT and Gly codon GGT were observed to be the most preferred codons among essential genes in Escherichia coli. Interestingly, some of the codons like CGT, ATA, GGT and GGG observed to be contributing consistently towards classifying essential genes across thirty-five bacteria genomes studied. In other hand, codons TGY and CAY encoding amino acids Cys and His respectively were among the least contributing codons towards classification among all these bacteria. This study demonstrates the gene essentiality based differences in synonymous codon usage in bacteria genomes and presents a common codon usage pattern across bacteria.
Collapse
Affiliation(s)
- Annushree Kurmi
- Department of Computer Science and Engineering, Tezpur University, Napaam, Assam, 784028, India
- Department of Computer Science and Engineering, The Assam Kaziranga University, Jorhat, Assam, 785006, India
| | - Piyali Sen
- Department of Computer Science and Engineering, Tezpur University, Napaam, Assam, 784028, India
| | - Madhusmita Dash
- Department of Electronics and Communication Engineering, NIT, Jote, Arunachal Pradesh, 791113, India
| | - Suvendra Kumar Ray
- Department of Molecular Biology and Biotechnology, Tezpur University, Napaam, Assam, 784028, India
| | | |
Collapse
|
2
|
Gurjar P, Khan AA, Alanazi AM, Vasil'ev VG, Zouganelis G, Alexiou A. Molecular Dissection of Herpes Simplex Virus Type 1 to Elucidate Molecular Mechanisms Behind Latency and Comparison of Its Codon Usage Patterns with Genes Modulated During Alzheimer's Disease as a Part of Host-Pathogen Interaction. J Alzheimers Dis 2024; 97:1111-1123. [PMID: 38306057 DOI: 10.3233/jad-231083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2024]
Abstract
BACKGROUND Herpes simplex virus type 1 (HSV-1) is associated with Alzheimer's disease, which goes into a cycle of latency and reactivation. The present study was envisaged to understand the reasons for latency and specific molecular patterns present in the HSV-1. OBJECTIVE The objective is the molecular dissection of Herpes simplex virus type 1 to elucidate molecular mechanisms behind latency and compare its codon usage patterns with genes modulated during Alzheimer's disease as a part of host-pathogen interaction. METHODS In the present study, we tried to investigate the potential reasons for the latency of HSV-1 virus bioinformatically by determining the CpG patterns. Also, we investigated the codon usage pattern, the presence of rare codons, codon context, and protein properties. RESULTS The top 222 codon pairs graded based on their frequency in the HSV-1 genome revealed that with only one exception (CUG-UUU), all other codon pairs have codons ending with G/C. Considering it an extension of host-pathogen interaction, we compared HSV-1 codon usage with that of codon usage of genes modulated during Alzheimer's disease, and we found that CGT and TTT are only two codons that exhibited similar codon usage patterns and other codons showed statistically highly significant different codon preferences. Dinucleotide CpG tends to mutate to TpG, suggesting the presence of mutational forces and the imperative role of CpG methylation in HSV-1 latency. CONCLUSIONS Upon comparison of codon usage between HSV-1 and Alzheimer's disease genes, no similarities in codon usage were found as a part of host-pathogen interaction. CpG methylation plays an imperative role in latency HSV-1.
Collapse
Affiliation(s)
- Pankaj Gurjar
- Centre for Global Health Research, Saveetha Medical College and Hospital, Saveetha Institute of Medical and Technical Sciences, Saveetha University, Chennai, Tamil Nadu, India
- Department of Science and Engineering, Novel Global Community Educational Foundation, Hebersham, Australia
| | - Azmat Ali Khan
- Department of Pharmaceutical Chemistry, Pharmaceutical Biotechnology Laboratory, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia
| | - Amer M Alanazi
- Department of Pharmaceutical Chemistry, Pharmaceutical Biotechnology Laboratory, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia
| | | | - George Zouganelis
- School of Human Sciences, College of Life and Natural Sciences, University of Derby, Derby, UK
| | - Athanasios Alexiou
- Department of Science and Engineering, Novel Global Community Educational Foundation, Hebersham, Australia
- AFNP Med, Vienna, Austria
| |
Collapse
|
3
|
Cornet F, Blanchais C, Dusfour-Castan R, Meunier A, Quebre V, Sekkouri Alaoui H, Boudsoq F, Campos M, Crozat E, Guynet C, Pasta F, Rousseau P, Ton Hoang B, Bouet JY. DNA Segregation in Enterobacteria. EcoSal Plus 2023; 11:eesp00382020. [PMID: 37220081 PMCID: PMC10729935 DOI: 10.1128/ecosalplus.esp-0038-2020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2022] [Accepted: 04/13/2023] [Indexed: 01/28/2024]
Abstract
DNA segregation ensures that cell offspring receive at least one copy of each DNA molecule, or replicon, after their replication. This important cellular process includes different phases leading to the physical separation of the replicons and their movement toward the future daughter cells. Here, we review these phases and processes in enterobacteria with emphasis on the molecular mechanisms at play and their controls.
Collapse
Affiliation(s)
- François Cornet
- Laboratoire de Microbiologie et Génétique Moléculaires (LMGM), Centre de Biologie Intégrative (CBI), CNRS, Université de Toulouse, Toulouse, France
| | - Corentin Blanchais
- Laboratoire de Microbiologie et Génétique Moléculaires (LMGM), Centre de Biologie Intégrative (CBI), CNRS, Université de Toulouse, Toulouse, France
| | - Romane Dusfour-Castan
- Laboratoire de Microbiologie et Génétique Moléculaires (LMGM), Centre de Biologie Intégrative (CBI), CNRS, Université de Toulouse, Toulouse, France
| | - Alix Meunier
- Laboratoire de Microbiologie et Génétique Moléculaires (LMGM), Centre de Biologie Intégrative (CBI), CNRS, Université de Toulouse, Toulouse, France
| | - Valentin Quebre
- Laboratoire de Microbiologie et Génétique Moléculaires (LMGM), Centre de Biologie Intégrative (CBI), CNRS, Université de Toulouse, Toulouse, France
| | - Hicham Sekkouri Alaoui
- Laboratoire de Microbiologie et Génétique Moléculaires (LMGM), Centre de Biologie Intégrative (CBI), CNRS, Université de Toulouse, Toulouse, France
| | - François Boudsoq
- Laboratoire de Microbiologie et Génétique Moléculaires (LMGM), Centre de Biologie Intégrative (CBI), CNRS, Université de Toulouse, Toulouse, France
| | - Manuel Campos
- Laboratoire de Microbiologie et Génétique Moléculaires (LMGM), Centre de Biologie Intégrative (CBI), CNRS, Université de Toulouse, Toulouse, France
| | - Estelle Crozat
- Laboratoire de Microbiologie et Génétique Moléculaires (LMGM), Centre de Biologie Intégrative (CBI), CNRS, Université de Toulouse, Toulouse, France
| | - Catherine Guynet
- Laboratoire de Microbiologie et Génétique Moléculaires (LMGM), Centre de Biologie Intégrative (CBI), CNRS, Université de Toulouse, Toulouse, France
| | - Franck Pasta
- Laboratoire de Microbiologie et Génétique Moléculaires (LMGM), Centre de Biologie Intégrative (CBI), CNRS, Université de Toulouse, Toulouse, France
| | - Philippe Rousseau
- Laboratoire de Microbiologie et Génétique Moléculaires (LMGM), Centre de Biologie Intégrative (CBI), CNRS, Université de Toulouse, Toulouse, France
| | - Bao Ton Hoang
- Laboratoire de Microbiologie et Génétique Moléculaires (LMGM), Centre de Biologie Intégrative (CBI), CNRS, Université de Toulouse, Toulouse, France
| | - Jean-Yves Bouet
- Laboratoire de Microbiologie et Génétique Moléculaires (LMGM), Centre de Biologie Intégrative (CBI), CNRS, Université de Toulouse, Toulouse, France
| |
Collapse
|
4
|
Sophiarani Y, Chakraborty S. Synonymous sites for accessibility around microRNA binding sites in bacterial spot and speck disease resistance genes of tomato. Funct Integr Genomics 2023; 23:247. [PMID: 37468805 DOI: 10.1007/s10142-023-01178-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2022] [Revised: 05/15/2023] [Accepted: 07/13/2023] [Indexed: 07/21/2023]
Abstract
The major causes of mass tomato infections in both covered and open ground are agents of bacterial spot and bacterial speck diseases. MicroRNAs (miRNAs) are 16-21 nucleotides in length, non-coding RNAs that inhibit translation and trigger mRNA degradation. MiRNAs play a significant part in plant resistance to abiotic and biotic stresses by mediating gene regulation via post-transcriptional RNA silencing. In this study, we analyzed a collection of bacterial resistance genes of tomato and their binding sites for tomato miRNAs and Pseudomonas syringe pv. tomato miRNAs. Our study found that two genes, bacterial spot disease resistance gene (Bs4) and bacterial speck disease resistance gene (Prf), have a 7mer-m8 perfect seed match with miRNAs. Bs4 was targeted by one tomato miRNA (sly-miR9470-3p) and three Pseudomonas syringe pv. tomato miRNAs (PSTJ4_3p_27246, PSTJ4_3p_27246, and PSTJ4_3p_27246). Again, Prf gene was found to be targeted by two tomato miRNAs namely, sly-miR9469-5p and sly-miR9474-3p. The accessibility of the miRNA-target site and its flanking regions and the relationship between relative synonymous codon usage and tRNAs were compared. Strong access to miRNA targeting regions and decreased rate of translations suggested that miRNAs might be efficient in binding to their particular targets. We also found the existence of rare codons, which suggests that it could enhance miRNA targeting even more. The codon usage pattern analysis of the two genes revealed that both were AT-rich (Bs4 = 63.2%; Prf = 60.8%). We found a low codon usage bias in both genes, suggesting that selective restriction might regulate them. The silencing property of miRNAs would allow researchers to discover the involvement of plant miRNAs in pathogen invasion. However, the efficient validation of direct targets of miRNAs is an urgent need that might be highly beneficial in enhancing plant resistance to multiple pathogenic diseases.
Collapse
Affiliation(s)
- Yengkhom Sophiarani
- Department of Biotechnology, Assam University, Silchar, Assam, 788011, India
| | - Supriyo Chakraborty
- Department of Biotechnology, Assam University, Silchar, Assam, 788011, India.
| |
Collapse
|
5
|
Saha J, Dey S, Pal A. Whole genome sequencing and comparative genomic analyses of Pseudomonas aeruginosa strain isolated from arable soil reveal novel insights into heavy metal resistance and codon biology. Curr Genet 2022; 68:481-503. [PMID: 35763098 DOI: 10.1007/s00294-022-01245-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Revised: 05/14/2022] [Accepted: 06/06/2022] [Indexed: 11/03/2022]
Abstract
Elevated concentration of non-essential persistent heavy metals and metalloids in the soil is detrimental to essential soil microbes and plants, resulting in diminished diversity and biomass. Thus, isolation, screening, and whole genomic analysis of potent strains of bacteria from arable lands with inherent capabilities of heavy metal resistance and plant growth promotion hold the key for bio remedial applications. This study is an attempt to do the same. In this study, a potent strain of Pseudomonas aeruginosa was isolated from paddy fields, followed by metabolic profiling using FTIR, metal uptake analysis employing ICP-MS, whole genome sequencing and comparative codon usage analysis. ICP-MS study provided insights into a high degree of Cd uptake during the exponential phase of growth under cumulative metal stress to Cd, Zn and Co, which was further corroborated by the detection of cadA gene along with czcCBA operon in the genome upon performing whole-genome sequencing. This potent strain of Pseudomonas aeruginosa also harboured genes, such as copA, chrA, znuA, mgtE, corA, and others conferring resistance against different heavy metals, such as Cd, Zn, Co, Cu, Cr, etc. A comparative codon usage bias analysis at the genomic and genic level, whereby several heavy metal resistant genes were considered in the backdrop of two housekeeping genes among 40 Pseudomonas spp. indicated the presence of a relatively strong codon usage bias in the studied strain. With this work, an effort was made to explore heavy metal-resistant bacteria (isolated from arable soil) and whole genome sequence analysis to get insight into metal resistance for future bio remedial applications.
Collapse
Affiliation(s)
- Jayanti Saha
- Microbiology and Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj, West Bengal, 733134, India
| | - Sourav Dey
- Microbiology and Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj, West Bengal, 733134, India
| | - Ayon Pal
- Microbiology and Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj, West Bengal, 733134, India.
| |
Collapse
|
6
|
Chakraborty S, Nath D. A Study on microRNAs Targeting the Genes Overexpressed in Lung Cancer and their Codon Usage Patterns. Mol Biotechnol 2022; 64:1095-1119. [DOI: 10.1007/s12033-022-00491-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Accepted: 04/04/2022] [Indexed: 10/18/2022]
|
7
|
Silencing lung cancer genes using miRNAs identified by 7mer-seed matching. Comput Biol Chem 2021; 92:107483. [PMID: 33932780 DOI: 10.1016/j.compbiolchem.2021.107483] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2020] [Revised: 03/19/2021] [Accepted: 04/03/2021] [Indexed: 12/20/2022]
Abstract
Lung cancer (LC) is the main cause of cancer-associated deaths in both men and women globally with a very high mortality rate. The microRNAs (miRNAs) are a class of noncoding RNAs consisting of 18-25 nucleotides. They inhibit translation of protein through binding to complementary target mRNAs. The non-coding miRNAs are recognized as potent biomarkers for detection, development and treatment of malignancy. In this study, we screened a set of 12 genes over expressed in small cell lung cancer, non small cell lung cancer and the genes involved in both categories and their binding sites for human miRNAs as no work was reported yet. Screening of human miRNAs revealed that a few genes showed numerous miRNA binding sites. Free energy values of mRNA sequences revealed that they might acquire compact folded structure causing complexity for miRNAs to interact. GC content in the target site was relatively higher than that of their flanks. It was observed through analysis of cosine similarity metric and compAI parameters that the genes related to lung cancer were encoded with non optimal codons and thus might be translationally less efficient for producing polypeptides. Gene ontology analysis was carried out to understand the diverse functions of these 12 genes.
Collapse
|
8
|
Riley LW. Distinguishing Pathovars from Nonpathovars: Escherichia coli. Microbiol Spectr 2020; 8:10.1128/microbiolspec.ame-0014-2020. [PMID: 33385193 PMCID: PMC10773148 DOI: 10.1128/microbiolspec.ame-0014-2020] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2020] [Indexed: 02/07/2023] Open
Abstract
Escherichia coli is one of the most well-adapted and pathogenically versatile bacterial organisms. It causes a variety of human infections, including gastrointestinal illnesses and extraintestinal infections. It is also part of the intestinal commensal flora of humans and other mammals. Groups of E. coli that cause diarrhea are often described as intestinal pathogenic E. coli (IPEC), while those that cause infections outside of the gut are called extraintestinal pathogenic E. coli (ExPEC). IPEC can cause a variety of diarrheal illnesses as well as extraintestinal syndromes such as hemolytic-uremic syndrome. ExPEC cause urinary tract infections, bloodstream infection, sepsis, and neonatal meningitis. IPEC and ExPEC have thus come to be referred to as pathogenic variants of E. coli or pathovars. While IPEC can be distinguished from commensal E. coli based on their characteristic virulence factors responsible for their associated clinical manifestations, ExPEC cannot be so easily distinguished. IPEC most likely have reservoirs outside of the human intestine but it is unclear if ExPEC represent nothing more than commensal E. coli that breach a sterile barrier to cause extraintestinal infections. This question has become more complicated by the advent of whole genome sequencing (WGS) that has raised a new question about the taxonomic characterization of E. coli based on traditional clinical microbiologic and phylogenetic methods. This review discusses how molecular epidemiologic approaches have been used to address these questions, and how answers to these questions may contribute to our better understanding of the epidemiology of infections caused by E. coli. *This article is part of a curated collection.
Collapse
Affiliation(s)
- Lee W Riley
- Division of Infectious Diseases and Vaccinology, School of Public Health, University of California, Berkeley, CA 94720
| |
Collapse
|
9
|
Saha J, Saha BK, Pal Sarkar M, Roy V, Mandal P, Pal A. Comparative Genomic Analysis of Soil Dwelling Bacteria Utilizing a Combinational Codon Usage and Molecular Phylogenetic Approach Accentuating on Key Housekeeping Genes. Front Microbiol 2019; 10:2896. [PMID: 31921071 PMCID: PMC6928123 DOI: 10.3389/fmicb.2019.02896] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Accepted: 12/02/2019] [Indexed: 01/02/2023] Open
Abstract
Soil is a diversified and complex ecological niche, home to a myriad of microorganisms particularly bacteria. The physico-chemical complexities of soil results in a plethora of physiological variations to exist within the different types of soil dwelling bacteria, giving rise to a wide variation in genome structure and complexity. This serves as an attractive proposition to analyze and compare the genome of a large number soil bacteria to comprehend their genome complexity and evolution. In this study a combination of codon usage and molecular phylogenetics of the whole genome and key housekeeping genes like infB (translation initiation factor 2), trpB (tryptophan synthase, beta subunit), atpD (ATP synthase, beta subunit), and rpoB (RNA polymerase, beta subunit) of 92 soil bacterial species spread across the entire eubacterial domain and residing in different soil types was performed. The results indicated the direct relationship of genome size with codon bias and coding frequency in the studied bacteria. The codon usage profile demonstrated by the gene trpB was found to be relatively different from the rest of the housekeeping genes with a large number of bacteria having a greater percentage of genes with Nc values less than the Nc of trpB. The results from the overall codon usage bias profile also depicted that the codon usage bias in the key housekeeping genes of soil bacteria was majorly due to selectional pressure and not mutation. The analysis of hydrophobicity of the gene product encoded by the rpoB coding sequences demonstrated tight clustering across all the soil bacteria suggesting conservation of protein structure for maintenance of form and function. The phylogenetic affinities inferred using 16S rRNA gene and the housekeeping genes demonstrated conflicting signals with trpB gene being the noisiest one. The housekeeping gene atpD was found to depict the least amount of evolutionary change in the soil bacteria considered in this study except in two Clostridium species. The phylogenetic and codon usage analysis of the soil bacteria consistently demonstrated the relatedness of Azotobacter chroococcum with different species of the genus Pseudomonas.
Collapse
Affiliation(s)
- Jayanti Saha
- Microbiology & Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj, India
| | - Barnan K. Saha
- Microbiology & Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj, India
| | - Monalisha Pal Sarkar
- Mycology & Plant Pathology Laboratory, Department of Botany, Raiganj University, Raiganj, India
| | - Vivek Roy
- Microbiology & Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj, India
| | - Parimal Mandal
- Mycology & Plant Pathology Laboratory, Department of Botany, Raiganj University, Raiganj, India
| | - Ayon Pal
- Microbiology & Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj, India
| |
Collapse
|
10
|
Yang ZK, Luo H, Zhang Y, Wang B, Gao F. Pan-genomic analysis provides novel insights into the association of E.coli with human host and its minimal genome. Bioinformatics 2018; 35:1987-1991. [DOI: 10.1093/bioinformatics/bty938] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2018] [Revised: 11/06/2018] [Accepted: 11/08/2018] [Indexed: 11/13/2022] Open
Affiliation(s)
- Zhi-Kai Yang
- Department of Physics, School of Science
- Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin, China
- SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), Tianjin, China
- SinoGenoMax Co., Ltd./Chinese National Human Genome Center, Beijing, China
| | - Hao Luo
- Department of Physics, School of Science
- Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin, China
- SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), Tianjin, China
| | - Yanming Zhang
- SinoGenoMax Co., Ltd./Chinese National Human Genome Center, Beijing, China
| | - Baijing Wang
- SinoGenoMax Co., Ltd./Chinese National Human Genome Center, Beijing, China
| | - Feng Gao
- Department of Physics, School of Science
- Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin, China
- SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), Tianjin, China
| |
Collapse
|