Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: de Avila e Silva S, Echeverrigaray S, Gerhardt GJ. BacPP: Bacterial promoter prediction—A tool for accurate sigma-factor specific assignment in enterobacteria. J Theor Biol 2011;287:92-9. [DOI: 10.1016/j.jtbi.2011.07.017] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2010] [Revised: 05/20/2011] [Accepted: 07/21/2011] [Indexed: 10/17/2022]

For:	de Avila e Silva S, Echeverrigaray S, Gerhardt GJ. BacPP: Bacterial promoter prediction—A tool for accurate sigma-factor specific assignment in enterobacteria. J Theor Biol 2011;287:92-9. [DOI: 10.1016/j.jtbi.2011.07.017] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2010] [Revised: 05/20/2011] [Accepted: 07/21/2011] [Indexed: 10/17/2022]

Number

Cited by Other Article(s)

Kader Chowdhury QMM, Islam S, Narayanan L, Ogunleye SC, Wang S, Thu D, Freitag NE, Lawrence ML, Abdelhamed H. An insight into the role of branched-chain α-keto acid dehydrogenase (BKD) complex in branched-chain fatty acid biosynthesis and virulence of Listeria monocytogenes. J Bacteriol 2024;206:e0003324. [PMID: 38899896 PMCID: PMC11270904 DOI: 10.1128/jb.00033-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Accepted: 05/31/2024] [Indexed: 06/21/2024] Open

Abstract

Listeria monocytogenes is a foodborne bacterial pathogen that causes listeriosis. Positive regulatory factor A (PrfA) is a pleiotropic master activator of virulence genes of L. monocytogenes that becomes active upon the entry of the bacterium into the cytosol of infected cells. L. monocytogenes can survive and multiply at low temperatures; this is accomplished through the maintenance of appropriate membrane fluidity via branched-chain fatty acid (BCFA) synthesis. Branched-chain α-keto acid dehydrogenase (BKD), which is composed of four polypeptides encoded by lpd, bkdA1, bkdA2, and bkdB, is known to play a vital role in BCFA biosynthesis. Here, we constructed BKD-deficient Listeria strains by in-frame deletion of lpd, bkdA1, bkdA2, and bkdB genes. To determine the role in in vivo and in vitro, mouse model challenges, plaque assay in murine L2 fibroblast, and intracellular replication in J744A.1 macrophage were conducted. BKD-deficient strains exhibited defects in BCFA composition, virulence, and PrfA-regulon function within the host cells. Transcriptomics analysis revealed that the transcript level of the PrfA-regulon was lower in ΔbkdA1 strain than those in the wild-type. This study demonstrates that L. monocytogenes strains lacking BKD complex components were defective in PrfA-regulon function, and full activation of wild-type prfA may not occur within host cells in the absence of BKD. Further study will investigate the consequences of BKD deletion on PrfA function through altering BCFA catabolism.IMPORTANCEListeria monocytogenes is the causative agent of listeriosis, a disease with a high mortality rate. In this study, we have shown that the deletion of BKD can impact the function of PrfA and the PrfA-regulon. The production of virulence proteins within host cells is necessary for L. monocytogenes to promote its intracellular survival and is likely dependent on membrane integrity. We thus report a link between L. monocytogenes membrane integrity and the function of PrfA. This knowledge will increase our understanding of L. monocytogenes pathogenesis, which may provide insight into the development of antimicrobial agents.

Collapse

Dulyayangkul P, Sealey JE, Lee WWY, Satapoomin N, Reding C, Heesom KJ, Williams PB, Avison MB. Improving nitrofurantoin resistance prediction in Escherichia coli from whole-genome sequence by integrating NfsA/B enzyme assays. Antimicrob Agents Chemother 2024;68:e0024224. [PMID: 38767379 PMCID: PMC11232377 DOI: 10.1128/aac.00242-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Accepted: 04/13/2024] [Indexed: 05/22/2024] Open

Zheng L, Shen J, Chen R, Hu Y, Zhao W, Leung ELH, Dai L. Genome engineering of the human gut microbiome. J Genet Genomics 2024;51:479-491. [PMID: 38218395 DOI: 10.1016/j.jgg.2024.01.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2023] [Revised: 01/02/2024] [Accepted: 01/03/2024] [Indexed: 01/15/2024]

Paul S, Olymon K, Martinez GS, Sarkar S, Yella VR, Kumar A. MLDSPP: Bacterial Promoter Prediction Tool Using DNA Structural Properties with Machine Learning and Explainable AI. J Chem Inf Model 2024;64:2705-2719. [PMID: 38258978 DOI: 10.1021/acs.jcim.3c02017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]

Abstract

Bacterial promoters play a crucial role in gene expression by serving as docking sites for the transcription initiation machinery. However, accurately identifying promoter regions in bacterial genomes remains a challenge due to their diverse architecture and variations. In this study, we propose MLDSPP (Machine Learning and Duplex Stability based Promoter prediction in Prokaryotes), a machine learning-based promoter prediction tool, to comprehensively screen bacterial promoter regions in 12 diverse genomes. We leveraged biologically relevant and informative DNA structural properties, such as DNA duplex stability and base stacking, and state-of-the-art machine learning (ML) strategies to gain insights into promoter characteristics. We evaluated several machine learning models, including Support Vector Machines, Random Forests, and XGBoost, and assessed their performance using accuracy, precision, recall, specificity, F1 score, and MCC metrics. Our findings reveal that XGBoost outperformed other models and current state-of-the-art promoter prediction tools, namely Sigma70pred and iPromoter2L, achieving F1-scores >95% in most systems. Significantly, the use of one-hot encoding for representing nucleotide sequences complements these structural features, enhancing our XGBoost model's predictive capabilities. To address the challenge of model interpretability, we incorporated explainable AI techniques using Shapley values. This enhancement allows for a better understanding and interpretation of the predictions of our model. In conclusion, our study presents MLDSPP as a novel, generic tool for predicting promoter regions in bacteria, utilizing original downstream sequences as nonpromoter controls. This tool has the potential to significantly advance the field of bacterial genomics and contribute to our understanding of gene regulation in diverse bacterial systems.

Collapse

Yang G, Li J, Hu J, Shi JY. Recognition of cyanobacteria promoters via Siamese network-based contrastive learning under novel non-promoter generation. Brief Bioinform 2024;25:bbae193. [PMID: 38701419 PMCID: PMC11066903 DOI: 10.1093/bib/bbae193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Revised: 03/08/2024] [Accepted: 04/05/2024] [Indexed: 05/05/2024] Open

Martinez GS, Perez-Rueda E, Kumar A, Dutt M, Maya CR, Ledesma-Dominguez L, Casa PL, Kumar A, de Avila e Silva S, Kelvin DJ. CDBProm: the Comprehensive Directory of Bacterial Promoters. NAR Genom Bioinform 2024;6:lqae018. [PMID: 38385146 PMCID: PMC10880602 DOI: 10.1093/nargab/lqae018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 01/12/2024] [Accepted: 01/29/2024] [Indexed: 02/23/2024] Open

Affiliation(s)

Gustavo Sganzerla Martinez Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia B3H 4H7, Canada Pediatrics, Izaak Walton Killam (IWK) Health Center. Canadian Center for Vaccinology (CCfV), Halifax, Nova Scotia B3H 4H7, Canada BioForge Canada Limited, Halifax, Nova Scotia B3N 3B9, Canada
Ernesto Perez-Rueda Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autonóma de México, Unidad Académica del Estado de Yucatán, Mérida 97302, Yucatán, Mexico
Anuj Kumar Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia B3H 4H7, Canada Pediatrics, Izaak Walton Killam (IWK) Health Center. Canadian Center for Vaccinology (CCfV), Halifax, Nova Scotia B3H 4H7, Canada BioForge Canada Limited, Halifax, Nova Scotia B3N 3B9, Canada
Mansi Dutt Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia B3H 4H7, Canada Pediatrics, Izaak Walton Killam (IWK) Health Center. Canadian Center for Vaccinology (CCfV), Halifax, Nova Scotia B3H 4H7, Canada BioForge Canada Limited, Halifax, Nova Scotia B3N 3B9, Canada
Cinthia Rodríguez Maya Facultad de Ciencias e Ingeniería, Universidad Nacional Autonoma de Mexico, Mexico City 04510, Mexico
Leonardo Ledesma-Dominguez Instituto de Investigaciones en Matematicas Aplicadas y en Sistemas, Universidad Nacional Autonoma de Mexico, Mexico City 04510, Mexico
Pedro Lenz Casa Biotechnology Institute, Universidade de Caxias do Sul, Caxias do Sul, Rio Grande do Sul 95070-560, Brazil
Aditya Kumar Molecular Biology and Biotechnology, Tezpur University, Tezpur, Assam 784028, India
Scheila de Avila e Silva Biotechnology Institute, Universidade de Caxias do Sul, Caxias do Sul, Rio Grande do Sul 95070-560, Brazil
David J Kelvin Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia B3H 4H7, Canada Pediatrics, Izaak Walton Killam (IWK) Health Center. Canadian Center for Vaccinology (CCfV), Halifax, Nova Scotia B3H 4H7, Canada BioForge Canada Limited, Halifax, Nova Scotia B3N 3B9, Canada

Collapse

Aguilar-Carrillo Y, Soto-Urzúa L, Martínez-Martínez MDLÁ, Becerril-Ramírez M, Martínez-Morales LJ. Computational Analysis of the Tripartite Interaction of Phasins (PhaP4 and 5)-Sigma Factor (σ²⁴)-DNA of Azospirillum brasilense Sp7. Polymers (Basel) 2024;16:611. [PMID: 38475295 DOI: 10.3390/polym16050611] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 02/04/2024] [Accepted: 02/07/2024] [Indexed: 03/14/2024] Open

Park JH, Lee S, Shin E, Abdi Nansa S, Lee SJ. The Transposition of Insertion Sequences in Sigma-Factor- and LysR-Deficient Mutants of Deinococcus geothermalis. Microorganisms 2024;12:328. [PMID: 38399731 PMCID: PMC10892881 DOI: 10.3390/microorganisms12020328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Revised: 01/29/2024] [Accepted: 02/01/2024] [Indexed: 02/25/2024] Open

Ligeti B, Szepesi-Nagy I, Bodnár B, Ligeti-Nagy N, Juhász J. ProkBERT family: genomic language models for microbiome applications. Front Microbiol 2024;14:1331233. [PMID: 38282738 PMCID: PMC10810988 DOI: 10.3389/fmicb.2023.1331233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 12/11/2023] [Indexed: 01/30/2024] Open

Abstract

Background

In the evolving landscape of microbiology and microbiome analysis, the integration of machine learning is crucial for understanding complex microbial interactions, and predicting and recognizing novel functionalities within extensive datasets. However, the effectiveness of these methods in microbiology faces challenges due to the complex and heterogeneous nature of microbial data, further complicated by low signal-to-noise ratios, context-dependency, and a significant shortage of appropriately labeled datasets. This study introduces the ProkBERT model family, a collection of large language models, designed for genomic tasks. It provides a generalizable sequence representation for nucleotide sequences, learned from unlabeled genome data. This approach helps overcome the above-mentioned limitations in the field, thereby improving our understanding of microbial ecosystems and their impact on health and disease.

Methods

ProkBERT models are based on transfer learning and self-supervised methodologies, enabling them to use the abundant yet complex microbial data effectively. The introduction of the novel Local Context-Aware (LCA) tokenization technique marks a significant advancement, allowing ProkBERT to overcome the contextual limitations of traditional transformer models. This methodology not only retains rich local context but also demonstrates remarkable adaptability across various bioinformatics tasks.

Results

In practical applications such as promoter prediction and phage identification, the ProkBERT models show superior performance. For promoter prediction tasks, the top-performing model achieved a Matthews Correlation Coefficient (MCC) of 0.74 for E. coli and 0.62 in mixed-species contexts. In phage identification, ProkBERT models consistently outperformed established tools like VirSorter2 and DeepVirFinder, achieving an MCC of 0.85. These results underscore the models' exceptional accuracy and generalizability in both supervised and unsupervised tasks.

Conclusions

The ProkBERT model family is a compact yet powerful tool in the field of microbiology and bioinformatics. Its capacity for rapid, accurate analyses and its adaptability across a spectrum of tasks marks a significant advancement in machine learning applications in microbiology. The models are available on GitHub (https://github.com/nbrg-ppcu/prokbert) and HuggingFace (https://huggingface.co/nerualbioinfo) providing an accessible tool for the community.

Collapse

Lechtenberg T, Wynands B, Wierckx N. Engineering 5-hydroxymethylfurfural (HMF) oxidation in Pseudomonas boosts tolerance and accelerates 2,5-furandicarboxylic acid (FDCA) production. Metab Eng 2024;81:262-272. [PMID: 38154655 DOI: 10.1016/j.ymben.2023.12.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 12/12/2023] [Accepted: 12/21/2023] [Indexed: 12/30/2023]

Zhu Q, Bai X, Li Q, Zhang M, Hu G, Pan K, Liu H, Ke Z, Hong Q, Qiu J. PcaR, a GntR/FadR Family Transcriptional Repressor Controls the Transcription of Phenazine-1-Carboxylic Acid 1,2-Dioxygenase Gene Cluster in Sphingomonas histidinilytica DS-9. Appl Environ Microbiol 2023;89:e0212122. [PMID: 37191535 PMCID: PMC10304782 DOI: 10.1128/aem.02121-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Accepted: 04/29/2023] [Indexed: 05/17/2023] Open

Abstract

In our previous study, the phenazine-1-carboxylic acid (PCA) 1,2-dioxygenase gene cluster (pcaA1A2A3A4 cluster) in Sphingomonas histidinilytica DS-9 was identified to be responsible for the conversion of PCA to 1,2-dihydroxyphenazine (Ren Y, Zhang M, Gao S, Zhu Q, et al. 2022. Appl Environ Microbiol 88:e00543-22). However, the regulatory mechanism of the pcaA1A2A3A4 cluster has not been elucidated yet. In this study, the pcaA1A2A3A4 cluster was found to be transcribed as two divergent operons: pcaA3-ORF5205 (named A3-5205 operon) and pcaA1A2-ORF5208-pcaA4-ORF5210 (named A1-5210 operon). The promoter regions of the two operons were overlapped. PcaR acts as a transcriptional repressor of the pcaA1A2A3A4 cluster, and it belongs to GntR/FadR family transcriptional regulator. Gene disruption of pcaR can shorten the lag phase of PCA degradation. The results of electrophoretic mobility shift assay and DNase I footprinting showed that PcaR binds to a 25-bp motif in the ORF5205-pcaA1 intergenic promoter region to regulate the expression of two operons. The 25-bp motif covers the -10 region of the promoter of A3-5205 operon and the -35 region and -10 region of the promoter of A1-5210 operon. The TNGT/ANCNA box within the motif was essential for PcaR binding to the two promoters. PCA acted as an effector of PcaR, preventing it from binding to the promoter region and repressing the transcription of the pcaA1A2A3A4 cluster. In addition, PcaR represses its own transcription, and this repression can be relieved by PCA. This study reveals the regulatory mechanism of PCA degradation in strain DS-9, and the identification of PcaR increases the variety of regulatory model of the GntR/FadR-type regulator. IMPORTANCE Sphingomonas histidinilytica DS-9 is a phenazine-1-carboxylic acid (PCA)-degrading strain. The 1,2-dioxygenase gene cluster (pcaA1A2A3A4 cluster, encoding dioxygenase PcaA1A2, reductase PcaA3, and ferredoxin PcaA4) is responsible for the initial degradation step of PCA and widely distributed in Sphingomonads, but its regulatory mechanism has not been investigated yet. In this study, a GntR/FadR-type transcriptional regulator PcaR repressing the transcription of pcaA1A2A3A4 cluster and pcaR gene was identified and characterized. The binding site of PcaR in ORF5205-pcaA1 intergenic promoter region contains a TNGT/ANCNA box, which is important for the binding. These findings enhance our understanding of the molecular mechanism of PCA degradation.

Collapse

Affiliation(s)

Qian Zhu Key Laboratory of Agricultural and Environmental Microbiology, Ministry of Agriculture and Rural Affairs, College of Life Sciences, Nanjing Agricultural University, Nanjing, Jiangsu, People’s Republic of China
Xuekun Bai Key Laboratory of Agricultural and Environmental Microbiology, Ministry of Agriculture and Rural Affairs, College of Life Sciences, Nanjing Agricultural University, Nanjing, Jiangsu, People’s Republic of China
Qian Li Key Laboratory of Agricultural and Environmental Microbiology, Ministry of Agriculture and Rural Affairs, College of Life Sciences, Nanjing Agricultural University, Nanjing, Jiangsu, People’s Republic of China
Mingliang Zhang Key Laboratory of Agricultural and Environmental Microbiology, Ministry of Agriculture and Rural Affairs, College of Life Sciences, Nanjing Agricultural University, Nanjing, Jiangsu, People’s Republic of China
Gang Hu Key Laboratory of Agricultural and Environmental Microbiology, Ministry of Agriculture and Rural Affairs, College of Life Sciences, Nanjing Agricultural University, Nanjing, Jiangsu, People’s Republic of China
Kaihua Pan Key Laboratory of Agricultural and Environmental Microbiology, Ministry of Agriculture and Rural Affairs, College of Life Sciences, Nanjing Agricultural University, Nanjing, Jiangsu, People’s Republic of China
Hongfei Liu Key Laboratory of Agricultural and Environmental Microbiology, Ministry of Agriculture and Rural Affairs, College of Life Sciences, Nanjing Agricultural University, Nanjing, Jiangsu, People’s Republic of China
Zhijian Ke School of Biological and Chemical Engineering, Ningbo Tech University, Ningbo, Zhejiang, People’s Republic of China
Qing Hong Key Laboratory of Agricultural and Environmental Microbiology, Ministry of Agriculture and Rural Affairs, College of Life Sciences, Nanjing Agricultural University, Nanjing, Jiangsu, People’s Republic of China
Jiguo Qiu Key Laboratory of Agricultural and Environmental Microbiology, Ministry of Agriculture and Rural Affairs, College of Life Sciences, Nanjing Agricultural University, Nanjing, Jiangsu, People’s Republic of China

Collapse

Sharma D, Sharma K, Mishra A, Siwach P, Mittal A, Jayaram B. Molecular dynamics simulation-based trinucleotide and tetranucleotide level structural and energy characterization of the functional units of genomic DNA. Phys Chem Chem Phys 2023;25:7323-7337. [PMID: 36825435 DOI: 10.1039/d2cp04820e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]

Xu SQ, Wang X, Xu L, Wang KX, Jiang YH, Zhang FY, Hong Q, He J, Liu SJ, Qiu JG. The MocR family transcriptional regulator DnfR has multiple binding sites and regulates Dirammox gene transcription in Alcaligenes faecalis JQ135. Environ Microbiol 2023;25:675-688. [PMID: 36527381 DOI: 10.1111/1462-2920.16318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 12/14/2022] [Indexed: 12/23/2022]

Affiliation(s)

Si-Qiong Xu Key Laboratory of Agricultural and Environmental Microbiology, Ministry of Agriculture and Rural Affairs, College of Life Sciences, Nanjing Agricultural University, Nanjing, China
Xiao Wang Key Laboratory of Agricultural and Environmental Microbiology, Ministry of Agriculture and Rural Affairs, College of Life Sciences, Nanjing Agricultural University, Nanjing, China
Lu Xu Key Laboratory of Agricultural and Environmental Microbiology, Ministry of Agriculture and Rural Affairs, College of Life Sciences, Nanjing Agricultural University, Nanjing, China
Ke-Xin Wang Key Laboratory of Agricultural and Environmental Microbiology, Ministry of Agriculture and Rural Affairs, College of Life Sciences, Nanjing Agricultural University, Nanjing, China
Yin-Hu Jiang Key Laboratory of Agricultural and Environmental Microbiology, Ministry of Agriculture and Rural Affairs, College of Life Sciences, Nanjing Agricultural University, Nanjing, China
Fu-Yin Zhang Key Laboratory of Agricultural and Environmental Microbiology, Ministry of Agriculture and Rural Affairs, College of Life Sciences, Nanjing Agricultural University, Nanjing, China
Qing Hong Key Laboratory of Agricultural and Environmental Microbiology, Ministry of Agriculture and Rural Affairs, College of Life Sciences, Nanjing Agricultural University, Nanjing, China
Jian He Key Laboratory of Agricultural and Environmental Microbiology, Ministry of Agriculture and Rural Affairs, College of Life Sciences, Nanjing Agricultural University, Nanjing, China
Shuang-Jiang Liu State Key Laboratory of Microbial Resources, and Environmental Microbiology Research Center at Institute of Microbiology, Chinese Academy of Sciences, Beijing, China State Key Laboratory of Microbial Technology, Shandong University, Qingdao, China
Ji-Guo Qiu Key Laboratory of Agricultural and Environmental Microbiology, Ministry of Agriculture and Rural Affairs, College of Life Sciences, Nanjing Agricultural University, Nanjing, China

Collapse

Explainable artificial intelligence as a reliable annotator of archaeal promoter regions. Sci Rep 2023;13:1763. [PMID: 36720898 PMCID: PMC9889792 DOI: 10.1038/s41598-023-28571-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 01/20/2023] [Indexed: 02/02/2023] Open

A Four-Step Platform to Optimize Growth Conditions for High-Yield Production of Siderophores in Cyanobacteria. Metabolites 2023;13:metabo13020154. [PMID: 36837773 PMCID: PMC9967094 DOI: 10.3390/metabo13020154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 01/16/2023] [Accepted: 01/17/2023] [Indexed: 01/22/2023] Open

Lluka T, Stokes JM. Antibiotic discovery in the artificial intelligence era. Ann N Y Acad Sci 2023;1519:74-93. [PMID: 36447334 DOI: 10.1111/nyas.14930] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]

Survey of mycobacterial fluoroquinolone resistance protein conservon (mfp conservon) in Mycobacteriaceae and identification of its promoter activity. GENE REPORTS 2022. [DOI: 10.1016/j.genrep.2022.101684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Park SK, Mohr G, Yao J, Russell R, Lambowitz AM. Group II intron-like reverse transcriptases function in double-strand break repair. Cell 2022;185:3671-3688.e23. [PMID: 36113466 PMCID: PMC9530004 DOI: 10.1016/j.cell.2022.08.014] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2022] [Revised: 06/16/2022] [Accepted: 08/14/2022] [Indexed: 01/26/2023]

Coppens L, Wicke L, Lavigne R. SAPPHIRE.CNN: Implementation of dRNA-seq-driven, species-specific promoter prediction using convolutional neural networks. Comput Struct Biotechnol J 2022;20:4969-4974. [PMID: 36147675 PMCID: PMC9478156 DOI: 10.1016/j.csbj.2022.09.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 09/03/2022] [Accepted: 09/05/2022] [Indexed: 11/22/2022] Open

Chhaya A, Sharma A, Dattu Hade M, Kaur J, Dikshit KL. Transcript analysis and expression of the glbO gene, encoding truncated hemoglobin,O, of M. smegmatis implicate its role under hypoxia and oxidative stress. Gene X 2022;841:146759. [PMID: 35933051 DOI: 10.1016/j.gene.2022.146759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 07/24/2022] [Indexed: 12/12/2022] Open

Dall'Alba G, Casa PL, Abreu FPD, Notari DL, de Avila E Silva S. A Survey of Biological Data in a Big Data Perspective. BIG DATA 2022;10:279-297. [PMID: 35394342 DOI: 10.1089/big.2020.0383] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Machine learning and statistics shape a novel path in archaeal promoter annotation. BMC Bioinformatics 2022;23:171. [PMID: 35538405 PMCID: PMC9087966 DOI: 10.1186/s12859-022-04714-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Accepted: 05/05/2022] [Indexed: 11/29/2022] Open

Zill D, Lettau E, Lorent C, Seifert F, Singh P, Lauterbach L. Crucial role of the chaperonin GroES/EL for heterologous production of the soluble methane monooxygenase from Methylomonas methanica MC09. Chembiochem 2022;23:e202200195. [PMID: 35385600 PMCID: PMC9324122 DOI: 10.1002/cbic.202200195] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Indexed: 11/15/2022]

Ye Q, Shin E, Lee C, Choi N, Kim Y, Yoon KS, Lee SJ. Transposition of insertion sequences by dielectric barrier discharge plasma and gamma irradiation in the radiation-resistant bacterium Deinococcus geothermalis. J Microbiol Methods 2022;196:106473. [DOI: 10.1016/j.mimet.2022.106473] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2022] [Revised: 04/19/2022] [Accepted: 04/19/2022] [Indexed: 12/27/2022]

Xu H, Yang C, Tian X, Chen Y, Liu WQ, Li J. Regulatory Part Engineering for High-Yield Protein Synthesis in an All-Streptomyces-Based Cell-Free Expression System. ACS Synth Biol 2022;11:570-578. [PMID: 35129330 DOI: 10.1021/acssynbio.1c00587] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Zhang M, Jia C, Li F, Li C, Zhu Y, Akutsu T, Webb GI, Zou Q, Coin LJM, Song J. Critical assessment of computational tools for prokaryotic and eukaryotic promoter prediction. Brief Bioinform 2022;23:6502561. [PMID: 35021193 PMCID: PMC8921625 DOI: 10.1093/bib/bbab551] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Revised: 11/12/2021] [Accepted: 11/30/2021] [Indexed: 01/13/2023] Open

Affiliation(s)

Meng Zhang
Cangzhi Jia Corresponding authors: Jiangning Song, Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia. E-mail: ; Lachlan J.M. Coin, Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, 792 Elizabeth Street, Melbourne, Victoria 3000, Australia. E-mail: ; Quan Zou, Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China. E-mail: ; Cangzhi Jia, School of Science, Dalian Maritime University, Dalian 116026, China. E-mail:
Fuyi Li
Chen Li
Yan Zhu
Tatsuya Akutsu
Geoffrey I Webb Department of Data Science and Artificial Intelligence, Monash University, Melbourne, VIC 3800, Australia,Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
Quan Zou Corresponding authors: Jiangning Song, Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia. E-mail: ; Lachlan J.M. Coin, Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, 792 Elizabeth Street, Melbourne, Victoria 3000, Australia. E-mail: ; Quan Zou, Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China. E-mail: ; Cangzhi Jia, School of Science, Dalian Maritime University, Dalian 116026, China. E-mail:
Lachlan J M Coin Corresponding authors: Jiangning Song, Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia. E-mail: ; Lachlan J.M. Coin, Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, 792 Elizabeth Street, Melbourne, Victoria 3000, Australia. E-mail: ; Quan Zou, Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China. E-mail: ; Cangzhi Jia, School of Science, Dalian Maritime University, Dalian 116026, China. E-mail:
Jiangning Song Corresponding authors: Jiangning Song, Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia. E-mail: ; Lachlan J.M. Coin, Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, 792 Elizabeth Street, Melbourne, Victoria 3000, Australia. E-mail: ; Quan Zou, Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China. E-mail: ; Cangzhi Jia, School of Science, Dalian Maritime University, Dalian 116026, China. E-mail:

Collapse

Bhukya R, Kumari A, Amilpur S, Dasari CM. PPred-PCKSM: A multi-layer predictor for identifying promoter and its variants using position based features. Comput Biol Chem 2022;97:107623. [DOI: 10.1016/j.compbiolchem.2022.107623] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2021] [Revised: 01/02/2022] [Accepted: 01/05/2022] [Indexed: 11/03/2022]

Current and emerging tools of computational biology to improve the detoxification of mycotoxins. Appl Environ Microbiol 2021;88:e0210221. [PMID: 34878810 DOI: 10.1128/aem.02102-21] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open

Chevez-Guardado R, Peña-Castillo L. Promotech: a general tool for bacterial promoter recognition. Genome Biol 2021;22:318. [PMID: 34789306 PMCID: PMC8597233 DOI: 10.1186/s13059-021-02514-9] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2020] [Accepted: 10/11/2021] [Indexed: 12/14/2022] Open

A Genome-Scale Antibiotic Screen in Serratia marcescens Identifies YdgH as a Conserved Modifier of Cephalosporin and Detergent Susceptibility. Antimicrob Agents Chemother 2021;65:e0078621. [PMID: 34491801 DOI: 10.1128/aac.00786-21] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Martinez-Hernandez F, Diop A, Garcia-Heredia I, Bobay LM, Martinez-Garcia M. Unexpected myriad of co-occurring viral strains and species in one of the most abundant and microdiverse viruses on Earth. ISME JOURNAL 2021;16:1025-1035. [PMID: 34775488 DOI: 10.1038/s41396-021-01150-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Revised: 10/15/2021] [Accepted: 10/28/2021] [Indexed: 11/09/2022]

Imported One-Day-Old Chicks as Trojan Horses for Multidrug-Resistant Priority Pathogens Harboring mcr-9, rmtG and Extended-Spectrum β-Lactamase Genes. Appl Environ Microbiol 2021;88:e0167521. [PMID: 34731047 DOI: 10.1128/aem.01675-21] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Abstract

Antimicrobial resistance is a critical issue that is no longer restricted to hospital settings, but also represents a growing problem involving intensive animal production systems. In this study, we have performed a microbiological and molecular investigation of priority pathogens carrying transferable resistance genes to critical antimicrobials in one-day-old chickens imported from Brazil to Uruguay. Bacterial identification was performed by MALDI-TOF mass spectrometry and antibiotic susceptibility was determined by Sensititre. Antimicrobial resistance genes were sought by polymerase chain reaction and clonality was assessed by PFGE. Four multidrug-resistant (MDR) representative strains were sequenced by Illumina and/or Oxford Nanopore Technologies. Twenty-eight MDR isolates identified as Escherichia coli (n= 14), Enterobacter cloacae (n= 11) and Klebsiella pneumoniae (n= 3). While resistance to oxyiminocephalosporins was due to bla_CTX-M-2, bla_CTX-M-8, bla_CTX-M-15, bla_CTX-M-55 and bla_CMY-2, plasmid-mediated quinolone resistance was associated with qnrB19, qnrE1, and qnrB2 genes. Finally, resistance to aminoglycosides and fosfomycin was due to the presence of 16S rRNA methyltransferase rmtG and fosA-type genes, respectively. Short and long-read genome sequencing of E. cloacae ODC-Eclo3 strain revealed the presence of IncQ/rmtG (pUR-EC3.1, 7400-pb), IncHI2A/mcr-9.1/bla_CTX-M-2 [pUR-EC3.2, ST16 (pMLST), 408,436-bp] and IncN2/qnrB19/aacC3/aph(3'')-Ib (pUR-EC3.3) resistance plasmids. Strikingly, the bla_CTX-M-2 gene was carried by a novel Tn1696-like composite transposon designated Tn7337. In summary, we report that imported one-day-old chicks can act as Trojan horses for the hidden spread of WHO critical priority MDR pathogens harboring mcr-9, rmtG and extended-spectrum β-lactamase genes in poultry farms, which is a critical issue within a One Health perspective. Importance section Antimicrobial resistance is considered a significant problem for global health, including within the concept of "One Health", therefore, the food chain is a link that connects human and animal health directly. In this work, we searched for microorganisms resistant to antibiotics considered critical for human health in intestinal microbiota of one-day-old baby chicks imported to Uruguay from Brazil. We described antibiotic-resistant genes to antibiotics named as to watch or reserve for the WHO, such as rmtG or mcr9.1, which confers resistance to all the aminoglycosides and colistin, respectively, among others genes, and their presence in new mobile genetic elements that favor its dissemination. The sustained entry of these microorganisms evades the sanitary measures implemented by the countries and production establishments to reduce the selection of resistant microorganisms. These silently imported resistant microorganisms could explain a considerable part of the antimicrobial resistance problems found in the production stages of the system.

Collapse

Martinez GS, Sarkar S, Kumar A, Pérez‐Rueda E, de Avila e Silva S. Characterization of promoters in archaeal genomes based on DNA structural parameters. Microbiologyopen 2021;10:e1230. [PMID: 34713600 PMCID: PMC8553660 DOI: 10.1002/mbo3.1230] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Revised: 07/27/2021] [Accepted: 07/29/2021] [Indexed: 11/10/2022] Open

Wang CY, Liu LC, Wu YC, Zhang YX. Identification and Validation of Four Novel Promoters for Gene Engineering with Broad Suitability across Species. J Microbiol Biotechnol 2021;31:1154-1162. [PMID: 34226414 PMCID: PMC9706022 DOI: 10.4014/jmb.2103.03049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 06/24/2021] [Accepted: 06/27/2021] [Indexed: 12/15/2022]

Abstract

The transcriptional capacities of target genes are strongly influenced by promoters, whereas few studies have focused on the development of robust, high-performance and cross-species promoters for wide application in different bacteria. In this work, four novel promoters (P_k.rtufB, P_k.r1, P_k.r2, and P_k.r3) were predicted from Ketogulonicigenium robustum and their inconsistency in the -10 and -35 region nucleotide sequences indicated they were different promoters. Their activities were evaluated by using green fluorescent protein (gfp) as a reporter in different species of bacteria, including K. vulgare SPU B805, Pseudomonas putida KT2440, Paracoccus denitrificans PD1222, Bacillus licheniformis and Raoultella ornithinolytica, due to their importance in metabolic engineering. Our results showed that the four promoters had different activities, with P_k.r1 showing the strongest activity in almost all of the experimental bacteria. By comparison with the commonly used promoters of E. coli (tufB, lac, lacUV5), K. vulgare (Psdh, Psndh) and P. putida KT2440 (JE111411), the four promoters showed significant differences due to only 12.62% nucleotide similarities, and relatively higher ability in regulating target gene expression. Further validation experiments confirmed their ability in initiating the target minCD cassette because of the shape changes under the promoter regulation. The overexpression of sorbose dehydrogenase and cytochrome c551 by P_k.r1 and P_k.r2 resulted in a 22.75% enhancement of 2-KGA yield, indicating their potential for practical application in metabolic engineering. This study demonstrates an example of applying bioinformatics to find new biological components for gene operation and provides four novel promoters with broad suitability, which enriches the usable range of promoters to realize accurate regulation in different genetic backgrounds.

Collapse

Martinez GS, de Ávila e Silva S, Kumar A, Pérez-Rueda E. DNA structural and physical properties reveal peculiarities in promoter sequences of the bacterium Escherichia coli K-12. SN APPLIED SCIENCES 2021. [DOI: 10.1007/s42452-021-04713-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open

Wilson EH, Groom JD, Sarfatis MC, Ford SM, Lidstrom ME, Beck DAC. A Computational Framework for Identifying Promoter Sequences in Nonmodel Organisms Using RNA-seq Data Sets. ACS Synth Biol 2021;10:1394-1405. [PMID: 33988977 DOI: 10.1021/acssynbio.1c00017] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]

Mishra A, Dhanda S, Siwach P, Aggarwal S, Jayaram B. A novel method SEProm for prokaryotic promoter prediction based on DNA structure and energetics. Bioinformatics 2020;36:2375-2384. [PMID: 31909789 DOI: 10.1093/bioinformatics/btz941] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2019] [Revised: 11/08/2019] [Accepted: 01/02/2020] [Indexed: 11/13/2022] Open

Benchmarking Bacterial Promoter Prediction Tools: Potentialities and Limitations. mSystems 2020;5:5/4/e00439-20. [PMID: 32843538 PMCID: PMC7449607 DOI: 10.1128/msystems.00439-20] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open

Abstract

The correct mapping of promoter elements is a crucial step in microbial genomics. Also, when combining new DNA elements into synthetic sequences, predicting the potential generation of new promoter sequences is critical. Over the last years, many bioinformatics tools have been created to allow users to predict promoter elements in a sequence or genome of interest. Here, we assess the predictive power of some of the main prediction tools available using well-defined promoter data sets. Using Escherichia coli as a model organism, we demonstrated that while some tools are biased toward AT-rich sequences, others are very efficient in identifying real promoters with low false-negative rates. We hope the potentials and limitations presented here will help the microbiology community to choose promoter prediction tools among many available alternatives.

The promoter region is a key element required for the production of RNA in bacteria. While new high-throughput technology allows massively parallel mapping of promoter elements, we still mainly rely on bioinformatics tools to predict such elements in bacterial genomes. Additionally, despite many different prediction tools having become popular to identify bacterial promoters, no systematic comparison of such tools has been performed. Here, we performed a systematic comparison between several widely used promoter prediction tools (BPROM, bTSSfinder, BacPP, CNNProm, IBBP, Virtual Footprint, iPro70-FMWin, 70ProPred, iPromoter-2L, and MULTiPly) using well-defined sequence data sets and standardized metrics to determine how well those tools performed related to each other. For this, we used data sets of experimentally validated promoters from Escherichia coli and a control data set composed of randomly generated sequences with similar nucleotide distributions. We compared the performance of the tools using metrics such as specificity, sensitivity, accuracy, and Matthews correlation coefficient (MCC). We show that the widely used BPROM presented the worse performance among the compared tools, while four tools (CNNProm, iPro70-FMWin, 70ProPred, and iPromoter-2L) offered high predictive power. Of these tools, iPro70-FMWin exhibited the best results for most of the metrics used. We present here some potentials and limitations of available tools, and we hope that future work can build upon our effort to systematically characterize this useful class of bioinformatics tools.

IMPORTANCE The correct mapping of promoter elements is a crucial step in microbial genomics. Also, when combining new DNA elements into synthetic sequences, predicting the potential generation of new promoter sequences is critical. Over the last years, many bioinformatics tools have been created to allow users to predict promoter elements in a sequence or genome of interest. Here, we assess the predictive power of some of the main prediction tools available using well-defined promoter data sets. Using Escherichia coli as a model organism, we demonstrated that while some tools are biased toward AT-rich sequences, others are very efficient in identifying real promoters with low false-negative rates. We hope the potentials and limitations presented here will help the microbiology community to choose promoter prediction tools among many available alternatives.

Collapse

Liu X, Guo Z, He T, Ren M. Prediction and analysis of prokaryotic promoters based on sequence features. Biosystems 2020;197:104218. [PMID: 32755610 DOI: 10.1016/j.biosystems.2020.104218] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2020] [Revised: 07/03/2020] [Accepted: 07/21/2020] [Indexed: 10/23/2022]

Ren H, Shi C, Zhao H. Computational Tools for Discovering and Engineering Natural Product Biosynthetic Pathways. iScience 2020;23:100795. [PMID: 31926431 PMCID: PMC6957853 DOI: 10.1016/j.isci.2019.100795] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2019] [Revised: 11/24/2019] [Accepted: 12/19/2019] [Indexed: 01/09/2023] Open

Lenzini L, Di Patti F, Livi R, Fondi M, Fani R, Mengoni A. A Method for the Structure-Based, Genome-Wide Analysis of Bacterial Intergenic Sequences Identifies Shared Compositional and Functional Features. Genes (Basel) 2019;10:genes10100834. [PMID: 31652625 PMCID: PMC6826451 DOI: 10.3390/genes10100834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Revised: 10/07/2019] [Accepted: 10/16/2019] [Indexed: 11/16/2022] Open

Coelho RV, Dall'Alba G, de Avila E Silva S, Echeverrigaray S, Delamare APL. Toward Algorithms for Automation of Postgenomic Data Analyses: Bacillus subtilis Promoter Prediction with Artificial Neural Network. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2019;24:300-309. [PMID: 31573385 DOI: 10.1089/omi.2019.0041] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Image-based promoter prediction: a promoter prediction method based on evolutionarily generated patterns. Sci Rep 2018;8:17695. [PMID: 30523308 PMCID: PMC6283834 DOI: 10.1038/s41598-018-36308-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2017] [Accepted: 11/12/2018] [Indexed: 11/18/2022] Open

Dall'Alba G, Casa PL, Notari DL, Adami AG, Echeverrigaray S, de Avila E Silva S. Analysis of the nucleotide content of Escherichia coli promoter sequences related to the alternative sigma factors. J Mol Recognit 2018;32:e2770. [PMID: 30458580 DOI: 10.1002/jmr.2770] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2018] [Revised: 10/23/2018] [Accepted: 10/24/2018] [Indexed: 01/26/2023]

Coelho RV, de Avila E Silva S, Echeverrigaray S, Delamare APL. Bacillus subtilis promoter sequences data set for promoter prediction in Gram-positive bacteria. Data Brief 2018;19:264-270. [PMID: 29892645 PMCID: PMC5993011 DOI: 10.1016/j.dib.2018.05.025] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2018] [Revised: 04/02/2018] [Accepted: 05/07/2018] [Indexed: 11/28/2022] Open

Chang SC, Lee CY. OpaR and RpoS are positive regulators of a virulence factor PrtA in Vibrio parahaemolyticus. Microbiology (Reading) 2018;164:221-231. [DOI: 10.1099/mic.0.000591] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open

Shahmuradov IA, Mohamad Razali R, Bougouffa S, Radovanovic A, Bajic VB. bTSSfinder: a novel tool for the prediction of promoters in cyanobacteria and Escherichia coli. Bioinformatics 2017;33:334-340. [PMID: 27694198 PMCID: PMC5408793 DOI: 10.1093/bioinformatics/btw629] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2016] [Accepted: 09/27/2016] [Indexed: 12/01/2022] Open

Kernan T, West AC, Banta S. Characterization of endogenous promoters for control of recombinant gene expression in Acidithiobacillus ferrooxidans. Biotechnol Appl Biochem 2017;64:793-802. [DOI: 10.1002/bab.1546] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Accepted: 11/16/2016] [Indexed: 11/06/2022]

Kumar A, Manivelan V, Bansal M. Structural features of DNA are conserved in the promoter region of orthologous genes across different strains ofHelicobacter pylori. FEMS Microbiol Lett 2016;363:fnw207. [DOI: 10.1093/femsle/fnw207] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/25/2016] [Indexed: 12/19/2022] Open

Abbas MM, Mohie-Eldin MM, EL-Manzalawy Y. Assessing the effects of data selection and representation on the development of reliable E. coli sigma 70 promoter region predictors. PLoS One 2015;10:e0119721. [PMID: 25803493 PMCID: PMC4372424 DOI: 10.1371/journal.pone.0119721] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2014] [Accepted: 01/26/2015] [Indexed: 11/27/2022] Open

Abstract

As the number of sequenced bacterial genomes increases, the need for rapid and reliable tools for the annotation of functional elements (e.g., transcriptional regulatory elements) becomes more desirable. Promoters are the key regulatory elements, which recruit the transcriptional machinery through binding to a variety of regulatory proteins (known as sigma factors). The identification of the promoter regions is very challenging because these regions do not adhere to specific sequence patterns or motifs and are difficult to determine experimentally. Machine learning represents a promising and cost-effective approach for computational identification of prokaryotic promoter regions. However, the quality of the predictors depends on several factors including: i) training data; ii) data representation; iii) classification algorithms; iv) evaluation procedures. In this work, we create several variants of E. coli promoter data sets and utilize them to experimentally examine the effect of these factors on the predictive performance of E. coli σ70 promoter models. Our results suggest that under some combinations of the first three criteria, a prediction model might perform very well on cross-validation experiments while its performance on independent test data is drastically very poor. This emphasizes the importance of evaluating promoter region predictors using independent test data, which corrects for the over-optimistic performance that might be estimated using the cross-validation procedure. Our analysis of the tested models shows that good prediction models often perform well despite how the non-promoter data was obtained. On the other hand, poor prediction models seems to be more sensitive to the choice of non-promoter sequences. Interestingly, the best performing sequence-based classifiers outperform the best performing structure-based classifiers on both cross-validation and independent test performance evaluation experiments. Finally, we propose a meta-predictor method combining two top performing sequence-based and structure-based classifiers and compare its performance with some of the state-of-the-art E. coli σ70 promoter prediction methods.

Collapse