1
|
Abstract
The correct mapping of promoter elements is a crucial step in microbial genomics. Also, when combining new DNA elements into synthetic sequences, predicting the potential generation of new promoter sequences is critical. Over the last years, many bioinformatics tools have been created to allow users to predict promoter elements in a sequence or genome of interest. Here, we assess the predictive power of some of the main prediction tools available using well-defined promoter data sets. Using Escherichia coli as a model organism, we demonstrated that while some tools are biased toward AT-rich sequences, others are very efficient in identifying real promoters with low false-negative rates. We hope the potentials and limitations presented here will help the microbiology community to choose promoter prediction tools among many available alternatives. The promoter region is a key element required for the production of RNA in bacteria. While new high-throughput technology allows massively parallel mapping of promoter elements, we still mainly rely on bioinformatics tools to predict such elements in bacterial genomes. Additionally, despite many different prediction tools having become popular to identify bacterial promoters, no systematic comparison of such tools has been performed. Here, we performed a systematic comparison between several widely used promoter prediction tools (BPROM, bTSSfinder, BacPP, CNNProm, IBBP, Virtual Footprint, iPro70-FMWin, 70ProPred, iPromoter-2L, and MULTiPly) using well-defined sequence data sets and standardized metrics to determine how well those tools performed related to each other. For this, we used data sets of experimentally validated promoters from Escherichia coli and a control data set composed of randomly generated sequences with similar nucleotide distributions. We compared the performance of the tools using metrics such as specificity, sensitivity, accuracy, and Matthews correlation coefficient (MCC). We show that the widely used BPROM presented the worse performance among the compared tools, while four tools (CNNProm, iPro70-FMWin, 70ProPred, and iPromoter-2L) offered high predictive power. Of these tools, iPro70-FMWin exhibited the best results for most of the metrics used. We present here some potentials and limitations of available tools, and we hope that future work can build upon our effort to systematically characterize this useful class of bioinformatics tools. IMPORTANCE The correct mapping of promoter elements is a crucial step in microbial genomics. Also, when combining new DNA elements into synthetic sequences, predicting the potential generation of new promoter sequences is critical. Over the last years, many bioinformatics tools have been created to allow users to predict promoter elements in a sequence or genome of interest. Here, we assess the predictive power of some of the main prediction tools available using well-defined promoter data sets. Using Escherichia coli as a model organism, we demonstrated that while some tools are biased toward AT-rich sequences, others are very efficient in identifying real promoters with low false-negative rates. We hope the potentials and limitations presented here will help the microbiology community to choose promoter prediction tools among many available alternatives.
Collapse
|
2
|
A novel method for identifying and distinguishing Cryptococcus neoformans and Cryptococcus gattii by surface-enhanced Raman scattering using positively charged silver nanoparticles. Sci Rep 2020; 10:12480. [PMID: 32719360 PMCID: PMC7385644 DOI: 10.1038/s41598-020-68978-0] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Accepted: 06/30/2020] [Indexed: 11/08/2022] Open
Abstract
There are approximately 1 million cryptococcal infections per year among HIV+ individuals, resulting in nearly 625,000 deaths. Cryptococcus neoformans and Cryptococcus gattii are the two most common species that cause human cryptococcosis. These two species of Cryptococcus have differences in pathogenicity, diagnosis, and treatment. Cryptococcal infections are usually difficult to identify because of their slow growth in vitro. In addition, the long detection cycle of Cryptococcus in clinical specimens makes the diagnosis of Cryptococcal infections difficult. Here, we used positively charged silver nanoparticles (AgNPs+) as a substrate to distinguish between C. neoformans and C. gattii in clinical specimens directly via surface-enhanced Raman scattering (SERS) and spectral analysis. The AgNPs+ self-assembled on the surface of the fungal cell wall via electrostatic aggregation, leading to enhanced SERS signals that were better than the standard substrate negatively charged silver nanoparticles (AgNPs). The SERS spectra could also be used as a sample database in the multivariate analysis via orthogonal partial least-squares discriminant analysis. This novel SERS detection method can clearly distinguish between the two Cryptococcus species using principal component analysis. The accuracy of the training data and test data was 100% after a tenfold crossover validation.
Collapse
|
3
|
Jervis AJ, Carbonell P, Vinaixa M, Dunstan MS, Hollywood KA, Robinson CJ, Rattray NJW, Yan C, Swainston N, Currin A, Sung R, Toogood H, Taylor S, Faulon JL, Breitling R, Takano E, Scrutton NS. Machine Learning of Designed Translational Control Allows Predictive Pathway Optimization in Escherichia coli. ACS Synth Biol 2019; 8:127-136. [PMID: 30563328 DOI: 10.1021/acssynbio.8b00398] [Citation(s) in RCA: 75] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The field of synthetic biology aims to make the design of biological systems predictable, shrinking the huge design space to practical numbers for testing. When designing microbial cell factories, most optimization efforts have focused on enzyme and strain selection/engineering, pathway regulation, and process development. In silico tools for the predictive design of bacterial ribosome binding sites (RBSs) and RBS libraries now allow translational tuning of biochemical pathways; however, methods for predicting optimal RBS combinations in multigene pathways are desirable. Here we present the implementation of machine learning algorithms to model the RBS sequence-phenotype relationship from representative subsets of large combinatorial RBS libraries allowing the accurate prediction of optimal high-producers. Applied to a recombinant monoterpenoid production pathway in Escherichia coli, our approach was able to boost production titers by over 60% when screening under 3% of a library. To facilitate library screening, a multiwell plate fermentation procedure was developed, allowing increased screening throughput with sufficient resolution to discriminate between high and low producers. High producers from one library did not translate during scale-up, but the reduced screening requirements allowed rapid rescreening at the larger scale. This methodology is potentially compatible with any biochemical pathway and provides a powerful tool toward predictive design of bacterial production chassis.
Collapse
Affiliation(s)
- Adrian J. Jervis
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM), Manchester Institute of Biotechnology and School of Chemistry, University of Manchester, Manchester M1 7DN, United Kingdom
| | - Pablo Carbonell
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM), Manchester Institute of Biotechnology and School of Chemistry, University of Manchester, Manchester M1 7DN, United Kingdom
| | - Maria Vinaixa
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM), Manchester Institute of Biotechnology and School of Chemistry, University of Manchester, Manchester M1 7DN, United Kingdom
| | - Mark S. Dunstan
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM), Manchester Institute of Biotechnology and School of Chemistry, University of Manchester, Manchester M1 7DN, United Kingdom
| | - Katherine A. Hollywood
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM), Manchester Institute of Biotechnology and School of Chemistry, University of Manchester, Manchester M1 7DN, United Kingdom
| | - Christopher J. Robinson
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM), Manchester Institute of Biotechnology and School of Chemistry, University of Manchester, Manchester M1 7DN, United Kingdom
| | - Nicholas J. W. Rattray
- Strathclyde Institute of Pharmacy and Biomedical Sciences, Strathclyde University, 161 Cathedral Street, Glasgow G4 0RE, United Kingdom
| | - Cunyu Yan
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM), Manchester Institute of Biotechnology and School of Chemistry, University of Manchester, Manchester M1 7DN, United Kingdom
| | - Neil Swainston
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM), Manchester Institute of Biotechnology and School of Chemistry, University of Manchester, Manchester M1 7DN, United Kingdom
| | - Andrew Currin
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM), Manchester Institute of Biotechnology and School of Chemistry, University of Manchester, Manchester M1 7DN, United Kingdom
| | - Rehana Sung
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM), Manchester Institute of Biotechnology and School of Chemistry, University of Manchester, Manchester M1 7DN, United Kingdom
| | - Helen Toogood
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM), Manchester Institute of Biotechnology and School of Chemistry, University of Manchester, Manchester M1 7DN, United Kingdom
| | - Sandra Taylor
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM), Manchester Institute of Biotechnology and School of Chemistry, University of Manchester, Manchester M1 7DN, United Kingdom
| | - Jean-Loup Faulon
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM), Manchester Institute of Biotechnology and School of Chemistry, University of Manchester, Manchester M1 7DN, United Kingdom
- MICALIS, INRA-AgroParisTech, Domaine de Vilvert, 78352 Jouy en Josas Cedex, France
| | - Rainer Breitling
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM), Manchester Institute of Biotechnology and School of Chemistry, University of Manchester, Manchester M1 7DN, United Kingdom
| | - Eriko Takano
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM), Manchester Institute of Biotechnology and School of Chemistry, University of Manchester, Manchester M1 7DN, United Kingdom
| | - Nigel S. Scrutton
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM), Manchester Institute of Biotechnology and School of Chemistry, University of Manchester, Manchester M1 7DN, United Kingdom
| |
Collapse
|
4
|
Li J, Wang C, Kang H, Shao L, Hu L, Xiao R, Wang S, Gu B. Label-free identification carbapenem-resistant Escherichia coli based on surface-enhanced resonance Raman scattering. RSC Adv 2018; 8:4761-4765. [PMID: 35539553 PMCID: PMC9078027 DOI: 10.1039/c7ra13063e] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2017] [Accepted: 01/23/2018] [Indexed: 11/21/2022] Open
Abstract
In this study, a surface-enhanced resonance Raman scattering (SERRS) method has been developed for the accurate detection and identification of carbapenem-resistant and carbapenem-sensitive Escherichia coli. A total of 89 human isolates of Enterobacteriaceae, comprising 41 strains of carbapenem-sensitive E. coli (CSEC) and 48 strains of carbapenem-resistant E. coli (CREC), were tested to assess the feasibility of our proposed SERRS method as a clinical tool, and the results showed almost 100% accuracy.
Collapse
Affiliation(s)
- Jia Li
- Medical Technology Institute of Xuzhou Medical University Xuzhou 221004 PR China
| | - Chongwen Wang
- Beijing Institute of Radiation Medicine Beijing 100850 PR China
- Department of Laboratory Medicine, Affiliated Hospital of Xuzhou Medical University Xuzhou 221004 PR China
| | - Haiquan Kang
- Medical Technology Institute of Xuzhou Medical University Xuzhou 221004 PR China
- Department of Laboratory Medicine, Affiliated Hospital of Xuzhou Medical University Xuzhou 221004 PR China
| | - Liting Shao
- Beijing Institute of Radiation Medicine Beijing 100850 PR China
| | - Lulu Hu
- Medical Technology Institute of Xuzhou Medical University Xuzhou 221004 PR China
| | - Rui Xiao
- Beijing Institute of Radiation Medicine Beijing 100850 PR China
| | - Shengqi Wang
- Medical Technology Institute of Xuzhou Medical University Xuzhou 221004 PR China
- Beijing Institute of Radiation Medicine Beijing 100850 PR China
| | - Bing Gu
- Medical Technology Institute of Xuzhou Medical University Xuzhou 221004 PR China
- Department of Laboratory Medicine, Affiliated Hospital of Xuzhou Medical University Xuzhou 221004 PR China
| |
Collapse
|