Diament A, Weiner I, Shahar N, Landman S, Feldman Y, Atar S, Avitan M, Schweitzer S, Yacoby I, Tuller T. ChimeraUGEM: unsupervised gene expression modeling in any given organism.
Bioinformatics 2020;
35:3365-3371. [PMID:
30715207 DOI:
10.1093/bioinformatics/btz080]
[Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2018] [Revised: 01/07/2019] [Accepted: 01/30/2019] [Indexed: 01/06/2023] Open
Abstract
MOTIVATION
Regulation of the amount of protein that is synthesized from genes has proved to be a serious challenge in terms of analysis and prediction, and in terms of engineering and optimization, due to the large diversity in expression machinery across species.
RESULTS
To address this challenge, we developed a methodology and a software tool (ChimeraUGEM) for predicting gene expression as well as adapting the coding sequence of a target gene to any host organism. We demonstrate these methods by predicting protein levels in seven organisms, in seven human tissues, and by increasing in vivo the expression of a synthetic gene up to 26-fold in the single-cell green alga Chlamydomonas reinhardtii. The underlying model is designed to capture sequence patterns and regulatory signals with minimal prior knowledge on the host organism and can be applied to a multitude of species and applications.
AVAILABILITY AND IMPLEMENTATION
Source code (MATLAB, C) and binaries are freely available for download for non-commercial use at http://www.cs.tau.ac.il/~tamirtul/ChimeraUGEM/, and supported on macOS, Linux and Windows.
SUPPLEMENTARY INFORMATION
Supplementary data are available at Bioinformatics online.
Collapse