1
|
Costello A, Peterson AA, Chen PH, Bagirzadeh R, Lanster DL, Badran AH. Genetic Code Expansion History and Modern Innovations. Chem Rev 2024. [PMID: 39466033 DOI: 10.1021/acs.chemrev.4c00275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/29/2024]
Abstract
The genetic code is the foundation for all life. With few exceptions, the translation of nucleic acid messages into proteins follows conserved rules, which are defined by codons that specify each of the 20 proteinogenic amino acids. For decades, leading research groups have developed a catalogue of innovative approaches to extend nature's amino acid repertoire to include one or more noncanonical building blocks in a single protein. In this review, we summarize advances in the history of in vitro and in vivo genetic code expansion, and highlight recent innovations that increase the scope of biochemically accessible monomers and codons. We further summarize state-of-the-art knowledge in engineered cellular translation, as well as alterations to regulatory mechanisms that improve overall genetic code expansion. Finally, we distill existing limitations of these technologies into must-have improvements for the next generation of technologies, and speculate on future strategies that may be capable of overcoming current gaps in knowledge.
Collapse
Affiliation(s)
- Alan Costello
- Department of Chemistry The Scripps Research Institute; La Jolla, California 92037, United States
- Department of Integrative Structural and Computational Biology The Scripps Research Institute; La Jolla, California 92037, United States
| | - Alexander A Peterson
- Department of Chemistry The Scripps Research Institute; La Jolla, California 92037, United States
- Department of Integrative Structural and Computational Biology The Scripps Research Institute; La Jolla, California 92037, United States
| | - Pei-Hsin Chen
- Department of Chemistry The Scripps Research Institute; La Jolla, California 92037, United States
- Department of Integrative Structural and Computational Biology The Scripps Research Institute; La Jolla, California 92037, United States
- Doctoral Program in Chemical and Biological Sciences The Scripps Research Institute; La Jolla, California 92037, United States
| | - Rustam Bagirzadeh
- Department of Chemistry The Scripps Research Institute; La Jolla, California 92037, United States
- Department of Integrative Structural and Computational Biology The Scripps Research Institute; La Jolla, California 92037, United States
| | - David L Lanster
- Department of Chemistry The Scripps Research Institute; La Jolla, California 92037, United States
- Department of Integrative Structural and Computational Biology The Scripps Research Institute; La Jolla, California 92037, United States
- Doctoral Program in Chemical and Biological Sciences The Scripps Research Institute; La Jolla, California 92037, United States
| | - Ahmed H Badran
- Department of Chemistry The Scripps Research Institute; La Jolla, California 92037, United States
- Department of Integrative Structural and Computational Biology The Scripps Research Institute; La Jolla, California 92037, United States
| |
Collapse
|
2
|
Jiang R, Yuan S, Zhou Y, Wei Y, Li F, Wang M, Chen B, Yu H. Strategies to overcome the challenges of low or no expression of heterologous proteins in Escherichia coli. Biotechnol Adv 2024; 75:108417. [PMID: 39038691 DOI: 10.1016/j.biotechadv.2024.108417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Revised: 07/18/2024] [Accepted: 07/19/2024] [Indexed: 07/24/2024]
Abstract
Protein expression is a critical process in diverse biological systems. For Escherichia coli, a widely employed microbial host in industrial catalysis and healthcare, researchers often face significant challenges in constructing recombinant expression systems. To maximize the potential of E. coli expression systems, it is essential to address problems regarding the low or absent production of certain target proteins. This article presents viable solutions to the main factors posing challenges to heterologous protein expression in E. coli, which includes protein toxicity, the intrinsic influence of gene sequences, and mRNA structure. These strategies include specialized approaches for managing toxic protein expression, addressing issues related to mRNA structure and codon bias, advanced codon optimization methodologies that consider multiple factors, and emerging optimization techniques facilitated by big data and machine learning.
Collapse
Affiliation(s)
- Ruizhao Jiang
- Department of Chemical Engineering, Tsinghua University, Beijing 100084, China; Key Laboratory of Industrial Biocatalysis (Tsinghua University), the Ministry of Education, Beijing 100084, China
| | - Shuting Yuan
- Department of Chemical Engineering, Tsinghua University, Beijing 100084, China; Key Laboratory of Industrial Biocatalysis (Tsinghua University), the Ministry of Education, Beijing 100084, China
| | - Yilong Zhou
- Tanwei College, Tsinghua University, Beijing 100084, China
| | - Yuwen Wei
- Department of Chemical Engineering, Tsinghua University, Beijing 100084, China; Key Laboratory of Industrial Biocatalysis (Tsinghua University), the Ministry of Education, Beijing 100084, China
| | - Fulong Li
- Beijing Evolyzer Co.,Ltd., 100176, China
| | | | - Bo Chen
- Beijing Evolyzer Co.,Ltd., 100176, China
| | - Huimin Yu
- Department of Chemical Engineering, Tsinghua University, Beijing 100084, China; Key Laboratory of Industrial Biocatalysis (Tsinghua University), the Ministry of Education, Beijing 100084, China; Center for Synthetic and Systems Biology, Tsinghua University, Beijing 100084, China.
| |
Collapse
|
3
|
Anastassov S, Filo M, Khammash M. Inteins: A Swiss army knife for synthetic biology. Biotechnol Adv 2024; 73:108349. [PMID: 38552727 DOI: 10.1016/j.biotechadv.2024.108349] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 03/21/2024] [Accepted: 03/23/2024] [Indexed: 04/13/2024]
Abstract
Inteins are proteins found in nature that execute protein splicing. Among them, split inteins stand out for their versatility and adaptability, presenting creative solutions for addressing intricate challenges in various biological applications. Their exquisite attributes, including compactness, reliability, orthogonality, low toxicity, and irreversibility, make them of interest to various fields including synthetic biology, biotechnology and biomedicine. In this review, we delve into the inherent challenges of using inteins, present approaches for overcoming these challenges, and detail their reliable use for specific cellular tasks. We will discuss the use of conditional inteins in areas like cancer therapy, drug screening, patterning, infection treatment, diagnostics and biocontainment. Additionally, we will underscore the potential of inteins in executing basic logical operations with practical implications. We conclude by showcasing their potential in crafting complex genetic circuits for performing computations and feedback control that achieves robust perfect adaptation.
Collapse
Affiliation(s)
- Stanislav Anastassov
- Department of Biosystems Science and Engineering, ETH Zürich, Basel 4056, Switzerland
| | - Maurice Filo
- Department of Biosystems Science and Engineering, ETH Zürich, Basel 4056, Switzerland
| | - Mustafa Khammash
- Department of Biosystems Science and Engineering, ETH Zürich, Basel 4056, Switzerland.
| |
Collapse
|
4
|
Chakarborty S, Irshad IU, Mahima, Sharma AK. TIR predictor and optimizer: Web-tools for accurate prediction of translation initiation rate and precision gene design in Saccharomyces cerevisiae. Biotechnol J 2024; 19:e2400081. [PMID: 38719586 DOI: 10.1002/biot.202400081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2024] [Revised: 04/15/2024] [Accepted: 04/16/2024] [Indexed: 05/14/2024]
Abstract
Translation initiation is the primary determinant of the rate of protein production. The variation in the rate with which this step occurs can cause up to three orders of magnitude differences in cellular protein levels. Several mRNA features, including mRNA stability in proximity to the start codon, coding sequence length, and presence of specific motifs in the mRNA molecule, have been shown to influence the translation initiation rate. These molecular factors acting at different strengths allow precise control of in vivo translation initiation rate and thus the rate of protein synthesis. However, despite the paramount importance of translation initiation rate in protein synthesis, accurate prediction of the absolute values of initiation rate remains a challenge. In fact, as of now, there is no available model for predicting the initiation rate in Saccharomyces cerevisiae. To address this, we train a machine learning model for predicting the in vivo initiation rate in S. cerevisiae transcripts. The model is trained using a diverse set of mRNA transcripts, enabling the comparison of initiation rates across different transcripts. Our model exhibited excellent accuracy in predicting the translation initiation rate and demonstrated its effectiveness with both endogenous and exogenous transcripts. Then, by combining the machine learning model with the Monte-Carlo search algorithm, we have also devised a method to optimize the nucleotide sequence of any gene to achieve a specific target initiation rate. The machine learning model we've developed for predicting translation initiation rates, along with the gene optimization method, are deployed as a web server. Both web servers are accessible for free at the following link: ajeetsharmalab.com/TIRPredictor. Thus, this research advances our fundamental understanding of translation initiation processes, with direct applications in biotechnology.
Collapse
Affiliation(s)
| | | | - Mahima
- Department of Physics, Indian Institute of Technology Jammu, Jammu, India
| | - Ajeet K Sharma
- Department of Physics, Indian Institute of Technology Jammu, Jammu, India
- Department of Biosciences and Bioengineering, Indian Institute of Technology Jammu, Jammu, India
| |
Collapse
|
5
|
Gu X, Qi Y, El-Kebir M. DERNA Enables Pareto Optimal RNA Design. J Comput Biol 2024; 31:179-196. [PMID: 38416637 DOI: 10.1089/cmb.2023.0283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/01/2024] Open
Abstract
The design of an RNA sequence v that encodes an input target protein sequence w is a crucial aspect of messenger RNA (mRNA) vaccine development. There are an exponential number of possible RNA sequences for a single target protein due to codon degeneracy. These potential RNA sequences can assume various secondary structure conformations, each with distinct minimum free energy (MFE), impacting thermodynamic stability and mRNA half-life. Furthermore, the presence of species-specific codon usage bias, quantified by the codon adaptation index (CAI), plays a vital role in translation efficiency. While earlier studies focused on optimizing either MFE or CAI, recent research has underscored the advantages of simultaneously optimizing both objectives. However, optimizing one objective comes at the expense of the other. In this work, we present the Pareto Optimal RNA Design problem, aiming to identify the set of Pareto optimal solutions for which no alternative solutions exist that exhibit better MFE and CAI values. Our algorithm DEsign RNA (DERNA) uses the weighted sum method to enumerate the Pareto front by optimizing convex combinations of both objectives. We use dynamic programming to solve each convex combination in O ( | w | 3 ) time and O ( | w | 2 ) space. Compared with a CDSfold, previous approach that only optimizes MFE, we show on a benchmark data set that DERNA obtains solutions with identical MFE but superior CAI. Moreover, we show that DERNA matches the performance in terms of solution quality of LinearDesign, a recent approach that similarly seeks to balance MFE and CAI. We conclude by demonstrating our method's potential for mRNA vaccine design for the SARS-CoV-2 spike protein.
Collapse
Affiliation(s)
- Xinyu Gu
- Department of Computer Science and University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - Yuanyuan Qi
- Department of Computer Science and University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - Mohammed El-Kebir
- Department of Computer Science and University of Illinois Urbana-Champaign, Urbana, Illinois, USA
- Cancer Center at Illinois, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| |
Collapse
|
6
|
Yamchi A, Rahimi M, Javan B, Abdollahi D, Salmanian M, Shahbazi M. Evaluation of the impact of polypeptide-p on diabetic rats upon its cloning, expression, and secretion in Saccharomyces boulardii. Arch Microbiol 2023; 206:37. [PMID: 38142245 DOI: 10.1007/s00203-023-03773-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Revised: 11/16/2023] [Accepted: 11/25/2023] [Indexed: 12/25/2023]
Abstract
This study was designed to evaluate the effectiveness of recombinant polypeptide-p derived from Momordica charantia on diabetic rats. In this research, the optimized sequence of polypeptide-p gene fused to a secretion signal tag was cloned into the expression vector and transformed into probiotic Saccharomyces boulardii. The production of recombinant secretion protein was verified by western blotting, HPLC, and mass spectrometry. To assay recombinant yeast bioactivity in the gut, diabetic rats were orally fed wild-type and recombinant S. boulardii, in short SB and rSB, respectively, at two low and high doses as well as glibenclamide as a reference drug. In untreated diabetic and treated diabetic + SB rats (low and high doses), the blood glucose increased from 461, 481, and 455 (mg/dl), respectively, to higher than 600 mg/dl on the 21st day. Whereas glibenclamide and rSB treatments showed a significant reduction in the blood glucose level. The result of this study promised a safe plant-source supplement for diabetes through probiotic orchestration.
Collapse
Affiliation(s)
- Ahad Yamchi
- Department of Biotechnology, Gorgan University of Agricultural Sciences and Natural Resources, Gorgan, Iran.
- Genetic Engineering and Molecular Genetics, Gorgan University of Agricultural Science and Natural Resources, P.O. Box: 4934174515, Gorgan, Iran.
| | - Maryam Rahimi
- Department of Horticulture, University of Zabol, Zabol, Iran
| | - Bita Javan
- Medical Cellular and Molecular Research Center, Golestan University of Medical Sciences, Gorgan, Iran
| | - Dorsa Abdollahi
- Department of Biotechnology, Gorgan University of Agricultural Sciences and Natural Resources, Gorgan, Iran
| | - Mojgan Salmanian
- Department of Animal Science and Poultry Nutrition, Gorgan University of Agricultural Sciences and Natural Resources, Gorgan, Iran
| | - Majid Shahbazi
- Medical Cellular and Molecular Research Center, Golestan University of Medical Sciences, Gorgan, Iran
| |
Collapse
|
7
|
Zhang H, Zhang L, Lin A, Xu C, Li Z, Liu K, Liu B, Ma X, Zhao F, Jiang H, Chen C, Shen H, Li H, Mathews DH, Zhang Y, Huang L. Algorithm for optimized mRNA design improves stability and immunogenicity. Nature 2023; 621:396-403. [PMID: 37130545 PMCID: PMC10499610 DOI: 10.1038/s41586-023-06127-z] [Citation(s) in RCA: 77] [Impact Index Per Article: 77.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2022] [Accepted: 04/25/2023] [Indexed: 05/04/2023]
Abstract
Messenger RNA (mRNA) vaccines are being used to combat the spread of COVID-19 (refs. 1-3), but they still exhibit critical limitations caused by mRNA instability and degradation, which are major obstacles for the storage, distribution and efficacy of the vaccine products4. Increasing secondary structure lengthens mRNA half-life, which, together with optimal codons, improves protein expression5. Therefore, a principled mRNA design algorithm must optimize both structural stability and codon usage. However, owing to synonymous codons, the mRNA design space is prohibitively large-for example, there are around 2.4 × 10632 candidate mRNA sequences for the SARS-CoV-2 spike protein. This poses insurmountable computational challenges. Here we provide a simple and unexpected solution using the classical concept of lattice parsing in computational linguistics, where finding the optimal mRNA sequence is analogous to identifying the most likely sentence among similar-sounding alternatives6. Our algorithm LinearDesign finds an optimal mRNA design for the spike protein in just 11 minutes, and can concurrently optimize stability and codon usage. LinearDesign substantially improves mRNA half-life and protein expression, and profoundly increases antibody titre by up to 128 times in mice compared to the codon-optimization benchmark on mRNA vaccines for COVID-19 and varicella-zoster virus. This result reveals the great potential of principled mRNA design and enables the exploration of previously unreachable but highly stable and efficient designs. Our work is a timely tool for vaccines and other mRNA-based medicines encoding therapeutic proteins such as monoclonal antibodies and anti-cancer drugs7,8.
Collapse
Affiliation(s)
- He Zhang
- Baidu Research USA, Sunnyvale, CA, USA
- School of EECS, Oregon State University, Corvallis, OR, USA
| | - Liang Zhang
- Baidu Research USA, Sunnyvale, CA, USA
- School of EECS, Oregon State University, Corvallis, OR, USA
- Vaccine Center, School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing, China
| | - Ang Lin
- StemiRNA Therapeutics, Shanghai, China
- Vaccine Center, School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing, China
| | | | - Ziyu Li
- Baidu Research USA, Sunnyvale, CA, USA
| | - Kaibo Liu
- Baidu Research USA, Sunnyvale, CA, USA
- School of EECS, Oregon State University, Corvallis, OR, USA
| | - Boxiang Liu
- Baidu Research USA, Sunnyvale, CA, USA
- Department of Pharmacy, National University of Singapore, Singapore, Singapore
| | | | | | | | | | | | | | - David H Mathews
- Department of Biochemistry and Biophysics, University of Rochester Medical Center, Rochester, NY, USA.
- Center for RNA Biology, University of Rochester Medical Center, Rochester, NY, USA.
- Department of Biostatistics and Computational Biology, University of Rochester Medical Center, Rochester, NY, USA.
- Coderna.ai, Inc., Sunnyvale, CA, USA.
| | - Yujian Zhang
- StemiRNA Therapeutics, Shanghai, China.
- , Gaithersburg, MD, USA.
| | - Liang Huang
- Baidu Research USA, Sunnyvale, CA, USA.
- School of EECS, Oregon State University, Corvallis, OR, USA.
- Coderna.ai, Inc., Sunnyvale, CA, USA.
| |
Collapse
|
8
|
Shin HC, Bochkov YA, Kim K, Gern JE, Jarjour NN, Esnault S. A motif in the 5'untranslated region of messenger RNAs regulates protein synthesis in a S6 kinase-dependent manner. Adv Biol Regul 2023; 89:100975. [PMID: 37302177 PMCID: PMC10735251 DOI: 10.1016/j.jbior.2023.100975] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 06/05/2023] [Indexed: 06/13/2023]
Abstract
The 5' untranslated regions (UTRs) in messenger RNAs (mRNAs) play an important role in the regulation of protein synthesis. We had previously identified a group of mRNAs that includes human semaphorin 7A (SEMA7A) whose translation is upregulated by the Erk/p90S6K pathway in human eosinophils, with a potential negative impact in asthma and airway inflammation. In the current study, we aimed to find a common 5'UTR regulatory cis-element, and determine its impact on protein synthesis. We identified a common and conserved 5'UTR motif GGCTG-[(C/G)T(C/G)]n-GCC that was present in this group of mRNAs. Mutations of the first two GG bases in this motif in SEMA7A 5'UTR led to a complete loss of S6K activity dependence for maximal translation. In conclusion, the newly identified 5'UTR motif present in SEMA7A has a critical role in regulating S6K-dependent protein synthesis.
Collapse
Affiliation(s)
- Hyun-Chul Shin
- Department of Chemistry Education, Korea National University of Education, Cheongju-si, Chungcheonbuk-do, Republic of Korea
| | - Yury A Bochkov
- Department of Pediatrics, School of Medicine and Public Health, University of Wisconsin, Madison, WI, USA
| | - Kangsan Kim
- Department of Chemistry Education, Korea National University of Education, Cheongju-si, Chungcheonbuk-do, Republic of Korea
| | - James E Gern
- Department of Pediatrics, School of Medicine and Public Health, University of Wisconsin, Madison, WI, USA; Department of Medicine, School of Medicine and Public Health, University of Wisconsin, Madison, WI, USA
| | - Nizar N Jarjour
- Department of Medicine, School of Medicine and Public Health, University of Wisconsin, Madison, WI, USA
| | - Stephane Esnault
- Department of Medicine, School of Medicine and Public Health, University of Wisconsin, Madison, WI, USA.
| |
Collapse
|
9
|
Bykova A, Saura A, Glazko GV, Roche-Lima A, Yurchenko V, Rogozin IB. The 29-nucleotide deletion in SARS-CoV: truncated versions of ORF8 are under purifying selection. BMC Genomics 2023; 24:387. [PMID: 37430204 DOI: 10.1186/s12864-023-09482-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Accepted: 06/23/2023] [Indexed: 07/12/2023] Open
Abstract
BACKGROUND Accessory proteins have diverse roles in coronavirus pathobiology. One of them in SARS-CoV (the causative agent of the severe acute respiratory syndrome outbreak in 2002-2003) is encoded by the open reading frame 8 (ORF8). Among the most dramatic genomic changes observed in SARS-CoV isolated from patients during the peak of the pandemic in 2003 was the acquisition of a characteristic 29-nucleotide deletion in ORF8. This deletion cause splitting of ORF8 into two smaller ORFs, namely ORF8a and ORF8b. Functional consequences of this event are not entirely clear. RESULTS Here, we performed evolutionary analyses of ORF8a and ORF8b genes and documented that in both cases the frequency of synonymous mutations was greater than that of nonsynonymous ones. These results suggest that ORF8a and ORF8b are under purifying selection, thus proteins translated from these ORFs are likely to be functionally important. Comparisons with several other SARS-CoV genes revealed that another accessory gene, ORF7a, has a similar ratio of nonsynonymous to synonymous mutations suggesting that ORF8a, ORF8b, and ORF7a are under similar selection pressure. CONCLUSIONS Our results for SARS-CoV echo the known excess of deletions in the ORF7a-ORF7b-ORF8 complex of accessory genes in SARS-CoV-2. A high frequency of deletions in this gene complex might reflect recurrent searches in "functional space" of various accessory protein combinations that may eventually produce more advantageous configurations of accessory proteins similar to the fixed deletion in the SARS-CoV ORF8 gene.
Collapse
Affiliation(s)
- Anastassia Bykova
- Life Science Research Centre, Faculty of Science, University of Ostrava, Ostrava, 710 00, Czech Republic
| | - Andreu Saura
- Life Science Research Centre, Faculty of Science, University of Ostrava, Ostrava, 710 00, Czech Republic
| | - Galina V Glazko
- Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR, 72205, USA
| | - Abiel Roche-Lima
- Center for Collaborative Research in Health Disparities-RCMI Program, Medical Sciences Campus, University of Puerto Rico, San Juan, PR, 00936, USA
| | - Vyacheslav Yurchenko
- Life Science Research Centre, Faculty of Science, University of Ostrava, Ostrava, 710 00, Czech Republic.
| | - Igor B Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA.
| |
Collapse
|
10
|
Sarabandi S, Pourtaghi H. Whole genome sequence analysis of CPV-2 isolates from 1998 to 2020. Virol J 2023; 20:138. [PMID: 37400901 DOI: 10.1186/s12985-023-02102-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Accepted: 06/14/2023] [Indexed: 07/05/2023] Open
Abstract
Canine parvovirus-2 (CPV-2) is a virus with worldwide spread causing canine gastroenteritis. New strains of this virus have unique characteristics and are resistant to some vaccine strains. Therefore, understanding the root causes of resistance has proven to be of increasing concern to many scientists. This study collected 126 whole genome sequences of CPV-2 subtypes with specific collection dates from the NCBI data bank. The whole genome sequences of CPV-2 collected from different countries were analyzed to detect the new substitutions and update these mutations. The result indicated 12, 7, and 10 mutations in NS1, VP1, and VP2, in that respective order. Moreover, the A5G and Q370R mutations of VP2 are the most common changes in the recent isolates of the CPV-2C subtype, and the new N93K residue of VP2 is speculated to be the cause of vaccine failure. To summarize, the observed mutations, which are increasing over time, causes several changes in viral characteristic. A comprehensive understanding of these mutations can lead us to control potential future epidemics associated with this virus more efficiently.
Collapse
Affiliation(s)
- Sajed Sarabandi
- Department of Pathobiology, Islamic Azad University, Karaj Branch, Karaj, Iran
| | - Hadi Pourtaghi
- Department of Microbiology, Islamic Azad University, Karaj Branch, Karaj, Iran.
| |
Collapse
|
11
|
Himmelstrand K, Brandström Durling M, Karlsson M, Stenlid J, Olson Å. Multiple rearrangements and low inter- and intra-species mitogenome sequence variation in the Heterobasidion annosum s.l. species complex. Front Microbiol 2023; 14:1159811. [PMID: 37275157 PMCID: PMC10234125 DOI: 10.3389/fmicb.2023.1159811] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Accepted: 03/16/2023] [Indexed: 06/07/2023] Open
Abstract
Introduction Mitochondria are essential organelles in the eukaryotic cells and responsible for the energy production but are also involved in many other functions including virulence of some fungal species. Although the evolution of fungal mitogenomes have been studied at some taxonomic levels there are still many things to be learned from studies of closely related species. Methods In this study, we have analyzed 60 mitogenomes in the five species of the Heterobasidion annosum sensu lato complex that all are necrotrophic pathogens on conifers. Results and Discussion Compared to other fungal genera the genomic and genetic variation between and within species in the complex was low except for multiple rearrangements. Several translocations of large blocks with core genes have occurred between the five species and rearrangements were frequent in intergenic areas. Mitogenome lengths ranged between 108 878 to 116 176 bp, mostly as a result of intron variation. There was a high degree of homology of introns, homing endonuclease genes, and intergenic ORFs among the five Heterobasidion species. Three intergenic ORFs with unknown function (uORF6, uORF8 and uORF9) were found in all five species and was located in conserved synteny blocks. A 13 bp long GC-containing self-complementary palindrome was discovered in many places in the five species that were optional in presence/absence. The within species variation is very low, among 48 H. parviporum mitogenomes, there was only one single intron exchange, and SNP frequency was 0.28% and indel frequency 0.043%. The overall low variation in the Heterobasidion annosum sensu lato complex suggests a slow evolution of the mitogenome.
Collapse
Affiliation(s)
| | | | | | | | - Åke Olson
- Uppsala BioCenter, Department of Forest Mycology and Plant Pathology, Swedish University of Agricultural Sciences, Uppsala, Sweden
| |
Collapse
|
12
|
McFadden A, Martin K, Foster G, Vierra M, Lundquist EW, Everts RE, Martin E, Volz E, McLoone K, Brooks SA, Lafayette C. 5'UTR Variant in KIT Associated with White Spotting in Horses. J Equine Vet Sci 2023:104563. [PMID: 37182614 DOI: 10.1016/j.jevs.2023.104563] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Revised: 05/10/2023] [Accepted: 05/10/2023] [Indexed: 05/16/2023]
Abstract
Mutations in KIT, a gene that influences melanoblast migration and pigmentation, often result in mammalian white spotting. As of February 2023, over 30 KIT variants associated with white spotting were documented in Equus caballus (horse). Here we report an association of increased white spotting on the skin and coat with a variant in the 5'UTR of KIT (rs1149701677: g.79,618,649A>C). Horses possessing at least one alternate allele demonstrate phenotypic characteristics similar to other KIT mutations: clear borders around unpigmented regions on the body, face, and limbs. Using a quantitative measure of depigmentation, we observed an average white score of 10.70 among individuals with rs1149701677, while the average score of the control, homozygous reference sample was 2.23 (p=1.892e-11, n=109, t-test). The rs1149701677 site has a cross-species conservation score of 3.4, one of the highest scores across the KIT 5'UTR, implying regulatory importance for this site. Ensembl also predicted a "moderately impactful" functional effect for the rs1149701677 variant. We propose that this single nucleotide variant likely alters the regulation of KIT, which in turn may disrupt melanoblast migration causing an increase in white spotting on the coat. Alternatively, the rs1149701677 variant may be in linkage with another nearby variant with an as-yet-undiscovered functional impact. We propose to term this new allele "Holiday White" or W35 based on conventional nomenclature.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Erin Volz
- Etalon Inc, Menlo Park, CA 94025, USA
| | | | - Samantha A Brooks
- Department of Animal Sciences, UF Genetics Institute University of Florida, Gainesville, FL 32611-0910, USA
| | | |
Collapse
|
13
|
Duan H, Zhang S, Zarai Y, Öllinger R, Wu Y, Sun L, Hu C, He Y, Tian G, Rad R, Kong X, Cheng Y, Tuller T, Wolf DA. eIF3 mRNA selectivity profiling reveals eIF3k as a cancer-relevant regulator of ribosome content. EMBO J 2023:e112362. [PMID: 37155573 DOI: 10.15252/embj.2022112362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Revised: 03/04/2023] [Accepted: 04/20/2023] [Indexed: 05/10/2023] Open
Abstract
eIF3, whose subunits are frequently overexpressed in cancer, regulates mRNA translation from initiation to termination, but mRNA-selective functions of individual subunits remain poorly defined. Using multiomic profiling upon acute depletion of eIF3 subunits, we observed that while eIF3a, b, e, and f markedly differed in their impact on eIF3 holo-complex formation and translation, they were each required for cancer cell proliferation and tumor growth. Remarkably, eIF3k showed the opposite pattern with depletion promoting global translation, cell proliferation, tumor growth, and stress resistance through repressing the synthesis of ribosomal proteins, especially RPS15A. Whereas ectopic expression of RPS15A mimicked the anabolic effects of eIF3k depletion, disruption of eIF3 binding to the 5'-UTR of RSP15A mRNA negated them. eIF3k and eIF3l are selectively downregulated in response to endoplasmic reticulum and oxidative stress. Supported by mathematical modeling, our data uncover eIF3k-l as a mRNA-specific module which, through controlling RPS15A translation, serves as a rheostat of ribosome content, possibly to secure spare translational capacity that can be mobilized during stress.
Collapse
Affiliation(s)
- Haoran Duan
- State Key Laboratory of Stress Biology and Fujian Provincial Key Laboratory of Innovative Drug Target Research, School of Pharmaceutical Sciences, Xiamen University, Xiamen, China
| | - Siqiong Zhang
- State Key Laboratory of Stress Biology and Fujian Provincial Key Laboratory of Innovative Drug Target Research, School of Pharmaceutical Sciences, Xiamen University, Xiamen, China
| | - Yoram Zarai
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, Israel
| | - Rupert Öllinger
- Institute of Molecular Oncology and Functional Genomics and Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technical University of Munich, Munich, Germany
| | - Yanmeng Wu
- State Key Laboratory of Stress Biology and Fujian Provincial Key Laboratory of Innovative Drug Target Research, School of Pharmaceutical Sciences, Xiamen University, Xiamen, China
| | - Li Sun
- State Key Laboratory of Stress Biology and Fujian Provincial Key Laboratory of Innovative Drug Target Research, School of Pharmaceutical Sciences, Xiamen University, Xiamen, China
| | - Cheng Hu
- State Key Laboratory of Stress Biology and Fujian Provincial Key Laboratory of Innovative Drug Target Research, School of Pharmaceutical Sciences, Xiamen University, Xiamen, China
| | - Yaohui He
- State Key Laboratory of Stress Biology and Fujian Provincial Key Laboratory of Innovative Drug Target Research, School of Pharmaceutical Sciences, Xiamen University, Xiamen, China
| | - Guiyou Tian
- State Key Laboratory of Stress Biology and Fujian Provincial Key Laboratory of Innovative Drug Target Research, School of Pharmaceutical Sciences, Xiamen University, Xiamen, China
| | - Roland Rad
- Institute of Molecular Oncology and Functional Genomics and Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technical University of Munich, Munich, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
- Department of Internal Medicine II, Klinikum rechts der Isar, Technical University Munich, Munich, Germany
| | - Xiangquan Kong
- Department of Radiation Oncology, Xiamen Humanity Hospital, Fujian Medical University, Xiamen, China
| | - Yabin Cheng
- State Key Laboratory of Stress Biology and Fujian Provincial Key Laboratory of Innovative Drug Target Research, School of Pharmaceutical Sciences, Xiamen University, Xiamen, China
| | - Tamir Tuller
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, Israel
- The Sagol School of Neuroscience, Tel-Aviv University, Tel Aviv, Israel
| | - Dieter A Wolf
- State Key Laboratory of Stress Biology and Fujian Provincial Key Laboratory of Innovative Drug Target Research, School of Pharmaceutical Sciences, Xiamen University, Xiamen, China
- Department of Internal Medicine II, Klinikum rechts der Isar, Technical University Munich, Munich, Germany
| |
Collapse
|
14
|
Nieuwkoop T, Terlouw BR, Stevens KG, Scheltema R, de Ridder D, van der Oost J, Claassens N. Revealing determinants of translation efficiency via whole-gene codon randomization and machine learning. Nucleic Acids Res 2023; 51:2363-2376. [PMID: 36718935 PMCID: PMC10018363 DOI: 10.1093/nar/gkad035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 12/14/2022] [Accepted: 01/16/2023] [Indexed: 02/01/2023] Open
Abstract
It has been known for decades that codon usage contributes to translation efficiency and hence to protein production levels. However, its role in protein synthesis is still only partly understood. This lack of understanding hampers the design of synthetic genes for efficient protein production. In this study, we generated a synonymous codon-randomized library of the complete coding sequence of red fluorescent protein. Protein production levels and the full coding sequences were determined for 1459 gene variants in Escherichia coli. Using different machine learning approaches, these data were used to reveal correlations between codon usage and protein production. Interestingly, protein production levels can be relatively accurately predicted (Pearson correlation of 0.762) by a Random Forest model that only relies on the sequence information of the first eight codons. In this region, close to the translation initiation site, mRNA secondary structure rather than Codon Adaptation Index (CAI) is the key determinant of protein production. This study clearly demonstrates the key role of codons at the start of the coding sequence. Furthermore, these results imply that commonly used CAI-based codon optimization of the full coding sequence is not a very effective strategy. One should rather focus on optimizing protein production via reducing mRNA secondary structure formation with the first few codons.
Collapse
Affiliation(s)
| | | | - Katherine G Stevens
- Biomolecular Mass Spectrometry and Proteomics, Bijvoet Center for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, University of Utrecht, Padualaan 8, 3584 CH Utrecht, The Netherlands
- Netherlands Proteomics Center, Padualaan 8, 3584 CH Utrecht, The Netherlands
| | - Richard A Scheltema
- Biomolecular Mass Spectrometry and Proteomics, Bijvoet Center for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, University of Utrecht, Padualaan 8, 3584 CH Utrecht, The Netherlands
- Netherlands Proteomics Center, Padualaan 8, 3584 CH Utrecht, The Netherlands
| | - Dick de Ridder
- Bioinformatics Group, Wageningen University, Wageningen, Droevendaalsesteeg 1, 6708 PB, The Netherlands
| | - John van der Oost
- Correspondence may also be addressed to John van der Oost. Tel: +31 317483740;
| | | |
Collapse
|
15
|
Fages-Lartaud M, Mueller Y, Elie F, Courtade G, Hohmann-Marriott MF. Standard Intein Gene Expression Ramps (SIGER) for Protein-Independent Expression Control. ACS Synth Biol 2023; 12:1058-1071. [PMID: 36920366 PMCID: PMC10127266 DOI: 10.1021/acssynbio.2c00530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/16/2023]
Abstract
Coordination of multigene expression is one of the key challenges of metabolic engineering for the development of cell factories. Constraints on translation initiation and early ribosome kinetics of mRNA are imposed by features of the 5'UTR in combination with the start of the gene, referred to as the "gene ramp", such as rare codons and mRNA secondary structures. These features strongly influence the translation yield and protein quality by regulating the ribosome distribution on mRNA strands. The utilization of genetic expression sequences, such as promoters and 5'UTRs in combination with different target genes, leads to a wide variety of gene ramp compositions with irregular translation rates, leading to unpredictable levels of protein yield and quality. Here, we present the Standard Intein Gene Expression Ramp (SIGER) system for controlling protein expression. The SIGER system makes use of inteins to decouple the translation initiation features from the gene of a target protein. We generated sequence-specific gene expression sequences for two inteins (DnaB and DnaX) that display defined levels of protein expression. Additionally, we used inteins that possess the ability to release the C-terminal fusion protein in vivo to avoid the impairment of protein functionality by the fused intein. Overall, our results show that SIGER systems are unique tools to mitigate the undesirable effects of gene ramp variation and to control the relative ratios of enzymes involved in molecular pathways. As a proof of concept of the potential of the system, we also used a SIGER system to express two difficult-to-produce proteins, GumM and CBM73.
Collapse
Affiliation(s)
- Maxime Fages-Lartaud
- Department of Biotechnology and Food Science, Norwegian University of Science and Technology, Trondheim N-7491, Norway
| | - Yasmin Mueller
- Department of Biotechnology and Food Science, Norwegian University of Science and Technology, Trondheim N-7491, Norway
| | - Florence Elie
- Department of Biotechnology and Food Science, Norwegian University of Science and Technology, Trondheim N-7491, Norway
| | - Gaston Courtade
- Department of Biotechnology and Food Science, Norwegian University of Science and Technology, Trondheim N-7491, Norway
| | - Martin Frank Hohmann-Marriott
- Department of Biotechnology and Food Science, Norwegian University of Science and Technology, Trondheim N-7491, Norway.,United Scientists CORE (Limited), Dunedin 9016, Aotearoa, New Zealand
| |
Collapse
|
16
|
A dynamical stochastic model of yeast translation across the cell cycle. Heliyon 2023; 9:e13101. [PMID: 36793957 PMCID: PMC9922973 DOI: 10.1016/j.heliyon.2023.e13101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Revised: 01/04/2023] [Accepted: 01/16/2023] [Indexed: 01/27/2023] Open
Abstract
Translation is a central step in gene expression, however its quantitative and time-resolved regulation is poorly understood. We developed a discrete, stochastic model for protein translation in S. cerevisiae in a whole-transcriptome, single-cell context. A "base case" scenario representing an average cell highlights translation initiation rates as the main co-translational regulatory parameters. Codon usage bias emerges as a secondary regulatory mechanism through ribosome stalling. Demand for anticodons with low abundancy is shown to cause above-average ribosome dwelling times. Codon usage bias correlates strongly both with protein synthesis rates and elongation rates. Applying the model to a time-resolved transcriptome estimated by combining data from FISH and RNA-Seq experiments, it could be shown that increased total transcript abundance during the cell cycle decreases translation efficiency at single transcript level. Translation efficiency grouped by gene function shows highest values for ribosomal and glycolytic genes. Ribosomal proteins peak in S phase while glycolytic proteins rank highest in later cell cycle phases.
Collapse
|
17
|
Panda A, Tuller T. Determinants of associations between codon and amino acid usage patterns of microbial communities and the environment inferred based on a cross-biome metagenomic analysis. NPJ Biofilms Microbiomes 2023; 9:5. [PMID: 36693851 PMCID: PMC9873608 DOI: 10.1038/s41522-023-00372-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2022] [Accepted: 01/11/2023] [Indexed: 01/25/2023] Open
Abstract
Codon and amino acid usage were associated with almost every aspect of microbial life. However, how the environment may impact the codon and amino acid choice of microbial communities at the habitat level is not clearly understood. Therefore, in this study, we analyzed codon and amino acid usage patterns of a large number of environmental samples collected from diverse ecological niches. Our results suggested that samples derived from similar environmental niches, in general, show overall similar codon and amino acid distribution as compared to samples from other habitats. To substantiate the relative impact of the environment, we considered several factors, such as their similarity in GC content, or in functional or taxonomic abundance. Our analysis demonstrated that none of these factors can fully explain the trends that we observed at the codon or amino acid level implying a direct environmental influence on them. Further, our analysis demonstrated different levels of selection on codon bias in different microbial communities with the highest bias in host-associated environments such as the digestive system or oral samples and the lowest level of selection in soil and water samples. Considering a large number of metagenomic samples here we showed that microorganisms collected from similar environmental backgrounds exhibit similar patterns of codon and amino acid usage irrespective of the location or time from where the samples were collected. Thus our study suggested a direct impact of the environment on codon and amino usage of microorganisms that cannot be explained considering the influence of other factors.
Collapse
Affiliation(s)
- Arup Panda
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, 69978, Israel
| | - Tamir Tuller
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, 69978, Israel.
| |
Collapse
|
18
|
Zabolotskii AI, Kozlovskiy SV, Katrukha AG. The Influence of the Nucleotide Composition of Genes and Gene Regulatory Elements on the Efficiency of Protein Expression in Escherichia coli. BIOCHEMISTRY (MOSCOW) 2023; 88:S176-S191. [PMID: 37069120 DOI: 10.1134/s0006297923140109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/22/2023]
Abstract
Recombinant proteins expressed in Escherichia coli are widely used in biochemical research and industrial processes. At the same time, achieving higher protein expression levels and correct protein folding still remains the key problem, since optimization of nutrient media, growth conditions, and methods for induction of protein synthesis do not always lead to the desired result. Often, low protein expression is determined by the sequences of the expressed genes and their regulatory regions. The genetic code is degenerated; 18 out of 20 amino acids are encoded by more than one codon. Choosing between synonymous codons in the coding sequence can significantly affect the level of protein expression and protein folding due to the influence of the gene nucleotide composition on the probability of formation of secondary mRNA structures that affect the ribosome binding at the translation initiation phase, as well as the ribosome movement along the mRNA during elongation, which, in turn, influences the mRNA degradation and the folding of the nascent protein. The nucleotide composition of the mRNA untranslated regions, in particular the promoter and Shine-Dalgarno sequences, also affects the efficiency of mRNA transcription, translation, and degradation. In this review, we describe the genetic principles that determine the efficiency of protein production in Escherichia coli.
Collapse
Affiliation(s)
- Artur I Zabolotskii
- Faculty of Biology, Lomonosov Moscow State University, Moscow, 119991, Russia.
| | | | - Alexey G Katrukha
- Faculty of Biology, Lomonosov Moscow State University, Moscow, 119991, Russia
| |
Collapse
|
19
|
Mahadevan S. Silence of the mutations. J Biosci 2022. [DOI: 10.1007/s12038-022-00320-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
|
20
|
Fages‐Lartaud M, Hundvin K, Hohmann‐Marriott MF. Mechanisms governing codon usage bias and the implications for protein expression in the chloroplast of Chlamydomonas reinhardtii. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 112:919-945. [PMID: 36071273 PMCID: PMC9828097 DOI: 10.1111/tpj.15970] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Revised: 08/29/2022] [Accepted: 09/01/2022] [Indexed: 05/30/2023]
Abstract
Chloroplasts possess a considerably reduced genome that is decoded via an almost minimal set of tRNAs. These features make an excellent platform for gaining insights into fundamental mechanisms that govern protein expression. Here, we present a comprehensive and revised perspective of the mechanisms that drive codon selection in the chloroplast of Chlamydomonas reinhardtii and the functional consequences for protein expression. In order to extract this information, we applied several codon usage descriptors to genes with different expression levels. We show that highly expressed genes strongly favor translationally optimal codons, while genes with lower functional importance are rather affected by directional mutational bias. We demonstrate that codon optimality can be deduced from codon-anticodon pairing affinity and, for a small number of amino acids (leucine, arginine, serine, and isoleucine), tRNA concentrations. Finally, we review, analyze, and expand on the impact of codon usage on protein yield, secondary structures of mRNA, translation initiation and termination, and amino acid composition of proteins, as well as cotranslational protein folding. The comprehensive analysis of codon choice provides crucial insights into heterologous gene expression in the chloroplast of C. reinhardtii, which may also be applicable to other chloroplast-containing organisms and bacteria.
Collapse
Affiliation(s)
- Maxime Fages‐Lartaud
- Department of BiotechnologyNorwegian University of Science and TechnologyTrondheimN‐7491Norway
| | - Kristoffer Hundvin
- Department of BiotechnologyNorwegian University of Science and TechnologyTrondheimN‐7491Norway
| | | |
Collapse
|
21
|
Ray S, Dandpat SS, Chatterjee S, Walter NG. Precise tuning of bacterial translation initiation by non-equilibrium 5'-UTR unfolding observed in single mRNAs. Nucleic Acids Res 2022; 50:8818-8833. [PMID: 35892287 PMCID: PMC9410914 DOI: 10.1093/nar/gkac635] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Revised: 06/15/2022] [Accepted: 07/14/2022] [Indexed: 11/21/2022] Open
Abstract
Noncoding, structured 5′-untranslated regions (5′-UTRs) of bacterial messenger RNAs (mRNAs) can control translation efficiency by forming structures that either recruit or repel the ribosome. Here we exploit a 5′-UTR embedded preQ1-sensing, pseudoknotted translational riboswitch to probe how binding of a small ligand controls recruitment of the bacterial ribosome to the partially overlapping Shine-Dalgarno (SD) sequence. Combining single-molecule fluorescence microscopy with mutational analyses, we find that the stability of 30S ribosomal subunit binding is inversely correlated with the free energy needed to unfold the 5′-UTR during mRNA accommodation into the mRNA binding cleft. Ligand binding to the riboswitch stabilizes the structure to both antagonize 30S recruitment and accelerate 30S dissociation. Proximity of the 5′-UTR and stability of the SD:anti-SD interaction both play important roles in modulating the initial 30S-mRNA interaction. Finally, depletion of small ribosomal subunit protein S1, known to help resolve structured 5′-UTRs, further increases the energetic penalty for mRNA accommodation. The resulting model of rapid standby site exploration followed by gated non-equilibrium unfolding of the 5′-UTR during accommodation provides a mechanistic understanding of how translation efficiency is governed by riboswitches and other dynamic structure motifs embedded upstream of the translation initiation site of bacterial mRNAs.
Collapse
Affiliation(s)
- Sujay Ray
- Single-Molecule Analysis Group, Department of Chemistry and Center for RNA Biomedicine, University of Michigan, Ann Arbor, MI 48109, USA
| | - Shiba S Dandpat
- Single-Molecule Analysis Group, Department of Chemistry and Center for RNA Biomedicine, University of Michigan, Ann Arbor, MI 48109, USA
| | - Surajit Chatterjee
- Single-Molecule Analysis Group, Department of Chemistry and Center for RNA Biomedicine, University of Michigan, Ann Arbor, MI 48109, USA
| | - Nils G Walter
- Single-Molecule Analysis Group, Department of Chemistry and Center for RNA Biomedicine, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
22
|
Duan Y, Zhang X, Zhai W, Zhang J, Zhang X, Xu G, Li H, Deng Z, Shi J, Xu Z. Deciphering the Rules of Ribosome Binding Site Differentiation in Context Dependence. ACS Synth Biol 2022; 11:2726-2740. [PMID: 35877551 DOI: 10.1021/acssynbio.2c00139] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The ribosome binding site (RBS) is a crucial element regulating translation. However, the activity of RBS is poorly predictable, because it is strongly affected by the local possible secondary structure, that is, context dependence. By the Flowseq technique, over 20 000 RBS variants were sorted and sequenced, and the translation of multiple genes under the same RBS was quantitatively characterized to evaluate the context dependence of each RBS variant in E. coli. Two regions, (-7 to -2) and (-17 to -12), of RBS were predicted with a higher possibility to pair with each other to slow down the translation initiation. Associations between phenotypes and the intrinsic factors suspected to affect translation efficiency and context dependence of the RBS, including nucleotide bias at each position, free energy, and conservation, were disentangled. The results showed that translation efficiency was influenced more significantly by conservation of the SD region (-16 to -8), while an AC-rich spacer region (-7 to -1) was associated with low context dependence. We confirmed these characteristics using a series of synthesized RBSs. The average correlation between multiple reporters was significantly higher for RBSs with an AC-rich spacer (0.714) compared with a GU-rich spacer (0.286). Overall, we proposed general design criteria to improve programmability and minimize context dependence of RBS. The characteristics unraveled here can be adapted to other bacteria for fine-tuning target-gene expression.
Collapse
Affiliation(s)
- Yanting Duan
- Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China.,National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, Jiangnan University, Wuxi 214122, China
| | - Xiaojuan Zhang
- Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China.,National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, Jiangnan University, Wuxi 214122, China
| | - Weiji Zhai
- Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China.,National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, Jiangnan University, Wuxi 214122, China
| | - Jinpeng Zhang
- Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China.,National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, Jiangnan University, Wuxi 214122, China
| | - Xiaomei Zhang
- School of Life Science and Health Engineering, Jiangnan University, Wuxi 214122, China.,Jiangsu Engineering Research Center for Bioactive Products Processing Technology, Jiangnan University, 1800 Lihu Avenue, Wuxi 214122, China
| | - Guoqiang Xu
- Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China.,National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, Jiangnan University, Wuxi 214122, China
| | - Hui Li
- School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi 214122, China
| | - Zhaohong Deng
- School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi 214122, China
| | - Jinsong Shi
- School of Life Science and Health Engineering, Jiangnan University, Wuxi 214122, China.,Jiangsu Engineering Research Center for Bioactive Products Processing Technology, Jiangnan University, 1800 Lihu Avenue, Wuxi 214122, China
| | - Zhenghong Xu
- Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China.,National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, Jiangnan University, Wuxi 214122, China
| |
Collapse
|
23
|
Design of typical genes for heterologous gene expression. Sci Rep 2022; 12:9625. [PMID: 35688911 PMCID: PMC9187722 DOI: 10.1038/s41598-022-13089-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2021] [Accepted: 05/20/2022] [Indexed: 11/09/2022] Open
Abstract
Heterologous protein expression is an important method for analysing cellular functions of proteins, in genetic circuit engineering and in overexpressing proteins for biopharmaceutical applications and structural biology research. The degeneracy of the genetic code, which enables a single protein to be encoded by a multitude of synonymous gene sequences, plays an important role in regulating protein expression, but substantial uncertainty exists concerning the details of this phenomenon. Here we analyse the influence of a profiled codon usage adaptation approach on protein expression levels in the eukaryotic model organism Saccharomyces cerevisiae. We selected green fluorescent protein (GFP) and human α-synuclein (αSyn) as representatives for stable and intrinsically disordered proteins and representing a benchmark and a challenging test case. A new approach was implemented to design typical genes resembling the codon usage of any subset of endogenous genes. Using this approach, synthetic genes for GFP and αSyn were generated, heterologously expressed and evaluated in yeast. We demonstrate that GFP is expressed at high levels, and that the toxic αSyn can be adapted to endogenous, low-level expression. The new software is publicly available as a web-application for performing host-specific protein adaptations to a set of the most commonly used model organisms ( https://odysseus.motorprotein.de ).
Collapse
|
24
|
Miller JB, Meurs TE, Hodgman MW, Song B, Miller KN, Ebbert MTW, Kauwe JSK, Ridge PG. The Ramp Atlas: facilitating tissue and cell-specific ramp sequence analyses through an intuitive web interface. NAR Genom Bioinform 2022; 4:lqac039. [PMID: 35664804 PMCID: PMC9155233 DOI: 10.1093/nargab/lqac039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Revised: 03/01/2022] [Accepted: 05/24/2022] [Indexed: 11/14/2022] Open
Abstract
Ramp sequences occur when the average translational efficiency of codons near the 5′ end of highly expressed genes is significantly lower than the rest of the gene sequence, which counterintuitively increases translational efficiency by decreasing downstream ribosomal collisions. Here, we show that the relative codon adaptiveness within different tissues changes the existence of a ramp sequence without altering the underlying genetic code. We present the first comprehensive analysis of tissue and cell type-specific ramp sequences and report 3108 genes with ramp sequences that change between tissues and cell types, which corresponds with increased gene expression within those tissues and cells. The Ramp Atlas (https://ramps.byu.edu/) allows researchers to query precomputed ramp sequences in 18 388 genes across 62 tissues and 66 cell types and calculate tissue-specific ramp sequences from user-uploaded FASTA files through an intuitive web interface. We used The Ramp Atlas to identify seven SARS-CoV-2 genes and seven human SARS-CoV-2 entry factor genes with tissue-specific ramp sequences that may help explain viral proliferation within those tissues. We anticipate that The Ramp Atlas will facilitate personalized and creative tissue-specific ramp sequence analyses for both human and viral genes that will increase our ability to utilize this often-overlooked regulatory region.
Collapse
Affiliation(s)
- Justin B Miller
- Sanders-Brown Center on Aging, University of Kentucky, Lexington, KY 40504, USA
| | - Taylor E Meurs
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Matthew W Hodgman
- Sanders-Brown Center on Aging, University of Kentucky, Lexington, KY 40504, USA
| | - Benjamin Song
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Kyle N Miller
- Department of Computer Science, Utah Valley University, Orem, UT 84058, USA
| | - Mark T W Ebbert
- Sanders-Brown Center on Aging, University of Kentucky, Lexington, KY 40504, USA
| | - John S K Kauwe
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Perry G Ridge
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| |
Collapse
|
25
|
Kim DJ, Kim J, Lee DH, Lee J, Woo HM. DeepTESR: A Deep Learning Framework to Predict the Degree of Translational Elongation Short Ramp for Gene Expression Control. ACS Synth Biol 2022; 11:1719-1726. [PMID: 35502843 DOI: 10.1021/acssynbio.2c00202] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Controlling translational elongation is essential for efficient protein synthesis. Ribosome profiling has revealed that the speed of ribosome movement is correlated with translational efficiency in the translational elongation ramp. In this work, we present a new deep learning model, called DeepTESR, to predict the degree of translational elongation short ramp (TESR) from mRNA sequence. The proposed deep learning model exhibited superior performance in predicting the TESR scores for 226 981 TESR sequences, resulting in the mean absolute error (MAE) of 0.285 and a coefficient of determination R2 of 0.627, superior to the conventional machine learning models (e.g., MAE of 0.335 and R2 of 0.571 for LightGBM). We experimentally validated that heterologous fluorescence expression of proteins with randomly selected TESR was moderately correlated with the predictions. Furthermore, a genome-wide analysis of TESR prediction in the 4305 coding sequences of Escherichia coli showed conserved TESRs over the clusters of orthologous groups. In this sense, DeepTESR can be used to predict the degree of TESR for gene expression control and to decipher the mechanism of translational control with ribosome profiling. DeepTESR is available at https://github.com/fmblab/DeepTESR.
Collapse
|
26
|
Wang X, Zhao B, Du J, Xu Y, Zhu X, Zhou J, Rao S, Du G, Chen J, Liu S. Active secretion of a thermostable transglutaminase variant in Escherichia coli. Microb Cell Fact 2022; 21:74. [PMID: 35488338 PMCID: PMC9052465 DOI: 10.1186/s12934-022-01801-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2022] [Accepted: 04/19/2022] [Indexed: 12/02/2022] Open
Abstract
Background Streptomyces mobaraenesis transglutaminase (smTG) is widely used to generate protein crosslinking or attachment of small molecules. However, the low thermostability is a main obstacle for smTG application. In addition, it is still hard to achieve the secretory expression of active smTG in E. coli, which benefits the enzyme evolution. In this study, a combined strategy was conducted to improve the thermostability and secretory expression of active smTG in E. coli. Results First, the thermostable S. mobaraenesis transglutaminase variant S2P-S23V-Y24N-S199A-K294L (TGm1) was intracellularly expressed in pro-enzyme form in E. coli. Fusing the pro-region of Streptomyces hygroscopicus transglutaminase (proH) and TrxA achieved a 9.78 U/mL of intracellular smTG activity, 1.37-fold higher than the TGm1 fused with its native pro-region. After in vitro activation by dispase, the TGm1 with proH yielded FRAPD-TGm1, exhibiting 0.95 ℃ and 94.25% increases in melting temperature and half-life at 60 ℃ compared to FRAP-TGm1 derived from the expression using its native pro-region, respectively. Second, the TGm1 with proH was co-expressed with transglutaminase activating protease and chaperones (DnaK, DnaJ, and GrpE) in E. coli, achieving 9.51 U/mL of intracellular FRAPD-TGm1 without in vitro activation. Third, the pelB signal peptide was used to mediate the secretory expression of active TGm in E. coli, yielding 0.54 U/mL of the extracellular FRAPD-TGm1. A script was developed to shuffle the codon of pelB and calculate the corresponding mRNA folding energy. A 1.8-fold increase in the extracellular expression of FRAPD-TGm1 was achieved by the Top-9 pelB sequence derived from the coding sequences with the lowest mRNA folding energy. Last, deleting the gene of Braun’s lipoprotein further increased the extracellular yield of FRAPD-TGm1 by 31.2%, reached 1.99 U/mL. Conclusions The stabilized FRAPD-smTG here could benefit the enzyme application in food and non-food sectors, while the E. coli system that enables secretory expression of active smTG will facilitate the directed evolution for further improved catalytic properties. The combined strategy (N-terminal modification, co-expression with chaperones, mRNA folding energy optimization of signal peptide, and lipoprotein deletion) may also improve the secretory expression of other functional proteins in E. coli. Supplementary Information The online version contains supplementary material available at 10.1186/s12934-022-01801-9.
Collapse
Affiliation(s)
- Xinglong Wang
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China.,Science Center for Future Foods, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China.,Jiangsu Provisional Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China
| | - Beichen Zhao
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China.,Science Center for Future Foods, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China
| | - Jianhui Du
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China.,Science Center for Future Foods, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China
| | - Yameng Xu
- Science Center for Future Foods, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China
| | - Xuewen Zhu
- Science Center for Future Foods, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China
| | - Jingwen Zhou
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China.,Science Center for Future Foods, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China.,Jiangsu Provisional Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China
| | - Shengqi Rao
- College of Food Science and Engineering, Yangzhou University, Yangzhou, 214122, Jiangsu, China
| | - Guocheng Du
- Science Center for Future Foods, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China.,The Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China
| | - Jian Chen
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China.,Science Center for Future Foods, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China.,Jiangsu Provisional Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China
| | - Song Liu
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China. .,Science Center for Future Foods, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China.
| |
Collapse
|
27
|
Leppek K, Byeon GW, Kladwang W, Wayment-Steele HK, Kerr CH, Xu AF, Kim DS, Topkar VV, Choe C, Rothschild D, Tiu GC, Wellington-Oguri R, Fujii K, Sharma E, Watkins AM, Nicol JJ, Romano J, Tunguz B, Diaz F, Cai H, Guo P, Wu J, Meng F, Shi S, Participants E, Dormitzer PR, Solórzano A, Barna M, Das R. Combinatorial optimization of mRNA structure, stability, and translation for RNA-based therapeutics. Nat Commun 2022; 13:1536. [PMID: 35318324 PMCID: PMC8940940 DOI: 10.1038/s41467-022-28776-w] [Citation(s) in RCA: 114] [Impact Index Per Article: 57.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Accepted: 02/07/2022] [Indexed: 02/07/2023] Open
Abstract
Therapeutic mRNAs and vaccines are being developed for a broad range of human diseases, including COVID-19. However, their optimization is hindered by mRNA instability and inefficient protein expression. Here, we describe design principles that overcome these barriers. We develop an RNA sequencing-based platform called PERSIST-seq to systematically delineate in-cell mRNA stability, ribosome load, as well as in-solution stability of a library of diverse mRNAs. We find that, surprisingly, in-cell stability is a greater driver of protein output than high ribosome load. We further introduce a method called In-line-seq, applied to thousands of diverse RNAs, that reveals sequence and structure-based rules for mitigating hydrolytic degradation. Our findings show that highly structured "superfolder" mRNAs can be designed to improve both stability and expression with further enhancement through pseudouridine nucleoside modification. Together, our study demonstrates simultaneous improvement of mRNA stability and protein expression and provides a computational-experimental platform for the enhancement of mRNA medicines.
Collapse
Affiliation(s)
- Kathrin Leppek
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA
| | - Gun Woo Byeon
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA
| | - Wipapat Kladwang
- Department of Biochemistry, Stanford University, Stanford, CA, 94305, USA
| | | | - Craig H Kerr
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA
| | - Adele F Xu
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA
| | - Do Soon Kim
- Department of Biochemistry, Stanford University, Stanford, CA, 94305, USA
| | - Ved V Topkar
- Program in Biophysics, Stanford University, Stanford, CA, 94305, USA
| | - Christian Choe
- Department of Bioengineering, Stanford University, Stanford, CA, 94305, USA
| | - Daphna Rothschild
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA
| | - Gerald C Tiu
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA
| | | | - Kotaro Fujii
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA
| | - Eesha Sharma
- Department of Biochemistry, Stanford University, Stanford, CA, 94305, USA
| | - Andrew M Watkins
- Department of Biochemistry, Stanford University, Stanford, CA, 94305, USA
| | - John J Nicol
- Eterna Massive Open Laboratory, Stanford University, Stanford, CA, 94305, USA
| | - Jonathan Romano
- Eterna Massive Open Laboratory, Stanford University, Stanford, CA, 94305, USA
- Department of Computer Science and Engineering, State University of New York at Buffalo, Buffalo, New York, 14260, USA
| | - Bojan Tunguz
- Department of Biochemistry, Stanford University, Stanford, CA, 94305, USA
- NVIDIA Corporation, 2788 San Tomas Expy, Santa Clara, CA, 95051, USA
| | - Fernando Diaz
- Pfizer Vaccine Research and Development, Pearl River, NY, USA
| | - Hui Cai
- Pfizer Vaccine Research and Development, Pearl River, NY, USA
| | - Pengbo Guo
- Pfizer Vaccine Research and Development, Pearl River, NY, USA
| | - Jiewei Wu
- Pfizer Vaccine Research and Development, Pearl River, NY, USA
| | - Fanyu Meng
- Pfizer Vaccine Research and Development, Pearl River, NY, USA
| | - Shuai Shi
- Pfizer Vaccine Research and Development, Pearl River, NY, USA
| | - Eterna Participants
- Eterna Massive Open Laboratory, Stanford University, Stanford, CA, 94305, USA
| | - Philip R Dormitzer
- Pfizer Vaccine Research and Development, Pearl River, NY, USA
- GlaxoSmithKline, 1000 Winter St., Waltham, MA, 02453, USA
| | | | - Maria Barna
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA.
| | - Rhiju Das
- Department of Biochemistry, Stanford University, Stanford, CA, 94305, USA.
- Program in Biophysics, Stanford University, Stanford, CA, 94305, USA.
- Eterna Massive Open Laboratory, Stanford University, Stanford, CA, 94305, USA.
| |
Collapse
|
28
|
Ding Z, Guan F, Xu G, Wang Y, Yan Y, Zhang W, Wu N, Yao B, Huang H, Tuller T, Tian J. MPEPE, a predictive approach to improve protein expression in E. coli based on deep learning. Comput Struct Biotechnol J 2022; 20:1142-1153. [PMID: 35317239 PMCID: PMC8913310 DOI: 10.1016/j.csbj.2022.02.030] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Revised: 02/27/2022] [Accepted: 02/28/2022] [Indexed: 12/20/2022] Open
Abstract
The expression of proteins in Escherichia coli is often essential for their characterization, modification, and subsequent application. Gene sequence is the major factor contributing expression. In this study, we used the expression data from 6438 heterologous proteins under the same expression condition in E. coli to construct a deep learning classifier for screening high- and low-expression proteins. In conjunction with conserved residue analysis to minimize functional disruption, a mutation predictor for enhanced protein expression (MPEPE) was proposed to identify mutations conducive to protein expression. MPEPE identified mutation sites in laccase 13B22 and the glucose dehydrogenase FAD-AtGDH, that significantly increased both expression levels and activity of these proteins. Additionally, a significant correlation of 0.46 between the predicted high level expression propensity with the constructed models and the protein abundance of endogenous genes in E. coli was also been detected. Therefore, the study provides foundational insights into the relationship between specific amino acid usage, codon usage, and protein expression, and is essential for research and industrial applications.
Collapse
Affiliation(s)
- Zundan Ding
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Feifei Guan
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Guoshun Xu
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
- Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Yuchen Wang
- College of Life Science, Northwest Normal University, Lanzhou 730070, China
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Yaru Yan
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Wei Zhang
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Ningfeng Wu
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Bin Yao
- Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Huoqing Huang
- Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Tamir Tuller
- Department of Biomedical Engineering, the Engineering Faculty, Tel Aviv University, Tel-Aviv, Israel
| | - Jian Tian
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| |
Collapse
|
29
|
Neumann T, Tuller T. Modeling the ribosomal small subunit dynamic in Saccharomyces cerevisiae based on TCP-seq data. Nucleic Acids Res 2022; 50:1297-1316. [PMID: 35100399 PMCID: PMC8860609 DOI: 10.1093/nar/gkac021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Revised: 12/31/2021] [Accepted: 01/07/2022] [Indexed: 11/13/2022] Open
Abstract
Translation Complex Profile Sequencing (TCP-seq), a protocol that was developed and implemented on Saccharomyces cerevisiae, provides the footprints of the small subunit (SSU) of the ribosome (with additional factors) across the entire transcriptome of the analyzed organism. In this study, based on the TCP-seq data, we developed for the first-time a predictive model of the SSU density and analyzed the effect of transcript features on the dynamics of the SSU scan in the 5′UTR. Among others, our model is based on novel tools for detecting complex statistical relations tailored to TCP-seq. We quantitatively estimated the effect of several important features, including the context of the upstream AUG, the upstream ORF length and the mRNA folding strength. Specifically, we suggest that around 50% of the variance related to the read counts (RC) distribution near a start codon can be attributed to the AUG context score. We provide the first large scale direct quantitative evidence that shows that indeed AUG context affects the small sub-unit movement. In addition, we suggest that strong folding may cause the detachment of the SSU from the mRNA. We also identified a number of novel sequence motifs that can affect the SSU scan; some of these motifs affect transcription factors and RNA binding proteins. The results presented in this study provide a better understanding of the biophysical aspects related to the SSU scan along the 5′UTR and of translation initiation in S. cerevisiae, a fundamental step toward a comprehensive modeling of initiation.
Collapse
Affiliation(s)
- Tamar Neumann
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Tamir Tuller
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv 6997801, Israel
- The Sagol School of Neuroscience, Tel-Aviv University, Tel Aviv 6997801, Israel
| |
Collapse
|
30
|
Abstract
In the past 20 years, the mRNA vaccine technology has evolved from the first proof of concept to the first licensed vaccine against emerging pandemics such as SARS-CoV-2. Two mRNA vaccines targeting SARS-CoV-2 have received emergency use authorization by US FDA, conditional marketing authorization by EMA, as well as multiple additional national regulatory authorities. The simple composition of an mRNA encoding the antigen formulated in a lipid nanoparticle enables a fast adaptation to new emerging pathogens. This can speed up vaccine development in pandemics from antigen and sequence selection to clinical trial to only a few months. mRNA vaccines are well tolerated and efficacious in animal models for multiple pathogens and will further contribute to the development of vaccines for other unaddressed diseases. Here, we give an overview of the mRNA vaccine design and factors for further optimization of this new promising technology and discuss current knowledge on the mode of action of mRNA vaccines interacting with the innate and adaptive immune system.
Collapse
|
31
|
Vinokour S, Tuller T. Determinants of efficient modulation of ribosomal traffic jams. Comput Struct Biotechnol J 2021; 19:6064-6079. [PMID: 34849209 PMCID: PMC8605386 DOI: 10.1016/j.csbj.2021.10.030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2021] [Revised: 10/17/2021] [Accepted: 10/20/2021] [Indexed: 11/28/2022] Open
Abstract
mRNA translation is the process which consumes most of the cellular energy. Thus, this process is under strong evolutionary selection for its optimization and rational optimization or reduction of the translation efficiency can impact the cell growth rate. Algorithms for modulating cell growth rate can have various applications in biotechnology, medicine, and agriculture. In this study, we demonstrate that the analysis of these algorithms can also be used for understanding translation. We specifically describe and analyze various generic algorithms, based on comprehensive computational models and whole cell simulations of translation, for introducing silent mutations that can either reduce or increase ribosomal traffic jams along the mRNA. As a result, more or less resources are available, for the cell, promoting improved or reduced cells growth-rate, respectively. We then explore the cost of these algorithms' performance, in terms of their computational time, the number of mutations they introduce, the modified genomic region, the effect on local translation rates, and the properties of the modified genes. Among others, we show that mRNA levels of a gene are much stronger predictors for the effect of its engineering on the ribosomal pool than the ribosomal density of the gene. We also demonstrate that the mutations at the ends of the coding regions have a stronger effect on the ribosomal pool. Furthermore, we report two optimization algorithms that exhibit a tread-off between the number of mutations they introduce and their executing time. The reported results here are fundamental both for understanding the biophysics and evolution of translation, as well as for developing efficient approaches for its engineering.
Collapse
Affiliation(s)
- Sophie Vinokour
- Department of Biomedical Engineering, Engineering Faculty, Tel Aviv University, Israel
| | - Tamir Tuller
- Department of Biomedical Engineering, Engineering Faculty, Tel Aviv University, Israel
- The Sagol School of Neuroscience, Tel Aviv University, Tel-Aviv 69978, Israel
- Corresponding author at: Department of Biomedical Engineering, Engineering Faculty, Tel Aviv University, Israel.
| |
Collapse
|
32
|
Tietze L, Lale R. Importance of the 5' regulatory region to bacterial synthetic biology applications. Microb Biotechnol 2021; 14:2291-2315. [PMID: 34171170 PMCID: PMC8601185 DOI: 10.1111/1751-7915.13868] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Revised: 06/03/2021] [Accepted: 06/04/2021] [Indexed: 01/02/2023] Open
Abstract
The field of synthetic biology is evolving at a fast pace. It is advancing beyond single-gene alterations in single hosts to the logical design of complex circuits and the development of integrated synthetic genomes. Recent breakthroughs in deep learning, which is increasingly used in de novo assembly of DNA components with predictable effects, are also aiding the discipline. Despite advances in computing, the field is still reliant on the availability of pre-characterized DNA parts, whether natural or synthetic, to regulate gene expression in bacteria and make valuable compounds. In this review, we discuss the different bacterial synthetic biology methodologies employed in the creation of 5' regulatory regions - promoters, untranslated regions and 5'-end of coding sequences. We summarize methodologies and discuss their significance for each of the functional DNA components, and highlight the key advances made in bacterial engineering by concentrating on their flaws and strengths. We end the review by outlining the issues that the discipline may face in the near future.
Collapse
Affiliation(s)
- Lisa Tietze
- PhotoSynLabDepartment of BiotechnologyFaculty of Natural SciencesNorwegian University of Science and TechnologyTrondheimN‐7491Norway
| | - Rahmi Lale
- PhotoSynLabDepartment of BiotechnologyFaculty of Natural SciencesNorwegian University of Science and TechnologyTrondheimN‐7491Norway
| |
Collapse
|
33
|
Bhandari BK, Lim CS, Remus DM, Chen A, van Dolleweerd C, Gardner PP. Analysis of 11,430 recombinant protein production experiments reveals that protein yield is tunable by synonymous codon changes of translation initiation sites. PLoS Comput Biol 2021; 17:e1009461. [PMID: 34610008 PMCID: PMC8519471 DOI: 10.1371/journal.pcbi.1009461] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Revised: 10/15/2021] [Accepted: 09/19/2021] [Indexed: 12/16/2022] Open
Abstract
Recombinant protein production is a key process in generating proteins of interest in the pharmaceutical industry and biomedical research. However, about 50% of recombinant proteins fail to be expressed in a variety of host cells. Here we show that the accessibility of translation initiation sites modelled using the mRNA base-unpairing across the Boltzmann's ensemble significantly outperforms alternative features. This approach accurately predicts the successes or failures of expression experiments, which utilised Escherichia coli cells to express 11,430 recombinant proteins from over 189 diverse species. On this basis, we develop TIsigner that uses simulated annealing to modify up to the first nine codons of mRNAs with synonymous substitutions. We show that accessibility captures the key propensity beyond the target region (initiation sites in this case), as a modest number of synonymous changes is sufficient to tune the recombinant protein expression levels. We build a stochastic simulation model and show that higher accessibility leads to higher protein production and slower cell growth, supporting the idea of protein cost, where cell growth is constrained by protein circuits during overexpression.
Collapse
Affiliation(s)
- Bikash K. Bhandari
- Department of Biochemistry, School of Biomedical Sciences, University of Otago, Dunedin, New Zealand
| | - Chun Shen Lim
- Department of Biochemistry, School of Biomedical Sciences, University of Otago, Dunedin, New Zealand
| | - Daniela M. Remus
- Callaghan Innovation Protein Science and Engineering, University of Canterbury, Christchurch, New Zealand
| | - Augustine Chen
- Department of Biochemistry, School of Biomedical Sciences, University of Otago, Dunedin, New Zealand
| | - Craig van Dolleweerd
- Biomolecular Interaction Center, University of Canterbury, Christchurch, New Zealand
| | - Paul P. Gardner
- Department of Biochemistry, School of Biomedical Sciences, University of Otago, Dunedin, New Zealand
- Biomolecular Interaction Center, University of Canterbury, Christchurch, New Zealand
| |
Collapse
|
34
|
Bahiri Elitzur S, Cohen-Kupiec R, Yacobi D, Fine L, Apt B, Diament A, Tuller T. Prokaryotic rRNA-mRNA interactions are involved in all translation steps and shape bacterial transcripts. RNA Biol 2021; 18:684-698. [PMID: 34586043 DOI: 10.1080/15476286.2021.1978767] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open
Abstract
The well-established Shine-Dalgarno model suggests that translation initiation in bacteria is regulated via base-pairing between ribosomal RNA (rRNA) and mRNA. We used novel computational analyses and modelling of 823 bacterial genomes coupled with experiments to demonstrate that rRNA-mRNA interactions are diverse and regulate all translation steps from pre-initiation to termination. Previous research has reported the significant influence of rRNA-mRNA interactions, mainly in the initiation phase of translation. The results reported in this paper suggest that, in addition to the rRNA-mRNA interactions near the start codon that trigger initiation in bacteria, rRNA-mRNA interactions affect all sub-stages of the translation process (pre-initiation, initiation, elongation, termination). As these interactions dictate translation efficiency, they serve as an evolutionary driving force for shaping transcripts in bacteria while considering trade-offs between the effects of different interactions across different transcript regions on translation efficacy and efficiency. We observed selection for strong interactions in regions where such interactions are likely to enhance initiation, regulate early elongation, and ensure translation termination fidelity. We discovered selection against strong interactions and for intermediate interactions in coding regions and presented evidence that these patterns maximize elongation efficiency while also enhancing initiation. These finding are relevant to all biomedical disciplines due to the centrality of the translation process and the effect of rRNA-mRNA interactions on transcript evolution.
Collapse
Affiliation(s)
| | | | - Dana Yacobi
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, Israel
| | - Larissa Fine
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, Israel
| | - Boaz Apt
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, Israel
| | - Alon Diament
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, Israel
| | - Tamir Tuller
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, Israel.,The Sagol School of Neuroscience, Tel-Aviv University, Tel Aviv, Israel
| |
Collapse
|
35
|
Dunkelmann DL, Oehm SB, Beattie AT, Chin JW. A 68-codon genetic code to incorporate four distinct non-canonical amino acids enabled by automated orthogonal mRNA design. Nat Chem 2021; 13:1110-1117. [PMID: 34426682 DOI: 10.1038/s41557-021-00764-5] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Accepted: 06/30/2021] [Indexed: 12/15/2022]
Abstract
Orthogonal (O) ribosome-mediated translation of O-mRNAs enables the incorporation of up to three distinct non-canonical amino acids (ncAAs) into proteins in Escherichia coli (E. coli). However, the general and efficient incorporation of multiple distinct ncAAs by O-ribosomes requires scalable strategies for both creating efficiently and specifically translated O-mRNAs, and the compact expression of multiple O-aminoacyl-tRNA synthetase (O-aaRS)/O-tRNA pairs. We automate the discovery of O-mRNAs that lead to up to 40 times more protein, and are up to 50-fold more orthogonal, than previous O-mRNAs; protein yields from our O-mRNAs match or exceed those from wild-type mRNAs. These advances enable a 33-fold increase in yield for incorporating three distinct ncAAs. We automate the creation of operons for O-tRNA genes, and develop operons for O-aaRS genes. Combining our advances creates a 68-codon, 24-amino-acid genetic code to efficiently incorporate four distinct ncAAs into a single protein in response to four distinct quadruplet codons.
Collapse
Affiliation(s)
| | - Sebastian B Oehm
- Medical Research Council Laboratory of Molecular Biology, Cambridge, UK
| | - Adam T Beattie
- Medical Research Council Laboratory of Molecular Biology, Cambridge, UK
| | - Jason W Chin
- Medical Research Council Laboratory of Molecular Biology, Cambridge, UK.
| |
Collapse
|
36
|
Bhandari BK, Lim CS, Gardner PP. TISIGNER.com: web services for improving recombinant protein production. Nucleic Acids Res 2021; 49:W654-W661. [PMID: 33744969 PMCID: PMC8265118 DOI: 10.1093/nar/gkab175] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Revised: 02/17/2021] [Accepted: 03/03/2021] [Indexed: 12/25/2022] Open
Abstract
Experiments that are planned using accurate prediction algorithms will mitigate failures in recombinant protein production. We have developed TISIGNER (https://tisigner.com) with the aim of addressing technical challenges to recombinant protein production. We offer three web services, TIsigner (Translation Initiation coding region designer), SoDoPE (Soluble Domain for Protein Expression) and Razor, which are specialised in synonymous optimisation of recombinant protein expression, solubility and signal peptide analysis, respectively. Importantly, TIsigner, SoDoPE and Razor are linked, which allows users to switch between the tools when optimising genes of interest.
Collapse
Affiliation(s)
- Bikash K Bhandari
- Department of Biochemistry, School of Biomedical Sciences, University of Otago, Dunedin 9054, New Zealand
| | - Chun Shen Lim
- Department of Biochemistry, School of Biomedical Sciences, University of Otago, Dunedin 9054, New Zealand
| | - Paul P Gardner
- Department of Biochemistry, School of Biomedical Sciences, University of Otago, Dunedin 9054, New Zealand
- Biomolecular Interaction Centre, University of Canterbury, Christchurch 8140, New Zealand
| |
Collapse
|
37
|
Kirby J, Geiselman GM, Yaegashi J, Kim J, Zhuang X, Tran-Gyamfi MB, Prahl JP, Sundstrom ER, Gao Y, Munoz N, Burnum-Johnson KE, Benites VT, Baidoo EEK, Fuhrmann A, Seibel K, Webb-Robertson BJM, Zucker J, Nicora CD, Tanjore D, Magnuson JK, Skerker JM, Gladden JM. Further engineering of R. toruloides for the production of terpenes from lignocellulosic biomass. BIOTECHNOLOGY FOR BIOFUELS 2021; 14:101. [PMID: 33883010 PMCID: PMC8058980 DOI: 10.1186/s13068-021-01950-w] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Accepted: 04/07/2021] [Indexed: 05/05/2023]
Abstract
BACKGROUND Mitigation of climate change requires that new routes for the production of fuels and chemicals be as oil-independent as possible. The microbial conversion of lignocellulosic feedstocks into terpene-based biofuels and bioproducts represents one such route. This work builds upon previous demonstrations that the single-celled carotenogenic basidiomycete, Rhodosporidium toruloides, is a promising host for the production of terpenes from lignocellulosic hydrolysates. RESULTS This study focuses on the optimization of production of the monoterpene 1,8-cineole and the sesquiterpene α-bisabolene in R. toruloides. The α-bisabolene titer attained in R. toruloides was found to be proportional to the copy number of the bisabolene synthase (BIS) expression cassette, which in turn influenced the expression level of several native mevalonate pathway genes. The addition of more copies of BIS under a stronger promoter resulted in production of α-bisabolene at 2.2 g/L from lignocellulosic hydrolysate in a 2-L fermenter. Production of 1,8-cineole was found to be limited by availability of the precursor geranylgeranyl pyrophosphate (GPP) and expression of an appropriate GPP synthase increased the monoterpene titer fourfold to 143 mg/L at bench scale. Targeted mevalonate pathway metabolite analysis suggested that 3-hydroxy-3-methyl-glutaryl-coenzyme A reductase (HMGR), mevalonate kinase (MK) and phosphomevalonate kinase (PMK) may be pathway bottlenecks are were therefore selected as targets for overexpression. Expression of HMGR, MK, and PMK orthologs and growth in an optimized lignocellulosic hydrolysate medium increased the 1,8-cineole titer an additional tenfold to 1.4 g/L. Expression of the same mevalonate pathway genes did not have as large an impact on α-bisabolene production, although the final titer was higher at 2.6 g/L. Furthermore, mevalonate pathway intermediates accumulated in the mevalonate-engineered strains, suggesting room for further improvement. CONCLUSIONS This work brings R. toruloides closer to being able to make industrially relevant quantities of terpene from lignocellulosic biomass.
Collapse
Affiliation(s)
- James Kirby
- Department of Energy, Agile BioFoundry, Emeryville, CA, 94608, USA
- Department of Biomass Science and Conversion Technology, Sandia National Laboratories, Livermore, CA, 94550, USA
| | - Gina M Geiselman
- Department of Energy, Agile BioFoundry, Emeryville, CA, 94608, USA
- Department of Biomass Science and Conversion Technology, Sandia National Laboratories, Livermore, CA, 94550, USA
| | - Junko Yaegashi
- Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Emeryville, CA, 94608, USA
- Chemical and Biological Processing Group, Pacific Northwest National Laboratory, Richland, WA, 99354, USA
| | - Joonhoon Kim
- Department of Energy, Agile BioFoundry, Emeryville, CA, 94608, USA
- Chemical and Biological Processing Group, Pacific Northwest National Laboratory, Richland, WA, 99354, USA
| | - Xun Zhuang
- Department of Energy, Agile BioFoundry, Emeryville, CA, 94608, USA
- Department of Biomass Science and Conversion Technology, Sandia National Laboratories, Livermore, CA, 94550, USA
| | - Mary Bao Tran-Gyamfi
- Department of Energy, Agile BioFoundry, Emeryville, CA, 94608, USA
- Department of Biomass Science and Conversion Technology, Sandia National Laboratories, Livermore, CA, 94550, USA
| | - Jan-Philip Prahl
- Department of Energy, Agile BioFoundry, Emeryville, CA, 94608, USA
- Advanced Biofuels and Bioproducts Process Development Unit, Lawrence Berkeley National Laboratory, Emeryville, CA, 94608, USA
| | - Eric R Sundstrom
- Department of Energy, Agile BioFoundry, Emeryville, CA, 94608, USA
- Advanced Biofuels and Bioproducts Process Development Unit, Lawrence Berkeley National Laboratory, Emeryville, CA, 94608, USA
| | - Yuqian Gao
- Department of Energy, Agile BioFoundry, Emeryville, CA, 94608, USA
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, 99354, USA
| | - Nathalie Munoz
- Department of Energy, Agile BioFoundry, Emeryville, CA, 94608, USA
- The Environmental Molecular Sciences Laboratory, Richland, WA, 99354, USA
| | - Kristin E Burnum-Johnson
- Department of Energy, Agile BioFoundry, Emeryville, CA, 94608, USA
- The Environmental Molecular Sciences Laboratory, Richland, WA, 99354, USA
| | - Veronica T Benites
- Department of Energy, Agile BioFoundry, Emeryville, CA, 94608, USA
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Edward E K Baidoo
- Department of Energy, Agile BioFoundry, Emeryville, CA, 94608, USA
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Anna Fuhrmann
- Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Emeryville, CA, 94608, USA
| | - Katharina Seibel
- Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Emeryville, CA, 94608, USA
| | - Bobbie-Jo M Webb-Robertson
- Department of Energy, Agile BioFoundry, Emeryville, CA, 94608, USA
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, 99354, USA
| | - Jeremy Zucker
- Department of Energy, Agile BioFoundry, Emeryville, CA, 94608, USA
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, 99354, USA
| | - Carrie D Nicora
- Department of Energy, Agile BioFoundry, Emeryville, CA, 94608, USA
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, 99354, USA
| | - Deepti Tanjore
- Department of Energy, Agile BioFoundry, Emeryville, CA, 94608, USA
- Advanced Biofuels and Bioproducts Process Development Unit, Lawrence Berkeley National Laboratory, Emeryville, CA, 94608, USA
| | - Jon K Magnuson
- Department of Energy, Agile BioFoundry, Emeryville, CA, 94608, USA
- Chemical and Biological Processing Group, Pacific Northwest National Laboratory, Richland, WA, 99354, USA
| | - Jeffrey M Skerker
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
- QB3-Berkeley, University of California, Berkeley, CA, 94704, USA
| | - John M Gladden
- Department of Energy, Agile BioFoundry, Emeryville, CA, 94608, USA.
- Department of Biomass Science and Conversion Technology, Sandia National Laboratories, Livermore, CA, 94550, USA.
| |
Collapse
|
38
|
Wang X, Chen J, Zhang J, Zhou Y, Zhang Y, Wang F, Li X. Engineering Escherichia coli for production of geraniol by systematic synthetic biology approaches and laboratory-evolved fusion tags. Metab Eng 2021; 66:60-67. [PMID: 33865982 DOI: 10.1016/j.ymben.2021.04.008] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Revised: 03/23/2021] [Accepted: 04/11/2021] [Indexed: 12/16/2022]
Abstract
Geraniol is a valuable monoterpene extensively used in the fragrance, food, and cosmetic industries. Increasing environmental concerns and supply gaps have motivated efforts to advance the microbial production of geraniol from renewable feedstocks. In this study, we first constructed a platform geraniol Escherichia coli strain by bioprospecting the key enzymes geranyl diphosphate synthase (GPPS) and geraniol synthase (GES) and selection of a host cell background. This strategy led to a 46.4-fold increase in geraniol titer to 964.3 mg/L. We propose that the expression level of eukaryotic GES can be further optimized through fusion tag evolution engineering. To this end, we manipulated GES to maximize flux towards the targeted product geraniol from precursor geranyl diphosphate (GPP) via the utilization of fusion tags. Additionally, we developed a high-throughput screening system to monitor fusion tag variants. This common plug-and-play toolbox proved to be a robust approach for systematic modulation of protein expression and can be used to tune biosynthetic metabolic pathways. Finally, by combining a modified E1* fusion tag, we achieved 2124.1 mg/L of geraniol in shake flask cultures, which reached 27.2% of the maximum theoretical yield and was the highest titer ever reported. We propose that this strategy has set a good reference for enhancing a broader range of terpenoid production in microbial cell factories, which might open new possibilities for the bio-production of other valuable chemicals.
Collapse
Affiliation(s)
- Xun Wang
- Jiangsu Provincial Key Lab for the Chemistry and Utilization of Agro-forest Biomass, Nanjing Forestry University, Nanjing, 210037, China; Jiangsu Key Laboratory of Biomass-based Green Fuels and Chemicals, Nanjing Forestry University, Nanjing, 210037, China; College of Chemical Engineering, Nanjing Forestry University, Nanjing, 210037, China
| | - Jiaming Chen
- Jiangsu Provincial Key Lab for the Chemistry and Utilization of Agro-forest Biomass, Nanjing Forestry University, Nanjing, 210037, China; Jiangsu Key Laboratory of Biomass-based Green Fuels and Chemicals, Nanjing Forestry University, Nanjing, 210037, China; College of Chemical Engineering, Nanjing Forestry University, Nanjing, 210037, China
| | - Jia Zhang
- Jiangsu Provincial Key Lab for the Chemistry and Utilization of Agro-forest Biomass, Nanjing Forestry University, Nanjing, 210037, China; Jiangsu Key Laboratory of Biomass-based Green Fuels and Chemicals, Nanjing Forestry University, Nanjing, 210037, China; College of Chemical Engineering, Nanjing Forestry University, Nanjing, 210037, China
| | - Yujunjie Zhou
- Jiangsu Provincial Key Lab for the Chemistry and Utilization of Agro-forest Biomass, Nanjing Forestry University, Nanjing, 210037, China; Jiangsu Key Laboratory of Biomass-based Green Fuels and Chemicals, Nanjing Forestry University, Nanjing, 210037, China; College of Chemical Engineering, Nanjing Forestry University, Nanjing, 210037, China
| | - Yu Zhang
- Jiangsu Provincial Key Lab for the Chemistry and Utilization of Agro-forest Biomass, Nanjing Forestry University, Nanjing, 210037, China; Jiangsu Key Laboratory of Biomass-based Green Fuels and Chemicals, Nanjing Forestry University, Nanjing, 210037, China; College of Chemical Engineering, Nanjing Forestry University, Nanjing, 210037, China
| | - Fei Wang
- Jiangsu Provincial Key Lab for the Chemistry and Utilization of Agro-forest Biomass, Nanjing Forestry University, Nanjing, 210037, China; Jiangsu Key Laboratory of Biomass-based Green Fuels and Chemicals, Nanjing Forestry University, Nanjing, 210037, China; College of Chemical Engineering, Nanjing Forestry University, Nanjing, 210037, China
| | - Xun Li
- Jiangsu Provincial Key Lab for the Chemistry and Utilization of Agro-forest Biomass, Nanjing Forestry University, Nanjing, 210037, China; Jiangsu Key Laboratory of Biomass-based Green Fuels and Chemicals, Nanjing Forestry University, Nanjing, 210037, China; College of Chemical Engineering, Nanjing Forestry University, Nanjing, 210037, China.
| |
Collapse
|
39
|
Leppek K, Byeon GW, Kladwang W, Wayment-Steele HK, Kerr CH, Xu AF, Kim DS, Topkar VV, Choe C, Rothschild D, Tiu GC, Wellington-Oguri R, Fujii K, Sharma E, Watkins AM, Nicol JJ, Romano J, Tunguz B, Participants E, Barna M, Das R. Combinatorial optimization of mRNA structure, stability, and translation for RNA-based therapeutics. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2021:2021.03.29.437587. [PMID: 33821271 PMCID: PMC8020971 DOI: 10.1101/2021.03.29.437587] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Therapeutic mRNAs and vaccines are being developed for a broad range of human diseases, including COVID-19. However, their optimization is hindered by mRNA instability and inefficient protein expression. Here, we describe design principles that overcome these barriers. We develop a new RNA sequencing-based platform called PERSIST-seq to systematically delineate in-cell mRNA stability, ribosome load, as well as in-solution stability of a library of diverse mRNAs. We find that, surprisingly, in-cell stability is a greater driver of protein output than high ribosome load. We further introduce a method called In-line-seq, applied to thousands of diverse RNAs, that reveals sequence and structure-based rules for mitigating hydrolytic degradation. Our findings show that "superfolder" mRNAs can be designed to improve both stability and expression that are further enhanced through pseudouridine nucleoside modification. Together, our study demonstrates simultaneous improvement of mRNA stability and protein expression and provides a computational-experimental platform for the enhancement of mRNA medicines.
Collapse
Affiliation(s)
- Kathrin Leppek
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Gun Woo Byeon
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Wipapat Kladwang
- Department of Biochemistry, Stanford University, California 94305, USA
| | | | - Craig H Kerr
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Adele F Xu
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Do Soon Kim
- Department of Biochemistry, Stanford University, California 94305, USA
| | - Ved V Topkar
- Program in Biophysics, Stanford University, Stanford, California 94305, USA
| | - Christian Choe
- Department of Bioengineering, Stanford University, Stanford, California 94305, USA
| | - Daphna Rothschild
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Gerald C Tiu
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | | | - Kotaro Fujii
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Eesha Sharma
- Department of Biochemistry, Stanford University, California 94305, USA
| | - Andrew M Watkins
- Department of Biochemistry, Stanford University, California 94305, USA
| | | | - Jonathan Romano
- Eterna Massive Open Laboratory
- Department of Computer Science and Engineering, State University of New York at Buffalo, Buffalo, New York, 14260, USA
| | - Bojan Tunguz
- Department of Biochemistry, Stanford University, California 94305, USA
| | | | - Maria Barna
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University, California 94305, USA
| |
Collapse
|
40
|
Carmi G, Gorohovski A, Mukherjee S, Frenkel-Morgenstern M. Non-optimal codon usage preferences of coronaviruses determine their promiscuity for infecting multiple hosts. FEBS J 2021; 288:5201-5223. [PMID: 33756061 DOI: 10.1111/febs.15835] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Revised: 02/09/2021] [Accepted: 03/22/2021] [Indexed: 12/11/2022]
Abstract
Circulating animal coronaviruses occasionally infect humans. The SARS-CoV-2 is responsible for the current worldwide outbreak of COVID-19 that has resulted in 2 112 844 deaths as of late January 2021. We compared genetic code preferences in 496 viruses, including 34 coronaviruses and 242 corresponding hosts, to uncover patterns that distinguish single- and 'promiscuous' multiple-host-infecting viruses. Based on a codon usage preference score, promiscuous viruses were shown to significantly employ nonoptimal codons, namely codons that involve 'wobble' binding to anticodons, as compared to single-host viruses. The codon adaptation index (CAI) and the effective number of codons (ENC) were calculated for all viruses and hosts. Promiscuous viruses were less adapted hosts vs single-host viruses (P-value = 4.392e-11). All coronaviruses exploit nonoptimal codons to infect multiple hosts. We found that nonoptimal codon preferences at the beginning of viral coding sequences enhance the translational efficiency of viral proteins within the host. Finally, coronaviruses lack endogenous RNA degradation motifs to a significant degree, thereby increasing viral mRNA burden and infection load. To conclude, we found that promiscuously infecting coronaviruses prefer nonoptimal codon usage to remove degradation motifs from their RNAs and to dramatically increase their viral RNA production rates.
Collapse
Affiliation(s)
- Gon Carmi
- Cancer Genomics and BioComputing of Complex Diseases Laboratory, The Azrieli Faculty of Medicine, Bar-Ilan University, Safed, Israel
| | - Alessandro Gorohovski
- Cancer Genomics and BioComputing of Complex Diseases Laboratory, The Azrieli Faculty of Medicine, Bar-Ilan University, Safed, Israel
| | - Sumit Mukherjee
- Cancer Genomics and BioComputing of Complex Diseases Laboratory, The Azrieli Faculty of Medicine, Bar-Ilan University, Safed, Israel
| | - Milana Frenkel-Morgenstern
- Cancer Genomics and BioComputing of Complex Diseases Laboratory, The Azrieli Faculty of Medicine, Bar-Ilan University, Safed, Israel.,The Data Science Institute, Bar-Ilan University, Ramat Gan, Israel.,The Dangoor Center for Personalized Medicine, Bar-Ilan University, Ramat Gan, Israel
| |
Collapse
|
41
|
Gao NL, He Z, Zhu Q, Jiang P, Hu S, Chen WH. Selection for Cheaper Amino Acids Drives Nucleotide Usage at the Start of Translation in Eukaryotic Genes. GENOMICS PROTEOMICS & BIOINFORMATICS 2021; 19:949-957. [PMID: 33741525 PMCID: PMC9403032 DOI: 10.1016/j.gpb.2021.03.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/12/2018] [Revised: 05/30/2019] [Accepted: 08/18/2019] [Indexed: 12/04/2022]
Abstract
Coding regions have complex interactions among multiple selective forces, which are manifested as biases in nucleotide composition. Previous studies have revealed a decreasing GC gradient from the 5′-end to 3′-end of coding regions in various organisms. We confirmed that this gradient is universal in eukaryotic genes, but the decrease only starts from the ∼ 25th codon. This trend is mostly found in nonsynonymous (ns) sites at which the GC gradient is universal across the eukaryotic genome. Increased GC contents at ns sites result in cheaper amino acids, indicating a universal selection for energy efficiency toward the N-termini of encoded proteins. Within a genome, the decreasing GC gradient is intensified from lowly to highly expressed genes (more and more protein products), further supporting this hypothesis. This reveals a conserved selective constraint for cheaper amino acids at the translation start that drives the increased GC contents at ns sites. Elevated GC contents can facilitate transcription but result in a more stable local secondary structure around the start codon and subsequently impede translation initiation. Conversely, the GC gradients at four-fold and two-fold synonymous sites vary across species. They could decrease or increase, suggesting different constraints acting at the GC contents of different codon sites in different species. This study reveals that the overall GC contents at the translation start are consequences of complex interactions among several major biological processes that shape the nucleotide sequences, especially efficient energy usage.
Collapse
Affiliation(s)
- Na L Gao
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China; Institute for Computer Science and Cluster of Excellence on Plant Sciences, Heinrich Heine University, Duesseldorf 40225, Germany
| | - Zilong He
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100029, China; State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China; Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, Interdisciplinary Innovation Institute of Medicine and Engineering, Beihang University, Beijing 100191, China
| | - Qianhui Zhu
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100029, China; State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Puzi Jiang
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Songnian Hu
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100029, China; State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China.
| | - Wei-Hua Chen
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China.
| |
Collapse
|
42
|
Variability in mRNA translation: a random matrix theory approach. Sci Rep 2021; 11:5300. [PMID: 33674667 PMCID: PMC7970873 DOI: 10.1038/s41598-021-84738-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Accepted: 02/19/2021] [Indexed: 01/31/2023] Open
Abstract
The rate of mRNA translation depends on the initiation, elongation, and termination rates of ribosomes along the mRNA. These rates depend on many "local" factors like the abundance of free ribosomes and tRNA molecules in the vicinity of the mRNA molecule. All these factors are stochastic and their experimental measurements are also noisy. An important question is how protein production in the cell is affected by this considerable variability. We develop a new theoretical framework for addressing this question by modeling the rates as identically and independently distributed random variables and using tools from random matrix theory to analyze the steady-state production rate. The analysis reveals a principle of universality: the average protein production rate depends only on the of the set of possible values that the random variable may attain. This explains how total protein production can be stabilized despite the overwhelming stochasticticity underlying cellular processes.
Collapse
|
43
|
Hia F, Takeuchi O. The effects of codon bias and optimality on mRNA and protein regulation. Cell Mol Life Sci 2021; 78:1909-1928. [PMID: 33128106 PMCID: PMC11072601 DOI: 10.1007/s00018-020-03685-7] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2020] [Revised: 10/05/2020] [Accepted: 10/12/2020] [Indexed: 12/25/2022]
Abstract
The central dogma of molecular biology entails that genetic information is transferred from nucleic acid to proteins. Notwithstanding retro-transcribing genetic elements, DNA is transcribed to RNA which in turn is translated into proteins. Recent advancements have shown that each stage is regulated to control protein abundances for a variety of essential physiological processes. In this regard, mRNA regulation is essential in fine-tuning or calibrating protein abundances. In this review, we would like to discuss one of several mRNA-intrinsic features of mRNA regulation that has been gaining traction of recent-codon bias and optimality. Specifically, we address the effects of codon bias with regard to codon optimality in several biological processes centred on translation, such as mRNA stability and protein folding among others. Finally, we examine how different organisms or cell types, through this system, are able to coordinate physiological pathways to respond to a variety of stress or growth conditions.
Collapse
Affiliation(s)
- Fabian Hia
- Department of Medical Chemistry, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Osamu Takeuchi
- Department of Medical Chemistry, Graduate School of Medicine, Kyoto University, Kyoto, Japan.
| |
Collapse
|
44
|
Xu K, Tong Y, Li Y, Tao J, Li J, Zhou J, Liu S. Rational Design of the N-Terminal Coding Sequence for Regulating Enzyme Expression in Bacillus subtilis. ACS Synth Biol 2021; 10:265-276. [PMID: 33464830 DOI: 10.1021/acssynbio.0c00309] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Synonymous mutation of the N-terminal coding sequence (NCS) has been used to regulate gene expression. We here developed a statistical model to predict the effect of the NCSs on protein expression in Bacillus subtilis WB600. First, a synonymous mutation was performed within the first 10 residues of a superfolder green fluorescent protein to generate a library of 172 NCS synonymous mutants with different expression levels. A prediction model was then developed, which adopted G/C frequency at the third position of each codon and minimum free energy of mRNA as the independent variables, using multiple regression analysis between the 11 sequence parameters of the NCS and their fluorescence intensities. By designing the NCS of the 10 signal peptides de novo according to the model, the extracellular yield of B. subtilis pullulanase fused to each signal peptide was up-regulated by up to 515% or down-regulated by at most 79%. This work provided a candidate tool for fine-tuning gene expression or enzyme production in B. subtilis.
Collapse
Affiliation(s)
- Kuidong Xu
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, Wuxi 214122, China
| | - Yi Tong
- National Engineering Research Center for Corn Deep Processing, Jilin COFCO Biochemical Co. Ltd., Changchun 130033, China
| | - Yi Li
- National Engineering Research Center for Corn Deep Processing, Jilin COFCO Biochemical Co. Ltd., Changchun 130033, China
| | - Jin Tao
- National Engineering Research Center for Corn Deep Processing, Jilin COFCO Biochemical Co. Ltd., Changchun 130033, China
| | - Jianghua Li
- The Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Jingwen Zhou
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, Wuxi 214122, China
- Jiangsu Provisional Research Center for Bioactive Product Processing Technology, Jiangnan University, Wuxi 214122, China
| | - Song Liu
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, Wuxi 214122, China
| |
Collapse
|
45
|
do Couto Bordignon P, Pechmann S. Inferring translational heterogeneity from Saccharomyces cerevisiae ribosome profiling. FEBS J 2021; 288:4541-4559. [PMID: 33539640 DOI: 10.1111/febs.15748] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Revised: 01/27/2021] [Accepted: 02/02/2021] [Indexed: 11/30/2022]
Abstract
Translation of mRNAs into proteins by the ribosome is the most important step of protein biosynthesis. Accordingly, translation is tightly controlled and heavily regulated to maintain cellular homeostasis. Ribosome profiling (Ribo-seq) has revolutionized the study of translation by revealing many of its underlying mechanisms. However, equally many aspects of translation remain mysterious, in part also due to persisting challenges in the interpretation of data obtained from Ribo-seq experiments. Here, we show that some of the variability observed in Ribo-seq data has biological origins and reflects programmed heterogeneity of translation. Through a comparative analysis of Ribo-seq data from Saccharomyces cerevisiae, we systematically identify short 3-codon sequences that are differentially translated (DT) across mRNAs, that is, identical sequences that are translated sometimes fast and sometimes slowly beyond what can be attributed to variability between experiments. Remarkably, the thus identified DT sequences link to mechanisms known to regulate translation elongation and are enriched in genes important for protein and organelle biosynthesis. Our results thus highlight examples of translational heterogeneity that are encoded in the genomic sequences and tuned to optimizing cellular homeostasis. More generally, our work highlights the power of Ribo-seq to understand the complexities of translation regulation.
Collapse
|
46
|
McKinnon LM, Miller JB, Whiting MF, Kauwe JSK, Ridge PG. A comprehensive analysis of the phylogenetic signal in ramp sequences in 211 vertebrates. Sci Rep 2021; 11:622. [PMID: 33436653 PMCID: PMC7803996 DOI: 10.1038/s41598-020-78803-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2020] [Accepted: 11/23/2020] [Indexed: 01/24/2023] Open
Abstract
Ramp sequences increase translational speed and accuracy when rare, slowly-translated codons are found at the beginnings of genes. Here, the results of the first analysis of ramp sequences in a phylogenetic construct are presented. Ramp sequences were compared from 247 vertebrates (114 Mammalian and 133 non-mammalian), where the presence and absence of ramp sequences was analyzed as a binary character in a parsimony and maximum likelihood framework. Additionally, ramp sequences were mapped to the Open Tree of Life synthetic tree to determine the number of parallelisms and reversals that occurred, and those results were compared to random permutations. Parsimony and maximum likelihood analyses of the presence and absence of ramp sequences recovered phylogenies that are highly congruent with established phylogenies. Additionally, 81% of vertebrate mammalian ramps and 81.2% of other vertebrate ramps had less parallelisms and reversals than the mean from 1000 randomly permuted trees. A chi-square analysis of completely orthologous ramp sequences resulted in a p-value < 0.001 as compared to random chance. Ramp sequences recover comparable phylogenies as other phylogenomic methods. Although not all ramp sequences appear to have a phylogenetic signal, more ramp sequences track speciation than expected by random chance. Therefore, ramp sequences may be used in conjunction with other phylogenomic approaches if many orthologs are taken into account. However, phylogenomic methods utilizing few orthologs should be cautious in incorporating ramp sequences because individual ramp sequences may provide conflicting signals.
Collapse
Affiliation(s)
- Lauren M McKinnon
- Department of Biology, Brigham Young University, Provo, UT, 84602, USA
| | - Justin B Miller
- Department of Biology, Brigham Young University, Provo, UT, 84602, USA
| | - Michael F Whiting
- Department of Biology, Brigham Young University, Provo, UT, 84602, USA
- Monte L. Bean Museum, Brigham Young University, Provo, UT, 84602, USA
| | - John S K Kauwe
- Department of Biology, Brigham Young University, Provo, UT, 84602, USA
| | - Perry G Ridge
- Department of Biology, Brigham Young University, Provo, UT, 84602, USA.
| |
Collapse
|
47
|
Exploring Potential Signals of Selection for Disordered Residues in Prokaryotic and Eukaryotic Proteins. GENOMICS PROTEOMICS & BIOINFORMATICS 2020; 18:549-564. [PMID: 33346088 PMCID: PMC8377245 DOI: 10.1016/j.gpb.2020.06.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/12/2019] [Revised: 03/29/2020] [Accepted: 06/10/2020] [Indexed: 11/22/2022]
Abstract
Intrinsically disordered proteins (IDPs) are an important class of proteins in all domains of life for their functional importance. However, how nature has shaped the disorder potential of prokaryotic and eukaryotic proteins is still not clearly known. Randomly generated sequences are free of any selective constraints, thus these sequences are commonly used as null models. Considering different types of random protein models, here we seek to understand how the disorder potential of natural eukaryotic and prokaryotic proteins differs from random sequences. Comparing proteome-wide disorder content between real and random sequences of 12 model organisms, we noticed that eukaryotic proteins are enriched in disordered regions compared to random sequences, but in prokaryotes such regions are depleted. By analyzing the position-wise disorder profile, we show that there is a generally higher disorder near the N- and C-terminal regions of eukaryotic proteins as compared to the random models; however, either no or a weak such trend was found in prokaryotic proteins. Moreover, here we show that this preference is not caused by the amino acid or nucleotide composition at the respective sites. Instead, these regions were found to be endowed with a higher fraction of protein–protein binding sites, suggesting their functional importance. We discuss several possible explanations for this pattern, such as improving the efficiency of protein–protein interaction, ribosome movement during translation, and post-translational modification. However, further studies are needed to clearly understand the biophysical mechanisms causing the trend.
Collapse
|
48
|
Abstract
The encoded biosynthesis of proteins provides the ultimate paradigm for high-fidelity synthesis of long polymers of defined sequence and composition, but it is limited to polymerizing the canonical amino acids. Recent advances have built on genetic code expansion - which commonly permits the cellular incorporation of one type of non-canonical amino acid into a protein - to enable the encoded incorporation of several distinct non-canonical amino acids. Developments include strategies to read quadruplet codons, use non-natural DNA base pairs, synthesize completely recoded genomes and create orthogonal translational components with reprogrammed specificities. These advances may enable the genetically encoded synthesis of non-canonical biopolymers and provide a platform for transforming the discovery and evolution of new materials and therapeutics.
Collapse
Affiliation(s)
| | - Jason W Chin
- Medical Research Council Laboratory of Molecular Biology, Cambridge, UK.
| |
Collapse
|
49
|
Algorithms for ribosome traffic engineering and their potential in improving host cells' titer and growth rate. Sci Rep 2020; 10:21202. [PMID: 33273552 PMCID: PMC7713304 DOI: 10.1038/s41598-020-78260-y] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Accepted: 11/20/2020] [Indexed: 11/08/2022] Open
Abstract
mRNA translation is a fundamental cellular process consuming most of the intracellular energy; thus, it is under extensive evolutionary selection for optimization, and its efficiency can affect the host's growth rate. We describe a generic approach for improving the growth rate (fitness) of any organism by introducing synonymous mutations based on comprehensive computational models. The algorithms introduce silent mutations that may improve the allocation of ribosomes in the cells via the decreasing of their traffic jams during translation respectively. As a result, resources availability in the cell changes leading to improved growth-rate. We demonstrate experimentally the implementation of the method on Saccharomyces cerevisiae: we show that by introducing a few mutations in two computationally selected genes the mutant's titer increased. Our approach can be employed for improving the growth rate of any organism providing the existence of data for inferring models, and with the relevant genomic engineering tools; thus, it is expected to be extremely useful in biotechnology, medicine, and agriculture.
Collapse
|
50
|
Hodgman MW, Miller JB, Meurs TE, Kauwe JSK. CUBAP: an interactive web portal for analyzing codon usage biases across populations. Nucleic Acids Res 2020; 48:11030-11039. [PMID: 33045750 PMCID: PMC7641757 DOI: 10.1093/nar/gkaa863] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Revised: 08/18/2020] [Accepted: 09/22/2020] [Indexed: 12/19/2022] Open
Abstract
Synonymous codon usage significantly impacts translational and transcriptional efficiency, gene expression, the secondary structure of both mRNA and proteins, and has been implicated in various diseases. However, population-specific differences in codon usage biases remain largely unexplored. Here, we present a web server, https://cubap.byu.edu, to facilitate analyses of codon usage biases across populations (CUBAP). Using the 1000 Genomes Project, we calculated and visually depict population-specific differences in codon frequencies, codon aversion, identical codon pairing, co-tRNA codon pairing, ramp sequences, and nucleotide composition in 17,634 genes. We found that codon pairing significantly differs between populations in 35.8% of genes, allowing us to successfully predict the place of origin for African and East Asian individuals with 98.8% and 100% accuracy, respectively. We also used CUBAP to identify a significant bias toward decreased CTG pairing in the immunity related GTPase M (IRGM) gene in East Asian and African populations, which may contribute to the decreased association of rs10065172 with Crohn's disease in those populations. CUBAP facilitates in-depth gene-specific and codon-specific visualization that will aid in analyzing candidate genes identified in genome-wide association studies, identifying functional implications of synonymous variants, predicting population-specific impacts of synonymous variants and categorizing genetic biases unique to certain populations.
Collapse
Affiliation(s)
- Matthew W Hodgman
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Justin B Miller
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Taylor E Meurs
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - John S K Kauwe
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| |
Collapse
|