Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Berezovsky IN, Kilosanidze GT, Tumanyan VG, Kisselev LL. Amino acid composition of protein termini are biased in different manners. Protein Eng 1999;12:23-30. [PMID: 10065707 DOI: 10.1093/protein/12.1.23] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

For:	Berezovsky IN, Kilosanidze GT, Tumanyan VG, Kisselev LL. Amino acid composition of protein termini are biased in different manners. Protein Eng 1999;12:23-30. [PMID: 10065707 DOI: 10.1093/protein/12.1.23] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

Number

Cited by Other Article(s)

Mir MH, Parmar S, Singh C, Kalia D. Location-agnostic site-specific protein bioconjugation via Baylis Hillman adducts. Nat Commun 2024;15:859. [PMID: 38286847 PMCID: PMC10825175 DOI: 10.1038/s41467-024-45124-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 01/15/2024] [Indexed: 01/31/2024] Open

Owen MD, Sacks C, Bathina S, Emmins RA, Dickson AJ. Characterising the structural and cellular role of immunoglobulin C-terminal lysine in secretory pathways. J Biotechnol 2023;374:38-48. [PMID: 37495115 DOI: 10.1016/j.jbiotec.2023.07.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Accepted: 07/19/2023] [Indexed: 07/28/2023]

Abstract

Improved understanding of expression of recombinant immunoglobulin (IgG)-based therapies can decrease manufacturing process costs and bring down costs to patients. Deletion of C-terminal Lysine (C-Lys) from IgG molecules has been shown to greatly impact yield. This study set out to characterise structural components of IgG C-terminal variants which modulate protein expression by examination of the consequences of mutations at the C-terminal of IgG on expression and by the use of fluorescent C-terminal fragment fusion proteins. Cell-based and cell-free experiments were also implemented to characterise how the C-terminal differentially engages with cellular pathways to modulate expression. IgG variants engineered by removal of the C-terminal Lys were expressed at significantly lower rates than control variants by CHO (and HEK) cells. Engineered constructs of mCherry fused with short regions of the C-terminal regions of IgG mimicked the ordering of expressability observed for IgG variants. These fluorescent C-terminal fragment fusions offered the potential to profile how sequences (and point mutations) modified expression. Via combinations of cell and cell-free systems, screening across a range of variants of IgG and mCherry reporter constructs has shown that interactions between specific C-terminal amino acid sequences and the ribosome can regulate the rate and extent of expression. This study highlights the importance of amino acid sequence regulatory events determining the efficiency of production of desirable recombinant proteins, showing that wildtype C-terminal lysine is a necessary capping molecule for IgG1 expression. From a wider perspective, these data are especially significant towards the design of novel entities. The approach has also provided information about novel short C-terminal tags which may be used to provide selective synthesis of specific subunits in the production of multisubunit products. Alternative strategies for removing C-terminal amino acid heterogeneity whilst maintaining efficient rates of expression have been provided.

Collapse

De Rosa L, Di Stasi R, Romanelli A, D’Andrea LD. Exploiting Protein N-Terminus for Site-Specific Bioconjugation. Molecules 2021;26:3521. [PMID: 34207845 PMCID: PMC8228110 DOI: 10.3390/molecules26123521] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Revised: 06/07/2021] [Accepted: 06/07/2021] [Indexed: 11/29/2022] Open

Influence of nascent polypeptide positive charges on translation dynamics. Biochem J 2021;477:2921-2934. [PMID: 32797214 DOI: 10.1042/bcj20200303] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2020] [Revised: 07/17/2020] [Accepted: 07/23/2020] [Indexed: 01/05/2023]

Weber M, Burgos R, Yus E, Yang J, Lluch‐Senar M, Serrano L. Impact of C-terminal amino acid composition on protein expression in bacteria. Mol Syst Biol 2020;16:e9208. [PMID: 32449593 PMCID: PMC7246954 DOI: 10.15252/msb.20199208] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Revised: 04/07/2020] [Accepted: 04/09/2020] [Indexed: 11/30/2022] Open

Bao W, Yuan CA, Zhang Y, Han K, Nandi AK, Honig B, Huang DS. Mutli-Features Prediction of Protein Translational Modification Sites. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018;15:1453-1460. [PMID: 28961121 DOI: 10.1109/tcbb.2017.2752703] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Androsiuk P, Jastrzębski JP, Paukszto Ł, Okorski A, Pszczółkowska A, Chwedorzewska KJ, Koc J, Górecki R, Giełwanowska I. The complete chloroplast genome of Colobanthus apetalus (Labill.) Druce: genome organization and comparison with related species. PeerJ 2018;6:e4723. [PMID: 29844954 PMCID: PMC5970550 DOI: 10.7717/peerj.4723] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2018] [Accepted: 04/17/2018] [Indexed: 02/02/2023] Open

Abstract

Colobanthus apetalus is a member of the genus Colobanthus, one of the 86 genera of the large family Caryophyllaceae which groups annual and perennial herbs (rarely shrubs) that are widely distributed around the globe, mainly in the Holarctic. The genus Colobanthus consists of 25 species, including Colobanthus quitensis, an extremophile plant native to the maritime Antarctic. Complete chloroplast (cp) genomes are useful for phylogenetic studies and species identification. In this study, next-generation sequencing (NGS) was used to identify the cp genome of C. apetalus. The complete cp genome of C. apetalus has the length of 151,228 bp, 36.65% GC content, and a quadripartite structure with a large single copy (LSC) of 83,380 bp and a small single copy (SSC) of 17,206 bp separated by inverted repeats (IRs) of 25,321 bp. The cp genome contains 131 genes, including 112 unique genes and 19 genes which are duplicated in the IRs. The group of 112 unique genes features 73 protein-coding genes, 30 tRNA genes, four rRNA genes and five conserved chloroplast open reading frames (ORFs). A total of 12 forward repeats, 10 palindromic repeats, five reverse repeats and three complementary repeats were detected. In addition, a simple sequence repeat (SSR) analysis revealed 41 (mono-, di-, tri-, tetra-, penta- and hexanucleotide) SSRs, most of which were AT-rich. A detailed comparison of C. apetalus and C. quitensis cp genomes revealed identical gene content and order. A phylogenetic tree was built based on the sequences of 76 protein-coding genes that are shared by the eleven sequenced representatives of Caryophyllaceae and C. apetalus, and it revealed that C. apetalus and C. quitensis form a clade that is closely related to Silene species and Agrostemma githago. Moreover, the genus Silene appeared as a polymorphic taxon. The results of this study expand our knowledge about the evolution and molecular biology of Caryophyllaceae.

Collapse

Bao W, You ZH, Huang DS. CIPPN: computational identification of protein pupylation sites by using neural network. Oncotarget 2017;8:108867-108879. [PMID: 29312575 PMCID: PMC5752488 DOI: 10.18632/oncotarget.22335] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2017] [Accepted: 09/03/2017] [Indexed: 11/25/2022] Open

Santiago-Frangos A, Jeliazkov JR, Gray JJ, Woodson SA. Acidic C-terminal domains autoregulate the RNA chaperone Hfq. eLife 2017;6:27049. [PMID: 28826489 PMCID: PMC5606850 DOI: 10.7554/elife.27049] [Citation(s) in RCA: 43] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2017] [Accepted: 08/03/2017] [Indexed: 11/15/2022] Open

Requião RD, Fernandes L, de Souza HJA, Rossetto S, Domitrovic T, Palhano FL. Protein charge distribution in proteomes and its impact on translation. PLoS Comput Biol 2017;13:e1005549. [PMID: 28531225 PMCID: PMC5460897 DOI: 10.1371/journal.pcbi.1005549] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2016] [Revised: 06/06/2017] [Accepted: 05/02/2017] [Indexed: 11/25/2022] Open

Abstract

As proteins are synthesized, the nascent polypeptide must pass through a negatively charged exit tunnel. During this stage, positively charged stretches can interact with the ribosome walls and slow the translation. Therefore, charged polypeptides may be important factors that affect protein expression. To determine the frequency and distribution of positively and negatively charged stretches in different proteomes, the net charge was calculated for every 30 consecutive amino acid residues, which corresponds to the length of the ribosome exit tunnel. The following annotated and reviewed proteins in the UniProt database (Swiss-Prot) were analyzed: 551,705 proteins from different organisms and a total of 180 million protein segments. We observed that there were more negative than positive stretches and that super-charged positive sequences (i.e., net charges ≥ 14) were underrepresented in the proteomes. Overall, the proteins were more positively charged at their N-termini and C-termini, and this feature was present in most organisms and subcellular localizations. To investigate whether the N-terminal charges affect the elongation rates, previously published ribosomal profiling data obtained from S. cerevisiae, without translation-interfering drugs, were analyzed. We observed a nonlinear effect of the charge on the ribosome occupancy in which values ≥ +5 and ≤ -6 showed increased and reduced ribosome densities, respectively. These groups also showed different distributions across 80S monosomes and polysomes. Basic polypeptides are more common within short proteins that are translated by monosomes, whereas negative stretches are more abundant in polysome-translated proteins. These findings suggest that the nascent peptide charge impacts translation and can be one of the factors that regulate translation efficiency and protein expression.

Which factors shape the sequence of amino acids that will form a protein? The biochemical features of amino acids, such as their charge and hydrophobicity, are important drivers of protein tridimensional folding, which creates interaction sites for binding other molecules and directs proteins to specific cellular compartments. These features all impact the activity of the proteins after they are produced. Another less obvious factor that influences the protein’s primary structure may be how efficiently a given amino acid sequence is produced by the ribosome. It is known that a repetitive stretch of positively charged amino acids may interact with the negative charges in the ribosome exit tunnel, slowing, or even halting, translation. By analyzing the charge of protein stretches in different organisms, we observed that proteins tend to present positively charged stretches at their extremities, and high charge values can slow (for positive charges) or speed (for negative charges) translation. An interesting consequence of this trend is that proteins that are translated in high quantities by several ribosomes at the same RNA (polysomes) tend to have more negatively charged stretches than proteins that are translated by a single ribosome per RNA (monosomes).

Collapse

Self-Referential Encoding on Modules of Anticodon Pairs-Roots of the Biological Flow System. Life (Basel) 2017;7:life7020016. [PMID: 28383509 PMCID: PMC5492138 DOI: 10.3390/life7020016] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2017] [Revised: 03/24/2017] [Accepted: 03/26/2017] [Indexed: 12/22/2022] Open

Chen D, Disotuar MM, Xiong X, Wang Y, Chou DHC. Selective N-terminal functionalization of native peptides and proteins. Chem Sci 2017;8:2717-2722. [PMID: 28553506 PMCID: PMC5426342 DOI: 10.1039/c6sc04744k] [Citation(s) in RCA: 113] [Impact Index Per Article: 16.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2016] [Accepted: 01/06/2017] [Indexed: 12/12/2022] Open

Bao W, Jiang Z. Prediction of Lysine Pupylation Sites with Machine Learning Methods. INTELLIGENT COMPUTING THEORIES AND APPLICATION 2017. [DOI: 10.1007/978-3-319-63312-1_36] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

Charneski CA, Hurst LD. Positive Charge Loading at Protein Termini Is Due to Membrane Protein Topology, Not a Translational Ramp. Mol Biol Evol 2013;31:70-84. [DOI: 10.1093/molbev/mst169] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Hansted JG, Pietikäinen L, Hög F, Sperling-Petersen HU, Mortensen KK. Expressivity tag: a novel tool for increased expression in Escherichia coli. J Biotechnol 2011;155:275-83. [PMID: 21801766 DOI: 10.1016/j.jbiotec.2011.07.013] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2011] [Revised: 07/07/2011] [Accepted: 07/11/2011] [Indexed: 11/18/2022]

Asada M, Hirakawa H, Kuhara S. Classification of Bacteria Based on the Biases of Terminal Amino Acid Residues. Protein J 2011;30:290-7. [DOI: 10.1007/s10930-011-9332-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Takahashi H, Yokota A, Takenawa T, Iwakura M. Sequence Perturbation Analysis: Addressing Amino Acid Indices to Elucidate the C-Terminal Role of Escherichia Coli Dihydrofolate Reductase. ACTA ACUST UNITED AC 2009;145:751-62. [DOI: 10.1093/jb/mvp034] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Guimarães RC, Moreira CHC, de Farias ST. A self-referential model for the formation of the genetic code. Theory Biosci 2008;127:249-70. [PMID: 18493811 DOI: 10.1007/s12064-008-0043-y] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2007] [Accepted: 04/11/2008] [Indexed: 10/22/2022]

Abstract

A model for the formation of the genetic code is presented where protein synthesis is directed initially by tRNA dimers. Proteins that are resistant to degradation and efficient RNA-binders protect the RNAs. Replication becomes elongational producing poly-tRNAs from which the mRNAs and ribosomes are derived. Attributions are successively fixed to tRNAs paired through the perfect palindromic anticodons, with the same bases at the extremities (5'ANA: UNU 3'; GNG: CNC; principal dinucleotides, pDiN). The 5' degeneracy is then developed. The first pairs to be encoded correspond to the hydropathy correlation outliers (Gly-CC: Pro-GG and Ser-GA: Ser-CU) and to the sector of homogeneous pDiN, composed by two pyrimidines or two purines. These amino acids are preferred in the N-ends of proteins, stabilizers of proteins against catabolism and strong RNA-binders. The next pairs complete the sector of homogeneous pDiN (Asp, Glu-UC: Leu-AG and Asn, Lys-UU: Phe-AA). This set of nine amino acids forms the protein cores with the predominant aperiodic conformation. Next enter the pairs with mixed pDiN (one purine and one pyrimidine), the RY attributions composing the protein N-ends and the YR attributions the C-ends. The last pair contains the main punctuation signs (Ile, Met, iMet-AU: Tyr, Stop-UA). The model indicates that genetic information emerged during the process of formation of the coding/decoding system and that genes were defined by the proteins. Stable proteins constructed the nucleoprotein system by binding to the RNAs that produced them. In this circular rationale, genes are memories in a metabolic system for production of proteins that stabilize it. The simplicity and the highly deterministic character of the process suggest that the Last Universal Common Ancestor populations could be composed, in early stages, of lineages bearing similar genetic codes.

Collapse

C-terminal motif prediction in eukaryotic proteomes using comparative genomics and statistical over-representation across protein families. BMC Genomics 2007;8:191. [PMID: 17594486 PMCID: PMC1929074 DOI: 10.1186/1471-2164-8-191] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2006] [Accepted: 06/26/2007] [Indexed: 12/28/2022] Open

Abstract

Background

The carboxy termini of proteins are a frequent site of activity for a variety of biologically important functions, ranging from post-translational modification to protein targeting. Several short peptide motifs involved in protein sorting roles and dependent upon their proximity to the C-terminus for proper function have already been characterized. As a limited number of such motifs have been identified, the potential exists for genome-wide statistical analysis and comparative genomics to reveal novel peptide signatures functioning in a C-terminal dependent manner. We have applied a novel methodology to the prediction of C-terminal-anchored peptide motifs involving a simple z-statistic and several techniques for improving the signal-to-noise ratio.

Results

We examined the statistical over-representation of position-specific C-terminal tripeptides in 7 eukaryotic proteomes. Sequence randomization models and simple-sequence masking were applied to the successful reduction of background noise. Similarly, as C-terminal homology among members of large protein families may artificially inflate tripeptide counts in an irrelevant and obfuscating manner, gene-family clustering was performed prior to the analysis in order to assess tripeptide over-representation across protein families as opposed to across all proteins. Finally, comparative genomics was used to identify tripeptides significantly occurring in multiple species. This approach has been able to predict, to our knowledge, all C-terminally anchored targeting motifs present in the literature. These include the PTS1 peroxisomal targeting signal (SKL*), the ER-retention signal (K/HDEL*), the ER-retrieval signal for membrane bound proteins (KKxx*), the prenylation signal (CC*) and the CaaX box prenylation motif. In addition to a high statistical over-representation of these known motifs, a collection of significant tripeptides with a high propensity for biological function exists between species, among kingdoms and across eukaryotes. Motifs of note include a serine-acidic peptide (DSD*) as well as several lysine enriched motifs found in nearly all eukaryotic genomes examined.

Conclusion

We have successfully generated a high confidence representation of eukaryotic motifs anchored at the C-terminus. A high incidence of true-positives in our results suggests that several previously unidentified tripeptide patterns are strong candidates for representing novel peptide motifs of a widely employed nature in the C-terminal biology of eukaryotes. Our application of comparative genomics, statistical over-representation and the adjustment for protein family homology has generated several hypotheses concerning the C-terminal topology as it pertains to sorting and potential protein interaction signals. This approach to background reduction could be expanded for application to protein motif prediction in the protein interior. A parallel N-terminal analysis is presented as supplementary data.

Collapse

Li W, Zou H, Tao M. Sequences downstream of the start codon and their relations to G + C content and optimal growth temperature in prokaryotic genomes. Antonie van Leeuwenhoek 2007;92:417-27. [PMID: 17562217 DOI: 10.1007/s10482-007-9170-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2007] [Accepted: 03/30/2007] [Indexed: 11/29/2022]

Farias STD, Moreira CHC, Guimarães RC. Structure of the genetic code suggested by the hydropathy correlation between anticodons and amino acid residues. ORIGINS LIFE EVOL B 2007;37:83-103. [PMID: 16955335 DOI: 10.1007/s11084-006-9008-7] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2005] [Accepted: 11/08/2005] [Indexed: 10/24/2022]

Kochetov AV. Alternative translation start sites and their significance for eukaryotic proteomes. Mol Biol 2006. [DOI: 10.1134/s0026893306050049] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Bogdanov AA, Karpov VL. RNA-protein interactions at the initial and terminal stages of protein biosynthesis as investigated by Lev Kisselev (on the occasion of his 70th anniversary). BIOCHEMISTRY (MOSCOW) 2006;71:915-24. [PMID: 16978156 DOI: 10.1134/s0006297906080141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Bahir I, Linial M. Functional grouping based on signatures in protein termini. Proteins 2006;63:996-1004. [PMID: 16475191 DOI: 10.1002/prot.20903] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Cridge AG, Major LL, Mahagaonkar AA, Poole ES, Isaksson LA, Tate WP. Comparison of characteristics and function of translation termination signals between and within prokaryotic and eukaryotic organisms. Nucleic Acids Res 2006;34:1959-73. [PMID: 16614446 PMCID: PMC1435984 DOI: 10.1093/nar/gkl074] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Kuznetsov IB, Hwang S. A novel sensitive method for the detection of user-defined compositional bias in biological sequences. Bioinformatics 2006;22:1055-63. [PMID: 16500936 DOI: 10.1093/bioinformatics/btl049] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Volkova OA, Titov SE, Kochetov AV. Correlation between the contexts of the translation initiation signal and the N-terminal sequence of arabidopsis, yeast, mouse, and human proteins. Biophysics (Nagoya-shi) 2006. [DOI: 10.1134/s0006350906070037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

Laio A, Micheletti C. Are structural biases at protein termini a signature of vectorial folding? Proteins 2005;62:17-23. [PMID: 16281293 DOI: 10.1002/prot.20712] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Krishna MMG, Englander SW. The N-terminal to C-terminal motif in protein folding and function. Proc Natl Acad Sci U S A 2005;102:1053-8. [PMID: 15657118 PMCID: PMC545867 DOI: 10.1073/pnas.0409114102] [Citation(s) in RCA: 123] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Gatto GJ, Berg JM. Nonrandom tripeptide sequence distributions at protein carboxyl termini. Genome Res 2003;13:617-23. [PMID: 12671002 PMCID: PMC430173 DOI: 10.1101/gr.667603] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Chung JJ, Yang H, Li M. Genome-wide analyses of carboxyl-terminal sequences. Mol Cell Proteomics 2003;2:173-81. [PMID: 12682279 DOI: 10.1074/mcp.m300008-mcp200] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

Scheglmann D, Werner K, Eiselt G, Klinger R. Role of paired basic residues of protein C-termini in phospholipid binding. Protein Eng Des Sel 2002;15:521-8. [PMID: 12082171 DOI: 10.1093/protein/15.6.521] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Maurer-Stroh S, Eisenhaber B, Eisenhaber F. N-terminal N-myristoylation of proteins: refinement of the sequence motif and its taxon-specific differences. J Mol Biol 2002;317:523-40. [PMID: 11955007 DOI: 10.1006/jmbi.2002.5425] [Citation(s) in RCA: 150] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Abstract

N-terminal N-myristoylation is a lipid anchor modification of eukaryotic and viral proteins targeting them to membrane locations, thus changing the cellular function of modified proteins. Protein myristoylation is critical in many pathways; e.g. in signal transduction, apoptosis, or alternative extracellular protein export. The myristoyl-CoA:protein N-myristoyltransferase (NMT) recognizes the sequence motif of appropriate substrate proteins at the N terminus and attaches the lipid moiety to the absolutely required N-terminal glycine residue. Reliable recognition of capacity for N-terminal myristoylation from the substrate protein sequence alone is desirable for proteome-wide function annotation projects but the existing PROSITE motif is not practical, since it produces huge numbers of false positive and even some false negative predictions. As a first step towards a new prediction method, it is necessary to refine the sequence motif coding for N-terminal N-myristoylation. Relying on the in-depth study of the amino acid sequence variability of substrate proteins, on binding site analyses in X-ray structures or 3D homology models for NMTs from various taxa, and on consideration of biochemical data extracted from the scientific literature, we found indications that, at least within a complete substrate protein, the N-terminal 17 protein residues experience different types of variability restrictions. We identified three motif regions: region 1 (positions 1-6) fitting the binding pocket; region 2 (positions 7-10) interacting with the NMT's surface at the mouth of the catalytic cavity; and region 3 (positions 11-17) comprising a hydrophilic linker. Each region was characterized by physical requirements to single sequence positions or groups of positions regarding volume, polarity, backbone flexibility and other typical properties of amino acids (http://mendel.imp.univie.ac.at/myristate/). These specificity differences are confined partly to taxonomic ranges and are proposed for the design of NMT inhibitors in pathogenic fungal and protozoan systems including Aspergillus fumigatus, Leishmania major, Trypanosoma cruzi, Trypanosoma brucei, Giardia intestinalis, Entamoeba histolytica, Pneumocystis carinii, Strongyloides stercoralis and Schistosoma mansoni. An exhaustive search for NMT-homologues led to the discovery of two putative entomopoxviral NMTs.

Collapse

Stenström CM, Holmgren E, Isaksson LA. Cooperative effects by the initiation codon and its flanking regions on translation initiation. Gene 2001;273:259-65. [PMID: 11595172 DOI: 10.1016/s0378-1119(01)00584-4] [Citation(s) in RCA: 80] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Villar HO, Koehler RT. Amino acid preferences of small, naturally occurring polypeptides. Biopolymers 2000;53:226-32. [PMID: 10679627 DOI: 10.1002/(sici)1097-0282(200003)53:3<226::aid-bip2>3.0.co;2-#] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]