1
|
Tatarinova TV, Chekalin E, Nikolsky Y, Bruskin S, Chebotarov D, McNally KL, Alexandrov N. Nucleotide diversity analysis highlights functionally important genomic regions. Sci Rep 2016; 6:35730. [PMID: 27774999 PMCID: PMC5075931 DOI: 10.1038/srep35730] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2016] [Accepted: 09/30/2016] [Indexed: 12/15/2022] Open
Abstract
We analyzed functionality and relative distribution of genetic variants across the complete Oryza sativa genome, using the 40 million single nucleotide polymorphisms (SNPs) dataset from the 3,000 Rice Genomes Project (http://snp-seek.irri.org), the largest and highest density SNP collection for any higher plant. We have shown that the DNA-binding transcription factors (TFs) are the most conserved group of genes, whereas kinases and membrane-localized transporters are the most variable ones. TFs may be conserved because they belong to some of the most connected regulatory hubs that modulate transcription of vast downstream gene networks, whereas signaling kinases and transporters need to adapt rapidly to changing environmental conditions. In general, the observed profound patterns of nucleotide variability reveal functionally important genomic regions. As expected, nucleotide diversity is much higher in intergenic regions than within gene bodies (regions spanning gene models), and protein-coding sequences are more conserved than untranslated gene regions. We have observed a sharp decline in nucleotide diversity that begins at about 250 nucleotides upstream of the transcription start and reaches minimal diversity exactly at the transcription start. We found the transcription termination sites to have remarkably symmetrical patterns of SNP density, implying presence of functional sites near transcription termination. Also, nucleotide diversity was significantly lower near 3′ UTRs, the area rich with regulatory regions.
Collapse
Affiliation(s)
- Tatiana V Tatarinova
- Center for Personalized Medicine and Spatial Sciences Institute, University of Southern California, Los Angeles, CA, USA.,Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russian Federation
| | | | - Yuri Nikolsky
- Vavilov Institute of General Genetics, Moscow, Russia.,F1 Genomics, San Diego, CA, USA.,School of Systems Biology, George Mason University, VA, USA
| | | | - Dmitry Chebotarov
- International Rice Research Institute, Los Baños, Laguna 4031, Philippines
| | - Kenneth L McNally
- International Rice Research Institute, Los Baños, Laguna 4031, Philippines
| | | |
Collapse
|
2
|
Hu J, Ni Y, Dryman BA, Meng XJ, Zhang C. Immunogenicity study of plant-made oral subunit vaccine against porcine reproductive and respiratory syndrome virus (PRRSV). Vaccine 2012; 30:2068-74. [PMID: 22300722 DOI: 10.1016/j.vaccine.2012.01.059] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2011] [Revised: 01/14/2012] [Accepted: 01/19/2012] [Indexed: 01/11/2023]
Abstract
Currently, killed-virus and modified-live PRRSV vaccines are used to control porcine reproductive and respiratory syndrome disease (PRRS). However, very limited efficacy of killed-virus vaccines and serious safety concerns for modified-live virus vaccines demand the development of novel PRRSV vaccines. In this report, we investigated the possibility of using transgenic plants as a cost-effective and scalable system for production and delivery of a viral protein as an oral subunit vaccine against PRRSV. Corn calli were genetically engineered to produce PRRSV viral envelope-associated M protein. Both serum and intestine mucosal antigen-specific antibodies were induced by oral administration of the transgenic plant tissues to mice. In addition, serum and mucosal antibodies showed virus neutralization activity. The neutralization antibody titers after the final boost reached 6.7 in serum and 3.7 in fecal extracts, respectively. A PRRSV-specific IFN-γ response was also detected in splenocytes of vaccinated animals. These results demonstrate that transgenic corn plants are an efficient subunit vaccine production and oral delivery system for generation of both systemic and mucosal immune responses against PRRSV.
Collapse
MESH Headings
- Administration, Oral
- Animals
- Antibodies, Neutralizing/analysis
- Antibodies, Neutralizing/blood
- Antibodies, Viral/analysis
- Antibodies, Viral/blood
- Feces/chemistry
- Female
- Interferon-gamma/metabolism
- Leukocytes, Mononuclear/immunology
- Mice
- Mice, Inbred BALB C
- Mucous Membrane/immunology
- Plants, Genetically Modified/genetics
- Porcine respiratory and reproductive syndrome virus/genetics
- Porcine respiratory and reproductive syndrome virus/immunology
- Spleen/immunology
- Vaccines, Edible/administration & dosage
- Vaccines, Edible/genetics
- Vaccines, Edible/immunology
- Vaccines, Subunit/administration & dosage
- Vaccines, Subunit/genetics
- Vaccines, Subunit/immunology
- Vaccines, Synthetic/administration & dosage
- Vaccines, Synthetic/genetics
- Vaccines, Synthetic/immunology
- Viral Matrix Proteins/genetics
- Viral Matrix Proteins/immunology
- Viral Vaccines/administration & dosage
- Viral Vaccines/genetics
- Viral Vaccines/immunology
- Zea mays/genetics
Collapse
Affiliation(s)
- Jianzhong Hu
- Department of Biological Systems Engineering, Virginia Tech, Blacksburg, VA 24061, USA
| | | | | | | | | |
Collapse
|
3
|
Hamby SE, Thomas NST, Cooper DN, Chuzhanova N. A meta-analysis of single base-pair substitutions in translational termination codons ('nonstop' mutations) that cause human inherited disease. Hum Genomics 2011; 5:241-64. [PMID: 21712188 PMCID: PMC3525242 DOI: 10.1186/1479-7364-5-4-241] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
'Nonstop' mutations are single base-pair substitutions that occur within translational termination (stop) codons and which can lead to the continued and inappropriate translation of the mRNA into the 3'-untranslated region. We have performed a meta-analysis of the 119 nonstop mutations (in 87 different genes) known to cause human inherited disease, examining the sequence context of the mutated stop codons and the average distance to the next alternative in-frame stop codon downstream, in comparison with their counterparts from control (non-mutated) gene sequences. A paucity of alternative in-frame stop codons was noted in the immediate vicinity (0-49 nucleotides downstream) of the mutated stop codons as compared with their control counterparts (p = 7.81 × 10-4). This implies that at least some nonstop mutations with alternative stop codons in close proximity will not have come to clinical attention, possibly because they will have given rise to stable mRNAs (not subject to nonstop mRNA decay) that are translatable into proteins of near-normal length and biological function. A significant excess of downstream in-frame stop codons was, however, noted in the range 150-199 nucleotides from the mutated stop codon (p = 8.55 × 10-4). We speculate that recruitment of an alternative stop codon at greater distance from the mutated stop codon may trigger nonstop mRNA decay, thereby decreasing the amount of protein product and yielding a readily discernible clinical phenotype. Confirmation or otherwise of this postulate must await the emergence of a clearer understanding of the mechanism of nonstop mRNA decay in mammalian cells.
Collapse
Affiliation(s)
- Stephen E Hamby
- School of Science and Technology, Nottingham Trent University, UK
| | | | | | | |
Collapse
|
4
|
Mort M, Ivanov D, Cooper DN, Chuzhanova NA. A meta-analysis of nonsense mutations causing human genetic disease. Hum Mutat 2008; 29:1037-47. [PMID: 18454449 DOI: 10.1002/humu.20763] [Citation(s) in RCA: 316] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Affiliation(s)
- Matthew Mort
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, United Kingdom
| | | | | | | |
Collapse
|
5
|
Zhouravleva GA, Moskalenko SE, Murina OA, Inge-Vechtomov SG. Viable nonsense mutants for the SUP45 gene in the yeast Saccharomyces cerevisiae are lethal at increased temperature. RUSS J GENET+ 2007. [DOI: 10.1134/s1022795407100079] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
6
|
Wu G, Nie L, Freeland SJ. The effects of differential gene expression on coding sequence features: Analysis by one-way ANOVA. Biochem Biophys Res Commun 2007; 358:1108-13. [PMID: 17517370 DOI: 10.1016/j.bbrc.2007.05.043] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2007] [Accepted: 05/08/2007] [Indexed: 10/23/2022]
Abstract
It is well-established that non-random patterns in coding DNA sequence (CDS) features can be partially explained by translational selection. Recent extensions of microarray and proteomic expression data have stimulated many genome-wide investigations of the relationships between gene expression and various CDS features. However, only modest correlations have been found. Here we introduced the one-way ANOVA, a more powerful extension of previous grouping methods, to re-examine these relationships at the whole genome scale for Saccharomyces cerevisiae, where genome-wide protein abundance has been recently quantified. Our results clarify that coding sequence features are inappropriate for use as genome-wide estimators for protein expression levels. This analysis also demonstrates that one-way ANOVA is a powerful and simple method to explore the influence of gene expression on CDS features.
Collapse
Affiliation(s)
- Gang Wu
- Department of Biological Sciences, University of Maryland at Baltimore County, Baltimore, MD 21250, USA.
| | | | | |
Collapse
|
7
|
Nie L, Wu G, Zhang W. Correlation of mRNA expression and protein abundance affected by multiple sequence features related to translational efficiency in Desulfovibrio vulgaris: a quantitative analysis. Genetics 2006; 174:2229-43. [PMID: 17028312 PMCID: PMC1698625 DOI: 10.1534/genetics.106.065862] [Citation(s) in RCA: 163] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The modest correlation between mRNA expression and protein abundance in large-scale data sets is explained in part by experimental challenges, such as technological limitations, and in part by fundamental biological factors in the transcription and translation processes. Among various factors affecting the mRNA-protein correlation, the roles of biological factors related to translation are poorly understood. In this study, using experimental mRNA expression and protein abundance data collected from Desulfovibrio vulgaris by DNA microarray and liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) proteomic analysis, we quantitatively examined the effects of several translational-efficiency-related sequence features on mRNA-protein correlation. Three classes of sequence features were investigated according to different translational stages: (i) initiation, Shine-Dalgarno sequences, start codon identity, and start codon context; (ii) elongation, codon usage and amino acid usage; and (iii) termination, stop codon identity and stop codon context. Surprisingly, although it is widely accepted that translation initiation is the rate-limiting step for translation, our results showed that the mRNA-protein correlation was affected the most by the features at elongation stages, i.e., codon usage and amino acid composition (5.3-15.7% and 5.8-11.9% of the total variation of mRNA-protein correlation, respectively), followed by stop codon context and the Shine-Dalgarno sequence (3.7-5.1% and 1.9-3.8%, respectively). Taken together, all sequence features contributed to 15.2-26.2% of the total variation of mRNA-protein correlation. This study provides the first comprehensive quantitative analysis of the mRNA-protein correlation in bacterial D. vulgaris and adds new insights into the relative importance of various sequence features in prokaryotic protein translation.
Collapse
Affiliation(s)
- Lei Nie
- Department of Biostatistics, Bioinformatics and Biomathematics, Georgetown University, Washington, DC 20057, USA
| | | | | |
Collapse
|
8
|
Liu Q. Comparative analysis of base biases around the stop codons in six eukaryotes. Biosystems 2006; 81:281-9. [PMID: 15979780 DOI: 10.1016/j.biosystems.2005.05.005] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2005] [Revised: 05/11/2005] [Accepted: 05/14/2005] [Indexed: 11/17/2022]
Abstract
Using full-length cDNA sequences, a comparative analysis of sequence patterns around the stop codons in six eukaryotes was performed. Here, it was showed that the codon immediately before and after the stop codons (defined as -1 codon and +1 codon, respectively) were much more biased than other examined positions, especially at the second position of -1 codons and the first position of +1 codons which were rich in As/Us and purines, respectively, for most species. The author speculated that strongly biased sequence pattern from position -2 to +4 might act as an extended translation termination signal. Translation termination was catalyzed by release factors that recognized the stop codons. The multiple amino acid sequence alignment of eukaryotic release factor 1 (eRF1) of 20 species showed that there were 16 residue sites that were strictly conserved, especially the invariant amino acids Ile70 and Lys71. Accordingly, it could be inferred that those candidate amino acids might involve in the recognition process. Moreover, the possible stop signal recognition hypothesis was also discussed herein.
Collapse
Affiliation(s)
- Qingpo Liu
- Department of Agronomy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310029, China.
| |
Collapse
|
9
|
Handy DE, Hang G, Scolaro J, Metes N, Razaq N, Yang Y, Loscalzo J. Aminoglycosides decrease glutathione peroxidase-1 activity by interfering with selenocysteine incorporation. J Biol Chem 2005; 281:3382-8. [PMID: 16354666 PMCID: PMC1472404 DOI: 10.1074/jbc.m511295200] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Cellular glutathione peroxidase is a key intracellular antioxidant enzyme that contains a selenocysteine residue at its active site. Selenium, a selenocysteine incorporation sequence in the 3'-untranslated region of the glutathione peroxidase mRNA, and other translational cofactors are necessary for "read-through" of a UGA stop codon that specifies selenocysteine incorporation. Aminoglycoside antibiotics facilitate read-through of premature stop codons in prokayotes and eukaryotes. We studied the effects of G418, an aminoglycoside, on cellular glutathione peroxidase expression and function in mammalian cells. Insertion of a selenocysteine incorporation element along with a UGA codon into a reporter construct allows for read-through only in the presence of selenium. G418 increased read-through in selenium-replete cells as well as in the absence of selenium. G418 treatment increased immunodetectable endogenous or recombinant glutathione peroxidase but reduced the specific activity of the enzyme. Tandem mass spectrometry experiments indicated that G418 caused a substitution of l-arginine for selenocysteine. These data show that G418 can affect the biosynthesis of this key antioxidant enzyme by promoting substitution at the UGA codon.
Collapse
Affiliation(s)
- Diane E Handy
- Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts 02115, USA.
| | | | | | | | | | | | | |
Collapse
|
10
|
Liu Q, Xue Q. Comparative studies on sequence characteristics around translation initiation codon in four eukaryotes. J Genet 2005; 84:317-22. [PMID: 16385165 DOI: 10.1007/bf02715803] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Affiliation(s)
- Qingpo Liu
- Department of Agronomy, College of Agriculture and Biotechnology, Zhejiang University, 310029 Hangzhou, People's Republic of China.
| | | |
Collapse
|
11
|
Arakawa K, Suzuki H, Fujishima K, Fujimoto K, Ueda S, Matsui M, Tomita M. A Comprehensive Software Suite for the Analysis of cDNAs. GENOMICS, PROTEOMICS & BIOINFORMATICS 2005; 3:179-88. [PMID: 16487083 PMCID: PMC5172547 DOI: 10.1016/s1672-0229(05)03023-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
We have developed a comprehensive software suite for bioinformatics research of cDNAs; it is aimed at rapid characterization of the features of genes and the proteins they code. Methods implemented include the detection of translation initiation and termination signals, statistical analysis of codon usage, comparative study of amino acid composition, comparative modeling of the structures of product proteins, prediction of alternative splice forms, and metabolic pathway reconstruction. The software package is freely available under the GNU General Public License at http://www.g-language.org/data/cdna/.
Collapse
Affiliation(s)
- Kazuharu Arakawa
- Institute for Advanced Biosciences, Keio University, Fujisawa 252-8520, Japan.
| | | | | | | | | | | | | |
Collapse
|
12
|
Abstract
Great advances have been made in the past three decades in understanding the molecular mechanics underlying protein synthesis in bacteria, but our understanding of the corresponding events in eukaryotic organisms is only beginning to catch up. In this review we describe the current state of our knowledge and ignorance of the molecular mechanics underlying eukaryotic translation. We discuss the mechanisms conserved across the three kingdoms of life as well as the important divergences that have taken place in the pathway.
Collapse
Affiliation(s)
- Lee D Kapp
- Department of Biophysics and Biophysical Chemistry, Johns Hopkins University School of Medicine, 725 North Wolfe Street, Baltimore, Maryland 21205-2185, USA.
| | | |
Collapse
|
13
|
Shabalina SA, Ogurtsov AY, Rogozin IB, Koonin EV, Lipman DJ. Comparative analysis of orthologous eukaryotic mRNAs: potential hidden functional signals. Nucleic Acids Res 2004; 32:1774-82. [PMID: 15031317 PMCID: PMC390323 DOI: 10.1093/nar/gkh313] [Citation(s) in RCA: 71] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Sequencing of multiple, nearly complete eukaryotic genomes creates opportunities for detecting previously unnoticed, subtle functional signals in non-coding regions. A genome-wide comparative analysis of orthologous sets of mammalian and yeast mRNAs revealed distinct patterns of evolutionary conservation at the boundaries of the untranslated regions (UTRs) and the coding region (CDS). Elevated sequence conservation was detected in approximately 30 nt regions around the start codon. There seems to be a complementary relationship between sequence conservation in the approximately 30 nt regions of the 5'-UTR immediately upstream of the start codon and that in the synonymous positions of the 5'-terminal 30 nt of the CDS: in mammalian mRNAs, the 5'-UTR shows a greater conservation than the CDS, whereas the opposite trend holds for yeast mRNAs. Unexpectedly, a approximately 30 nt region downstream of the stop codon shows a substantially lower level of sequence conservation than the downstream portions of the 3'-UTRs. However, the sequence in this poorly conserved 30 nt portion of the 3'-UTR is non-random in that it has a higher GC content than the rest of the UTR. It is hypothesized that the elevated sequence conservation in the region immediately upstream of the start codon is related to the requirement for initiation factor binding during pre-initiation ribosomal scanning. In contrast, the poorly conserved region downstream of the stop codon could be involved in the post- termination scanning and dissociation of the ribosomes from the mRNA, which requires only the mRNA-ribosome interaction. Additionally, it was found that the choice of the stop codon in mammals, but not in yeasts, and the context in the immediate vicinity of the stop codons in both mammals and yeasts are subject to strong selection. Thus, genome-wide analysis of orthologous gene sets allows detection of previously unrecognized patterns of sequence conservation, which are likely to reflect hidden functional signals, such as ribosomal filters that could regulate translation by modulating the interaction between the mRNA and ribosomes.
Collapse
Affiliation(s)
- Svetlana A Shabalina
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | | | | | |
Collapse
|