1
|
Bulfoni M, Bouyioukos C, Zakaria A, Nigon F, Rapone R, Del Maestro L, Ait-Si-Ali S, Scharfmann R, Cosson B. Glucose controls co-translation of structurally related mRNAs via the mTOR and eIF2 pathways in human pancreatic beta cells. Front Endocrinol (Lausanne) 2022; 13:949097. [PMID: 35992129 PMCID: PMC9388909 DOI: 10.3389/fendo.2022.949097] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Accepted: 07/11/2022] [Indexed: 11/13/2022] Open
Abstract
Pancreatic beta cell response to glucose is critical for the maintenance of normoglycemia. A strong transcriptional response was classically described in rodent models but, interestingly, not in human cells. In this study, we exposed human pancreatic beta cells to an increased concentration of glucose and analysed at a global level the mRNAs steady state levels and their translationalability. Polysome profiling analysis showed an early acute increase in protein synthesis and a specific translation regulation of more than 400 mRNAs, independently of their transcriptional regulation. We clustered the co-regulated mRNAs according to their behaviour in translation in response to glucose and discovered common structural and sequence mRNA features. Among them mTOR- and eIF2-sensitive elements have a predominant role to increase mostly the translation of mRNAs encoding for proteins of the translational machinery. Furthermore, we show that mTOR and eIF2α pathways are independently regulated in response to glucose, participating to a translational reshaping to adapt beta cell metabolism. The early acute increase in the translation machinery components prepare the beta cell for further protein demand due to glucose-mediated metabolism changes.
Collapse
Affiliation(s)
- Manuel Bulfoni
- Université Paris Cité, CNRS, Epigenetics and Cell Fate, Paris, France
| | - Costas Bouyioukos
- Université Paris Cité, CNRS, Epigenetics and Cell Fate, Paris, France
| | - Albatoul Zakaria
- Université Paris Cité, Institut Cochin, INSERM, CNRS, Paris, France
| | - Fabienne Nigon
- Université Paris Cité, CNRS, Epigenetics and Cell Fate, Paris, France
| | - Roberta Rapone
- Université Paris Cité, CNRS, Epigenetics and Cell Fate, Paris, France
| | | | | | | | - Bertrand Cosson
- Université Paris Cité, CNRS, Epigenetics and Cell Fate, Paris, France
- *Correspondence: Bertrand Cosson,
| |
Collapse
|
2
|
Lackey L, Coria A, Ghosh AJ, Grayeski P, Hatfield A, Shankar V, Platig J, Xu Z, Ramos SBV, Silverman EK, Ortega VE, Cho MH, Hersh CP, Hobbs BD, Castaldi P, Laederach A. Alternative poly-adenylation modulates α1-antitrypsin expression in chronic obstructive pulmonary disease. PLoS Genet 2021; 17:e1009912. [PMID: 34784346 PMCID: PMC8631626 DOI: 10.1371/journal.pgen.1009912] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Revised: 11/30/2021] [Accepted: 10/25/2021] [Indexed: 01/07/2023] Open
Abstract
α1-anti-trypsin (A1AT), encoded by SERPINA1, is a neutrophil elastase inhibitor that controls the inflammatory response in the lung. Severe A1AT deficiency increases risk for Chronic Obstructive Pulmonary Disease (COPD), however, the role of A1AT in COPD in non-deficient individuals is not well known. We identify a 2.1-fold increase (p = 2.5x10-6) in the use of a distal poly-adenylation site in primary lung tissue RNA-seq in 82 COPD cases when compared to 64 controls and replicate this in an independent study of 376 COPD and 267 controls. This alternative polyadenylation event involves two sites, a proximal and distal site, 61 and 1683 nucleotides downstream of the A1AT stop codon. To characterize this event, we measured the distal ratio in human primary tissue short read RNA-seq data and corroborated our results with long read RNA-seq data. Integrating these results with 3' end RNA-seq and nanoluciferase reporter assay experiments we show that use of the distal site yields mRNA transcripts with over 50-fold decreased translation efficiency and A1AT expression. We identified seven RNA binding proteins using enhanced CrossLinking and ImmunoPrecipitation precipitation (eCLIP) with one or more binding sites in the SERPINA1 3' UTR. We combined these data with measurements of the distal ratio in shRNA knockdown experiments, nuclear and cytoplasmic fractionation, and chemical RNA structure probing. We identify Quaking Homolog (QKI) as a modulator of SERPINA1 mRNA translation and confirm the role of QKI in SERPINA1 translation with luciferase reporter assays. Analysis of single-cell RNA-seq showed differences in the distribution of the SERPINA1 distal ratio among hepatocytes, macrophages, αβ-Tcells and plasma cells in the liver. Alveolar Type 1,2, dendritic cells and macrophages also vary in their distal ratio in the lung. Our work reveals a complex post-transcriptional mechanism that regulates alternative polyadenylation and A1AT expression in COPD.
Collapse
Affiliation(s)
- Lela Lackey
- Department of Genetics and Biochemistry, Center for Human Genetics, Clemson University, Greenwood, South Carolina, United States of America
| | - Aaztli Coria
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina, United States of America
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Auyon J. Ghosh
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Phil Grayeski
- Curriculum in Genetics and Molecular Biology, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Abigail Hatfield
- Department of Genetics and Biochemistry, Center for Human Genetics, Clemson University, Greenwood, South Carolina, United States of America
| | - Vijay Shankar
- Department of Genetics and Biochemistry, Center for Human Genetics, Clemson University, Greenwood, South Carolina, United States of America
| | - John Platig
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Zhonghui Xu
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Silvia B. V. Ramos
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Edwin K. Silverman
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Victor E. Ortega
- Department of Internal Medicine, Division of Respiratory Medicine, Center for Individualized Medicine, Mayo Clinic, Scottsdale, Arizona, United States of America
| | - Michael H. Cho
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Craig P. Hersh
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Brian D. Hobbs
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Peter Castaldi
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Internal Medicine and Primary Care, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Alain Laederach
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina, United States of America
| |
Collapse
|
3
|
Genomic Space of MGMT in Human Glioma Revisited: Novel Motifs, Regulatory RNAs, NRF1, 2, and CTCF Involvement in Gene Expression. Int J Mol Sci 2021; 22:ijms22052492. [PMID: 33801310 PMCID: PMC7958331 DOI: 10.3390/ijms22052492] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Revised: 02/18/2021] [Accepted: 02/25/2021] [Indexed: 01/08/2023] Open
Abstract
Background: The molecular regulation of increased MGMT expression in human brain tumors, the associated regulatory elements, and linkages of these to its epigenetic silencing are not understood. Because the heightened expression or non-expression of MGMT plays a pivotal role in glioma therapeutics, we applied bioinformatics and experimental tools to identify the regulatory elements in the MGMT and neighboring EBF3 gene loci. Results: Extensive genome database analyses showed that the MGMT genomic space was rich in and harbored many undescribed RNA regulatory sequences and recognition motifs. We extended the MGMT’s exon-1 promoter to 2019 bp to include five overlapping alternate promoters. Consensus sequences in the revised promoter for (a) the transcriptional factors CTCF, NRF1/NRF2, GAF, (b) the genetic switch MYC/MAX/MAD, and (c) two well-defined p53 response elements in MGMT intron-1, were identified. A putative protein-coding or non-coding RNA sequence was located in the extended 3′ UTR of the MGMT transcript. Eleven non-coding RNA loci coding for miRNAs, antisense RNA, and lncRNAs were identified in the MGMT-EBF3 region and six of these showed validated potential for curtailing the expression of both MGMT and EBF3 genes. ChIP analysis verified the binding site in MGMT promoter for CTCF which regulates the genomic methylation and chromatin looping. CTCF depletion by a pool of specific siRNA and shRNAs led to a significant attenuation of MGMT expression in human GBM cell lines. Computational analysis of the ChIP sequence data in ENCODE showed the presence of NRF1 in the MGMT promoter and this occurred only in MGMT-proficient cell lines. Further, an enforced NRF2 expression markedly augmented the MGMT mRNA and protein levels in glioma cells. Conclusions: We provide the first evidence for several new regulatory components in the MGMT gene locus which predict complex transcriptional and posttranscriptional controls with potential for new therapeutic avenues.
Collapse
|
4
|
K-mer-Based Motif Analysis in Insect Species across Anopheles, Drosophila, and Glossina Genera and Its Application to Species Classification. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2019; 2019:4259479. [PMID: 31827584 PMCID: PMC6881769 DOI: 10.1155/2019/4259479] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/21/2019] [Revised: 09/18/2019] [Accepted: 09/28/2019] [Indexed: 11/17/2022]
Abstract
Short k-mer sequences from DNA are both conserved and diverged across species owing to their functional significance in speciation, which enables their use in many species classification algorithms. In the present study, we developed a methodology to analyze the DNA k-mers of whole genome, 5' UTR, intron, and 3' UTR regions from 58 insect species belonging to three genera of Diptera that include Anopheles, Drosophila, and Glossina. We developed an improved algorithm to predict and score k-mers based on a scheme that normalizes k-mer scores in different genomic subregions. This algorithm takes advantage of the information content of the whole genome as opposed to other algorithms or studies that analyze only a small group of genes. Our algorithm uses k-mers of lengths 7-9 bp for the whole genome, 5' and 3' UTR regions as well as the intronic regions. Taxonomical relationships based on the whole-genome k-mer signatures showed that species of the three genera clustered together quite visibly. We also improved the scoring and filtering of these k-mers for accurate species identification. The whole-genome k-mer content correlation algorithm showed that species within a single genus correlated tightly with each other as compared to other genera. The genomes of two Aedes and one Culex species were also analyzed to demonstrate how newly sequenced species can be classified using the algorithm. Furthermore, working with several dozen species has enabled us to assign a whole-genome k-mer signature for each of the 58 Dipteran species by making all-to-all pairwise comparison of the k-mer content. These signatures were used to compare the similarity between species and to identify clusters of species displaying similar signatures.
Collapse
|
5
|
Jiang Z, Yang J, Dai A, Wang Y, Li W, Xie Z. Ribosome profiling reveals translational regulation of mammalian cells in response to hypoxic stress. BMC Genomics 2017; 18:638. [PMID: 28826393 PMCID: PMC5563900 DOI: 10.1186/s12864-017-3996-8] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2017] [Accepted: 08/01/2017] [Indexed: 12/30/2022] Open
Abstract
Background Retinal pigment epithelium (RPE) cells transfer oxygen and nutrients from choroid to the neural retina. Reduced oxygen to RPE perturbs development and functions of blood vessels in retina. Previous efforts of genome-wide studies have been largely focused on transcriptional changes of cells in response to hypoxia. Recently developed ribosome profiling provides an opportunity to study genome-wide translational changes. To gain systemic insights into the transcriptional and translational regulation of cellular in response to hypoxic stress, we used simultaneous RNA sequencing and ribosome profiling on an RPE cells line, ARPE-19, under hypoxia condition. Results Both HIF-1α and EPAS1 (HIF-2α) proteins were stabilized in ARPE-19 under hypoxic stress treatment at 1 h, 2 h and 4 h. Analysis of simultaneous RNA sequencing and ribosome profiling data showed genome-wide gene expression changes at both transcriptional and translational levels. Comparative analysis of ribosome profiling and RNA-seq data revealed that hypoxia induced changes of more genes at the translational than the transcriptional levels. Ribosomes densities at 5′ untranslated region (UTR) significantly increased under hypoxic stress. Interestingly, the increase in ribosome densities at 5′ UTR is positively correlated with the presence of upstream open reading frames (uORFs) in the 5′ UTR of mRNAs. Conclusion Our results characterized translational profiles of mRNAs for a RPE cell line in response to hypoxia. In particular, uORFs play important roles in the regulation of translation efficiency by affecting ribosomes loading onto mRNAs. This study provides the first attempt to understand translational response of mammalian cells under hypoxic condition. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3996-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Zhiwen Jiang
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Lab of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Jiaqi Yang
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Lab of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Aimei Dai
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Lab of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Yuming Wang
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Lab of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Wei Li
- Retinal Neurobiology Section, National Eye Institute, Bethesda, MD, USA
| | - Zhi Xie
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Lab of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China.
| |
Collapse
|
6
|
In silico analysis of deleterious single nucleotide polymorphisms in human BUB1 mitotic checkpoint serine/threonine kinase B gene. Meta Gene 2016; 9:142-50. [PMID: 27331020 PMCID: PMC4913181 DOI: 10.1016/j.mgene.2016.05.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2015] [Revised: 04/12/2016] [Accepted: 05/06/2016] [Indexed: 11/23/2022] Open
Abstract
One of the major challenges in the analysis of human genetic variation is to distinguish mutations that are functionally neutral from those that contribute to disease. BubR1 is a key protein mediating spindle-checkpoint activation that plays a role in the inhibition of the anaphase-promoting complex/cyclosome (APC/C), delaying the onset of anaphase and ensuring proper chromosome segregation. Owing to the importance of BUB1B gene in mitotic checkpoint a functional analysis using different in silico approaches was undertaken to explore the possible associations between genetic mutations and phenotypic variation. In this work we found that 3 nsSNPs I82N, P334L and R814H have a functional effect on protein function and stability. A literature search revealed that R814H was already implicated in human diseases. Additionally, 2 SNPs in the 5' UTR region was predicted to exhibit a pattern change in the internal ribosome entry site (IRES), and eight MicroRNA binding sites were found to be highly affected due to 3' UTR SNPs. These in silico predictions will provide useful information in selecting the target SNPs that are likely to have functional impact on the BUB1B gene.
Collapse
|
7
|
The transcriptomes of the crucian carp complex (Carassius auratus) provide insights into the distinction between unisexual triploids and sexual diploids. Int J Mol Sci 2014; 15:9386-406. [PMID: 24871367 PMCID: PMC4100101 DOI: 10.3390/ijms15069386] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2014] [Revised: 05/15/2014] [Accepted: 05/16/2014] [Indexed: 01/03/2023] Open
Abstract
Both sexual reproduction and unisexual reproduction are adaptive strategies for species survival and evolution. Unisexual animals have originated largely by hybridization, which tends to elevate their heterozygosity. However, the extent of genetic diversity resulting from hybridization and the genomic differences that determine the type of reproduction are poorly understood. In Carassius auratus, sexual diploids and unisexual triploids coexist. These two forms are similar morphologically but differ markedly in their modes of reproduction. Investigation of their genomic differences will be useful to study genome diversity and the development of reproductive mode. We generated transcriptomes for the unisexual and sexual populations. Genes were identified using homology searches and an ab initio method. Estimation of the synonymous substitution rate in the orthologous pairs indicated that the hybridization of gibel carp occurred 2.2 million years ago. Microsatellite genotyping in each individual from the gibel carp population indicated that most gibel carp genes were not tri-allelic. Molecular function and pathway comparisons suggested few gene expansions between them, except for the progesterone-mediated oocyte maturation pathway, which is enriched in gibel carp. Differential expression analysis identified highly expressed genes in gibel carp. The transcriptomes provide information on genetic diversity and genomic differences, which should assist future studies in functional genomics.
Collapse
|
8
|
Rao YS, Chai XW, Wang ZF, Nie QH, Zhang XQ. Impact of GC content on gene expression pattern in chicken. Genet Sel Evol 2013; 45:9. [PMID: 23557030 PMCID: PMC3641017 DOI: 10.1186/1297-9686-45-9] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2012] [Accepted: 03/16/2013] [Indexed: 11/21/2022] Open
Abstract
Background GC content varies greatly between different genomic regions in many eukaryotes. In order to determine whether this organization named isochore organization influences gene expression patterns, the relationship between GC content and gene expression has been investigated in man and mouse. However, to date, this question is still a matter for debate. Among the avian species, chicken (Gallus gallus) is the best studied representative with a complete genome sequence. The distinctive features and organization of its sequence make it a good model to explore important issues in genome structure and evolution. Methods Only nuclear genes with complete information on protein-coding sequence with no evidence of multiple-splicing forms were included in this study. Chicken protein coding sequences, complete mRNA sequences (or full length cDNA sequences), and 5′ untranslated region sequences (5′ UTR) were downloaded from Ensembl and chicken expression data originated from a previous work. Three indices i.e. expression level, expression breadth and maximum expression level were used to measure the expression pattern of a given gene. CpG islands were identified using hgTables of the UCSC Genome Browser. Correlation analysis between variables was performed by SAS Proprietary Software Release 8.1. Results In chicken, the GC content of 5′ UTR is significantly and positively correlated with expression level, expression breadth, and maximum expression level, whereas that of coding sequences and introns and at the third coding position are negatively correlated with expression level and expression breadth, and not correlated with maximum expression level. These significant trends are independent of recombination rate, chromosome size and gene density. Furthermore, multiple linear regression analysis indicated that GC content in genes could explain approximately 10% of the variation in gene expression. Conclusions GC content is significantly associated with gene expression pattern and could be one of the important regulation factors in the chicken genome.
Collapse
Affiliation(s)
- You Sheng Rao
- Department of Biological Technology, Jiangxi Educational Institute, Jiangxi, Nanchang 330029, China
| | | | | | | | | |
Collapse
|
9
|
Salari R, Kimchi-Sarfaty C, Gottesman MM, Przytycka TM. Sensitive measurement of single-nucleotide polymorphism-induced changes of RNA conformation: application to disease studies. Nucleic Acids Res 2012; 41:44-53. [PMID: 23125360 PMCID: PMC3592397 DOI: 10.1093/nar/gks1009] [Citation(s) in RCA: 66] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Single-nucleotide polymorphisms (SNPs) are often linked to critical phenotypes such as diseases or responses to vaccines, medications and environmental factors. However, the specific molecular mechanisms by which a causal SNP acts is usually not obvious. Changes in RNA secondary structure emerge as a possible explanation necessitating the development of methods to measure the impact of single-nucleotide variation on RNA structure. Despite the recognition of the importance of considering the changes in Boltzmann ensemble of RNA conformers in this context, a formal method to perform directly such comparison was lacking. Here, we solved this problem and designed an efficient method to compute the relative entropy between the Boltzmann ensembles of the native and a mutant structure. On the basis of this theoretical progress, we developed a software tool, remuRNA, and investigated examples of its application. Comparing the impact of common SNPs naturally occurring in populations with the impact of random point mutations, we found that structural changes introduced by common SNPs are smaller than those introduced by random point mutations. This suggests a natural selection against mutations that significantly change RNA structure and demonstrates, surprisingly, that randomly inserted point mutations provide inadequate estimation of random mutations effects. Subsequently, we applied remuRNA to determine which of the disease-associated non-coding SNPs are potentially related to RNA structural changes.
Collapse
Affiliation(s)
- Raheleh Salari
- National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD 20894, USA
| | | | | | | |
Collapse
|
10
|
Ritz J, Martin JS, Laederach A. Evaluating our ability to predict the structural disruption of RNA by SNPs. BMC Genomics 2012; 13 Suppl 4:S6. [PMID: 22759654 PMCID: PMC3303743 DOI: 10.1186/1471-2164-13-s4-s6] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
The structure of RiboNucleic Acid (RNA) has the potential to be altered by a Single Nucleotide Polymorphism (SNP). Disease-associated SNPs mapping to non-coding regions of the genome that are transcribed into RiboNucleic Acid (RNA) can potentially affect cellular regulation (and cause disease) by altering the structure of the transcript. We performed a large-scale meta-analysis of Selective 2'-Hydroxyl Acylation analyzed by Primer Extension (SHAPE) data, which probes the structure of RNA. We found that several single point mutations exist that significantly disrupt RNA secondary structure in the five transcripts we analyzed. Thus, every RNA that is transcribed has the potential to be a “RiboSNitch;” where a SNP causes a large conformational change that alters regulatory function. Predicting the SNPs that will have the largest effect on RNA structure remains a contemporary computational challenge. We therefore benchmarked the most popular RNA structure prediction algorithms for their ability to identify mutations that maximally affect structure. We also evaluated metrics for rank ordering the extent of the structural change. Although no single algorithm/metric combination dramatically outperformed the others, small differences in AUC (Area Under the Curve) values reveal that certain approaches do provide better agreement with experiment. The experimental data we analyzed nonetheless show that multiple single point mutations exist in all RNA transcripts that significantly disrupt structure in agreement with the predictions.
Collapse
Affiliation(s)
- Justin Ritz
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599, USA
| | | | | |
Collapse
|
11
|
Wang JT, Li JT, Zhang XF, Sun XW. Transcriptome analysis reveals the time of the fourth round of genome duplication in common carp (Cyprinus carpio). BMC Genomics 2012; 13:96. [PMID: 22424280 PMCID: PMC3352309 DOI: 10.1186/1471-2164-13-96] [Citation(s) in RCA: 91] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2011] [Accepted: 03/19/2012] [Indexed: 11/23/2022] Open
Abstract
Background Common carp (Cyprinus carpio) is thought to have undergone one extra round of genome duplication compared to zebrafish. Transcriptome analysis has been used to study the existence and timing of genome duplication in species for which genome sequences are incomplete. Large-scale transcriptome data for the common carp genome should help reveal the timing of the additional duplication event. Results We have sequenced the transcriptome of common carp using 454 pyrosequencing. After assembling the 454 contigs and the published common carp sequences together, we obtained 49,669 contigs and identified genes using homology searches and an ab initio method. We identified 4,651 orthologous pairs between common carp and zebrafish and found 129,984 paralogous pairs within the common carp. An estimation of the synonymous substitution rate in the orthologous pairs indicated that common carp and zebrafish diverged 120 million years ago (MYA). We identified one round of genome duplication in common carp and estimated that it had occurred 5.6 to 11.3 MYA. In zebrafish, no genome duplication event after speciation was observed, suggesting that, compared to zebrafish, common carp had undergone an additional genome duplication event. We annotated the common carp contigs with Gene Ontology terms and KEGG pathways. Compared with zebrafish gene annotations, we found that a set of biological processes and pathways were enriched in common carp. Conclusions The assembled contigs helped us to estimate the time of the fourth-round of genome duplication in common carp. The resource that we have built as part of this study will help advance functional genomics and genome annotation studies in the future.
Collapse
Affiliation(s)
- Jin-Tu Wang
- The Centre for Applied Aquatic Genomics, Chinese Academy of Fishery Sciences, Beijing 100141, China
| | | | | | | |
Collapse
|
12
|
Martin JS, Halvorsen M, Davis-Neulander L, Ritz J, Gopinath C, Beauregard A, Laederach A. Structural effects of linkage disequilibrium on the transcriptome. RNA (NEW YORK, N.Y.) 2012; 18:77-87. [PMID: 22109839 PMCID: PMC3261746 DOI: 10.1261/rna.029900.111] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
A majority of SNPs (single nucleotide polymorphisms) map to noncoding and intergenic regions of the genome. Noncoding SNPs are often identified in genome-wide association studies (GWAS) as strongly associated with human disease. Two such disease-associated SNPs in the 5' UTR of the human FTL (Ferritin Light Chain) gene are predicted to alter the ensemble of structures adopted by the mRNA. High-accuracy single nucleotide resolution chemical mapping reveals that these SNPs result in substantial changes in the structural ensemble in agreement with the computational prediction. Furthermore six rescue mutations are correctly predicted to restore the mRNA to its wild-type ensemble. Our data confirm that the FTL 5' UTR is a "RiboSNitch," an RNA that changes structure if a particular disease-associated SNP is present. The structural change observed is analogous to that of a bacterial Riboswitch in that it likely regulates translation. These data further suggest that specific pairs of SNPs in high linkage disequilibrium (LD) will form RNA structure-stabilizing haplotypes (SSHs). We identified 484 SNP pairs that form SSHs in UTRs of the human genome, and in eight of the 10 SSH-containing transcripts, SNP pairs stabilize RNA protein binding sites. The ubiquitous nature of SSHs in the transcriptome suggests that certain haplotypes are conserved to avoid RiboSNitch formation.
Collapse
Affiliation(s)
- Joshua S. Martin
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Matthew Halvorsen
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Lauren Davis-Neulander
- Developmental Genetics and Bioinformatics, Wadsworth Center, Albany, New York 12208, USA
| | - Justin Ritz
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Chetna Gopinath
- Biomedical Sciences Department, University at Albany, Albany, New York 12208, USA
| | - Arthur Beauregard
- Biomedical Sciences Department, University at Albany, Albany, New York 12208, USA
| | - Alain Laederach
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
- Corresponding author.E-mail .
| |
Collapse
|
13
|
Halvorsen M, Martin JS, Broadaway S, Laederach A. Disease-associated mutations that alter the RNA structural ensemble. PLoS Genet 2010; 6:e1001074. [PMID: 20808897 PMCID: PMC2924325 DOI: 10.1371/journal.pgen.1001074] [Citation(s) in RCA: 244] [Impact Index Per Article: 17.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2010] [Accepted: 07/15/2010] [Indexed: 12/28/2022] Open
Abstract
Genome-wide association studies (GWAS) often identify disease-associated mutations in intergenic and non-coding regions of the genome. Given the high percentage of the human genome that is transcribed, we postulate that for some observed associations the disease phenotype is caused by a structural rearrangement in a regulatory region of the RNA transcript. To identify such mutations, we have performed a genome-wide analysis of all known disease-associated Single Nucleotide Polymorphisms (SNPs) from the Human Gene Mutation Database (HGMD) that map to the untranslated regions (UTRs) of a gene. Rather than using minimum free energy approaches (e.g. mFold), we use a partition function calculation that takes into consideration the ensemble of possible RNA conformations for a given sequence. We identified in the human genome disease-associated SNPs that significantly alter the global conformation of the UTR to which they map. For six disease-states (Hyperferritinemia Cataract Syndrome, β-Thalassemia, Cartilage-Hair Hypoplasia, Retinoblastoma, Chronic Obstructive Pulmonary Disease (COPD), and Hypertension), we identified multiple SNPs in UTRs that alter the mRNA structural ensemble of the associated genes. Using a Boltzmann sampling procedure for sub-optimal RNA structures, we are able to characterize and visualize the nature of the conformational changes induced by the disease-associated mutations in the structural ensemble. We observe in several cases (specifically the 5′ UTRs of FTL and RB1) SNP–induced conformational changes analogous to those observed in bacterial regulatory Riboswitches when specific ligands bind. We propose that the UTR and SNP combinations we identify constitute a “RiboSNitch,” that is a regulatory RNA in which a specific SNP has a structural consequence that results in a disease phenotype. Our SNPfold algorithm can help identify RiboSNitches by leveraging GWAS data and an analysis of the mRNA structural ensemble. Genome-wide association studies identify mutations in the human genome that correlate with a particular disease. It is common to find mutations associated with disease in the non-coding region of the genome. These non-coding mutations are more difficult to interpret at a molecular level, because they do not affect the protein sequence. In this study, we analyze disease-associated mutations in non-coding regions of our genome in the context of their structural effect on the message of genetic information in our cells, Ribonucleic Acid (RNA). We focus in particular on the regulatory parts of our genes known as untranslated regions. We find that certain disease-associated mutations in these regulatory untranslated regions have a significant effect on the structure of the RNA message. We call these elements “RiboSNitches,” because they act like switches turning on and off genes, but are caused by Single Nucleotide Polymorphisms (SNPs), which are single point mutations in our genome. The RiboSNitches we identify are potentially a new class of pharmaceutical targets, as it is possible to change the structure of RNA with small drug-like molecules.
Collapse
Affiliation(s)
- Matthew Halvorsen
- Biomedical Sciences Department, University at Albany, Albany, New York, United States of America
| | - Joshua S. Martin
- Developmental Genetics and Bioinformatics, Wadsworth Center, Albany, New York, United States of America
| | - Sam Broadaway
- Developmental Genetics and Bioinformatics, Wadsworth Center, Albany, New York, United States of America
| | - Alain Laederach
- Biomedical Sciences Department, University at Albany, Albany, New York, United States of America
- Developmental Genetics and Bioinformatics, Wadsworth Center, Albany, New York, United States of America
- * E-mail:
| |
Collapse
|
14
|
Wu TY, Hsieh CC, Hong JJ, Chen CY, Tsai YS. IRSS: a web-based tool for automatic layout and analysis of IRES secondary structure prediction and searching system in silico. BMC Bioinformatics 2009; 10:160. [PMID: 19473520 PMCID: PMC2698906 DOI: 10.1186/1471-2105-10-160] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2008] [Accepted: 05/27/2009] [Indexed: 12/21/2022] Open
Abstract
Background Internal ribosomal entry sites (IRESs) provide alternative, cap-independent translation initiation sites in eukaryotic cells. IRES elements are important factors in viral genomes and are also useful tools for bi-cistronic expression vectors. Most existing RNA structure prediction programs are unable to deal with IRES elements. Results We designed an IRES search system, named IRSS, to obtain better results for IRES prediction. RNA secondary structure prediction and comparison software programs were implemented to construct our two-stage strategy for the IRSS. Two software programs formed the backbone of IRSS: the RNAL fold program, used to predict local RNA secondary structures by minimum free energy method; and the RNA Align program, used to compare predicted structures. After complete viral genome database search, the IRSS have low error rate and up to 72.3% sensitivity in appropriated parameters. Conclusion IRSS is freely available at this website . In addition, all source codes, precompiled binaries, examples and documentations are downloadable for local execution. This new search approach for IRES elements will provide a useful research tool on IRES related studies.
Collapse
Affiliation(s)
- Tzong-Yuan Wu
- Department of Bioscience Technology, Chung Yuan Christian University, Chung-Li, Taiwan.
| | | | | | | | | |
Collapse
|
15
|
Yu JY, Chung KH, Deo M, Thompson RC, Turner DL. MicroRNA miR-124 regulates neurite outgrowth during neuronal differentiation. Exp Cell Res 2008; 314:2618-33. [PMID: 18619591 DOI: 10.1016/j.yexcr.2008.06.002] [Citation(s) in RCA: 254] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2008] [Revised: 06/02/2008] [Accepted: 06/03/2008] [Indexed: 12/19/2022]
Abstract
MicroRNAs (miRNAs) are small RNAs with diverse regulatory roles. The miR-124 miRNA is expressed in neurons in the developing and adult nervous system. Here we show that overexpression of miR-124 in differentiating mouse P19 cells promotes neurite outgrowth, while blocking miR-124 function delays neurite outgrowth and decreases acetylated alpha-tubulin. Altered neurite outgrowth also was observed in mouse primary cortical neurons when miR-124 expression was increased, or when miR-124 function was blocked. In uncommitted P19 cells, miR-124 expression led to disruption of actin filaments and stabilization of microtubules. Expression of miR-124 also decreased Cdc42 protein and affected the subcellular localization of Rac1, suggesting that miR-124 may act in part via alterations to members of the Rho GTPase family. Furthermore, constitutively active Cdc42 or Rac1 attenuated neurite outgrowth promoted by miR-124. To obtain a broader perspective, we identified mRNAs downregulated by miR-124 in P19 cells using microarrays. mRNAs for proteins involved in cytoskeletal regulation were enriched among mRNAs downregulated by miR-124. A miR-124 variant with an additional 5' base failed to promote neurite outgrowth and downregulated substantially different mRNAs. These results indicate that miR-124 contributes to the control of neurite outgrowth during neuronal differentiation, possibly by regulation of the cytoskeleton.
Collapse
Affiliation(s)
- Jenn-Yah Yu
- Molecular and Behavioral Neuroscience Institute, University of Michigan, Ann Arbor, MI 48109-2200, USA
| | | | | | | | | |
Collapse
|
16
|
Tanney A, Oliver GR, Farztdinov V, Kennedy RD, Mulligan JM, Fulton CE, Farragher SM, Field JK, Johnston PG, Harkin DP, Proutski V, Mulligan KA. Generation of a non-small cell lung cancer transcriptome microarray. BMC Med Genomics 2008; 1:20. [PMID: 18513400 PMCID: PMC2426710 DOI: 10.1186/1755-8794-1-20] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2007] [Accepted: 05/30/2008] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND Non-small cell lung cancer (NSCLC) is the leading cause of cancer mortality worldwide. At present no reliable biomarkers are available to guide the management of this condition. Microarray technology may allow appropriate biomarkers to be identified but present platforms are lacking disease focus and are thus likely to miss potentially vital information contained in patient tissue samples. METHODS A combination of large-scale in-house sequencing, gene expression profiling and public sequence and gene expression data mining were used to characterise the transcriptome of NSCLC and the data used to generate a disease-focused microarray - the Lung Cancer DSA research tool. RESULTS Built on the Affymetrix GeneChip platform, the Lung Cancer DSA research tool allows for interrogation of ~60,000 transcripts relevant to Lung Cancer, tens of thousands of which are unavailable on leading commercial microarrays. CONCLUSION We have developed the first high-density disease specific transcriptome microarray. We present the array design process and the results of experiments carried out to demonstrate the array's utility. This approach serves as a template for the development of other disease transcriptome microarrays, including non-neoplastic diseases.
Collapse
Affiliation(s)
- Austin Tanney
- Almac Diagnostics Ltd, 19 Seagoe Industrial Estate, Craigavon, BT63 5QD, UK
| | - Gavin R Oliver
- Almac Diagnostics Ltd, 19 Seagoe Industrial Estate, Craigavon, BT63 5QD, UK
| | - Vadim Farztdinov
- Almac Diagnostics Ltd, 19 Seagoe Industrial Estate, Craigavon, BT63 5QD, UK
| | - Richard D Kennedy
- Almac Diagnostics Ltd, 19 Seagoe Industrial Estate, Craigavon, BT63 5QD, UK
| | - Jude M Mulligan
- Almac Diagnostics Ltd, 19 Seagoe Industrial Estate, Craigavon, BT63 5QD, UK
| | - Ciaran E Fulton
- Almac Diagnostics Ltd, 19 Seagoe Industrial Estate, Craigavon, BT63 5QD, UK
| | - Susan M Farragher
- Almac Diagnostics Ltd, 19 Seagoe Industrial Estate, Craigavon, BT63 5QD, UK
| | - John K Field
- Roy Castle Lung Cancer Research Programme, The University of Liverpool Cancer Research Centre, 200 London Road, Liverpool, L3 9TA, UK
| | - Patrick G Johnston
- Centre for Cancer Research and Cell Biology, Queen's University of Belfast, 97 Lisburn Road, Belfast, BT9 7BL, UK
| | - D Paul Harkin
- Almac Diagnostics Ltd, 19 Seagoe Industrial Estate, Craigavon, BT63 5QD, UK
| | - Vitali Proutski
- Almac Diagnostics Ltd, 19 Seagoe Industrial Estate, Craigavon, BT63 5QD, UK
| | - Karl A Mulligan
- Almac Diagnostics Ltd, 19 Seagoe Industrial Estate, Craigavon, BT63 5QD, UK
| |
Collapse
|
17
|
Abstract
In recent years, genome-wide detection of alternative splicing based on Expressed Sequence Tag (EST) sequence alignments with mRNA and genomic sequences has dramatically expanded our understanding of the role of alternative splicing in functional regulation. This chapter reviews the data, methodology, and technical challenges of these genome-wide analyses of alternative splicing, and briefly surveys some of the uses to which such alternative splicing databases have been put. For example, with proper alternative splicing database schema design, it is possible to query genome-wide for alternative splicing patterns that are specific to particular tissues, disease states (e.g., cancer), gender, or developmental stages. EST alignments can be used to estimate exon inclusion or exclusion level of alternatively spliced exons and evolutionary changes for various species can be inferred from exon inclusion level. Such databases can also help automate design of probes for RT-PCR and microarrays, enabling high throughput experimental measurement of alternative splicing.
Collapse
|
18
|
Gow JM, Chinn LW, Kroetz DL. The effects of ABCB1 3'-untranslated region variants on mRNA stability. Drug Metab Dispos 2007; 36:10-5. [PMID: 17940136 DOI: 10.1124/dmd.107.017087] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Genetic variation in ABCB1, encoding P-glycoprotein (P-gp), is a potential cause of interindividual variation in drug response. Numerous studies have focused on the effects of coding region variants on P-gp expression and function, whereas few noncoding region variants have been investigated. The 3'-untranslated region (UTR) regulates mRNA levels or stability via RNA-protein interactions with mRNA degradation machinery. mRNA stability is a key regulatory step controlling ABCB1 mRNA expression that ultimately affects P-gp levels and function. We hypothesized that ABCB1 3'-UTR polymorphisms alter mRNA stability by disrupting RNA-protein interactions. An ethnically diverse panel of DNA samples was sequenced to identify 3'-UTR polymorphisms and determine allele frequencies. The three most common variants, along with reference ABCB1, were stably expressed in cells in order to measure mRNA half-life. The calculated half-life for ABCB1 reference in HEK293 cells was 9.4 +/- 1.3 h and was similar to that estimated for the 3'-UTR variants. Endogenous ABCB1 mRNA decay was similar in lymphoblastoid cell lines carrying 3'-UTR variant and reference alleles. Although the examined ABCB1 3'-UTR variants have no effect on ABCB1 mRNA stability, these data represent one of the first attempts to determine the influence of genetic variation in UTRs on ABCB1 mRNA levels.
Collapse
Affiliation(s)
- Jason M Gow
- The Department of Biopharmaceutical Sciences, University of California, San Francisco, California 94158-2911, USA
| | | | | |
Collapse
|
19
|
Koscianska E, Baev V, Skreka K, Oikonomaki K, Rusinov V, Tabler M, Kalantidis K. Prediction and preliminary validation of oncogene regulation by miRNAs. BMC Mol Biol 2007; 8:79. [PMID: 17877811 PMCID: PMC2096627 DOI: 10.1186/1471-2199-8-79] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2007] [Accepted: 09/18/2007] [Indexed: 12/19/2022] Open
Abstract
Background MicroRNAs (miRNAs) are one of the most abundant groups of regulatory genes in multicellular organisms, playing important roles in many fundamental cellular processes. More than four hundred miRNAs have been identified in humans and the deregulation of miRNA expression has been also shown in many cancers. Despite the postulated involvement of miRNAs in tumourigenesis, there are only a few examples where an oncogene or a tumour suppressor has been identified as a miRNA target. Results Here, we present an in silico analysis of potential miRNA- oncogene interactions. Moreover, we have tested the validity of two possible interactions of miRNAs with genes related to cancer. We present evidence for the down-regulation of c-MYC, one of the most potent and frequently deregulated oncogenes, by let-7 miRNA, via the predicted binding site in the 3'UTR, and verify the suppression of BCL-2 by miR16. Conclusion In this work both bioinformatic and experimental approaches for the prediction and validation of possible targets for miRNAs have been used. A list of putative targets for different oncomirs, validation of which would be of special interest, is proposed and two such interactions have been experimentally validated.
Collapse
MESH Headings
- 3' Untranslated Regions/genetics
- Animals
- Base Sequence
- Cloning, Molecular/methods
- Computational Biology/methods
- Databases, Genetic
- Gene Expression Regulation, Neoplastic/genetics
- Gene Targeting/methods
- Genes, Tumor Suppressor/drug effects
- Genes, bcl-2/drug effects
- Genes, bcl-2/genetics
- Genes, myc/drug effects
- Genes, myc/genetics
- Humans
- MicroRNAs/genetics
- MicroRNAs/pharmacology
- Models, Genetic
- Molecular Sequence Data
- Oncogenes/drug effects
- Oncogenes/genetics
- Predictive Value of Tests
- RNA Interference
- Sequence Analysis, RNA/methods
Collapse
Affiliation(s)
- Edyta Koscianska
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology, Hellas, PO Box 1385, GR-71110, Heraklion/Crete, Greece
- Laboratory of Cancer Genetics, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland
| | - Vesselin Baev
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology, Hellas, PO Box 1385, GR-71110, Heraklion/Crete, Greece
- Department of Plant Physiology and Molecular Biology, University of Plovdiv 24, Tsar Assen St, 4000 Plovdiv, Bulgaria
| | - Konstantinia Skreka
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology, Hellas, PO Box 1385, GR-71110, Heraklion/Crete, Greece
- Department of Biology, University of Crete, Heraklion, Greece
| | - Katerina Oikonomaki
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology, Hellas, PO Box 1385, GR-71110, Heraklion/Crete, Greece
- Department of Biology, University of Crete, Heraklion, Greece
| | - Ventsislav Rusinov
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology, Hellas, PO Box 1385, GR-71110, Heraklion/Crete, Greece
- Department of Plant Physiology and Molecular Biology, University of Plovdiv 24, Tsar Assen St, 4000 Plovdiv, Bulgaria
| | - Martin Tabler
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology, Hellas, PO Box 1385, GR-71110, Heraklion/Crete, Greece
| | - Kriton Kalantidis
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology, Hellas, PO Box 1385, GR-71110, Heraklion/Crete, Greece
| |
Collapse
|
20
|
Cvijović M, Dalevi D, Bilsland E, Kemp GJL, Sunnerhagen P. Identification of putative regulatory upstream ORFs in the yeast genome using heuristics and evolutionary conservation. BMC Bioinformatics 2007; 8:295. [PMID: 17686169 PMCID: PMC1964767 DOI: 10.1186/1471-2105-8-295] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2007] [Accepted: 08/08/2007] [Indexed: 11/30/2022] Open
Abstract
Background The translational efficiency of an mRNA can be modulated by upstream open reading frames (uORFs) present in certain genes. A uORF can attenuate translation of the main ORF by interfering with translational reinitiation at the main start codon. uORFs also occur by chance in the genome, in which case they do not have a regulatory role. Since the sequence determinants for functional uORFs are not understood, it is difficult to discriminate functional from spurious uORFs by sequence analysis. Results We have used comparative genomics to identify novel uORFs in yeast with a high likelihood of having a translational regulatory role. We examined uORFs, previously shown to play a role in regulation of translation in Saccharomyces cerevisiae, for evolutionary conservation within seven Saccharomyces species. Inspection of the set of conserved uORFs yielded the following three characteristics useful for discrimination of functional from spurious uORFs: a length between 4 and 6 codons, a distance from the start of the main ORF between 50 and 150 nucleotides, and finally a lack of overlap with, and clear separation from, neighbouring uORFs. These derived rules are inherently associated with uORFs with properties similar to the GCN4 locus, and may not detect most uORFs of other types. uORFs with high scores based on these rules showed a much higher evolutionary conservation than randomly selected uORFs. In a genome-wide scan in S. cerevisiae, we found 34 conserved uORFs from 32 genes that we predict to be functional; subsequent analysis showed the majority of these to be located within transcripts. A total of 252 genes were found containing conserved uORFs with properties indicative of a functional role; all but 7 are novel. Functional content analysis of this set identified an overrepresentation of genes involved in transcriptional control and development. Conclusion Evolutionary conservation of uORFs in yeasts can be traced up to 100 million years of separation. The conserved uORFs have certain characteristics with respect to length, distance from each other and from the main start codon, and folding energy of the sequence. These newly found characteristics can be used to facilitate detection of other conserved uORFs.
Collapse
Affiliation(s)
- Marija Cvijović
- Department of Cell and Molecular Biology, Lundberg Laboratory, Göteborg University, PO Box 462 SE-405 30 Göteborg, Sweden
- Max-Planck Institute for Molecular Genetics, Ihnestraße 63, D-14195 Berlin, Germany
| | - Daniel Dalevi
- Department of Computer Science and Engineering, Chalmers University of Technology, SE-412 96 Göteborg, Sweden
| | - Elizabeth Bilsland
- Department of Cell and Molecular Biology, Lundberg Laboratory, Göteborg University, PO Box 462 SE-405 30 Göteborg, Sweden
- Biochemistry Department, University of Cambridge, 80 Tennis Court Road, Cambridge CB2 1GA, UK
| | - Graham JL Kemp
- Department of Computer Science and Engineering, Chalmers University of Technology, SE-412 96 Göteborg, Sweden
| | - Per Sunnerhagen
- Department of Cell and Molecular Biology, Lundberg Laboratory, Göteborg University, PO Box 462 SE-405 30 Göteborg, Sweden
| |
Collapse
|
21
|
Jiang C, Han L, Su B, Li WH, Zhao Z. Features and Trend of Loss of Promoter-Associated CpG Islands in the Human and Mouse Genomes. Mol Biol Evol 2007; 24:1991-2000. [PMID: 17591602 DOI: 10.1093/molbev/msm128] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
CpG islands (CGIs) are often considered as gene markers, but the number of CGIs varies among mammalian genomes that have similar numbers of genes. In this study, we investigated the distribution of CGIs in the promoter regions of 3,197 human-mouse orthologous gene pairs and found that the mouse genome has notably fewer CGIs in the promoter regions and less pronounced CGI characteristics than does the human genome. We further inferred CGI's ancestral state using the dog genome as a reference and examined the nucleotide substitution pattern and the mutational direction in the conserved regions of human and mouse CGIs. The results reveal many losses of CGIs in both genomes but the loss rate in the mouse lineage is two to four times the rate in the human lineage. We found an intriguing feature of CGI loss, namely that the loss of a CGI usually starts from erosion at the both edges and gradually moves towards the center. We found functional bias in the genes that have lost promoter-associated CGIs in the human or mouse lineage. Finally, our analysis indicates that the association of CGIs with housekeeping genes is not as strong as previously estimated. Our study provides a detailed view of the evolution of promoter-associated CGIs in the human and mouse genomes and our findings are helpful for understanding the evolution of mammalian genomes and the role of CGIs in gene function.
Collapse
Affiliation(s)
- Cizhong Jiang
- Department of Psychiatry and Center for the Study of Biological Complexity, Virginia Commonwealth, USA
| | | | | | | | | |
Collapse
|
22
|
Corà D, Di Cunto F, Provero P, Silengo L, Caselle M. Computational identification of transcription factor binding sites by functional analysis of sets of genes sharing overrepresented upstream motifs. BMC Bioinformatics 2004; 5:57. [PMID: 15137914 PMCID: PMC449910 DOI: 10.1186/1471-2105-5-57] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2003] [Accepted: 05/11/2004] [Indexed: 11/30/2022] Open
Abstract
Background Transcriptional regulation is a key mechanism in the functioning of the cell, and is mostly effected through transcription factors binding to specific recognition motifs located upstream of the coding region of the regulated gene. The computational identification of such motifs is made easier by the fact that they often appear several times in the upstream region of the regulated genes, so that the number of occurrences of relevant motifs is often significantly larger than expected by pure chance. Results To exploit this fact, we construct sets of genes characterized by the statistical overrepresentation of a certain motif in their upstream regions. Then we study the functional characterization of these sets by analyzing their annotation to Gene Ontology terms. For the sets showing a statistically significant specific functional characterization, we conjecture that the upstream motif characterizing the set is a binding site for a transcription factor involved in the regulation of the genes in the set. Conclusions The method we propose is able to identify many known binding sites in S. cerevisiae and new candidate targets of regulation by known transcritpion factors. Its application to less well studied organisms is likely to be valuable in the exploration of their regulatory interaction network.
Collapse
Affiliation(s)
- Davide Corà
- Dipartimento di Fisica Teorica, Università di Torino, and INFN, sezione di Torino, Italy
| | - Ferdinando Di Cunto
- Dipartimento di Genetica, Biologia e Biochimica, Università di Torino, Torino, Italy
| | | | - Lorenzo Silengo
- Dipartimento di Genetica, Biologia e Biochimica, Università di Torino, Torino, Italy
| | - Michele Caselle
- Dipartimento di Fisica Teorica, Università di Torino, and INFN, sezione di Torino, Italy
| |
Collapse
|
23
|
Scheetz TE, Laffin JJ, Berger B, Holte S, Baumes SA, Brown R, Chang S, Coco J, Conklin J, Crouch K, Donohue M, Doonan G, Estes C, Eyestone M, Fishler K, Gardiner J, Guo L, Johnson B, Keppel C, Kreger R, Lebeck M, Marcelino R, Miljkovich V, Perdue M, Qui L, Rehmann J, Reiter RS, Rhoads B, Schaefer K, Smith C, Sunjevaric I, Trout K, Wu N, Birkett CL, Bischof J, Gackle B, Gavin A, Grundstad AJ, Mokrzycki B, Moressi C, O'Leary B, Pedretti K, Roberts C, Robinson NL, Smith M, Tack D, Trivedi N, Kucaba T, Freeman T, Lin JJC, Bonaldo MF, Casavant TL, Sheffield VC, Soares MB. High-throughput gene discovery in the rat. Genome Res 2004; 14:733-41. [PMID: 15060017 PMCID: PMC383320 DOI: 10.1101/gr.1414204] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The rat is an important animal model for human diseases and is widely used in physiology. In this article we present a new strategy for gene discovery based on the production of ESTs from serially subtracted and normalized cDNA libraries, and we describe its application for the development of a comprehensive nonredundant collection of rat ESTs. Our new strategy appears to yield substantially more EST clusters per ESTs sequenced than do previous approaches that did not use serial subtraction. However, multiple rounds of library subtraction resulted in high frequencies of otherwise rare internally primed cDNAs, defining the limits of this powerful approach. To date, we have generated >200,000 3' ESTs from >100 cDNA libraries representing a wide range of tissues and developmental stages of the laboratory rat. Most importantly, we have contributed to approximately 50,000 rat UniGene clusters. We have identified, arrayed, and derived 5' ESTs from >30,000 unique rat cDNA clones. Complete information, including radiation hybrid mapping data, is also maintained locally at http://genome.uiowa.edu/clcg.html. All of the sequences described in this article have been submitted to the dbEST division of the NCBI.
Collapse
Affiliation(s)
- Todd E Scheetz
- Center for Bioinformatics and Computational Biology, The University of Iowa, Iowa City, Iowa 52242, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Vilela C, McCarthy JEG. Regulation of fungal gene expression via short open reading frames in the mRNA 5'untranslated region. Mol Microbiol 2003; 49:859-67. [PMID: 12890013 DOI: 10.1046/j.1365-2958.2003.03622.x] [Citation(s) in RCA: 113] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
We review how the expression of fungal mRNAs can be controlled by ribosome interactions with short upstream open reading frames (uORFs) within the 5'untranslated region. The efficiency of uAUG recognition modulates the impact of a uORF but steps during and after translation of the uORF also influence uORF function. The post-termination behaviour of ribosomes, therefore, plays a major role in determining the expression level of these main ORFs. Translation of a uORF can produce a cis-acting peptide that causes effector molecule-dependent stalling of the ribosomes at the end of the uORF. In other cases it is the length or position, or other features of the uORF, rather than the peptide it encodes, that determine the efficiency with which ribosomes reinitiate translation downstream of it. Whether the form of the ribosome that resumes scanning after termination is the 40S subunit alone or the entire 80S ribosome is not known. Translation of the uORF can also control gene expression by affecting the stability of the mRNA. Finally, trans-acting factors may participate in the regulatory mechanisms. Future work will need not only to provide more information on the mechanisms underlying the known cases of uORF-mediated control but also to define the full complement of uORF-containing mRNAs in at least one fungal organism.
Collapse
Affiliation(s)
- Cristina Vilela
- Posttranscriptional Control Group, Department of Biomolecular Sciences, UMIST, PO Box 88, Manchester M60 1QD, UK
| | | |
Collapse
|
25
|
Rutter JL, Goldstein AM, Dávila MR, Tucker MA, Struewing JP. CDKN2A point mutations D153spl(c.457G>T) and IVS2+1G>T result in aberrant splice products affecting both p16INK4a and p14ARF. Oncogene 2003; 22:4444-8. [PMID: 12853981 DOI: 10.1038/sj.onc.1206564] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The CDKN2A gene, which encodes the proteins p16(INK4a) and p14(ARF), is located on chromosome 9p21. Germline mutations at this locus increase susceptibility to cutaneous malignant melanoma (CMM). In general, missense and nonsense mutations are primarily responsible for defective p16(INK4a) and possibly p14(ARF) protein function and account for approximately 20% of inherited CMM cases. We report a G>T transversion mutation in the last nucleotide of exon 2, affecting the aspartic acid residue at position 153 of CDKN2A-p16(INK4a) in a proband with melanoma. If splicing were unaffected, this mutation would change Asp to Tyr. RT-PCR analysis, however, revealed that this mutation, which we have termed D153spl(c.457G>T), and a previously described mutation at the next nucleotide, IVS2+1G>T, result in identical aberrant splicing affecting both p16(INK4a) and p14(ARF). The two main alternate splice products for each of the two normal transcripts includes a 74 bp deletion in exon 2, revealing a cryptic splice site, and the complete skipping of exon 2. The dual inactivation of p16(INK4a) and p14(ARF) may contribute to the CMM in these families.
Collapse
Affiliation(s)
- Joni L Rutter
- Laboratory of Population Genetics, Center for Cancer Research, National Cancer Institute, National Institutes of Health, DHHS, Bethesda, MD, USA
| | | | | | | | | |
Collapse
|
26
|
Procissi A, Piazza P, Tonelli C. A maize r1 gene is regulated post-transcriptionally by differential splicing of its leader. PLANT MOLECULAR BIOLOGY 2002; 49:239-248. [PMID: 11999378 DOI: 10.1023/a:1014959230492] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Anthocyanin biosynthesis in Zea mays is controlled by regulatory genes of the r1/b1 family that encode bHLH transcription factors. Analysis of the 381 nucleotide leader sequence of a member of this family, Sn, discloses the presence of five ATG triplets upstream of the coding region and three upstream open reading frames (uORFs) of 38, 15 and 13 amino acids respectively. RT-PCR studies revealed that a splicing event occurs in the leader region in the different tissues tested. Splicing deletes 146 nucleotides which include uORF2 and uORF3. By trans-activation experiments in maize protoplasts we find that the spliced leader, compared to the non-spliced one, reduces the number of pigmented protoplasts by four-fold. We suggest a multilevel regulation of the Sn transcription factor acting not only at the transcriptional but also at the post-transcriptional level.
Collapse
Affiliation(s)
- A Procissi
- Dipartimento di Genetica e di Biologia dei Microorganismi, Università degli Studi di Milano, Italy
| | | | | |
Collapse
|
27
|
Gregorio-King CC, McLeod JL, Collier FM, Collier GR, Bolton KA, Van Der Meer GJ, Apostolopoulos J, Kirkland MA. MERP1: a mammalian ependymin-related protein gene differentially expressed in hematopoietic cells. Gene 2002; 286:249-57. [PMID: 11943480 DOI: 10.1016/s0378-1119(02)00434-1] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
We have utilized differential display polymerase chain reaction to investigate the gene expression of hematopoietic progenitor cells from adult bone marrow and umbilical cord blood. A differentially expressed gene was identified in CD34+ hematopoietic progenitor cells, with low expression in CD34- cells. We have obtained the full coding sequence of this gene which we designated human mammalian ependymin-related protein 1 (MERP1). Expression of MERP1 was found in a variety of normal human tissues, and is 4- and 10-fold higher in adult bone marrow and umbilical cord blood CD34+ cells, respectively, compared to CD34- cells. Additionally, MERP1 expression in a hematopoietic stem cell enriched population was down-regulated with proliferation and differentiation. Conceptual translation of the MERP1 open reading frame reveals significant homology to two families of glycoprotein calcium-dependant cell adhesion molecules: ependymins and protocadherins.
Collapse
Affiliation(s)
- Claudia C Gregorio-King
- Stem Cell Laboratory, The Douglas Hocking Research Institute, Barwon Health, The Geelong Hospital, Geelong, VIC, 3220, Australia.
| | | | | | | | | | | | | | | |
Collapse
|
28
|
Landry JR, Medstrand P, Mager DL. Repetitive elements in the 5' untranslated region of a human zinc-finger gene modulate transcription and translation efficiency. Genomics 2001; 76:110-6. [PMID: 11549323 DOI: 10.1006/geno.2001.6604] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
A substantial proportion of the human genome consists of repetitive sequences. Although most of these sequences are nonessential for the organism, retroelements, such as Alu sequences, L1s, and HERVs (human endogenous retroviruses), have recently been implicated in the regulation of various genes. Our laboratory previously identified a novel, alternatively spliced zinc-finger gene, ZNF177, which incorporates Alu L1, and HERV segments into the 5' untranslated region (UTR) of transcripts. In this study, we investigated the genomic structure and functional significance of the repetitive sequences in the 5' UTR of ZNF177 mRNAs. Using luciferase and GFP reporter constructs, we assessed the effect of the HERV, Alu, and L1 sequences on gene expression levels. Our results indicate that the presence of the retroelement sequences, particularly the Alu and L1 segments which form one 5' UTR exon, modifies the expression level of both reporter genes. We present evidence that the Alu and L1 sequences alter both the RNA and protein levels of reporter genes by increasing transcription efficiency while decreasing translation efficiency. Our findings indicate that the Alu and L1 repeats in the 5' UTR of ZNF177 exert a positive transcriptional enhancer effect, but repress translation of the zinc finger gene. In addition, our analysis of a 5' UTR database suggests that 4% of human 5' UTRs harbor Alu sequences, indicating that the expression of many genes might be influenced by Alu repeats. These results illustrate the complex regulatory effects that retroelements can have on human gene expression.
Collapse
Affiliation(s)
- J R Landry
- Terry Fox Laboratory, British Columbia Cancer Agency and Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia, Canada
| | | | | |
Collapse
|
29
|
Shibata K, Abe S, Davies E. Structure of the coding region and mRNA variants of the apyrase gene from pea (Pisum sativum). ACTA PHYSIOLOGIAE PLANTARUM 2001; 23:3-13. [PMID: 12238526 DOI: 10.1007/s11738-001-0016-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Partial amino acid sequences of a 49 kDa apyrase (ATP diphosphohydrolase, EC 3.6.1.5) from the cytoskeletal fraction of etiolated pea stems were used to derive oligonucleotide DNA primers to generate a cDNA fragment of pea apyrase mRNA by RT-PCR and these primers were used to screen a pea stem cDNA library. Two almost identical cDNAs differing in just 6 nucleotides within the coding regions were found, and these cDNA sequences were used to clone genomic fragments by PCR. Two nearly identical gene fragments containing 8 exons and 7 introns were obtained. One of them (H-type) encoded the mRNA sequence described by Hsieh et al. (1996) (DDBJ/EMBL/GenBank Z32743), while the other (S-type) differed by the same 6 nucleotides as the mRNAs, suggesting that these genes may be alleles. The six nucleotide differences between these two alleles were found solely in the first exon, and these mutation sites had two types of consensus sequences. These mRNAs were found with varying lengths of 3' untranslated regions (3'-UTR). There are some similarities between the 3'-UTR of these mRNAs and those of actin and actin binding proteins in plants. The putative roles of the 3'-UTR and alternative polyadenylation sites are discussed in relation to their possible role in targeting the mRNAs to different subcellular compartments.
Collapse
Affiliation(s)
- K Shibata
- Laboratory of Molecular Cell Biology, Department of Biological Resources, Faculty of Agriculture, Ehime University, Matsuyama, Japan
| | | | | |
Collapse
|
30
|
David RB, Blom AK, Sjaastad OV, Harbitz I. The porcine erythropoietin gene: cDNA sequence, genomic sequence and expression analyses in piglets. Domest Anim Endocrinol 2001; 20:137-47. [PMID: 11311851 DOI: 10.1016/s0739-7240(01)00089-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
The porcine erythropoietin (EPO) gene and its cDNA have been cloned and characterized. The cDNA encodes a protein of 194 amino acids. The gene structure and sequence show a high degree of homology to the corresponding human and murine gene. Steroid hormone receptor binding sites are present both in the promoter and in the 3' flanking region of the gene, which also contains an oxygen-sensing sequence. The promoter lacks classical promoter elements such as TATA and CAAT boxes. Expression analyses using a competitive RT-PCR assay showed that the kidneys contain about ten times more erythropoietin mRNA than the liver in five-week-old piglets, thus indicating that the shift from mainly hepatic to mainly renal EPO production has taken place at this age. The testes showed a higher ratio of EPO mRNA to total RNA than the liver. Spleen showed very low levels of expression, while no expression of erythropoietin mRNA was detected in brain tissue, bone marrow, lung, lymph nodes, and ovaries.
Collapse
Affiliation(s)
- R B David
- Department of Biochemistry, Physiology and Nutrition, The Norwegian School of Veterinary Science, P.O. Box 8146 Dep., N-0033 1, Oslo, Norway
| | | | | | | |
Collapse
|
31
|
Scherf M, Klingenhoff A, Werner T. Highly specific localization of promoter regions in large genomic sequences by PromoterInspector: a novel context analysis approach. J Mol Biol 2000; 297:599-606. [PMID: 10731414 DOI: 10.1006/jmbi.2000.3589] [Citation(s) in RCA: 206] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We present a new algorithm called PromoterInspector to locate eukaryotic polymase II promoter regions in large genomic sequences with a high degree of specificity. PromoterInspector focuses on the genetic context of promoters, rather than their exact location. Application of PromoterInspector can serve as a crucial pre-processing step for other methods to locate exactly, or to analyze promoters. PromoterInspector does not depend on heuristics, because it is purely based on libraries of IUPAC words extracted from training sequences by an unsupervised learning approach. We compared PromoterInspector to in silico promoter prediction tools using the sequences from the review by J.W. Fickett. PromoterInspector compared favourably on Fickett's evaluation scheme. A true positive to false positive ratio of 2.3 was obtained, surpassing the best ratio of 0.6, reported for TSSG. The application of our method to several large genomic sequences of over 1.3 million base-pairs in total resulted in even more specific predictions. The coverage of annotated promoters was comparable to other in silico promoter prediction methods, while the true positive predictions increased by up to 100% of total matches. PromoterInspector scans 100 kb in less than one minute on a workstation, and thus is especially applicable for large genome analysis. The method is available at http://genomatix.gsf. de/cgi-bin/promoterinspector/promoterinspector.pl.
Collapse
Affiliation(s)
- M Scherf
- Institute of Mammalian Genetics, GSF-National Research Center for Environment and Health, Ingolstädter Landstrasse 1, Neuherberg, D-85758, Germany
| | | | | |
Collapse
|
32
|
Pesole G, Gissi C, Catalano D, Grillo G, Licciulli F, Liuni S, Attimonelli M, Saccone C. MitoNuc and MitoAln: two related databases of nuclear genes coding for mitochondrial proteins. Nucleic Acids Res 2000; 28:163-5. [PMID: 10592211 PMCID: PMC102385 DOI: 10.1093/nar/28.1.163] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/1999] [Revised: 09/03/1999] [Accepted: 09/16/1999] [Indexed: 11/14/2022] Open
Abstract
Mitochondria, besides their central role in energy metabolism, have recently been found to be involved in a number of basic processes of cell life and to contribute to the pathogenesis of many degenerative diseases. All functions of mitochondria depend on the interaction of nuclear and organellar genomes. Mitochondrial genomes have been extensively sequenced and analysed and the data collected in several specialised databases. In order to collect information on nuclear coded mitochondrial proteins we developed MitoNuc and MitoAln, two related databases containing, respectively, detailed information on sequenced nuclear genes coding for mitochondrial proteins in Metazoa and yeast, and the multiple alignments of the relevant homologous protein coding regions. MitoNuc and MitoAln retrieval through SRS at http://bio-www.ba.cnr.it:8000/srs6/ can easily allow the extraction of sequence data, subsequences defined by specific features and nucleotide or amino acid multiple alignments.
Collapse
Affiliation(s)
- G Pesole
- Dipartimento di Fisiologia e Biochimica Generali, Università di Milano, via Celoria 26, 20122 Milano, Italy.
| | | | | | | | | | | | | | | |
Collapse
|
33
|
Jacobs GH, Stockwell PA, Schrieber MJ, Tate WP, Brown CM. Transterm: a database of messenger RNA components and signals. Nucleic Acids Res 2000; 28:293-5. [PMID: 10592251 PMCID: PMC102492 DOI: 10.1093/nar/28.1.293] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/1999] [Accepted: 10/11/1999] [Indexed: 11/14/2022] Open
Abstract
Transterm facilitates studies of messenger RNAs and translational control signals. Each messenger RNA (mRNA) from GenBank is extracted and broken into its functional components, its coding sequence, initiation context, termination context, flanking sequence representing its 5' UTR (untranslated region), 3' UTR and translational signals. In addition, numerical parameters characterising each coding region in Transterm, including codon and GC bias, are available. For each species in Transterm, the initiation and termination regions are aligned by their start or stop codons and presented as base frequency matrices and tables of the information content of the bases in the alignments. Users can obtain summaries of characteristics of the mRNAs for species of their choice and search for translational signals both in the Transterm database and in their own sequence. The current release contains data from over 10 000 species, including the complete genomes of 20 prokaryotes and three eukaryotes. Both flat-file and relational database forms of Transterm are accessible via the WWW at http://biochem.otago.ac.nz/Transterm/
Collapse
Affiliation(s)
- G H Jacobs
- Department of Biochemistry, University of Otago, Dunedin, New Zealand
| | | | | | | | | |
Collapse
|
34
|
Lescure A, Gautheret D, Carbon P, Krol A. Novel selenoproteins identified in silico and in vivo by using a conserved RNA structural motif. J Biol Chem 1999; 274:38147-54. [PMID: 10608886 DOI: 10.1074/jbc.274.53.38147] [Citation(s) in RCA: 213] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Selenocysteine is incorporated into selenoproteins by an in-frame UGA codon whose readthrough requires the selenocysteine insertion sequence (SECIS), a conserved hairpin in the 3'-untranslated region of eukaryotic selenoprotein mRNAs. To identify new selenoproteins, we developed a strategy that obviates the need for prior amino acid sequence information. A computational screen was used to scan nucleotide sequence data bases for sequences presenting a potential SECIS secondary structure. The computer-selected hairpins were then assayed in vivo for their functional capacities, and the cDNAs corresponding to the SECIS winners were identified. Four of them encoded novel selenoproteins as confirmed by in vivo experiments. Among these, SelZf1 and SelZf2 share a common domain with mitochondrial thioredoxin reductase-2. The three proteins, however, possess distinct N-terminal domains. We found that another protein, SelX, displays sequence similarity to a protein involved in bacterial pilus formation. For the first time, four novel selenoproteins were discovered based on a computational screen for the RNA hairpin directing selenocysteine incorporation.
Collapse
Affiliation(s)
- A Lescure
- Unité Propre de Recherche CNRS 9002, Structure des Macromolécules Biologiques et Mécanismes de Reconnaissance, Institut de Biologie Moleculaire et Cellulaire, 15, Rue René Descartes, 67084 Strasbourg Cedex, France
| | | | | | | |
Collapse
|
35
|
Abstract
The efficiency of AUG start codon recognition in translation initiation is modulated by its sequence context. Here we investigated a non-redundant set of 5914 human genes and show that this context is different in genes located in different isochores. In particular, of the two main consensus start sequences, RCCaugR is five-fold more represented than AARaugR in genes from the GC-rich H3 isochores compared to genes from the GC-poor L isochores. Furthermore, genes located in GC-rich isochores have shorter 5' UTRs and stronger avoidance of upstream AUG than genes located in GC-poor isochores. This suggests that genes requiring highly efficient translation are located in GC-rich isochores and genes requiring fine modulation of expression are located in GC-poor isochores. This is in agreement with independent data from the literature concerning the location of housekeeping and tissue-specific genes, respectively.
Collapse
Affiliation(s)
- G Pesole
- Dipartimento di Fisiologia e Biochimica Generali, Università di Milano, via Celoria 26, 20133, Milan, Italy.
| | | | | |
Collapse
|
36
|
Pesole G, Liuni S. Internet resources for the functional analysis of 5' and 3' untranslated regions of eukaryotic mRNAs. Trends Genet 1999; 15:378. [PMID: 10461207 DOI: 10.1016/s0168-9525(99)01795-3] [Citation(s) in RCA: 84] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Affiliation(s)
- G Pesole
- Dipartimento di Fisiologia e Biochimica Generali, Università di Milano, Via Celoria, 26, 20133 Milano, Italy.
| | | |
Collapse
|