1001
|
Toyoda T, Shinozaki K. Tiling array-driven elucidation of transcriptional structures based on maximum-likelihood and Markov models. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2005; 43:611-21. [PMID: 16098113 DOI: 10.1111/j.1365-313x.2005.02470.x] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Tiling arrays of high-density oligonucleotide probes spanning the entire genome are powerful tools for the discovery of new genes. However, it is difficult to determine the structure of the spliced product of a structurally unknown gene from noisy array signals only. Here we introduce a statistical method that estimates the precise splicing points and the exon/intron structure of a structurally unknown gene by maximizing the odds or the ratio of posterior probabilities of the structure under the observation of array signal intensities and nucleic acid sequences. Our method more accurately predicted the gene structures than the simple threshold-based method, and more correctly estimated the expression values of structurally unknown genes than the window-based method. It was observed that the Markov model contributed to the precision of splice points, and that the statistical significance of expression (P-value) represented the reliability of the estimated gene structure and expression value well. We have implemented the method as a program ARTADE (ARabidopsis Tiling Array-based Detection of Exons) and applied it to the Arabidopsis thaliana whole-genome array data analysis. The database of the predicted results and the ARTADE program are available at http://omicspace.riken.jp/ARTADE/.
Collapse
Affiliation(s)
- Tetsuro Toyoda
- Phenome Informatics Team, Functional Genomics Research Group, Genomic Sciences Center, Japan
| | | |
Collapse
|
1002
|
Xuan Z, Zhao F, Wang J, Chen G, Zhang MQ. Genome-wide promoter extraction and analysis in human, mouse, and rat. Genome Biol 2005; 6:R72. [PMID: 16086854 PMCID: PMC1273639 DOI: 10.1186/gb-2005-6-8-r72] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2005] [Revised: 05/23/2005] [Accepted: 07/11/2005] [Indexed: 01/27/2023] Open
Abstract
Large-scale and high-throughput genomics research needs reliable and comprehensive genome-wide promoter annotation resources. We have conducted a systematic investigation on how to improve mammalian promoter prediction by incorporating both transcript and conservation information. This enabled us to build a better multispecies promoter annotation pipeline and hence to create CSHLmpd (Cold Spring Harbor Laboratory Mammalian Promoter Database) for the biomedical research community, which can act as a starting reference system for more refined functional annotations.
Collapse
Affiliation(s)
- Zhenyu Xuan
- Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | - Fang Zhao
- Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | - Jinhua Wang
- Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | - Gengxin Chen
- Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | - Michael Q Zhang
- Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| |
Collapse
|
1003
|
Abstract
DNA microarray technology revolutionized gene-expression analysis in molecular biology to observe patterns of gene expression in genomic scale. We review the biological aspects of genome-wide gene-expression activity in tumors specially focusing on the analysis of enzyme coding genes. First, the methods for analyzing gene-expression data for the study of metabolome in silico are discussed showing SV40T antigen expressing liver tumor data as an example. Next, an application for tumor metabolome analysis utilizing a reference set of gene-expression profiles is shown.
Collapse
Affiliation(s)
- Hidemasa Bono
- Division of Functional Genomics and Systems Medicine, Research Center for Genomic Medicine, Saitama Medical School, 1397-1 Yamane, Hidaka, Saitama 350-1241, Japan.
| | | |
Collapse
|
1004
|
Li H, Inoue M, Yabuki T, Aoki M, Seki E, Matsuda T, Nunokawa E, Motoda Y, Kobayashi A, Terada T, Shirouzu M, Koshiba S, Lin YJ, Güntert P, Suzuki H, Hayashizaki Y, Kigawa T, Yokoyama S. Solution structure of the mouse enhancer of rudimentary protein reveals a novel fold. JOURNAL OF BIOMOLECULAR NMR 2005; 32:329-34. [PMID: 16211485 DOI: 10.1007/s10858-005-7959-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2005] [Accepted: 05/18/2005] [Indexed: 05/04/2023]
Affiliation(s)
- Hua Li
- RIKEN Genomic Sciences Center, 1-7-22 Suehiro-cho, Yokohama, Tsurumi, 230-0045, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1005
|
Abstract
Mammalian cells harbor numerous small non-protein-coding RNAs, including small nucleolar RNAs (snoRNAs), microRNAs (miRNAs), short interfering RNAs (siRNAs) and small double-stranded RNAs, which regulate gene expression at many levels including chromatin architecture, RNA editing, RNA stability, translation, and quite possibly transcription and splicing. These RNAs are processed by multistep pathways from the introns and exons of longer primary transcripts, including protein-coding transcripts. Most show distinctive temporal- and tissue-specific expression patterns in different tissues, including embryonal stem cells and the brain, and some are imprinted. Small RNAs control a wide range of developmental and physiological pathways in animals, including hematopoietic differentiation, adipocyte differentiation and insulin secretion in mammals, and have been shown to be perturbed in cancer and other diseases. The extent of transcription of non-coding sequences and the abundance of small RNAs suggests the existence of an extensive regulatory network on the basis of RNA signaling which may underpin the development and much of the phenotypic variation in mammals and other complex organisms and which may have different genetic signatures from sequences encoding proteins.
Collapse
Affiliation(s)
- John S Mattick
- ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia.
| | | |
Collapse
|
1006
|
Abstract
Many decisions about genome sequencing projects are directed by perceived gaps in the tree of life, or towards model organisms. With the goal of a better understanding of biology through the lens of evolution, however, there are additional genomes that are worth sequencing. One such rationale for whole-genome sequencing is discussed here, along with other important strategies for understanding the phenotypic divergence of species.
Collapse
Affiliation(s)
- David A Liberles
- Computational Biology Unit, Bergen Centre for Computational Science, University of Bergen, 5020 Bergen, Norway.
| |
Collapse
|
1007
|
Wiemann S, Kolb-Kokocinski A, Poustka A. Alternative pre-mRNA processing regulates cell-type specific expression of the IL4l1 and NUP62 genes. BMC Biol 2005; 3:16. [PMID: 16029492 PMCID: PMC1198218 DOI: 10.1186/1741-7007-3-16] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2005] [Accepted: 07/19/2005] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Given the complexity of higher organisms, the number of genes encoded by their genomes is surprisingly small. Tissue specific regulation of expression and splicing are major factors enhancing the number of the encoded products. Commonly these mechanisms are intragenic and affect only one gene. RESULTS Here we provide evidence that the IL4I1 gene is specifically transcribed from the apparent promoter of the upstream NUP62 gene, and that the first two exons of NUP62 are also contained in the novel IL4I1_2 variant. While expression of IL4I1 driven from its previously described promoter is found mostly in B cells, the expression driven by the NUP62 promoter is restricted to cells in testis (Sertoli cells) and in the brain (e.g., Purkinje cells). Since NUP62 is itself ubiquitously expressed, the IL4I1_2 variant likely derives from cell type specific alternative pre-mRNA processing. CONCLUSION Comparative genomics suggest that the promoter upstream of the NUP62 gene originally belonged to the IL4I1 gene and was later acquired by NUP62 via insertion of a retroposon. Since both genes are apparently essential, the promoter had to serve two genes afterwards. Expression of the IL4I1 gene from the "NUP62" promoter and the tissue specific involvement of the pre-mRNA processing machinery to regulate expression of two unrelated proteins indicate a novel mechanism of gene regulation.
Collapse
Affiliation(s)
- Stefan Wiemann
- Molecular Genome Analysis, German Cancer Research Center, Im Neuenheimer Feld 580, Heidelberg, 69120, Germany
| | - Anja Kolb-Kokocinski
- Molecular Genome Analysis, German Cancer Research Center, Im Neuenheimer Feld 580, Heidelberg, 69120, Germany
| | - Annemarie Poustka
- Molecular Genome Analysis, German Cancer Research Center, Im Neuenheimer Feld 580, Heidelberg, 69120, Germany
| |
Collapse
|
1008
|
Munroe SH. Diversity of antisense regulation in eukaryotes: multiple mechanisms, emerging patterns. J Cell Biochem 2005; 93:664-71. [PMID: 15389973 DOI: 10.1002/jcb.20252] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
High-throughput analysis of RNA molecules in multicellular eukaryotes has revealed an abundance of complementary antisense RNAs that are transcribed from separate or overlapping genes. In mammals these include many novel non-coding RNAs of unknown function. This unexpected complexity of the mammalian transcriptome suggests that expression of many genes is regulated post-transcriptionally by mechanisms mediated by RNA-RNA base pairing. The recent discovery of the widespread expression of microRNAs in animals and plants provides a prototypic example of such regulation in eukaryotes. However, there are likely to be numerous other types of antisense regulation in eukaryotes, many as yet uncharacterized.
Collapse
Affiliation(s)
- Stephen H Munroe
- Department of Biological Sciences, Marquette University, Milwaukee, Wisconsin 53201, USA.
| |
Collapse
|
1009
|
Wakui M, Morel L, Butfiloski EJ, Kim C, Sobel ES. Genetic Dissection of Systemic Lupus Erythematosus Pathogenesis: Partial Functional Complementation betweenSle1andSle3/5Demonstrates Requirement for Intracellular Coexpression for Full Phenotypic Expression of Lupus. THE JOURNAL OF IMMUNOLOGY 2005; 175:1337-45. [PMID: 16002739 DOI: 10.4049/jimmunol.175.2.1337] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
Sle1 on chromosome 1 and Sle3/5 on chromosome 7 are two of the most critical lupus susceptibility loci of the New Zealand Black/White-derived NZM2410 mouse strain. In contrast to C57BL/6 mice congenic for either Sle1 (B6.Sle1) or Sle3/5 (B6.Sle3/5), strains that express only a modest lupus-related phenotype, the bicongenic B6.Sle1.Sle3/5 strain has a robust phenotype, suggesting a critical role for epistatic interactions in lupus pathogenesis. Mixed chimera experiments indicated that the two loci are functionally expressed by different cell populations and predicted that phenotypic expression of the phenotypic features of the B6.Sle1.Sle3/5 strain could be fully reproduced with a combination of B6.Sle1 and B6.Sle3/5 bone marrow. Contrary to our expectations, there was only a partial functional complementation in these mixed chimeras. Spleen enlargement, CD4:CD8 ratio elevation, and epitope spreading of autoantibodies were fully developed in B6+B6.Sle1.Sle3/5 but not in B6.Sle1+B6.Sle3/5 mixed chimeras. This study is the first to present evidence that the pathways mediated by two critical lupus susceptibility loci derived from the New Zealand White strain must be integrated intracellularly for epistatic interactions to occur. Our mixed chimera approach continues to provide novel insights into the functional genetic pathways underlying this important murine model of systemic autoimmunity.
Collapse
Affiliation(s)
- Masatoshi Wakui
- Department of Medicine and Division of Rheumatology and Clinical Immunology, College of Medicine, University of Florida, Gainesville, FL 32610, USA
| | | | | | | | | |
Collapse
|
1010
|
Sheikh K, Förster J, Nielsen LK. Modeling hybridoma cell metabolism using a generic genome-scale metabolic model of Mus musculus. Biotechnol Prog 2005; 21:112-21. [PMID: 15903248 DOI: 10.1021/bp0498138] [Citation(s) in RCA: 132] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The reconstructed cellular metabolic network of Mus musculus, based on annotated genomic data, pathway databases, and currently available biochemical and physiological information, is presented. Although incomplete, it represents the first attempt to collect and characterize the metabolic network of a mammalian cell on the basis of genomic data. The reaction network is generic in nature and attempts to capture the carbon, energy, and nitrogen metabolism of the cell. The metabolic reactions were compartmentalized between the cytosol and the mitochondria, including transport reactions between the compartments and the extracellular medium. The reaction list consists of 872 internal metabolites involved in a total of 1220 reactions, whereof 473 relate to known open reading frames. Initial in silico analysis of the reconstructed model is presented.
Collapse
Affiliation(s)
- Kashif Sheikh
- Department of Chemical Engineering, The University of Queensland, St. Lucia, 4072 QLD, Australia, and Fluxome Sciences A/S, Søltofts Plads, Building 223, 2800 Lyngby, Denmark
| | | | | |
Collapse
|
1011
|
Riaño-Pachón DM, Dreyer I, Mueller-Roeber B. Orphan transcripts in Arabidopsis thaliana: identification of several hundred previously unrecognized genes. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2005; 43:205-12. [PMID: 15998307 DOI: 10.1111/j.1365-313x.2005.02438.x] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Expressed sequence tags (ESTs) represent a huge resource for the discovery of previously unknown genetic information and functional genome assignment. In this study we screened a collection of 178 292 ESTs from Arabidopsis thaliana by testing them against previously annotated genes of the Arabidopsis genome. We identified several hundreds of new transcripts that match the Arabidopsis genome at so far unassigned loci. The transcriptional activity of these loci was independently confirmed by comparison with the Salk Whole Genome Array Data. To a large extent, the newly identified transcriptionally active genomic regions do not encode 'classic' proteins, but instead generate non-coding RNAs and/or small peptide-coding RNAs of presently unknown biological function. More than 560 transcripts identified in this study are not represented by the Affymetrix GeneChip arrays currently widely used for expression profiling in A. thaliana. Our data strongly support the hypothesis that numerous previously unknown genes exist in the Arabidopsis genome.
Collapse
Affiliation(s)
- Diego Mauricio Riaño-Pachón
- Department of Molecular Biology, Institute for Biochemistry and Biology, University of Potsdam, Karl-Liebknecht-Str. 25, Haus 20, D-14476 Golm/Potsdam, Germany
| | | | | |
Collapse
|
1012
|
Abstract
The Gene Ontology project (http://www.geneontology.org/) produces structured, controlled vocabularies and gene product annotations. Gene products are classified according to the cellular locations and biological process in which they act, and the molecular functions that they carry out. We annotate gene products from a broad range of model species and provide support for those groups that wish to contribute annotation of further model species. The Gene Ontology facilitates the exchange of information between groups of scientists studying similar processes in different model organisms, and so provides a broad range of opportunities for plant scientists.
Collapse
Affiliation(s)
- Jennifer I Clark
- EMBL-EBI Wellcome Trust Genome Campus Hinxton, Cambridge CB10 1SD, United Kingdom.
| | | | | |
Collapse
|
1013
|
Abstract
The past four years have seen an explosion in the number of detected RNA transcripts with no apparent protein-coding potential. This has led to speculation that non-protein-coding RNAs (ncRNAs) might be as important as proteins in the regulation of vital cellular functions. However, there has been significantly less progress in actually demonstrating the functions of these transcripts. In this article, we review the results of recent experiments that show that transcription of non-protein-coding RNA is far more widespread than was previously anticipated. Although some ncRNAs act as molecular switches that regulate gene expression, the function of many ncRNAs is unknown. New experimental and computational approaches are emerging that will help determine whether these newly identified transcription products are evidence of important new biochemical pathways or are merely 'junk' RNA generated by the cell as a by-product of its functional activities.
Collapse
Affiliation(s)
- Alexander Hüttenhofer
- Division of Genomics and RNomics, Innsbruck Medical University-Biocenter, Fritz-Pregl-Strasse 3, 6020 Innsbruck, Austria.
| | | | | |
Collapse
|
1014
|
Abstract
A central issue in the regulation of gene expression is the physical association of transcription factors with relevant promoter sequences. Recently, technological advances have allowed researchers to analyze these processes on a genomic scale. In particular, the combination of the chromatin immunoprecipitation (ChIP) technique with microarray analysis (the 'ChIP to chip' experiment) is providing a wealth of new and surprising data on transcription factor-chromatin interactions. These advances are reviewed here. We also discuss future challenges in the area.
Collapse
Affiliation(s)
- Devanjan Sikder
- Department of Internal Medicine, Center for Biomedical Inventions, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, Texas 75390-8573, USA.
| | | |
Collapse
|
1015
|
Sharov AA, Dudekula DB, Ko MSH. Genome-wide assembly and analysis of alternative transcripts in mouse. Genome Res 2005; 15:748-54. [PMID: 15867436 PMCID: PMC1088304 DOI: 10.1101/gr.3269805] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
To build a mouse gene index with the most comprehensive coverage of alternative transcription/splicing (ATS), we developed an algorithm and a fully automated computational pipeline for transcript assembly from expressed sequences aligned to the genome. We identified 191,946 genomic loci, which included 27,497 protein-coding genes and 11,906 additional gene candidates (e.g., nonprotein-coding, but multiexon). Comparison of the resulting gene index with TIGR, UniGene, DoTS, and ESTGenes databases revealed that it had a greater number of transcripts, a greater average number of exons and introns with proper splicing sites per gene, and longer ORFs. The 27,497 protein-coding genes had 77,138 transcripts, i.e., 2.8 transcripts per gene on average. Close examination of transcripts led to a combinatorial table of 23 types of ATS units, only nine of which were previously described, i.e., 14 types of alternative splicing, seven types of alternative starts, and two types of alternative termination. The 47%, 18%, and 14% of 20,323 multiexon protein-coding genes with proper splice sites had alternative splicings, alternative starts, and alternative terminations, respectively. The gene index with the comprehensive ATS will provide a useful platform for analyzing the nature and mechanism of ATS, as well as for designing the accurate exon-based DNA microarrays. The sequence data from this study have been submitted to GenBank under accession numbers: CK329321-CK334090; CF891695-CF906652; CF906741-CF916750; CK334091-CK347104; CK387035-CK393993; CN660032-CN690720; CN690721-CN725493.
Collapse
Affiliation(s)
- Alexei A Sharov
- Developmental Genomics and Aging Section, Laboratory of Genetics, National Institute on Aging, National Institutes of Health, Baltimore, MD 21224, USA
| | | | | |
Collapse
|
1016
|
Arai R, Kukimoto-Niino M, Uda-Tochio H, Morita S, Uchikubo-Kamo T, Akasaka R, Etou Y, Hayashizaki Y, Kigawa T, Terada T, Shirouzu M, Yokoyama S. Crystal structure of an enhancer of rudimentary homolog (ERH) at 2.1 Angstroms resolution. Protein Sci 2005; 14:1888-93. [PMID: 15937287 PMCID: PMC2253356 DOI: 10.1110/ps.051484505] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
The enhancer of rudimentary gene, e(r), of Drosophila melanogaster encodes an enhancer of rudimentary (ER) protein with functions implicated in pyrimidine biosynthesis and the cell cycle. The ER homolog (ERH) is highly conserved among vertebrates, invertebrates, and plants. Xenopus laevis ERH was reported to be a transcriptional repressor. Here we report the 2.1 Angstroms crystal structure of murine ERH (Protein Data Bank ID 1WZ7), determined by the multiwavelength anomalous dispersion (MAD) method. The monomeric structure of ERH comprises a single domain consisting of three alpha-helices and four beta-strands, which is a novel fold. In the crystal structure, ERH assumes a dimeric structure, through interactions between the beta-sheet regions. The formation of an ERH dimer is consistent with the results of analytical ultracentrifugation. The residues at the core region and at the dimer interface are highly conserved, suggesting the conservation of the dimer formation as well as the monomer fold. The long flexible loop (44 approximately 53) is also significantly conserved, suggesting that this loop region may be important for the functions of ERH. In addition, the putative phosphorylation sites are located at the start of the beta2-strand (Thr18) and at the start of the alpha1-helix (Ser24), implying that the phosphorylation might cause some structural changes.
Collapse
Affiliation(s)
- Ryoichi Arai
- Protein Research Group, RIKEN Yokohama Institute, Tsurumi-ku, Yokohama, Japan
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1017
|
Abstract
Over the past two decades a variety of mechanisms regulating cellular differentiation have been uncovered. These include signaling by morphogens or membrane-associated ligands and asymmetric segregation of cytoplasmic components. Most of these processes are driven by protein coding genes. Here I describe another possible cellular differentiation mechanism that involves asymmetric segregation of microRNAs, a group of recently discovered non-protein coding genes that have been shown to be involved in differentiation.
Collapse
Affiliation(s)
- Isaac Bentwich
- Rosetta Genomics, 10 Plaut Street, Rehovot 76706, Israel.
| |
Collapse
|
1018
|
Lee S, Bao J, Zhou G, Shapiro J, Xu J, Shi RZ, Lu X, Clark T, Johnson D, Kim YC, Wing C, Tseng C, Sun M, Lin W, Wang J, Yang H, Wang J, Du W, Wu CI, Zhang X, Wang SM. Detecting novel low-abundant transcripts in Drosophila. RNA (NEW YORK, N.Y.) 2005; 11:939-46. [PMID: 15923377 PMCID: PMC1370778 DOI: 10.1261/rna.7239605] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Increasing evidence suggests that low-abundant transcripts may play fundamental roles in biological processes. In an attempt to estimate the prevalence of low-abundant transcripts in eukaryotic genomes, we performed a transcriptome analysis in Drosophila using the SAGE technique. We collected 244,313 SAGE tags from transcripts expressed in Drosophila embryonic, larval, pupae, adult, and testicular tissue. From these SAGE tags, we identified 40,823 unique SAGE tags. Our analysis showed that 55% of the 40,823 unique SAGE tags are novel without matches in currently known Drosophila transcripts, and most of the novel SAGE tags have low copy numbers. Further analysis indicated that these novel SAGE tags represent novel low-abundant transcripts expressed from loci outside of currently annotated exons including the intergenic and intronic regions, and antisense of the currently annotated exons in the Drosophila genome. Our study reveals the presence of a significant number of novel low-abundant transcripts in Drosophila, and highlights the need to isolate these novel low-abundant transcripts for further biological studies.
Collapse
Affiliation(s)
- Sanggyu Lee
- Department of Medicine, University of Chicago, IL 60637, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1019
|
Loebel DAF, Tsoi B, Wong N, Tam PPL. A conserved noncoding intronic transcript at the mouse Dnm3 locus. Genomics 2005; 85:782-9. [PMID: 15885504 DOI: 10.1016/j.ygeno.2005.02.001] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2004] [Accepted: 02/02/2005] [Indexed: 01/01/2023]
Abstract
A 6-kb antisense transcript (Dnm3os) contained within an intron of the mouse Dnm3 gene has been identified in a screen for genes that may be regulated by the basic helix-loop-helix transcription factor Twist during mouse development. The antisense transcript is highly conserved between vertebrate species, but does not appear to encode a protein. We show that expression of the Dnm3 and Dnm3os transcripts overlaps during embryogenesis and in adult tissues, except that Dnm3 is most highly expressed in adult brain and testis and expressed at lower levels in embryos, whereas the antisense transcript is most strongly expressed in embryos and gravid uterus. Both Dnm3 and Dnm3os are downregulated in branchial arch tissue of Twist-null embryos. The conservation and restricted expression pattern of this noncoding transcript suggest that it may perform an important function during embryonic development and further suggest a connection between the regulation of Dnm3 and this novel intronic, antisense transcript.
Collapse
Affiliation(s)
- David A F Loebel
- Embryology Unit, Children's Medical Research Institute, Locked Bag 23, Wentworthville, NSW 2145, Australia.
| | | | | | | |
Collapse
|
1020
|
Young TL, Matsuda T, Cepko CL. The noncoding RNA taurine upregulated gene 1 is required for differentiation of the murine retina. Curr Biol 2005; 15:501-12. [PMID: 15797018 DOI: 10.1016/j.cub.2005.02.027] [Citation(s) in RCA: 397] [Impact Index Per Article: 19.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2004] [Revised: 02/02/2005] [Accepted: 02/03/2005] [Indexed: 11/28/2022]
Abstract
BACKGROUND With the advent of genome-wide analyses, it is becoming evident that a large number of noncoding RNAs (ncRNAs) are expressed in vertebrates. However, of the thousands of ncRNAs identified, the functions of relatively few have been established. RESULTS In a screen for genes upregulated by taurine in developing retinal cells, we identified a gene that appears to be a ncRNA. Taurine Upregulated Gene 1 (TUG1) is a spliced, polyadenylated RNA that does not encode any open reading frame greater than 82 amino acids in its full-length, 6.7 kilobase (kb) RNA sequence. Analyses of Northern blots and in situ hybridization revealed that TUG1 is expressed in the developing retina and brain, as well as in adult tissues. In the newborn retina, knockdown of TUG1 with RNA interference (RNAi) resulted in malformed or nonexistent outer segments of transfected photoreceptors. Immunofluorescent staining and microarray analyses suggested that this loss of proper photoreceptor differentiation is a result of the disregulation of photoreceptor gene expression. CONCLUSIONS A function for a newly identified ncRNA, TUG1, has been established. TUG1 is necessary for the proper formation of photoreceptors in the developing rodent retina.
Collapse
Affiliation(s)
- T L Young
- Howard Hughes Medical Institute, Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
| | | | | |
Collapse
|
1021
|
Schmidt-Ott KM, Yang J, Chen X, Wang H, Paragas N, Mori K, Li JY, Lu B, Costantini F, Schiffer M, Bottinger E, Barasch J. Novel Regulators of Kidney Development from the Tips of the Ureteric Bud. J Am Soc Nephrol 2005; 16:1993-2002. [PMID: 15917337 DOI: 10.1681/asn.2004121127] [Citation(s) in RCA: 101] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open
Abstract
Mammalian nephrogenesis depends on the interaction between the ureteric bud and the metanephric mesenchyme. As the ureteric bud undergoes branching and segmentation, the stalks differentiate into the collecting system of the mature kidney, while the tip cells interact with the adjacent cells of the metanephric mesenchyme, inducing their conversion into nephrons. This induction is mediated by secreted factors. For identifying novel mediators, the tips of the ureteric tree were isolated and microarray analyses were performed using manually refined, multistep gene ontology annotations. For identifying conserved factors, two databases were developed, one from mouse E12.5 and one from rat E13.5 ureteric buds. The overlap of mouse and rat data sets yielded 20 different transcripts that were enriched in the ureteric bud compared with metanephric mesenchyme and predicted to code for secreted proteins. Real-time reverse transcriptase-PCR and in situ hybridization confirmed these identifications. One of the genes that was highly specific to the ureteric bud tip was cytokine-like factor 1 (CLF-1). Recombinant CLF-1 in complex with its physiologic ligand, cardiotrophin-like cytokine (CLC), triggered phosphorylation of signal transducer and activator of transcription 3 in mesenchyme, a pathway characteristic of mesenchymal-to-epithelial conversion. Indeed, when applied to isolated rat metanephric mesenchyme, CLF-1/CLC (3 nM) induced mature nephron structures expressing glomerular and tubular markers. These results underline the power of this first comprehensive gene expression analysis of the ureteric bud tip to identify bioactive molecules.
Collapse
Affiliation(s)
- Kai M Schmidt-Ott
- Department of Medicine, Columbia University College of Physicians and Surgeons, 630 W. 168th Street, New York, NY 10032, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1022
|
Cheng J, Kapranov P, Drenkow J, Dike S, Brubaker S, Patel S, Long J, Stern D, Tammana H, Helt G, Sementchenko V, Piccolboni A, Bekiranov S, Bailey DK, Ganesh M, Ghosh S, Bell I, Gerhard DS, Gingeras TR. Transcriptional Maps of 10 Human Chromosomes at 5-Nucleotide Resolution. Science 2005; 308:1149-54. [PMID: 15790807 DOI: 10.1126/science.1108625] [Citation(s) in RCA: 868] [Impact Index Per Article: 43.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Sites of transcription of polyadenylated and nonpolyadenylated RNAs for 10 human chromosomes were mapped at 5-base pair resolution in eight cell lines. Unannotated, nonpolyadenylated transcripts comprise the major proportion of the transcriptional output of the human genome. Of all transcribed sequences, 19.4, 43.7, and 36.9% were observed to be polyadenylated, nonpolyadenylated, and bimorphic, respectively. Half of all transcribed sequences are found only in the nucleus and for the most part are unannotated. Overall, the transcribed portions of the human genome are predominantly composed of interlaced networks of both poly A+ and poly A- annotated transcripts and unannotated transcripts of unknown function. This organization has important implications for interpreting genotype-phenotype associations, regulation of gene expression, and the definition of a gene.
Collapse
MESH Headings
- Cell Line
- Cell Line, Tumor
- Cell Nucleus/metabolism
- Chromosomes, Human/genetics
- Chromosomes, Human, Pair 13/genetics
- Chromosomes, Human, Pair 14/genetics
- Chromosomes, Human, Pair 19/genetics
- Chromosomes, Human, Pair 20/genetics
- Chromosomes, Human, Pair 21/genetics
- Chromosomes, Human, Pair 22/genetics
- Chromosomes, Human, Pair 6/genetics
- Chromosomes, Human, Pair 7/genetics
- Chromosomes, Human, X/genetics
- Chromosomes, Human, Y/genetics
- Computational Biology
- Cytosol/metabolism
- DNA, Complementary
- DNA, Intergenic
- Exons
- Female
- Genome, Human
- Humans
- Introns
- Male
- Molecular Sequence Data
- Nucleic Acid Amplification Techniques
- Oligonucleotide Array Sequence Analysis
- Physical Chromosome Mapping
- RNA Splicing
- RNA, Messenger/analysis
- Transcription, Genetic
Collapse
Affiliation(s)
- Jill Cheng
- Affymetrix Inc., Santa Clara, CA 95051, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1023
|
Kriventseva EV, Koutsos AC, Blass C, Kafatos FC, Christophides GK, Zdobnov EM. AnoEST: toward A. gambiae functional genomics. Genome Res 2005; 15:893-9. [PMID: 15899967 PMCID: PMC1142480 DOI: 10.1101/gr.3756405] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Here, we present an analysis of 215,634 EST and cDNA sequences of a major vector of human malaria Anopheles gambiae structured into the AnoEST database. The expressed sequences are grouped into clusters using genomic sequence as template and associated with inferred functional annotation, including the following: corresponding Ensembl gene prediction, putative orthologous genes in other species, homology to known proteins, protein domains, associated Gene Ontology terms, and corresponding classification into broad GO-slim functional groups. AnoEST is a vital resource for interpretation of expression profiles derived using recently developed A. gambiae cDNA microarrays. Using these cDNA microarrays, we have experimentally confirmed the expression of 7961 clusters during mosquito development. Of these, 3100 are not associated with currently predicted genes. Moreover, we found that clusters with confirmed expression are nonbiased with respect to the current gene annotation or homology to known proteins. Consequently, we expect that many as yet unconfirmed clusters are likely to be actual A. gambiae genes. [AnoEST is publicly available at http://komar.embl.de, and is also accessible as a Distributed Annotation Service (DAS).].
Collapse
|
1024
|
Nishida H, Tomaru Y, Oho Y, Hayashizaki Y. Naturally occurring antisense RNA of histone H2a in mouse cultured cell lines. BMC Genet 2005; 6:23. [PMID: 15892893 PMCID: PMC1156883 DOI: 10.1186/1471-2156-6-23] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2004] [Accepted: 05/14/2005] [Indexed: 11/30/2022] Open
Abstract
Background An antisense transcript of histone H2a that has no significant protein-coding region has been cloned from a mouse full-length cDNA library. In the present study, we evaluated this transcript by using RT-PCR and compared the expression patterns of the sense and antisense transcripts by using quantitative RT-PCR (qRT-PCR). Results This antisense RNA was expressed in three mouse cell lines. We call it ASH2a. ASH2a includes not only the complementary sequence of the transcript of Hist2h2aa2 (a replication-dependent histone H2a gene), but also that of the promoter of Hist2h2aa2. The upstream genomic sequence of the transcription start site of the ASH2a-coding gene (ASH2a) lacks both CCAAT and TATA boxes. This absence suggests that the regulation of ASH2a is different from that of the replication-dependent histone H2a genes. Findings from qRT-PCR indicated that the expression pattern of ASH2a was different from that of Hist2h2aa2. Expression of Hist2h2aa2 peaked at 2 to 4 h during S-phase, but that of ASH2a peaked at 1 h. Conclusion We showed the existence of ASH2a, a histone H2a antisense RNA, in mouse cultured cells. The expression pattern of ASH2a is different from that of the sense RNA.
Collapse
Affiliation(s)
- Hiromi Nishida
- Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center (GSC), RIKEN Yokohama Institute, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| | - Yasuhiro Tomaru
- Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center (GSC), RIKEN Yokohama Institute, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| | - Yuko Oho
- Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center (GSC), RIKEN Yokohama Institute, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| | - Yoshihide Hayashizaki
- Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center (GSC), RIKEN Yokohama Institute, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| |
Collapse
|
1025
|
Matsumura Y, Shimokawa K, Hayashizaki Y, Ikeo K, Tateno Y, Kawai J. Development of a spot reliability evaluation score for DNA microarrays. Gene 2005; 350:149-60. [PMID: 15788151 DOI: 10.1016/j.gene.2005.02.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2004] [Revised: 01/28/2005] [Accepted: 02/08/2005] [Indexed: 10/25/2022]
Abstract
We developed a reliability index named SRED (Spot Reliability Evaluation Score for DNA microarrays) that represents the probability that the calibrated gene expression level from a DNA microarray would be less than a factor of 2 different from that of quantitative real-time polymerase chain reaction assays whose dynamic quantification range is treated statistically to be similar to that of the DNA microarray. To define the SRED score, two parameters, the reproducibility of measurement value and the relative expression value were selected from nine candidate parameters. The SRED score supplies the probability that the expression level in each spot of a microarray is less than a certain-fold different compared to other expression profiling data, such as QRT-PCR. This score was applied to approximately 1,500,000 points of the expression profile in the RIKEN Expression Array Database.
Collapse
Affiliation(s)
- Yonehiro Matsumura
- Genome Exploration Research Group, RIKEN Genomic Sciences Center (GSC), Yokohama Institute, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| | | | | | | | | | | |
Collapse
|
1026
|
Ko MSH. Molecular biology of preimplantation embryos: primer for philosophical discussions. Reprod Biomed Online 2005; 10 Suppl 1:80-7. [PMID: 15820015 DOI: 10.1016/s1472-6483(10)62212-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
This article is based on a presentation at the First International Conference on Ethics, Science and Moral Philosophy of Assisted Human Reproduction. The goal is to provide scientific background for the discussion of philosophic issues. Recent advances in the systematic molecular analysis of preimplantation embryos are summarized, including the molecular identification of nearly all genes involved in preimplantation development and their detailed expression patterns. Notwithstanding a quantum leap in molecular understanding of preimplantation embryos, molecular evidence seems to provide no decisive definition of a threshold for the beginning of human life during preimplantation development.
Collapse
Affiliation(s)
- Minoru S H Ko
- Developmental Genomics and Ageing Section, Laboratory of Genetics, National Institute on Ageing, National Institutes of Health, Baltimore, MD 21224, USA
| |
Collapse
|
1027
|
Pulakat L, Cooper S, Knowle D, Mandavia C, Bruhl S, Hetrick M, Gavini N. Ligand-dependent complex formation between the Angiotensin II receptor subtype AT2 and Na+/H+ exchanger NHE6 in mammalian cells. Peptides 2005; 26:863-73. [PMID: 15808917 DOI: 10.1016/j.peptides.2004.12.015] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/02/2004] [Revised: 12/14/2004] [Accepted: 12/14/2004] [Indexed: 11/22/2022]
Abstract
Involvement of Angiotensin II (Ang II) in the regulation of sodium levels by modulating the Na+/H+ exchangers is demonstrated in many tissues. Screening of a mouse 17-day fetus cDNA library with the Angiotensin II receptor AT2 as the bait in yeast two-hybrid assay led us to identify an AT2-interacting mouse fetus peptide that shared 98% amino acid identity with the corresponding region of the human NHE6. NCBI Blast search showed that the clone 6430520C02 (GenBank Accession # AK032326) of the mouse genome project carried the complete sequence of this new mouse NHE6 isoform. The human and mouse NHE6 peptides share 97% overall homology. Further analysis showed that the region spanning the third intracellular loop and C-terminal cytoplasmic tail of the AT2 directly interacted with a 182 amino acid region that spans the predicted 5th intracellular loop and the initial part of the C-terminus of the mouse NHE6 in yeast two-hybrid assay. This 182-amino acid region that interacted with the AT2 also shares 98% homology with the corresponding region of rat NHE6 and therefore is highly conserved across species. We detected widespread expression of this NHE6 isoform in several rat tissues including 10-day fetus, 17-day fetus, and 30-day post-natal tissues of heart, brain, kidney and muscle. Moreover, the AT2 co-immunoiprecipitated with a hemagglutinin tagged NHE6 when expressed in human cell line MCF-7, and activated by AngII. This ligand-dependent complex formation between the AT2 and NHE6 suggests that the hormone Ang II may act as a regulator of NHE6, and Ang II-mediated direct protein-protein interaction between AT2 and NHE6 could be a mechanism for modulating the functions of the ubiquitously expressed NHE6 in different tissues.
Collapse
Affiliation(s)
- Lakshmi Pulakat
- Department of Biological Sciences, Bowling Green State University, Bowling Green, OH 43403, USA.
| | | | | | | | | | | | | |
Collapse
|
1028
|
Rogers AN, Vanburen DG, Hedblom EE, Tilahun ME, Telfer JC, Baldwin CL. Gammadelta T cell function varies with the expressed WC1 coreceptor. THE JOURNAL OF IMMUNOLOGY 2005; 174:3386-93. [PMID: 15749871 DOI: 10.4049/jimmunol.174.6.3386] [Citation(s) in RCA: 104] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
WC1 molecules are transmembrane glycoproteins belonging to the scavenger receptor cysteine-rich family and uniquely expressed on gammadelta T cells. Although participation of WC1+ gammadelta T cells in immune responses is well established, very little is understood regarding the significance of expressing different forms of the WC1 molecule. Two forms previously identified by mAbs, i.e., WC1.1 and WC1.2, are expressed by largely nonoverlapping subpopulations of gammadelta T cells. In this study it was shown that expression of the WC1.1 coreceptor was the main indicator of proliferation and IFN-gamma production in response to autologous and bacterial Ags as well as for IFN-gamma production without proliferation in Th1-polarizing, IL-12-containing cultures. Nevertheless, after culture in either Th1-polarizing or neutral conditions, mRNA was present for both T-bet and GATA-3 as well as for IL-12Rbeta2 in WC1.1+ and WC1.2+ subpopulations, and neither produced IL-4 under any conditions. Although the steady decrease in the proportion of WC1.1+ cells, but not WC1.2+ cells, within PBMC with animal aging suggested that the two subpopulations may have different roles in immune regulation, cells bearing either WC1.1 or WC1.2 expressed mRNA for regulatory cytokines IL-10 and TGF-beta, with TGF-beta being constitutively expressed by ex vivo cells. Overall, the results demonstrate that the form of the WC1 coreceptor expressed on gammadelta T cells divides them into functional subsets according to IFN-gamma production and proliferative capacity to specific stimuli as well as with regard to representation within PBMC. Finally, evidence is provided for minor differences in the intracytoplasmic tail sequences of WC1.1 and WC1.2 that may affect signaling.
Collapse
MESH Headings
- Aging/immunology
- Amino Acid Sequence
- Animals
- Base Sequence
- Cattle
- Cell Proliferation
- Cells, Cultured
- DNA/genetics
- Gene Expression
- In Vitro Techniques
- Interferon-gamma/biosynthesis
- Membrane Glycoproteins/chemistry
- Membrane Glycoproteins/genetics
- Membrane Glycoproteins/immunology
- Membrane Glycoproteins/metabolism
- Molecular Sequence Data
- RNA, Messenger/genetics
- RNA, Messenger/metabolism
- Receptors, Antigen, T-Cell, gamma-delta/metabolism
- Receptors, Immunologic/chemistry
- Receptors, Immunologic/genetics
- Receptors, Immunologic/metabolism
- Receptors, Interleukin-2/metabolism
- Sequence Homology, Amino Acid
- T-Lymphocyte Subsets/cytology
- T-Lymphocyte Subsets/immunology
Collapse
Affiliation(s)
- Aric N Rogers
- Program in Molecular and Cellular Biology, University of Massachusetts, Amherst, MA 01003, USA
| | | | | | | | | | | |
Collapse
|
1029
|
Morris KA, Snir E, Pompeia C, Koroleva IV, Kachar B, Hayashizaki Y, Carninci P, Soares MB, Beisel KW. Differential expression of genes within the cochlea as defined by a custom mouse inner ear microarray. J Assoc Res Otolaryngol 2005; 6:75-89. [PMID: 15735932 PMCID: PMC2504641 DOI: 10.1007/s10162-004-5046-x] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2004] [Accepted: 11/19/2004] [Indexed: 11/24/2022] Open
Abstract
Microarray analyses have contributed greatly to the rapid understanding of functional genomics through the identification of gene networks as well as gene discovery. To facilitate functional genomics of the inner ear, we have developed a mouse inner-ear-pertinent custom microarray chip (CMA-IE1). Nonredundant cDNA clones were obtained from two cDNA library resources: the RIKEN subtracted inner ear set and the NIH organ of Corti library. At least 2000 cDNAs unique to the inner ear were present on the chip. Comparisons were performed to examine the relative expression levels of these unique cDNAs within the organ of Corti, lateral wall, and spiral ganglion. Total RNA samples were obtained from the three cochlear-dissected fractions from adult CF-1 mice. The total RNA was linearly amplified, and a dendrimer-based system was utilized to enhance the hybridization signal. Differentially expressed genes were verified by comparison to known gene expression patterns in the cochlea or by correlation with genes and gene families deduced to be present in the three tissue types. Approximately 22-25% of the genes on the array had significant levels of expression. A number of differentially expressed genes were detected in each tissue fraction. These included genes with known functional roles, hypothetical genes, and various unknown or uncharacterized genes. Four of the differentially expressed genes found in the organ of Corti are linked to deafness loci. None of these are hypothetical or unknown genes.
Collapse
Affiliation(s)
- Ken A. Morris
- Department of Biomedical Sciences, Creighton University, 2500 California Plaza, Omaha, NE 68178 USA
| | - Einat Snir
- Pediatrics-Genetics, Iowa University, Iowa, IA 52242 USA
| | - Celine Pompeia
- Section on Structural Cell Biology, NIDCD/NIH, Bethesda, MD 20892 USA
| | | | - Bechara Kachar
- Section on Structural Cell Biology, NIDCD/NIH, Bethesda, MD 20892 USA
| | - Yoshihide Hayashizaki
- Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center, Tsukuba, Japan
| | - Piero Carninci
- Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center, Tsukuba, Japan
| | | | - Kirk W. Beisel
- Department of Biomedical Sciences, Creighton University, 2500 California Plaza, Omaha, NE 68178 USA
| |
Collapse
|
1030
|
Chao H, Spicer AP. Natural antisense mRNAs to hyaluronan synthase 2 inhibit hyaluronan biosynthesis and cell proliferation. J Biol Chem 2005; 280:27513-22. [PMID: 15843373 DOI: 10.1074/jbc.m411544200] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
We report the identification of a natural antisense mRNA of hyaluronan synthase 2 that we have chosen to designate as HASNT (for HA synthase 2 antisense) in human and mouse. HASNT is transcribed from the opposite strand of the HAS2 gene locus and is represented by several independent expressed sequence tags in human. Portions of the mouse Hasnt gene were identified through an exon-trapping approach. Sequence conservation is extremely low between human and mouse HASNT, and it is not clear whether these mRNAs contain functional open reading frames. HASNT has an alternate splice site in both human and mouse. This splice site is located at an identical position within the gene in both species and results in mRNAs of two different lengths. In each species, the antisense portion of the HASNT gene is complementary to the first exon of HAS2, which represents the 5'-untranslated region. To study the biological activity of HASNT, two human expressed sequence tag clones, representing long and short HASNT splice variants, were cloned into a tetracycline-inducible vector and were stably transfected into human osteosarcoma U2-OS Tet-on cells. The long and short HASNT-expressing cells had a reduction in HAS2 mRNA levels up to 94 and 86%, respectively, whereas hyaluronan biosynthesis was inhibited by 40 and 37%, respectively. Cell proliferation was reduced throughout the time frame of the experiment. Exogenous high molecular mass hyaluronan failed to rescue the suppressed cell proliferation, whereas adenoviral-mediated overexpression of hyaluronan synthase 3, which stimulated endogenous hyaluronan biosynthesis, was able to rescue. Collectively, our data suggest that natural antisense mRNAs of HAS2 are able to regulate HAS2 mRNA levels and hyaluronan biosynthesis in a cell culture model system and may have an important and novel regulatory role in the control of HAS2, HA biosynthesis, and HA-dependent cell functions in vivo.
Collapse
MESH Headings
- 5' Untranslated Regions
- Adenoviridae/genetics
- Alternative Splicing
- Animals
- Base Sequence
- Blotting, Northern
- COS Cells
- Cell Adhesion
- Cell Line, Tumor
- Cell Proliferation
- Culture Media, Serum-Free/pharmacology
- DNA, Complementary/metabolism
- Databases, Genetic
- Exons
- Expressed Sequence Tags
- Genetic Vectors
- Glucuronosyltransferase/chemistry
- Glucuronosyltransferase/genetics
- Glucuronosyltransferase/metabolism
- Humans
- Hyaluronan Synthases
- Hyaluronic Acid/antagonists & inhibitors
- Hyaluronic Acid/biosynthesis
- Mice
- Models, Genetic
- Molecular Sequence Data
- Oligonucleotides, Antisense/chemistry
- Oligonucleotides, Antisense/genetics
- Open Reading Frames
- Plasmids/metabolism
- RNA, Messenger/metabolism
- Reverse Transcriptase Polymerase Chain Reaction
- Sequence Homology, Nucleic Acid
- Tetracycline/pharmacology
- Time Factors
- Transfection
Collapse
Affiliation(s)
- Hsu Chao
- Center for Extracellular Matrix Biology, Institute of Biosciences and Technology, Texas A&M University System Health Science Center, Houston, Texas 77030, USA
| | | |
Collapse
|
1031
|
Han K, Yeo G, An P, Burge CB, Grabowski PJ. A combinatorial code for splicing silencing: UAGG and GGGG motifs. PLoS Biol 2005; 3:e158. [PMID: 15828859 PMCID: PMC1079783 DOI: 10.1371/journal.pbio.0030158] [Citation(s) in RCA: 137] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2004] [Accepted: 03/04/2005] [Indexed: 12/28/2022] Open
Abstract
Alternative pre-mRNA splicing is widely used to regulate gene expression by tuning the levels of tissue-specific mRNA isoforms. Few regulatory mechanisms are understood at the level of combinatorial control despite numerous sequences, distinct from splice sites, that have been shown to play roles in splicing enhancement or silencing. Here we use molecular approaches to identify a ternary combination of exonic UAGG and 5′-splice-site-proximal
GGGG motifs that functions cooperatively to silence the brain-region-specific CI cassette exon (exon 19) of the glutamate NMDA R1 receptor (GRIN1) transcript. Disruption of three components of the motif pattern converted the CI cassette into a constitutive exon, while predominant skipping was conferred when the same components were introduced, de novo, into a heterologous constitutive exon. Predominant exon silencing was directed by the motif pattern in the presence of six competing exonic splicing enhancers, and this effect was retained after systematically repositioning the two exonic UAGGs within the CI cassette. In this system, hnRNP A1 was shown to mediate silencing while hnRNP H antagonized silencing. Genome-wide computational analysis combined with RT-PCR testing showed that a class of skipped human and mouse exons can be identified by searches that preserve the sequence and spatial configuration of the UAGG and
GGGG motifs. This analysis suggests that the multi-component silencing code may play an important role in the tissue-specific regulation of the CI cassette exon, and that it may serve more generally as a molecular language to allow for intricate adjustments and the coordination of splicing patterns from different genes.
Many genes are alternatively spliced, but the signals that regulate the process are unclear. These authors have found a sequence motif that appears to function at many alternatively spliced genes
Collapse
Affiliation(s)
- Kyoungha Han
- 1Department of Biological Sciences, University of PittsburghPittsburgh, PennsylvaniaUnited States of America
| | - Gene Yeo
- 2Department of Brain and Cognitive Sciences, Massachusetts Institute of TechnologyBoston, MassachusettsUnited States of America
| | - Ping An
- 1Department of Biological Sciences, University of PittsburghPittsburgh, PennsylvaniaUnited States of America
| | - Christopher B Burge
- 3Department of Biology, Massachusetts Institute of TechnologyBoston, MassachusettsUnited States of America
| | - Paula J Grabowski
- 1Department of Biological Sciences, University of PittsburghPittsburgh, PennsylvaniaUnited States of America
| |
Collapse
|
1032
|
Pang KC, Stephen S, Engström PG, Tajul-Arifin K, Chen W, Wahlestedt C, Lenhard B, Hayashizaki Y, Mattick JS. RNAdb--a comprehensive mammalian noncoding RNA database. Nucleic Acids Res 2005; 33:D125-30. [PMID: 15608161 PMCID: PMC540043 DOI: 10.1093/nar/gki089] [Citation(s) in RCA: 110] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
In recent years, there have been increasing numbers of transcripts identified that do not encode proteins, many of which are developmentally regulated and appear to have regulatory functions. Here, we describe the construction of a comprehensive mammalian noncoding RNA database (RNAdb) which contains over 800 unique experimentally studied non-coding RNAs (ncRNAs), including many associated with diseases and/or developmental processes. The database is available at http://research.imb.uq.edu.au/RNAdb and is searchable by many criteria. It includes microRNAs and snoRNAs, but not infrastructural RNAs, such as rRNAs and tRNAs, which are catalogued elsewhere. The database also includes over 1100 putative antisense ncRNAs and almost 20 000 putative ncRNAs identified in high-quality murine and human cDNA libraries, with more to be added in the near future. Many of these RNAs are large, and many are spliced, some alternatively. The database will be useful as a foundation for the emerging field of RNomics and the characterization of the roles of ncRNAs in mammalian gene expression and regulation.
Collapse
Affiliation(s)
- Ken C Pang
- ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland 4072, Australia
| | | | | | | | | | | | | | | | | |
Collapse
|
1033
|
Tateno Y, Saitou N, Okubo K, Sugawara H, Gojobori T. DDBJ in collaboration with mass-sequencing teams on annotation. Nucleic Acids Res 2005; 33:D25-8. [PMID: 15608189 PMCID: PMC539974 DOI: 10.1093/nar/gki020] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
In the past year, we at DDBJ (DNA Data Bank of Japan; http://www.ddbj.nig.ac.jp) collected and released 1 066 084 entries or 718 072 425 bases including the whole chromosome 22 of chimpanzee, the whole-genome shotgun sequences of silkworm and various others. On the other hand, we hosted workshops for human full-length cDNA annotation and participated in jamborees of mouse full-length cDNA annotation. The annotated data are made public at DDBJ. We are also in collaboration with a RIKEN team to accept and release the CAGE (Cap Analysis Gene Expression) data under a new category, MGA (Mass Sequences for Genome Annotation). The data will be useful for studying gene expression control in many aspects.
Collapse
Affiliation(s)
- Y Tateno
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Research Organization of Information and Systems, Yata, Mishima, 411-8540, Japan.
| | | | | | | | | |
Collapse
|
1034
|
Militello KT, Patel V, Chessler AD, Fisher JK, Kasper JM, Gunasekera A, Wirth DF. RNA polymerase II synthesizes antisense RNA in Plasmodium falciparum. RNA (NEW YORK, N.Y.) 2005; 11:365-370. [PMID: 15703443 PMCID: PMC1370726 DOI: 10.1261/rna.7940705] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/26/2004] [Accepted: 12/10/2004] [Indexed: 05/24/2023]
Abstract
The recent identification of antisense RNA in the transcriptomes of many eukaryotes has generated enormous interest. The presence of antisense RNA in Plasmodium falciparum, the causative agent of severe malaria, remains controversial. Elucidation of the mechanism of antisense RNA in P. falciparum synthesis is critical in order to demonstrate the origin and function of these transcripts. Therefore, a systematic analysis of antisense and sense RNA synthesis was performed using direct labeling experiments. Nuclear run on experiments with single-stranded DNA probes demonstrated that antisense RNA is synthesized in the nucleus at several genomic loci. Antisense RNA synthesis is sensitive to the potent RNA polymerase II inhibitor alpha-amanitin. Antisense and sense transcription was also detected in nuclei isolated from synchronized parasites, suggesting concurrent synthesis. In summary, our experiments directly demonstrate that antisense RNA synthesis is a common transcriptional phenomenon in P. falciparum, and is catalyzed by RNA polymerase II.
Collapse
Affiliation(s)
- Kevin T Militello
- Harvard School of Public Health, Department of Immunology and Infectious Diseases, 665 Huntington Avenue, Building I, Room 705, Boston, MA 02115, USA
| | | | | | | | | | | | | |
Collapse
|
1035
|
Kiyosawa H, Mise N, Iwase S, Hayashizaki Y, Abe K. Disclosing hidden transcripts: mouse natural sense-antisense transcripts tend to be poly(A) negative and nuclear localized. Genome Res 2005; 15:463-74. [PMID: 15781571 PMCID: PMC1074361 DOI: 10.1101/gr.3155905] [Citation(s) in RCA: 106] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Genome-wide in silico analysis identified thousands of natural sense-antisense transcript (SAT) pairs in the mouse transcriptome. We investigated their expression using strand-specific oligo-microarray that distinguishes expression of sense and antisense RNA from 1947 SAT pairs. The majority of the predicted SATs are expressed at various steady-state levels in various tissues, and cluster analysis of the array data demonstrated that the ratio of sense and antisense expression for some of the SATs fluctuated markedly among these tissues, while the rest was unchanged. Surprisingly, further analyses indicated that vast amounts of multiple-sized transcripts are expressed from the SAT loci, which tended to be poly(A) negative, and nuclear localized. The tendency that the SATs are often not polyadenylated is conserved, even in the randomly chosen SAT genes in the plant Arabidopsis thaliana. Such common characteristics imply general roles of the SATs in regulation of gene expression.
Collapse
Affiliation(s)
- Hidenori Kiyosawa
- Technology and Development Team for Mammalian Cellular Dynamics, BioResource Center (BRC), RIKEN Tsukuba Institute, Tsukuba, Ibaraki, Japan 305-0074
| | | | | | | | | |
Collapse
|
1036
|
Zhang SSM, Xu X, Li J, Liu MG, Zhao H, Soares MB, Barnstable CJ, Fu XY. Comprehensive in silico functional specification of mouse retina transcripts. BMC Genomics 2005; 6:40. [PMID: 15777472 PMCID: PMC1083414 DOI: 10.1186/1471-2164-6-40] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2004] [Accepted: 03/18/2005] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The retina is a well-defined portion of the central nervous system (CNS) that has been used as a model for CNS development and function studies. The full specification of transcripts in an individual tissue or cell type, like retina, can greatly aid the understanding of the control of cell differentiation and cell function. In this study, we have integrated computational bioinformatics and microarray experimental approaches to classify the tissue specificity and developmental distribution of mouse retina transcripts. RESULTS We have classified a set of retina-specific genes using sequence-based screening integrated with computational and retina tissue-specific microarray approaches. 33,737 non-redundant sequences were identified as retina transcript clusters (RTCs) from more than 81,000 mouse retina ESTs. We estimate that about 19,000 to 20,000 genes might express in mouse retina from embryonic to adult stages. 39.1% of the RTCs are not covered by 60,770 RIKEN full-length cDNAs. Through comparison with 2 million mouse ESTs, spectra of neural, retinal, late-generated retinal, and photoreceptor -enriched RTCs have been generated. More than 70% of these RTCs have data from biological experiments confirming their tissue-specific expression pattern. The highest-grade retina-enriched pool covered almost all the known genes encoding proteins involved in photo-transduction. CONCLUSION This study provides a comprehensive mouse retina transcript profile for further gene discovery in retina and suggests that tissue-specific transcripts contribute substantially to the whole transcriptome.
Collapse
Affiliation(s)
- Samuel Shao-Min Zhang
- Departments of Pathology, Yale University School of Medicine, 310 Cedar Street, New Haven, CT 06520, USA
- Departments of Ophthalmology and Visual Science, Yale University School of Medicine, 330 Cedar Street, New Haven, CT 06520, USA
| | - Xuming Xu
- Departments of Pathology, Yale University School of Medicine, 310 Cedar Street, New Haven, CT 06520, USA
- Departments of Ophthalmology and Visual Science, Yale University School of Medicine, 330 Cedar Street, New Haven, CT 06520, USA
| | - Jinming Li
- Department of Epidemiology and Public Health, Yale University School of Medicine, 60 College Street, New Haven, CT 06520, USA
| | - Mu-Gen Liu
- Departments of Pathology, Yale University School of Medicine, 310 Cedar Street, New Haven, CT 06520, USA
- Departments of Ophthalmology and Visual Science, Yale University School of Medicine, 330 Cedar Street, New Haven, CT 06520, USA
| | - Hongyu Zhao
- Department of Epidemiology and Public Health, Yale University School of Medicine, 60 College Street, New Haven, CT 06520, USA
- Department of Genetics, Yale University School of Medicine, 333 Cedar Street, New Haven, CT 06520, USA
| | - M Bento Soares
- Departments of Pediatrics, Biochemistry, Orthopaedics, Physiology and Biophysics, The University of Iowa Roy J. and Lucille A. Carver College of Medicine, 375 Newton Road, Iowa City, IA 52242, USA
| | - Colin J Barnstable
- Departments of Ophthalmology and Visual Science, Yale University School of Medicine, 330 Cedar Street, New Haven, CT 06520, USA
- Department of Neurobiology, Yale University School of Medicine, 333 Cedar Street, New Haven, CT 06520, USA
| | - Xin-Yuan Fu
- Departments of Pathology, Yale University School of Medicine, 310 Cedar Street, New Haven, CT 06520, USA
| |
Collapse
|
1037
|
Mielenz D, Vettermann C, Hampel M, Lang C, Avramidou A, Karas M, Jäck HM. Lipid Rafts Associate with Intracellular B Cell Receptors and Exhibit a B Cell Stage-Specific Protein Composition. THE JOURNAL OF IMMUNOLOGY 2005; 174:3508-17. [PMID: 15749887 DOI: 10.4049/jimmunol.174.6.3508] [Citation(s) in RCA: 67] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Lipid rafts serve as platforms for BCR signal transduction. To better define the molecular basis of these membrane microdomains, we used two-dimensional gel electrophoresis and mass spectrometry to characterize lipid raft proteins from mature as well as immature B cell lines. Of 51 specific raft proteins, we identified a total of 18 proteins by peptide mass fingerprinting. Among them, we found vacuolar ATPase subunits alpha-1 and beta-2, vimentin, gamma-actin, mitofilin, and prohibitin. None of these has previously been reported in lipid rafts of B cells. The differential raft association of three proteins, including a novel potential signaling molecule designated swiprosin-1, correlated with the stage-specific sensitivity of B cells to BCR-induced apoptosis. In addition, MHC class II molecules were detected in lipid rafts of mature, but not immature B cells. This intriguing finding points to a role for lipid rafts in regulating Ag presentation during B cell maturation. Finally, a fraction of the BCR in the B cell line CH27 was constitutively present in lipid rafts. Surprisingly, this fraction was neither expressed at the cell surface nor fully O-glycosylated. Thus, we conclude that partitioning the BCR into lipid rafts occurs in the endoplasmic reticulum/cis-Golgi compartment and may represent a control mechanism for surface transport.
Collapse
Affiliation(s)
- Dirk Mielenz
- Division of Molecular Immunology, Department of Internal Medicine III, University of Erlangen, Erlangen, Germany.
| | | | | | | | | | | | | |
Collapse
|
1038
|
Cowieson NP, Listwan P, Kurz M, Aagaard A, Ravasi T, Wells C, Huber T, Hume DA, Kobe B, Martin JL. Pilot studies on the parallel production of soluble mouse proteins in a bacterial expression system. ACTA ACUST UNITED AC 2005; 6:13-20. [PMID: 15965734 DOI: 10.1007/s10969-005-0462-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2004] [Accepted: 01/05/2005] [Indexed: 10/25/2022]
Abstract
We investigated the parallel production in medium throughput of mouse proteins, using protocols that involved recombinatorial cloning, protein expression screening and batch purification. The methods were scaled up to allow the simultaneous processing of tens or hundreds of protein samples. Scale-up was achieved in two stages. In an initial study, 30 targets were processed manually but with common protocols for all targets. In the second study, these protocols were applied to 96 target proteins that were processed in an automated manner. The success rates at each stage of the study were similar for both the manual and automated approaches. Overall, 15 of the selected 126 target mouse genes (12%) yielded soluble protein products in a bacterial expression system. This success rate compares favourably with other protein screening projects, particularly for eukaryotic proteins, and could be further improved by modifications at the cloning step.
Collapse
|
1039
|
Burns-Hamuro LL, Barraclough DM, Taylor SS. Identification and functional analysis of dual-specific A kinase-anchoring protein-2. Methods Enzymol 2005; 390:354-74. [PMID: 15488188 DOI: 10.1016/s0076-6879(04)90022-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/01/2023]
Abstract
Since the cloning of dual-specificity A kinase-anchoring protein 2 (D-AKAP2), there has been considerable progress in understanding the structural features of this AKAP and its interaction with protein kinase A (PKA). The domain organization of D-AKAP2 is quite unique, containing two tandem, putative RGS domains, a PKA-binding motif, and a PDZ (PSD95/Dlg/ZO1)-binding motif. Although the function of D-AKAP2 has remained elusive, several reports suggest that D-AKAP2 is targeted to cotransporters in the kidney and that it may play a role in regulating transporter activity. In addition, the finding that a single nucleotide polymorphism in the PKA-binding region of D-AKAP2 may contribute to increased morbidity and mortality emphasizes the potential importance of this protein in pathogenesis. The first part of this article focuses on initial efforts to identify and clone D-AKAP2, followed by tissue localization and expression profiles. The latter half of the article focuses on the domain organization of D-AKAP2 and its interaction with PKA. Finally, a comprehensive analysis of the PKA binding motif is described, which has led to the development of novel peptides derived from D-AKAP2 that can be useful tools in probing the function of this AKAP in cellular and animal models.
Collapse
|
1040
|
Ozawa T, Nishitani K, Sako Y, Umezawa Y. A high-throughput screening of genes that encode proteins transported into the endoplasmic reticulum in mammalian cells. Nucleic Acids Res 2005; 33:e34. [PMID: 15731327 PMCID: PMC549573 DOI: 10.1093/nar/gni032] [Citation(s) in RCA: 93] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The compartments of eukaryotic cells maintain a distinct protein composition to perform a variety of specialized functions. We developed a new method for identifying the proteins that are transported to the endoplasmic reticulum (ER) in living mammalian cells. The principle is based on the reconstitution of two split fragments of enhanced green fluorescent protein (EGFP) by protein splicing with DnaE from Synechocystis PCC6803. Complementary DNA (cDNA) libraries fused to the N-terminal halves of DnaE and EGFP are introduced in mammalian cells with retroviruses. If an expressed protein is transported into the ER, the N-terminal half of EGFP meets its C-terminal half in the ER, and full-length EGFP is reconstituted by protein splicing. The fluorescent cells are isolated using fluorescence-activated cell sorting and the cDNAs are sequenced. The developed method was able to accurately identify cDNAs that encode proteins transported to the ER. We identified 27 novel proteins as the ER-targeting proteins. The present method overcomes the limitation of the previous GFP- or epitope-tagged methods, using which it was difficult to identify the ER-targeting proteins in a high-throughput manner.
Collapse
Affiliation(s)
- Takeaki Ozawa
- Department of Chemistry, School of Science, The University of TokyoHongo, Bunkyo-ku, Tokyo 113-0033, Japan
- Japan Science and Technology CorporationTokyo, Japan
- PREST, Japan Science and Technology Agency4-1-8 Honcho Kawaguchi, Saitama, Japan
| | - Kengo Nishitani
- Department of Chemistry, School of Science, The University of TokyoHongo, Bunkyo-ku, Tokyo 113-0033, Japan
- Japan Science and Technology CorporationTokyo, Japan
| | - Yusuke Sako
- Department of Chemistry, School of Science, The University of TokyoHongo, Bunkyo-ku, Tokyo 113-0033, Japan
- Japan Science and Technology CorporationTokyo, Japan
| | - Yoshio Umezawa
- Department of Chemistry, School of Science, The University of TokyoHongo, Bunkyo-ku, Tokyo 113-0033, Japan
- Japan Science and Technology CorporationTokyo, Japan
- To whom correspondence should be addressed. Tel: +81 3 5841 4351; Fax: +81 3 5841 8349;
| |
Collapse
|
1041
|
Mottagui-Tabar S, Faghihi MA, Mizuno Y, Engström PG, Lenhard B, Wasserman WW, Wahlestedt C. Identification of functional SNPs in the 5-prime flanking sequences of human genes. BMC Genomics 2005; 6:18. [PMID: 15717931 PMCID: PMC550646 DOI: 10.1186/1471-2164-6-18] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2004] [Accepted: 02/17/2005] [Indexed: 11/10/2022] Open
Abstract
Background Over 4 million single nucleotide polymorphisms (SNPs) are currently reported to exist within the human genome. Only a small fraction of these SNPs alter gene function or expression, and therefore might be associated with a cell phenotype. These functional SNPs are consequently important in understanding human health. Information related to functional SNPs in candidate disease genes is critical for cost effective genetic association studies, which attempt to understand the genetics of complex diseases like diabetes, Alzheimer's, etc. Robust methods for the identification of functional SNPs are therefore crucial. We report one such experimental approach. Results Sequence conserved between mouse and human genomes, within 5 kilobases of the 5-prime end of 176 GPCR genes, were screened for SNPs. Sequences flanking these SNPs were scored for transcription factor binding sites. Allelic pairs resulting in a significant score difference were predicted to influence the binding of transcription factors (TFs). Ten such SNPs were selected for mobility shift assays (EMSA), resulting in 7 of them exhibiting a reproducible shift. The full-length promoter regions with 4 of the 7 SNPs were cloned in a Luciferase based plasmid reporter system. Two out of the 4 SNPs exhibited differential promoter activity in several human cell lines. Conclusions We propose a method for effective selection of functional, regulatory SNPs that are located in evolutionary conserved 5-prime flanking regions (5'-FR) regions of human genes and influence the activity of the transcriptional regulatory region. Some SNPs behave differently in different cell types.
Collapse
Affiliation(s)
- Salim Mottagui-Tabar
- Center for Genomics and Bioinformatics, Karolinska Institutet, SE-17177 Stockholm, Sweden
| | - Mohammad A Faghihi
- Center for Genomics and Bioinformatics, Karolinska Institutet, SE-17177 Stockholm, Sweden
| | - Yosuke Mizuno
- Center for Genomics and Bioinformatics, Karolinska Institutet, SE-17177 Stockholm, Sweden
| | - Pär G Engström
- Center for Genomics and Bioinformatics, Karolinska Institutet, SE-17177 Stockholm, Sweden
| | - Boris Lenhard
- Center for Genomics and Bioinformatics, Karolinska Institutet, SE-17177 Stockholm, Sweden
| | - Wyeth W Wasserman
- Centre for Molecular Medicine and Therapeutics, University of British Columbia, Vancouver, BC V5Z 4H4, Canada
| | - Claes Wahlestedt
- Center for Genomics and Bioinformatics, Karolinska Institutet, SE-17177 Stockholm, Sweden
| |
Collapse
|
1042
|
Zdobnov EM, Campillos M, Harrington ED, Torrents D, Bork P. Protein coding potential of retroviruses and other transposable elements in vertebrate genomes. Nucleic Acids Res 2005; 33:946-54. [PMID: 15716312 PMCID: PMC549403 DOI: 10.1093/nar/gki236] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
We suggest an annotation strategy for genes encoded by retroviruses and transposable elements (RETRA genes) based on a set of marker protein domains. Usually RETRA genes are masked in vertebrate genomes prior to the application of automated gene prediction pipelines under the assumption that they provide no selective advantage to the host. Yet, we show that about 1000 genes in four vertebrate gene sets analyzed contain at least one RETRA gene marker domain. Using the conservation of genomic neighborhood (synteny), we were able to discriminate between RETRA genes with putative functionality in the vertebrates and those that probably function only in the context of mobile elements. We identified 35 such genes in human, along with their corresponding mouse and rat orthologs; which included almost all known human genes with similarity to mobile elements. The results also imply that the vast majority of the remaining RETRA genes in current gene sets are unlikely to encode vertebrate functions. To automatically annotate RETRA genes in other vertebrate genomes, we provide as a tool a set of marker protein domains and a manually refined list of domesticated or ancestral RETRA genes for rescuing genes with vertebrate functions.
Collapse
Affiliation(s)
| | | | | | | | - Peer Bork
- EMBLMeyerhofstrasse 1, 69117 Heidelberg, Germany
- MDC Berlin-BuchRobert-Roessle-Strasse 10, Germany
- To whom correspondence should be addressed at EMBL Heidelberg, Meyerhofstrasse 1, D-69117 Heidelberg, Germany. Tel: +49 6221 387 526; Fax: +49 6221 387 517;
| |
Collapse
|
1043
|
Pan Q, Shai O, Misquitta C, Zhang W, Saltzman AL, Mohammad N, Babak T, Siu H, Hughes TR, Morris QD, Frey BJ, Blencowe BJ. Revealing global regulatory features of mammalian alternative splicing using a quantitative microarray platform. Mol Cell 2005; 16:929-41. [PMID: 15610736 DOI: 10.1016/j.molcel.2004.12.004] [Citation(s) in RCA: 234] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2004] [Revised: 11/18/2004] [Accepted: 12/08/2004] [Indexed: 01/27/2023]
Abstract
We describe the application of a microarray platform, which combines information from exon body and splice-junction probes, to perform a quantitative analysis of tissue-specific alternative splicing (AS) for thousands of exons in mammalian cells. Through this system, we have analyzed global features of AS in major mouse tissues. The results provide numerous inferences for the functions of tissue-specific AS, insights into how the evolutionary history of exons can impact on their inclusion levels, and also information on how global regulatory properties of AS define tissue type. Like global transcription profiles, global AS profiles reflect tissue identity. Interestingly, we find that transcription and AS act independently on different sets of genes in order to define tissue-specific expression profiles. These results demonstrate the utility of our quantitative microarray platform and data for revealing important global regulatory features of AS.
Collapse
Affiliation(s)
- Qun Pan
- Banting and Best Department of Medical Research, University of Toronto, 112 College Street, Toronto, Ontario M5G 1L6, Canada
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1044
|
Petrovsky N, Silva D. The power of an integrated informatic and molecular approach to type 1 diabetes research. Ann N Y Acad Sci 2005; 1037:216-24. [PMID: 15699520 DOI: 10.1196/annals.1337.035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Recent years have witnessed an explosive growth in available biological data. This includes a tremendous quantity of sequence data (e.g., biological structures, genetic and physical maps, pathways) generated by genome and transcriptome projects focused on humans, mice, and a multitude of other species. Diabetes research stands to greatly benefit from this data, which is distributed across public and private databases and the scientific literature. The increasing quantity and complexity of this biological data necessitates use of novel bioinformatics strategies for its efficient retrieval, analysis, and interpretation. Bioinformatic capability is becoming increasingly indispensable for fast and comprehensive analysis of biological data by diabetes researchers. There is great potential for diabetes scientists and clinicians to take advantage of recent bioinformatics and knowledge discovery developments to radically transform and advance this field of research. This paper will review advances in the field of bioinformatics relevant to diabetes research and preview a new specialty diabetes database, Diabetaeta, that we are creating to serve as a central bioinformatic portal for type 1 diabetes research, as well as serving as a public repository for beta cell gene and protein expression data.
Collapse
Affiliation(s)
- Nikolai Petrovsky
- Director, Diabetes and Endocrinology, Flinders Medical Centre, Bedford Park, South Australia, Australia.
| | | |
Collapse
|
1045
|
Nameki N, Tochio N, Koshiba S, Inoue M, Yabuki T, Aoki M, Seki E, Matsuda T, Fujikura Y, Saito M, Ikari M, Watanabe M, Terada T, Shirouzu M, Yoshida M, Hirota H, Tanaka A, Hayashizaki Y, Güntert P, Kigawa T, Yokoyama S. Solution structure of the PWWP domain of the hepatoma-derived growth factor family. Protein Sci 2005; 14:756-64. [PMID: 15689505 PMCID: PMC2279273 DOI: 10.1110/ps.04975305] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
Among the many PWWP-containing proteins, the largest group of homologous proteins is related to hepatoma-derived growth factor (HDGF). Within a well-conserved region at the extreme N-terminus, HDGF and five HDGF-related proteins (HRPs) always have a PWWP domain, which is a module found in many chromatin-associated proteins. In this study, we determined the solution structure of the PWWP domain of HDGF-related protein-3 (HRP-3) by NMR spectroscopy. The structure consists of a five-stranded beta-barrel with a PWWP-specific long loop connecting beta2 and beta3 (PR-loop), followed by a helical region including two alpha-helices. Its structure was found to have a characteristic solvent-exposed hydrophobic cavity, which is composed of an abundance of aromatic residues in the beta1/beta2 loop (beta-beta arch) and the beta3/beta4 loop. A similar ligand binding cavity occurs at the corresponding position in the Tudor, chromo, and MBT domains, which have structural and probable evolutionary relationships with PWWP domains. These findings suggest that the PWWP domains of the HDGF family bind to some component of chromatin via the cavity.
Collapse
Affiliation(s)
- Nobukazu Nameki
- RIKEN Genomic Sciences Center, 1-7-22 Suehiro-cho, Tsurumi, Yokohama 230-0045, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1046
|
Yamano Y, Ohyama K, Ohta M, Sano T, Ritani A, Shimada J, Ashida N, Yoshida E, Ikehara K, Morishima I. A novel spermatogenesis related factor-2 (SRF-2) gene expression affected by TCDD treatment. Endocr J 2005; 52:75-81. [PMID: 15758561 DOI: 10.1507/endocrj.52.75] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
We have cloned a gene which is specifically expressed at the stage of sexual maturation in the rat testis by means of differential display, and have named it spermatogenesis-related factor-2 (SRF-2). Testicular expression was first detected at 5 weeks of age, and its level of the expression increased up to 7 weeks, and was maintained even at 63 weeks. Its cDNA was 2,789 bp in length and encoded an open reading frame of 718 amino acids. This gene was mainly expressed in the spermatocyte, judging from the result of in situ hybridization. The hypothetical gene product had a motif highly homologous with RabGAP/TBC protein. Taken together, this gene is considered to have some important functions for meiosis. The gene expression was significantly decreased by treatment with TCDD, a candidate endocrine disruptor, when administered to male rats of the nursling period. Body weight and testis weight were decreased by the treatment, but even then the sperm concentration in cauda epididymis was not changed significantly. SRF-2 gene may be a promising biomarker to construct a detection system of uncertain endocrine disruptors.
Collapse
Affiliation(s)
- Yoshiaki Yamano
- Department of Biochemistry and Biotechnology, Faculty of Agriculture, Tottori University, Tottori, Japan
| | | | | | | | | | | | | | | | | | | |
Collapse
|
1047
|
Shiraishi M, Loutzenhiser RD, Walsh MP. A highly sensitive method for quantification of myosin light chain phosphorylation by capillary isoelectric focusing with laser-induced fluorescence detection. Electrophoresis 2005; 26:571-80. [PMID: 15690429 DOI: 10.1002/elps.200410119] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Activation of myosin II by phosphorylation of the 20 kDa regulatory light chains (LC20) has been implicated in numerous contractile and motile events, e.g., smooth muscle contraction, cytokinesis, and cell migration. The ability to analyze LC20 phosphorylation in minute samples is critical to determine the importance of LC20 phosphorylation in diverse physiological processes. We have developed a method for the separation and quantification of unphosphorylated, monophosphorylated, and diphosphorylated LC20 with a detection limit of 1 pg (50 amol). LC20 is initially isolated by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) and transblotted to a polyvinlyidene difluoride (PVDF) membrane. The region of the membrane containing the LC20 band (identified by electrophoresis of purified LC20 in a neighboring lane) is cut out and fluorescently labeled with Alexa Fluor 488 C5 maleimide. The labeled LC20 is eluted from the membrane with detergent and subjected to capillary isoelectric focusing (CIEF) to separate unphosphorylated, mono-, and diphosphorylated LC20, which are detected and quantified by laser-induced fluorescence (LIF). A linear relationship between log(peak area) and log(LC20 amount) is observed over the range of 50 amol-150 fmol. Quantification of LC20 phosphorylation by CIEF with LIF detection was compared with three commonly used methods with much lower levels of sensitivity: urea/glycerol-PAGE with Western blotting, phosphorylation by [gamma-32P]ATP with Cerenkov counting, and phosphorylation by [gamma-32P]ATP followed by SDS-PAGE, autoradiography, and scanning densitometry. All four methods gave very similar quantitative results, the major difference being that the new method exhibits 3000-fold enhanced sensitivity. This method is therefore applicable to quantitative analysis of phosphorylation of minute quantities of LC20.
Collapse
Affiliation(s)
- Mitsuya Shiraishi
- Smooth Muscle Research Group, University of Calgary, Faculty of Medicine, Calgary, Alberta, Canada
| | | | | |
Collapse
|
1048
|
Perincheri S, Dingle RWC, Peterson ML, Spear BT. Hereditary persistence of alpha-fetoprotein and H19 expression in liver of BALB/cJ mice is due to a retrovirus insertion in the Zhx2 gene. Proc Natl Acad Sci U S A 2005; 102:396-401. [PMID: 15626755 PMCID: PMC544306 DOI: 10.1073/pnas.0408555102] [Citation(s) in RCA: 81] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2004] [Indexed: 11/18/2022] Open
Abstract
The alpha-fetoprotein (AFP) and H19 genes are transcribed at high levels in the mammalian fetal liver but are rapidly repressed postnatally. This repression in the liver is controlled, at least in part, by the Afr1 gene. Afr1 was defined >25 years ago when BALB/cJ mice were found to have 5- to 20-fold higher adult serum AFP levels compared with all other mouse strains; subsequent studies showed that this elevation was due to higher Afp expression in the liver. H19, which has become a model for genomic imprinting, was identified initially in a screen for Afr1-regulated genes. The BALB/cJ allele (Afr1(b)) is recessive to the wild-type allele (Afr1(a)), consistent with the idea that Afr1 functions as a repressor. By high-resolution mapping, we identified a gene that maps to the Afr1 interval on chromosome 15 and encodes a putative zinc fingers and homeoboxes (ZHX) protein. In BALB/cJ mice, this gene contains a murine endogenous retrovirus within its first intron and produces predominantly an aberrant transcript that no longer encodes a functional protein. Liver-specific overexpression of a Zhx2 transgene restores wild-type H19 repression on a BALB/cJ background, confirming that this gene is responsible for hereditary persistence of Afp and H19 in the livers of BALB/cJ mice. Thus we have identified a genetically defined transcription factor that is involved in developmental gene silencing in mammals. We present a model to explain the liver-specific phenotype in BALB/cJ mice, even though Afr1 is a ubiquitously expressed gene.
Collapse
Affiliation(s)
- Sudhir Perincheri
- Department of Microbiology, Immunology, and Molecular Genetics and Markey Cancer Center, University of Kentucky College of Medicine, Lexington, KY 40536
| | | | | | | |
Collapse
|
1049
|
Ng P, Wei CL, Sung WK, Chiu KP, Lipovich L, Ang CC, Gupta S, Shahab A, Ridwan A, Wong CH, Liu ET, Ruan Y. Gene identification signature (GIS) analysis for transcriptome characterization and genome annotation. Nat Methods 2005; 2:105-11. [PMID: 15782207 DOI: 10.1038/nmeth733] [Citation(s) in RCA: 174] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2004] [Accepted: 12/15/2004] [Indexed: 11/09/2022]
Abstract
We have developed a DNA tag sequencing and mapping strategy called gene identification signature (GIS) analysis, in which 5' and 3' signatures of full-length cDNAs are accurately extracted into paired-end ditags (PETs) that are concatenated for efficient sequencing and mapped to genome sequences to demarcate the transcription boundaries of every gene. GIS analysis is potentially 30-fold more efficient than standard cDNA sequencing approaches for transcriptome characterization. We demonstrated this approach with 116,252 PET sequences derived from mouse embryonic stem cells. Initial analysis of this dataset identified hundreds of previously uncharacterized transcripts, including alternative transcripts of known genes. We also uncovered several intergenically spliced and unusual fusion transcripts, one of which was confirmed as a trans-splicing event and was differentially expressed. The concept of paired-end ditagging described here for transcriptome analysis can also be applied to whole-genome analysis of cis-regulatory and other DNA elements and represents an important technological advance for genome annotation.
Collapse
Affiliation(s)
- Patrick Ng
- Genome Institute of Singapore, 60 Biopolis Street, Genome 02-01, Singapore 138672
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1050
|
Irizarry KJL, Merriman B, Bahamonde ME, Wong ML, Licinio J. The evolution of signaling complexity suggests a mechanism for reducing the genomic search space in human association studies. Mol Psychiatry 2005; 10:14-26. [PMID: 15618953 DOI: 10.1038/sj.mp.4001576] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
The size complexity of the human genome has been traditionally viewed as an obstacle that frustrates efforts aimed at identifying the genetic correlates of complex human phenotypes. As such complex phenotypes are attributed to the combined action of numerous genomic loci, attempts to identify the underlying multi-locus interactions may produce a combinatorial sum of false positives that drown out the real signal. Faced with such grim prospects for successfully identifying the genetic basis of complex phenotypes, many geneticists simply disregard epistatic interactions altogether. However, the emerging picture from systems biology is that the cellular programs encoded by the genome utilize nested signaling hierarchies to integrate a number of loosely coupled, semiautonomous, and functionally distinct genetic networks. The current view of these modules is that connections encoding inter-module signaling are relatively sparse, while the gene-to-gene (protein-to-protein) interactions within a particular module are typically denser. We believe that each of these modules is encoded by a finite set of discontinuous, sequence-specific, genomic intervals that are functionally linked to association rules, which correlate directly to features in the environment. Furthermore, because these environmental association rules have evolved incrementally over time, we explore theoretical models of cellular evolution to better understand the role of evolution in genomic complexity. Specifically, we present a conceptual framework for (1) reducing genomic complexity by partitioning the genome into subsets composed of functionally distinct genetic modules and (2) improving the selection of coding region SNPs, which results in an increased probability of identifying functionally relevant SNPs. Additionally, we introduce the notion of 'genomic closure,' which provides a quantitative measure of how functionally insulated a specific genetic module might be from the influence of the rest of the genome. We suggest that the development and use of theoretical models can provide insight into the nature of biological systems and may lead to significant improvements in computational algorithms designed to reduce the complexity of the human genome.
Collapse
Affiliation(s)
- K J L Irizarry
- Center for Pharmacogenomics and Clinical Pharmacology, Neuropsychiatric Institute, David Geffen School of Medicine, University California, Los Angeles, CA 90095, USA.
| | | | | | | | | |
Collapse
|