1051
|
Arakawa K, Suzuki H, Fujishima K, Fujimoto K, Ueda S, Matsui M, Tomita M. A Comprehensive Software Suite for the Analysis of cDNAs. GENOMICS, PROTEOMICS & BIOINFORMATICS 2005; 3:179-88. [PMID: 16487083 PMCID: PMC5172547 DOI: 10.1016/s1672-0229(05)03023-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
We have developed a comprehensive software suite for bioinformatics research of cDNAs; it is aimed at rapid characterization of the features of genes and the proteins they code. Methods implemented include the detection of translation initiation and termination signals, statistical analysis of codon usage, comparative study of amino acid composition, comparative modeling of the structures of product proteins, prediction of alternative splice forms, and metabolic pathway reconstruction. The software package is freely available under the GNU General Public License at http://www.g-language.org/data/cdna/.
Collapse
Affiliation(s)
- Kazuharu Arakawa
- Institute for Advanced Biosciences, Keio University, Fujisawa 252-8520, Japan.
| | | | | | | | | | | | | |
Collapse
|
1052
|
Hubbard SJ, Grafham DV, Beattie KJ, Overton IM, McLaren SR, Croning MDR, Boardman PE, Bonfield JK, Burnside J, Davies RM, Farrell ER, Francis MD, Griffiths-Jones S, Humphray SJ, Hyland C, Scott CE, Tang H, Taylor RG, Tickle C, Brown WRA, Birney E, Rogers J, Wilson SA. Transcriptome analysis for the chicken based on 19,626 finished cDNA sequences and 485,337 expressed sequence tags. Genome Res 2005; 15:174-83. [PMID: 15590942 PMCID: PMC540287 DOI: 10.1101/gr.3011405] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2004] [Accepted: 10/04/2004] [Indexed: 12/22/2022]
Abstract
We present an analysis of the chicken (Gallus gallus) transcriptome based on the full insert sequences for 19,626 cDNAs, combined with 485,337 EST sequences. The cDNA data set has been functionally annotated and describes a minimum of 11,929 chicken coding genes, including the sequence for 2260 full-length cDNAs together with a collection of noncoding (nc) cDNAs that have been stringently filtered to remove untranslated regions of coding mRNAs. The combined collection of cDNAs and ESTs describe 62,546 clustered transcripts and provide transcriptional evidence for a total of 18,989 chicken genes, including 88% of the annotated Ensembl gene set. Analysis of the ncRNAs reveals a set that is highly conserved in chickens and mammals, including sequences for 14 pri-miRNAs encoding 23 different miRNAs. The data sets described here provide a transcriptome toolkit linked to physical clones for bioinformaticians and experimental biologists who wish to use chicken systems as a low-cost, accessible alternative to mammals for the analysis of vertebrate development, immunology, and cell biology.
Collapse
Affiliation(s)
- Simon J Hubbard
- Faculty of Life Sciences, The University of Manchester, Manchester, M60 1QD, United Kingdom
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1053
|
Liu ET. Genomic technologies and the interrogation of the transcriptome. Mech Ageing Dev 2005; 126:153-9. [PMID: 15610774 DOI: 10.1016/j.mad.2004.09.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Functional genomics refers to the study of whole genomes and the function of its constituent parts to explain biological processes. Though these investigations may involve whole proteome analysis, the primary focus is on the transcriptome and how it is regulated. Recent advances in technologies that can interrogate cellular transcripts on a genome-wide scale seek the complete disclosure of the transcriptome over time-intervals and across many different cellular states. This massively complex data when viewed as a whole can provide surprisingly precise assessment of cellular conditions. Moreover, these data can define hierarchies of importance and have shown us new transcriptional elements. Herein, we describe the technologies and the experimental strategies to study the transcriptome that would be pertinent to cancer and ageing research.
Collapse
Affiliation(s)
- Edison T Liu
- Genome Institute of Singapore, 60 Biopolis Street, #02-01, Genome Building, Singapore 138672, Singapore.
| |
Collapse
|
1054
|
Marionneau C, Couette B, Liu J, Li H, Mangoni ME, Nargeot J, Lei M, Escande D, Demolombe S. Specific pattern of ionic channel gene expression associated with pacemaker activity in the mouse heart. J Physiol 2005; 562:223-34. [PMID: 15498808 PMCID: PMC1665484 DOI: 10.1113/jphysiol.2004.074047] [Citation(s) in RCA: 274] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2004] [Accepted: 10/21/2004] [Indexed: 11/08/2022] Open
Abstract
Even though sequencing of the mammalian genome has led to the discovery of a large number of ionic channel genes, identification of the molecular determinants of cellular electrical properties in different regions of the heart has been rarely obtained. We developed a high-throughput approach capable of simultaneously assessing the expression pattern of ionic channel repertoires from different regions of the mouse heart. By using large-scale real-time RT-PCR, we have profiled 71 channels and related genes in the sinoatrial node (SAN), atrioventricular node (AVN), the atria (A) and ventricles (V). Hearts from 30 adult male C57BL/6 mice were microdissected and RNA was isolated from six pools of five mice each. TaqMan data were analysed using the threshold cycle (C(t)) relative quantification method. Cross-contamination of each region was checked with expression of the atrial and ventricular myosin light chains. Two-way hierarchical clustering analysis of the 71 genes successfully classified the six pools from the four distinct regions. In comparison with the A, the SAN and AVN were characterized by higher expression of Nav beta 1, Nav beta 3, Cav1.3, Cav3.1 and Cav alpha 2 delta 2, and lower expression of Kv4.2, Cx40, Cx43 and Kir3.1. In addition, the SAN was characterized by higher expression of HCN1 and HCN4, and lower expression of RYR2, Kir6.2, Cav beta 2 and Cav gamma 4. The AVN was characterized by higher expression of Nav1.1, Nav1.7, Kv1.6, Kvbeta1, MinK and Cav gamma 7. Other gene expression profiles discriminate between the ventricular and the atrial myocardium. The present study provides the first genome-scale regional ionic channel expression profile in the mouse heart.
Collapse
Affiliation(s)
- Céline Marionneau
- L'institut du thorax, INSERM U533, Faculté de Médecine, 44035 Nantes cedex, France
| | | | | | | | | | | | | | | | | |
Collapse
|
1055
|
Pollet N, Muncke N, Verbeek B, Li Y, Fenger U, Delius H, Niehrs C. An atlas of differential gene expression during early Xenopus embryogenesis. Mech Dev 2004; 122:365-439. [PMID: 15763213 DOI: 10.1016/j.mod.2004.11.009] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2004] [Revised: 10/11/2004] [Accepted: 11/07/2004] [Indexed: 01/24/2023]
Abstract
We have carried out a large-scale, semi-automated whole-mount in situ hybridization screen of 8369 cDNA clones in Xenopus laevis embryos. We confirm that differential gene expression is prevalent during embryogenesis since 24% of the clones are expressed non-ubiquitously and 8% are organ or cell type specific marker genes. Sequence analysis and clustering yielded 723 unique genes displaying a differential expression pattern. Of these, 18% were already described in Xenopus, 47% have homologs and 35% are lacking significant sequence similarity in databases. Many of them encode known developmental regulators. We classified 363 of the 723 genes for which a Gene Ontology annotation for molecular function could be attributed and found 'DNA binding' and 'enzyme' the most represented terms. The most common protein domains encoded in these embryonic, differentially expressed genes are the homeobox and RNA Recognition Motif (RRM). Fifty-nine putative orthologs of human disease genes, and 254 organ or cell specific marker genes were identified. Markers were found for nasal placode and archenteron roof, organs for which a specific marker was previously unavailable. Markers were also found for novel subdomains of various other organs. The tissues for which most markers were found are muscle and epidermis. Expression of cell cycle regulators fell in two classes, containing proliferation-promoting and anti-proliferative genes, respectively. We identified 66 new members of the BMP4, chromatin, endoplasmic reticulum, and karyopherin synexpression groups, thus providing a first glimpse of their probable cellular roles. Cluster analysis of tissues to measure tissue relatedness yielded some unorthodox affinities besides expectable lineage relationships. In conclusion, this study represents an atlas of gene expression patterns, which reveals embryonic regionalization, provides novel marker genes, and makes predictions about the functional role of unknown genes.
Collapse
Affiliation(s)
- Nicolas Pollet
- Division of Molecular Embryology, Deutsches Krebsforschungszentrum, Im Neuenheimer Feld 280, D-69120 Heidelberg, Germany.
| | | | | | | | | | | | | |
Collapse
|
1056
|
Full-length cDNAs from chicken bursal lymphocytes to facilitate gene function analysis. Genome Biol 2004; 6:R6. [PMID: 15642098 PMCID: PMC549067 DOI: 10.1186/gb-2004-6-1-r6] [Citation(s) in RCA: 88] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2004] [Revised: 10/26/2004] [Accepted: 12/07/2004] [Indexed: 12/04/2022] Open
Abstract
This article reports a cDNA collection representing more than 2000 new, full-length transcripts from a high-quality cDNA library. A large number of cDNA inserts were sequenced from a high-quality library of chicken bursal lymphocyte cDNAs. Comparisons to public gene databases indicate that the cDNA collection represents more than 2,000 new, full-length transcripts. This resource defines the structure and the coding potential of a large fraction of B-cell specific and housekeeping genes whose function can be analyzed by disruption in the chicken DT40 B-cell line.
Collapse
|
1057
|
Lenhard B, Sandelin A, Mendoza L, Engström P, Jareborg N, Wasserman WW. Identification of conserved regulatory elements by comparative genome analysis. J Biol 2004; 2:13. [PMID: 12760745 PMCID: PMC193685 DOI: 10.1186/1475-4924-2-13] [Citation(s) in RCA: 180] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2002] [Revised: 03/21/2003] [Accepted: 04/08/2003] [Indexed: 12/04/2022] Open
Abstract
BACKGROUND For genes that have been successfully delineated within the human genome sequence, most regulatory sequences remain to be elucidated. The annotation and interpretation process requires additional data resources and significant improvements in computational methods for the detection of regulatory regions. One approach of growing popularity is based on the preferential conservation of functional sequences over the course of evolution by selective pressure, termed 'phylogenetic footprinting'. Mutations are more likely to be disruptive if they appear in functional sites, resulting in a measurable difference in evolution rates between functional and non-functional genomic segments. RESULTS We have devised a flexible suite of methods for the identification and visualization of conserved transcription-factor-binding sites. The system reports those putative transcription-factor-binding sites that are both situated in conserved regions and located as pairs of sites in equivalent positions in alignments between two orthologous sequences. An underlying collection of metazoan transcription-factor-binding profiles was assembled to facilitate the study. This approach results in a significant improvement in the detection of transcription-factor-binding sites because of an increased signal-to-noise ratio, as demonstrated with two sets of promoter sequences. The method is implemented as a graphical web application, ConSite, which is at the disposal of the scientific community at http://www.phylofoot.org/. CONCLUSIONS Phylogenetic footprinting dramatically improves the predictive selectivity of bioinformatic approaches to the analysis of promoter sequences. ConSite delivers unparalleled performance using a novel database of high-quality binding models for metazoan transcription factors. With a dynamic interface, this bioinformatics tool provides broad access to promoter analysis with phylogenetic footprinting.
Collapse
Affiliation(s)
- Boris Lenhard
- Center for Genomics and Bioinformatics, Karolinska Institutet, 171 77 Stockholm, Sweden
| | - Albin Sandelin
- Center for Genomics and Bioinformatics, Karolinska Institutet, 171 77 Stockholm, Sweden
| | - Luis Mendoza
- Center for Genomics and Bioinformatics, Karolinska Institutet, 171 77 Stockholm, Sweden
- Current address: Serono Research and Development, CH-1121 Geneva 20, Switzerland
| | - Pär Engström
- Center for Genomics and Bioinformatics, Karolinska Institutet, 171 77 Stockholm, Sweden
| | - Niclas Jareborg
- Center for Genomics and Bioinformatics, Karolinska Institutet, 171 77 Stockholm, Sweden
- Current address: AstraZeneca Research and Development, S-151 85 Södertälje, Sweden
| | - Wyeth W Wasserman
- Center for Genomics and Bioinformatics, Karolinska Institutet, 171 77 Stockholm, Sweden
- Current address: Centre for Molecular Medicine and Therapeutics, University of British Columbia, Vancouver, BC V5Z 4H4, Canada
| |
Collapse
|
1058
|
Niida A, Hiroko T, Kasai M, Furukawa Y, Nakamura Y, Suzuki Y, Sugano S, Akiyama T. DKK1, a negative regulator of Wnt signaling, is a target of the beta-catenin/TCF pathway. Oncogene 2004; 23:8520-6. [PMID: 15378020 DOI: 10.1038/sj.onc.1207892] [Citation(s) in RCA: 445] [Impact Index Per Article: 21.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
Wnt signaling plays an important role in embryonic development and tumorigenesis. These biological effects are exerted by activation of the beta-catenin/TCF transcription complex and consequent regulation of a set of downstream genes. TCF-binding elements have been found in the promoter regions of many TCF target genes and characterized by a highly conserved consensus sequence. Utilizing this consensus sequence, we performed an in silico screening for new TCF target genes. Through computational screening and subsequent experimental analysis, we identified a novel TCF target gene, DKK1, which has been shown to be a potent inhibitor of Wnt signaling. Our finding suggests the existence of a novel feedback loop in Wnt signaling.
Collapse
Affiliation(s)
- Atsushi Niida
- Laboratory of Molecular and Genetic Information, Institute of Molecular and Cellular Biosciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-ku, Tokyo 113-0032, Japan
| | | | | | | | | | | | | | | |
Collapse
|
1059
|
Baguet A, Epler J, Wen KW, Bix M. A Leishmania major response locus identified by interval-specific congenic mapping of a T helper type 2 cell bias-controlling quantitative trait locus. ACTA ACUST UNITED AC 2004; 200:1605-12. [PMID: 15596523 PMCID: PMC2211989 DOI: 10.1084/jem.20040334] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
The propensity of naive CD4 T cells to become T helper (Th) type 2 cells correlates with susceptibility to infection by the protozoal parasite Leishmania major. Using genetic linkage analysis, we earlier identified Dice1 as a Th2 cell bias-controlling quantitative trait locus on chromosome 16. Using interval-specific congenic mapping, we now resolve Dice1 into two independent genetic loci, Dice1.1 and Dice1.2, which control Il4 expression from naive Th cells and thereby indirectly control Th2 cell bias. Interestingly, only one of the two congenic intervals containing Dice1.1 and Dice1.2, respectively, also contained an L. major response locus, indicating that L. major responsiveness can be insensitive to determinants that influence Th2 cell bias by controlling naive T cell Il4 expression. These results lay the groundwork for identifying the Dice1.1 and Dice1.2 genes controlling naive T cell Il4 expression and L. major responses, and for testing whether these control other Th2 cell–dependent processes such as worm expulsion, allergic asthma, and dermatitis.
Collapse
Affiliation(s)
- Aurelie Baguet
- Dept. of Immunology, University of Washington, 1959 N.E. Pacific St., HSC I607I, Seattle, WA 98195, USA
| | | | | | | |
Collapse
|
1060
|
Wahl MB, Heinzmann U, Imai K. LongSAGE analysis significantly improves genome annotation: identifications of novel genes and alternative transcripts in the mouse. Bioinformatics 2004; 21:1393-400. [PMID: 15591356 DOI: 10.1093/bioinformatics/bti207] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Owing to its increased tag length, LongSAGE tags are expected to be more reliable in direct assignment to genome sequences. Therefore, we evaluated the use of LongSAGE data in genome annotation by using our LongSAGE dataset of 202 015 tags (consisting of 41 718 unique tags), experimentally generated from mouse embryonic tail libraries. RESULTS A fraction of LongSAGE tags could not be unambiguously assigned to its gene, due to the presence of widely conserved sequences downstream of particular CATG anchor sites. The presence of alternative forms of transcripts was confirmed in 45% of all detected genes. Surprisingly, a large fraction of LongSAGE tags with hits to the genome (66%) could not be assigned to any gene annotated in EnsEMBL. Among such cases, 2098 LongSAGE tags fell into a region containing a putative gene predicted by GenScan, providing experimental evidence for the presence of real genes, while 9112 genes were found out to be left out or wrongly annotated by the EnsEMBL pipeline. CONCLUSIONS LongSAGE transcriptome data can significantly improve the genome annotation by identifying novel genes and alternative transcripts, even in the case of thus far best-characterized organisms like the mouse. CONTACT imai@gsf.de.
Collapse
Affiliation(s)
- Matthias B Wahl
- Institute of Developmental Genetics, GSF-National Research Center for Environment and Health, Ingolstädter Landstrasse 1, D-85764 Neuherberg, Germany
| | | | | |
Collapse
|
1061
|
Stamm S, Ben-Ari S, Rafalska I, Tang Y, Zhang Z, Toiber D, Thanaraj TA, Soreq H. Function of alternative splicing. Gene 2004; 344:1-20. [PMID: 15656968 DOI: 10.1016/j.gene.2004.10.022] [Citation(s) in RCA: 671] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2004] [Revised: 09/10/2004] [Accepted: 10/21/2004] [Indexed: 02/06/2023]
Abstract
Alternative splicing is one of the most important mechanisms to generate a large number of mRNA and protein isoforms from the surprisingly low number of human genes. Unlike promoter activity, which primarily regulates the amount of transcripts, alternative splicing changes the structure of transcripts and their encoded proteins. Together with nonsense-mediated decay (NMD), at least 25% of all alternative exons are predicted to regulate transcript abundance. Molecular analyses during the last decade demonstrate that alternative splicing determines the binding properties, intracellular localization, enzymatic activity, protein stability and posttranslational modifications of a large number of proteins. The magnitude of the effects range from a complete loss of function or acquisition of a new function to very subtle modulations, which are observed in the majority of cases reported. Alternative splicing factors regulate multiple pre-mRNAs and recent identification of physiological targets shows that a specific splicing factor regulates pre-mRNAs with coherent biological functions. Therefore, evidence is now accumulating that alternative splicing coordinates physiologically meaningful changes in protein isoform expression and is a key mechanism to generate the complex proteome of multicellular organisms.
Collapse
Affiliation(s)
- Stefan Stamm
- Institute for Biochemistry, University of Erlangen, Fahrstrasse 17, 91054 Erlangen, Germany.
| | | | | | | | | | | | | | | |
Collapse
|
1062
|
Abstract
MOTIVATION In this paper, we shall examine the evolution of domain architectures across 62 genomes of known phylogeny including all kingdoms of life. We look in particular at the possibility of convergent evolution, with a view to determining the extent to which the architectures observed in the genomes are due to functional necessity or evolutionary descent. We used domains of known structure, because from this and other information we know their evolutionary relationships. We use a range of methods including phylogenetic grouping, sequence similarity/alignment, mutation rates and comparative genomics to approach this difficult problem from several angles. RESULTS Although we do not claim an exhaustive analysis, we conclude that between 0.4 and 4% of sequences are involved in convergent evolution of domain architectures, and expect the actual number to be close to the lower bound. We also made two incidental observations, albeit on a small sample: the events leading to convergent evolution appear to be random with no functional or structural preferences, and changes in the number of tandem repeat domains occur more readily than changes which alter the domain composition. CONCLUSION The principal conclusion is that the observed domain architectures of the sequences in the genomes are driven by evolutionary descent rather than functional necessity. CONTACT gough@supfam.org.
Collapse
Affiliation(s)
- Julian Gough
- RIKEN Genomic Sciences Centre, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan.
| |
Collapse
|
1063
|
Zheng ZM. Regulation of alternative RNA splicing by exon definition and exon sequences in viral and mammalian gene expression. J Biomed Sci 2004; 11:278-94. [PMID: 15067211 PMCID: PMC2442652 DOI: 10.1007/bf02254432] [Citation(s) in RCA: 132] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2003] [Accepted: 11/12/2003] [Indexed: 12/16/2022] Open
Abstract
Intron removal from a pre-mRNA by RNA splicing was once thought to be controlled mainly by intron splicing signals. However, viral and other eukaryotic RNA exon sequences have recently been found to regulate RNA splicing, polyadenylation, export, and nonsense-mediated RNA decay in addition to their coding function. Regulation of alternative RNA splicing by exon sequences is largely attributable to the presence of two major cis-acting elements in the regulated exons, the exonic splicing enhancer (ESE) and the suppressor or silencer (ESS). Two types of ESEs have been verified from more than 50 genes or exons: purine-rich ESEs, which are the more common, and non-purine-rich ESEs. In contrast, the sequences of ESSs identified in approximately 20 genes or exons are highly diverse and show little similarity to each other. Through interactions with cellular splicing factors, an ESE or ESS determines whether or not a regulated splice site, usually an upstream 3' splice site, will be used for RNA splicing. However, how these elements function precisely in selecting a regulated splice site is only partially understood. The balance between positive and negative regulation of splice site selection likely depends on the cis-element's identity and changes in cellular splicing factors under physiological or pathological conditions.
Collapse
Affiliation(s)
- Zhi-Ming Zheng
- HIV and AIDS Malignancy Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA.
| |
Collapse
|
1064
|
Horisawa K, Tateyama S, Ishizaka M, Matsumura N, Takashima H, Miyamoto-Sato E, Doi N, Yanagawa H. In vitro selection of Jun-associated proteins using mRNA display. Nucleic Acids Res 2004; 32:e169. [PMID: 15576676 PMCID: PMC535696 DOI: 10.1093/nar/gnh167] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Although yeast two-hybrid assay and biochemical methods combined with mass spectrometry have been successfully employed for the analyses of protein-protein interactions in the field of proteomics, these methods encounter various difficulties arising from the usage of living cells, including inability to analyze toxic proteins and restriction of testable interaction conditions. Totally in vitro display technologies such as ribosome display and mRNA display are expected to circumvent these difficulties. In this study, we applied an mRNA display technique to screening for interactions of a basic leucine zipper domain of Jun protein in a mouse brain cDNA library. By performing iterative affinity selection and sequence analyses, we selected 16 novel Jun-associated protein candidates in addition to four known interactors. By means of real-time PCR and pull-down assay, 10 of the 16 newly discovered candidates were confirmed to be direct interactors with Jun in vitro. Furthermore, interaction of 6 of the 10 proteins with Jun was observed in cultured cells by means of co-immunoprecipitation and observation of subcellular localization. These results demonstrate that this in vitro display technology is effective for the discovery of novel protein-protein interactions and can contribute to the comprehensive mapping of protein-protein interactions.
Collapse
Affiliation(s)
- Kenichi Horisawa
- Department of Biosciences and Informatics, Faculty of Science and Technology, Keio University, Yokohama 223-8522, Japan
| | | | | | | | | | | | | | | |
Collapse
|
1065
|
Ronshaugen M, Levine M. Visualization of trans-Homolog Enhancer-Promoter Interactions at the Abd-B Hox Locus in the Drosophila Embryo. Dev Cell 2004; 7:925-32. [PMID: 15572134 DOI: 10.1016/j.devcel.2004.11.001] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2004] [Revised: 11/01/2004] [Accepted: 11/05/2004] [Indexed: 11/29/2022]
Abstract
The Hox gene Abdominal-B (Abd-B) controls the morphogenesis of posterior abdominal segments in Drosophila. Expression is regulated by a series of 3' enhancers that are themselves transcribed. RNA FISH was used to visualize nascent transcripts associated with coding and noncoding regions of Abd-B in developing embryos. Confocal imaging suggests that distal enhancers often loop to the Abd-B promoter region. Surprisingly, enhancers located on one chromosome frequently associate with the Abd-B transcription unit located on the other homolog. These trans-homolog interactions can be interpreted as the direct visualization of a genetic phenomenon known as transvection, whereby certain mutations in Abd-B can be rescued in trans by the other copy of the gene. A 10 kb sequence in the 3' flanking region mediates tight pairing of Abd-B alleles, thereby facilitating trans looping of distal enhancers. Such trans-homolog interactions might be a common mechanism of gene regulation in higher metazoans.
Collapse
Affiliation(s)
- Matthew Ronshaugen
- Department of Molecular Cell Biology, Division of Genetics, Center for Integrative Genomics, University of California, Berkeley, Berkeley, CA 94720, USA.
| | | |
Collapse
|
1066
|
Kasukawa T, Katayama S, Kawaji H, Suzuki H, Hume DA, Hayashizaki Y. Construction of representative transcript and protein sets of human, mouse, and rat as a platform for their transcriptome and proteome analysis. Genomics 2004; 84:913-21. [PMID: 15533708 DOI: 10.1016/j.ygeno.2004.08.011] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2004] [Accepted: 08/16/2004] [Indexed: 10/26/2022]
Abstract
The number of mammalian transcripts identified by full-length cDNA projects and genome sequencing projects is increasing remarkably. Clustering them into a strictly nonredundant and comprehensive set provides a platform for functional analysis of the transcriptome and proteome, but the quality of the clustering and predictive usefulness have previously required manual curation to identify truncated transcripts and inappropriate clustering of closely related sequences. A Representative Transcript and Protein Sets (RTPS) pipeline was previously designed to identify the nonredundant and comprehensive set of mouse transcripts based on clustering of a large mouse full-length cDNA set (FANTOM2). Here we propose an alternative method that is more robust, requires less manual curation, and is applicable to other organisms in addition to mouse. RTPSs of human, mouse, and rat have been produced by this method and used for validation. Their comprehensiveness and quality are discussed by comparison with other clustering approaches. The RTPSs are available at .
Collapse
Affiliation(s)
- Takeya Kasukawa
- Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Yokohama, Kanagawa 230-0045, Japan.
| | | | | | | | | | | |
Collapse
|
1067
|
Sato Y, Suganami H, Hamada C, Yoshimura I, Yoshida T, Yoshimura K. Designing a multistage, SNP-based, genome screen for common diseases. J Hum Genet 2004; 49:669-676. [PMID: 15635486 DOI: 10.1007/s10038-004-0205-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2004] [Accepted: 09/18/2004] [Indexed: 10/26/2022]
Abstract
A genome-wide linkage equilibrium mapping is an emerging strategy to identify risk-modifying genes for common diseases, despite unsettled controversies upon many aspects, including its premises, designs, marker choices and cost benefits. One large-scale attempt in Japan aims to identify disease-associated single nucleotide polymorphisms (SNPs) for five diseases among the Japanese population: Alzheimer's disease, gastric cancer, diabetes, hypertension and asthma. Following an initial screening of c.a. 100,000 SNPs on 940 subjects (five diseases x 188 patients) to select about 2,000 SNPs, we compared which subsequent screening design is more appropriate, and an additional one or two screens to further narrow down any disease-associated SNPs within a fixed total volume of 15,040,000 typings (2,000 SNPs x five diseases x 1,504 subjects, comprising 752 cases and 752 controls). We employed a Monte Carlo simulation to evaluate the probability of identifying truly disease-associated SNPs. The results suggest the single additional stage design (i.e., total two-stage design including the initial screening of 100,000 SNPs) was more practicable for the simple reason that the gain in probability is considered insufficient relative to an associated increase in study complexity in the three-stage design.
Collapse
Affiliation(s)
- Yasunori Sato
- Faculty of Engineering, Tokyo University of Science, 1-3 Kagurazaka, Shinjyuku-ku, Tokyo, 162-8601, Japan.
- Genetics Division, National Cancer Center Research Institute, Tokyo, Japan.
| | - Hideki Suganami
- Faculty of Engineering, Tokyo University of Science, 1-3 Kagurazaka, Shinjyuku-ku, Tokyo, 162-8601, Japan
- Biostatistics and Data Management Department, Kowa Co., Ltd., Tokyo, Japan
| | - Chikuma Hamada
- Faculty of Engineering, Tokyo University of Science, 1-3 Kagurazaka, Shinjyuku-ku, Tokyo, 162-8601, Japan
| | - Isao Yoshimura
- Faculty of Engineering, Tokyo University of Science, 1-3 Kagurazaka, Shinjyuku-ku, Tokyo, 162-8601, Japan
| | - Teruhiko Yoshida
- Genetics Division, National Cancer Center Research Institute, Tokyo, Japan
| | - Kimio Yoshimura
- Genetics Division, National Cancer Center Research Institute, Tokyo, Japan
| |
Collapse
|
1068
|
Kelleher RJ, Govindarajan A, Tonegawa S. Translational regulatory mechanisms in persistent forms of synaptic plasticity. Neuron 2004; 44:59-73. [PMID: 15450160 DOI: 10.1016/j.neuron.2004.09.013] [Citation(s) in RCA: 455] [Impact Index Per Article: 21.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Memory and synaptic plasticity exhibit distinct temporal phases, with long-lasting forms distinguished by their dependence on macromolecular synthesis. Prevailing models for the molecular mechanisms underlying long-lasting synaptic plasticity have largely focused on transcriptional regulation. However, a growing body of evidence now supports a crucial role for neuronal activity-dependent mRNA translation, which may occur in dendrites for a subset of neuronal mRNAs. Recent work has begun to define the signaling mechanisms coupling synaptic activation to the protein synthesis machinery. The ERK and mTOR signaling pathways have been shown to regulate the activity of the general translational machinery, while the translation of particular classes of mRNAs is additionally controlled by gene-specific mechanisms. Rapid enhancement of the synthesis of a diverse array of neuronal proteins through such mechanisms provides the components necessary for persistent forms of LTP and LTD. These findings have important implications for the synapse specificity and associativity of protein synthesis-dependent changes in synaptic strength.
Collapse
Affiliation(s)
- Raymond J Kelleher
- Howard Hughes Medical Institute, The Picower Center for Learning and Memory, RIKEN-MIT Neuroscience Research Center, Center for Cancer Research, Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | | | | |
Collapse
|
1069
|
Abstract
The ENCyclopedia Of DNA Elements (ENCODE) Project aims to identify all functional elements in the human genome sequence. The pilot phase of the Project is focused on a specified 30 megabases (approximately 1%) of the human genome sequence and is organized as an international consortium of computational and laboratory-based scientists working to develop and apply high-throughput approaches for detecting all sequence elements that confer biological function. The results of this pilot phase will guide future efforts to analyze the entire human genome.
Collapse
|
1070
|
Watahiki A, Waki K, Hayatsu N, Shiraki T, Kondo S, Nakamura M, Sasaki D, Arakawa T, Kawai J, Harbers M, Hayashizaki Y, Carninci P. Libraries enriched for alternatively spliced exons reveal splicing patterns in melanocytes and melanomas. Nat Methods 2004; 1:233-9. [PMID: 15782199 DOI: 10.1038/nmeth719] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2004] [Accepted: 09/27/2004] [Indexed: 01/27/2023]
Abstract
It is becoming increasingly clear that alternative splicing enables the complex development and homeostasis of higher organisms. To gain a better understanding of how splicing contributes to regulatory pathways, we have developed an alternative splicing library approach for the identification of alternatively spliced exons and their flanking regions by alternative splicing sequence enriched tags sequencing. Here, we have applied our approach to mouse melan-c melanocyte and B16-F10Y melanoma cell lines, in which 5,401 genes were found to be alternatively spliced. These genes include those encoding important regulatory factors such as cyclin D2, Ilk, MAPK12, MAPK14, RAB4, melastatin 1 and previously unidentified splicing events for 436 genes. Real-time PCR further identified cell line-specific exons for Tmc6, Abi1, Sorbs1, Ndel1 and Snx16. Thus, the ASL approach proved effective in identifying splicing events, which suggest that alternative splicing is important in melanoma development.
Collapse
Affiliation(s)
- Akira Watahiki
- Genome Science Laboratory, RIKEN, Wako main campus, 2-1 Hirosawa, Wako, Saitama, 351-0198 Japan
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1071
|
Gerhard DS, Wagner L, Feingold EA, Shenmen CM, Grouse LH, Schuler G, Klein SL, Old S, Rasooly R, Good P, Guyer M, Peck AM, Derge JG, Lipman D, Collins FS, Jang W, Sherry S, Feolo M, Misquitta L, Lee E, Rotmistrovsky K, Greenhut SF, Schaefer CF, Buetow K, Bonner TI, Haussler D, Kent J, Kiekhaus M, Furey T, Brent M, Prange C, Schreiber K, Shapiro N, Bhat NK, Hopkins RF, Hsie F, Driscoll T, Soares MB, Casavant TL, Scheetz TE, Brown-stein MJ, Usdin TB, Toshiyuki S, Carninci P, Piao Y, Dudekula DB, Ko MSH, Kawakami K, Suzuki Y, Sugano S, Gruber CE, Smith MR, Simmons B, Moore T, Waterman R, Johnson SL, Ruan Y, Wei CL, Mathavan S, Gunaratne PH, Wu J, Garcia AM, Hulyk SW, Fuh E, Yuan Y, Sneed A, Kowis C, Hodgson A, Muzny DM, McPherson J, Gibbs RA, Fahey J, Helton E, Ketteman M, Madan A, Rodrigues S, Sanchez A, Whiting M, Madari A, Young AC, Wetherby KD, Granite SJ, Kwong PN, Brinkley CP, Pearson RL, Bouffard GG, Blakesly RW, Green ED, Dickson MC, Rodriguez AC, Grimwood J, Schmutz J, Myers RM, Butterfield YSN, Griffith M, Griffith OL, Krzywinski MI, Liao N, Morin R, Morrin R, et alGerhard DS, Wagner L, Feingold EA, Shenmen CM, Grouse LH, Schuler G, Klein SL, Old S, Rasooly R, Good P, Guyer M, Peck AM, Derge JG, Lipman D, Collins FS, Jang W, Sherry S, Feolo M, Misquitta L, Lee E, Rotmistrovsky K, Greenhut SF, Schaefer CF, Buetow K, Bonner TI, Haussler D, Kent J, Kiekhaus M, Furey T, Brent M, Prange C, Schreiber K, Shapiro N, Bhat NK, Hopkins RF, Hsie F, Driscoll T, Soares MB, Casavant TL, Scheetz TE, Brown-stein MJ, Usdin TB, Toshiyuki S, Carninci P, Piao Y, Dudekula DB, Ko MSH, Kawakami K, Suzuki Y, Sugano S, Gruber CE, Smith MR, Simmons B, Moore T, Waterman R, Johnson SL, Ruan Y, Wei CL, Mathavan S, Gunaratne PH, Wu J, Garcia AM, Hulyk SW, Fuh E, Yuan Y, Sneed A, Kowis C, Hodgson A, Muzny DM, McPherson J, Gibbs RA, Fahey J, Helton E, Ketteman M, Madan A, Rodrigues S, Sanchez A, Whiting M, Madari A, Young AC, Wetherby KD, Granite SJ, Kwong PN, Brinkley CP, Pearson RL, Bouffard GG, Blakesly RW, Green ED, Dickson MC, Rodriguez AC, Grimwood J, Schmutz J, Myers RM, Butterfield YSN, Griffith M, Griffith OL, Krzywinski MI, Liao N, Morin R, Morrin R, Palmquist D, Petrescu AS, Skalska U, Smailus DE, Stott JM, Schnerch A, Schein JE, Jones SJM, Holt RA, Baross A, Marra MA, Clifton S, Makowski KA, Bosak S, Malek J. The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC). Genome Res 2004; 14:2121-7. [PMID: 15489334 PMCID: PMC528928 DOI: 10.1101/gr.2596504] [Show More Authors] [Citation(s) in RCA: 419] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The National Institutes of Health's Mammalian Gene Collection (MGC) project was designed to generate and sequence a publicly accessible cDNA resource containing a complete open reading frame (ORF) for every human and mouse gene. The project initially used a random strategy to select clones from a large number of cDNA libraries from diverse tissues. Candidate clones were chosen based on 5'-EST sequences, and then fully sequenced to high accuracy and analyzed by algorithms developed for this project. Currently, more than 11,000 human and 10,000 mouse genes are represented in MGC by at least one clone with a full ORF. The random selection approach is now reaching a saturation point, and a transition to protocols targeted at the missing transcripts is now required to complete the mouse and human collections. Comparison of the sequence of the MGC clones to reference genome sequences reveals that most cDNA clones are of very high sequence quality, although it is likely that some cDNAs may carry missense variants as a consequence of experimental artifact, such as PCR, cloning, or reverse transcriptase errors. Recently, a rat cDNA component was added to the project, and ongoing frog (Xenopus) and zebrafish (Danio) cDNA projects were expanded to take advantage of the high-throughput MGC pipeline.
Collapse
|
1072
|
Marsischky G, LaBaer J. Many paths to many clones: a comparative look at high-throughput cloning methods. Genome Res 2004; 14:2020-8. [PMID: 15489321 DOI: 10.1101/gr.2528804] [Citation(s) in RCA: 101] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The creation of genome-scale clone resources is a difficult and costly process, making it essential to maximize the efficiency of each step of clone creation. In this review, we compare the available commercial and open-source recombinational cloning methods with regard to their use in creating comprehensive open reading frame (ORF) clone collections with an emphasis on the properties requisite to use in a high-throughput setting. The most efficient strategy to the creation of ORF clone resources is to build a master clone collection that serves as a quality validated source for producing collections of expression clones. We examine the methods for recombinational cloning available for both the creation of master clones and their conversion into expression clones. Alternative approaches to creating clones involving mixing of cloning methods, including gap-repair cloning, are also explored.
Collapse
Affiliation(s)
- Gerald Marsischky
- Institute of Proteomics, Harvard Medical School, Department of Biological Chemistry and Molecular Pharmacology, Boston, Massachusetts 02115, USA.
| | | |
Collapse
|
1073
|
Furey TS, Diekhans M, Lu Y, Graves TA, Oddy L, Randall-Maher J, Hillier LW, Wilson RK, Haussler D. Analysis of human mRNAs with the reference genome sequence reveals potential errors, polymorphisms, and RNA editing. Genome Res 2004; 14:2034-40. [PMID: 15489323 PMCID: PMC528917 DOI: 10.1101/gr.2467904] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The NCBI Reference Sequence (RefSeq) project and the NIH Mammalian Gene Collection (MGC) together define a set of approximately 30,000 nonredundant human mRNA sequences with identified coding regions representing 17,000 distinct loci. These high-quality mRNA sequences allow for the identification of transcribed regions in the human genome sequence, and many researchers accept them as the correct representation of each defined gene sequence. Computational comparison of these mRNA sequences and the recently published essentially finished human genome sequence reveals several thousand undocumented nonsynonymous substitution and frame shift discrepancies between the two resources. Additional analysis is undertaken to verify that the euchromatic human genome is sufficiently complete--containing nearly the whole mRNA collection, thus allowing for a comprehensive analysis to be undertaken. Many of the discrepancies will prove to be genuine polymorphisms in the human population, somatic cell genomic variants, or examples of RNA editing. It is observed that the genome sequence variant has significant additional support from other mRNAs and ESTs, almost four times more often than does the mRNA variant, suggesting that the genome sequence is more accurate. In approximately 15% of these cases, there is substantial support for both variants, suggestive of an undocumented polymorphism. An initial screening against a 24-individual genomic DNA diversity panel verified 60% of a small set of potential single nucleotide polymorphisms from which successful results could be obtained. We also find statistical evidence that a few of these discrepancies are due to RNA editing. Overall, these results suggest that the mRNA collections may contain a substantial number of errors. For current and future mRNA collections, it may be prudent to fully reconcile each genome sequence discrepancy, classifying each as a polymorphism, site of RNA editing or somatic cell variation, or genome sequence error.
Collapse
Affiliation(s)
- Terrence S Furey
- Center for Biomolecular Science and Engineering, Department of Computer Science, University of California, Santa Cruz, Santa Cruz, California 95064, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
1074
|
Yuille M, Korn B, Moore T, Farmer AA, Carrino J, Prange C, Hayashizaki Y. The responsibility to share: sharing the responsibility. Genome Res 2004; 14:2015-9. [PMID: 15489320 DOI: 10.1101/gr.2677304] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Affiliation(s)
- Martin Yuille
- Medical Research Council geneservice, Genome Campus, Hinxton, Cambridge, CB10 1SB, UK.
| | | | | | | | | | | | | |
Collapse
|
1075
|
Bonaldo MF, Bair TB, Scheetz TE, Snir E, Akabogu I, Bair JL, Berger B, Crouch K, Davis A, Eyestone ME, Keppel C, Kucaba TA, Lebeck M, Lin JL, de Melo AIR, Rehmann J, Reiter RS, Schaefer K, Smith C, Tack D, Trout K, Sheffield VC, Lin JJC, Casavant TL, Soares MB. 1274 full-open reading frames of transcripts expressed in the developing mouse nervous system. Genome Res 2004; 14:2053-63. [PMID: 15489326 PMCID: PMC528920 DOI: 10.1101/gr.2601304] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
As part of the trans-National Institutes of Health (NIH) Mouse Brain Molecular Anatomy Project (BMAP), and in close coordination with the NIH Mammalian Gene Collection Program (MGC), we initiated a large-scale project to clone, identify, and sequence the complete open reading frame (ORF) of transcripts expressed in the developing mouse nervous system. Here we report the analysis of the ORF sequence of 1274 cDNAs, obtained from 47 full-length-enriched cDNA libraries, constructed by using a novel approach, herein described. cDNA libraries were derived from size-fractionated cytoplasmic mRNA isolated from brain and eye tissues obtained at several embryonic stages and postnatal days. Altogether, including the full-ORF MGC sequences derived from these libraries by the MGC sequencing team, NIH_BMAP full-ORF sequences correspond to approximately 20% of all transcripts currently represented in mouse MGC. We show that NIH_BMAP clones comprise 68% of mouse MGC cDNAs > or =5 kb, and 54% of those > or =4 kb, as of March 15, 2004. Importantly, we identified transcripts, among the 1274 full-ORF sequences, that are exclusively or predominantly expressed in brain and eye tissues, many of which encode yet uncharacterized proteins.
Collapse
Affiliation(s)
- Maria F Bonaldo
- Department of Pediatrics, The University of Iowa, Iowa City, Iowa 52242, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1076
|
Wu JQ, Garcia AM, Hulyk S, Sneed A, Kowis C, Yuan Y, Steffen D, McPherson JD, Gunaratne PH, Gibbs RA. Large-scale RT-PCR recovery of full-length cDNA clones. Biotechniques 2004; 36:690-6, 698-700. [PMID: 15088387 DOI: 10.2144/04364dd03] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Pseudogenes, alternative transcripts, noncoding RNA, and polymorphisms each add extensive complexity to the mammalian transcriptome and confound estimation of the total number of genes. Despite advanced algorithms for gene prediction and several large-scale efforts to obtain cDNA clones for all human open reading frames (ORFs), no single collection is complete. To enhance this effort, we have developed a high-throughput pipeline for reverse transcription PCR (RT-PCR) gene recovery. Most importantly, novel molecular strategies for improving RT-PCR yield of transcripts that have been difficult to isolate by other means and computational strategies for clone sequence validation have been developed and optimized. This systematic gene recovery pipeline allows both rescue of predicted human and rat genes and provides insight into the complexity of the transcriptome through comparisons with existing data sets.
Collapse
Affiliation(s)
- Jia Qian Wu
- Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
1077
|
Brasch MA, Hartley JL, Vidal M. ORFeome cloning and systems biology: standardized mass production of the parts from the parts-list. Genome Res 2004; 14:2001-9. [PMID: 15489318 DOI: 10.1101/gr.2769804] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Together with metabolites, proteins and RNAs form complex biological systems through highly intricate networks of physical and functional interactions. Large-scale studies aimed at a molecular understanding of the structure, function, and dynamics of proteins and RNAs in the context of cellular networks require novel approaches and technologies. This Special Issue of Genome Research features strategies for the high-throughput construction and manipulation of complete sets of protein-encoding open reading frames (ORFeome), gene promoters (promoterome), and noncoding RNAs, as predicted from genome and transcriptome sequences. Here we discuss the use of a recombinational cloning system that allows efficiency, adaptability, and compatibility in the generation of ORFeome, promoterome, and other resources.
Collapse
|
1078
|
Meier-Noorden M, Flindt S, Kalinke U, Hinz T. A CpG-rich bidirectional promoter induces the T-cell death-associated gene 51 and downregulates an inversely oriented transcript during early T-cell activation. Gene 2004; 338:197-207. [PMID: 15315823 DOI: 10.1016/j.gene.2004.05.006] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2004] [Revised: 04/16/2004] [Accepted: 05/06/2004] [Indexed: 11/18/2022]
Abstract
The human T-cell death-associated gene 51 (TDAG51) is upregulated upon lymphocyte stimulation and in the context of ER stress. Moreover, TDAG51 plays a role in programmed cell death and tumorigenesis. We performed an extensive TDAG51 promoter analysis and found a strong CpG-rich bidirectional promoter within the first 582 nucleotides of the TDAG51 reference DNA complementary to RNA (cDNA). Upon stimulation of primary human T cells, this promoter modulated the downregulation of a newly detected head-to-head oriented transcript. Mapping of the transcription start points revealed that the 5' regions of the TDAG51 mRNA and of the newly identified transcript did not overlap in T cells. Thus, the TDAG51 locus shows an operon-like organization of two head-to-head oriented transcripts that are inversely regulated in T lymphocytes by a CpG-rich bidirectional promoter.
Collapse
Affiliation(s)
- Marc Meier-Noorden
- Division of Immunology, Paul-Ehrlich-Institut, Paul-Ehrlich-Str. 51-59, D-63225 Langen, Germany
| | | | | | | |
Collapse
|
1079
|
Genomic structure and cloning of two transcript isoforms of human Sp8. BMC Genomics 2004; 5:86. [PMID: 15533246 PMCID: PMC534095 DOI: 10.1186/1471-2164-5-86] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2004] [Accepted: 11/08/2004] [Indexed: 11/29/2022] Open
Abstract
Background The Specificity proteins (Sp) are a family of transcription factors that have three highly conserved zinc-fingers located towards the carboxy-terminal that bind GC-boxes and assist in the initiation of gene transcription. Human Sp1-7 genes have been characterized. Recently, the phenotype of Sp8 null mice has been described, being tailless and having severe truncation of both fore and hind limbs. They also have malformed brains with defective closure of the anterior and posterior neuropore during brain development. Results The human Sp8 gene is a three-exon gene that maps to 7p21.3, close to the related Sp4 gene. From an osteosarcoma cell line we cloned two transcript variants that use two different first exons and have a common second exon. One clone encodes a 508-residue protein, Sp8L (isoform 1) and the other a shorter 490-residue protein, Sp8S (isoform 2). These two isoforms are conserved being found also in mice and zebrafish. Analysis of the Sp8L protein sequence reveals an amino-terminal hydrophobic Sp-motif that is disrupted in Sp8S, a buttonhead box and three C2H2 zinc-fingers. Sp8 mRNA expression was detected in a wide range of tissues at a low level, with the highest levels being found in brain. Treatment of the murine pluripotent cell line C3H10T1/2 with 100 ng/mL BMP-2 induced Sp8 mRNA after 24 hours. Conclusions There is conservation of the two Sp8 protein isoforms between primates, rodents and fish, suggesting that the isoforms have differing roles in gene regulation. Sp8 may play a role in chondrogenic/osteoblastic differentiation in addition to its role in brain and limb development.
Collapse
|
1080
|
Abstract
Life science in the 21st century is developing rapidly through the structural analysis of biomolecules, the completion of the human genome sequence and the analysis of transcriptomes. The mouse transcriptome has been comprehensively analyzed using a gene discovery approach to collect full-length cDNA (FL-cDNA) clones. The framework of the transcriptome was then mapped out by an international Functional ANnoTation Of Mouse cDNA (FANTOM) effort, and a significant new population of noncoding transcripts was discovered. The geographical analogy of a second "RNA continent," separate from the "continent" of expressed proteins, aids the visualization of this concept. An unexpected number of variations was discovered in the mouse transcriptome. The animal transcriptome has evolved to produce several transcripts and proteins from a single "transcriptional unit". Transcriptome analysis has given rise to the FL-cDNA database and to the 60 770 FANTOM FL-cDNA clone set, and the DNABook was developed as an easier way to distribute these clones. In conjunction with genome sequence databases, transcriptome databases and clone banks will be platforms for developing advanced databases of gene function (e.g. the Genome Function Database). This will enable life science to make rapid progress towards understanding life as a system of molecules.
Collapse
Affiliation(s)
- Yoshihide Hayashizaki
- Genome Exploration Research Group, Genomic Science Center (GSC), RIKEN, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 230-0045 Japan.
| | | |
Collapse
|
1081
|
Tiwari R, Geliebter J, Lucchese A, Mittelman A, Kanduc D. Computational peptide dissection of Melan-a/MART-1 oncoprotein antigenicity. Peptides 2004; 25:1865-71. [PMID: 15501517 DOI: 10.1016/j.peptides.2004.07.004] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/14/2004] [Revised: 07/14/2004] [Accepted: 07/14/2004] [Indexed: 12/22/2022]
Abstract
We have mapped the linear antigenic determinant of a commercial MAb raised in the mouse against the melanoma-associated-antigen Melan-A/MART-1. The B cell epitope on the Melan-A/MART-1 oncoprotein is located in the 15-mer amino acid sequence 101-115 PPAYEKLSAEQSPPP, within residues 102-106. The definition of the antigenic sequence on Melan-A/MART-1 oncoprotein was reached following analyses of MHC II binding potential and similarity level to the mouse proteome, that put into evidence the 15-mer amino acid sequence 101-115 PPAYEKLSAEQSPPP as the top scoring peptide in binding H2-A(d) molecules and the epitopic sequence residues 102-106 (i.e., the peptide sequence PAYEK) as having low-similarity level to the mouse proteome. Dot-blot epitope mapping immunoassay identified proline residue 102 as critical, based on its effect on antibody recognition. The present study adds to previous companion reports in validating the hypothesis that low-similarity to the host's proteome and binding potential to MHC II molecules are essential concurring factors in the modulation of the pool of epitopic sequences.
Collapse
Affiliation(s)
- Raj Tiwari
- Department of Microbiology and Immunology, New York Medical College, Valhalla, NY 10595, USA
| | | | | | | | | |
Collapse
|
1082
|
Afjehi-Sadat L, Shin JH, Felizardo M, Lee K, Slavc I, Lubec G. Detection of hypothetical proteins in 10 individual human tumor cell lines. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2004; 1747:67-80. [PMID: 15680240 DOI: 10.1016/j.bbapap.2004.09.024] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/25/2004] [Revised: 09/27/2004] [Accepted: 09/27/2004] [Indexed: 12/20/2022]
Abstract
The search for new structures in tumors by genomics and proteomics methods is a major goal in tumor biology and may lead to the detection of markers or antigens for the generation of tumor vaccines. The aim of this study was to identify proteins that have been predicted so far based upon their nucleic acid sequence only or show poor identity to known proteins in tumor cell lines. Cell lines of neuroblastoma, colorectal, cervix carcinoma, adenocarcinoma of the ovary, lung and breast cancer, promyelocytic leukaemia, rhabdomyosarcoma, osteosarcoma and malignant melanoma were used. Cell lysates were run on 2D gel electrophoresis with subsequent in-gel digestion and MALDI-TOF-TOF analysis. A series of 10 hypothetical proteins (HPs) were observed and three of these proteins, hypothetical protein (Q9BTE6), CGI-83 protein (Q9Y392) and similar to CG11334 (Q9BV20), were so far described in tumors exclusively. The other seven proteins were already detected at the transcriptional level in normal and tumor cell lines or tissues. In conclusion, the three HPs observed in lung cancer and malignant melanoma may be candidates for development of tumor markers and generation of tumor vaccines.
Collapse
Affiliation(s)
- Leila Afjehi-Sadat
- Medical University of Vienna, Division of Basic Science, Department of Pediatrics, Währinger Gürtel 18, A-1090 Vienna, Austria
| | | | | | | | | | | |
Collapse
|
1083
|
Suzuki Y, Yamashita R, Shirota M, Sakakibara Y, Chiba J, Mizushima-Sugano J, Nakai K, Sugano S. Sequence comparison of human and mouse genes reveals a homologous block structure in the promoter regions. Genome Res 2004; 14:1711-8. [PMID: 15342556 PMCID: PMC515316 DOI: 10.1101/gr.2435604] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Comparative sequence analysis was carried out for the regions adjacent to experimentally validated transcriptional start sites (TSSs), using 3324 pairs of human and mouse genes. We aligned the upstream putative promoter sequences over the 1-kb proximal regions and found that the sequence conservation could not be further extended at, on average, 510 bp upstream positions of the TSSs. This discontinuous manner of the sequence conservation revealed a "block" structure in about one-third of the putative promoter regions. Consistently, we also observed that G+C content and CpG frequency were significantly different inside and outside the blocks. Within the blocks, the sequence identity was uniformly 65% regardless of their length. About 90% of the previously characterized transcription factor binding sites were located within those blocks. In 46% of the blocks, the 5' ends were bounded by interspersed repetitive elements, some of which may have nucleated the genomic rearrangements. The length of the blocks was shortest in the promoters of genes encoding transcription factors and of genes whose expression patterns are brain specific, which suggests that the evolutional diversifications in the transcriptional modulations should be the most marked in these populations of genes.
Collapse
Affiliation(s)
- Yutaka Suzuki
- Human Genome Center, The Institute of Medical Science, The University of Tokyo, Minato-ku, Tokyo, 108-8639, Japan.
| | | | | | | | | | | | | | | |
Collapse
|
1084
|
|
1085
|
|
1086
|
Greenfield NJ, Kostyukova AS, Hitchcock-DeGregori SE. Structure and tropomyosin binding properties of the N-terminal capping domain of tropomodulin 1. Biophys J 2004; 88:372-83. [PMID: 15475586 PMCID: PMC1305014 DOI: 10.1529/biophysj.104.051128] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Two families of actin regulatory proteins are the tropomodulins and tropomyosins. Tropomodulin binds to tropomyosin (TM) and to the pointed end of actin filaments and "caps" the pointed end (i.e., inhibits its polymerization and depolymerization). Tropomodulin 1 has two distinct actin-capping regions: a folded C-terminal domain (residues 160-359), which does not bind to TM, and a conserved, N-terminal region, within residues 1-92 that binds TM and requires TM for capping activity. NMR and circular dichroism were used to determine the structure of a peptide containing residues 1-92 of tropomodulin (Tmod1(1-92)) and to define its TM binding site. Tmod1(1-92) is mainly disordered with only one helical region, residues 24-35. This helix forms part of the TM binding domain, residues 1-35, which become more ordered upon binding a peptide containing the N-terminus of an alpha-TM. Mutation of L27 to E or G in the Tmod helix reduces TM affinity. Residues 49-92 are required for capping but do not bind TM. Of these, residues 67-75 have the sequence of an amphipathic helix, but are not helical. Residues 55-62 and 76-92 display negative 1H-15N heteronuclear Overhauser enhancements showing they are flexible. The conformational dynamics of these residues may be important for actin capping activity.
Collapse
Affiliation(s)
- Norma J Greenfield
- Department of Neuroscience and Cell Biology, University of Medicine and Dentistry of New Jersey-Robert Wood Johnson Medical School, Piscataway, New Jersey 08854-5635, USA.
| | | | | |
Collapse
|
1087
|
Suzuki H, Okunishi R, Hashizume W, Katayama S, Ninomiya N, Osato N, Sato K, Nakamura M, Iida J, Kanamori M, Hayashizaki Y. Identification of region-specific transcription factor genes in the adult mouse brain by medium-scale real-time RT-PCR. FEBS Lett 2004; 573:214-8. [PMID: 15328000 DOI: 10.1016/j.febslet.2004.07.068] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2004] [Revised: 07/26/2004] [Accepted: 07/27/2004] [Indexed: 11/20/2022]
Abstract
We established a medium-scale real-time RT-PCR system focusing on transcription factors and applied it to their expression profiles in the adult mouse 11 brain regions (http://genome.gsc.riken.jp/qRT-PCR/). Almost 90% of the examined genes showed significant expression in at least one region. We successfully extracted 179 region-specific genes by clustering analysis. Interestingly, the transcription factors involved in the development of the pituitary were still expressed in the adult pituitary, suggesting that they also play important roles in maintenance of the pituitary. These results provide unique molecular markers that may account for the molecular basis of the unique functions of specific brain regions.
Collapse
Affiliation(s)
- Harukazu Suzuki
- Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center (GSC), 1-7-22, Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1088
|
Motojima K. 17β -Hydroxysteroid dehydrogenase type 11 is a major peroxisome proliferator-activated receptor α-regulated gene in mouse intestine. ACTA ACUST UNITED AC 2004; 271:4141-6. [PMID: 15479243 DOI: 10.1111/j.1432-1033.2004.04352.x] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
In order to study the role of peroxisome proliferator-activated receptor alpha in mouse intestine, its agonist-induced proteins were identified by peptide mass fingerprinting followed by Northern blot analysis using their cDNAs. One of the most remarkably induced proteins was identified as 17beta-hydroxysterol dehydrogenase type 11. Its very rapid induction by various agonists was most efficient in intestine and then in liver. These findings together with recently reported results showing the enzyme family's wide substrate spectrum, including not only glucocorticoids and sex steroids but also bile acids, fatty acids and branched chain amino acids, suggest new roles for both peroxisome proliferator-activated receptor alpha and 17beta-hydroxysterol dehydrogenase type 11 in lipid metabolism and/or detoxification in the intestine.
Collapse
Affiliation(s)
- Kiyoto Motojima
- Department of Biochemistry, Meiji Pharmaceutical University, Kiyose, Tokyo, Japan.
| |
Collapse
|
1089
|
Rodriguez A, Griffiths-Jones S, Ashurst JL, Bradley A. Identification of mammalian microRNA host genes and transcription units. Genome Res 2004; 14:1902-10. [PMID: 15364901 PMCID: PMC524413 DOI: 10.1101/gr.2722704] [Citation(s) in RCA: 1453] [Impact Index Per Article: 69.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2004] [Accepted: 07/27/2004] [Indexed: 12/13/2022]
Abstract
To derive a global perspective on the transcription of microRNAs (miRNAs) in mammals, we annotated the genomic position and context of this class of noncoding RNAs (ncRNAs) in the human and mouse genomes. Of the 232 known mammalian miRNAs, we found that 161 overlap with 123 defined transcription units (TUs). We identified miRNAs within introns of 90 protein-coding genes with a broad spectrum of molecular functions, and in both introns and exons of 66 mRNA-like noncoding RNAs (mlncRNAs). In addition, novel families of miRNAs based on host gene identity were identified. The transcription patterns of all miRNA host genes were curated from a variety of sources illustrating spatial, temporal, and physiological regulation of miRNA expression. These findings strongly suggest that miRNAs are transcribed in parallel with their host transcripts, and that the two different transcription classes of miRNAs ('exonic' and 'intronic') identified here may require slightly different mechanisms of biogenesis.
Collapse
Affiliation(s)
- Antony Rodriguez
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
| | | | | | | |
Collapse
|
1090
|
Ohtsuka S, Iwase K, Kato M, Seki N, Shimizu-Yabe A, Miyauchi O, Sakao E, Kanazawa M, Yamamoto S, Kohno Y, Takiguchi M. An mRNA amplification procedure with directional cDNA cloning and strand-specific cRNA synthesis for comprehensive gene expression analysis. Genomics 2004; 84:715-29. [PMID: 15475249 DOI: 10.1016/j.ygeno.2004.06.012] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2003] [Accepted: 06/28/2004] [Indexed: 12/21/2022]
Abstract
We developed an integrated system suitable for comprehensive gene expression studies including construction and analysis of cDNA microarrays starting from a small amount of mRNA. We amplified total mRNA first as cDNA mixtures by polymerase chain reaction and then as strand-specific cRNA mixtures by in vitro transcription. These amplified cDNA and cRNA enabled determination of mRNA levels by hybridization analyses such as Southern, Northern, reverse-Northern macroarray, and cDNA microarray analyses, as well as construction of the cDNA library with a unidirectional cDNA insert. By using strand-specific cRNA derived from rat primary-cultured hepatocytes, we detected putative antisense transcripts for the metallothionein gene. cDNA microarray analysis for genes regulated by glucocorticoids and glucagon in the hepatocytes revealed that a number of genes involved in signal transduction and transcriptional regulation were up- or down-regulated. The present system is widely applicable to gene expression analysis with limited amounts of RNA samples.
Collapse
MESH Headings
- Animals
- Base Sequence
- Cells, Cultured
- Cloning, Molecular
- DNA, Complementary/genetics
- DNA, Complementary/metabolism
- Gene Amplification
- Gene Expression
- Gene Library
- Glucagon/pharmacology
- Glucocorticoids/pharmacology
- Hepatocytes/drug effects
- Hepatocytes/metabolism
- Metallothionein/genetics
- Mice
- Molecular Sequence Data
- Neurons/drug effects
- Neurons/metabolism
- Oligonucleotide Array Sequence Analysis
- Polymerase Chain Reaction
- RNA, Antisense/genetics
- RNA, Complementary/biosynthesis
- RNA, Complementary/genetics
- RNA, Messenger/genetics
- RNA, Messenger/metabolism
- Rats
- Transcription, Genetic/genetics
Collapse
Affiliation(s)
- Satoko Ohtsuka
- Department of Biochemistry and Genetics, Chiba University Graduate School of Medicine, Chiba 260-8670, Japan
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1091
|
Collins JE, Wright CL, Edwards CA, Davis MP, Grinham JA, Cole CG, Goward ME, Aguado B, Mallya M, Mokrab Y, Huckle EJ, Beare DM, Dunham I. A genome annotation-driven approach to cloning the human ORFeome. Genome Biol 2004; 5:R84. [PMID: 15461802 PMCID: PMC545604 DOI: 10.1186/gb-2004-5-10-r84] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2004] [Revised: 07/16/2004] [Accepted: 08/11/2004] [Indexed: 11/10/2022] Open
Abstract
We have developed a systematic approach to generating cDNA clones containing full-length open reading frames (ORFs), exploiting knowledge of gene structure from genomic sequence. Each ORF was amplified by PCR from a pool of primary cDNAs, cloned and confirmed by sequencing. We obtained clones representing 70% of genes on human chromosome 22, whereas searching available cDNA clone collections found at best 48% from a single collection and 60% for all collections combined.
Collapse
Affiliation(s)
- John E Collins
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Charmain L Wright
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Carol A Edwards
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
- Current address: Department of Anatomy, University of Cambridge, Downing Street, Cambridge, CB2 3DY, UK
| | - Matthew P Davis
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - James A Grinham
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Charlotte G Cole
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Melanie E Goward
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Begoña Aguado
- MRC Rosalind Franklin Centre for Genomics Research (formerly MRC UK Human Genome Mapping Resource Centre), Hinxton, Cambridge, CB10 1SB, UK
- Current address: Centro Nacional de Biotecnología (CNB), Campus de la Universidad Autónoma de Madrid, Cantoblanco, 28049 Madrid, Spain
| | - Meera Mallya
- MRC Rosalind Franklin Centre for Genomics Research (formerly MRC UK Human Genome Mapping Resource Centre), Hinxton, Cambridge, CB10 1SB, UK
- Current address: Cambridge Institute for Medical Research, Wellcome Trust/MRC Building, Addenbrooke's Hospital, Hills Road, Cambridge, CB2 2XY, UK
| | - Younes Mokrab
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
- Current address: Department of Biochemistry, Sanger Building, University of Cambridge, 80 Tennis Court Road, Cambridge, CB2 1GA, UK
| | - Elizabeth J Huckle
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - David M Beare
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Ian Dunham
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| |
Collapse
|
1092
|
Iida K, Seki M, Sakurai T, Satou M, Akiyama K, Toyoda T, Konagaya A, Shinozaki K. Genome-wide analysis of alternative pre-mRNA splicing in Arabidopsis thaliana based on full-length cDNA sequences. Nucleic Acids Res 2004; 32:5096-103. [PMID: 15452276 PMCID: PMC521658 DOI: 10.1093/nar/gkh845] [Citation(s) in RCA: 185] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We mapped RIKEN Arabidopsis full-length (RAFL) cDNAs to the Arabidopsis thaliana genome to search for alternative splicing events. We used 278,734 full-length and 3'/5' terminal reads of the sequences of 220,214 RAFL cDNA clones for the analysis. Eighty-nine percent of the cDNA sequences could be mapped to the genome and were clustered in 17,130 transcription units (TUs). Alternative splicing events were found in 1764 out of 15,214 TUs (11.6%) with multiple sequences. We collected full-length cDNA clones from plants grown under various environmental conditions or from various organs. We then analyzed the correlation between alternative splicing events and environmental stress conditions. Alternative splicing profiles changed according to environmental stress conditions and the various developmental stages of plant organs. In particular, cold-stress conditions affected alternative splicing profiles. The change in alternative splicing profiles under cold stress may be mediated by alternative splicing and transcriptional regulation of splicing factors.
Collapse
Affiliation(s)
- Kei Iida
- Plant Mutation Exploration Team, Plant Functional Genomics Research Group, RIKEN Genomic Sciences Center (GSC), RIKEN Yokohama Institute, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan
| | | | | | | | | | | | | | | |
Collapse
|
1093
|
Fletcher HA, Owiafe P, Jeffries D, Hill P, Rook GAW, Zumla A, Doherty TM, Brookes RH. Increased expression of mRNA encoding interleukin (IL)-4 and its splice variant IL-4delta2 in cells from contacts of Mycobacterium tuberculosis, in the absence of in vitro stimulation. Immunology 2004; 112:669-73. [PMID: 15270739 PMCID: PMC1782524 DOI: 10.1111/j.1365-2567.2004.01922.x] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Expression of interleukin (IL)-4 is increased in tuberculosis and thought to be detrimental. We show here that in healthy contacts there is increased expression of its naturally occurring antagonist, IL-4delta2 (IL-4delta2). We identified contacts by showing that their peripheral blood mononuclear cells (PBMC) released interferon (IFN)-gamma in response to the Mycobacterium tuberculosis-specific antigen 6 kDa early secretory antigenic target (ESAT-6). Fresh unstimulated PBMC from these contacts contained higher levels of mRNA encoding IL-4delta2 (P=0.002) than did cells from ESAT-6 negative donors (noncontacts). These data indicate that contact with M. tuberculosis induces unusual, previously unrecognized, immunological events. We tentatively hypothesize that progression to active disease might depend upon the underlying ratio of IL-4 to IL-4delta2.
Collapse
Affiliation(s)
- Helen A Fletcher
- The Centre for Infectious Diseases and International Health, Windeyer Institute of Medical Sciences, Royal Free and University College Medical School, London, UK
| | | | | | | | | | | | | | | |
Collapse
|
1094
|
Brusic V. From immunoinformatics to immunomics. J Bioinform Comput Biol 2004; 1:179-81. [PMID: 15290787 DOI: 10.1142/s0219720003000034] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2002] [Revised: 01/29/2003] [Accepted: 02/05/2002] [Indexed: 11/18/2022]
Affiliation(s)
- Vladimir Brusic
- Institute for Infocomm Research, 21 Heng Mui Keng Terrace, Singapore 119613, Singapore.
| |
Collapse
|
1095
|
Abstract
The availability of entire genome sequences is expected to revolutionize the way in which biology and medicine are conducted for years to come. However, achieving this promise still requires significant effort in the areas of gene annotation, cloning and expression of thousands of known and heretofore unknown protein-encoding genes. Traditional technologies of manipulating genes are too cumbersome and inefficient when one is dealing with more than a few genes at a time. Entire libraries composed of all protein-encoding open reading frames (ORFs) cloned in highly flexible vectors will be needed to take full advantage of the information found in any genome sequence. The creation of such ORFeome resources using novel technologies for cloning and expressing entire proteomes constitutes an effective gateway from whole genome sequencing efforts to downstream 'omics' applications.
Collapse
Affiliation(s)
- Jean-François Rual
- Center for Cancer Systems Biology and Department of Cancer Biology, Dana-Farber Cancer Institute and Department of Genetics, Harvard Medical School, 44 Binney Street, Boston, MA 02115, USA
| | | | | |
Collapse
|
1096
|
Suzuki M, Hayashizaki Y. Mouse-centric comparative transcriptomics of protein coding and non-coding RNAs. Bioessays 2004; 26:833-43. [PMID: 15273986 DOI: 10.1002/bies.20084] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The largest transcriptome reported so far comprises 60,770 mouse full-length cDNA clones, and is an effective reference data set for comparative transcriptomics. The number of mouse cDNAs identified greatly exceeds the number of genes predicted from the sequenced human and mouse genomes. This is largely because of extensive alternative splicing and the presence of many non-coding RNAs (ncRNAs), which are difficult to predict from genomic sequences. Notably, ncRNAs are a major component of the transcriptomes of higher organisms, and many sense-antisense pairs have been identified. The ncRNAs function in a range of regulatory mechanisms for gene expression and other biological processes. They might also have contributed to the increased functional diversification of genomes during evolution. In this review, we discuss aspects of the transcriptome of various organisms in relation to the mouse data, in order to shed light on the regulatory mechanisms and physiological significance of these abundant RNAs.
Collapse
Affiliation(s)
- Masanori Suzuki
- Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Kanagawa, Japan
| | | |
Collapse
|
1097
|
Kanamori M, Konno H, Osato N, Kawai J, Hayashizaki Y, Suzuki H. A genome-wide and nonredundant mouse transcription factor database. Biochem Biophys Res Commun 2004; 322:787-93. [PMID: 15336533 DOI: 10.1016/j.bbrc.2004.07.179] [Citation(s) in RCA: 100] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2004] [Indexed: 10/26/2022]
Abstract
Here we describe the development of a genome-wide and nonredundant mouse transcription factor database and its viewer (http://genome.gsc.riken.gp/TFdb/). We systematically selected transcription factors with DNA-binding properties and their regulators on the basis of their LocusLink and Gene Ontology annotations. We also incorporated into our database information regarding the corresponding available cDNA clones and their structural properties. Because of these features, our database is unique and may provide useful information for systematic genome-wide studies of transcriptional regulation.
Collapse
Affiliation(s)
- Mutsumi Kanamori
- Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center (GSC), 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan
| | | | | | | | | | | |
Collapse
|
1098
|
Boldogköi Z. Gene Network Polymorphism Is the Raw Material of Natural Selection: The Selfish Gene Network Hypothesis. J Mol Evol 2004; 59:340-57. [PMID: 15553089 DOI: 10.1007/s00239-004-2629-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Population genetics, the mathematical theory of modern evolutionary biology, defines evolution as the alteration of the frequency of distinct gene variants (alleles) differing in fitness over the time. The major problem with this view is that in gene and protein sequences we can find little evidence concerning the molecular basis of phenotypic variance, especially those that would confer adaptive benefit to the bearers. Some novel data, however, suggest that a large amount of genetic variation exists in the regulatory region of genes within populations. In addition, comparison of homologous DNA sequences of various species shows that evolution appears to depend more strongly on gene expression than on the genes themselves. Furthermore, it has been demonstrated in several systems that genes form functional networks, whose products exhibit interrelated expression profiles. Finally, it has been found that regulatory circuits of development behave as evolutionary units. These data demonstrate that our view of evolution calls for a new synthesis. In this article I propose a novel concept, termed the selfish gene network hypothesis, which is based on an overall consideration of the above findings. The major statements of this hypothesis are as follows. (1) Instead of individual genes, gene networks (GNs) are responsible for the determination of traits and behaviors. (2) The primary source of microevolution is the intraspecific polymorphism in GNs and not the allelic variation in either the coding or the regulatory sequences of individual genes. (3) GN polymorphism is generated by the variation in the regulatory regions of the component genes and not by the variance in their coding sequences. (4) Evolution proceeds through continuous restructuring of the composition of GNs rather than fixing of specific alleles or GN variants.
Collapse
Affiliation(s)
- Zsolt Boldogköi
- Laboratory of Neuromorphology, Department of Anatomy, Faculty of Medicine, University of Budapest, Budapest, Hungary.
| |
Collapse
|
1099
|
Kawai J, Hayashizaki Y. Mouse genome encyclopedia and genome network. Geriatr Gerontol Int 2004. [DOI: 10.1111/j.1447-0594.2004.00179.x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
1100
|
Curwen V, Eyras E, Andrews TD, Clarke L, Mongin E, Searle SMJ, Clamp M. The Ensembl automatic gene annotation system. Genome Res 2004; 14:942-50. [PMID: 15123590 PMCID: PMC479124 DOI: 10.1101/gr.1858004] [Citation(s) in RCA: 285] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
As more genomes are sequenced, there is an increasing need for automated first-pass annotation which allows timely access to important genomic information. The Ensembl gene-building system enables fast automated annotation of eukaryotic genomes. It annotates genes based on evidence derived from known protein, cDNA, and EST sequences. The gene-building system rests on top of the core Ensembl (MySQL) database schema and Perl Application Programming Interface (API), and the data generated are accessible through the Ensembl genome browser (http://www.ensembl.org). To date, the Ensembl predicted gene sets are available for the A. gambiae, C. briggsae, zebrafish, mouse, rat, and human genomes and have been heavily relied upon in the publication of the human, mouse, rat, and A. gambiae genome sequence analysis. Here we describe in detail the gene-building system and the algorithms involved. All code and data are freely available from http://www.ensembl.org.
Collapse
Affiliation(s)
- Val Curwen
- The Wellcome Trust Sanger Institute, The Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | | | | | | | | | | | | |
Collapse
|