1151
|
Visel A, Thaller C, Eichele G. GenePaint.org: an atlas of gene expression patterns in the mouse embryo. Nucleic Acids Res 2004; 32:D552-6. [PMID: 14681479 PMCID: PMC308763 DOI: 10.1093/nar/gkh029] [Citation(s) in RCA: 417] [Impact Index Per Article: 19.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
High-throughput instruments were recently developed to determine gene expression patterns on tissue sections by RNA in situ hybridization. The resulting images of gene expression patterns, chiefly of E14.5 mouse embryos, are accessible to the public at http://www.genepaint.org. This relational database is searchable for gene identifiers and RNA probe sequences. Moreover, patterns and intensity of expression in approximately 100 different embryonic tissues are annotated and can be searched using a standardized catalog of anatomical structures. A virtual microscope tool, the Zoom Image Server, was implemented in GenePaint.org and permits interactive zooming and panning across approximately 15,000 high-resolution images.
Collapse
Affiliation(s)
- Axel Visel
- Max Planck Institute of Experimental Endocrinology, Feodor-Lynen-Strasse 7, D-30625 Hannover, Germany
| | | | | |
Collapse
|
1152
|
Harris MA, Clark J, Ireland A, Lomax J, Ashburner M, Foulger R, Eilbeck K, Lewis S, Marshall B, Mungall C, Richter J, Rubin GM, Blake JA, Bult C, Dolan M, Drabkin H, Eppig JT, Hill DP, Ni L, Ringwald M, Balakrishnan R, Cherry JM, Christie KR, Costanzo MC, Dwight SS, Engel S, Fisk DG, Hirschman JE, Hong EL, Nash RS, Sethuraman A, Theesfeld CL, Botstein D, Dolinski K, Feierbach B, Berardini T, Mundodi S, Rhee SY, Apweiler R, Barrell D, Camon E, Dimmer E, Lee V, Chisholm R, Gaudet P, Kibbe W, Kishore R, Schwarz EM, Sternberg P, Gwinn M, Hannick L, Wortman J, Berriman M, Wood V, de la Cruz N, Tonellato P, Jaiswal P, Seigfried T, White R. The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res 2004; 32:D258-61. [PMID: 14681407 PMCID: PMC308770 DOI: 10.1093/nar/gkh036] [Citation(s) in RCA: 2807] [Impact Index Per Article: 133.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The Gene Ontology (GO) project (http://www. geneontology.org/) provides structured, controlled vocabularies and classifications that cover several domains of molecular and cellular biology and are freely available for community use in the annotation of genes, gene products and sequences. Many model organism databases and genome annotation groups use the GO and contribute their annotation sets to the GO resource. The GO database integrates the vocabularies and contributed annotations and provides full access to this information in several formats. Members of the GO Consortium continually work collectively, involving outside experts as needed, to expand and update the GO vocabularies. The GO Web resource also provides access to extensive documentation about the GO project and links to applications that use GO data for functional analyses.
Collapse
|
1153
|
Yazaki J, Kojima K, Suzuki K, Kishimoto N, Kikuchi S. The Rice PIPELINE: a unification tool for plant functional genomics. Nucleic Acids Res 2004; 32:D383-7. [PMID: 14681439 PMCID: PMC308736 DOI: 10.1093/nar/gkh001] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The Rice Genome Research Project in Japan performs genome sequencing and comprehensive expression profiling, constructs genetic and physical maps, collects full-length cDNAs and generates mutant lines, all aimed at improving the breeding of the rice plant as a food source. The National Institute of Agrobiological Sciences in Tsukuba, Japan, has accumulated numerous rice biological resources and has already successfully produced a high-quality genome sequence, a high-density genetic map with 3000 markers, 30,000 full-length cDNAs, over 700 expression profiles with a 9000 cDNA microarray and 15,000 flanking sequences with Tos17 insertions in about 3765 mutant lines from about 50,000 transposon insertion lines. These resources are available in the public domain. A new unification tool for functional genomics, called Rice PIPELINE, has also been developed for the dynamic collection and compilation of genomics data (genome sequences, full-length cDNAs, gene expression profiles, mutant lines, cis elements) from various databases. The mission of Rice PIPELINE is to provide a unique scientific resource that pools publicly available rice genomic data for search by clone sequence, clone name, GenBank accession number, or keyword. The web-based form of Rice PIPELINE is available at http://cdna01.dna.affrc.go.jp/PIPE/.
Collapse
Affiliation(s)
- Junshi Yazaki
- Department of Molecular Biology, National Institute of Agrobiological Sciences, 2-1-2 Kannon-dai, Tsukuba, Ibaraki 305-8602, Japan
| | | | | | | | | |
Collapse
|
1154
|
Nagashima T, Matsuda H, Silva DG, Petrovsky N, Konagaya A, Schönbach C, Kasukawa T, Arakawa T, Carninci P, Kawai J, Hayashizaki Y. FREP: a database of functional repeats in mouse cDNAs. Nucleic Acids Res 2004; 32:D471-5. [PMID: 14681460 PMCID: PMC308857 DOI: 10.1093/nar/gkh123] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The FREP database (http://facts.gsc.riken.go.jp/FREP/) contains 31 396 RepeatMasker-identified non-redundant variant repeat sequences derived from 16,527 mouse cDNAs with protein-coding potential. The repeats were computationally associated with potential effects on transcriptional variation, translation, protein function or involvement in disease to identify Functional REPeats (FREPs). FREPs are defined by the (i) occurrence of exon-exon boundaries in repeats, (ii) presence of polyadenylation sites in 3'UTR-located repeats, (iii) effect on translation, (iv) position in the protein- coding region or protein domains or (v) conditional association with disease MeSH terms. Currently the database contains 9261 (29.5%) inferred FREPs derived from 6861 (41.5%) mouse cDNAs. Integrated evidence of the functional assignments and dynamically generated sequence similarity search results support the exploration and annotation of functional, ancestral or taxon-specific repeats. Keyword and pre-selected feature searches (e.g. coding sequence-repeat or splice site-repeat relations) support intuitive database querying as well as the retrieval of repeat sequences. Integrated sequence search and alignment tools allow the analysis of known or identification of new functional repeat candidates. FREP is a unique resource for illuminating the role of transposons and repetitive sequences in shaping the coding part of the mouse transcriptome and for selecting the appropriate experimental model to study diseases with suspected repeat etiology contributions.
Collapse
Affiliation(s)
- Takeshi Nagashima
- Biomedical Knowledge Discovery Team, Bioinformatics Group, RIKEN Genomic Sciences Center (GSC), RIKEN Yokohama Institute, Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1155
|
Suzuki Y, Yamashita R, Sugano S, Nakai K. DBTSS, DataBase of Transcriptional Start Sites: progress report 2004. Nucleic Acids Res 2004; 32:D78-81. [PMID: 14681363 PMCID: PMC308810 DOI: 10.1093/nar/gkh076] [Citation(s) in RCA: 106] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
DBTSS (http://dbtss.hgc.jp) was originally constructed based on a collection of experimentally determined TSSs of human genes. Since its first release in 2002, it has been updated several times. First, the amount of stored data has increased significantly: e.g. the number of clones that match both the RefSeq mRNA set and the genome sequence has increased from 111,382 to 190,964, now covering 1,234 genes. Second, the positions of SNPs in dbSNP were displayed on the upstream regions of contained human genes. Third, DBTSS now covers other species such as mouse and the human malaria parasite. It will become a central database containing data for many more species with oligo-capping and related methods. Lastly, the database now serves for comparative promoter analyses: in the current version, comparative views of potentially orthologous promoters from human and mouse are presented with an additional function of searching potential transcription-factor binding sites, which are either conserved or diverged between species.
Collapse
Affiliation(s)
- Yutaka Suzuki
- Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo, 108-8639, Japan.
| | | | | | | |
Collapse
|
1156
|
Bult CJ, Blake JA, Richardson JE, Kadin JA, Eppig JT, Baldarelli RM, Barsanti K, Baya M, Beal JS, Boddy WJ, Bradt DW, Burkart DL, Butler NE, Campbell J, Corey R, Corbani LE, Cousins S, Dene H, Drabkin HJ, Frazer K, Garippa DM, Glass LH, Goldsmith CW, Grant PL, King BL, Lennon-Pierce M, Lewis J, Lu I, Lutz CM, Maltais LJ, McKenzie LM, Miers D, Modrusan D, Ni L, Ormsby JE, Qi D, Ramachandran S, Reddy TBK, Reed DJ, Sinclair R, Shaw DR, Smith CL, Szauter P, Taylor B, Vanden Borre P, Walker M, Washburn L, Witham I, Winslow J, Zhu Y. The Mouse Genome Database (MGD): integrating biology with the genome. Nucleic Acids Res 2004; 32:D476-81. [PMID: 14681461 PMCID: PMC308859 DOI: 10.1093/nar/gkh125] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The Mouse Genome Database (MGD) is one component of the Mouse Genome Informatics (MGI) system (http://www.informatics.jax.org), a community database resource for the laboratory mouse. MGD strives to provide a comprehensive knowledgebase about the mouse with experiments and data annotated from both literature and online sources. MGD curates and presents consensus and experimental data representations of genetic, genotype (sequence) and phenotype information including highly detailed reports about genes and gene products. Primary foci of integration are through representations of relationships between genes, sequences and phenotypes. MGD collaborates with other bioinformatics groups to curate a definitive set of information about the laboratory mouse and to build and implement the data and semantic standards that are essential for comparative genome analysis. Recent developments in MGD discussed here include an extensive integration of the mouse sequence data and substantial revisions in the presentation, query and visualization of sequence data.
Collapse
Affiliation(s)
- Carol J Bult
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1157
|
Nikaido I, Saito C, Wakamoto A, Tomaru Y, Arakawa T, Hayashizaki Y, Okazaki Y. EICO (Expression-based Imprint Candidate Organizer): finding disease-related imprinted genes. Nucleic Acids Res 2004; 32:D548-51. [PMID: 14681478 PMCID: PMC308827 DOI: 10.1093/nar/gkh093] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
We have developed an integrated database that is specialized for the study of imprinted disease genes. The database contains novel candidate imprinted genes identified by the RIKEN full-length mouse cDNA microarray study, information on validated single nucleotide polymorphisms (SNPs) to confirm imprinting using reciprocal mouse crosses and the predicted physical position of imprinting-related disease loci in the mouse and human genomes. It has two user-friendly search interfaces: the SNP-central view (MuSCAT: MoUse SNP CATalog) and the candidate gene-central view (CITE: Candidate Imprinted Transcripts by Expression). The database, EICO (Expression-based Imprint Candidate Organizer), can be accessed via the World Wide Web (http://fantom2.gsc.riken.jp/EICODB/) and the DAS client software. These data and interfaces facilitate understanding of the mechanism of imprinting in mammalian inherited traits.
Collapse
Affiliation(s)
- Itoshi Nikaido
- Division of Genomic Information Resource Exploration, Science of Biological Supramolecular Systems, Yokohama City University, Graduate School of Integrated Science, Yokohama, Kanagawa 230-0045, Japan
| | | | | | | | | | | | | |
Collapse
|
1158
|
Xie T, Rowen L, Aguado B, Ahearn ME, Madan A, Qin S, Campbell RD, Hood L. Analysis of the gene-dense major histocompatibility complex class III region and its comparison to mouse. Genome Res 2004; 13:2621-36. [PMID: 14656967 PMCID: PMC403804 DOI: 10.1101/gr.1736803] [Citation(s) in RCA: 80] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
In mammals, the Major Histocompatibility Complex class I and II gene clusters are separated by an approximately 700-kb stretch of sequence called the MHC class III region, which has been associated with susceptibility to numerous diseases. To facilitate understanding of this medically important and architecturally interesting portion of the genome, we have sequenced and analyzed both the human and mouse class III regions. The cross-species comparison has facilitated the identification of 60 genes in human and 61 in mouse, including a potential RNA gene for which the introns are more conserved across species than the exons. Delineation of global organization, gene structure, alternative splice forms, protein similarities, and potential cis-regulatory elements leads to several conclusions: (1) The human MHC class III region is the most gene-dense region of the human genome: >14% of the sequence is coding, approximately 72% of the region is transcribed, and there is an average of 8.5 genes per 100 kb. (2) Gene sizes, number of exons, and intergenic distances are for the most part similar in both species, implying that interspersed repeats have had little impact in disrupting the tight organization of this densely packed set of genes. (3) The region contains a heterogeneous mixture of genes, only a few of which have a clearly defined and proven function. Although many of the genes are of ancient origin, some appear to exist only in mammals and fish, implying they might be specific to vertebrates. (4) Conserved noncoding sequences are found primarily in or near the 5'-UTR or the first intron of genes, and seldom in the intergenic regions. Many of these conserved blocks are likely to be cis-regulatory elements.
Collapse
Affiliation(s)
- Tao Xie
- Institute for Systems Biology, Seattle, Washington 98103, USA
| | | | | | | | | | | | | | | |
Collapse
|
1159
|
Kasai T, Inoue M, Koshiba S, Yabuki T, Aoki M, Nunokawa E, Seki E, Matsuda T, Matsuda N, Tomo Y, Shirouzu M, Terada T, Obayashi N, Hamana H, Shinya N, Tatsuguchi A, Yasuda S, Yoshida M, Hirota H, Matsuo Y, Tani K, Suzuki H, Arakawa T, Carninci P, Kawai J, Hayashizaki Y, Kigawa T, Yokoyama S. Solution structure of a BolA-like protein from Mus musculus. Protein Sci 2004; 13:545-8. [PMID: 14718656 PMCID: PMC2286707 DOI: 10.1110/ps.03401004] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
The BolA-like proteins are widely conserved from prokaryotes to eukaryotes. The BolA-like proteins seem to be involved in cell proliferation or cell-cycle regulation, but the molecular function is still unknown. Here we determined the structure of a mouse BolA-like protein. The overall topology is alphabetabetaalphaalphabetaalpha, in which beta(1) and beta(2) are antiparallel, and beta(3) is parallel to beta(2). This fold is similar to the class II KH fold, except for the absence of the GXXG loop, which is well conserved in the KH fold. The conserved residues in the BolA-like proteins are assembled on the one side of the protein.
Collapse
Affiliation(s)
- Takuma Kasai
- Protein Research Group, RIKEN Genomic Sciences Center, 1-7-22 Suehiro-cho, Tsurumi, Yokohama 230-0045, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1160
|
Noor MAF, Michalak P, Donze D. Characterization of a male-predominant antisense transcript underexpressed in hybrids of Drosophila pseudoobscura and D. persimilis. Genetics 2004; 165:1823-30. [PMID: 14704168 PMCID: PMC1462881 DOI: 10.1093/genetics/165.4.1823] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Characterizing genes that are misregulated in hybrids may elucidate the genetic basis of hybrid sterility or other hybrid dysfunctions that contribute to speciation. Previously, a small segment of a male-predominant transcript that is underexpressed in adult male hybrids of Drosophila pseudoobscura and D. persimilis relative to pure species was identified in a differential display screen. Here, we obtained the full sequence of this 1330-bp transcript and determined that it is an antisense message with high sequence similarity to the D. melanogaster TRAP100 gene, part of the Mediator protein complex that regulates transcriptional initiation during development. Both the sense and the antisense messages are transcribed in D. pseudoobscura, but only the sense message (TRAP100) is transcribed in D. melanogaster complex species. Unlike the antisense message, the sense message is transcribed similarly in D. pseudoobscura males and females and in hybrids of D. pseudoobscura and D. persimilis. The high sequence similarity between distantly related species suggests that the sense message is functionally constrained within the genus. We speculate that the antisense transcript may have evolved a role in male-specific post-transcriptional regulation of TRAP100 in the D. pseudoobscura lineage and that its underexpression in sterile hybrid males may cause an overproduction of TRAP100 protein, possibly yielding deleterious effects.
Collapse
Affiliation(s)
- Mohamed A F Noor
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana 70803, USA.
| | | | | |
Collapse
|
1161
|
Hou J, Yashiro K, Okazaki Y, Saijoh Y, Hayashizaki Y, Hamada H. Identification of a novel left-right asymmetrically expressed gene in the mouse belonging to the BPI/PLUNC superfamily. Dev Dyn 2004; 229:373-9. [PMID: 14745963 DOI: 10.1002/dvdy.10450] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Abstract
In the process of left-right (L-R) axis formation in the mouse, the node plays a critical role as a structure where the initial breaking of L-R symmetry occurs. Here, we report on the gene LPlunc1, a member of BPI/PLUNC gene superfamily, which is asymmetrically expressed in the developing mouse node. LPlunc1 protein is secreted as a processed form of relative molecular mass 54K-60K and shares sequence features with the other members of BPI/PLUNC superfamily, including the N-terminal and C-terminal homology domains, each of which is considered to form a lipid binding pocket. LPlunc1 is transiently expressed in the crown cells of the node asymmetrically. This expression pattern of Lplunc1 highly overlaps with that of Nodal, a major player during the L-R formation. Interestingly, this asymmetric expression pattern is randomized in the iv mutant and reversed in the inv mutant, indicating that LPlunc1 is downstream of iv and inv. Our results suggest a link between lipid binding/transfer and the axis development.
Collapse
Affiliation(s)
- Juan Hou
- Developmental Genetics Group, Graduate School of Frontier Biosciences, Osaka University, and CREST, Japan Science and Technology Corporation (JST), Osaka, Japan
| | | | | | | | | | | |
Collapse
|
1162
|
Karsten SL, Kudo LC, Geschwind DH. Microarray Platforms: Introduction and Application to Neurobiology. INTERNATIONAL REVIEW OF NEUROBIOLOGY 2004; 60:1-23. [PMID: 15474585 DOI: 10.1016/s0074-7742(04)60001-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/30/2023]
Affiliation(s)
- Stanislav L Karsten
- Department of Neurology, Program in Neurogenetics, David Geffen School of Medicine, University of California, Los Angeles 90095, USA
| | | | | |
Collapse
|
1163
|
Ko MSH. Embryogenomics of pre-implantation mammalian development: current status. Reprod Fertil Dev 2004. [DOI: 10.1071/rd03080] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Pre-implantation development is marked by many critical molecular events, including the maternal to zygotic transition and the first differentiation of cells. Understanding such events is important, for both basic reproductive biology and practical applications, including regenerative medicine and livestock production. Scarcity of materials has hampered the progress of the field, but systematic genomics approaches are beginning to be applied to the study of pre-implantation development, resulting in unprecedented amounts of data about the pre-implantation process. The first step in embryogenomics is to collect and sequence cDNAs (expressed sequence tags (ESTs)) for genes that are expressed and function in these early embryos. Mouse work is the most advanced, with 140111 ESTs derived from all stages of pre-implantation development currently available in the public sequence database. For other mammals, at present only approximately 1000 ESTs can be found in the public database, but efforts by several groups are generating cDNA libraries and ESTs. In the present review, the current status of the implementation of these investigative tools for mammalian pre-implantation embryos is discussed.
Collapse
|
1164
|
Fujimori S, Washio T, Higo K, Ohtomo Y, Murakami K, Matsubara K, Kawai J, Carninci P, Hayashizaki Y, Kikuchi S, Tomita M. A novel feature of microsatellites in plants: a distribution gradient along the direction of transcription. FEBS Lett 2003; 554:17-22. [PMID: 14596907 DOI: 10.1016/s0014-5793(03)01041-x] [Citation(s) in RCA: 72] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
A computer-based analysis was conducted to assess the characteristics of microsatellites in transcribed regions of rice and Arabidopsis. In addition, two mammals were simultaneously analyzed for a comparative analysis. Our analyses confirmed a novel plant-specific feature in which there is a gradient in microsatellite density along the direction of transcription. It was also confirmed that pyrimidine-rich microsatellites are found intensively near the transcription start sites, specifically in the two plants, but not in the mammals. Our results suggest that microsatellites located at high frequency in the 5'-flanking regions of plant genes can potentially act as factors in regulating gene expression.
Collapse
Affiliation(s)
- Shigeo Fujimori
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata 997-0035, Japan
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1165
|
Shiraki T, Kondo S, Katayama S, Waki K, Kasukawa T, Kawaji H, Kodzius R, Watahiki A, Nakamura M, Arakawa T, Fukuda S, Sasaki D, Podhajska A, Harbers M, Kawai J, Carninci P, Hayashizaki Y. Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage. Proc Natl Acad Sci U S A 2003; 100:15776-81. [PMID: 14663149 PMCID: PMC307644 DOI: 10.1073/pnas.2136655100] [Citation(s) in RCA: 519] [Impact Index Per Article: 23.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2003] [Indexed: 12/22/2022] Open
Abstract
We introduce cap analysis gene expression (CAGE), which is based on preparation and sequencing of concatamers of DNA tags deriving from the initial 20 nucleotides from 5' end mRNAs. CAGE allows high-throughout gene expression analysis and the profiling of transcriptional start points (TSP), including promoter usage analysis. By analyzing four libraries (brain, cortex, hippocampus, and cerebellum), we redefined more accurately the TSPs of 11-27% of the analyzed transcriptional units that were hit. The frequency of CAGE tags correlates well with results from other analyses, such as serial analysis of gene expression, and furthermore maps the TSPs more accurately, including in tissue-specific cases. The high-throughput nature of this technology paves the way for understanding gene networks via correlation of promoter usage and gene transcriptional factor expression.
Collapse
Affiliation(s)
- Toshiyuki Shiraki
- Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center, Yokohama Institute 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1166
|
Hurowitz EH, Brown PO. Genome-wide analysis of mRNA lengths in Saccharomyces cerevisiae. Genome Biol 2003; 5:R2. [PMID: 14709174 PMCID: PMC395734 DOI: 10.1186/gb-2003-5-1-r2] [Citation(s) in RCA: 77] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2003] [Revised: 11/18/2003] [Accepted: 11/21/2003] [Indexed: 11/17/2022] Open
Abstract
A novel 'Virtual Northern' method provides a practical and efficient method for genome-scale analysis of mRNA transcript lengths. A study in Saccharomyces cerevisiae has revealed that approximately 12-15% of the yeast genome is represented in untranslated sequences of mRNAs. Background Although the protein-coding sequences in the Saccharomyces cerevisiae genome have been studied and annotated extensively, much less is known about the extent and characteristics of the untranslated regions of yeast mRNAs. Results We developed a 'Virtual Northern' method, using DNA microarrays for genome-wide systematic analysis of mRNA lengths. We used this method to measure mRNAs corresponding to 84% of the annotated open reading frames (ORFs) in the S. cerevisiae genome, with high precision and accuracy (measurement errors ± 6-7%). We found a close linear relationship between mRNA lengths and the lengths of known or predicted translated sequences; mRNAs were typically around 300 nucleotides longer than the translated sequences. Analysis of genes deviating from that relationship identified ORFs with annotation errors, ORFs that appear not to be bona fide genes, and potentially novel genes. Interestingly, we found that systematic differences in the total length of the untranslated sequences in mRNAs were related to the functions of the encoded proteins. Conclusions The Virtual Northern method provides a practical and efficient method for genome-scale analysis of transcript lengths. Approximately 12-15% of the yeast genome is represented in untranslated sequences of mRNAs. A systematic relationship between the lengths of the untranslated regions in yeast mRNAs and the functions of the proteins they encode may point to an important regulatory role for these sequences.
Collapse
MESH Headings
- Blotting, Northern/methods
- DNA Transposable Elements/genetics
- DNA, Intergenic/genetics
- Gene Expression Profiling/methods
- Gene Expression Regulation, Fungal/genetics
- Genes, Fungal/genetics
- Genes, Overlapping/genetics
- Genetic Variation/genetics
- Genome, Fungal
- Introns/genetics
- Multigene Family/genetics
- Open Reading Frames/genetics
- RNA/genetics
- RNA, Fungal/genetics
- RNA, Messenger/genetics
- RNA, Mitochondrial
- RNA, Ribosomal/genetics
- RNA, Small Nuclear/genetics
- RNA, Transfer/genetics
- Saccharomyces cerevisiae/genetics
- Saccharomyces cerevisiae Proteins/genetics
- Sequence Homology, Nucleic Acid
- Transcription, Genetic/genetics
Collapse
Affiliation(s)
- Evan H Hurowitz
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305-5307, USA
| | - Patrick O Brown
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305-5307, USA
- Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA 94305-5428, USA
| |
Collapse
|
1167
|
Sharov AA, Piao Y, Matoba R, Dudekula DB, Qian Y, VanBuren V, Falco G, Martin PR, Stagg CA, Bassey UC, Wang Y, Carter MG, Hamatani T, Aiba K, Akutsu H, Sharova L, Tanaka TS, Kimber WL, Yoshikawa T, Jaradat SA, Pantano S, Nagaraja R, Boheler KR, Taub D, Hodes RJ, Longo DL, Schlessinger D, Keller J, Klotz E, Kelsoe G, Umezawa A, Vescovi AL, Rossant J, Kunath T, Hogan BLM, Curci A, D'Urso M, Kelso J, Hide W, Ko MSH. Transcriptome analysis of mouse stem cells and early embryos. PLoS Biol 2003; 1:E74. [PMID: 14691545 PMCID: PMC300684 DOI: 10.1371/journal.pbio.0000074] [Citation(s) in RCA: 135] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2003] [Accepted: 10/13/2003] [Indexed: 12/26/2022] Open
Abstract
Understanding and harnessing cellular potency are fundamental in biology and are also critical to the future therapeutic use of stem cells. Transcriptome analysis of these pluripotent cells is a first step towards such goals. Starting with sources that include oocytes, blastocysts, and embryonic and adult stem cells, we obtained 249,200 high-quality EST sequences and clustered them with public sequences to produce an index of approximately 30,000 total mouse genes that includes 977 previously unidentified genes. Analysis of gene expression levels by EST frequency identifies genes that characterize preimplantation embryos, embryonic stem cells, and adult stem cells, thus providing potential markers as well as clues to the functional features of these cells. Principal component analysis identified a set of 88 genes whose average expression levels decrease from oocytes to blastocysts, stem cells, postimplantation embryos, and finally to newborn tissues. This can be a first step towards a possible definition of a molecular scale of cellular potency. The sequences and cDNA clones recovered in this work provide a comprehensive resource for genes functioning in early mouse embryos and stem cells. The nonrestricted community access to the resource can accelerate a wide range of research, particularly in reproductive and regenerative medicine. 250,000 EST sequences from oocytes, blastocysts, and embryonic and adult stem cells contribute to the annotation of the mouse genome and suggest genes that contribute to the unique features of these developmental stages and cell types
Collapse
Affiliation(s)
- Alexei A Sharov
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Yulan Piao
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Ryo Matoba
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Dawood B Dudekula
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Yong Qian
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Vincent VanBuren
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Geppino Falco
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Patrick R Martin
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Carole A Stagg
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Uwem C Bassey
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Yuxia Wang
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Mark G Carter
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Toshio Hamatani
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Kazuhiro Aiba
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Hidenori Akutsu
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Lioudmila Sharova
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Tetsuya S Tanaka
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Wendy L Kimber
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | | | - Saied A Jaradat
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Serafino Pantano
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Ramaiah Nagaraja
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Kenneth R Boheler
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Dennis Taub
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | - Richard J Hodes
- 1National Institute on AgingBaltimore, MarylandUnited States of America
- 2National Cancer InstituteBethesda, MarylandUnited States of America
| | - Dan L Longo
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| | | | - Jonathan Keller
- 3Basic Research Program, SAIC–Frederick, National Cancer Institute at FrederickFrederick, MarylandUnited States of America
| | - Emily Klotz
- 2National Cancer InstituteBethesda, MarylandUnited States of America
| | - Garnett Kelsoe
- 4Duke University Medical CenterDurham, North CarolinaUnited States of America
| | - Akihiro Umezawa
- 5National Research Institute for Child Health and DevelopmentTokyoJapan
| | - Angelo L Vescovi
- 6Institute for Stem Cell Research, Ospedale San RaffaeleMilanItaly
| | | | | | - Brigid L. M Hogan
- 4Duke University Medical CenterDurham, North CarolinaUnited States of America
| | - Anna Curci
- 8Institute of Genetics and Biophysics, Consiglio Nazionale delle RicercheNaplesItaly
| | - Michele D'Urso
- 8Institute of Genetics and Biophysics, Consiglio Nazionale delle RicercheNaplesItaly
| | - Janet Kelso
- 9South African National Bioinformatics Institute, University of the Western CapeBellvilleSouth Africa
| | - Winston Hide
- 9South African National Bioinformatics Institute, University of the Western CapeBellvilleSouth Africa
| | - Minoru S. H Ko
- 1National Institute on AgingBaltimore, MarylandUnited States of America
| |
Collapse
|
1168
|
Schwartz F, Duka A, Triantafyllidi E, Johns C, Duka I, Cui J, Gavras H. Serial analysis of gene expression in mouse kidney following angiotensin II administration. Physiol Genomics 2003; 16:90-8. [PMID: 14570981 DOI: 10.1152/physiolgenomics.00108.2003] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
As a new line of inquiry into the molecular mechanisms underlying pathophysiological processes associated with angiotensin (ANG II)-dependent hypertension, we applied the method of serial analysis of gene expression (SAGE) to examine genome-wide transcription changes in the kidneys of mice that developed hypertension in response to chronic ANG II administration. Mice were infused subcutaneously via osmotic minipumps with ANG II for 7 days, and systolic blood pressure was measured by tail-cuff plethysmography. Subsequently, mice were euthanized, and the total RNA isolated from the kidneys was used to construct SAGE libraries. Comparison of 11,447 SAGE tags from the hypertensive kidneys, representing 5,740 unique transcripts, and 11,273 tags from the control kidneys, corresponding to 5,619 different transcripts, identified genes that are significantly ( P < 0.05) down- or upregulated in the hypertensive kidney. Our assessment of the genome-wide influence of ANG II resulted in the detection of several novel genes and in a recognition of potential new roles for the previously characterized genes, thus providing new probes with which to further explore the ANG II effects in normal and disease states.
Collapse
Affiliation(s)
- Faina Schwartz
- Department of Medicine, Hypertension Section, Boston University School of Medicine, Boston, Massachusetts 02118, USA.
| | | | | | | | | | | | | |
Collapse
|
1169
|
Futaki S, Hayashi Y, Yamashita M, Yagi K, Bono H, Hayashizaki Y, Okazaki Y, Sekiguchi K. Molecular basis of constitutive production of basement membrane components. Gene expression profiles of Engelbreth-Holm-Swarm tumor and F9 embryonal carcinoma cells. J Biol Chem 2003; 278:50691-701. [PMID: 12968032 DOI: 10.1074/jbc.m304985200] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Engelbreth-Holm-Swarm (EHS) tumors produce large amounts of basement membrane (BM) components that are widely used as cell culture substrates mimicking BM functions. To delineate the tissue/organ origin of the tumor and the mechanisms operating in the BM overproduction, a genome-wide expression profile of EHS tumor was analyzed using RIKEN cDNA microarrays containing approximately 40,000 mouse cDNA clones. Expression profiles of F9 embryonal carcinoma cells that produce laminin-1 and other BM components upon differentiation into parietal endoderm-like cells (designated F9-PE) were also analyzed. Hierarchical clustering analysis showed that the gene expression profiles of EHS and F9-PE were the most similar among 49 mouse tissues/organs in the RIKEN Expression Array Database, suggesting that EHS tumor is parietal endoderm-derived. Quantitative PCR analysis confirmed that not only BM components but also the machineries required for efficient production of BM components, such as enzymes involved in post-translational modification and molecular chaperones, were highly expressed in both EHS and F9-PE. Pairs of similar transcription factor isoforms, such as Gata4/Gata6, Sox7/Sox17, and Cited1/Cited2, were also highly expressed in both EHS tumor and F9-PE. Time course analysis of F9 differentiation showed that up-regulation of the transcription factors was associated with those of BM components, suggesting their involvement in parietal endoderm specification and overproduction of the BM components.
Collapse
Affiliation(s)
- Sugiko Futaki
- Sekiguchi Biomatrix Signaling Project, ERATO, Japanese Science and Technology Agency (JST), Aichi Medical University, 21 Karimata, Yazako Nagakute-cho, Aichi-gun, Aichi, 480-1195, Japan
| | | | | | | | | | | | | | | |
Collapse
|
1170
|
Zhang T, Haws P, Wu Q. Multiple variable first exons: a mechanism for cell- and tissue-specific gene regulation. Genome Res 2003; 14:79-89. [PMID: 14672974 PMCID: PMC314283 DOI: 10.1101/gr.1225204] [Citation(s) in RCA: 107] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
A large family of neural protocadherin (Pcdh) proteins is encoded by three closely linked mammalian gene clusters (alpha, beta, and gamma). Pcdh alpha and gamma clusters have a striking genomic organization. Specifically, each "variable" exon is spliced to a common set of downstream "constant" exons within each cluster. Recent studies demonstrated that the cell-specific expression of each Pcdh gene is determined bya combination of variable-exon promoter activation and cis-splicing of the corresponding variable exon to the first constant exon. To determine whether there are other similarly organized gene clusters in mammalian genomes, we performed a genome-wide search and identified a large number of mammalian genes containing multiple variable first exons. Here we describe several clusters that contain about a dozen variable exons arrayed in tandem, including UDP glucuronosyltransferase (UGT1), plectin, neuronal nitric oxide synthase (NOS1), and glucocorticoid receptor (GR) genes. In all these cases, multiple variable first exons are each spliced to a common set of downstream constant exons to generate diverse functional mRNAs. As an example, we analyzed the tissue-specific expression profile of the mouse UGT1 repertoire and found that multiple isoforms are expressed in a tissue-specific manner. Therefore, this variable and constant genomic organization provides a genetic mechanism for directing distinct cell- and tissue-specific patterns of gene expression.
Collapse
Affiliation(s)
- Theresa Zhang
- Department of Bioinformatics, Merck Research Labs, Rahway, New Jersey 07065, USA
| | | | | |
Collapse
|
1171
|
Osato N, Yamada H, Satoh K, Ooka H, Yamamoto M, Suzuki K, Kawai J, Carninci P, Ohtomo Y, Murakami K, Matsubara K, Kikuchi S, Hayashizaki Y. Antisense transcripts with rice full-length cDNAs. Genome Biol 2003; 5:R5. [PMID: 14709177 PMCID: PMC395737 DOI: 10.1186/gb-2003-5-1-r5] [Citation(s) in RCA: 92] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2003] [Revised: 10/22/2003] [Accepted: 11/07/2003] [Indexed: 11/29/2022] Open
Abstract
In this study, 687 sense-antisense transcript pairs from 32,127 full-length rice cDNA sequences were identified by aligning the cDNA sequences with rice genome sequences. The large number of pairs suggests that gene regulation by antisense transcripts occurs in plants and not only in animals. Background Natural antisense transcripts control gene expression through post-transcriptional gene silencing by annealing to the complementary sequence of the sense transcript. Because many genome and mRNA sequences have become available recently, genome-wide searches for sense-antisense transcripts have been reported, but few plant sense-antisense transcript pairs have been studied. The Rice Full-Length cDNA Sequencing Project has enabled computational searching of a large number of plant sense-antisense transcript pairs. Results We identified sense-antisense transcript pairs from 32,127 full-length rice cDNA sequences produced by this project and public rice mRNA sequences by aligning the cDNA sequences with rice genome sequences. We discovered 687 bidirectional transcript pairs in rice, including sense-antisense transcript pairs. Both sense and antisense strands of 342 pairs (50%) showed homology to at least one expressed sequence tag other than that of the pair. Microarray analysis showed 82 pairs (32%) out of 258 pairs on the microarray were more highly expressed than the median expression intensity of 21,938 rice transcriptional units. Both sense and antisense strands of 594 pairs (86%) had coding potential. Conclusions The large number of plant sense-antisense transcript pairs suggests that gene regulation by antisense transcripts occurs in plants and not only in animals. On the basis of our results, experiments should be carried out to analyze the function of plant antisense transcripts.
Collapse
MESH Headings
- DNA, Antisense/chemistry
- DNA, Antisense/classification
- DNA, Antisense/genetics
- DNA, Complementary/chemistry
- DNA, Complementary/classification
- DNA, Complementary/genetics
- DNA, Plant/chemistry
- DNA, Plant/classification
- DNA, Plant/genetics
- Gene Expression Profiling/methods
- Gene Expression Regulation, Plant/genetics
- Oligonucleotide Array Sequence Analysis/methods
- Oryza/genetics
- RNA Interference/physiology
- RNA, Antisense/classification
- RNA, Antisense/genetics
- RNA, Messenger/chemistry
- RNA, Messenger/classification
- RNA, Messenger/genetics
- RNA, Plant/chemistry
- RNA, Plant/classification
- RNA, Plant/genetics
- Sequence Homology, Nucleic Acid
- Transcription, Genetic/physiology
Collapse
Affiliation(s)
- Naoki Osato
- Laboratory for Genome Exploration Research Group, RIKEN Genomic Science Center (GSC), RIKEN Yokohama Institute, Tsurumi-ku, Yokohama, Kanagawa, Japan 230-0045
| | - Hitomi Yamada
- Department of Molecular Biology, National Institute of Agrobiological Sciences (NIAS), Tsukuba, Ibaraki, Japan 305-8602
| | - Kouji Satoh
- Department of Molecular Biology, National Institute of Agrobiological Sciences (NIAS), Tsukuba, Ibaraki, Japan 305-8602
| | - Hisako Ooka
- Department of Molecular Biology, National Institute of Agrobiological Sciences (NIAS), Tsukuba, Ibaraki, Japan 305-8602
| | - Makoto Yamamoto
- Hitachi Software Engineering Company Ltd, Naka-ku, Yokohama, Kanagawa, Japan 231-0015
| | - Kohji Suzuki
- Hitachi Software Engineering Company Ltd, Naka-ku, Yokohama, Kanagawa, Japan 231-0015
| | - Jun Kawai
- Laboratory for Genome Exploration Research Group, RIKEN Genomic Science Center (GSC), RIKEN Yokohama Institute, Tsurumi-ku, Yokohama, Kanagawa, Japan 230-0045
- Genome Science Laboratory, RIKEN Wako Main Campus, Wako, Saitama, Japan 351-0198
| | - Piero Carninci
- Laboratory for Genome Exploration Research Group, RIKEN Genomic Science Center (GSC), RIKEN Yokohama Institute, Tsurumi-ku, Yokohama, Kanagawa, Japan 230-0045
- Genome Science Laboratory, RIKEN Wako Main Campus, Wako, Saitama, Japan 351-0198
| | - Yasuhiro Ohtomo
- Laboratory of Genome Sequencing and Analysis Group, Foundation of Advancement of International Science (FAIS), Tsukuba, Ibaraki, Japan 305-0062
| | - Kazuo Murakami
- Laboratory of Genome Sequencing and Analysis Group, Foundation of Advancement of International Science (FAIS), Tsukuba, Ibaraki, Japan 305-0062
| | - Kenichi Matsubara
- Laboratory of Genome Sequencing and Analysis Group, Foundation of Advancement of International Science (FAIS), Tsukuba, Ibaraki, Japan 305-0062
| | - Shoshi Kikuchi
- Department of Molecular Biology, National Institute of Agrobiological Sciences (NIAS), Tsukuba, Ibaraki, Japan 305-8602
| | - Yoshihide Hayashizaki
- Laboratory for Genome Exploration Research Group, RIKEN Genomic Science Center (GSC), RIKEN Yokohama Institute, Tsurumi-ku, Yokohama, Kanagawa, Japan 230-0045
- Genome Science Laboratory, RIKEN Wako Main Campus, Wako, Saitama, Japan 351-0198
| |
Collapse
|
1172
|
de Bono B, Chothia C. Exegesis: a procedure to improve gene predictions and its use to find immunoglobulin superfamily proteins in the human and mouse genomes. Nucleic Acids Res 2003; 31:6096-103. [PMID: 14576296 PMCID: PMC275470 DOI: 10.1093/nar/gkg828] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Exegesis is a procedure to refine the gene predictions that are produced for complex genomes, e.g. those of humans and mice. It uses the program Genewise, sequences determined by experiment, experimental maps of gene segment libraries and a new browser that allows the user to rapidly inspect and compare multiple gene maps to regions of genomic sequences. The procedure should be of general use. Here, we use the procedure to find members of the immunoglobulin superfamily in the human and mouse genomes. To do this, Exegesis was used to process the original gene predictions from the automated Ensembl annotation pipeline. Exegesis produced (i) many more complete genes and new transcripts and (ii) a mapping of the immunoglobulin and T cell receptor gene libraries to the genome, which are largely absent in the Ensembl set.
Collapse
Affiliation(s)
- Bernard de Bono
- MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 2QH, UK.
| | | |
Collapse
|
1173
|
Yamada K, Lim J, Dale JM, Chen H, Shinn P, Palm CJ, Southwick AM, Wu HC, Kim C, Nguyen M, Pham P, Cheuk R, Karlin-Newmann G, Liu SX, Lam B, Sakano H, Wu T, Yu G, Miranda M, Quach HL, Tripp M, Chang CH, Lee JM, Toriumi M, Chan MMH, Tang CC, Onodera CS, Deng JM, Akiyama K, Ansari Y, Arakawa T, Banh J, Banno F, Bowser L, Brooks S, Carninci P, Chao Q, Choy N, Enju A, Goldsmith AD, Gurjal M, Hansen NF, Hayashizaki Y, Johnson-Hopson C, Hsuan VW, Iida K, Karnes M, Khan S, Koesema E, Ishida J, Jiang PX, Jones T, Kawai J, Kamiya A, Meyers C, Nakajima M, Narusaka M, Seki M, Sakurai T, Satou M, Tamse R, Vaysberg M, Wallender EK, Wong C, Yamamura Y, Yuan S, Shinozaki K, Davis RW, Theologis A, Ecker JR. Empirical analysis of transcriptional activity in the Arabidopsis genome. Science 2003; 302:842-6. [PMID: 14593172 DOI: 10.1126/science.1088305] [Citation(s) in RCA: 662] [Impact Index Per Article: 30.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Functional analysis of a genome requires accurate gene structure information and a complete gene inventory. A dual experimental strategy was used to verify and correct the initial genome sequence annotation of the reference plant Arabidopsis. Sequencing full-length cDNAs and hybridizations using RNA populations from various tissues to a set of high-density oligonucleotide arrays spanning the entire genome allowed the accurate annotation of thousands of gene structures. We identified 5817 novel transcription units, including a substantial amount of antisense gene transcription, and 40 genes within the genetically defined centromeres. This approach resulted in completion of approximately 30% of the Arabidopsis ORFeome as a resource for global functional experimentation of the plant proteome.
Collapse
|
1174
|
Clark MS, Edwards YJK, Peterson D, Clifton SW, Thompson AJ, Sasaki M, Suzuki Y, Kikuchi K, Watabe S, Kawakami K, Sugano S, Elgar G, Johnson SL. Fugu ESTs: new resources for transcription analysis and genome annotation. Genome Res 2003; 13:2747-53. [PMID: 14613980 PMCID: PMC403817 DOI: 10.1101/gr.1691503] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2003] [Accepted: 09/10/2003] [Indexed: 10/26/2022]
Abstract
The draft Fugu rubripes genome was released in 2002, at which time relatively few cDNAs were available to aid in the annotation of genes. The data presented here describe the sequencing and analysis of 24,398 expressed sequence tags (ESTs) generated from 15 different adult and juvenile Fugu tissues, 74% of which matched protein database entries. Analysis of the EST data compared with the Fugu genome data predicts that approximately 10,116 gene tags have been generated, covering almost one-third of Fugu predicted genes. This represents a remarkable economy of effort. Comparison with the Washington University zebrafish EST assemblies indicates strong conservation within fish species, but significant differences remain. This potentially represents divergence of sequence in the 5' terminal exons and UTRs between these two fish species, although clearly, complete EST data sets are not available for either species. This project provides new Fugu resources, and the analysis adds significant weight to the argument that EST programs remain an essential resource for genome exploitation and annotation. This is particularly timely with the increasing availability of draft genome sequence from different organisms and the mounting emphasis on gene function and regulation.
Collapse
Affiliation(s)
- Melody S Clark
- MRC Rosalind Franklin Centre for Genomics Research, (formerly known as the MRC UK HGMP Resource Centre), Genome Campus, Hinxton, Cambridge, CB10 1SB, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1175
|
Holmes G, Crooijmans R, Groenen M, Niswander L. ALC (adjacent to LMX1 in chick) is a novel dorsal limb mesenchyme marker. Gene Expr Patterns 2003; 3:735-41. [PMID: 14643681 DOI: 10.1016/s1567-133x(03)00139-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
During dorsal-ventral (DV) patterning of the vertebrate limb, WNT7A is expressed in dorsal limb ectoderm and activates the expression of LMX1 (in chick; Lmx1b in mouse) in dorsal limb mesenchyme, resulting in the appropriate development of dorsal cell fates. These two genes are the only known factors involved in directing dorsal patterning and the molecular events that link these two factors or that occur downstream of LMX1/1b are unknown. We have isolated a novel chick transcript, ALC (adjacent to LMX1 in chick). ALC is located 5.3 kb from the 5' end of LMX1 and is transcribed in the opposite direction. It is expressed in a sub-set of tissues expressing LMX1, most notably in the dorsal mesenchyme of the limb, and thus is the second gene discovered with such a distribution in the limb. Misexpression studies with viral constructs show that ALC is downstream of WNT7A but not of LMX1. ALC shows no homology to known genes and its function remains to be determined. However, similarly placed transcripts occur in the human and mouse genomes, and we demonstrate that a mouse transcript is also expressed in dorsal limb mesenchyme.
Collapse
Affiliation(s)
- Greg Holmes
- Molecular Biology Program, Memorial Sloan Kettering Cancer Center, 1275 York Ave, New York, NY 10021, USA
| | | | | | | |
Collapse
|
1176
|
Abbott DE, Pritchard C, Clegg NJ, Ferguson C, Dumpit R, Sikes RA, Nelson PS. Expressed sequence tag profiling identifies developmental and anatomic partitioning of gene expression in the mouse prostate. Genome Biol 2003; 4:R79. [PMID: 14659016 PMCID: PMC329418 DOI: 10.1186/gb-2003-4-12-r79] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2003] [Revised: 10/28/2003] [Accepted: 11/12/2003] [Indexed: 02/02/2023] Open
Abstract
BACKGROUND The prostate gland is an organ with highly specialized functional attributes that serves to enhance the fertility of mammalian species. Much of the information pertaining to normal and pathological conditions affecting the prostate has been obtained through extensive developmental, biochemical and genetic analyses of rodent species. Although important insights can be obtained through detailed anatomical and histological assessments of mouse and rat models, further mechanistic explanations are greatly aided through studies of gene and protein expression. RESULTS In this article we characterize the repertoire of genes expressed in the normal developing mouse prostate through the analysis of 50,562 expressed sequence tags derived from 14 mouse prostate cDNA libraries. Sequence assemblies and annotations identified 15,009 unique transcriptional units of which more than 600 represent high quality assemblies without corresponding annotations in public gene expression databases. Quantitative analyses demonstrate distinct anatomical and developmental partitioning of prostate gene expression. This finding may assist in the interpretation of comparative studies between human and mouse and guide the development of new transgenic murine disease models. The identification of several novel genes is reported, including a new member of the beta-defensin gene family with prostate-restricted expression. CONCLUSIONS These findings suggest a potential role for the prostate as a defensive barrier for entry of pathogens into the genitourinary tract and, further, serve to emphasize the utility of the continued evaluation of transcriptomes from a diverse repertoire of tissues and cell types.
Collapse
Affiliation(s)
- Denise E Abbott
- Division of Human Biology, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA 98109, USA
| | - Colin Pritchard
- Division of Human Biology, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA 98109, USA
| | - Nigel J Clegg
- Division of Human Biology, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA 98109, USA
| | - Camari Ferguson
- Division of Human Biology, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA 98109, USA
| | - Ruth Dumpit
- Division of Human Biology, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA 98109, USA
| | - Robert A Sikes
- Laboratory for Cancer Ontogeny and Therapeutics, Department of Biological Sciences, University of Delaware, Newark, DE 19716, USA
| | - Peter S Nelson
- Division of Human Biology, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA 98109, USA
- Division of Clinical Research, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA 98109, USA
| |
Collapse
|
1177
|
Mattick JS. Challenging the dogma: the hidden layer of non-protein-coding RNAs in complex organisms. Bioessays 2003; 25:930-9. [PMID: 14505360 DOI: 10.1002/bies.10332] [Citation(s) in RCA: 317] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
The central dogma of biology holds that genetic information normally flows from DNA to RNA to protein. As a consequence it has been generally assumed that genes generally code for proteins, and that proteins fulfil not only most structural and catalytic but also most regulatory functions, in all cells, from microbes to mammals. However, the latter may not be the case in complex organisms. A number of startling observations about the extent of non-protein-coding RNA (ncRNA) transcription in the higher eukaryotes and the range of genetic and epigenetic phenomena that are RNA-directed suggests that the traditional view of the structure of genetic regulatory systems in animals and plants may be incorrect. ncRNA dominates the genomic output of the higher organisms and has been shown to control chromosome architecture, mRNA turnover and the developmental timing of protein expression, and may also regulate transcription and alternative splicing. This paper re-examines the available evidence and suggests a new framework for considering and understanding the genomic programming of biological complexity, autopoietic development and phenotypic variation.
Collapse
Affiliation(s)
- John S Mattick
- ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, St. Lucia QLD 4072, Australia.
| |
Collapse
|
1178
|
Iourgenko V, Zhang W, Mickanin C, Daly I, Jiang C, Hexham JM, Orth AP, Miraglia L, Meltzer J, Garza D, Chirn GW, McWhinnie E, Cohen D, Skelton J, Terry R, Yu Y, Bodian D, Buxton FP, Zhu J, Song C, Labow MA. Identification of a family of cAMP response element-binding protein coactivators by genome-scale functional analysis in mammalian cells. Proc Natl Acad Sci U S A 2003; 100:12147-52. [PMID: 14506290 PMCID: PMC218727 DOI: 10.1073/pnas.1932773100] [Citation(s) in RCA: 300] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2003] [Indexed: 11/18/2022] Open
Abstract
This report describes an unbiased method for systematically determining gene function in mammalian cells. A total of 20,704 predicted human full-length cDNAs were tested for induction of the IL-8 promoter. A number of genes, including those for cytokines, receptors, adapters, kinases, and transcription factors, were identified that induced the IL-8 promoter through known regulatory sites. Proteins that acted through a cooperative interaction between an AP-1 and an unrecognized cAMP response element (CRE)-like site were also identified. A protein, termed transducer of regulated cAMP response element-binding protein (CREB) (TORC1), was identified that activated expression through the variant CRE and consensus CRE sites. TORC1 potently induced known CREB1 target genes, bound CREB1, and activated expression through a potent transcription activation domain. A functional Drosophila TORC gene was also identified. Thus, TORCs represent a family of highly conserved CREB coactivators that may control the potency and specificity of CRE-mediated responses.
Collapse
Affiliation(s)
- Vadim Iourgenko
- Department of Functional Genomics, Novartis Institute for Biomedical Research, 100 Technology Square, Cambridge, MA 02139, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1179
|
Bell SM, Schreiner CM, Waclaw RR, Campbell K, Potter SS, Scott WJ. Sp8 is crucial for limb outgrowth and neuropore closure. Proc Natl Acad Sci U S A 2003; 100:12195-200. [PMID: 14526104 PMCID: PMC218735 DOI: 10.1073/pnas.2134310100] [Citation(s) in RCA: 108] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2003] [Indexed: 11/18/2022] Open
Abstract
In this report we describe the developmental expression and function of Sp8, a member of the Sp family of zinc finger transcription factors, and provide evidence that the legless transgene insertional mutant is a hypomorphic allele of the Sp8 gene. Sp8 is expressed during embryogenesis in the forming apical ectodermal ridge (AER), restricted regions of the central nervous system, and tail bud. Targeted deletion of the Sp8 gene gives a striking phenotype, with severe truncation of both forelimbs and hindlimbs, absent tail, as well as defects in anterior and posterior neuropore closure leading to exencephaly and spina bifida. Outgrowth of the limb depends on formation of the AER, a signaling center that forms at the limb bud apex. In Sp8 mutants, the AER precursor cells are induced and initially express multiple appropriate marker genes, but expression of these genes is not maintained and progression to a mature AER is blocked. These observations indicate that Sp8 functions downstream of Wnt3, Fgf10, and Bmpr1a in the signaling cascade that mediates AER formation.
Collapse
Affiliation(s)
- Sheila M Bell
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, 3333 Burnet Avenue, Cincinnati, OH 45229, USA.
| | | | | | | | | | | |
Collapse
|
1180
|
Schultz N, Hamra FK, Garbers DL. A multitude of genes expressed solely in meiotic or postmeiotic spermatogenic cells offers a myriad of contraceptive targets. Proc Natl Acad Sci U S A 2003; 100:12201-6. [PMID: 14526100 PMCID: PMC218736 DOI: 10.1073/pnas.1635054100] [Citation(s) in RCA: 455] [Impact Index Per Article: 20.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Understanding mammalian spermatozoan development and the events surrounding fertilization has grown slowly, in part because of uncertainty about the number and identity of the cellular components involved. Determination of those transcripts expressed specifically by germ cells should provide an inclusive list of probable critical proteins. Here, total mouse testis transcript profiles were trimmed of transcripts found in cultures enriched in Sertoli or interstitial cells to yield a germ cell-enriched transcript profile. Monitoring of changes of this profile in the developing testis identified 1,652 genes whose transcript abundance increased markedly coincident with the onset of meiosis. Remarkably, 351 of these genes (approximately equal to 20%) appear to be expressed only in the male germline. Germ cell-specific transcripts are much less common earlier in testis development. Further analysis of the UniGene EST database coupled with quantitative PCR indicates that approximately 4% of the mouse genome is dedicated to expression in postmeiotic male germ cells. Most or many of the protein products of these transcripts are probably retained in mature spermatozoa. Targeted disruption of 19 of these genes has indicated that a majority have roles critical for normal fertility. Thus, we find an astonishing number of genes expressed specifically by male germ cells late in development. This extensive group provides a plethora of potential targets for germ cell-directed contraception and a staggering number of candidate proteins that could be critical for fertilization.
Collapse
Affiliation(s)
- Nikolaus Schultz
- Cecil H. and Ida Green Center for Reproductive Biology Sciences, Howard Hughes Medical Institute and Department of Pharmacology, University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX 75390, USA
| | | | | |
Collapse
|
1181
|
Gardiner K, Fortna A, Bechtel L, Davisson MT. Mouse models of Down syndrome: how useful can they be? Comparison of the gene content of human chromosome 21 with orthologous mouse genomic regions. Gene 2003; 318:137-47. [PMID: 14585506 DOI: 10.1016/s0378-1119(03)00769-8] [Citation(s) in RCA: 157] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
With an incidence of approximately 1 in 700 live births, Down syndrome (DS) remains the most common genetic cause of mental retardation. The phenotype is assumed to be due to overexpression of some number of the >300 genes encoded by human chromosome 21. Mouse models, in particular the chromosome 16 segmental trisomies, Ts65Dn and Ts1Cje, are indispensable for DS-related studies of gene-phenotype correlations. Here we compare the updated gene content of the finished sequence of human chromosome 21 (364 genes and putative genes) with the gene content of the homologous mouse genomic regions (291 genes and putative genes) obtained from annotation of the public sector C57Bl/6 draft sequence. Annotated genes fall into one of three classes. First, there are 170 highly conserved, human/mouse orthologues. Second, there are 83 minimally conserved, possible orthologues. Included among the conserved and minimally conserved genes are 31 antisense transcripts. Third, there are species-specific genes: 111 spliced human transcripts show no orthologues in the syntenic mouse regions although 13 have homologous sequences elsewhere in the mouse genomic sequence, and 38 spliced mouse transcripts show no identifiable human orthologues. While these species-specific genes are largely based solely on spliced EST data, a majority can be verified in RNA expression experiments. In addition, preliminary data suggest that many human-specific transcripts may represent a novel class of primate-specific genes. Lastly, updated functional annotation of orthologous genes indicates genes encoding components of several cellular pathways are dispersed throughout the orthologous mouse chromosomal regions and are not completely represented in the Down syndrome segmental mouse models. Together, these data point out the potential for existing mouse models to produce extraneous phenotypes and to fail to produce DS-relevant phenotypes.
Collapse
Affiliation(s)
- Katheleen Gardiner
- Eleanor Roosevelt Institute at the University of Denver, 1899 Gaylord Street, Denver, CO 80206-1210, USA.
| | | | | | | |
Collapse
|
1182
|
Dalla E, Verardo R, Lazarević D, Marchionni L, Reid JF, Bahar N, Klarić E, Marcuzzi G, Marzio R, Belgrano A, Licastro D, Schneider C. LNCIB human full-length cDNAs collection: towards a better comprehension of the human transcriptome. C R Biol 2003; 326:967-70. [PMID: 14744102 DOI: 10.1016/j.crvi.2003.09.029] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
LNCIB has been producing a variety of human full-length-enriched, normalized and subtracted cDNA libraries from various cell lines and tissues in different developmental stages by using the CAP-Trapper method. By sequencing 23000 clones of these libraries we identified a pool of about 5800 good quality unique cDNAs. After BLAST analysis on Human RefSeq/Unigene databases, 1717 of these sequences remained with no or poor annotation. We show that cross-species comparative BLAST resulted as a valid tool for the annotation of orthologous genes.
Collapse
Affiliation(s)
- Emiliano Dalla
- Laboratorio Nazionale CIB, Functional Genomics Group, AREA Science Park, Padriciano 99, Trieste 34012, Italy
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1183
|
Tomaru Y, Kondo S, Suzuki M, Hayashizaki Y. A comprehensive search for HNF-3α-regulated genes in mouse hepatoma cells by 60K cDNA microarray and chromatin immunoprecipitation/PCR analysis. Biochem Biophys Res Commun 2003; 310:667-74. [PMID: 14521963 DOI: 10.1016/j.bbrc.2003.08.148] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
To characterize the regulatory pattern by a specific transcription regulatory factor, we used a combination of expression analysis with the mouse cDNA microarray composed of 60,000 cDNA clones and cross-linking/chromatin immunoprecipitation (X-ChIP) followed by comparative PCR. Overexpression of mouse hepatocyte nuclear factor-3alpha (HNF-3alpha) in a mouse hepatoma cell line resulted in accompanied perturbed expression of more than 1500 genes. Search for HNF-3alpha consensus recognition sequences in the upstream regions of their coding sequences, which were mapped on the mouse genome, enabled us to mine 300 genes as the potential HNF-3alpha-regulated genes and classify 135 annotated ones into several functional categories. Further X-ChIP/PCR analysis demonstrated in vivo binding of HNF-3alpha to the 5(')-flanking sequences of 25 members selected out of these genes. Besides known HNF-3alpha-regulated genes such as albumin and alpha-fetoprotein genes, the genes newly identified as the HNF-3alpha-regulated ones include three encoding CDP-diacylglycerol-inositol 3-phosphatidyltransferase, phosphatidylserine decarboxylase, and phospholipase A2, which are located en suite in the lipid metabolic pathway in liver. The potential usefulness of the present approach to extensive characterization of gene expression framework directed by a specific transcription regulatory factor is discussed.
Collapse
Affiliation(s)
- Yasuhiro Tomaru
- Laboratory of Genome Exploration Research Group, RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, 1-7-22 Suehiro-Cho, Tsurumi-Ku, Yokohama, Kanagawa 230-0045, Japan
| | | | | | | |
Collapse
|
1184
|
Ji P, Diederichs S, Wang W, Böing S, Metzger R, Schneider PM, Tidow N, Brandt B, Buerger H, Bulk E, Thomas M, Berdel WE, Serve H, Müller-Tidow C. MALAT-1, a novel noncoding RNA, and thymosin beta4 predict metastasis and survival in early-stage non-small cell lung cancer. Oncogene 2003; 22:8031-41. [PMID: 12970751 DOI: 10.1038/sj.onc.1206928] [Citation(s) in RCA: 1788] [Impact Index Per Article: 81.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Early-stage non-small cell lung cancer (NSCLC) can be cured by surgical resection, but a substantial fraction of patients ultimately dies due to distant metastasis. In this study, we used subtractive hybridization to identify gene expression differences in stage I NSCLC tumors that either did or did not metastasize in the course of disease. Individual clones (n=225) were sequenced and quantitative RT-PCR verified overexpression in metastasizing samples. Several of the identified genes (eIF4A1, thymosin beta4 and a novel transcript named MALAT-1) were demonstrated to be significantly associated with metastasis in NSCLC patients (n=70). The genes' association with metastasis was stage- and histology specific. The Kaplan-Meier analyses identified MALAT-1 and thymosin beta4 as prognostic parameters for patient survival in stage I NSCLC. The novel MALAT-1 transcript is a noncoding RNA of more than 8000 nt expressed from chromosome 11q13. It is highly expressed in lung, pancreas and other healthy organs as well as in NSCLC. MALAT-1 expressed sequences are conserved across several species indicating its potentially important function. Taken together, these data contribute to the identification of early-stage NSCLC patients that are at high risk to develop metastasis. The identification of MALAT-1 emphasizes the potential role of noncoding RNAs in human cancer.
Collapse
Affiliation(s)
- Ping Ji
- Department of Medicine, University of Münster, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1185
|
Oda R, Yaoi T, Okajima S, Kobashi H, Kubo T, Fushiki S. A novel marker for terminal Schwann cells, homocysteine-responsive ER-resident protein, as isolated by a single cell PCR-differential display. Biochem Biophys Res Commun 2003; 308:872-7. [PMID: 12927800 DOI: 10.1016/s0006-291x(03)01499-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Terminal Schwann cells (TSCs) that cover motor neuron terminals are known to play important roles in maintaining neuromuscular junctions, as well as in the repair process after nerve injury. However, molecular characteristics of TSCs remain unknown, because of the difficulties in analyzing them due to their paucity. We have established a method of selectively and efficiently collecting TSCs so that cDNA analysis can be done properly. The expression of 1-2% of whole mRNAs was compared between myelinating Schwann cells (MSCs) and TSCs, and it turned out that approximately one-third of the bands could be categorized as cell-type-specific bands. TSCs thus constitute a distinct entity from the viewpoint of gene expression. As one of the cDNA clones belonging to TSC-specific bands was identified homocysteine-responsive ER-resident protein (Herp), and in situ hybridization confirmed that Herp mRNA is expressed in TSCs on motor nerve terminals but not in MSCs, both in developing and adult rats. In conclusion, we have been able to identify Herp as a novel molecular marker for TSCs.
Collapse
Affiliation(s)
- Ryo Oda
- Department of Orthopaedic Surgery, Kyoto Prefectural University of Medicine Graduate School of Medical Science, Kawaramachi-Hirokoji, Kyoto 602-8566, Japan
| | | | | | | | | | | |
Collapse
|
1186
|
Kile BT, Hentges KE, Clark AT, Nakamura H, Salinger AP, Liu B, Box N, Stockton DW, Johnson RL, Behringer RR, Bradley A, Justice MJ. Functional genetic analysis of mouse chromosome 11. Nature 2003; 425:81-6. [PMID: 12955145 DOI: 10.1038/nature01865] [Citation(s) in RCA: 164] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2003] [Accepted: 06/10/2003] [Indexed: 12/30/2022]
Abstract
Now that the mouse and human genome sequences are complete, biologists need systematic approaches to determine the function of each gene. A powerful way to discover gene function is to determine the consequence of mutations in living organisms. Large-scale production of mouse mutations with the point mutagen N-ethyl-N-nitrosourea (ENU) is a key strategy for analysing the human genome because mouse mutants will reveal functions unique to mammals, and many may model human diseases. To examine genes conserved between human and mouse, we performed a recessive ENU mutagenesis screen that uses a balancer chromosome, inversion chromosome 11 (refs 4, 5). Initially identified in the fruitfly, balancer chromosomes are valuable genetic tools that allow the easy isolation of mutations on selected chromosomes. Here we show the isolation of 230 new recessive mouse mutations, 88 of which are on chromosome 11. This genetic strategy efficiently generates and maps mutations on a single chromosome, even as mutations throughout the genome are discovered. The mutations reveal new defects in haematopoiesis, craniofacial and cardiovascular development, and fertility.
Collapse
Affiliation(s)
- Benjamin T Kile
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1187
|
Abstract
Fifty years after the publication of DNA structure, the whole human genome sequence will be officially finished. This achievement marks the beginning of the task to catalogue every human gene and identify each of their function expression patterns. Currently, researchers estimate that there are about 30,000 human genes and approximately 70% of these can be automatically predicted using a combination of ab initio and similarity-based programs. However, to experimentally investigate every gene's function, the research community requires a high-quality annotation of alternative splicing, pseudogenes, and promoter regions that can only be provided by manual intervention. Manual curation of the human genome will be a long-term project as experimental data are continually produced to confirm or refine the predictions, and new features such as noncoding RNAs and enhancers have not been fully identified. Such a highly curated human gene-set made publicly available will be a great asset for the experimental community and for future comparative genome projects.
Collapse
Affiliation(s)
- Jennifer L Ashurst
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom.
| | | |
Collapse
|
1188
|
Wang J, Li S, Zhang Y, Zheng H, Xu Z, Ye J, Yu J, Wong GKS. Vertebrate gene predictions and the problem of large genes. Nat Rev Genet 2003; 4:741-9. [PMID: 12951575 DOI: 10.1038/nrg1160] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
To find unknown protein-coding genes, annotation pipelines use a combination of ab initio gene prediction and similarity to experimentally confirmed genes or proteins. Here, we show that although the ab initio predictions have an intrinsically high false-positive rate, they also have a consistently low false-negative rate. The incorporation of similarity information is meant to reduce the false-positive rate, but in doing so it increases the false-negative rate. The crucial variable is gene size (including introns)--genes of the most extreme sizes, especially very large genes, are most likely to be incorrectly predicted.
Collapse
Affiliation(s)
- Jun Wang
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 101300, China
| | | | | | | | | | | | | | | |
Collapse
|
1189
|
Abstract
Functional RNAs such as ribosomal RNA and structured domains of mRNA are targets for small molecule ligands that can act as modulators of the RNA biological activity. Natural ligands for RNA display a bewildering structural and chemical complexity that has yet to be matched by synthetic RNA binders. Comparison of natural and artificial ligands for RNA may help to direct future approaches to design and synthesize potent novel scaffolds for specific recognition of RNA targets.
Collapse
Affiliation(s)
- Thomas Hermann
- Department of Computational Chemistry & Structure, Anadys Parmaceuticals, Inc., 9050 Camino Santa Fe, San Diego, CA 92121, USA.
| |
Collapse
|
1190
|
Stone LS, Vulchanova L. The pain of antisense: in vivo application of antisense oligonucleotides for functional genomics in pain and analgesia. Adv Drug Deliv Rev 2003; 55:1081-112. [PMID: 12935946 DOI: 10.1016/s0169-409x(03)00105-4] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
As the genomic revolution continues to evolve, there is an increasing demand for efficient and reliable tools for functional characterization of individual gene products. Antisense oligonucleotide-mediated knockdown has been used successfully as a functional genomics tool in animal models of pain and analgesia yet skepticism regarding the validity and utility of antisense technology remains. Contributing to this uncertainty are the lack of systematic studies exploring antisense oligonucleotide use in vivo and the many technical and methodological challenges intrinsic to the method. This article reviews the contributions of antisense oligonucleotide-based studies to the field of pain and analgesia and the general principles of antisense technology. A special emphasis is placed on technical issues surrounding the successful application of antisense oligonucleotides in vivo, including sequence selection, antisense oligonucleotide chemistry, DNA controls, route of administration, uptake, dose-dependence, time-course and adequate evaluation of knockdown.
Collapse
Affiliation(s)
- Laura S Stone
- Department of Neuroscience, University of Minnesota, 6-125 Jackson Hall, 321 Church Street S.E., Minneapolis, MN 55455, USA.
| | | |
Collapse
|
1191
|
Abstract
The draft human genome sequence (about 3 billion base pairs) was completed in 2001. Humans have fewer protein-coding genes than expected, and most of these are highly conserved among animals. Humans and other complex organisms produce massive amounts of non-coding RNAs, which may form another level of genetic output that controls differentiation and development. Aside from classical monogenic diseases and other differences caused by mutations and polymorphisms in protein-coding genes, much of the variation between individuals, including that which may affect our predispositions to common diseases, is probably due to differences in the non-coding regions of the genome (ie, the control architecture of the system). Within 10 years we can expect to see: increased penetration of DNA diagnostic tests to assess risk of disease, to diagnose pathogens, to determine the best treatment regimens, and for individual identification; a range of new pharmaceuticals as well as new gene and cell therapies to repair damage, to optimise health and to minimise future disease risk; and medicine become increasingly personalised, with the knowledge of individual genetic make-up and lifestyle influences.
Collapse
Affiliation(s)
- John S Mattick
- Institute for Molecular Bioscience, University of Queensland, St Lucia, QLD 4072, Australia.
| |
Collapse
|
1192
|
Lobley A, Pierron V, Reynolds L, Allen L, Michalovich D. Identification of human and mouse CatSper3 and CatSper4 genes: characterisation of a common interaction domain and evidence for expression in testis. Reprod Biol Endocrinol 2003; 1:53. [PMID: 12932298 PMCID: PMC184451 DOI: 10.1186/1477-7827-1-53] [Citation(s) in RCA: 115] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/01/2003] [Accepted: 08/01/2003] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND CatSper1 and CatSper2 are two recently identified channel-like proteins, which show sperm specific expression patterns. Through targeted mutagenesis in the mouse, CatSper1 has been shown to be required for fertility, sperm motility and for cAMP induced Ca2+ current in sperm. Both channels resemble a single pore forming repeat from a four repeat voltage dependent Ca2+ /Na+ channel. However, neither CatSper1 or CatSper2 have been shown to function as cation channels when transfected into cells, singly or in conjunction. As the pore forming units of voltage gated cation channels form a tetramer it has been suggested that the known CatSper proteins require additional subunits and/or interaction partners to function. RESULTS Using in silico gene identification and prediction techniques, we have identified two further members of the CatSper family, CatSper3 and Catsper4. Each carries a single channel-forming domain with the predicted pore-loop containing the consensus sequence TxDxW. Each of the new CatSper genes has evidence for expression in the testis. Furthermore we identified coiled-coil protein-protein interaction domains in the C-terminal tails of each of the CatSper channels, implying that CatSper channels 1,2,3 and 4 may interact directly or indirectly to form a functional tetramer. CONCLUSIONS The topological and sequence relationship of CatSper1 and CatSper2 to the four repeat Ca2+ /Na+ channels suggested other members of this family may exist. We have identified a further two novel CatSper genes, conserved in both the human and mouse genomes. Furthermore, all four of the CatSper proteins are predicted to contain a common coiled-coil protein-protein interaction domain in their C-terminal tail. Coupled with expression data this leads to the hypothesis that the CatSper proteins form a functional hetero-tetrameric channel in sperm.
Collapse
Affiliation(s)
- Anna Lobley
- Target Discovery, Inpharmatica Ltd, 60 Charlotte Street, London W1T 2NU, UK
- Discovery Informatics Inpharmatica Ltd, 60 Charlotte Street, London W1T 2NU, UK
| | - Valerie Pierron
- Discovery Biology, Inpharmatica Ltd, 60 Charlotte Street, London W1T 2NU, UK
| | - Lindsey Reynolds
- Discovery Biology, Inpharmatica Ltd, 60 Charlotte Street, London W1T 2NU, UK
| | - Liz Allen
- Discovery Biology, Inpharmatica Ltd, 60 Charlotte Street, London W1T 2NU, UK
| | - David Michalovich
- Target Discovery, Inpharmatica Ltd, 60 Charlotte Street, London W1T 2NU, UK
| |
Collapse
|
1193
|
Tuggle CK, Green JA, Fitzsimmons C, Woods R, Prather RS, Malchenko S, Soares BM, Kucaba T, Crouch K, Smith C, Tack D, Robinson N, O'Leary B, Scheetz T, Casavant T, Pomp D, Edeal BJ, Zhang Y, Rothschild MF, Garwood K, Beavis W. EST-based gene discovery in pig: virtual expression patterns and comparative mapping to human. Mamm Genome 2003; 14:565-79. [PMID: 12925889 DOI: 10.1007/s00335-002-2263-7] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2003] [Accepted: 04/03/2003] [Indexed: 10/26/2022]
Abstract
A molecular understanding of porcine reproduction is of biological interest and economic importance. Our Midwest Consortium has produced cDNA libraries containing the majority of genes expressed in major female reproductive tissues, and we have deposited into public databases 21,499 expressed sequence tag (EST) gene sequences from the 3' end of clones from these libraries. These sequences represent 10,574 different genes, based on sequence comparison among these data, and comparison with existing porcine ESTs and genes indicate as many as 4652 of these EST clusters are novel. In silico analysis identified sequences that are expressed in specific pig tissues or organs and confirmed the broad expression in pig for many genes ubiquitously expressed in human tissues. Furthermore, we have developed computer software to identify sequence similarity of these pig genes with their human counterparts, and to extract the mapping information of these human homologues from genome databases. We demonstrate the utility of this software for comparative mapping by localizing 61 genes on the porcine physical map for Chromosomes (Chrs) 5, 10, and 14.
Collapse
Affiliation(s)
- Christopher K Tuggle
- Center for Integrated Animal Genomics and Department of Animal Science, Iowa State University, 2255 Kildee Hall, Ames, Iowa 50011, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1194
|
Feng X, Carlton JM, Joy DA, Mu J, Furuya T, Suh BB, Wang Y, Barnwell JW, Su XZ. Single-nucleotide polymorphisms and genome diversity in Plasmodium vivax. Proc Natl Acad Sci U S A 2003; 100:8502-7. [PMID: 12799466 PMCID: PMC166258 DOI: 10.1073/pnas.1232502100] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2003] [Indexed: 01/26/2023] Open
Abstract
The study of genetic variation in malaria parasites has practical significance for developing strategies to control the disease. Vaccines based on highly polymorphic antigens may be confounded by allelic restriction of the host immune response. In response to drug pressure, a highly plastic genome may generate resistant mutants more easily than a monomorphic one. Additionally, the study of the distribution of genomic polymorphisms may provide information leading to the identification of genes associated with traits such as parasite development and drug resistance. Indeed, the age and diversity of the human malaria parasite Plasmodium falciparum has been the subject of recent debate, because an ancient parasite with a complex genome is expected to present greater challenges for drug and vaccine development. The genome diversity of the important human pathogen Plasmodium vivax, however, remains essentially unknown. Here we analyze an approximately 100-kb contiguous chromosome segment from five isolates, revealing 191 single-nucleotide polymorphisms (SNPs) and 44 size polymorphisms. The SNPs are not evenly distributed across the segment with blocks of high and low diversity. Whereas the majority (approximately 63%) of the SNPs are in intergenic regions, introns contain significantly less SNPs than intergenic sequences. Polymorphic tandem repeats are abundant and are more uniformly distributed at a frequency of about one polymorphic tandem repeat per 3 kb. These data show that P. vivax has a highly diverse genome, and provide useful information for further understanding the genome diversity of the parasite.
Collapse
Affiliation(s)
- Xiaorong Feng
- Laboratory of Malaria and Vector Research,
National Institute of Allergy and Infectious Diseases, National Institutes of
Health, Bethesda, MD 20892; Parasite Genomics
Group, The Institute for Genomic Research, Rockville, MD 20850;
Department of Bioinformatics, American Type
Culture Collection, Manassas, VA 20110; and
Division of Parasitic Diseases, Centers for
Disease Control and Prevention, Atlanta, GA 30341
| | - Jane M. Carlton
- Laboratory of Malaria and Vector Research,
National Institute of Allergy and Infectious Diseases, National Institutes of
Health, Bethesda, MD 20892; Parasite Genomics
Group, The Institute for Genomic Research, Rockville, MD 20850;
Department of Bioinformatics, American Type
Culture Collection, Manassas, VA 20110; and
Division of Parasitic Diseases, Centers for
Disease Control and Prevention, Atlanta, GA 30341
| | - Deirdre A. Joy
- Laboratory of Malaria and Vector Research,
National Institute of Allergy and Infectious Diseases, National Institutes of
Health, Bethesda, MD 20892; Parasite Genomics
Group, The Institute for Genomic Research, Rockville, MD 20850;
Department of Bioinformatics, American Type
Culture Collection, Manassas, VA 20110; and
Division of Parasitic Diseases, Centers for
Disease Control and Prevention, Atlanta, GA 30341
| | - Jianbing Mu
- Laboratory of Malaria and Vector Research,
National Institute of Allergy and Infectious Diseases, National Institutes of
Health, Bethesda, MD 20892; Parasite Genomics
Group, The Institute for Genomic Research, Rockville, MD 20850;
Department of Bioinformatics, American Type
Culture Collection, Manassas, VA 20110; and
Division of Parasitic Diseases, Centers for
Disease Control and Prevention, Atlanta, GA 30341
| | - Tetsuya Furuya
- Laboratory of Malaria and Vector Research,
National Institute of Allergy and Infectious Diseases, National Institutes of
Health, Bethesda, MD 20892; Parasite Genomics
Group, The Institute for Genomic Research, Rockville, MD 20850;
Department of Bioinformatics, American Type
Culture Collection, Manassas, VA 20110; and
Division of Parasitic Diseases, Centers for
Disease Control and Prevention, Atlanta, GA 30341
| | - Bernard B. Suh
- Laboratory of Malaria and Vector Research,
National Institute of Allergy and Infectious Diseases, National Institutes of
Health, Bethesda, MD 20892; Parasite Genomics
Group, The Institute for Genomic Research, Rockville, MD 20850;
Department of Bioinformatics, American Type
Culture Collection, Manassas, VA 20110; and
Division of Parasitic Diseases, Centers for
Disease Control and Prevention, Atlanta, GA 30341
| | - Yufeng Wang
- Laboratory of Malaria and Vector Research,
National Institute of Allergy and Infectious Diseases, National Institutes of
Health, Bethesda, MD 20892; Parasite Genomics
Group, The Institute for Genomic Research, Rockville, MD 20850;
Department of Bioinformatics, American Type
Culture Collection, Manassas, VA 20110; and
Division of Parasitic Diseases, Centers for
Disease Control and Prevention, Atlanta, GA 30341
| | - John W. Barnwell
- Laboratory of Malaria and Vector Research,
National Institute of Allergy and Infectious Diseases, National Institutes of
Health, Bethesda, MD 20892; Parasite Genomics
Group, The Institute for Genomic Research, Rockville, MD 20850;
Department of Bioinformatics, American Type
Culture Collection, Manassas, VA 20110; and
Division of Parasitic Diseases, Centers for
Disease Control and Prevention, Atlanta, GA 30341
| | - Xin-Zhuan Su
- Laboratory of Malaria and Vector Research,
National Institute of Allergy and Infectious Diseases, National Institutes of
Health, Bethesda, MD 20892; Parasite Genomics
Group, The Institute for Genomic Research, Rockville, MD 20850;
Department of Bioinformatics, American Type
Culture Collection, Manassas, VA 20110; and
Division of Parasitic Diseases, Centers for
Disease Control and Prevention, Atlanta, GA 30341
| |
Collapse
|
1195
|
Halees AS, Leyfer D, Weng Z. PromoSer: A large-scale mammalian promoter and transcription start site identification service. Nucleic Acids Res 2003; 31:3554-9. [PMID: 12824364 PMCID: PMC168956 DOI: 10.1093/nar/gkg549] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Proximal promoters have a major impact on transcriptional regulation. Studies of the sequence-based nature of this regulation usually require collection of proximal promoter sequences for large sets of co-regulated genes. We report a newly implemented web service that facilitates extraction of user specified regions around the transcription start site of all annotated human, mouse or rat genes. The transcription start sites have been identified computationally by considering alignments of a large number of partial and full-length mRNA sequences to genomic DNA, with provision for alternative promoters. The service is publicly available at http://biowulf.bu.edu/zlab/PromoSer/.
Collapse
Affiliation(s)
- Anason S Halees
- Bioinformatics Program, Boston University, 44 Cummington Street, Boston, MA 02215, USA
| | | | | |
Collapse
|
1196
|
Sorek R, Ast G. Intronic sequences flanking alternatively spliced exons are conserved between human and mouse. Genome Res 2003; 13:1631-7. [PMID: 12840041 PMCID: PMC403736 DOI: 10.1101/gr.1208803] [Citation(s) in RCA: 184] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2003] [Accepted: 03/26/2003] [Indexed: 12/22/2022]
Abstract
Comparison of the sequences of mouse and human genomes revealed a surprising number of nonexonic, nonexpressed conserved sequences, for which no function could be assigned. To study the possible correlation between these conserved intronic sequences and alternative splicing regulation, we developed a method to identify exons that are alternatively spliced in both human and mouse. We compiled two exon sets: one of alternatively spliced conserved exons and another of constitutively spliced conserved exons. We found that 77% of the conserved alternatively spliced exons were flanked on both sides by long conserved intronic sequences. In comparison, only 17% of the conserved constitutively spliced exons were flanked by such conserved intronic sequences. The average length of the conserved intronic sequences was 103 bases in the upstream intron and 94 bases in the downstream intron. The average identity levels in the immediately flanking intronic sequences were 88% and 80% for the upstream and downstream introns, respectively, higher than the conservation levels of 77% that were measured in promoter regions. Our results suggest that the function of many of the intronic sequence blocks that are conserved between human and mouse is the regulation of alternative splicing.
Collapse
Affiliation(s)
- Rotem Sorek
- Department of Human Genetics, Sackler Faculty of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel
| | | |
Collapse
|
1197
|
Lee BTK, Tan TW, Ranganathan S. MGAlignIt: A web service for the alignment of mRNA/EST and genomic sequences. Nucleic Acids Res 2003; 31:3533-6. [PMID: 12824360 PMCID: PMC168968 DOI: 10.1093/nar/gkg561] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Splicing is a biological phenomenon that removes the non-coding sequence from the transcripts to produce a mature transcript suitable for translation. To study this phenomenon, information on the intron-exon arrangement of a gene is essential, usually obtained by aligning mRNA/EST sequences to their cognate genomic sequences. MGAlign is a novel, rapid, memory efficient and practical method for aligning mRNA/EST and genome sequences. We present here a freely available web service, MGAlignIt (http://origin.bic.nus.edu.sg/mgalign/mgalignit), based on MGAlign. Besides the alignment itself, this web service allows users to effectively visualize the alignment in a graphical manner and to perform limited analysis on the alignment output. The server also permits the alignment to be saved in several forms, both graphical and text, suitable for further processing and analysis by other programs.
Collapse
Affiliation(s)
- Bernett T K Lee
- Department of Biochemistry, National University of Singapore, 8 Medical Drive, Singapore 117597, Singapore
| | | | | |
Collapse
|
1198
|
Grillo G, Licciulli F, Liuni S, Sbisà E, Pesole G. PatSearch: A program for the detection of patterns and structural motifs in nucleotide sequences. Nucleic Acids Res 2003; 31:3608-12. [PMID: 12824377 PMCID: PMC168955 DOI: 10.1093/nar/gkg548] [Citation(s) in RCA: 78] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Regulation of gene expression at transcriptional and post-transcriptional level involves the interaction between short DNA or RNA tracts and the corresponding trans-acting protein factors. Detection of such cis-acting elements in genome-wide screenings may significantly contribute to genome annotation and comparative analysis as well as to target functional characterization experiments. We present here PatSearch, a flexible and fast pattern matcher able to search for specific combinations of oligonucleotide consensus sequences, secondary structure elements and position-weight matrices. It can also allow for mismatches/mispairings below a user fixed threshold. We report three different applications of the program in the search of complex patterns such as those of the iron responsive element hairpin-loop structure, the p53 responsive element and a promoter module containing CAAT-, TATA- and cap-boxes. PatSearch is available on the web at http://bighost.area.ba.cnr.it/BIG/PatSearch/.
Collapse
Affiliation(s)
- Giorgio Grillo
- Sezione di Bioinformatica e Genomica di Bari, Istituto Tecnologie Biomediche CNR, via Amendola 168/5, 70125 Bari, Italy
| | | | | | | | | |
Collapse
|
1199
|
Wells CA, Ravasi T, Faulkner GJ, Carninci P, Okazaki Y, Hayashizaki Y, Sweet M, Wainwright BJ, Hume DA. Genetic control of the innate immune response. BMC Immunol 2003; 4:5. [PMID: 12826024 PMCID: PMC194878 DOI: 10.1186/1471-2172-4-5] [Citation(s) in RCA: 106] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2003] [Accepted: 06/26/2003] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND Susceptibility to infectious diseases is directed, in part, by the interaction between the invading pathogen and host macrophages. This study examines the influence of genetic background on host-pathogen interactions, by assessing the transcriptional responses of macrophages from five inbred mouse strains to lipopolysaccharide (LPS), a major determinant of responses to gram-negative microorganisms. RESULTS The mouse strains examined varied greatly in the number, amplitude and rate of induction of genes expressed in response to LPS. The response was attenuated in the C3H/HeJlpsd strain, which has a mutation in the LPS receptor Toll-like receptor 4 (TLR4). Variation between mouse strains allowed clustering into early (C57Bl/6J and DBA/2J) and delayed (BALB/c and C3H/ARC) transcriptional phenotypes. There was no clear correlation between gene induction patterns and variation at the Bcg locus (Slc11A1) or propensity to bias Th1 versus Th2 T cell activation responses. CONCLUSION Macrophages from each strain responded to LPS with unique gene expression profiles. The variation apparent between genetic backgrounds provides insights into the breadth of possible inflammatory responses, and paradoxically, this divergence was used to identify a common transcriptional program that responds to TLR4 signalling, irrespective of genetic background. Our data indicates that many additional genetic loci control the nature and the extent of transcriptional responses promoted by a single pathogen-associated molecular pattern (PAMP), such as LPS.
Collapse
Affiliation(s)
- Christine A Wells
- Institute for Molecular Biosciences, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Timothy Ravasi
- Institute for Molecular Biosciences, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Geoffrey J Faulkner
- Institute for Molecular Biosciences, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Piero Carninci
- Laboratory for Genome Exploration Research Group, Genomic Sciences Centre, Riken Yokohama Institute, Yokohama 230-0045
| | - Yasushi Okazaki
- Laboratory for Genome Exploration Research Group, Genomic Sciences Centre, Riken Yokohama Institute, Yokohama 230-0045
| | - Yoshihide Hayashizaki
- Laboratory for Genome Exploration Research Group, Genomic Sciences Centre, Riken Yokohama Institute, Yokohama 230-0045
| | - Matthew Sweet
- Institute for Molecular Biosciences, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Brandon J Wainwright
- Institute for Molecular Biosciences, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - David A Hume
- Institute for Molecular Biosciences, The University of Queensland, Brisbane, QLD, 4072, Australia
| |
Collapse
|
1200
|
Wang L, Bieber Urbauer RJ, Urbauer JL, Benson DR. House fly cytochrome b5 exhibits kinetically trapped hemin and selectivity in hemin binding. Biochem Biophys Res Commun 2003; 305:840-5. [PMID: 12767907 DOI: 10.1016/s0006-291x(03)00842-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We report that cytochrome b(5) (cyt b(5)) from Musca domestica (house fly) is more thermally stable than all other microsomal (Mc) cytochromes b(5) that have been examined to date. It also exhibits a much higher barrier to equilibration of the two isomeric forms of the protein, which differ by a 180 degrees rotation about the alpha-gamma-meso axis of hemin (ferric heme). In fact, hemin is kinetically trapped in a nearly statistical 1.2:1 ratio of rotational forms in freshly expressed protein. The equilibrium ratio (5.5:1) is established only upon incubation at temperatures above 37 degrees C. House fly Mc cyt b(5) is only the second b-hemoprotein that has been shown to exhibit kinetically trapped hemin at room temperature or above, the first being cyt b(5) from the outer membrane of rat liver mitochondria (rat OM cyt b(5)). Finally, we show that the small excess of one orientational isomer over the other in freshly expressed protein results from selective binding of hemin by the apoprotein, a phenomenon that has not heretofore been established for any apocyt b(5).
Collapse
Affiliation(s)
- Lijun Wang
- Department of Chemistry, University of Kansas, Lawrence, KS 66045, USA
| | | | | | | |
Collapse
|