1
|
Pérez-Umphrey AA, Settlecowski AE, Elbers JP, Williams ST, Jonsson CB, Bonisoli-Alquati A, Snider AM, Taylor SS. Genetic variants associated with hantavirus infection in a reservoir host are related to regulation of inflammation and immune surveillance. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2023; 116:105525. [PMID: 37956745 DOI: 10.1016/j.meegid.2023.105525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 10/14/2023] [Accepted: 11/10/2023] [Indexed: 11/15/2023]
Abstract
The immunogenetics of wildlife populations influence the epidemiology and evolutionary dynamic of the host-pathogen system. Profiling immune gene diversity present in wildlife may be especially important for those species that, while not at risk of disease or extinction themselves, are host to diseases that are a threat to humans, other wildlife, or livestock. Hantaviruses (genus: Orthohantavirus) are globally distributed zoonotic RNA viruses with pathogenic strains carried by a diverse group of rodent hosts. The marsh rice rat (Oryzomys palustris) is the reservoir host of Orthohantavirus bayoui, a hantavirus that causes fatal cases of hantavirus cardiopulmonary syndrome in humans. We performed a genome wide association study (GWAS) using the rice rat "immunome" (i.e., all exons related to the immune response) to identify genetic variants associated with infection status in wild-caught rice rats naturally infected with their endemic strain of hantavirus. First, we created an annotated reference genome using 10× Chromium Linked Reads sequencing technology. This reference genome was used to create custom baits which were then used to target enrich prepared rice rat libraries (n = 128) and isolate their immunomes prior to sequencing. Top SNPs in the association test were present in four genes (Socs5, Eprs, Mrc1, and Il1f8) which have not been previously implicated in hantavirus infections. However, these genes correspond with other loci or pathways with established importance in hantavirus susceptibility or infection tolerance in reservoir hosts: the JAK/STAT, MHC, and NFκB. These results serve as informative markers for future exploration and highlight the importance of immune pathways that repeatedly emerge across hantavirus systems. Our work aids in creating cross-species comparisons for better understanding mechanisms of genetic susceptibility and host-pathogen coevolution in hantavirus systems.
Collapse
Affiliation(s)
- Anna A Pérez-Umphrey
- School of Renewable Natural Resources, Louisiana State University and AgCenter, 227 RNR Building, Baton Rouge, LA 70803, USA.
| | - Amie E Settlecowski
- School of Renewable Natural Resources, Louisiana State University and AgCenter, 227 RNR Building, Baton Rouge, LA 70803, USA
| | - Jean P Elbers
- School of Renewable Natural Resources, Louisiana State University and AgCenter, 227 RNR Building, Baton Rouge, LA 70803, USA; Institute of Medical Genetics, Center for Pathobiochemistry and Genetics, Medical University of Vienna, Währinger Straße 10, 1090 Vienna, Austria
| | - S Tyler Williams
- School of Renewable Natural Resources, Louisiana State University and AgCenter, 227 RNR Building, Baton Rouge, LA 70803, USA
| | - Colleen B Jonsson
- Department of Microbiology, Immunology and Biochemistry, College of Medicine, University of Tennessee Health Science Center, University of Tennessee, 858 Madison Ave., Memphis, TN 38163, USA
| | - Andrea Bonisoli-Alquati
- School of Renewable Natural Resources, Louisiana State University and AgCenter, 227 RNR Building, Baton Rouge, LA 70803, USA; Department of Biological Sciences, California State Polytechnic University-Pomona, Pomona, CA 91768, USA
| | - Allison M Snider
- School of Renewable Natural Resources, Louisiana State University and AgCenter, 227 RNR Building, Baton Rouge, LA 70803, USA
| | - Sabrina S Taylor
- School of Renewable Natural Resources, Louisiana State University and AgCenter, 227 RNR Building, Baton Rouge, LA 70803, USA
| |
Collapse
|
2
|
Acri DJ, You Y, Tate MD, Karahan H, Martinez P, McCord B, Sharify AD, John S, Kim B, Dabin LC, Philtjens S, Wijeratne HS, McCray TJ, Smith DC, Bissel SJ, Lamb BT, Lasagna-Reeves CA, Kim J. Network analysis identifies strain-dependent response to tau and tau seeding-associated genes. J Exp Med 2023; 220:e20230180. [PMID: 37606887 PMCID: PMC10443211 DOI: 10.1084/jem.20230180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2023] [Revised: 06/05/2023] [Accepted: 07/27/2023] [Indexed: 08/23/2023] Open
Abstract
Previous research demonstrated that genetic heterogeneity is a critical factor in modeling amyloid accumulation and other Alzheimer's disease phenotypes. However, it is unknown what mechanisms underlie these effects of genetic background on modeling tau aggregate-driven pathogenicity. In this study, we induced tau aggregation in wild-derived mice by expressing MAPT. To investigate the effect of genetic background on the action of tau aggregates, we performed RNA sequencing with brains of C57BL/6J, CAST/EiJ, PWK/PhJ, and WSB/EiJ mice (n = 64) and determined core transcriptional signature conserved in all genetic backgrounds and signature unique to wild-derived backgrounds. By measuring tau seeding activity using the cortex, we identified 19 key genes associated with tau seeding and amyloid response. Interestingly, microglial pathways were strongly associated with tau seeding activity in CAST/EiJ and PWK/PhJ backgrounds. Collectively, our study demonstrates that mouse genetic context affects tau-mediated alteration of transcriptome and tau seeding. The gene modules associated with tau seeding provide an important resource to better model tauopathy.
Collapse
Affiliation(s)
- Dominic J. Acri
- Stark Neurosciences Research Institute, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
- Medical Neuroscience Graduate Program, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
| | - Yanwen You
- Stark Neurosciences Research Institute, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
- Department of Anatomy, Cell Biology and Physiology, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
| | - Mason D. Tate
- Stark Neurosciences Research Institute, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
- Medical Neuroscience Graduate Program, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
| | - Hande Karahan
- Stark Neurosciences Research Institute, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
- Department of Medical and Molecular Genetics, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
| | - Pablo Martinez
- Department of Anatomy, Cell Biology and Physiology, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
| | - Brianne McCord
- Stark Neurosciences Research Institute, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
- Department of Medical and Molecular Genetics, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
| | - A. Daniel Sharify
- Stark Neurosciences Research Institute, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
- Department of Medical and Molecular Genetics, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
| | - Sutha John
- Stark Neurosciences Research Institute, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
- Department of Medical and Molecular Genetics, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
| | - Byungwook Kim
- Stark Neurosciences Research Institute, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
- Department of Medical and Molecular Genetics, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
| | - Luke C. Dabin
- Stark Neurosciences Research Institute, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
- Department of Medical and Molecular Genetics, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
| | - Stéphanie Philtjens
- Stark Neurosciences Research Institute, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
- Department of Medical and Molecular Genetics, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
| | - H.R. Sagara Wijeratne
- Stark Neurosciences Research Institute, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
- Department of Biochemistry and Molecular Biology, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
| | - Tyler J. McCray
- Stark Neurosciences Research Institute, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
- Medical Neuroscience Graduate Program, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
| | - Daniel C. Smith
- Stark Neurosciences Research Institute, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
- Medical Neuroscience Graduate Program, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
| | - Stephanie J. Bissel
- Stark Neurosciences Research Institute, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
- Department of Medical and Molecular Genetics, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
| | - Bruce T. Lamb
- Stark Neurosciences Research Institute, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
- Department of Medical and Molecular Genetics, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
| | - Cristian A. Lasagna-Reeves
- Stark Neurosciences Research Institute, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
- Department of Anatomy, Cell Biology and Physiology, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
- Center for Computational Biology and Bioinformatics, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
| | - Jungsu Kim
- Stark Neurosciences Research Institute, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
- Department of Medical and Molecular Genetics, Indiana UniversitySchool of Medicine, Indianapolis, IN, USA
| |
Collapse
|
3
|
Deng X, Yang F, Zhang L, Wang J, Liu B, Liang W, Tang J, Xie Y, He L. ECO: An Integrated Gene Expression Omnibus for Mouse Endothelial Cells In Vivo. Front Genet 2022; 13:844544. [PMID: 35309132 PMCID: PMC8931405 DOI: 10.3389/fgene.2022.844544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2021] [Accepted: 02/15/2022] [Indexed: 11/30/2022] Open
Abstract
Endothelial cell (EC) plays critical roles in vascular physiological and pathological processes. With the development of high-throughput technologies, transcriptomics analysis of EC has increased dramatically and a large amount of informative data have been generated. The dynamic patterns of gene expression in ECs under various conditions were revealed. Unfortunately, due to the lack of bioinformatics infrastructures, reuse of these large-scale datasets is challenging for many scientists. Here, by systematic re-analyzing, integrating, and standardizing of 203 RNA sequencing samples from freshly isolated mouse ECs under 71 conditions, we constructed an integrated mouse EC gene expression omnibus (ECO). The ECO database enables one-click retrieval of endothelial expression profiles from different organs under different conditions including disease models, genetic modifications, and clinically relevant treatments in vivo. The EC expression profiles are visualized with user-friendly bar-plots. It also provides a convenient search tool for co-expressed genes. ECO facilitates endothelial research with an integrated tool and resource for transcriptome analysis. The ECO database is freely available at https://heomics.shinyapps.io/ecodb/.
Collapse
Affiliation(s)
- Xiangyi Deng
- Department of Neurosurgery, Tianjin Medical University General Hospital, Tianjin Neurological Institute, Key Laboratory of Post-Neuroinjury Neuro-Repair and Regeneration in Central Nervous System, Ministry of Education and Tianjin City, Tianjin, China
| | - Fan Yang
- Department of Neurosurgery, Tianjin Medical University General Hospital, Tianjin Neurological Institute, Key Laboratory of Post-Neuroinjury Neuro-Repair and Regeneration in Central Nervous System, Ministry of Education and Tianjin City, Tianjin, China
| | - Lei Zhang
- Key Laboratory of Ministry of Education for Medicinal Plant Resource and Natural Pharmaceutical Chemistry, National Engineering Laboratory for Resource Developing of Endangered Chinese Crude Drugs in Northwest of China, College of Life Sciences, Shaanxi Normal University, Xi’an, China
- Precision Medicine Center, the Second People’s Hospital of Huaihua, Huaihua, China
| | - Jianhao Wang
- Department of Neurosurgery, Tianjin Medical University General Hospital, Tianjin Neurological Institute, Key Laboratory of Post-Neuroinjury Neuro-Repair and Regeneration in Central Nervous System, Ministry of Education and Tianjin City, Tianjin, China
| | - Boxuan Liu
- Precision Medicine Center, the Second People’s Hospital of Huaihua, Huaihua, China
| | - Wei Liang
- Precision Medicine Center, the Second People’s Hospital of Huaihua, Huaihua, China
| | - Jiefu Tang
- Trauma Center, First Affiliated Hospital of Hunan University of Medicine, Huaihua, China
| | - Yuan Xie
- Key Laboratory of Ministry of Education for Medicinal Plant Resource and Natural Pharmaceutical Chemistry, National Engineering Laboratory for Resource Developing of Endangered Chinese Crude Drugs in Northwest of China, College of Life Sciences, Shaanxi Normal University, Xi’an, China
| | - Liqun He
- Department of Neurosurgery, Tianjin Medical University General Hospital, Tianjin Neurological Institute, Key Laboratory of Post-Neuroinjury Neuro-Repair and Regeneration in Central Nervous System, Ministry of Education and Tianjin City, Tianjin, China
- Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
- *Correspondence: Liqun He,
| |
Collapse
|
4
|
Abstract
The science of genomes: only within the past few decades have scientists progressed from the analysis of a single or a small number of genes at once to the investigation of thousands of genes, going from the study of the units of inheritance to the investigation of the whole genome of an organism. The science of the genomes, or "genomics," initially dedicated to the determination of DNA sequences (the nucleotide order on a given fragment of DNA), has promptly expanded toward a more functional level--studying the expression profiles and the roles of both genes and proteins. The aim of the chapter is to review some basic assumptions and definitions that are the fabric of genomics, and to elucidate key concepts and approaches on which genomics rely.
Collapse
Affiliation(s)
- Luca Del Giacco
- Division of Functional and Reproductive Biology, Department of Biology, University of Milan, Milan-MI, Italy.
| | | |
Collapse
|
5
|
Doyle CE, Donaldson ME, Morrison EN, Saville BJ. Ustilago maydis transcript features identified through full-length cDNA analysis. Mol Genet Genomics 2011; 286:143-59. [PMID: 21750919 DOI: 10.1007/s00438-011-0634-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2011] [Accepted: 06/28/2011] [Indexed: 12/13/2022]
Abstract
Ustilago maydis is the model for investigating basidiomycete biotrophic plant pathogens. To further the annotation of its genome, 12,943 full-length cDNA sequences were used to construct databases for the promoter and untranslated regions of U. maydis genes. A subset of clones was sequenced to determine full cDNA sequences. These and the original ESTs were assembled into contigs representing 3,058, or 45%, of the predicted U. maydis genes. The new sequencing allowed the confirmation of 2,842 gene models, 690 of which contain an intron. The use of full-length cDNA clone sequences ensured that untranslated regions were physically linked to the open reading frames (ORFs), not merely aligned upstream of the start of transcription. Identified sequence features include: (1) over 500 potential short upstream ORFs, (2) 95 gene models that require further annotation, (3) one new potential ORF, (4) varying GC content in different gene regions, (5) a WebLogo motif for the start of translation, (6) the correlation of UTR length with transcript representation in cDNA libraries and with gene function categories, (7) a relationship between natural antisense transcripts and UTR length that differs from that of Saccharomyces cerevisiae, (8) a potential relationship between DNA replication and the control of transcription, and (9) new insights regarding mechanisms for the control of transcription and mRNA maturation in U. maydis.
Collapse
Affiliation(s)
- Colleen E Doyle
- Environmental and Life Sciences Graduate Program, Trent University, Peterborough, ON K9J 7B8, Canada
| | | | | | | |
Collapse
|
6
|
Begley DA, Krupke DM, Neuhauser SB, Richardson JE, Bult CJ, Eppig JT, Sundberg JP. The Mouse Tumor Biology Database (MTB): a central electronic resource for locating and integrating mouse tumor pathology data. Vet Pathol 2011; 49:218-23. [PMID: 21282667 DOI: 10.1177/0300985810395726] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The Mouse Tumor Biology Database (MTB) is designed to provide an electronic data storage, search, and analysis system for information on mouse models of human cancer. The MTB includes data on tumor frequency and latency, strain, germ line, and somatic genetics, pathologic notations, and photomicrographs. The MTB collects data from the primary literature, other public databases, and direct submissions from the scientific community. The MTB is a community resource that provides integrated access to mouse tumor data from different scientific research areas and facilitates integration of molecular, genetic, and pathologic data. Current status of MTB, search capabilities, data types, and future enhancements are described in this article.
Collapse
Affiliation(s)
- D A Begley
- The Jackson Laboratory, 600 Main St, Bar Harbor, ME 04609-1500, USA
| | | | | | | | | | | | | |
Collapse
|
7
|
Identification and characterization of full-length cDNAs in channel catfish (Ictalurus punctatus) and blue catfish (Ictalurus furcatus). PLoS One 2010; 5:e11546. [PMID: 20634964 PMCID: PMC2902525 DOI: 10.1371/journal.pone.0011546] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2010] [Accepted: 06/14/2010] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND Genome annotation projects, gene functional studies, and phylogenetic analyses for a given organism all greatly benefit from access to a validated full-length cDNA resource. While increasingly common in model species, full-length cDNA resources in aquaculture species are scarce. METHODOLOGY AND PRINCIPAL FINDINGS Through in silico analysis of catfish (Ictalurus spp.) ESTs, a total of 10,037 channel catfish and 7,382 blue catfish cDNA clones were identified as potentially encoding full-length cDNAs. Of this set, a total of 1,169 channel catfish and 933 blue catfish full-length cDNA clones were selected for re-sequencing to provide additional coverage and ensure sequence accuracy. A total of 1,745 unique gene transcripts were identified from the full-length cDNA set, including 1,064 gene transcripts from channel catfish and 681 gene transcripts from blue catfish, with 416 transcripts shared between the two closely related species. Full-length sequence characteristics (ortholog conservation, UTR length, Kozak sequence, and conserved motifs) of the channel and blue catfish were examined in detail. Comparison of gene ontology composition between full-length cDNAs and all catfish ESTs revealed that the full-length cDNA set is representative of the gene diversity encoded in the catfish transcriptome. CONCLUSIONS This study describes the first catfish full-length cDNA set constructed from several cDNA libraries. The catfish full-length cDNA sequences, and data gleaned from sequence characteristics analysis, will be a valuable resource for ongoing catfish whole-genome sequencing and future gene-based studies of function and evolution in teleost fishes.
Collapse
|
8
|
Abstract
Recent progress in the analyses of the mouse transcriptome leads to unexpected discoveries. The mouse genomic sequences read by RNA polymerase II may be six times more than previously expected for human chromosomes. The transcript-abundant regions (named "transcription forests") occupy more than half of the genomic sequence and are divided by transcript-scarce regions (transcription deserts). Many of the coding mRNAs may have partially overlapping antisense RNAs. There are transcripts bridging several adjacent genes that were previously regarded as distinct ones. The transcription start sites appearing as cap analysis of gene expression (CAGE) tags are mapped on the mouse genomic sequences. Distributions of CAGE tags show that the shapes of mammalian gene promoters can be classified into four major categories. These shapes were conserved between mouse and human. Most of the gene has exonic transcription start sites, especially in the 3' untranslated region (3' UTR) sequences. The term "RNA continent" has been invented to express this unexpectedly complex and prodigious mouse transcriptome. More than a half of the RNA polymerase II transcripts are regarded as noncoding RNAs (ncRNAs). The great variety of ncRNAs in mammalian transcriptome implies that there are many functional ncRNAs in the cells. Especially, the evolutionarily conserved microRNAs play critical roles in mammalian development and other biological functions. Moreover, many other ncRNAs have also been shown to have biological significant functions, mainly in the regulation of gene expression. The functional survey of the RNA continent has just started. We will describe the state of the art of the RNA continent and its impact on the modern molecular biology, especially on the cancer research.
Collapse
Affiliation(s)
- Jun Yasuda
- Functional RNA Research Program, Frontier Research System, RIKEN Yokohama Institute, 1-7-22, Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan
| | | |
Collapse
|
9
|
Abstract
In recent years, genome-wide detection of alternative splicing based on Expressed Sequence Tag (EST) sequence alignments with mRNA and genomic sequences has dramatically expanded our understanding of the role of alternative splicing in functional regulation. This chapter reviews the data, methodology, and technical challenges of these genome-wide analyses of alternative splicing, and briefly surveys some of the uses to which such alternative splicing databases have been put. For example, with proper alternative splicing database schema design, it is possible to query genome-wide for alternative splicing patterns that are specific to particular tissues, disease states (e.g., cancer), gender, or developmental stages. EST alignments can be used to estimate exon inclusion or exclusion level of alternatively spliced exons and evolutionary changes for various species can be inferred from exon inclusion level. Such databases can also help automate design of probes for RT-PCR and microarrays, enabling high throughput experimental measurement of alternative splicing.
Collapse
|
10
|
Mazzarelli JM, White P, Gorski R, Brestelli J, Pinney DF, Arsenlis A, Katokhin A, Belova O, Bogdanova V, Elisafenko E, Gubina M, Nizolenko L, Perelman P, Puzakov M, Shilov A, Trifonoff V, Vorobjeva N, Kolchanov N, Kaestner KH, Stoeckert CJ. Novel genes identified by manual annotation and microarray expression analysis in the pancreas. Genomics 2006; 88:752-761. [PMID: 16725306 DOI: 10.1016/j.ygeno.2006.04.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2006] [Accepted: 04/14/2006] [Indexed: 10/24/2022]
Abstract
The mouse PancChip, a microarray developed for studying endocrine pancreatic development and diabetes, represents over 13,000 cDNAs. After computationally assigning the cDNAs on the array to known genes, manual curation of the remaining sequences identified 211 novel transcripts. In microarray experiments, we found that 196 of these transcripts were expressed in total pancreas and/or pancreatic islets. Of 50 randomly selected clones from these 196 transcripts, 92% were confirmed as expressed by qRT-PCR. We evaluated the coding potential of the novel transcripts and found that 74% of the clones had low coding potential. Since the transcripts may be partial mRNAs, we examined their translated proteins for transmembrane or signal peptide domains and found that about 40 proteins had one of these predicted domains. Interestingly, when we investigated the novel transcripts for their overlap with noncoding microRNAs, we found that 1 of the novel transcripts overlapped a known microRNA gene.
Collapse
Affiliation(s)
- Joan M Mazzarelli
- Center for Bioinformatics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.
| | - Peter White
- Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Regina Gorski
- Center for Bioinformatics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - John Brestelli
- Center for Bioinformatics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Deborah F Pinney
- Center for Bioinformatics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Athanasios Arsenlis
- Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Alexey Katokhin
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | - Olga Belova
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | - Vera Bogdanova
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | | | - Marina Gubina
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | - Lilia Nizolenko
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | - Polina Perelman
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | - Mikhail Puzakov
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | | | | | | | | | - Klaus H Kaestner
- Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Christian J Stoeckert
- Center for Bioinformatics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Genetics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
11
|
Abstract
The house mouse has been used as a privileged model organism since the early days of genetics, and the numerous experiments made with this small mammal have regularly contributed to enrich our knowledge of mammalian biology and pathology, ranging from embryonic development to metabolic disease, histocompatibility, immunology, behavior, and cancer. Over the past two decades, a number of large-scale integrated and concerted projects have been undertaken that will probably open a new era in the genetics of the species. The sequencing of the genome, which will allow researchers to make comparisons with other mammals and identify regions conserved by evolution, is probably the most important project, but many other initiatives, such as the massive production of point or chromosomal mutations associated with comprehensive and standardized phenotyping of the mutant phenotypes, will help annotation of the approximately 25,000 genes packed in the mouse genome. In the same way, and as another consequence of the sequencing, the discovery of many single nucleotide polymorphisms and the development of new tools and resources, like the Collaborative Cross, will contribute to the development of modern quantitative genetics. It is clear that mouse genetics has changed dramatically over the last 10-15 years and its future looks promising.
Collapse
Affiliation(s)
- Jean Louis Guénet
- Département de Biologie du Développement, Institut Pasteur, 75724 Paris Cedex 15, France.
| |
Collapse
|
12
|
Win J, Kanneganti TD, Torto-Alalibo T, Kamoun S. Computational and comparative analyses of 150 full-length cDNA sequences from the oomycete plant pathogen Phytophthora infestans. Fungal Genet Biol 2006; 43:20-33. [PMID: 16380277 DOI: 10.1016/j.fgb.2005.10.003] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2005] [Revised: 10/05/2005] [Accepted: 10/05/2005] [Indexed: 11/16/2022]
Abstract
Phytophthora infestans is a devastating phytopathogenic oomycete that causes late blight on tomato and potato. Recent genome sequencing efforts of P. infestans and other Phytophthora species are generating vast amounts of sequence data providing opportunities to unlock the complex nature of pathogenesis. However, accurate annotation of Phytophthora genomes will be a significant challenge. Most of the information about gene structure in these species was gathered from a handful of genes resulting in significant limitations for development of ab initio gene-calling programs. In this study, we collected a total of 150 bioinformatically determined near full-length cDNA (FLcDNA) sequences of P. infestans that were predicted to contain full open reading frame sequences. We performed detailed computational analyses of these FLcDNA sequences to obtain a snapshot of P. infestans gene structure, gauge the degree of sequence conservation between P. infestans genes and those of Phytophthora sojae and Phytophthora ramorum, and identify patterns of gene conservation between P. infestans and various eukaryotes, particularly fungi, for which genome-wide translated protein sequences are available. These analyses helped us to define the structural characteristics of P. infestans genes using a validated data set. We also determined the degree of sequence conservation within the genus Phytophthora and identified a set of fast evolving genes. Finally, we identified a set of genes that are shared between Phytophthora and fungal phytopathogens but absent in animal fungal pathogens. These results confirm that plant pathogenic oomycetes and fungi share virulence components, and suggest that eukaryotic microbial pathogens that share similar lifestyles also share a similar set of genes independently of their phylogenetic relatedness.
Collapse
Affiliation(s)
- Joe Win
- Department of Plant Pathology, The Ohio State University, Ohio Agricultural Research and Development Center, Wooster, OH 44691, USA
| | | | | | | |
Collapse
|
13
|
Harhay GP, Sonstegard TS, Keele JW, Heaton MP, Clawson ML, Snelling WM, Wiedmann RT, Van Tassell CP, Smith TPL. Characterization of 954 bovine full-CDS cDNA sequences. BMC Genomics 2005; 6:166. [PMID: 16305752 PMCID: PMC1314900 DOI: 10.1186/1471-2164-6-166] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2005] [Accepted: 11/23/2005] [Indexed: 11/10/2022] Open
Abstract
Background Genome assemblies rely on the existence of transcript sequence to stitch together contigs, verify assembly of whole genome shotgun reads, and annotate genes. Functional genomics studies also rely on transcript sequence to create expression microarrays or interpret digital tag data produced by methods such as Serial Analysis of Gene Expression (SAGE). Transcript sequence can be predicted based on reconstruction from overlapping expressed sequence tags (EST) that are obtained by single-pass sequencing of random cDNA clones, but these reconstructions are prone to errors caused by alternative splice forms, transcripts from gene families with related sequences, and expressed pseudogenes. These errors confound genome assembly and annotation. The most useful transcript sequences are derived by complete insert sequencing of clones containing the entire length, or at least the full protein coding sequence (CDS) portion, of the source mRNA. While the bovine genome sequencing initiative is nearing completion, there is currently a paucity of bovine full-CDS mRNA and protein sequence data to support bovine genome assembly and functional genomics studies. Consequently, the production of high-quality bovine full-CDS cDNA sequences will enhance the bovine genome assembly and functional studies of bovine genes and gene products. The goal of this investigation was to identify and characterize the full-CDS sequences of bovine transcripts from clones identified in non-full-length enriched cDNA libraries. In contrast to several recent full-length cDNA investigations, these full-CDS cDNAs were selected, sequenced, and annotated without the benefit of the target organism's genomic sequence, by using comparison of bovine EST sequence to existing human mRNA to identify likely full-CDS clones for full-length insert cDNA (FLIC) sequencing. Results The predicted bovine protein lengths, 5' UTR lengths, and Kozak consensus sequences from 954 bovine FLIC sequences (bFLICs; average length 1713 nt, representing 762 distinct loci) are all consistent with previously sequenced mammalian full-length transcripts. Conclusion In most cases, the bFLICs span the entire CDS of the genes, providing the basis for creating predicted bovine protein sequences to support proteomics and comparative evolutionary research as well as functional genomics and genome annotation. The results demonstrate the utility of the comparative approach in obtaining predicted protein sequences in other species.
Collapse
Affiliation(s)
- Gregory P Harhay
- USDA-ARS-U.S. Meat Animal Research Center, Clay Center, NE 68901, USA
| | | | - John W Keele
- USDA-ARS-U.S. Meat Animal Research Center, Clay Center, NE 68901, USA
| | - Michael P Heaton
- USDA-ARS-U.S. Meat Animal Research Center, Clay Center, NE 68901, USA
| | - Michael L Clawson
- USDA-ARS-U.S. Meat Animal Research Center, Clay Center, NE 68901, USA
| | - Warren M Snelling
- USDA-ARS-U.S. Meat Animal Research Center, Clay Center, NE 68901, USA
| | - Ralph T Wiedmann
- USDA-ARS-U.S. Meat Animal Research Center, Clay Center, NE 68901, USA
| | | | - Timothy PL Smith
- USDA-ARS-U.S. Meat Animal Research Center, Clay Center, NE 68901, USA
| |
Collapse
|
14
|
Wistow G. The NEIBank project for ocular genomics: data-mining gene expression in human and rodent eye tissues. Prog Retin Eye Res 2005; 25:43-77. [PMID: 16005676 DOI: 10.1016/j.preteyeres.2005.05.003] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
NEIBank is a project to gather and organize genomic resources for eye research. The first phase of this project covers the construction and sequence analysis of cDNA libraries from human and animal model eye tissues to develop an overview of the repertoire of genes expressed in the eye and a resource of cDNA clones for further studies. The sequence data are grouped and identified using the tools of bioinformatics and the results are displayed through a web site where they can be interrogated by keyword search, chromosome location, by Blast (sequence comparison) or by alignment on completed genomes. Many novel proteins and novel splice forms of known genes have already emerged from analysis of the accumulating data. This review provides an overview of the current state of the database for human eye tissues, with specific comparisons to some parallel data from mouse and rat, and with illustrative examples of the kinds of insights and discoveries these data can produce. One of the major themes that emerges is that at the molecular level human eye tissues have significant differences from those of rodents, encompassing species specific genes, alternative splice forms and great variation in levels of gene expression. These point to specific adaptations and mechanisms in the human eye and emphasize that care needs to be taken in the application of appropriate animal model systems.
Collapse
Affiliation(s)
- Graeme Wistow
- Section on Molecular Structure and Functional Genomics, National Eye Institute, National Institutes of Health, Building 7, Room 201, Bethesda, MD 20892-0703, USA.
| |
Collapse
|
15
|
Nagoshi E, Brown SA, Dibner C, Kornmann B, Schibler U. Circadian Gene Expression in Cultured Cells. Methods Enzymol 2005; 393:543-57. [PMID: 15817311 DOI: 10.1016/s0076-6879(05)93028-0] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
In mammals, circadian oscillators not only exist in specialized neurons of the suprachiasmatic nucleus, but in almost all peripheral cell types. These oscillators are operative even in established fibroblast cell lines, such as Rat-1 cells or NIH3T3 cells, and in primary fibroblasts from mouse embryos or adult animals. This can be demonstrated by treating such cells for a short time period with high concentrations of serum or chemicals that activate a large number of known signaling pathways. The possibility of studying circadian rhythms in cultured cells should facilitate the biochemical and genetic dissection of the circadian clockwork and should promote the discovery of new clock components.
Collapse
Affiliation(s)
- Emi Nagoshi
- Department of Molecular Biology, Sciences III, University of Geneva, CH-1211 Geneva-4, Switzerlan
| | | | | | | | | |
Collapse
|
16
|
Auffray C, Chen Z, Hood L, Soares B, Sugano S. Foreword: from the TRANSCRIPTOME conferences to the SYSTEMOSCOPE international consortium. C R Biol 2004; 326:867-75. [PMID: 14744093 DOI: 10.1016/j.crvi.2003.09.024] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
This thematic issue issue of the Comptes rendus Biologies contains review articles, original papers and conference reports presented at the first two TRANSCRIPTOME conferences From Functional Genomics to Systems Biology and IMAGE Consortium Invitational workshops (Paris, November 2000 and Seattle, March 2002), and discussed during the inaugural meetings of the SYSTEMOSCOPE International Consortium (Paris, June 2003). We describe the founding principles, missions, working plan and policy for partnership and industrial development of SYSTEMOSCOPE to promote the study of the complexity of biological systems by integrating scientific, medical, ethical and economic issues in implementation of interdisciplinary projects for human health.
Collapse
Affiliation(s)
- Charles Auffray
- Genexpress, Functional Genomics and Systemic Biology for Health, CNRS FRE 2571, 7, rue Guy-Môquet, BP 8, 94801 Villejuif cedex, France.
| | | | | | | | | |
Collapse
|