26
|
Wier AM, Nyholm SV, Mandel MJ, Massengo-Tiassé RP, Schaefer AL, Koroleva I, Splinter-BonDurant S, Brown B, Manzella L, Snir E, Almabrazi H, Scheetz TE, de Fatima Bonaldo M, Casavant TL, Soares MB, Cronan JE, Reed JL, Ruby EG, McFall-Ngai MJ. Transcriptional patterns in both host and bacterium underlie a daily rhythm of anatomical and metabolic change in a beneficial symbiosis. Proc Natl Acad Sci U S A 2010; 107:2259-64. [PMID: 20133870 PMCID: PMC2836665 DOI: 10.1073/pnas.0909712107] [Citation(s) in RCA: 131] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Mechanisms for controlling symbiont populations are critical for maintaining the associations that exist between a host and its microbial partners. We describe here the transcriptional, metabolic, and ultrastructural characteristics of a diel rhythm that occurs in the symbiosis between the squid Euprymna scolopes and the luminous bacterium Vibrio fischeri. The rhythm is driven by the host's expulsion from its light-emitting organ of most of the symbiont population each day at dawn. The transcriptomes of both the host epithelium that supports the symbionts and the symbiont population itself were characterized and compared at four times over this daily cycle. The greatest fluctuation in gene expression of both partners occurred as the day began. Most notable was an up-regulation in the host of >50 cytoskeleton-related genes just before dawn and their subsequent down-regulation within 6 h. Examination of the epithelium by TEM revealed a corresponding restructuring, characterized by effacement and blebbing of its apical surface. After the dawn expulsion, the epithelium reestablished its polarity, and the residual symbionts began growing, repopulating the light organ. Analysis of the symbiont transcriptome suggested that the bacteria respond to the effacement by up-regulating genes associated with anaerobic respiration of glycerol; supporting this finding, lipid analysis of the symbionts' membranes indicated a direct incorporation of host-derived fatty acids. After 12 h, the metabolic signature of the symbiont population shifted to one characteristic of chitin fermentation, which continued until the following dawn. Thus, the persistent maintenance of the squid-vibrio symbiosis is tied to a dynamic diel rhythm that involves both partners.
Collapse
|
27
|
Keen HL, Halabi CM, Beyer AM, de Lange WJ, Liu X, Maeda N, Faraci FM, Casavant TL, Sigmund CD. Bioinformatic analysis of gene sets regulated by ligand-activated and dominant-negative peroxisome proliferator-activated receptor gamma in mouse aorta. Arterioscler Thromb Vasc Biol 2009; 30:518-25. [PMID: 20018933 DOI: 10.1161/atvbaha.109.200733] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
OBJECTIVE Drugs that activate peroxisome proliferator-activated receptor (PPAR) gamma improve glucose sensitivity and lower blood pressure, whereas dominant-negative mutations in PPARgamma cause severe insulin resistance and hypertension. We hypothesize that these PPARgamma mutants regulate target genes opposite to those of ligand-mediated activation, and we tested this hypothesis on a genomewide scale. METHODS AND RESULTS We integrated gene expression data in aorta specimens from mice treated with the PPARgamma ligand rosiglitazone with data from mice containing a globally expressed knockin of the PPARgamma P465L dominant-negative mutation. We also integrated our data with publicly available data sets containing the following: (1) gene expression profiles in many human tissues, (2) PPARgamma target genes in 3T3-L1 adipocytes, and (3) experimentally validated PPARgamma binding sites throughout the genome. Many classic PPARgamma target genes were induced by rosiglitazone and repressed by dominant-negative PPARgamma. A similar pattern was observed for about 90% of the gene sets regulated by both rosiglitazone and dominant-negative PPARgamma. Genes exhibiting this pattern of contrasting regulation were significantly enriched for nearby PPARgamma binding sites. CONCLUSIONS These results provide convincing evidence that the PPARgamma P465L mutation causes transcriptional effects that are opposite to those mediated by PPARgamma ligand, thus validating mice carrying the mutation as a model of PPARgamma interference.
Collapse
|
28
|
Hildebrand MS, DeLuca AP, Taylor KR, Hoskinson DP, Hur IA, Tack D, McMordie SJ, Huygen PLM, Casavant TL, Smith RJH. A contemporary review of AudioGene audioprofiling: a machine-based candidate gene prediction tool for autosomal dominant nonsyndromic hearing loss. Laryngoscope 2009; 119:2211-5. [PMID: 19780026 DOI: 10.1002/lary.20664] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
29
|
Hildebrand MS, Tack D, Deluca A, Hur IA, Van Rybroek JM, McMordie SJ, Muilenburg A, Hoskinson DP, Van Camp G, Pensak ML, Storper IS, Huygen PLM, Casavant TL, Smith RJH. Mutation in the COCH gene is associated with superior semicircular canal dehiscence. Am J Med Genet A 2009; 149A:280-5. [PMID: 19161137 DOI: 10.1002/ajmg.a.32618] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
30
|
Kalari KR, Casavant TL, Scheetz TE. A knowledge-based approach to predict intragenic deletions or duplications. ACTA ACUST UNITED AC 2008; 24:1975-9. [PMID: 18647756 DOI: 10.1093/bioinformatics/btn370] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
MOTIVATION Despite recent improvements in high-throughput or classic molecular biology approaches it is still challenging to identify intermediate resolution genomic variations (50 bp to 50 kb). Although array-based technologies can be used to detect copy number variations in the human genome they are biased to detect only the largest such deletions or duplications. Several studies have identified deletions or duplications occurring within a gene that directly cause or predispose to disease. We have developed a novel computational system, SPeeDD (system to prioritize deletions or duplications) that utilizes machine learning techniques to predict likely candidate regions that delete or duplicate exon(s) within a gene. RESULTS Data mining and machine learning methods were applied to identify sequence features that were predictive of homologous recombination events. The logistic model tree (LMT) method yielded the best results. Sensitivity varied from 20% to 71.6% depending on the specific machine learning model used, but specificity exceeded 90% for all methods evaluated. In addition, the SPeeDD system successfully predicted and prioritized a recently published novel BRCA1 mutation. CONCLUSIONS Results suggest that the SPeeDD system is effective at prioritizing candidate deletions and duplications within a gene. Use of SPeeDD enables more focused screening, which reduces the labor and associated costs of the molecular assays and may also lead to targeted design of new array-based screens to focus on candidate areas to accelerate the process of mutation discovery.
Collapse
|
31
|
O'Leary BM, Davis SG, Smith MF, Brown B, Kemp MB, Almabrazi H, Grundstad JA, Burns T, Leontiev V, Andorf J, Clark AF, Sheffield VC, Casavant TL, Scheetz TE, Stone EM, Braun TA. Transcript annotation prioritization and screening system (TrAPSS) for mutation screening. J Bioinform Comput Biol 2008; 5:1155-72. [PMID: 18172923 DOI: 10.1142/s0219720007003132] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2007] [Revised: 07/31/2007] [Accepted: 08/13/2007] [Indexed: 11/18/2022]
Abstract
When searching for disease-causing mutations with polymerase chain reaction (PCR)-based methods, candidate genes are usually screened in their entirety, exon by exon. Genomic resources (i.e. www.ncbi.nih.gov, www.ensembl.org, and genome.ucsc.edu) largely support this paradigm for mutation screening by making it easy to view and access sequence data associated with genes in their genomic context. However, the administrative burden of conducting mutation screening in potentially hundreds of genes and thousands of exons in thousands of patients is significant, even with the use of public genome resources. For example, the manual design of oligonucleotide primers for all exons of the 10 Leber's congenital amaurosis (LCA) genes (149 exons) represents a significant information management challenge. The Transcript Annotation Prioritization and Screening System (TrAPSS) is designed to accelerate mutation screening by (1) providing a gene-based local cache of candidate disease genes in a genomic context, (2) automating tasks associated with optimizing candidate disease gene screening and information management, and (3) providing the implementation of an algorithmic technique to utilize large amounts of heterogeneous genome annotation (e.g. conserved protein functional domains) so as to prioritize candidate genes.
Collapse
|
32
|
Thomas G, Grassi MA, Lee JR, Edwards AO, Gorin MB, Klein R, Casavant TL, Scheetz TE, Stone EM, Williams AB. IDOCS: Intelligent Distributed Ontology Consensus System—The Use of Machine Learning in Retinal Drusen Phenotyping. ACTA ACUST UNITED AC 2007; 48:2278-84. [PMID: 17460291 DOI: 10.1167/iovs.06-1022] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
PURPOSE To use the power of knowledge acquisition and machine learning in the development of a collaborative computer classification system based on the features of age-related macular degeneration (AMD). METHODS A vocabulary was acquired from four AMD experts who examined 100 ophthalmoscopic images. The vocabulary was analyzed, hierarchically structured, and incorporated into a collaborative computer classification system called IDOCS. Using this system, three of the experts examined images from a second set of digital images compiled from more than 1000 patients with AMD. Images were annotated, and features were identified and defined. Decision trees, a machine learning method, were trained on the data collected and used to extract patterns. Interrelationships between the data from the different clinicians were investigated. RESULTS Six drusen classes in the structured vocabulary were largely sufficient to describe all the identified features. The decision trees classified the data with 76.86% to 88.5% accuracy and distilled patterns in the form of hierarchical trees composed of 5 to 15 nodes. Experts were largely consistent in their characterization of soft, and to a lesser extent, hard drusen, but diverge in definition of other drusen. Size and crystalline morphology were the main determinants of drusen type across all experts. CONCLUSIONS Machine learning is a powerful tool for the characterization of disease phenotypes. The creation of a defined feature set for AMD will facilitate the development of an IDOCS-based classification system.
Collapse
|
33
|
Ben-Shahar Y, Nannapaneni K, Casavant TL, Scheetz TE, Welsh MJ. Eukaryotic operon-like transcription of functionally related genes in Drosophila. Proc Natl Acad Sci U S A 2006; 104:222-7. [PMID: 17190802 PMCID: PMC1749324 DOI: 10.1073/pnas.0609683104] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Complex biological processes require coordinated function of many genes. One evolutionary solution to the problem of coordinately expressing functionally related genes in bacteria and nematodes is organization of genes in operons. Surprisingly, eukaryotic operons are considered rare outside the nematode lineage. In Drosophila melanogaster, we found lounge lizard (llz), which encodes a degenerin/ENaC cation channel, cotranscribed with CheB42a, a nonhomologous gene of unknown function residing <100 bp upstream. These two genes were transcribed from a single promoter as one primary transcript and were processed posttranscriptionally to generate individual mRNAs. The mechanism did not involve alternative splicing, and it differed from the trans splicing used in nematode operons. Both genes were expressed in the same tissues, and previous work suggested that both may be involved in courtship behavior. A bioinformatic approach identified numerous additional loci as potential Drosophila operons. These data reveal eukaryotic operon-like transcription of functionally related genes in Drosophila. The results also suggest that operon-based transcription may be more common in eukaryotes than previously appreciated.
Collapse
|
34
|
Kang Y, Moressi CJ, Scheetz TE, Xie L, Tran DT, Casavant TL, Ak P, Benham CJ, Davidson BL, McCray PB. Integration site choice of a feline immunodeficiency virus vector. J Virol 2006; 80:8820-3. [PMID: 16912328 PMCID: PMC1563849 DOI: 10.1128/jvi.00719-06] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
We mapped 226 unique integration sites in human hepatoma cells following gene transfer with a feline immunodeficiency virus (FIV)-based lentivirus vector. FIV integrated across the entire length of the transcriptional units. Microarray data indicated that FIV integration favored actively transcribed genes. Approximately 21% of FIV integrations within transcriptional units occurred in genes regulated by the LEDGF/p75 transcriptional coactivator. DNA in regions of FIV insertion sites exhibited a "bendable" structure and a pattern of duplex destabilization favoring strand separation. FIV integration preferences are more similar to those of primate lentiviruses and distinct from those of Moloney murine leukemia virus, avian sarcoma leukosis virus, and foamy virus.
Collapse
|
35
|
Scheetz TE, Kim KYA, Swiderski RE, Philp AR, Braun TA, Knudtson KL, Dorrance AM, DiBona GF, Huang J, Casavant TL, Sheffield VC, Stone EM. Regulation of gene expression in the mammalian eye and its relevance to eye disease. Proc Natl Acad Sci U S A 2006; 103:14429-34. [PMID: 16983098 PMCID: PMC1636701 DOI: 10.1073/pnas.0602562103] [Citation(s) in RCA: 130] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We used expression quantitative trait locus mapping in the laboratory rat (Rattus norvegicus) to gain a broad perspective of gene regulation in the mammalian eye and to identify genetic variation relevant to human eye disease. Of >31,000 gene probes represented on an Affymetrix expression microarray, 18,976 exhibited sufficient signal for reliable analysis and at least 2-fold variation in expression among 120 F(2) rats generated from an SR/JrHsd x SHRSP intercross. Genome-wide linkage analysis with 399 genetic markers revealed significant linkage with at least one marker for 1,300 probes (alpha = 0.001; estimated empirical false discovery rate = 2%). Both contiguous and noncontiguous loci were found to be important in regulating mammalian eye gene expression. We investigated one locus of each type in greater detail and identified putative transcription-altering variations in both cases. We found an inserted cREL binding sequence in the 5' flanking sequence of the Abca4 gene associated with an increased expression level of that gene, and we found a mutation of the gene encoding thyroid hormone receptor beta2 associated with a decreased expression level of the gene encoding short-wavelength sensitive opsin (Opn1sw). In addition to these positional studies, we performed a pairwise analysis of gene expression to identify genes that are regulated in a coordinated manner and used this approach to validate two previously undescribed genes involved in the human disease Bardet-Biedl syndrome. These data and analytical approaches can be used to facilitate the discovery of additional genes and regulatory elements involved in human eye disease.
Collapse
|
36
|
Dillon RJ, Ivens AC, Churcher C, Holroyd N, Quail MA, Rogers ME, Soares MB, Bonaldo MF, Casavant TL, Lehane MJ, Bates PA. Analysis of ESTs from Lutzomyia longipalpis sand flies and their contribution toward understanding the insect-parasite relationship. Genomics 2006; 88:831-840. [PMID: 16887324 PMCID: PMC2675706 DOI: 10.1016/j.ygeno.2006.06.011] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2006] [Revised: 06/16/2006] [Accepted: 06/20/2006] [Indexed: 11/30/2022]
Abstract
An expressed sequence tag library has been generated from a sand fly vector of visceral leishmaniasis, Lutzomyia longipalpis. A normalized cDNA library was constructed from whole adults and 16,608 clones were sequenced from both ends and assembled into 10,203 contigs and singlets. Of these 58% showed significant similarity to known genes from other organisms, < 4% were identical to described sand fly genes, and 42% had no match to any database sequence. Our analyses revealed putative proteins involved in the barrier function of the gut (peritrophins, microvillar proteins, glutamine synthase), digestive physiology (secreted and membrane-anchored hydrolytic enzymes), and the immune response (gram-negative binding proteins, thioester proteins, scavenger receptors, galectins, signaling pathway factors, caspases, serpins, and peroxidases). Sequence analysis of this transcriptome dataset has provided new insights into genes that might be associated with the response of the vector to the development of Leishmania.
Collapse
|
37
|
Braun TA, Shankar SP, Davis S, O'Leary B, Scheetz TE, Clark AF, Sheffield VC, Casavant TL, Stone EM. Prioritizing regions of candidate genes for efficient mutation screening. Hum Mutat 2006; 27:195-200. [PMID: 16395665 DOI: 10.1002/humu.20247] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
The availability of the complete sequence of the human genome has dramatically facilitated the search for disease-causing sequence variations. In fact, the rate-limiting step has shifted from the discovery and characterization of candidate genes to the actual screening of human populations and the subsequent interpretation of observed variations. In this study we tested the hypothesis that some segments of candidate genes are more likely than others to contain disease-causing variations and that these segments can be predicted bioinformatically. A bioinformatic technique, prioritization of annotated regions (PAR), was developed to predict the likelihood that a specific coding region of a gene will harbor a disease-causing mutation based on conserved protein functional domains and protein secondary structures. This method was evaluated by using it to analyze 710 genes that collectively harbor 4,498 previously identified mutations. Nearly 50% of the genes were recognized as disease-associated after screening only 9% of the complete coding sequence. The PAR technique identified 90% of the genes as containing at least one mutation, with less than 40% of the screening resources that traditional approaches would require. These results suggest that prioritization strategies such as PAR can accelerate disease-gene identification through more efficient use of screening resources.
Collapse
|
38
|
Chun CK, Scheetz TE, Bonaldo MDF, Brown B, Clemens A, Crookes-Goodson WJ, Crouch K, DeMartini T, Eyestone M, Goodson MS, Janssens B, Kimbell JL, Koropatnick TA, Kucaba T, Smith C, Stewart JJ, Tong D, Troll JV, Webster S, Winhall-Rice J, Yap C, Casavant TL, McFall-Ngai MJ, Soares MB. An annotated cDNA library of juvenile Euprymna scolopes with and without colonization by the symbiont Vibrio fischeri. BMC Genomics 2006; 7:154. [PMID: 16780587 PMCID: PMC1574308 DOI: 10.1186/1471-2164-7-154] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2006] [Accepted: 06/16/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Biologists are becoming increasingly aware that the interaction of animals, including humans, with their coevolved bacterial partners is essential for health. This growing awareness has been a driving force for the development of models for the study of beneficial animal-bacterial interactions. In the squid-vibrio model, symbiotic Vibrio fischeri induce dramatic developmental changes in the light organ of host Euprymna scolopes over the first hours to days of their partnership. We report here the creation of a juvenile light-organ specific EST database. RESULTS We generated eleven cDNA libraries from the light organ of E. scolopes at developmentally significant time points with and without colonization by V. fischeri. Single pass 3' sequencing efforts generated 42,564 expressed sequence tags (ESTs) of which 35,421 passed our quality criteria and were then clustered via the UIcluster program into 13,962 nonredundant sequences. The cDNA clones representing these nonredundant sequences were sequenced from the 5' end of the vector and 58% of these resulting sequences overlapped significantly with the associated 3' sequence to generate 8,067 contigs with an average sequence length of 1,065 bp. All sequences were annotated with BLASTX (E-value < -03) and Gene Ontology (GO). CONCLUSION Both the number of ESTs generated from each library and GO categorizations are reflective of the activity state of the light organ during these early stages of symbiosis. Future analyses of the sequences identified in these libraries promise to provide valuable information not only about pathways involved in colonization and early development of the squid light organ, but also about pathways conserved in response to bacterial colonization across the animal kingdom.
Collapse
|
39
|
Steely HT, Dillow GW, Bian L, Grundstad J, Braun TA, Casavant TL, McCartney MD, Clark AF. Protein expression in a transformed trabecular meshwork cell line: proteome analysis. Mol Vis 2006; 12:372-83. [PMID: 16636656] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/08/2023] Open
Abstract
PURPOSE Characterization of the human trabecular meshwork (TM) proteome is hindered by the small mass of intact tissue and the slow growth of cultured cell strains. We have previously characterized a transformed TM cell strain (GTM3) that demonstrates many of the same protein expression and cell signaling systems of nontransformed cell strains. The aim of this study was to initiate a proteomic survey of GTM3 cells as the initial step toward characterization of the complete human TM proteome. METHODS GTM3 cells were cultured to confluence, harvested and solubilized in urea/Nonidet. The protein extract (600 mug) was focused in immobilized isoelectric focusing (IEF) strips, separated by 10% SDS PAGE, and visualized with colloidal Coomassie Blue. Spots of interest were excised, destained, and the contained proteins subjected to in-gel reduction, derivatization, and tryptic digestion. Tryptic peptides were extracted and analyzed by electrospray LC/MS/MS. Protein identification was made using the TurboSequest search algorithm and a recent version of the nonredundant human protein database downloaded from the National Center for Biotechnology Information (NCBI). RESULTS Eighty-seven (87) primary proteins and 93 variants of these proteins were identified. A website was created (TM proteome) that combines data such as graphic spot location within the gel, peptide sequence, apparent and calculated pI, apparent and calculated mass, percentage of coverage, and protein informatic website links. CONCLUSIONS Proteomic analysis of a transformed human TM cell line has been initiated combining preparative two-dimensional PAGE separation, LC/MS/MS analysis of major proteins, and bioinformatic cataloging of the data. Further investigation of data from the transformed cell strain will be used in a comparative fashion for spot identification of analytical proteomic gels of human TM tissue and cultured normal cells. These initial data will form the base from which the characterization of protein expression in the normal and glaucomatous TM can be accomplished.
Collapse
|
40
|
Chiang AP, Beck JS, Yen HJ, Tayeh MK, Scheetz TE, Swiderski RE, Nishimura DY, Braun TA, Kim KYA, Huang J, Elbedour K, Carmi R, Slusarski DC, Casavant TL, Stone EM, Sheffield VC. Homozygosity mapping with SNP arrays identifies TRIM32, an E3 ubiquitin ligase, as a Bardet-Biedl syndrome gene (BBS11). Proc Natl Acad Sci U S A 2006; 103:6287-92. [PMID: 16606853 PMCID: PMC1458870 DOI: 10.1073/pnas.0600158103] [Citation(s) in RCA: 296] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The identification of mutations in genes that cause human diseases has largely been accomplished through the use of positional cloning, which relies on linkage mapping. In studies of rare diseases, the resolution of linkage mapping is limited by the number of available meioses and informative marker density. One recent advance is the development of high-density SNP microarrays for genotyping. The SNP arrays overcome low marker informativity by using a large number of markers to achieve greater coverage at finer resolution. We used SNP microarray genotyping for homozygosity mapping in a small consanguineous Israeli Bedouin family with autosomal recessive Bardet-Biedl syndrome (BBS; obesity, pigmentary retinopathy, polydactyly, hypogonadism, renal and cardiac abnormalities, and cognitive impairment) in which previous linkage studies using short tandem repeat polymorphisms failed to identify a disease locus. SNP genotyping revealed a homozygous candidate region. Mutation analysis in the region of homozygosity identified a conserved homozygous missense mutation in the TRIM32 gene, a gene coding for an E3 ubiquitin ligase. Functional analysis of this gene in zebrafish and expression correlation analyses among other BBS genes in an expression quantitative trait loci data set demonstrate that TRIM32 is a BBS gene. This study shows the value of high-density SNP genotyping for homozygosity mapping and the use of expression correlation data for evaluation of candidate genes and identifies the proteasome degradation pathway as a pathway involved in BBS.
Collapse
|
41
|
Kalari KR, Casavant M, Bair TB, Keen HL, Comeron JM, Casavant TL, Scheetz TE. First exons and introns--a survey of GC content and gene structure in the human genome. In Silico Biol 2006; 6:237-42. [PMID: 16922687] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Most transcriptional regulatory elements are located in non-coding DNA. In particular, some first introns play a vital role in transcriptional control and splicing. The length and GC-content of first exons and introns in complex organisms suggests that these structural units are likely to be important functional elements in large genomes. Hence, in this paper we perform a systematic comparison of exon-intron structure and GC content on all known genes in the human genome. Our in-silico analysis found that the GC content of introns and exons varies significantly depending on their length. On average, the first intron of a gene is significantly longer than other introns in the same gene. Our results also show that first introns and exons are more GC rich than last and internal. This study provides insight into the structure of eukaryotic genes. These results confirm and expand the previously identified regulatory potential of first exons and introns.
Collapse
|
42
|
Bischof JM, Chiang AP, Scheetz TE, Stone EM, Casavant TL, Sheffield VC, Braun TA. Genome-wide identification of pseudogenes capable of disease-causing gene conversion. Hum Mutat 2006; 27:545-52. [PMID: 16671097 DOI: 10.1002/humu.20335] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Pseudogenes are remnants of gene duplication (nonprocessed pseudogenes) and retrotransposition (processed pseudogenes) events. This study describes methods for identifying gene conversion candidates from predicted pseudogenes. Pseudogenes may accumulate and harbor sequence variations over time that become disease-causing mutations when transferred to genes by gene conversion. A total of 14,476 pseudogenes were identified, including 3,426 nonprocessed pseudogenes. In addition, 1,945 nonprocessed pseudogenes that are localized near their progenitor gene were evaluated for their possible role in gene conversion and disease. All 11 known, human cases of gene conversion (with deleterious effects) involving pseudogenes were successfully identified by these methods. Among the pseudogenes identified is a retinitis pigmentosa 9 (RP9) pseudogene that carries a c.509A>G mutation which produces a p.Asp170Gly substitution that is associated with the RP9 form of autosomal dominant retinitis pigmentosa (adRP). The c.509A>G mutation in RP9 is a previously unrecognized example of gene conversion between the progenitor gene and its pseudogene. Notably, two processed pseudogenes also contain mutations associated with diseases. An inosine monophosphate dehydrogenase 1 (IMPDH1) pseudogene carries a c.676G>A mutation that produces a p.Asp226Asn substitution that causes the retinitis pigmentosa 10 (RP10) form of adRP; and a phosphoglycerate kinase 1 (PGK1) pseudogene (PGK1P1) carries a c.837T>C mutation that produces a p.Ile252Thr substitution that is associated with a phosphoglycerate kinase deficiency. Ranking of nonprocessed pseudogenes as candidates for gene conversion was also performed based on the sequence characteristics of published cases of pseudogene-mediated gene conversion. All results and tools produced by this study are available for download at: http://genome.uiowa.edu/pseudogenes.
Collapse
|
43
|
Goodson MS, Kojadinovic M, Troll JV, Scheetz TE, Casavant TL, Soares MB, McFall-Ngai MJ. Identifying components of the NF-kappaB pathway in the beneficial Euprymna scolopes-Vibrio fischeri light organ symbiosis. Appl Environ Microbiol 2005; 71:6934-46. [PMID: 16269728 PMCID: PMC1287678 DOI: 10.1128/aem.71.11.6934-6946.2005] [Citation(s) in RCA: 103] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
The Toll/NF-kappaB pathway is a common, evolutionarily conserved innate immune pathway that modulates the responses of animal cells to microbe-associated molecular patterns (MAMPs). Because MAMPs have been implicated as critical elements in the signaling of symbiont-induced development, an expressed sequence tag library from the juvenile light organ of Euprymna scolopes was used to identify members of the Toll/NF-kappaB pathway. Full-length transcripts were identified by using 5' and 3' RACE PCR. Seven transcripts critical for MAMP-induced triggering of the Toll/NF-kappaB phosphorylation cascade have been identified, including receptors, signal transducers, and a transcription factor. Further investigations should elucidate the role of the Toll/NF-kappaB pathway in the initiation of the beneficial symbiosis between E. scolopes and Vibrio fischeri.
Collapse
|
44
|
Sibenaller ZA, Etame AB, Ali MM, Barua M, Braun TA, Casavant TL, Ryken TC. Genetic characterization of commonly used glioma cell lines in the rat animal model system. Neurosurg Focus 2005; 19:E1. [PMID: 16241103 DOI: 10.3171/foc.2005.19.4.2] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Object
Animal models have been used extensively to discern the molecular biology of diseases and to gain insight into treatments. Nevertheless, discrepancies in the effects of treatments and procedures have been encountered during the transition from these animal models to application of the information to clinical trials in humans. To assess the genetic similarities between human gliomas and four cell lines used routinely in animal models, the authors used microarray technology to characterize the similarities and differences in gene expression.
Methods
To define the changes in gene expression, normal rat astrocytes were compared with four rat glioma cell lines (C6, 9L, F98, and RG2). The data were analyzed using two different methods: fold-change analysis and statistical analysis with t statistics. The gene products that were highlighted after intersecting the lists generated by the two methods of analysis were scrutinized against changes in gene expression reported in the literature. Tumorigenesis involves three major steps: the accumulation of genetic alterations, uncontrolled growth, and selected survival of transformed cells. The discussion of the results focuses attention on genes whose primary function is in pathways involved in glioma proliferation, infiltration, and neovascularization. A comparative microarray analysis of differentially expressed genes for four of the commonly used rat tumor cell lines is presented here.
Conclusions
Due to the variances between the cell lines and results from analyses in humans, caution must be observed in interpreting as well as in the translation of information learned from animal models to its application in human trials.
Collapse
|
45
|
Zabner J, Scheetz TE, Almabrazi HG, Casavant TL, Huang J, Keshavjee S, McCray PB. CFTR DeltaF508 mutation has minimal effect on the gene expression profile of differentiated human airway epithelia. Am J Physiol Lung Cell Mol Physiol 2005; 289:L545-53. [PMID: 15937068 DOI: 10.1152/ajplung.00065.2005] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Cystic fibrosis (CF) is caused by mutations in the cystic fibrosis transmembrane conductance regulator (CFTR), an epithelial chloride channel regulated by phosphorylation. Most of the disease-associated morbidity is the consequence of chronic lung infection with progressive tissue destruction. As an approach to investigate the cellular effects of CFTR mutations, we used large-scale microarray hybridization to contrast the gene expression profiles of well-differentiated primary cultures of human CF and non-CF airway epithelia grown under resting culture conditions. We surveyed the expression profiles for 10 non-CF and 10 DeltaF508 homozygote samples. Of the 22,283 genes represented on the Affymetrix U133A GeneChip, we found evidence of significant changes in expression in 24 genes by two-sample t-test (P < 0.00001). A second, three-filter method of comparative analysis found no significant differences between the groups. The levels of CFTR mRNA were comparable in both groups. There were no significant differences in the gene expression patterns between male and female CF specimens. There were 18 genes with significant increases and 6 genes with decreases in CF relative to non-CF samples. Although the function of many of the differentially expressed genes is unknown, one transcript that was elevated in CF, the KCl cotransporter (KCC4), is a candidate for further study. Overall, the results indicate that CFTR dysfunction has little direct impact on airway epithelial gene expression in samples grown under these conditions.
Collapse
|
46
|
Hackett JD, Scheetz TE, Yoon HS, Soares MB, Bonaldo MF, Casavant TL, Bhattacharya D. Insights into a dinoflagellate genome through expressed sequence tag analysis. BMC Genomics 2005; 6:80. [PMID: 15921535 PMCID: PMC1173104 DOI: 10.1186/1471-2164-6-80] [Citation(s) in RCA: 120] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2005] [Accepted: 05/29/2005] [Indexed: 11/10/2022] Open
Abstract
Background Dinoflagellates are important marine primary producers and grazers and cause toxic "red tides". These taxa are characterized by many unique features such as immense genomes, the absence of nucleosomes, and photosynthetic organelles (plastids) that have been gained and lost multiple times. We generated EST sequences from non-normalized and normalized cDNA libraries from a culture of the toxic species Alexandrium tamarense to elucidate dinoflagellate evolution. Previous analyses of these data have clarified plastid origin and here we study the gene content, annotate the ESTs, and analyze the genes that are putatively involved in DNA packaging. Results Approximately 20% of the 6,723 unique (11,171 total 3'-reads) ESTs data could be annotated using Blast searches against GenBank. Several putative dinoflagellate-specific mRNAs were identified, including one novel plastid protein. Dinoflagellate genes, similar to other eukaryotes, have a high GC-content that is reflected in the amino acid codon usage. Highly represented transcripts include histone-like (HLP) and luciferin binding proteins and several genes occur in families that encode nearly identical proteins. We also identified rare transcripts encoding a predicted protein highly similar to histone H2A.X. We speculate this histone may be retained for its role in DNA double-strand break repair. Conclusion This is the most extensive collection to date of ESTs from a toxic dinoflagellate. These data will be instrumental to future research to understand the unique and complex cell biology of these organisms and for potentially identifying the genes involved in toxin production.
Collapse
|
47
|
Bonaldo MF, Bair TB, Scheetz TE, Snir E, Akabogu I, Bair JL, Berger B, Crouch K, Davis A, Eyestone ME, Keppel C, Kucaba TA, Lebeck M, Lin JL, de Melo AIR, Rehmann J, Reiter RS, Schaefer K, Smith C, Tack D, Trout K, Sheffield VC, Lin JJC, Casavant TL, Soares MB. 1274 full-open reading frames of transcripts expressed in the developing mouse nervous system. Genome Res 2004; 14:2053-63. [PMID: 15489326 PMCID: PMC528920 DOI: 10.1101/gr.2601304] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
As part of the trans-National Institutes of Health (NIH) Mouse Brain Molecular Anatomy Project (BMAP), and in close coordination with the NIH Mammalian Gene Collection Program (MGC), we initiated a large-scale project to clone, identify, and sequence the complete open reading frame (ORF) of transcripts expressed in the developing mouse nervous system. Here we report the analysis of the ORF sequence of 1274 cDNAs, obtained from 47 full-length-enriched cDNA libraries, constructed by using a novel approach, herein described. cDNA libraries were derived from size-fractionated cytoplasmic mRNA isolated from brain and eye tissues obtained at several embryonic stages and postnatal days. Altogether, including the full-ORF MGC sequences derived from these libraries by the MGC sequencing team, NIH_BMAP full-ORF sequences correspond to approximately 20% of all transcripts currently represented in mouse MGC. We show that NIH_BMAP clones comprise 68% of mouse MGC cDNAs > or =5 kb, and 54% of those > or =4 kb, as of March 15, 2004. Importantly, we identified transcripts, among the 1274 full-ORF sequences, that are exclusively or predominantly expressed in brain and eye tissues, many of which encode yet uncharacterized proteins.
Collapse
|
48
|
Shrout JD, Scheetz TE, Casavant TL, Parkin GF. Isolation and characterization of autotrophic, hydrogen-utilizing, perchlorate-reducing bacteria. Appl Microbiol Biotechnol 2004; 67:261-8. [PMID: 15834721 DOI: 10.1007/s00253-004-1725-0] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2004] [Revised: 06/25/2004] [Accepted: 07/09/2004] [Indexed: 10/26/2022]
Abstract
Recent studies have shown that perchlorate (ClO(4) (-)) can be degraded by some pure-culture and mixed-culture bacteria with the addition of hydrogen. This paper describes the isolation of two hydrogen-utilizing perchlorate-degrading bacteria capable of using inorganic carbon for growth. These autotrophic bacteria are within the genus Dechloromonas and are the first Dechloromonas species that are microaerophilic and incapable of growth at atmospheric oxygen concentrations. Dechloromonas sp. JDS5 and Dechloromonas sp. JDS6 are the first perchlorate-degrading autotrophs isolated from a perchlorate-contaminated site. Measured hydrogen thresholds were higher than for other environmentally significant, hydrogen-utilizing, anaerobic bacteria (e.g., halorespirers). The chlorite dismutase activity of these bacteria was greater for autotrophically grown cells than for cells grown heterotrophically on lactate. These bacteria used fumarate as an alternate electron acceptor, which is the first report of growth on an organic electron acceptor by perchlorate-reducing bacteria.
Collapse
|
49
|
Stone EM, Braun TA, Russell SR, Kuehn MH, Lotery AJ, Moore PA, Eastman CG, Casavant TL, Sheffield VC. Missense variations in the fibulin 5 gene and age-related macular degeneration. N Engl J Med 2004; 351:346-53. [PMID: 15269314 DOI: 10.1056/nejmoa040833] [Citation(s) in RCA: 217] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
BACKGROUND Age-related macular degeneration (AMD) is the most common cause of irreversible vision loss in the developed world. The study of a rare mendelian form of macular degeneration implicated fibulin genes in the pathogenesis of more common forms of this disease. We evaluated five fibulin genes in a large series of patients with AMD. METHODS We studied 402 patients with AMD and 429 control subjects from the same clinic population. Patients were examined by means of indirect ophthalmoscopy, slit-lamp microscopy, and fundus photography to establish the presence and phenotypic pattern of AMD. DNA samples were screened for sequence variations in five members of the fibulin gene family. RESULTS Amino acid-altering sequence variations were found in all five fibulin genes, many of which were observed only in patients with AMD. Several of the altered residues have been conserved during evolution. Seven of the 402 patients with AMD had amino acid-altering sequence variations in the fibulin 5 gene, whereas none were observed among 429 control subjects (P<0.01). In addition, these seven patients all had small, circular drusen, which are commonly referred to as basal laminar or cuticular drusen. CONCLUSIONS Missense mutations in the fibulin 5 gene were found in 1.7 percent of patients with AMD. Many variations in other fibulin genes were also found in these patients, and the evolutionary conservation of the affected residues suggests that several of these variations may also be involved in AMD.
Collapse
|
50
|
Zhao SH, Simmons DG, Cross JC, Scheetz TE, Casavant TL, Soares MB, Tuggle CK. PLET1 (C11orf34), a highly expressed and processed novel gene in pig and mouse placenta, is transcribed but poorly spliced in human. Genomics 2004; 84:114-25. [PMID: 15203209 DOI: 10.1016/j.ygeno.2004.02.006] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2003] [Accepted: 02/12/2004] [Indexed: 11/28/2022]
Abstract
Sequencing of porcine cDNAs identified a novel EST with high frequency in placenta tissue. Full-length PLET1 (placenta-expressed transcript 1, also called C11orf34) matched a mouse cDNA and many bovine and mouse ESTs but no human transcripts or ESTs. However, the porcine cDNA matched several putative exons within a human genomic DNA fragment on chromosome 11. This human locus is in a region of conserved synteny with pig chromosome 9, to which the porcine gene was subsequently mapped. RNA blot hybridization showed that this gene had high expression in porcine and mouse conceptus and throughout placenta development. In situ hybridization using mouse placenta showed PLET1 expression in trophoblast cells of the labyrinth, as well as in spongiotrophoblast and glycogen trophoblast cells. However, no expression of PLET1 was detected by RNA blot analysis of human placenta, although RT-PCR analysis detected very small amounts of partially spliced RNA that were significantly less abundant than the RNA levels in mouse placenta. Donor and acceptor splicing site sequences in the exons of the human gene are poorly conserved and may be the cause of inefficient splicing found specifically in human tissue. Our data correct GenomeScan annotation of this region of the human genome and describe functional gene discovery in mammals not recognized in human EST projects.
Collapse
|