1
|
Phillips CD, Butler B, Fondon JW, Mantilla-Meluk H, Baker RJ. Contrasting evolutionary dynamics of the developmental regulator PAX9, among bats, with evidence for a novel post-transcriptional regulatory mechanism. PLoS One 2013; 8:e57649. [PMID: 23469040 PMCID: PMC3585407 DOI: 10.1371/journal.pone.0057649] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2012] [Accepted: 01/23/2013] [Indexed: 11/18/2022] Open
Abstract
Morphological evolution can be the result of natural selection favoring modification of developmental signaling pathways. However, little is known about the genetic basis of such phenotypic diversity. Understanding these mechanisms is difficult for numerous reasons, yet studies in model organisms often provide clues about the major developmental pathways involved. The paired-domain gene, PAX9, is known to be a key regulator of development, particularly of the face and teeth. In this study, using a comparative genetics approach, we investigate PAX9 molecular evolution among mammals, focusing on craniofacially diversified (Phyllostomidae) and conserved (Vespertilionidae) bat families, and extend our comparison to other orders of mammal. Open-reading frame analysis disclosed signatures of selection, in which a small percentage of residues vary, and lineages acquire different combinations of variation through recurrent substitution and lineage specific changes. A few instances of convergence for specific residues were observed between morphologically convergent bat lineages. Bioinformatic analysis for unknown PAX9 regulatory motifs indicated a novel post-transcriptional regulatory mechanism involving a Musashi protein. This regulation was assessed through fluorescent reporter assays and gene knockdowns. Results are compatible with the hypothesis that the number of Musashi binding-elements in PAX9 mRNA proportionally regulates protein translation rate. Although a connection between morphology and binding element frequency was not apparent, results indicate this regulation would vary among craniofacially divergent bat species, but be static among conserved species. Under this model, Musashi's regulatory control of alternative human PAX9 isoforms would also vary. The presence of Musashi-binding elements within PAX9 of all mammals examined, chicken, zebrafish, and the fly homolog of PAX9, indicates this regulatory mechanism is ancient, originating basal to much of the animal phylogeny.
Collapse
Affiliation(s)
- Caleb D Phillips
- Department of Biological Sciences, Texas Tech University, Lubbock, Texas, United States of America.
| | | | | | | | | |
Collapse
|
2
|
McIver LJ, McCormick JF, Martin A, Fondon JW, Garner HR. Population-scale analysis of human microsatellites reveals novel sources of exonic variation. Gene 2012; 516:328-34. [PMID: 23274653 DOI: 10.1016/j.gene.2012.12.068] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2012] [Accepted: 12/15/2012] [Indexed: 11/29/2022]
Abstract
Using our microsatellite specific genotyping method, we analyzed tandem repeats, which are known to be highly variable with some recognized as biomarkers causative of disease, in over 500 individuals who were exon sequenced in a 1000 Genomes Project pilot study. We were able to genotype over 97% of the microsatellite loci in the targeted regions. A total of 25,115 variations were observed, including repeat length and single nucleotide polymorphisms, corresponding to an average of 45.6 variations per individual and a density of 1.1 variations per kilobase. Standard variant detection did not report 94.2% of the exonic repeat length variations in part because the alignment techniques are not ideal for repetitive regions. Additionally some standard variation detection tools rely on a database of known variations, making them less likely to call repeat length variations as only a small percent of these loci (~6000) have been accurately characterized. A subset of the hundreds of non-synonymous variations we identified was experimentally validated, indicating an accuracy of 96.5% for our microsatellite-based genotyping method, with some novel variants identified in genes associated with cancer. We propose that microsatellite-based genotyping be used as a part of large scale sequencing studies to identify novel variants.
Collapse
Affiliation(s)
- L J McIver
- Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, VA, USA
| | | | | | | | | |
Collapse
|
3
|
Schoenebeck JJ, Hutchinson SA, Byers A, Beale HC, Carrington B, Faden DL, Rimbault M, Decker B, Kidd JM, Sood R, Boyko AR, Fondon JW, Wayne RK, Bustamante CD, Ciruna B, Ostrander EA. Variation of BMP3 contributes to dog breed skull diversity. PLoS Genet 2012; 8:e1002849. [PMID: 22876193 PMCID: PMC3410846 DOI: 10.1371/journal.pgen.1002849] [Citation(s) in RCA: 119] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2012] [Accepted: 06/06/2012] [Indexed: 12/21/2022] Open
Abstract
Since the beginnings of domestication, the craniofacial architecture of the domestic dog has morphed and radiated to human whims. By beginning to define the genetic underpinnings of breed skull shapes, we can elucidate mechanisms of morphological diversification while presenting a framework for understanding human cephalic disorders. Using intrabreed association mapping with museum specimen measurements, we show that skull shape is regulated by at least five quantitative trait loci (QTLs). Our detailed analysis using whole-genome sequencing uncovers a missense mutation in BMP3. Validation studies in zebrafish show that Bmp3 function in cranial development is ancient. Our study reveals the causal variant for a canine QTL contributing to a major morphologic trait. As a result of selective breeding practices, modern dogs display a multitude of head shapes. Breeds such as the Pug and Bulldog popularize one of these morphologies, termed “brachycephaly.” A short, upward-pointing snout, a massive and rounded head, and an underbite typify brachycephalic breeds. Here, we have coupled the phenotypes collected from museum skulls with the genotypes collected from dogs and identified five regions of the dog genome that are associated with canine brachycephaly. Fine mapping at one of these regions revealed a causal mutation in the gene BMP3. Bmp3's role in regulating cranial development is evolutionarily ancient, as zebrafish require its function to generate a normal craniofacial morphology. Our data begin to expose the genetic mechanisms unknowingly employed by breeders to create and diversify the cranial shape of dogs.
Collapse
Affiliation(s)
- Jeffrey J. Schoenebeck
- Cancer Genetics Branch, National Human Genome Research Institute, Bethesda, Maryland, United States of America
| | - Sarah A. Hutchinson
- Program in Developmental and Stem Cell Biology, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Alexandra Byers
- Cancer Genetics Branch, National Human Genome Research Institute, Bethesda, Maryland, United States of America
| | - Holly C. Beale
- Cancer Genetics Branch, National Human Genome Research Institute, Bethesda, Maryland, United States of America
| | - Blake Carrington
- Zebrafish Core, National Human Genome Research Institute, Bethesda, Maryland, United States of America
| | - Daniel L. Faden
- Cancer Genetics Branch, National Human Genome Research Institute, Bethesda, Maryland, United States of America
| | - Maud Rimbault
- Cancer Genetics Branch, National Human Genome Research Institute, Bethesda, Maryland, United States of America
| | - Brennan Decker
- Cancer Genetics Branch, National Human Genome Research Institute, Bethesda, Maryland, United States of America
| | - Jeffrey M. Kidd
- Department of Genetics, Stanford School of Medicine, Stanford, California, United States of America
| | - Raman Sood
- Zebrafish Core, National Human Genome Research Institute, Bethesda, Maryland, United States of America
| | - Adam R. Boyko
- Department of Biomedical Sciences, Cornell University College of Veterinary Medicine, Ithaca, New York, United States of America
| | - John W. Fondon
- Department of Biology, University of Texas at Arlington, Arlington, Texas, United States of America
| | - Robert K. Wayne
- Department of Ecology and Evolutionary Biology, University of California Los Angeles, Los Angeles, California, United States of America
| | - Carlos D. Bustamante
- Department of Genetics, Stanford School of Medicine, Stanford, California, United States of America
| | - Brian Ciruna
- Program in Developmental and Stem Cell Biology, The Hospital for Sick Children, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Elaine A. Ostrander
- Cancer Genetics Branch, National Human Genome Research Institute, Bethesda, Maryland, United States of America
- * E-mail:
| |
Collapse
|
4
|
Fondon JW, Martin A, Richards S, Gibbs RA, Mittelman D. Analysis of microsatellite variation in Drosophila melanogaster with population-scale genome sequencing. PLoS One 2012; 7:e33036. [PMID: 22427938 PMCID: PMC3299726 DOI: 10.1371/journal.pone.0033036] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2011] [Accepted: 02/07/2012] [Indexed: 11/29/2022] Open
Abstract
Genome sequencing technologies promise to revolutionize our understanding of genetics, evolution, and disease by making it feasible to survey a broad spectrum of sequence variation on a population scale. However, this potential can only be realized to the extent that methods for extracting and interpreting distinct forms of variation can be established. The error profiles and read length limitations of early versions of next-generation sequencing technologies rendered them ineffective for some sequence variant types, particularly microsatellites and other tandem repeats, and fostered the general misconception that such variants are inherently inaccessible to these platforms. At the same time, tandem repeats have emerged as important sources of functional variation. Tandem repeats are often located in and around genes, and frequent mutations in their lengths exert quantitative effects on gene function and phenotype, rapidly degrading linkage disequilibrium between markers and traits. Sensitive identification of these variants in large-scale next-gen sequencing efforts will enable more comprehensive association studies capable of revealing previously invisible associations. We present a population-scale analysis of microsatellite repeats using whole-genome data from 158 inbred isolates from the Drosophila Genetics Reference Panel, a collection of over 200 extensively phenotypically characterized isolates from a single natural population, to uncover processes underlying repeat mutation and to enable associations with behavioral, morphological, and life-history traits. Analysis of repeat variation from next-generation sequence data will also enhance studies of genome stability and neurodegenerative diseases.
Collapse
Affiliation(s)
- John W. Fondon
- Department of Biology, University of Texas at Arlington, Arlington, Texas, United States of America
| | - Andy Martin
- Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, Virginia, United States of America
| | - Stephen Richards
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - Richard A. Gibbs
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - David Mittelman
- Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, Virginia, United States of America
- Department of Biological Sciences, Virginia Tech, Blacksburg, Virginia, United States of America
| |
Collapse
|
5
|
McIver LJ, Fondon JW, Skinner MA, Garner HR. Evaluation of microsatellite variation in the 1000 Genomes Project pilot studies is indicative of the quality and utility of the raw data and alignments. Genomics 2011; 97:193-9. [PMID: 21223998 DOI: 10.1016/j.ygeno.2011.01.001] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2010] [Revised: 12/14/2010] [Accepted: 01/03/2011] [Indexed: 12/18/2022]
Abstract
We performed an analysis of global microsatellite variation on the two kindreds sequenced at high depth (~20×-60×) in the 1000 Genomes Project pilot studies because alterations in these highly mutable repetitive sequences have been linked with many phenotypes and disease risks. The standard alignment technique performs poorly in microsatellite regions as a consequence of low effective coverage (~1×-5×) resulting in 79% of the informative loci exhibiting non-Mendelian inheritance patterns. We used a more stringent approach in computing robust allelotypes resulting in 94.4% of the 1095 informative repeats conforming to traditional inheritance. The high-confidence allelotypes were analyzed to obtain an estimate of the minimum polymorphism rate as a function of motif length, motif sequence, and distribution within the genome.
Collapse
Affiliation(s)
- L J McIver
- Virginia Bioinformatics Institute, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061-0477, USA
| | | | | | | |
Collapse
|
6
|
Fondon JW, Hammock EAD, Hannan AJ, King DG. Simple sequence repeats: genetic modulators of brain function and behavior. Trends Neurosci 2008; 31:328-34. [PMID: 18550185 DOI: 10.1016/j.tins.2008.03.006] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2007] [Revised: 03/17/2008] [Accepted: 03/18/2008] [Indexed: 02/03/2023]
Affiliation(s)
- John W Fondon
- McDermott Center for Human Growth and Development and Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | | | | | | |
Collapse
|
7
|
Abstract
The remarkable responsiveness of dog morphology to selection is a testament to the mutability of mammals. The genetic sources of this morphological variation are largely unknown, but some portion is due to tandem repeat length variation in genes involved in development. Previous analysis of tandem repeats in coding regions of developmental genes revealed fewer interruptions in repeat sequences in dogs than in the orthologous repeats in humans, as well as higher levels of polymorphism, but the fragmentary nature of the available dog genome sequence thwarted attempts to distinguish between locus-specific and genome-wide origins of this disparity. Using whole-genome analyses of the human and recently completed dog genomes, we show that dogs possess a genome-wide increase in the basal germ-line slippage mutation rate. Building on the approach that gave rise to the initial observation in dogs, we sequenced 55 coding repeat regions in 42 species representing 10 major carnivore clades and found that a genome-wide elevated slippage mutation rate is a derived character shared by diverse wild canids, distinguishing them from other Carnivora. A similarly heightened slippage profile was also detected in rodents, another taxon exhibiting high diversity and rapid evolvability. The correlation of enhanced slippage rates with major evolutionary radiations suggests that the possession of a "slippery" genome may bestow on some taxa greater potential for rapid evolutionary change.
Collapse
Affiliation(s)
- Jeffrey Laidlaw
- Department of Biochemistry, UT Southwestern Medical Center, Dallas, TX 75390, USA
| | | | | | | | | | | | | |
Collapse
|
8
|
Fondon JW, Garner HR. Detection of length-dependent effects of tandem repeat alleles by 3-D geometric decomposition of craniofacial variation. Dev Genes Evol 2006; 217:79-85. [PMID: 17066275 DOI: 10.1007/s00427-006-0113-4] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2006] [Accepted: 09/11/2006] [Indexed: 10/24/2022]
Abstract
Topologically conservative morphological transformations typify the succession of species in the fossil record and also typify more subtle morphological variation within species. Isolation and quantification of morphological variation along its various intermingled modes becomes increasingly difficult as the structures under consideration increase in complexity. Here, we describe a comparative morphometric and genomic study in dogs in which complex three-dimensional craniofacial variation is mathematically distilled into simpler geometric components to test the hypothesis that incremental mutations at developmental loci result in simple geometric deformations of morphology. Combinations of candidate transforms are computationally evaluated for their ability to accurately transform a reference three-dimensional skull model into those of distinct breeds. A set of five simple basis functions are found to be sufficient to describe most craniofacial variation among dogs. Allele lengths of amino acid repeat length variants in developmental regulator genes, which frequently have quantitative effects on phenotype, were compared to geometric terms using Pearson correlation and regression. The coordinated quantitative representation of both phenotype and genotype improves the statistical power for the detection of causative genotype-phenotype relationships and enabled the characterization of the influence of Runx-2 coding repeat length on craniofacial variation among domestic dogs.
Collapse
Affiliation(s)
- John W Fondon
- McDermott Center for Human Growth and Development, Department of Biochemistry, UT Southwestern Medical Center, Dallas, TX 75390-9152, USA.
| | | |
Collapse
|
9
|
Pertsemlidis A, Zelinka J, Fondon JW, Henderson RK, Otwinowski Z. Bayesian Statistical Studies of the Ramachandran Distribution. Stat Appl Genet Mol Biol 2005; 4:Article35. [PMID: 16646854 DOI: 10.2202/1544-6115.1165] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
We describe a method for the generation of knowledge-based potentials and apply it to the observed torsional angles of known protein structures. The potential is derived using Bayesian reasoning, and is useful as a prior for further such reasoning in the presence of additional data. The potential takes the form of a probability density function, which is described by a small number of coefficients with the number of necessary coefficients determined by tests based on statistical significance and entropy. We demonstrate the methods in deriving one such potential corresponding to two dimensions, the Ramachandran plot. In contrast to traditional histogram-based methods, the function is continuous and differentiable. These properties allow us to use the function as a force term in the energy minimization of appropriately described structures. The method can easily be extended to other observable angles and higher dimensions, or to include sequence dependence and should find applications in structure determination and validation.
Collapse
|
10
|
Abstract
Mutations in cis-regulatory sequences have been implicated as being the predominant source of variation in morphological evolution. We offer a hypothesis that gene-associated tandem repeat expansions and contractions are a major source of phenotypic variation in evolution. Here, we describe a comparative genomic study of repetitive elements in developmental genes of 92 breeds of dogs. We find evidence for selection for divergence at coding repeat loci in the form of both elevated purity and extensive length polymorphism among different breeds. Variations in the number of repeats in the coding regions of the Alx-4 (aristaless-like 4) and Runx-2 (runt-related transcription factor 2) genes were quantitatively associated with significant differences in limb and skull morphology. We identified similar repeat length variation in the coding repeats of Runx-2, Twist, and Dlx-2 in several other species. The high frequency and incremental effects of repeat length mutations provide molecular explanations for swift, yet topologically conservative morphological evolution.
Collapse
Affiliation(s)
- John W Fondon
- Eugene McDermott Center for Human Growth and Development and Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, TX 75390-8591, USA.
| | | |
Collapse
|
11
|
Abstract
We present a point mutation classification method that contrasts SNP databases and has the potential to illuminate the relative mutational load of genes caused by codon bias. We group point variation gleaned from public databases by their wild-type and mutant codons, e.g. codon mutation classes (CMCs, 576 possible such as ACG-->ATG), whose frequencies in a database are assembled into a BLOSUM-style matrix describing the likelihood of observing all possible single base codon changes as tuned by the intertwined effects of mutation rate and selection. The rankings of the CMCs in any database are reshuffled according to the population stratification of the typical genotyping experiment producing that resource's data. Analysis of four independent databases reveals that a considerable fraction of mutation in functional genes can be described by a few CMCs regardless of gene identity or population stratification in the genotyping experiment. For example, the top 5% (29/576) of CMCs account for 27.4% of the observed variants in dbSNP while the bottom 5% account for only 0.02%. For non-synonymous disease-causing mutation, 40.8% are described by the top 5% of all possible non-silent CMCs (22/438). Overall, the most observed polymorphism is a G-->A transition at CpG dinucleotides causing ACG, TCG, GCG, and CCG to frequently undergo silent mutation in any gene due to the putative lack of impact on the protein product. In order to assess how well CMC spectrums estimate the aggregate non-synonymous mutational trends of a single gene, a CMC matrix was applied to seven unrelated genes to compute the most likely point mutations. In excess of 87% of these mutation predictions are historically known to play an important role in a disease state according to published literature. CMC-based mutation prediction may aid design and execution of direct association genotyping studies.
Collapse
Affiliation(s)
- Monica M Horvath
- McDermott Center for Human Growth and Development, The University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390-8591, USA.
| | | | | |
Collapse
|
12
|
Abstract
Searching for similarities between biological sequences is the principal means by which bioinformatics contributes to our understanding of biology. Of the various informatics tools developed to accomplish this task, the most widely used is BLAST, the basic local alignment search tool. This article discusses the principles, workings, applications and potential pitfalls of BLAST, focusing on the implementation developed at the National Center for Biotechnology Information.
Collapse
Affiliation(s)
- A Pertsemlidis
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX 75390-8591, USA.
| | | |
Collapse
|
13
|
Wren JD, Forgacs E, Fondon JW, Pertsemlidis A, Cheng SY, Gallardo T, Williams RS, Shohet RV, Minna JD, Garner HR. Repeat polymorphisms within gene regions: phenotypic and evolutionary implications. Am J Hum Genet 2000; 67:345-56. [PMID: 10889045 PMCID: PMC1287183 DOI: 10.1086/303013] [Citation(s) in RCA: 112] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2000] [Accepted: 06/02/2000] [Indexed: 11/03/2022] Open
Abstract
We have developed an algorithm that predicted 11,265 potentially polymorphic tandem repeats within transcribed sequences. We estimate that 22% (2,207/9,717) of the annotated clusters within UniGene contain at least one potentially polymorphic locus. Our predictions were tested by allelotyping a panel of approximately 30 individuals for 5% of these regions, confirming polymorphism for more than half the loci tested. Our study indicates that tandem-repeat polymorphisms in genes are more common than is generally believed. Approximately 8% of these loci are within coding sequences and, if polymorphic, would result in frameshifts. Our catalogue of putative polymorphic repeats within transcribed sequences comprises a large set of potentially phenotypic or disease-causing loci. In addition, from the anomalous character of the repetitive sequences within unannotated clusters, we also conclude that the UniGene cluster count substantially overestimates the number of genes in the human genome. We hypothesize that polymorphisms in repeated sequences occur with some baseline distribution, on the basis of repeat homogeneity, size, and sequence composition, and that deviations from that distribution are indicative of the nature of selection pressure at that locus. We find evidence of selective maintenance of the ability of some genes to respond very rapidly, perhaps even on intragenerational timescales, to fluctuating selective pressures.
Collapse
Affiliation(s)
- J D Wren
- Program in Genetics, Southwestern Graduate School of Biomedical Sciences, Dallas, TX, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Wistuba II, Behrens C, Virmani AK, Mele G, Milchgrub S, Girard L, Fondon JW, Garner HR, McKay B, Latif F, Lerman MI, Lam S, Gazdar AF, Minna JD. High resolution chromosome 3p allelotyping of human lung cancer and preneoplastic/preinvasive bronchial epithelium reveals multiple, discontinuous sites of 3p allele loss and three regions of frequent breakpoints. Cancer Res 2000; 60:1949-60. [PMID: 10766185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/16/2023]
Abstract
Allele loss involving chromosome arm 3p is one of the most frequent and earliest known genetic events in lung cancer pathogenesis and may affect several potential tumor suppressor gene regions. To further study the role of chromosome 3p allele loss in the pathogenesis of lung cancer, we performed high resolution loss of heterozygosity (LOH) studies on 97 lung cancer and 54 preneoplastic/preinvasive microdissected respiratory epithelial samples using a panel of 28 3p markers. Allelic losses of 3p were detected in 96% of the lung cancers and in 78% of the preneoplastic/preinvasive lesions. The allele losses were often multiple and discontinuous, with areas of LOH interspersed with areas of retention of heterozygosity. Most small cell lung carcinomas (91%) and squamous cell carcinomas (95%) demonstrated larger 3p segments of allele loss, whereas most (71%) of the adenocarcinomas and preneoplastic/preinvasive lesions had smaller chromosome areas of 3p allele loss. There was a progressive increase in the frequency and size of 3p allele loss regions with increasing severity of histopathological preneoplastic/preinvasive changes. In analyses of the specific parental allele lost comparing 42 preneoplastic/preinvasive foci with those lost in the lung cancer in the same patient (n = 10), the same parental allele was lost in 88% of 244 comparisons for 28 3p markers (P = 1.2 x 10(-36) for this occurring by chance). This indicates the occurrence of allele-specific loss in these foci similar to that seen in the tumor by a currently unknown mechanism. Analysis of all of the data indicated multiple regions of localized 3p allele loss including telomere-D3S1597, D3S1111-D3S2432, D3S2432-D3S1537, D3S1537, D3S1537-D3S1612, D3S4604/Luca19.1-D3S4622/Luca4.1, D3S4624/Luca2.1, D3S4624/Luca2.1-D3S1582, D3S1766, D3S1234-D3S1300 (FHIT/FRA3B region centered on D3S1300), D3S1284-D3S1577 (U2020/DUTT1 region centered on D3S1274), and D3S1511-centromere. A panel of six markers in the 600-kb 3p21.3 deletion region showed loss in 77% of the lung cancers, 70% of normal or preneoplastic/preinvasive lesions associated with lung cancer, and 49% of 47 normal, mildly abnormal, or preneoplastic/preinvasive lesions found in smokers without lung cancer; however, loss was seen in 0% of 18 epithelial samples from seven never smokers. The 600-kb 3p21.3 region and the 3p14.2 (FHIT/FRA3B) and 3p12 (U2020/DUTT1) regions were common, independent sites of breakpoints (retention of heterozygosity by some markers and LOH by other markers in the immediate region). We conclude that 3p allele loss is nearly universal in lung cancer pathogenesis; involves multiple, discrete, 3p LOH sites that often show a "discontinuous LOH" pattern in individual tumors; occurs in preneoplastic/preinvasive lesions in smokers with and without lung cancer (multiple lesions often lose the same parental allele); frequently involves breakpoints in at least three very small defined genomic regions; and appears to have allele loss and breakpoints first occurring in the 600-kb 3p21.3 region. These findings are consistent with previously reported LOH studies in a variety of tumors showing allele loss occurring by mitotic recombination and induced by oxidative damage.
Collapse
Affiliation(s)
- I I Wistuba
- Hamon Center for Therapeutic Oncology Research, University of Texas Southwestern Medical Center, Dallas 75390, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Fondon JW, Mele GM, Brezinschek RI, Cummings D, Pande A, Wren J, O'Brien KM, Kupfer KC, Wei MH, Lerman M, Minna JD, Garner HR. Computerized polymorphic marker identification: experimental validation and a predicted human polymorphism catalog. Proc Natl Acad Sci U S A 1998; 95:7514-9. [PMID: 9636181 PMCID: PMC22669 DOI: 10.1073/pnas.95.13.7514] [Citation(s) in RCA: 46] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/1998] [Accepted: 05/04/1998] [Indexed: 02/07/2023] Open
Abstract
A computational system for the prediction of polymorphic loci directly and efficiently from human genomic sequence was developed and verified. A suite of programs, collectively called POMPOUS (polymorphic marker prediction of ubiquitous simple sequences) detects tandem repeats ranging from dinucleotides up to 250 mers, scores them according to predicted level of polymorphism, and designs appropriate flanking primers for PCR amplification. This approach was validated on an approximately 750-kilobase region of human chromosome 3p21.3, involved in lung and breast carcinoma homozygous deletions. Target DNA from 36 paired B lymphoblastoid and lung cancer lines was amplified and allelotyped for 33 loci predicted by POMPOUS to be variable in repeat size. We found that among those 36 predominately Caucasian individuals 22 of the 33 (67%) predicted loci were polymorphic with an average heterozygosity of 0.42. Allele loss in this region was found in 27/36 (75%) of the tumor lines using these markers. POMPOUS provides the genetic researcher with an additional tool for the rapid and efficient identification of polymorphic markers, and through a World Wide Web site, investigators can use POMPOUS to identify polymorphic markers for their research. A catalog of 13,261 potential polymorphic markers and associated primer sets has been created from the analysis of 141,779,504 base pairs of human genomic sequence in GenBank. This data is available on our Web site (pompous.swmed.edu) and will be updated periodically as GenBank is expanded and algorithm accuracy is improved.
Collapse
Affiliation(s)
- J W Fondon
- McDermott Center for Human Growth and Development and the Center for Biomedical Inventions, The University of Texas Southwestern Medical Center, Dallas, TX 75235, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
O'Brien KM, Fondon JW, Evans GA, Garner HR. Rescuing corrupted gel files from Model 377 and 373 DNA Sequencers. Biotechniques 1997; 22:1162-3. [PMID: 9187770 DOI: 10.2144/97226bc05] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Automated DNA sequencing requires the intensive use of computers to handle the large amount of data taken. When a computer failure occurs and the data are no longer accessible, all the expense and effort that went into the sequencing experiment is lost. By using the data storage architecture of Macintosh computers to our advantage, we may prevent this loss in the case of automatic sequencers from PE Applied Biosystems. The software required to allow the experimenter to do this has been written and is available free of charge.
Collapse
Affiliation(s)
- K M O'Brien
- Eugene McDermott Center for Human Growth and Development, University of Texas, Southwestern Medical Center, Dallas 75235-8591, USA
| | | | | | | |
Collapse
|