51
|
Nam K, Mugal C, Nabholz B, Schielzeth H, Wolf JBW, Backström N, Künstner A, Balakrishnan CN, Heger A, Ponting CP, Clayton DF, Ellegren H. Molecular evolution of genes in avian genomes. Genome Biol 2010; 11:R68. [PMID: 20573239 PMCID: PMC2911116 DOI: 10.1186/gb-2010-11-6-r68] [Citation(s) in RCA: 95] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2010] [Revised: 06/18/2010] [Accepted: 06/23/2010] [Indexed: 11/20/2022] Open
Abstract
Background Obtaining a draft genome sequence of the zebra finch (Taeniopygia guttata), the second bird genome to be sequenced, provides the necessary resource for whole-genome comparative analysis of gene sequence evolution in a non-mammalian vertebrate lineage. To analyze basic molecular evolutionary processes during avian evolution, and to contrast these with the situation in mammals, we aligned the protein-coding sequences of 8,384 1:1 orthologs of chicken, zebra finch, a lizard and three mammalian species. Results We found clear differences in the substitution rate at fourfold degenerate sites, being lowest in the ancestral bird lineage, intermediate in the chicken lineage and highest in the zebra finch lineage, possibly reflecting differences in generation time. We identified positively selected and/or rapidly evolving genes in avian lineages and found an over-representation of several functional classes, including anion transporter activity, calcium ion binding, cell adhesion and microtubule cytoskeleton. Conclusions Focusing specifically on genes of neurological interest and genes differentially expressed in the unique vocal control nuclei of the songbird brain, we find a number of positively selected genes, including synaptic receptors. We found no evidence that selection for beneficial alleles is more efficient in regions of high recombination; in fact, there was a weak yet significant negative correlation between ω and recombination rate, which is in the direction predicted by the Hill-Robertson effect if slightly deleterious mutations contribute to protein evolution. These findings set the stage for studies of functional genetics of avian genes.
Collapse
Affiliation(s)
- Kiwoong Nam
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18D, Uppsala, S-752 36, Sweden
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
52
|
Shumay E, Fowler JS, Volkow ND. Genomic features of the human dopamine transporter gene and its potential epigenetic States: implications for phenotypic diversity. PLoS One 2010; 5:e11067. [PMID: 20548783 PMCID: PMC2883569 DOI: 10.1371/journal.pone.0011067] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2010] [Accepted: 05/18/2010] [Indexed: 02/06/2023] Open
Abstract
Human dopamine transporter gene (DAT1 or SLC6A3) has been associated with various brain-related diseases and behavioral traits and, as such, has been investigated intensely in experimental- and clinical-settings. However, the abundance of research data has not clarified the biological mechanism of DAT regulation; similarly, studies of DAT genotype-phenotype associations yielded inconsistent results. Hence, our understanding of the control of the DAT protein product is incomplete; having this knowledge is critical, since DAT plays the major role in the brain's dopaminergic circuitry. Accordingly, we reevaluated the genomic attributes of the SLC6A3 gene that might confer sensitivity to regulation, hypothesizing that its unique genomic characteristics might facilitate highly dynamic, region-specific DAT expression, so enabling multiple regulatory modes. Our comprehensive bioinformatic analyzes revealed very distinctive genomic characteristics of the SLC6A3, including high inter-individual variability of its sequence (897 SNPs, about 90 repeats and several CNVs spell out all abbreviations in abstract) and pronounced sensitivity to regulation by epigenetic mechanisms, as evident from the GC-bias composition (0.55) of the SLC6A3, and numerous intragenic CpG islands (27 CGIs). We propose that this unique combination of the genomic features and the regulatory attributes enables the differential expression of the DAT1 gene and fulfills seemingly contradictory demands to its regulation; that is, robustness of region-specific expression and functional dynamics.
Collapse
Affiliation(s)
- Elena Shumay
- Brookhaven National Laboratory, Medical Department, Upton, New York, United States of America
- * E-mail: (ES); (JSF); (NDV)
| | - Joanna S. Fowler
- Brookhaven National Laboratory, Medical Department, Upton, New York, United States of America
- * E-mail: (ES); (JSF); (NDV)
| | - Nora D. Volkow
- National Institute on Drug Abuse, National Institutes of Health, Bethesda, Maryland, United States of America
- * E-mail: (ES); (JSF); (NDV)
| |
Collapse
|
53
|
Fadista J, Thomsen B, Holm LE, Bendixen C. Copy number variation in the bovine genome. BMC Genomics 2010; 11:284. [PMID: 20459598 PMCID: PMC2902221 DOI: 10.1186/1471-2164-11-284] [Citation(s) in RCA: 126] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2009] [Accepted: 05/06/2010] [Indexed: 12/12/2022] Open
Abstract
Background Copy number variations (CNVs), which represent a significant source of genetic diversity in mammals, have been shown to be associated with phenotypes of clinical relevance and to be causative of disease. Notwithstanding, little is known about the extent to which CNV contributes to genetic variation in cattle. Results We designed and used a set of NimbleGen CGH arrays that tile across the assayable portion of the cattle genome with approximately 6.3 million probes, at a median probe spacing of 301 bp. This study reports the highest resolution map of copy number variation in the cattle genome, with 304 CNV regions (CNVRs) being identified among the genomes of 20 bovine samples from 4 dairy and beef breeds. The CNVRs identified covered 0.68% (22 Mb) of the genome, and ranged in size from 1.7 to 2,031 kb (median size 16.7 kb). About 20% of the CNVs co-localized with segmental duplications, while 30% encompass genes, of which the majority is involved in environmental response. About 10% of the human orthologous of these genes are associated with human disease susceptibility and, hence, may have important phenotypic consequences. Conclusions Together, this analysis provides a useful resource for assessment of the impact of CNVs regarding variation in bovine health and production traits.
Collapse
Affiliation(s)
- João Fadista
- Group of Molecular Genetics and Systems Biology, Department of Genetics and Biotechnology, Faculty of Agricultural Sciences, Aarhus University, Blichers Allé 20, DK-8830 Tjele, Denmark
| | | | | | | |
Collapse
|
54
|
Hehir-Kwa JY, Wieskamp N, Webber C, Pfundt R, Brunner HG, Gilissen C, de Vries BBA, Ponting CP, Veltman JA. Accurate distinction of pathogenic from benign CNVs in mental retardation. PLoS Comput Biol 2010; 6:e1000752. [PMID: 20421931 PMCID: PMC2858682 DOI: 10.1371/journal.pcbi.1000752] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2009] [Accepted: 03/19/2010] [Indexed: 11/18/2022] Open
Abstract
Copy number variants (CNVs) have recently been recognized as a common form of genomic variation in humans. Hundreds of CNVs can be detected in any individual genome using genomic microarrays or whole genome sequencing technology, but their phenotypic consequences are still poorly understood. Rare CNVs have been reported as a frequent cause of neurological disorders such as mental retardation (MR), schizophrenia and autism, prompting widespread implementation of CNV screening in diagnostics. In previous studies we have shown that, in contrast to benign CNVs, MR-associated CNVs are significantly enriched in genes whose mouse orthologues, when disrupted, result in a nervous system phenotype. In this study we developed and validated a novel computational method for differentiating between benign and MR-associated CNVs using structural and functional genomic features to annotate each CNV. In total 13 genomic features were included in the final version of a Naïve Bayesian Tree classifier, with LINE density and mouse knock-out phenotypes contributing most to the classifier's accuracy. After demonstrating that our method (called GECCO) perfectly classifies CNVs causing known MR-associated syndromes, we show that it achieves high accuracy (94%) and negative predictive value (99%) on a blinded test set of more than 1,200 CNVs from a large cohort of individuals with MR. These results indicate that this classification method will be of value for objectively prioritizing CNVs in clinical research and diagnostics. Rare copy number variants (CNVs) are a frequent cause of neurological disorders such as mental retardation (MR). However CNVs are also commonly identified in healthy individuals. It is therefore crucial for both diagnostic and research applications to be able to distinguish between disease-causing CNVs and “benign” CNVs occurring as normal genomic variation. Separating these two types can take advantage of significant differences in their genomic contents. For example, benign CNVs are enriched in repetitive sequences. By contrast, CNVs associated with MR tend to have high densities of functional elements, including genes whose mouse orthologues, when knocked-out, lead to specific nervous system abnormalities. We have developed a novel objective approach that is effective in distinguishing MR-associated CNVs from benign CNVs based on the presence of 13 genomic attributes. This method is able to achieve high accuracies in a cohort of CNVs known to cause MR and in a cohort of individuals with unexplained MR. The development of this technique promises to substantially improve the methodology for determining the pathogenicity of CNVs.
Collapse
Affiliation(s)
- Jayne Y. Hehir-Kwa
- Radboud University Nijmegen Medical Centre, Department of Human Genetics, Nijmegen, The Netherlands
| | - Nienke Wieskamp
- Radboud University Nijmegen Medical Centre, Department of Human Genetics, Nijmegen, The Netherlands
| | - Caleb Webber
- MRC Functional Genomics Unit, University of Oxford, Department of Physiology, Anatomy and Genetics, Oxford, United Kingdom
| | - Rolph Pfundt
- Radboud University Nijmegen Medical Centre, Department of Human Genetics, Nijmegen, The Netherlands
| | - Han G. Brunner
- Radboud University Nijmegen Medical Centre, Department of Human Genetics, Nijmegen, The Netherlands
| | - Christian Gilissen
- Radboud University Nijmegen Medical Centre, Department of Human Genetics, Nijmegen, The Netherlands
| | - Bert B. A. de Vries
- Radboud University Nijmegen Medical Centre, Department of Human Genetics, Nijmegen, The Netherlands
| | - Chris P. Ponting
- MRC Functional Genomics Unit, University of Oxford, Department of Physiology, Anatomy and Genetics, Oxford, United Kingdom
| | - Joris A. Veltman
- Radboud University Nijmegen Medical Centre, Department of Human Genetics, Nijmegen, The Netherlands
- * E-mail:
| |
Collapse
|
55
|
Kuiper RP, Ligtenberg MJL, Hoogerbrugge N, Geurts van Kessel A. Germline copy number variation and cancer risk. Curr Opin Genet Dev 2010; 20:282-9. [PMID: 20381334 DOI: 10.1016/j.gde.2010.03.005] [Citation(s) in RCA: 92] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2010] [Revised: 03/05/2010] [Accepted: 03/15/2010] [Indexed: 01/13/2023]
Abstract
The human genome is subject to substantial structural variation, including copy number variation (CNV). Constitutional CNVs may either represent benign polymorphic variants or be associated with disease, including cancer predisposition. Rare nonpolymorphic CNVs, that is DNA lesions that result in gene deletions, inversions, and/or fusions, may be responsible for a high cancer risk. In addition, we previously elucidated a mechanism by which CNV-based transcriptional read-through mediates inactivation of a neighboring gene through in cis hypermethylation of its promoter. This novel mechanism explains the etiology of a recurrent and strongly inherited tissue-restricted epimutation. Recently, we obtained supporting evidence for such a CNV-associated scenario, suggesting that it may be more prevalent than previously thought. We expect that copy number profiling in unexplained high-risk families will lead to the discovery of additional cancer-predisposing genes and/or mechanisms.
Collapse
Affiliation(s)
- Roland P Kuiper
- Department of Human Genetics, Radboud University Nijmegen Medical Centre, Nijmegen Centre for Molecular Life Sciences, Nijmegen, The Netherlands.
| | | | | | | |
Collapse
|
56
|
Birchler JA, Veitia RA. The gene balance hypothesis: implications for gene regulation, quantitative traits and evolution. THE NEW PHYTOLOGIST 2010; 186:54-62. [PMID: 19925558 PMCID: PMC2858765 DOI: 10.1111/j.1469-8137.2009.03087.x] [Citation(s) in RCA: 200] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
The gene balance hypothesis states that the stoichiometry of members of multisubunit complexes affects the function of the whole because of the kinetics and mode of assembly. Gene regulatory mechanisms also would be governed by these principles. Here, we review the impact of this concept with regard to the effects on the genetics of quantitative traits, the fate of duplication of genes following polyploidization events or segmental duplication, the basis of aneuploid syndromes, the constraints on cis and trans variation in gene regulation and the potential involvement in hybrid incompatibilities.
Collapse
Affiliation(s)
- James A Birchler
- Division of Biological Sciences, University of Missouri, Columbia, MO 65211, USA.
| | | |
Collapse
|
57
|
Kong L, Lovell PV, Heger A, Mello CV, Ponting CP. Accelerated evolution of PAK3- and PIM1-like kinase gene families in the zebra finch, Taeniopygia guttata. Mol Biol Evol 2010; 27:1923-34. [PMID: 20237222 DOI: 10.1093/molbev/msq080] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
Genes encoding protein kinases tend to evolve slowly over evolutionary time, and only rarely do they appear as recent duplications in sequenced vertebrate genomes. Consequently, it was a surprise to find two families of kinase genes that have greatly and recently expanded in the zebra finch (Taeniopygia guttata) lineage. In contrast to other amniotic genomes (including chicken) that harbor only single copies of p21-activated serine/threonine kinase 3 (PAK3) and proviral integration site 1 (PIM1) genes, the zebra finch genome appeared at first to additionally contain 67 PAK3-like (PAK3L) and 51 PIM1-like (PIM1L) protein kinase genes. An exhaustive analysis of these gene models, however, revealed most to be incomplete, owing to the absence of terminal exons. After reprediction, 31 PAK3L genes and 10 PIM1L genes remain, and all but three are predicted, from the retention of functional sites and open reading frames, to be enzymatically active. PAK3L, but not PIM1L, gene sequences show evidence of recurrent episodes of positive selection, concentrated within structures spatially adjacent to N- and C-terminal protein regions that have been discarded from zebra finch PAK3L genes. At least seven zebra finch PAK3L genes were observed to be expressed in testis, whereas two sequences were found transcribed in the brain, one broadly including the song nuclei and the other in the ventricular zone and in cells resembling Bergmann's glia in the cerebellar Purkinje cell layer. Two PIM1L sequences were also observed to be expressed with broad distributions in the zebra finch brain, one in both the ventricular zone and the cerebellum and apparently associated with glial cells and the other showing neuronal cell expression and marked enrichment in midbrain/thalamic nuclei. These expression patterns do not correlate with zebra finch-specific features such as vocal learning. Nevertheless, our results show how ancient and conserved intracellular signaling molecules can be co-opted, following duplication, thereby resulting in lineage-specific functions, presumably affecting the zebra finch testis and brain.
Collapse
Affiliation(s)
- Lesheng Kong
- Medical Research Council Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
| | | | | | | | | |
Collapse
|
58
|
Abstract
Although mutation provides the fuel for phenotypic evolution, it also imposes a substantial burden on fitness through the production of predominantly deleterious alleles, a matter of concern from a human-health perspective. Here, recently established databases on de novo mutations for monogenic disorders are used to estimate the rate and molecular spectrum of spontaneously arising mutations and to derive a number of inferences with respect to eukaryotic genome evolution. Although the human per-generation mutation rate is exceptionally high, on a per-cell division basis, the human germline mutation rate is lower than that recorded for any other species. Comparison with data from other species demonstrates a universal mutational bias toward A/T composition, and leads to the hypothesis that genome-wide nucleotide composition generally evolves to the point at which the power of selection in favor of G/C is approximately balanced by the power of random genetic drift, such that variation in equilibrium genome-wide nucleotide composition is largely defined by variation in mutation biases. Quantification of the hazards associated with introns reveals that mutations at key splice-site residues are a major source of human mortality. Finally, a consideration of the long-term consequences of current human behavior for deleterious-mutation accumulation leads to the conclusion that a substantial reduction in human fitness can be expected over the next few centuries in industrialized societies unless novel means of genetic intervention are developed.
Collapse
Affiliation(s)
- Michael Lynch
- Department of Biology, Indiana University, Bloomington, IN 47405, USA.
| |
Collapse
|
59
|
Hastings PJ, Lupski JR, Rosenberg SM, Ira G. Mechanisms of change in gene copy number. Nat Rev Genet 2009. [PMID: 19597530 DOI: 10.1038/nrg2593.mechanisms] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Deletions and duplications of chromosomal segments (copy number variants, CNVs) are a major source of variation between individual humans and are an underlying factor in human evolution and in many diseases, including mental illness, developmental disorders and cancer. CNVs form at a faster rate than other types of mutation, and seem to do so by similar mechanisms in bacteria, yeast and humans. Here we review current models of the mechanisms that cause copy number variation. Non-homologous end-joining mechanisms are well known, but recent models focus on perturbation of DNA replication and replication of non-contiguous DNA segments. For example, cellular stress might induce repair of broken replication forks to switch from high-fidelity homologous recombination to non-homologous repair, thus promoting copy number change.
Collapse
Affiliation(s)
- P J Hastings
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA.
| | | | | | | |
Collapse
|
60
|
Hastings PJ, Lupski JR, Rosenberg SM, Ira G. Mechanisms of change in gene copy number. Nat Rev Genet 2009; 10:551-64. [PMID: 19597530 DOI: 10.1038/nrg2593] [Citation(s) in RCA: 846] [Impact Index Per Article: 56.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
Deletions and duplications of chromosomal segments (copy number variants, CNVs) are a major source of variation between individual humans and are an underlying factor in human evolution and in many diseases, including mental illness, developmental disorders and cancer. CNVs form at a faster rate than other types of mutation, and seem to do so by similar mechanisms in bacteria, yeast and humans. Here we review current models of the mechanisms that cause copy number variation. Non-homologous end-joining mechanisms are well known, but recent models focus on perturbation of DNA replication and replication of non-contiguous DNA segments. For example, cellular stress might induce repair of broken replication forks to switch from high-fidelity homologous recombination to non-homologous repair, thus promoting copy number change.
Collapse
Affiliation(s)
- P J Hastings
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA.
| | | | | | | |
Collapse
|
61
|
Webber C, Hehir-Kwa JY, Nguyen DQ, de Vries BBA, Veltman JA, Ponting CP. Forging links between human mental retardation-associated CNVs and mouse gene knockout models. PLoS Genet 2009; 5:e1000531. [PMID: 19557186 PMCID: PMC2694283 DOI: 10.1371/journal.pgen.1000531] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2009] [Accepted: 05/22/2009] [Indexed: 12/20/2022] Open
Abstract
Rare copy number variants (CNVs) are frequently associated with common neurological disorders such as mental retardation (MR; learning disability), autism, and schizophrenia. CNV screening in clinical practice is limited because pathological CNVs cannot be distinguished routinely from benign CNVs, and because genes underlying patients' phenotypes remain largely unknown. Here, we present a novel, statistically robust approach that forges links between 148 MR–associated CNVs and phenotypes from ∼5,000 mouse gene knockout experiments. These CNVs were found to be significantly enriched in two classes of genes, those whose mouse orthologues, when disrupted, result in either abnormal axon or dopaminergic neuron morphologies. Additional enrichments highlighted correspondences between relevant mouse phenotypes and secondary presentations such as brain abnormality, cleft palate, and seizures. The strength of these phenotype enrichments (>100% increases) greatly exceeded molecular annotations (<30% increases) and allowed the identification of 78 genes that may contribute to MR and associated phenotypes. This study is the first to demonstrate how the power of mouse knockout data can be systematically exploited to better understand genetically heterogeneous neurological disorders. Mental retardation (MR; also known as learning disability) affects 1%–3% of people and is often associated with the presence of genomic copy number variations (CNVs) such as deletions and duplications. Most of these CNVs are rare and they often involve tens, sometimes hundreds, of genes. Pinpointing exactly which particular gene or genes are responsible for MR in an individual patient is therefore challenging and limits diagnostic applications. In this study, the functions of genes present within a large collection of MR–associated CNVs were investigated by comparing them to data from large-scale mouse knock-out experiments. We found that MR–associated CNVs contain greater than expected numbers of genes that give specific nervous system phenotypes when disrupted in the mouse. Not only does this study confirm that CNVs frequently cause MR, but it narrows down the list of genes whose changes lead to this disorder from thousands to several dozen. This reduced list of genes brings wide-spread genetic testing for MR one step closer. It also provides a better understanding of the biology behind MR that could, eventually, yield medical treatments.
Collapse
Affiliation(s)
- Caleb Webber
- MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
| | - Jayne Y. Hehir-Kwa
- Department of Human Genetics, Nijmegen Centre for Molecular Life Sciences, Radboud University Nijmegen Medical Centre, Nijmegen, The Netherlands
| | - Duc-Quang Nguyen
- MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
| | - Bert B. A. de Vries
- Department of Human Genetics, Nijmegen Centre for Molecular Life Sciences, Radboud University Nijmegen Medical Centre, Nijmegen, The Netherlands
| | - Joris A. Veltman
- Department of Human Genetics, Nijmegen Centre for Molecular Life Sciences, Radboud University Nijmegen Medical Centre, Nijmegen, The Netherlands
- * E-mail: (JV); (CPP)
| | - Chris P. Ponting
- MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
- * E-mail: (JV); (CPP)
| |
Collapse
|
62
|
Lineage-specific biology revealed by a finished genome assembly of the mouse. PLoS Biol 2009; 7:e1000112. [PMID: 19468303 PMCID: PMC2680341 DOI: 10.1371/journal.pbio.1000112] [Citation(s) in RCA: 347] [Impact Index Per Article: 23.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2008] [Accepted: 04/03/2009] [Indexed: 02/06/2023] Open
Abstract
A finished clone-based assembly of the mouse genome reveals extensive recent sequence duplication during recent evolution and rodent-specific expansion of certain gene families. Newly assembled duplications contain protein-coding genes that are mostly involved in reproductive function. The mouse (Mus musculus) is the premier animal model for understanding human disease and development. Here we show that a comprehensive understanding of mouse biology is only possible with the availability of a finished, high-quality genome assembly. The finished clone-based assembly of the mouse strain C57BL/6J reported here has over 175,000 fewer gaps and over 139 Mb more of novel sequence, compared with the earlier MGSCv3 draft genome assembly. In a comprehensive analysis of this revised genome sequence, we are now able to define 20,210 protein-coding genes, over a thousand more than predicted in the human genome (19,042 genes). In addition, we identified 439 long, non–protein-coding RNAs with evidence for transcribed orthologs in human. We analyzed the complex and repetitive landscape of 267 Mb of sequence that was missing or misassembled in the previously published assembly, and we provide insights into the reasons for its resistance to sequencing and assembly by whole-genome shotgun approaches. Duplicated regions within newly assembled sequence tend to be of more recent ancestry than duplicates in the published draft, correcting our initial understanding of recent evolution on the mouse lineage. These duplicates appear to be largely composed of sequence regions containing transposable elements and duplicated protein-coding genes; of these, some may be fixed in the mouse population, but at least 40% of segmentally duplicated sequences are copy number variable even among laboratory mouse strains. Mouse lineage-specific regions contain 3,767 genes drawn mainly from rapidly-changing gene families associated with reproductive functions. The finished mouse genome assembly, therefore, greatly improves our understanding of rodent-specific biology and allows the delineation of ancestral biological functions that are shared with human from derived functions that are not. The availability of an accurate genome sequence provides the bedrock upon which modern biomedical research is based. Here we describe a high-quality assembly, Build 36, of the mouse genome. This assembly was put together by aligning overlapping individual clones representing parts of the genome, and it provides a more complete picture than previous assemblies, because it adds much rodent-specific sequence that was previously unavailable. The addition of these sequences provides insight into both the genomic architecture and the gene complement of the mouse. In particular, it highlights recent gene duplications and the expansion of certain gene families during rodent evolution. An improved understanding of the mouse genome and thus mouse biology will enhance the utility of the mouse as a model for human disease.
Collapse
|
63
|
Schmidt J, Kirsch S, Rappold GA, Schempp W. Complex evolution of a Y-chromosomal double homeobox 4 (DUX4)-related gene family in hominoids. PLoS One 2009; 4:e5288. [PMID: 19404400 PMCID: PMC2671837 DOI: 10.1371/journal.pone.0005288] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2009] [Accepted: 03/24/2009] [Indexed: 12/21/2022] Open
Abstract
The human Y chromosome carries four human Y-chromosomal euchromatin/heterochromatin transition regions, all of which are characterized by the presence of interchromosomal segmental duplications. The Yq11.1/Yq11.21 transition region harbours a peculiar segment composed of an imperfectly organized tandem-repeat structure encoding four members of the double homeobox (DUX) gene family. By comparative fluorescence in situ hybridization (FISH) analysis we have documented the primary appearance of Y-chromosomal DUX genes (DUXY) on the gibbon Y chromosome. The major amplification and dispersal of DUXY paralogs occurred after the gibbon and hominid lineages had diverged. Orthologous DUXY loci of human and chimpanzee show a highly similar structural organization. Sequence alignment survey, phylogenetic reconstruction and recombination detection analyses of human and chimpanzee DUXY genes revealed the existence of all copies in a common ancestor. Comparative analysis of the circumjacent beta-satellites indicated that DUXY genes and beta-satellites evolved in concert. However, evolutionary forces acting on DUXY genes may have induced amino acid sequence differences in the orthologous chimpanzee and human DUXY open reading frames (ORFs). The acquisition of complete ORFs in human copies might relate to evolutionary advantageous functions indicating neo-functionalization. We propose an evolutionary scenario in which an ancestral tandem array DUX gene cassette transposed to the hominoid Y chromosome followed by lineage-specific chromosomal rearrangements paved the way for a species-specific evolution of the Y-chromosomal members of a large highly diverged homeobox gene family.
Collapse
Affiliation(s)
- Julia Schmidt
- Institute of Human Genetics, University of Freiburg, Freiburg, Germany
| | - Stefan Kirsch
- Institute of Human Genetics, University of Freiburg, Freiburg, Germany
| | - Gudrun A. Rappold
- Institute of Human Genetics, University of Heidelberg, Heidelberg, Germany
| | - Werner Schempp
- Institute of Human Genetics, University of Freiburg, Freiburg, Germany
- * E-mail:
| |
Collapse
|
64
|
Crespi B, Summers K, Dorus S. Genomic sister-disorders of neurodevelopment: an evolutionary approach. Evol Appl 2009; 2:81-100. [PMID: 25567849 PMCID: PMC3352408 DOI: 10.1111/j.1752-4571.2008.00056.x] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2008] [Accepted: 11/26/2008] [Indexed: 02/06/2023] Open
Abstract
Genomic sister-disorders are defined here as diseases mediated by duplications versus deletions of the same region. Such disorders can provide unique information concerning the genomic underpinnings of human neurodevelopment because effects of diametric variation in gene copy number on cognitive and behavioral phenotypes can be inferred. We describe evidence from the literature on deletions versus duplications for the regions underlying the best-known human neurogenetic sister-disorders, including Williams syndrome, Velocardiofacial syndrome, and Smith-Magenis syndrome, as well as the X-chromosomal conditions Klinefelter and Turner syndromes. These data suggest that diametric copy-number alterations can, like diametric alterations to imprinted genes, generate contrasting phenotypes associated with autistic-spectrum and psychotic-spectrum conditions. Genomically based perturbations to the development of the human social brain are thus apparently mediated to a notable degree by effects of variation in gene copy number. We also conducted the first analyses of positive selection for genes in the regions affected by these disorders. We found evidence consistent with adaptive evolution of protein-coding genes, or selective sweeps, for three of the four sets of sister-syndromes analyzed. These studies of selection facilitate identification of candidate genes for the phenotypes observed and lend a novel evolutionary dimension to the analysis of human cognitive architecture and neurogenetic disorders.
Collapse
Affiliation(s)
- Bernard Crespi
- Department of Biosciences, Simon Fraser University Burnaby, BC, Canada
| | - Kyle Summers
- Department of Biology, East Carolina University Greenville, NC, USA
| | - Steve Dorus
- Department of Biology and Biochemistry, University of Bath Bath, UK
| |
Collapse
|
65
|
Abstract
Copy number variation (CNV) is a source of genetic diversity in humans. Numerous CNVs are being identified with various genome analysis platforms, including array comparative genomic hybridization (aCGH), single nucleotide polymorphism (SNP) genotyping platforms, and next-generation sequencing. CNV formation occurs by both recombination-based and replication-based mechanisms and de novo locus-specific mutation rates appear much higher for CNVs than for SNPs. By various molecular mechanisms, including gene dosage, gene disruption, gene fusion, position effects, etc., CNVs can cause Mendelian or sporadic traits, or be associated with complex diseases. However, CNV can also represent benign polymorphic variants. CNVs, especially gene duplication and exon shuffling, can be a predominant mechanism driving gene and genome evolution.
Collapse
Affiliation(s)
- Feng Zhang
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | | | | | | |
Collapse
|
66
|
Casci T. CNV evolution revisited. Nat Rev Genet 2008. [DOI: 10.1038/nrg2477] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
67
|
Perry GH, Yang F, Marques-Bonet T, Murphy C, Fitzgerald T, Lee AS, Hyland C, Stone AC, Hurles ME, Tyler-Smith C, Eichler EE, Carter NP, Lee C, Redon R. Copy number variation and evolution in humans and chimpanzees. Genome Res 2008; 18:1698-710. [PMID: 18775914 DOI: 10.1101/gr.082016.108] [Citation(s) in RCA: 180] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Copy number variants (CNVs) underlie many aspects of human phenotypic diversity and provide the raw material for gene duplication and gene family expansion. However, our understanding of their evolutionary significance remains limited. We performed comparative genomic hybridization on a single human microarray platform to identify CNVs among the genomes of 30 humans and 30 chimpanzees as well as fixed copy number differences between species. We found that human and chimpanzee CNVs occur in orthologous genomic regions far more often than expected by chance and are strongly associated with the presence of highly homologous intrachromosomal segmental duplications. By adapting population genetic analyses for use with copy number data, we identified functional categories of genes that have likely evolved under purifying or positive selection for copy number changes. In particular, duplications and deletions of genes with inflammatory response and cell proliferation functions may have been fixed by positive selection and involved in the adaptive phenotypic differentiation of humans and chimpanzees.
Collapse
Affiliation(s)
- George H Perry
- School of Human Evolution & Social Change, Arizona State University, Tempe, Arizona 85287, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|