1
|
Cavalieri V, Geraci F, Spinelli G. Diversification of spatiotemporal expression and copy number variation of the echinoid hbox12/pmar1/micro1 multigene family. PLoS One 2017; 12:e0174404. [PMID: 28350855 PMCID: PMC5370098 DOI: 10.1371/journal.pone.0174404] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2016] [Accepted: 03/08/2017] [Indexed: 11/26/2022] Open
Abstract
Changes occurring during evolution in the cis-regulatory landscapes of individual members of multigene families might impart diversification in their spatiotemporal expression and function. The archetypal member of the echinoid hbox12/pmar1/micro1 family is hbox12-a, a homeobox-containing gene expressed exclusively by dorsal blastomeres, where it governs the dorsal/ventral gene regulatory network during embryogenesis of the sea urchin Paracentrotus lividus. Here we describe the inventory of the hbox12/pmar1/micro1 genes in P. lividus, highlighting that gene copy number variation occurs across individual sea urchins of the same species. We show that the various hbox12/pmar1/micro1 genes group into three subfamilies according to their spatiotemporal expression, which ranges from broad transcription throughout development to transient expression in either the animal hemisphere or micromeres of the early embryo. Interestingly, the promoter regions of those genes showing comparable expression patterns are highly similar, while differing from those of the other subfamilies. Strikingly, phylogenetic analysis suggests that the hbox12/pmar1/micro1 genes are species-specific, exhibiting extensive divergence in their noncoding, but not in their coding, sequences across three distinct sea urchin species. In spite of this, two micromere-specific genes of P. lividus possess a TCF/LEF-binding motif in a similar position, and their transcription relies on Wnt/β-catenin signaling, similar to the pmar1 and micro1 genes, which in other sea urchin species are involved in micromere specification. Altogether, our findings suggest that the hbox12/pmar1/micro1 gene family evolved rather rapidly, generating paralogs whose cis-regulatory sequences diverged following multiple rounds of duplication from a common ancestor.
Collapse
Affiliation(s)
- Vincenzo Cavalieri
- Department of Biological, Chemical and Pharmaceutical Sciences and Technologies (STEBICEF), University of Palermo, Viale delle Scienze Edificio 16, Palermo, Italy
- Advanced Technologies Network Center (ATeN), University of Palermo, Viale delle Scienze Edificio 18, Palermo, Italy
- * E-mail: (VC); (GS)
| | - Fabiana Geraci
- Department of Biological, Chemical and Pharmaceutical Sciences and Technologies (STEBICEF), University of Palermo, Viale delle Scienze Edificio 16, Palermo, Italy
| | - Giovanni Spinelli
- Department of Biological, Chemical and Pharmaceutical Sciences and Technologies (STEBICEF), University of Palermo, Viale delle Scienze Edificio 16, Palermo, Italy
- * E-mail: (VC); (GS)
| |
Collapse
|
2
|
Extensive copy-number variation of young genes across stickleback populations. PLoS Genet 2014; 10:e1004830. [PMID: 25474574 PMCID: PMC4256280 DOI: 10.1371/journal.pgen.1004830] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2014] [Accepted: 10/16/2014] [Indexed: 12/30/2022] Open
Abstract
Duplicate genes emerge as copy-number variations (CNVs) at the population level, and remain copy-number polymorphic until they are fixed or lost. The successful establishment of such structural polymorphisms in the genome plays an important role in evolution by promoting genetic diversity, complexity and innovation. To characterize the early evolutionary stages of duplicate genes and their potential adaptive benefits, we combine comparative genomics with population genomics analyses to evaluate the distribution and impact of CNVs across natural populations of an eco-genomic model, the three-spined stickleback. With whole genome sequences of 66 individuals from populations inhabiting three distinct habitats, we find that CNVs generally occur at low frequencies and are often only found in one of the 11 populations surveyed. A subset of CNVs, however, displays copy-number differentiation between populations, showing elevated within-population frequencies consistent with local adaptation. By comparing teleost genomes to identify lineage-specific genes and duplications in sticklebacks, we highlight rampant gene content differences among individuals in which over 30% of young duplicate genes are CNVs. These CNV genes are evolving rapidly at the molecular level and are enriched with functional categories associated with environmental interactions, depicting the dynamic early copy-number polymorphic stage of genes during population differentiation. After a locus is duplicated in a genome, individuals from a population instantaneously differ in the number of copies of this locus producing a copy-number variation (CNV). Over time, the joint effects of selection and other evolutionary forces will act to either eliminate the extra genetic copy or retain it. Depending on this evolutionary interplay, young duplications, including newly duplicated genes, can persist for millions of years as CNVs. CNVs may especially be prevalent between populations that have colonized and adapted to disparate environments in which selective pressures differ. Using whole genome sequences from several populations of three-spined sticklebacks that inhabit different environments, we find that a third of young duplicated genes are CNVs. These young CNV genes are enriched with environmental response functions and evolving rapidly at the molecular level, making them promising candidates for a role in the rapid ecological adaptation to novel environments.
Collapse
|
3
|
Abstract
Variation in gene copy number is increasingly recognized as a common, heritable source of inter-individual differences in genomic sequence. The role of copy number variation is well established in the pathogenesis of rare genomic disorders. More recently, germline and somatic copy number variation have been shown to be important pathogenic factors in a range of common diseases, including infectious, autoimmune and neuropsychiatric diseases and cancer. In this review, we describe the range of methods available for measuring copy number variants (CNVs) in individuals and populations, including the limitations of presently available assays, and highlight some key examples of common diseases in which CNVs have been shown clearly to have a pathogenic role. Although there has been major progress in this field in the last 5 years, understanding the full contribution of CNVs to the genetic basis of common diseases will require further studies, with more accurate CNV assays and larger cohorts than have presently been completed.
Collapse
Affiliation(s)
- M Fanciulli
- Imperial College London, Hammersmith Hospital, London, W12 0NN, UK
| | | | | |
Collapse
|
4
|
Abstract
DNA copy number variation (CNV) represents a considerable source of human genetic diversity. Recently,1 a global map of copy number variation in the human genome has been drawn up which reveals not only the ubiquity but also the complexity of this type of variation. Thus, two human genomes may differ by more than 20 Mb and it is likely that the full extent of CNV still remains to be discovered. Nearly 3000 genes are associated with CNV. This high degree of variability with regard to gene copy number between two individuals challenges definitions of normality. Many CNVs are located in regions of complex genomic structure and this currently limits the extent to which these variants can be genotyped by using tagging SNPs. However, some CNVs are already amenable to genome-wide association studies so that their influence on human phenotypic diversity and disease susceptibility may soon be determined.
Collapse
|
5
|
Freeman JL, Perry GH, Feuk L, Redon R, McCarroll SA, Altshuler DM, Aburatani H, Jones KW, Tyler-Smith C, Hurles ME, Carter NP, Scherer SW, Lee C. Copy number variation: new insights in genome diversity. Genome Res 2006; 16:949-61. [PMID: 16809666 DOI: 10.1101/gr.3677206] [Citation(s) in RCA: 545] [Impact Index Per Article: 30.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
DNA copy number variation has long been associated with specific chromosomal rearrangements and genomic disorders, but its ubiquity in mammalian genomes was not fully realized until recently. Although our understanding of the extent of this variation is still developing, it seems likely that, at least in humans, copy number variants (CNVs) account for a substantial amount of genetic variation. Since many CNVs include genes that result in differential levels of gene expression, CNVs may account for a significant proportion of normal phenotypic variation. Current efforts are directed toward a more comprehensive cataloging and characterization of CNVs that will provide the basis for determining how genomic diversity impacts biological function, evolution, and common human diseases.
Collapse
Affiliation(s)
- Jennifer L Freeman
- Department of Pathology, Brigham and Women's Hospital, Boston, Massachusetts 02115, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
6
|
Abstract
The first wave of information from the analysis of the human genome revealed SNPs to be the main source of genetic and phenotypic human variation. However, the advent of genome-scanning technologies has now uncovered an unexpectedly large extent of what we term 'structural variation' in the human genome. This comprises microscopic and, more commonly, submicroscopic variants, which include deletions, duplications and large-scale copy-number variants - collectively termed copy-number variants or copy-number polymorphisms - as well as insertions, inversions and translocations. Rapidly accumulating evidence indicates that structural variants can comprise millions of nucleotides of heterogeneity within every genome, and are likely to make an important contribution to human diversity and disease susceptibility.
Collapse
Affiliation(s)
- Lars Feuk
- The Centre for Applied Genomics and Program in Genetics and Genomic Biology, The Hospital for Sick Children, Department of Molecular and Medical Genetics, University of Toronto, Ontario, Canada
| | | | | |
Collapse
|
7
|
Blanchong CA, Chung EK, Rupert KL, Yang Y, Yang Z, Zhou B, Moulds JM, Yu CY. Genetic, structural and functional diversities of human complement components C4A and C4B and their mouse homologues, Slp and C4. Int Immunopharmacol 2001; 1:365-92. [PMID: 11367523 DOI: 10.1016/s1567-5769(01)00019-4] [Citation(s) in RCA: 116] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The complement protein C4 is a non-enzymatic component of the C3 and C5 convertases and thus essential for the propagation of the classical complement pathway. The covalent binding of C4 to immunoglobulins and immune complexes (IC) also enhances the solubilization of immune aggregates, and the clearance of IC through complement receptor one (CR1) on erythrocytes. Human C4 is the most polymorphic protein of the complement system. In this review, we summarize the current concepts on the 1-2-3 loci model of C4A and C4B genes in the population, factors affecting the expression levels of C4 transcripts and proteins, and the structural, functional and serological diversities of the C4A and C4B proteins. The diversities and polymorphisms of the mouse homologues Slp and C4 proteins are described and contrasted with their human homologues. The human C4 genes are located in the MHC class III region on chromosome 6. Each human C4 gene consists of 41 exons coding for a 5.4-kb transcript. The long gene is 20.6 kb and the short gene is 14.2 kb. In the Caucasian population 55% of the MHC haplotypes have the 2-locus, C4A-C4B configurations and 45% have an unequal number of C4A and C4B genes. Moreover, three-quarters of C4 genes harbor the 6.4 kb endogenous retrovirus HERV-K(C4) in the intron 9 of the long genes. Duplication of a C4 gene always concurs with its adjacent genes RP, CYP21 and TNX, which together form a genetic unit termed an RCCX module. Monomodular, bimodular and trimodular RCCX structures with 1, 2 and 3 complement C4 genes have frequencies of 17%, 69% and 14%, respectively. Partial deficiencies of C4A and C4B, primarily due to the presence of monomodular haplotypes and homo-expression of C4A proteins from bimodular structures, have a combined frequency of 31.6%. Multiple structural isoforms of each C4A and C4B allotype exist in the circulation because of the imperfect and incomplete proteolytic processing of the precursor protein to form the beta-alpha-gamma structures. Immunofixation experiments of C4A and C4B demonstrate > 41 allotypes in the two classes of proteins. A compilation of polymorphic sites from limited C4 sequences revealed the presence of 24 polymophic residues, mostly clustered C-terminal to the thioester bond within the C4d region of the alpha-chain. The covalent binding affinities of the thioester carbonyl group of C4A and C4B appear to be modulated by four isotypic residues at positions 1101, 1102, 1105 and 1106. Site directed mutagenesis experiments revealed that D1106 is responsible for the effective binding of C4A to form amide bonds with immune aggregates or protein antigens, and H1106 of C4B catalyzes the transacylation of the thioester carbonyl group to form ester bonds with carbohydrate antigens. The expression of C4 is inducible or enhanced by gamma-interferon. The liver is the main organ that synthesizes and secretes C4A and C4B to the circulation but there are many extra-hepatic sites producing moderate quantities of C4 for local defense. The plasma protein levels of C4A and C4B are mainly determined by the corresponding gene dosage. However, C4B proteins encoded by monomodular short genes may have relatively higher concentrations than those from long C4A genes. The 5' regulatory sequence of a C4 gene contains a Spl site, three E-boxes but no TATA box. The sequences beyond--1524 nt may be completely different as the C4 genes at RCCX module I have RPI-specific sequences, while those at Modules II, III and IV have TNXA-specific sequences. The remarkable genetic diversity of human C4A and C4B probably promotes the exchange of genetic information to create and maintain the quantitative and qualitative variations of C4A and C4B proteins in the population, as driven by the selection pressure against a great variety of microbes. An undesirable accompanying byproduct of this phenomenon is the inherent deleterious recombinations among the RCCX constituents leading to autoimmune and genetic disorders.
Collapse
Affiliation(s)
- C A Blanchong
- Children's Research Institute, 700 Children's Drive, Columbus, OH 43205-2696, USA
| | | | | | | | | | | | | | | |
Collapse
|
8
|
Fan Q, Uring-Lambert B, Weill B, Gautreau C, Menkes CJ, Delpech M. Complement component C4 deficiencies and gene alterations in patients with systemic lupus erythematosus. EUROPEAN JOURNAL OF IMMUNOGENETICS : OFFICIAL JOURNAL OF THE BRITISH SOCIETY FOR HISTOCOMPATIBILITY AND IMMUNOGENETICS 1993; 20:11-21. [PMID: 8095158 DOI: 10.1111/j.1744-313x.1993.tb00091.x] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Deficiency of complement component C4 is considered playing a role in the genetic predisposition for systemic lupus erythematosus (SLE). The purpose of this study was to characterize the genomic alterations of the C4 and CYP21 genes in 40 caucasoid patients with SLE by C4 allotyping and by RFLP analysis. Nineteen patients (47.5%) carried C4A null alleles and eight patients (20.0%) C4B null alleles. SLE patients had more frequent C4A null alleles (47.5%) than healthy individuals (20%) (chi 2 = 10.75; P < 0.005). The commonest molecular alteration in the patients with C4A null alleles was a large gene deletion affecting both C4A and CYP21A genes. However, among the patients with C4A null alleles, 16.7% persons had no detectable C4A deletion. The non-expression of C4A gene might be due to defects at various levels of gene expression (i.e. transcription and translation). Among the patients with C4B null alleles, 62.5% persons had no detectable gene lesion, whereas 37.5% showed a C4B deletion including both C4B/CYP21A or C4B/CYP21B genes. Duplication of the C4B gene was not rare in SLE patients, as we found 15.0% of the patients with a heterozygous C4B/CY21A gene duplication. The patients typed as having C4B gene homoduplication (B1,1) demonstrated two long C4B loci, whereas heteroduplication (B1,2) displayed two short loci, therefore the type of C4B gene duplication may be related to the gene length. In conclusion, C4 deficiencies observed in 26 of the 40 SLE patients studied were very heterogeneous. In every case, the gene alteration affected both C4 and CYP21 genes.
Collapse
Affiliation(s)
- Q Fan
- CNRS, ER 17-Institut Cochin de Génétique Moléculaire, Paris, France
| | | | | | | | | | | |
Collapse
|
9
|
Christenson MJ, LaRosa T, Jung M, Bias WB, McLean RH. Hypomorphic C4B* 15 variant of the fourth component of complement. FEBS Lett 1990; 260:183-6. [PMID: 1688812 DOI: 10.1016/0014-5793(90)80099-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
We report a rare 'hypomorphic' C4 allotype detected during routine screening in controls for the Rogers:1 epitope. C4B* 15 was distinguished by having only faint staining when using polyclonal anti-C4 antibody on agarose immunoelectrophoresis (e.g. hypomorphic), having relatively weak hemolytic activity but being strongly reactive with monoclonal antibody to Rodgers 1. TaqI restriction fragment length polymorphism (RFLP) demonstrated that C4B* 15 segregated with 7 kb and 5.4 kb C4 gene fragments and with the haplotype HLA-A2,C-, B50,BW6,DR7,DQ2,DR52,SO7C2(1,15). The 5.4-kb fragment was more intense than the 7.0-kb fragment, suggesting duplication of the 5.4-kb fragment. This hypomorphic C4 allotype (genotype frequency = 0.0088) has diminished expression of C4 epitopes commonly recognized by polyclonal anti-C4 and may be missed by standard phenotyping methods.
Collapse
Affiliation(s)
- M J Christenson
- Department of Pediatrics, Johns Hopkins University School of Medicine, Baltimore, MD 21205
| | | | | | | | | |
Collapse
|