1
|
Agarwala S, Ramachandra NB. Role of CNTNAP2 in autism manifestation outlines the regulation of signaling between neurons at the synapse. EGYPTIAN JOURNAL OF MEDICAL HUMAN GENETICS 2021. [DOI: 10.1186/s43042-021-00138-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Abstract
Background
Autism is characterized by high heritability and a complex genetic mutational landscape with restricted social behavior and impaired social communication. Whole-exome sequencing is a reliable tool to pinpoint variants for unraveling the disease pathophysiology. The present meta-analysis was performed using 222 whole-exome sequences deposited by Simons Simplex Collection (SSC) at the European Nucleotide Archive. This sample cohort was used to identify causal mutations in autism-specific genes to create a mutational landscape focusing on the CNTNAP2 gene.
Results
The authors account for the identification of 15 high confidence genes with 24 variants for autism with Simons Foundation Autism Research Initiative (SFARI) gene scoring. These genes encompass critical autism pathways such as neuron development, synapse complexity, cytoskeleton, and microtubule activation. Among these 15 genes, overlapping variants were present across multiple samples: KMT2C in 167 cases, CNTNAP2 in 192 samples, CACNA1C in 152 cases, and SHANK3 in 124 cases. Pathway analysis identifies clustering and interplay of autism genes—WDFY3, SHANK2, CNTNAP2, HOMER1, SYNGAP1, and ANK2 with CNTNAP2. These genes coincide across autism-relevant pathways, namely abnormal social behavior and intellectual and cognitive impairment. Based on multiple layers of selection criteria, CNTNAP2 was chosen as the master gene for the study. It is an essential gene for autism with speech-language delays, a typical phenotype in most cases under study. It showcases nine variants across multiple samples with one damaging variant, T589P, with a GERP rank score range of 0.065–0.95. This unique variant was present across 86.5% of the samples impairing the epithelial growth factor (EGF) domain. Established microRNA (miRNA) genes hsa-mir-548aq and hsa-mir-548f were mutated within the CNTNAP2 region, adding to the severity. The mutated protein showed reduced stability by 0.25, increased solvent accessibility by 9%, and reduced depth by 0.2, which rendered the protein non-functional. Secondary physical interactors of CNTNAP2 through CNTN2 proteins were mutated in the samples, further intensifying the severity.
Conclusion
CNTNAP2 has been identified as a master gene in autism manifestation responsible for speech-language delay by impairing the EGF protein domain and downstream cascade. The decrease in EGF is correlated with vital autism symptoms, especially language disabilities.
Collapse
|
2
|
Integrated structural and evolutionary analysis reveals common mechanisms underlying adaptive evolution in mammals. Proc Natl Acad Sci U S A 2020; 117:5977-5986. [PMID: 32123117 DOI: 10.1073/pnas.1916786117] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Understanding the molecular basis of adaptation to the environment is a central question in evolutionary biology, yet linking detected signatures of positive selection to molecular mechanisms remains challenging. Here we demonstrate that combining sequence-based phylogenetic methods with structural information assists in making such mechanistic interpretations on a genomic scale. Our integrative analysis shows that positively selected sites tend to colocalize on protein structures and that positively selected clusters are found in functionally important regions of proteins, indicating that positive selection can contravene the well-known principle of evolutionary conservation of functionally important regions. This unexpected finding, along with our discovery that positive selection acts on structural clusters, opens previously unexplored strategies for the development of better models of protein evolution. Remarkably, proteins where we detect the strongest evidence of clustering belong to just two functional groups: Components of immune response and metabolic enzymes. This gives a coherent picture of pathogens and xenobiotics as important drivers of adaptive evolution of mammals.
Collapse
|
3
|
Evolutionary Forces and Codon Bias in Different Flavors of Intrinsic Disorder in the Human Proteome. J Mol Evol 2019; 88:164-178. [DOI: 10.1007/s00239-019-09921-4] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2019] [Accepted: 11/26/2019] [Indexed: 12/22/2022]
|
4
|
Afanasyeva A, Bockwoldt M, Cooney CR, Heiland I, Gossmann TI. Human long intrinsically disordered protein regions are frequent targets of positive selection. Genome Res 2018; 28:975-982. [PMID: 29858274 PMCID: PMC6028134 DOI: 10.1101/gr.232645.117] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Accepted: 06/01/2018] [Indexed: 12/20/2022]
Abstract
Intrinsically disordered regions occur frequently in proteins and are characterized by a lack of a well-defined three-dimensional structure. Although these regions do not show a higher order of structural organization, they are known to be functionally important. Disordered regions are rapidly evolving, largely attributed to relaxed purifying selection and an increased role of genetic drift. It has also been suggested that positive selection might contribute to their rapid diversification. However, for our own species, it is currently unknown whether positive selection has played a role during the evolution of these protein regions. Here, we address this question by investigating the evolutionary pattern of more than 6600 human proteins with intrinsically disordered regions and their ordered counterparts. Our comparative approach with data from more than 90 mammalian genomes uses a priori knowledge of disordered protein regions, and we show that this increases the power to detect positive selection by an order of magnitude. We can confirm that human intrinsically disordered regions evolve more rapidly, not only within humans but also across the entire mammalian phylogeny. They have, however, experienced substantial evolutionary constraint, hinting at their fundamental functional importance. We find compelling evidence that disordered protein regions are frequent targets of positive selection and estimate that the relative rate of adaptive substitutions differs fourfold between disordered and ordered protein regions in humans. Our results suggest that disordered protein regions are important targets of genetic innovation and that the contribution of positive selection in these regions is more pronounced than in other protein parts.
Collapse
Affiliation(s)
- Arina Afanasyeva
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield S102TN, United Kingdom.,Institute of Nanobiotechnologies, Peter the Great St. Petersburg Polytechnic University, Saint-Petersburg 195251, Russia.,Petersburg Nuclear Physics Institute, B.P. Konstantinov NRC Kurchatov Institute, Gatchina, Leningrad District 188300, Russia.,National Institutes of Biomedical Innovation, Health and Nutrition, Ibaraki City, Osaka 567-0085, Japan
| | - Mathias Bockwoldt
- Department of Arctic and Marine Biology, UiT The Arctic University of Norway, 9037 Tromsø, Norway
| | - Christopher R Cooney
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield S102TN, United Kingdom
| | - Ines Heiland
- Department of Arctic and Marine Biology, UiT The Arctic University of Norway, 9037 Tromsø, Norway
| | - Toni I Gossmann
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield S102TN, United Kingdom
| |
Collapse
|
5
|
Qi F, Frishman D. Melting temperature highlights functionally important RNA structure and sequence elements in yeast mRNA coding regions. Nucleic Acids Res 2017; 45:6109-6118. [PMID: 28335026 PMCID: PMC5449622 DOI: 10.1093/nar/gkx161] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2016] [Accepted: 02/24/2017] [Indexed: 11/13/2022] Open
Abstract
Secondary structure elements in the coding regions of mRNAs play an important role in gene expression and regulation, but distinguishing functional from non-functional structures remains challenging. Here we investigate the dependence of sequence–structure relationships in the coding regions on temperature based on the recent PARTE data by Wan et al. Our main finding is that the regions with high and low thermostability (high Tm and low Tm regions) are under evolutionary pressure to preserve RNA secondary structure and primary sequence, respectively. Sequences of low Tm regions display a higher degree of evolutionary conservation compared to high Tm regions. Low Tm regions are under strong synonymous constraint, while high Tm regions are not. These findings imply that high Tm regions contain thermo-stable functionally important RNA structures, which impose relaxed evolutionary constraint on sequence as long as the base-pairing patterns remain intact. By contrast, low thermostability regions contain single-stranded functionally important conserved RNA sequence elements accessible for binding by other molecules. We also find that theoretically predicted structures of paralogous mRNA pairs become more similar with growing temperature, while experimentally measured structures tend to diverge, which implies that the melting pathways of RNA structures cannot be fully captured by current computational approaches.
Collapse
Affiliation(s)
- Fei Qi
- Department of Bioinformatics, Technische Universität München, Wissenschaftzentrum Weihenstephan, Maximus-von-Imhof-Forum 3, D-85354 Freising, Germany
| | - Dmitrij Frishman
- Department of Bioinformatics, Technische Universität München, Wissenschaftzentrum Weihenstephan, Maximus-von-Imhof-Forum 3, D-85354 Freising, Germany.,St Petersburg State Polytechnic University, St Petersburg 195251, Russia
| |
Collapse
|
6
|
Savisaar R, Hurst LD. Both Maintenance and Avoidance of RNA-Binding Protein Interactions Constrain Coding Sequence Evolution. Mol Biol Evol 2017; 34:1110-1126. [PMID: 28138077 PMCID: PMC5400389 DOI: 10.1093/molbev/msx061] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
While the principal force directing coding sequence (CDS) evolution is selection on protein function, to ensure correct gene expression CDSs must also maintain interactions with RNA-binding proteins (RBPs). Understanding how our genes are shaped by these RNA-level pressures is necessary for diagnostics and for improving transgenes. However, the evolutionary impact of the need to maintain RBP interactions remains unresolved. Are coding sequences constrained by the need to specify RBP binding motifs? If so, what proportion of mutations are affected? Might sequence evolution also be constrained by the need not to specify motifs that might attract unwanted binding, for instance because it would interfere with exon definition? Here, we have scanned human CDSs for motifs that have been experimentally determined to be recognized by RBPs. We observe two sets of motifs-those that are enriched over nucleotide-controlled null and those that are depleted. Importantly, the depleted set is enriched for motifs recognized by non-CDS binding RBPs. Supporting the functional relevance of our observations, we find that motifs that are more enriched are also slower-evolving. The net effect of this selection to preserve is a reduction in the over-all rate of synonymous evolution of 2-3% in both primates and rodents. Stronger motif depletion, on the other hand, is associated with stronger selection against motif gain in evolution. The challenge faced by our CDSs is therefore not only one of attracting the right RBPs but also of avoiding the wrong ones, all while also evolving under selection pressures related to protein structure.
Collapse
Affiliation(s)
- Rosina Savisaar
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, United Kingdom
| | - Laurence D Hurst
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, United Kingdom
| |
Collapse
|
7
|
Savisaar R, Hurst LD. Estimating the prevalence of functional exonic splice regulatory information. Hum Genet 2017; 136:1059-1078. [PMID: 28405812 PMCID: PMC5602102 DOI: 10.1007/s00439-017-1798-3] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2017] [Accepted: 04/04/2017] [Indexed: 12/14/2022]
Abstract
In addition to coding information, human exons contain sequences necessary for correct splicing. These elements are known to be under purifying selection and their disruption can cause disease. However, the density of functional exonic splicing information remains profoundly uncertain. Several groups have experimentally investigated how mutations at different exonic positions affect splicing. They have found splice information to be distributed widely in exons, with one estimate putting the proportion of splicing-relevant nucleotides at >90%. These results suggest that splicing could place a major pressure on exon evolution. However, analyses of sequence conservation have concluded that the need to preserve splice regulatory signals only slightly constrains exon evolution, with a resulting decrease in the average human rate of synonymous evolution of only 1–4%. Why do these two lines of research come to such different conclusions? Among other reasons, we suggest that the methods are measuring different things: one assays the density of sites that affect splicing, the other the density of sites whose effects on splicing are visible to selection. In addition, the experimental methods typically consider short exons, thereby enriching for nucleotides close to the splice junction, such sites being enriched for splice-control elements. By contrast, in part owing to correction for nucleotide composition biases and to the assumption that constraint only operates on exon ends, the conservation-based methods can be overly conservative.
Collapse
Affiliation(s)
- Rosina Savisaar
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK.
| | - Laurence D Hurst
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| |
Collapse
|
8
|
Pancsa R, Tompa P. Coding Regions of Intrinsic Disorder Accommodate Parallel Functions. Trends Biochem Sci 2016; 41:898-906. [DOI: 10.1016/j.tibs.2016.08.009] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2016] [Revised: 08/16/2016] [Accepted: 08/19/2016] [Indexed: 02/01/2023]
|
9
|
Homma K, Noguchi T, Fukuchi S. Codon usage is less optimized in eukaryotic gene segments encoding intrinsically disordered regions than in those encoding structural domains. Nucleic Acids Res 2016; 44:10051-10061. [PMID: 27915289 PMCID: PMC5137448 DOI: 10.1093/nar/gkw899] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2016] [Revised: 09/15/2016] [Accepted: 09/29/2016] [Indexed: 12/14/2022] Open
Abstract
Codon usage tends to be optimized in highly expressed genes. A plausible explanation for this phenomenon is that translational accuracy is increased in highly expressed genes with infrequent use of rare codons. Besides structural domains (SDs), eukaryotic proteins generally have intrinsically disordered regions (IDRs) that by themselves do not assume unique three-dimensional structures. As IDRs are free from structural constraint, they can probably accommodate more translational errors than SDs can. Thus, codon usage in IDRs is likely to be less optimized than that in SDs. Codon usage in all the genes of seven eukaryotes was examined in terms of both tRNA adaptation index and codon adaptation index. Different amino acid compositions in different protein regions were taken into account in calculating expected adaptation indices, to which observed indices were compared. Codon usage is less optimized in gene regions encoding IDRs than in those corresponding to SDs. The finding does not depend on whether IDRs are located at the N-terminus, in the middle, or at the C-terminus of proteins. Furthermore, the observation remains unchanged in two different algorithms used to predict IDRs in proteins. The result is consistent with the idea that IDRs tolerate more translational errors than SDs.
Collapse
Affiliation(s)
- Keiichi Homma
- Department of Life Science and Informatics, Maebashi Institute of Technology, 460-1 Kamisadori-machi, Maebashi-shi 371-0816, Japan
| | - Tamotsu Noguchi
- Pharmaceutical Education Research Center, Meiji Pharmaceutical University, 2-522-1 Noshio, Kiyose, Tokyo 204-8588, Japan
| | - Satoshi Fukuchi
- Department of Life Science and Informatics, Maebashi Institute of Technology, 460-1 Kamisadori-machi, Maebashi-shi 371-0816, Japan
| |
Collapse
|
10
|
Tovo-Rodrigues L, Recamonde-Mendoza M, Paixão-Côrtes VR, Bruxel EM, Schuch JB, Friedrich DC, Rohde LA, Hutz MH. The role of protein intrinsic disorder in major psychiatric disorders. Am J Med Genet B Neuropsychiatr Genet 2016; 171:848-60. [PMID: 27184105 DOI: 10.1002/ajmg.b.32455] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/03/2015] [Accepted: 04/22/2016] [Indexed: 01/26/2023]
Abstract
Although new candidate genes for Autism Spectrum Disorder (ASD), Schizophrenia (SCZ), Attention-Deficit/Hyperactivity Disorder (ADHD), and Bipolar Disorder (BD) emerged from genome-wide association studies (GWAS), their underlying molecular mechanisms remain poorly understood. Evidences of the involvement of intrinsically disordered proteins in diseases have grown in the last decade. These proteins lack tridimensional structure under physiological conditions and are involved in important cellular functions such as signaling, recognition and regulation. The aim of the present study was to identify the role and abundance of intrinsically disordered proteins in a set of psychiatric diseases and to test whether diseases are different regarding protein intrinsic disorder. Our hypothesis is that differences across psychiatric illnesses phenotypes and symptoms may arise from differences in intrinsic protein disorder content and properties of each group. A bioinformatics prediction of intrinsic disorder was performed in proteins retrieved based on top findings from GWAS, Copy Number Variation and candidate gene investigations for each disease. This approach revealed that about 80% of studied proteins presented long stretches of disorder. This amount was significantly higher than that observed in general eukaryotic proteins, and those involved in cardiovascular diseases. These results suggest that proteins with intrinsic disorder are a common feature of neurodevelopment and synaptic transmission processes which are potentially involved in the etiology of psychiatric diseases. Moreover, we identified differences between ADHD and ASD when the binary prediction of structure and putative binding sites were compared. These differences may be related to variation in symptom complexity between both diseases. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Luciana Tovo-Rodrigues
- Department of Genetics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil.,Postgraduate Program in Epidemiology, Federal University of Pelotas, Pelotas, RS, Brazil
| | - Mariana Recamonde-Mendoza
- Experimental and Molecular Cardiovascular Laboratory, Hospital de Clinicas de Porto Alegre, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| | | | - Estela M Bruxel
- Department of Genetics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| | - Jaqueline B Schuch
- Department of Genetics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| | - Deise C Friedrich
- Department of Genetics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| | - Luis A Rohde
- Child and Adolescent Psychiatric Division, Hospital de Clinicas de Porto Alegre, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil.,National Institute of Developmental Psychiatry for Children and Adolescents (INCT-CNPq), Brazil
| | - Mara H Hutz
- Department of Genetics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| |
Collapse
|
11
|
Annibalini G, Bielli P, De Santi M, Agostini D, Guescini M, Sisti D, Contarelli S, Brandi G, Villarini A, Stocchi V, Sette C, Barbieri E. MIR retroposon exonization promotes evolutionary variability and generates species-specific expression of IGF-1 splice variants. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2016; 1859:757-68. [DOI: 10.1016/j.bbagrm.2016.03.014] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/03/2015] [Revised: 03/07/2016] [Accepted: 03/23/2016] [Indexed: 12/18/2022]
|
12
|
Cong Q, Borek D, Otwinowski Z, Grishin NV. Skipper genome sheds light on unique phenotypic traits and phylogeny. BMC Genomics 2015; 16:639. [PMID: 26311350 PMCID: PMC4551732 DOI: 10.1186/s12864-015-1846-0] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2015] [Accepted: 08/14/2015] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Butterflies and moths are emerging as model organisms in genetics and evolutionary studies. The family Hesperiidae (skippers) was traditionally viewed as a sister to other butterflies based on its moth-like morphology and darting flight habits with fast wing beats. However, DNA studies suggest that the family Papilionidae (swallowtails) may be the sister to other butterflies including skippers. The moth-like features and the controversial position of skippers in Lepidoptera phylogeny make them valuable targets for comparative genomics. RESULTS We obtained the 310 Mb draft genome of the Clouded Skipper (Lerema accius) from a wild-caught specimen using a cost-effective strategy that overcomes the high (1.6 %) heterozygosity problem. Comparative analysis of Lerema accius and the highly heterozygous genome of Papilio glaucus revealed differences in patterns of SNP distribution, but similarities in functions of genes that are enriched in non-synonymous SNPs. Comparison of Lepidoptera genomes revealed possible molecular bases for unique traits of skippers: a duplication of electron transport chain components could result in efficient energy supply for their rapid flight; a diversified family of predicted cellulases might allow them to feed on cellulose-enriched grasses; an expansion of pheromone-binding proteins and enzymes for pheromone synthesis implies a more efficient mate-recognition system, which compensates for the lack of clear visual cues due to the similarities in wing colors and patterns of many species of skippers. Phylogenetic analysis of several Lepidoptera genomes suggested that the position of Hesperiidae remains uncertain as the tree topology varied depending on the evolutionary model. CONCLUSION Completion of the first genome from the family Hesperiidae allowed comparative analyses with other Lepidoptera that revealed potential genetic bases for the unique phenotypic traits of skippers. This work lays the foundation for future experimental studies of skippers and provides a rich dataset for comparative genomics and phylogenetic studies of Lepidoptera.
Collapse
Affiliation(s)
- Qian Cong
- Department of Biophysics and Biochemistry, University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX, 75390-8816, USA.
| | - Dominika Borek
- Department of Biophysics and Biochemistry, University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX, 75390-8816, USA.
| | - Zbyszek Otwinowski
- Department of Biophysics and Biochemistry, University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX, 75390-8816, USA.
| | - Nick V Grishin
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX, 75390-9050, USA. .,Department of Biophysics and Biochemistry, University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX, 75390-8816, USA.
| |
Collapse
|
13
|
Kryuchkova-Mostacci N, Robinson-Rechavi M. Tissue-Specific Evolution of Protein Coding Genes in Human and Mouse. PLoS One 2015; 10:e0131673. [PMID: 26121354 PMCID: PMC4488272 DOI: 10.1371/journal.pone.0131673] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2015] [Accepted: 06/04/2015] [Indexed: 12/23/2022] Open
Abstract
Protein-coding genes evolve at different rates, and the influence of different parameters, from gene size to expression level, has been extensively studied. While in yeast gene expression level is the major causal factor of gene evolutionary rate, the situation is more complex in animals. Here we investigate these relations further, especially taking in account gene expression in different organs as well as indirect correlations between parameters. We used RNA-seq data from two large datasets, covering 22 mouse tissues and 27 human tissues. Over all tissues, evolutionary rate only correlates weakly with levels and breadth of expression. The strongest explanatory factors of purifying selection are GC content, expression in many developmental stages, and expression in brain tissues. While the main component of evolutionary rate is purifying selection, we also find tissue-specific patterns for sites under neutral evolution and for positive selection. We observe fast evolution of genes expressed in testis, but also in other tissues, notably liver, which are explained by weak purifying selection rather than by positive selection.
Collapse
Affiliation(s)
- Nadezda Kryuchkova-Mostacci
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Marc Robinson-Rechavi
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
14
|
Bergantino F, Guariniello S, Raucci R, Colonna G, De Luca A, Normanno N, Costantini S. Structure–fluctuation–function relationships of seven pro-angiogenic isoforms of VEGFA, important mediators of tumorigenesis. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2015; 1854:410-25. [DOI: 10.1016/j.bbapap.2015.01.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2014] [Revised: 01/06/2015] [Accepted: 01/14/2015] [Indexed: 10/24/2022]
|
15
|
Smithers B, Oates ME, Gough J. Splice junctions are constrained by protein disorder. Nucleic Acids Res 2015; 43:4814-22. [PMID: 25934802 PMCID: PMC4446445 DOI: 10.1093/nar/gkv407] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2015] [Accepted: 04/15/2015] [Indexed: 01/23/2023] Open
Abstract
We have discovered that positions of splice junctions in genes are constrained by the tolerance for disorder-promoting amino acids in the translated protein region. It is known that efficient splicing requires nucleotide bias at the splice junction; the preferred usage produces a distribution of amino acids that is disorder-promoting. We observe that efficiency of splicing, as seen in the amino-acid distribution, is not compromised to accommodate globular structure. Thus we infer that it is the positions of splice junctions in the gene that must be under constraint by the local protein environment. Examining exonic splicing enhancers found near the splice junction in the gene, reveals that these (short DNA motifs) are more prevalent in exons that encode disordered protein regions than exons encoding structured regions. Thus we also conclude that local protein features constrain efficient splicing more in structure than in disorder.
Collapse
Affiliation(s)
- Ben Smithers
- Department of Computer Science, University of Bristol, Bristol, BS8 1UB, UK
| | - Matt E Oates
- Department of Computer Science, University of Bristol, Bristol, BS8 1UB, UK
| | - Julian Gough
- Department of Computer Science, University of Bristol, Bristol, BS8 1UB, UK
| |
Collapse
|
16
|
N-terminal region of human chemokine receptor CXCR3: Structural analysis of CXCR3(1–48) by experimental and computational studies. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2014; 1844:1868-80. [DOI: 10.1016/j.bbapap.2014.08.004] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/14/2014] [Revised: 08/06/2014] [Accepted: 08/07/2014] [Indexed: 11/20/2022]
|
17
|
An overview of the sequence features of N- and C-terminal segments of the human chemokine receptors. Cytokine 2014; 70:141-50. [PMID: 25138014 DOI: 10.1016/j.cyto.2014.07.257] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2014] [Revised: 06/21/2014] [Accepted: 07/29/2014] [Indexed: 01/10/2023]
Abstract
Chemokine receptors play a crucial role in the cellular signaling enrolling extracellular ligands chemotactic proteins which recruit immune cells. They possess seven trans-membrane helices, an extracellular N-terminal region with three extracellular hydrophilic loops being important for search and recognition of specific ligand(s), and an intracellular C-terminal region with three intracellular loops that couple G-proteins. Although the functional aspects of the terminal segments of the extra-and intra-cellular G proteins are universally identified, the molecular basis on which they rest are still unclear because they are not definable by means of X-rays due to their high mobility and are not easy to study in the membrane. The purpose of this work is to define which physical-chemical properties of the terminal segments of the human chemokine receptors are at the basis of their functional mechanisms. Therefore, we have evaluated their physical-chemical properties in terms of amino acid composition, local flexibility, disorder propensity, net charge distribution and putative sites of post-translational modifications. Our results support the conclusion that all 19 C-terminal and N-terminal segments of human chemokine receptors are very flexible due to the systematic presence of intrinsic disorder. Although, the purpose of this plasticity clearly appears that of controlling and modulating the binding of ligands, we provide evidence that the overlap of linearly charged stretches, intrinsic disorder and post-translational modification sites, consistently found in these motives, is a necessary feature to exert the function. The role of the intrinsic disorder has been discussed considering the structural information coming from intrinsically disordered model compounds which support the view that the chemokine terminals have to be considered as strong polyampholytes or polyelectrolytes where conformational ensembles and structural transitions between them are modulated by charge fraction variations. Also the role of post-translational modifications has been found coherent with this view because, changing the charge fraction, they guide structural transitions between ensembles. Moreover, we have also considered our results from an evolutionary point of view in order to understand if the features found in humans were also present in other species. Our data evidenced that the structural features of the human terminals of the chemokine receptors were shared and evolutionarily conserved particularly among mammals. This means that the various organisms not only tolerate but select intrinsic disorder for the terminal regions of their receptors, reflecting constraints that point to molecular recognition. In conclusion the terminal segments of chemokine receptors must be considered as strong polyampholytes where the charge fraction variations induced by post-translational modifications are the driving physico-chemical feature able to adapt the conformations of the terminal segments to their functions.
Collapse
|