1
|
Bakhtiar D, Vondraskova K, Pengelly RJ, Chivers M, Kralovicova J, Vorechovsky I. Exonic splicing code and coordination of divalent metals in proteins. Nucleic Acids Res 2024; 52:1090-1106. [PMID: 38055834 PMCID: PMC10853796 DOI: 10.1093/nar/gkad1161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 11/15/2023] [Accepted: 11/17/2023] [Indexed: 12/08/2023] Open
Abstract
Exonic sequences contain both protein-coding and RNA splicing information but the interplay of the protein and splicing code is complex and poorly understood. Here, we have studied traditional and auxiliary splicing codes of human exons that encode residues coordinating two essential divalent metals at the opposite ends of the Irving-Williams series, a universal order of relative stabilities of metal-organic complexes. We show that exons encoding Zn2+-coordinating amino acids are supported much less by the auxiliary splicing motifs than exons coordinating Ca2+. The handicap of the former is compensated by stronger splice sites and uridine-richer polypyrimidine tracts, except for position -3 relative to 3' splice junctions. However, both Ca2+ and Zn2+ exons exhibit close-to-constitutive splicing in multiple tissues, consistent with their critical importance for metalloprotein function and a relatively small fraction of expendable, alternatively spliced exons. These results indicate that constraints imposed by metal coordination spheres on RNA splicing have been efficiently overcome by the plasticity of exon-intron architecture to ensure adequate metalloprotein expression.
Collapse
Affiliation(s)
- Dara Bakhtiar
- University of Southampton, Faculty of Medicine, Southampton SO16 6YD, UK
| | - Katarina Vondraskova
- Slovak Academy of Sciences, Centre of Biosciences, 840 05 Bratislava, Slovak Republic
| | - Reuben J Pengelly
- University of Southampton, Faculty of Medicine, Southampton SO16 6YD, UK
| | - Martin Chivers
- University of Southampton, Faculty of Medicine, Southampton SO16 6YD, UK
| | - Jana Kralovicova
- University of Southampton, Faculty of Medicine, Southampton SO16 6YD, UK
- Slovak Academy of Sciences, Centre of Biosciences, 840 05 Bratislava, Slovak Republic
| | - Igor Vorechovsky
- University of Southampton, Faculty of Medicine, Southampton SO16 6YD, UK
| |
Collapse
|
2
|
Genetic association of KCNJ10 rs1130183 with seizure susceptibility and computational analysis of deleterious non-synonymous SNPs of KCNJ10 gene. Gene 2014; 536:247-53. [DOI: 10.1016/j.gene.2013.12.026] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2013] [Revised: 12/05/2013] [Accepted: 12/11/2013] [Indexed: 11/17/2022]
|
3
|
Computational Approaches and Resources in Single Amino Acid Substitutions Analysis Toward Clinical Research. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2014; 94:365-423. [DOI: 10.1016/b978-0-12-800168-4.00010-x] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
|
4
|
Masoodi TA, Alsaif MA, Al Shammari SA, Alhamdan AA. Evaluation and identification of damaged single nucleotide polymorphisms in COL1A1 gene involved in osteoporosis. Arch Med Sci 2013; 9:899-905. [PMID: 24273577 PMCID: PMC3832808 DOI: 10.5114/aoms.2012.28598] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/27/2011] [Revised: 09/26/2011] [Accepted: 12/16/2011] [Indexed: 12/11/2022] Open
Abstract
INTRODUCTION Single-nucleotide polymorphisms (SNPs) are biomarkers for exploring the genetic basis of many complex human diseases. The prediction of SNPs is promising in modern genetic analysis but it is still a great challenge to identify the functional SNPs in a disease-related gene. The computational approach has overcome this challenge and an increase in the successful rate of genetic association studies and reduced cost of genotyping have been achieved. The objective of this study is to identify deleterious non-synonymous SNPs (nsSNPs) associated with the COL1A1 gene. MATERIAL AND METHODS The SNPs were retrieved from the Single Nucleotide Polymorphism Database (dbSNP). Using I-Mutant, protein stability change was calculated. The potentially functional nsSNPs and their effect on proteins were predicted by PolyPhen and SIFT respectively. FASTSNP was used for estimation of risk score. RESULTS Our analysis revealed 247 SNPs as non-synonymous, out of which 5 nsSNPs were found to be least stable by I-Mutant 2.0 with a DDG value of > -1.0. Four nsSNPs, namely rs17853657, rs17857117, rs57377812 and rs1059454, showed a highly deleterious tolerance index score of 0.00 with a change in their physicochemical properties by the SIFT server. Seven nsSNPs, namely rs1059454, rs8179178, rs17853657, rs17857117, rs72656340, rs72656344 and rs72656351, were found to be probably damaging with a PSIC score difference between 2.0 and 3.5 by the PolyPhen server. Three nsSNPs, namely rs1059454, rs17853657 and rs17857117, were found to be highly polymorphic with a risk score of 3-4 with a possible effect of non-conservative change and splicing regulation by FASTSNP. CONCLUSIONS Three nsSNPs, namely rs1059454, rs17853657 and rs17857117, are potential functional polymorphisms that are likely to have a functional impact on the COL1A1 gene.
Collapse
Affiliation(s)
- Tariq Ahmad Masoodi
- Health Care Development for Elderly Research Chair, College of Applied Medical Sciences, King Saud University, Riyadh, Kingdom of Saudi Arabia
| | | | | | | |
Collapse
|
5
|
Masoodi TA, Al Shammari SA, Al-Muammar MN, Alhamdan AA, Talluri VR. Exploration of deleterious single nucleotide polymorphisms in late-onset Alzheimer disease susceptibility genes. Gene 2013; 512:429-37. [DOI: 10.1016/j.gene.2012.08.026] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2012] [Revised: 07/27/2012] [Accepted: 08/17/2012] [Indexed: 02/03/2023]
|
6
|
Quirk ME, Dobrowolski SF, Nelson BE, Coffee B, Singh RH. Utility of phenylalanine hydroxylase genotype for tetrahydrobiopterin responsiveness classification in patients with phenylketonuria. Mol Genet Metab 2012; 107:31-6. [PMID: 22841515 PMCID: PMC4029439 DOI: 10.1016/j.ymgme.2012.07.008] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/24/2012] [Revised: 07/10/2012] [Accepted: 07/11/2012] [Indexed: 10/28/2022]
Abstract
BACKGROUND A need exists to expand the characterization of tetrahydrobiopterin (BH(4)) responsiveness in patients with phenylketonuria (PKU), beyond simply evaluating change in blood phenylalanine concentrations. The clinical interpretation of BH(4) responsiveness should be evaluated within the context of phenylalanine hydroxylase (PAH) genotype. AIM This investigation seeks to use a modified version of a previously developed PAH genotype severity tool, the assigned value (AV) sum, to assess the molecular basis of responsiveness in a clinical cohort and to explore the tool's ability to differentiate BH(4) responsive groups. METHODS BH(4) response was previously clinically classified in 58 patients with PKU, with three response groups emerging: definitive responders, provisional responders, and non-responders. Provisional responders represented a clinically ambiguous group, with an initial decrease in plasma phenylalanine concentrations, but limited ability to improve dietary phenylalanine tolerance. In this retrospective analysis, mutations in the PAH gene were identified in each patient. PAH genotype was characterized through the AV sum approach, in which each mutation is given an AV of 1, 2, 4, or 8; the sum of both mutations' AV corresponds to genotype severity, with a lower number representing a more severe phenotype. An AV sum cutoff of 2 (indicative of the most severe genotypes) was used to dichotomize patients and predict BH(4) responsiveness. Provisional responders were classified with the definitive responders then the non-responders to see with which group they best aligned. RESULTS In 17/19 definitive responders, at least one mutation was mild or moderate in severity (AV sum>2). In contrast, 7/9 provisional responders carried two severe or null mutations (AV sum=2), suggesting little molecular basis for responsiveness. Non-responders represent a heterogeneous group with 15/25 patients carrying two severe mutations (AV sum=2), 5/25 patients carrying one moderate or mild mutation in combination with a severe or null mutation (AV sum>2), and the remaining five patients carrying an uncharacterized mutation in combination with a severe mutation. Predictive sensitivity of the AV sum was maximized (89.5% vs. 67.9%) with limited detriment to specificity (79.4% vs. 80.0%), by classifying provisional responders with the non-responders rather than with the definitive responders. CONCLUSIONS In our clinical cohort, the AV sum tool was able to identify definitive responders with a high degree of sensitivity. As demonstrated by both the provisional responder group and the substantial number of non-responders with AV sums>2, a potential exists for misclassification when BH(4) response is determined by relying solely on change in plasma phenylalanine concentrations. PAH genotype should be incorporated in the clinical evaluation of BH(4) responsiveness.
Collapse
Affiliation(s)
- Meghan E. Quirk
- Division of Biological and Biomedical Sciences, Nutrition and Health Sciences, Emory University, Atlanta, GA, USA
| | - Steven F. Dobrowolski
- Department of Pathology, University of Utah School of Medicine, Salt Lake City, UT, USA
| | | | - Bradford Coffee
- Department of Human Genetics, Emory University School of Medicine, Decatur, GA, USA
| | - Rani H. Singh
- Division of Biological and Biomedical Sciences, Nutrition and Health Sciences, Emory University, Atlanta, GA, USA
- Department of Human Genetics, Emory University School of Medicine, Decatur, GA, USA
| |
Collapse
|
7
|
Heintz C, Dobrowolski SF, Andersen HS, Demirkol M, Blau N, Andresen BS. Splicing of phenylalanine hydroxylase (PAH) exon 11 is vulnerable: molecular pathology of mutations in PAH exon 11. Mol Genet Metab 2012; 106:403-11. [PMID: 22698810 DOI: 10.1016/j.ymgme.2012.05.013] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/28/2012] [Revised: 05/20/2012] [Accepted: 05/20/2012] [Indexed: 02/04/2023]
Abstract
In about 20-30% of phenylketonuria (PKU) patients, phenylalanine (Phe) levels can be controlled by cofactor 6R-tetrahydrobiopterin (BH(4)) administration. The phenylalanine hydroxylase (PAH) genotype has a predictive value concerning BH(4)-response and therefore a correct assessment of the mutation molecular pathology is important. Mutations that disturb the splicing of exons (e.g. interplay between splice site strength and regulatory sequences like exon splicing enhancers (ESEs)/exon splicing silencers (ESSs)) may cause different severity of PKU. In this study, we identified PAH exon 11 as a vulnerable exon and used patient derived lymphoblast cell lines and PAH minigenes to study the molecular defect that impacted pre-mRNA processing. We showed that the c.1144T>C and c.1066-3C>T mutations cause exon 11 skipping, while the c.1139C>T mutation is neutral or slightly beneficial. The c.1144T>C mutation resides in a putative splicing enhancer motif and binding by splicing factors SF2/ASF, SRp20 and SRp40 is disturbed. Additional mutations in potential splicing factor binding sites contributed to elucidate the pathogenesis of mutations in PAH exon 11. We suggest that PAH exon 11 is vulnerable due to a weak 3' splice site and that this makes exon 11 inclusion dependent on an ESE spanning position c.1144. Importantly, this implies that other mutations in exon 11 may affect splicing, since splicing is often determined by a fine balance between several positive and negative splicing regulatory elements distributed throughout the exon. Finally, we identified a pseudoexon in intron 11, which would have pathogenic consequences if activated by mutations or improved splicing conditions. Exonic mutations that disrupt splicing are unlikely to facilitate response to BH(4) and may lead to inconsistent genotype-phenotype correlations. Therefore, recognizing such mutations enhances our ability to predict the BH(4)-response.
Collapse
Affiliation(s)
- Caroline Heintz
- Division of Clinical Chemistry and Biochemistry, University Children's Hospital, Zürich, Switzerland
| | | | | | | | | | | |
Collapse
|
8
|
Masoodi TA, Shammari SAA, Al-Muammar MN, Almubrad TM, Alhamdan AA. Screening and structural evaluation of deleterious Non-Synonymous SNPs of ePHA2 gene involved in susceptibility to cataract formation. Bioinformation 2012; 8:562-7. [PMID: 22829731 PMCID: PMC3398778 DOI: 10.6026/97320630008562] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2012] [Accepted: 05/24/2012] [Indexed: 02/02/2023] Open
Abstract
Age-related cataract is clinically and genetically heterogeneous disorder affecting the ocular lens, and the leading cause of vision loss and blindness worldwide. Here we screened nonsynonymous single nucleotide polymorphisms (nsSNPs) of a novel gene, EPHA2 responsible for age related cataracts. The SNPs were retrieved from dbSNP. Using I-Mutant, protein stability change was calculated. The potentially functional nsSNPs and their effect on protein was predicted by PolyPhen and SIFT respectively. FASTSNP was used for functional analysis and estimation of risk score. The functional impact on the EPHA2 protein was evaluated by using SWISSPDB viewer and NOMAD-Ref server. Our analysis revealed 16 SNPs as nonsynonymous out of which 6 nsSNPs, namely rs11543934, rs2291806, rs1058371, rs1058370, rs79100278 and rs113882203 were found to be least stable by I-Mutant 2.0 with DDG value of > -1.0. nsSNPs, namely rs35903225, rs2291806, rs1058372, rs1058370, rs79100278 and rs113882203 showed a highly deleterious tolerance index score of 0.00 by SIFT server. Four nsSNPs namely rs11543934, rs2291806, rs1058370 and rs113882203 were found to be probably damaging with PSIC score of ≥ 2. 0 by Polyp hen server. Three nsSNPs namely, rs11543934, rs2291806 and rs1058370 were found to be highly polymorphic with a risk score of 3-4 with a possible effect of Non-conservative change and splicing regulation by FASTSNP. The total energy and RMSD value was higher for the mutant-type structure compared to the native type structure. We concluded that the nsSNP namely rs2291806 as the potential functional polymorphic that is likely to have functional impact on the EPHA2 gene.
Collapse
Affiliation(s)
- Tariq Ahmad Masoodi
- Health Care Development for Elderly Research Chair, College of Applied Medical Sciences, King Saud University, Riyadh, Saudi Arabia
| | - Sulaiman A Al Shammari
- Health Care Development for Elderly Research Chair, College of Applied Medical Sciences, King Saud University, Riyadh, Saudi Arabia
| | - May N Al-Muammar
- Health Care Development for Elderly Research Chair, College of Applied Medical Sciences, King Saud University, Riyadh, Saudi Arabia
| | - Turki M Almubrad
- Department of Optometry, College of Applied Medical Sciences, King Saud University, Riyadh, Saudi Arabia
| | - Adel A Alhamdan
- Health Care Development for Elderly Research Chair, College of Applied Medical Sciences, King Saud University, Riyadh, Saudi Arabia
| |
Collapse
|
9
|
Masoodi TA, Rao Talluri V, Shaik NA, Al-Aama JY, Hasan Q. Functional genomics based prioritization of potential nsSNPs in EPHX1, GSTT1, GSTM1 and GSTP1 genes for breast cancer susceptibility studies. Genomics 2012; 99:330-9. [DOI: 10.1016/j.ygeno.2012.04.006] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2012] [Revised: 04/17/2012] [Accepted: 04/23/2012] [Indexed: 10/28/2022]
|
10
|
Screening and Evaluation of Deleterious SNPs in APOE Gene of Alzheimer's Disease. Neurol Res Int 2012; 2012:480609. [PMID: 22530123 PMCID: PMC3317072 DOI: 10.1155/2012/480609] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2011] [Accepted: 12/09/2011] [Indexed: 11/21/2022] Open
Abstract
Introduction. Apolipoprotein E (APOE) is an important risk factor for Alzheimer's disease (AD) and is present in 30–50% of patients who develop late-onset AD. Several single-nucleotide polymorphisms (SNPs) are present in APOE gene which act as the biomarkers for exploring the genetic basis of this disease. The objective of this study is to identify deleterious nsSNPs associated with APOE gene. Methods. The SNPs were retrieved from dbSNP. Using I-Mutant, protein stability change was calculated. The potentially functional nonsynonymous (ns) SNPs and their effect on protein was predicted by PolyPhen and SIFT, respectively. FASTSNP was used for functional analysis and estimation of risk score. The functional impact on the APOE protein was evaluated by using Swiss PDB viewer and NOMAD-Ref server. Results. Six nsSNPs were found to be least stable by I-Mutant 2.0 with DDG value of >−1.0. Four nsSNPs showed a highly deleterious tolerance index score of 0.00. Nine nsSNPs were found to be probably damaging with position-specific independent counts (PSICs) score of ≥2.0. Seven nsSNPs were found to be highly polymorphic with a risk score of 3-4. The total energies and root-mean-square deviation (RMSD) values were higher for three mutant-type structures compared to the native modeled structure. Conclusion. We concluded that three nsSNPs, namely, rs11542041, rs11542040, and rs11542034, to be potentially functional polymorphic.
Collapse
|
11
|
Sethumadhavan R, Doss CGP, Rajasekaran R. In silico searching for disease-associated functional DNA variants. Methods Mol Biol 2011; 760:239-250. [PMID: 21780001 DOI: 10.1007/978-1-61779-176-5_15] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Experimental analyses of disease-associated DNA variants have provided significant insights into the functional implications of sequence variation. However, such experiment-based approaches for identifying functional DNA variants from a pool with a large number of neutral variants are challenging. Computational biology has the opportunity to play an important role in the identification of functional DNA variants in large-scale genotyping studies, ultimately yielding new drug targets and biomarkers. This chapter outlines in silico methods to predict disease-associated functional DNA variants so that the number of DNA variants screened for association with disease can be reduced to those that are most likely to alter gene function. To explore possible relationships between genetic mutations and phenotypic variation, different computational methods like Sorting Intolerant from Tolerant (SIFT, an evolutionary-based approach), Polymorphism Phenotyping (PolyPhen, a structure-based approach) and PupaSuite are discussed for prioritization of high-risk DNA variants. The PupaSuite tool aims to predict the phenotypic effect of DNA variants on the structure and function of the affected protein as well as the effect of variants in the non-coding regions of the same genes. To further investigate the possible causes of disease at the molecular level, deleterious nonsynonymous variants can be mapped to 3D protein structures. An analysis of solvent accessibility and secondary structure can also be performed to understand the impact of a mutation on protein function and stability. This chapter demonstrates a 'real-world' application of some existing bioinformatics tools for DNA variant analysis.
Collapse
Affiliation(s)
- Rao Sethumadhavan
- School of BioSciences and Technology, Vellore Institute of Technology, 632014, Vellore, Tamil Nadu, India.
| | | | | |
Collapse
|
12
|
Doss CGP, Sethumadhavan R. Investigation on the role of nsSNPs in HNPCC genes--a bioinformatics approach. J Biomed Sci 2009; 16:42. [PMID: 19389263 PMCID: PMC2682794 DOI: 10.1186/1423-0127-16-42] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2008] [Accepted: 04/24/2009] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND A central focus of cancer genetics is the study of mutations that are causally implicated in tumorigenesis. The identification of such causal mutations not only provides insight into cancer biology but also presents anticancer therapeutic targets and diagnostic markers. Missense mutations are nucleotide substitutions that change an amino acid in a protein, the deleterious effects of these mutations are commonly attributed to their impact on primary amino acid sequence and protein structure. METHODS The method to identify functional SNPs from a pool, containing both functional and neutral SNPs is challenging by experimental protocols. To explore possible relationships between genetic mutation and phenotypic variation, we employed different bioinformatics algorithms like Sorting Intolerant from Tolerant (SIFT), Polymorphism Phenotyping (PolyPhen), and PupaSuite to predict the impact of these amino acid substitutions on protein activity of mismatch repair (MMR) genes causing hereditary nonpolyposis colorectal cancer (HNPCC). RESULTS SIFT classified 22 of 125 variants (18%) as 'Intolerant." PolyPhen classified 40 of 125 amino acid substitutions (32%) as "Probably or possibly damaging". The PupaSuite predicted the phenotypic effect of SNPs on the structure and function of the affected protein. Based on the PolyPhen scores and availability of three-dimensional structures, structure analysis was carried out with the major mutations that occurred in the native protein coded by MSH2 and MSH6 genes. The amino acid residues in the native and mutant model protein were further analyzed for solvent accessibility and secondary structure to check the stability of the proteins. CONCLUSION Based on this approach, we have shown that four nsSNPs, which were predicted to have functional consequences (MSH2-Y43C, MSH6-Y538S, MSH6-S580L, and MSH6-K854M), were already found to be associated with cancer risk. Our study demonstrates the presence of other deleterious mutations and also endorses with in vivo experimental studies.
Collapse
Affiliation(s)
- C George Priya Doss
- Bioinformatics Division, School of Biotechnology, Chemical and Biomedical Engineering, Vellore Institute of Technology, Vellore 632014, Tamil Nadu, India.
| | | |
Collapse
|
13
|
Dong C, Qian Z, Jia P, Wang Y, Huang W, Li Y. Gene-centric characteristics of genome-wide association studies. PLoS One 2007; 2:e1262. [PMID: 18060058 PMCID: PMC2092383 DOI: 10.1371/journal.pone.0001262] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2007] [Accepted: 10/12/2007] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND The high-throughput genotyping chips have contributed greatly to genome-wide association (GWA) studies to identify novel disease susceptibility single nucleotide polymorphisms (SNPs). The high-density chips are designed using two different SNP selection approaches, the direct gene-centric approach, and the indirect quasi-random SNPs or linkage disequilibrium (LD)-based tagSNPs approaches. Although all these approaches can provide high genome coverage and ascertain variants in genes, it is not clear to which extent these approaches could capture the common genic variants. It is also important to characterize and compare the differences between these approaches. METHODOLOGY/PRINCIPAL FINDINGS In our study, by using both the Phase II HapMap data and the disease variants extracted from OMIM, a gene-centric evaluation was first performed to evaluate the ability of the approaches in capturing the disease variants in Caucasian population. Then the distribution patterns of SNPs were also characterized in genic regions, evolutionarily conserved introns and nongenic regions, ontologies and pathways. The results show that, no mater which SNP selection approach is used, the current high-density SNP chips provide very high coverage in genic regions and can capture most of known common disease variants under HapMap frame. The results also show that the differences between the direct and the indirect approaches are relatively small. Both have similar SNP distribution patterns in these gene-centric characteristics. CONCLUSIONS/SIGNIFICANCE This study suggests that the indirect approaches not only have the advantage of high coverage but also are useful for studies focusing on various functional SNPs either in genes or in the conserved regions that the direct approach supports. The study and the annotation of characteristics will be helpful for designing and analyzing GWA studies that aim to identify genetic risk factors involved in common diseases, especially variants in genes and conserved regions.
Collapse
Affiliation(s)
- Changzheng Dong
- Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
- Chinese National Human Genome Center at Shanghai, Shanghai, China
- Graduate School of the Chinese Academy of Sciences, Beijing, China
| | - Ziliang Qian
- Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
- Graduate School of the Chinese Academy of Sciences, Beijing, China
| | - Peilin Jia
- Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
- Graduate School of the Chinese Academy of Sciences, Beijing, China
| | - Ying Wang
- Chinese National Human Genome Center at Shanghai, Shanghai, China
| | - Wei Huang
- Chinese National Human Genome Center at Shanghai, Shanghai, China
- Rui Jin Hospital, School of Medicine, Shanghai Jiaotong University, Shanghai, China
| | - Yixue Li
- Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
- Shanghai Center for Bioinformation Technology, Shanghai, China
| |
Collapse
|
14
|
Kusz K, Ginter-Matuszewska B, Ziolkowska K, Spik A, Bierla J, Jedrzejczak P, Latos-Bielenska A, Pawelczyk L, Jaruzelska J. Polymorphisms of the humanPUMILIO2 gene and male sterility. Mol Reprod Dev 2007; 74:795-9. [PMID: 17154300 DOI: 10.1002/mrd.20683] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
The highly conserved Pumilio protein plays crucial roles in fertility of many organisms acting as a repressor of translation, and causing infertility when mutated. Although one of two human Pumilio homologs, PUMILIO2 is expressed mainly in the germ line, its role in mammalian germ cell development has not been reported yet. To shed light on the role of PUMILIO2 in development of the human male germ line, we screened this gene for mutations in 137 patients presenting a variety of phenotypes with spermatogenic failure. The first variant, we identified was a single base substitution within intron 15 (IVS15 + 6G > A). This variant was found in three azoospermic males, the second allele being the wild type. However, this variant was also present among fertile males, as frequently as in the patients. Although location of IVS15 + 6G > A substitution in close proximity to the canonical donor splice site GT, indicates that its influence on splicing cannot be excluded, our preliminary cDNA analysis has not revealed evidence of a splicing abnormality of PUMILIO2 pre-mRNA carrying this variant. Nevertheless, this study provides new interesting variant containing a donor splice site variant, which can be relevant for understanding of splicing mechanism of mammalian genes. The second variant, c.774 C > T transversion (Y258Y) in exon 6 was found only in one patient, but an influence on PUMILIO2 function is not obvious. Altogether, this study shows that variation in the PUMILIO2 gene is very low and it seems improbable that mutations of this gene significantly contribute to male infertility in humans.
Collapse
Affiliation(s)
- K Kusz
- Institute of Human Genetics Polish Academy of Sciences, Poznan, Poland
| | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Desviat LR, Pérez B, Bèlanger-Quintana A, Castro M, Aguado C, Sánchez A, García MJ, Martínez-Pardo M, Ugarte M. Tetrahydrobiopterin responsiveness: results of the BH4 loading test in 31 Spanish PKU patients and correlation with their genotype. Mol Genet Metab 2004; 83:157-62. [PMID: 15464430 DOI: 10.1016/j.ymgme.2004.06.007] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/22/2004] [Revised: 06/12/2004] [Accepted: 06/16/2004] [Indexed: 11/25/2022]
Abstract
Tetrahydrobiopterin (BH4) responsiveness in patients with mutations in the phenylalanine hydroxylase (PAH) gene is a recently recognized subtype of hyperphenylalaninemia characterized by a positive BH4 loading test. According to recent estimates, this phenotype may be quite common, suggesting that a large group of individuals may benefit from BH4 substitution, eliminating the need of life-long dietary restrictions. This underscores the importance of identifying BH4-responsive patients in each population, establishing the association with specific PAH mutations. In this work, we describe the results of a pilot study performed with 31 Spanish PAH-deficient patients subjected to a BH4 loading test. Overall, 11/31 (37%) showed a positive response with a 30% decrease in blood Phe levels 8 h after the BH4 challenge, and three additional patients, considered slow responders, showed this decrease only after 12-16 h. We report for the first time a patient homozygous for a splicing mutation with a slow response, suggesting an effect of BH4 supplementation on PAH gene expression. Most of the responsive patients belong to the mild hyperphenylalaninemia (MHP) or mild phenylketonuria phenotypic groups. In MHP patients we report for the first time the results of parallel single Phe doses confirming the utility of these analyses for a better evaluation of the response. Genotype analysis confirms the involvement in the response of specific mutations (D415N, S87R, R176L, E390G, and A309V) present in hemizygous patients, and provide relevant information for the discussion of the potential mechanisms underlying BH4 responsiveness.
Collapse
|
16
|
Stitziel NO, Binkowski TA, Tseng YY, Kasif S, Liang J. topoSNP: a topographic database of non-synonymous single nucleotide polymorphisms with and without known disease association. Nucleic Acids Res 2004; 32:D520-2. [PMID: 14681472 PMCID: PMC308838 DOI: 10.1093/nar/gkh104] [Citation(s) in RCA: 71] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The database of topographic mapping of Single Nucleotide Polymorphism (topoSNP) provides an online resource for analyzing non-synonymous SNPs (nsSNPs) that can be mapped onto known 3D structures of proteins. These include disease- associated nsSNPs derived from the Online Mendelian Inheritance in Man (OMIM) database and other nsSNPs derived from dbSNP, a resource at the National Center for Biotechnology Information that catalogs SNPs. TopoSNP further classifies each nsSNP site into three categories based on their geometric location: those located in a surface pocket or an interior void of the protein, those on a convex region or a shallow depressed region, and those that are completely buried in the interior of the protein structure. These unique geometric descriptions provide more detailed mapping of nsSNPs to protein structures. The current release also includes relative entropy of SNPs calculated from multiple sequence alignment as obtained from the Pfam database (a database of protein families and conserved protein motifs) as well as manually adjusted multiple alignments obtained from ClustalW. These structural and conservational data can be useful for studying whether nsSNPs in coding regions are likely to lead to phenotypic changes. TopoSNP includes an interactive structural visualization web interface, as well as downloadable batch data. The database will be updated at regular intervals and can be accessed at: http://gila.bioengr.uic.edu/snp/toposnp.
Collapse
Affiliation(s)
- Nathan O Stitziel
- Department of Bioengineering, University of Illinois at Chicago, M/C 063, 851 S. Morgan Street, Chicago, IL 60607, USA
| | | | | | | | | |
Collapse
|
17
|
Stitziel NO, Tseng YY, Pervouchine D, Goddeau D, Kasif S, Liang J. Structural location of disease-associated single-nucleotide polymorphisms. J Mol Biol 2003; 327:1021-30. [PMID: 12662927 DOI: 10.1016/s0022-2836(03)00240-7] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Non-synonymous single-nucleotide polymorphism (nsSNP) of genes introduces amino acid changes to proteins, and plays an important role in providing genetic functional diversity. To understand the structural characteristics of disease-associated SNPs, we have mapped a set of nsSNPs derived from the online mendelian inheritance in man (OMIM) database to the structural surfaces of encoded proteins. These nsSNPs are disease-associated or have distinctive phenotypes. As a control dataset, we mapped a set of nsSNPs derived from SNP database dbSNP to the structural surfaces of those encoded proteins. Using the alpha shape method from computational geometry, we examine the geometric locations of the structural sites of these nsSNPs. We classify each nsSNP site into one of three categories of geometric locations: those in a pocket or a void (type P); those on a convex region or a shallow depressed region (type S); and those that are buried completely in the interior (type I). We find that the majority (88%) of disease-associated nsSNPs are located in voids or pockets, and they are infrequently observed in the interior of proteins (3.2% in the data set). We find that nsSNPs mapped from dbSNP are less likely to be located in pockets or voids (68%). We further introduce a novel application of hidden Markov models (HMM) for analyzing sequence homology of SNPs on various geometric sites. For SNPs on surface pocket or void, we find that there is no strong tendency for them to occur on conserved residues. For SNPs buried in the interior, we find that disease-associated mutations are more likely to be conserved. The approach of classifying nsSNPs with alpha shape and HMM developed in this study can be integrated with additional methods to improve the accuracy of predictions of whether a given nsSNP is likely to be disease-associated.
Collapse
Affiliation(s)
- Nathan O Stitziel
- Department of Bioengineering SEO, MC-063, University of Illinois at Chicago, Room 218, 851, S. Morgan Street, Chicago, IL 60607-7052, USA
| | | | | | | | | | | |
Collapse
|
18
|
Anikster Y, Lucero C, Guo J, Huizing M, Shotelersuk V, Bernardini I, McDowell G, Iwata F, Kaiser-Kupfer MI, Jaffe R, Thoene J, Schneider JA, Gahl WA. Ocular nonnephropathic cystinosis: clinical, biochemical, and molecular correlations. Pediatr Res 2000; 47:17-23. [PMID: 10625078 DOI: 10.1203/00006450-200001000-00007] [Citation(s) in RCA: 78] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Ocular nonnephropathic cystinosis, a variant of the classic nephropathic type of cystinosis, is an autosomal recessive lysosomal storage disorder characterized by photophobia due to corneal cystine crystals but absence of renal disease. We determined the molecular basis for ocular cystinosis in four individuals. All had mutations in the cystinosis gene CTNS, indicating that ocular cystinosis is allelic with classic nephropathic cystinosis. The ocular cystinosis patients each had one severe mutation and one mild mutation, the latter consisting of either a 928 G-->A (G197R) mutation or an IVS10-3 C-->G splicing mutation resulting in the insertion of 182 bp of IVS10 into the CTNS mRNA. The mild mutations appear to allow for residual CTNS mRNA production, significant amounts of lysosomal cystine transport, and lower levels of cellular cystine compared with those in nephropathic cystinosis. The lack of kidney involvement in ocular cystinosis may be explained by two different mechanisms. On the one hand (e.g. the G197R mutation), significant residual cystinosin activity may be present in every tissue. On the other hand (e.g. the IVS 10-3 C-->G mutation), substantial cystinosin activity may exist in the kidney because of that tissue's specific expression of factors that promote splicing of a normal CTNS transcript. Each of these mechanisms could result in minimally reduced lysosomal cystine transport in the kidneys.
Collapse
Affiliation(s)
- Y Anikster
- Section on Human Biochemical Genetics, Heritable Disorders Branch, National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland 20892, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Ferranti P, Lilla S, Chianese L, Addeo F. Alternative nonallelic deletion is constitutive of ruminant alpha(s1)-casein. JOURNAL OF PROTEIN CHEMISTRY 1999; 18:595-602. [PMID: 10524777 DOI: 10.1023/a:1020659518748] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Multiple forms of alpha(s1)-casein were identified in the four major ruminant species by structural characterization of the protein fraction. While alpha(s1)-casein phenotypes were constituted by a mixture of at least seven molecular forms in ovine and caprine species, there were only two forms in bovine and water buffalo species. In ovine and caprine forms the main component corresponded to the 199-residue-long form, and the deleted proteins differed from the complete one by the absence of peptides 141-148, 110-117, or Gln78, or a combination of such deletions. The deleted segments corresponded to the sequence regions encoded by exons 13 and 16, and by the first triplet of exon 11 (CAG), suggesting that the occurrence of the short protein forms is due to alternative skipping, as previously demonstrated for some caprine and ovine phenotypes. The alternative deletion of Gln78 in alpha(s1)-casein, the only form common to the milk of all the species examined and located in a sequence region joining the polar phosphorylation cluster and the hydrophobic C-terminal domain of the protein, may play a functional role in the stabilization of the milk micelle structure.
Collapse
Affiliation(s)
- P Ferranti
- Dipartimento di Scienza degli Alimenti, Università degli Studi di Napoli Federico II, Portici, Italy.
| | | | | | | |
Collapse
|
20
|
Vignal L, Lisacek F, Quinqueton J, d'Aubenton-Carafa Y, Thermes C. A multi-agent system simulating human splice site recognition. COMPUTERS & CHEMISTRY 1999; 23:219-31. [PMID: 10404617 DOI: 10.1016/s0097-8485(99)00019-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
The present paper describes a method detecting splice sites automatically on the basis of sequence data and models of site/signal recognition supported by experimental evidences. The method is designed to simulate splicing and while doing so, track prediction failures, missing information and possibly test correcting hypotheses. Correlations between nucleotides in the splice site regions and the various elements of the acceptor region are evaluated and combined to assess compensating interactions between elements of the splicing machinery. A scanning model of the acceptor region and a model of interaction between the splicing complexes (exon definition model) are also incorporated in the detection process. Subsets of sites presenting deficiencies of several splice site elements could be identified. Further examination of these sites helps to determine lacking elements and refine models.
Collapse
|