1
|
Shi DM, Dong SS, Zhou HX, Song DQ, Wan JL, Wu WZ. Genomic and transcriptomic profiling reveals key molecules in metastatic potentials and organ-tropisms of hepatocellular carcinoma. Cell Signal 2023; 104:110565. [PMID: 36539000 DOI: 10.1016/j.cellsig.2022.110565] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2022] [Revised: 12/04/2022] [Accepted: 12/15/2022] [Indexed: 12/23/2022]
Abstract
Metastasis is a landmark event for rapid postsurgical relapse and death of HCC patients. Although distinct genomic and transcriptomic profiling of HCC metastasis had been reported previously, the causal relationships of somatic mutants, mRNA levels and metastatic potentials were difficult to be established in clinic. Therefore, 11 human HCC cell lines and 7 monoclonal derivatives with definite metastatic potentials and tropisms were subjected to whole exome sequencing (WES) and whole transcriptome sequencing (WTS). TP53, MYO5A, ROS1 and ARID2 were the prominent mutants of metastatic drivers in HCC cells. During HCC clonal evaluation, TP53, MYO5A and ROS1 mutations occurred in the early stage, EXT2 and NIN in the late stage. NF1 mutant was unique in lung tropistic cell lines, RNF126 mutant in lymphatic tropistic ones. PER1, LMO2, GAS7, NR4A3 expression levels were positively associated with relapse-free survival (RFS) of HCC patients. The integrative analysis revealed 58 genes exhibited both somatic mutation and dysregulated mRNA levels in high metastatic cells. Altogether, metastatic drivers could accumulate gradually at different stages during HCC progression, some drivers might modulate HCC metastatic potentials and the others regulate metastatic tropisms.
Collapse
Affiliation(s)
- Dong-Min Shi
- Liver Cancer Institute, Zhongshan Hospital, Fudan University, Key Laboratory of Carcinogenesis and Cancer Invasion, Ministry of Education, Shanghai 200032, China; Department of Medical Oncology, Changzheng Hospital, Shanghai, People's Republic of China
| | - Shuang-Shuang Dong
- Liver Cancer Institute, Zhongshan Hospital, Fudan University, Key Laboratory of Carcinogenesis and Cancer Invasion, Ministry of Education, Shanghai 200032, China
| | - Hong-Xing Zhou
- Liver Cancer Institute, Zhongshan Hospital, Fudan University, Key Laboratory of Carcinogenesis and Cancer Invasion, Ministry of Education, Shanghai 200032, China
| | - Dong-Qiang Song
- Liver Cancer Institute, Zhongshan Hospital, Fudan University, Key Laboratory of Carcinogenesis and Cancer Invasion, Ministry of Education, Shanghai 200032, China
| | - Jin-Liang Wan
- Department of Medical Oncology, Binzhou Medical University Hospital, Binzhou, Shandong 256603, China
| | - Wei-Zhong Wu
- Liver Cancer Institute, Zhongshan Hospital, Fudan University, Key Laboratory of Carcinogenesis and Cancer Invasion, Ministry of Education, Shanghai 200032, China.
| |
Collapse
|
2
|
Shimada MK, Sanbonmatsu R, Yamaguchi-Kabata Y, Yamasaki C, Suzuki Y, Chakraborty R, Gojobori T, Imanishi T. Selection pressure on human STR loci and its relevance in repeat expansion disease. Mol Genet Genomics 2016; 291:1851-69. [PMID: 27290643 DOI: 10.1007/s00438-016-1219-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2015] [Accepted: 05/21/2016] [Indexed: 12/30/2022]
Abstract
Short Tandem Repeats (STRs) comprise repeats of one to several base pairs. Because of the high mutability due to strand slippage during DNA synthesis, rapid evolutionary change in the number of repeating units directly shapes the range of repeat-number variation according to selection pressure. However, the remaining questions include: Why are STRs causing repeat expansion diseases maintained in the human population; and why are these limited to neurodegenerative diseases? By evaluating the genome-wide selection pressure on STRs using the database we constructed, we identified two different patterns of relationship in repeat-number polymorphisms between DNA and amino-acid sequences, although both patterns are evolutionary consequences of avoiding the formation of harmful long STRs. First, a mixture of degenerate codons is represented in poly-proline (poly-P) repeats. Second, long poly-glutamine (poly-Q) repeats are favored at the protein level; however, at the DNA level, STRs encoding long poly-Qs are frequently divided by synonymous SNPs. Furthermore, significant enrichments of apoptosis and neurodevelopment were biological processes found specifically in genes encoding poly-Qs with repeat polymorphism. This suggests the existence of a specific molecular function for polymorphic and/or long poly-Q stretches. Given that the poly-Qs causing expansion diseases were longer than other poly-Qs, even in healthy subjects, our results indicate that the evolutionary benefits of long and/or polymorphic poly-Q stretches outweigh the risks of long CAG repeats predisposing to pathological hyper-expansions. Molecular pathways in neurodevelopment requiring long and polymorphic poly-Q stretches may provide a clue to understanding why poly-Q expansion diseases are limited to neurodegenerative diseases.
Collapse
Affiliation(s)
- Makoto K Shimada
- Institute for Comprehensive Medical Science, Fujita Health University, 1-98 Dengakugakubo, Kutsukake-cho, Toyoake, Aichi, 470-1192, Japan. .,National Institute of Advanced Industrial Science and Technology, 2-3-26 Aomi Koto-ku, Tokyo, 135-0064, Japan. .,Japan Biological Informatics Consortium, 10F TIME24 Building, 2-4-32 Aomi, Koto-ku, Tokyo, 135-8073, Japan.
| | - Ryoko Sanbonmatsu
- Japan Biological Informatics Consortium, 10F TIME24 Building, 2-4-32 Aomi, Koto-ku, Tokyo, 135-8073, Japan
| | - Yumi Yamaguchi-Kabata
- National Institute of Advanced Industrial Science and Technology, 2-3-26 Aomi Koto-ku, Tokyo, 135-0064, Japan.,Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, 980-8573, Japan
| | - Chisato Yamasaki
- National Institute of Advanced Industrial Science and Technology, 2-3-26 Aomi Koto-ku, Tokyo, 135-0064, Japan.,Japan Biological Informatics Consortium, 10F TIME24 Building, 2-4-32 Aomi, Koto-ku, Tokyo, 135-8073, Japan
| | - Yoshiyuki Suzuki
- Graduate School of Natural Sciences, Nagoya City University, 1 Yamanohata, Mizuho-cho, Mizuho-ku, Nagoya, Aichi, 467-8501, Japan
| | - Ranajit Chakraborty
- Health Science Center, University of North Texas, 3500 Camp Bowie Blvd., Fort Worth, TX, 76107, USA
| | - Takashi Gojobori
- National Institute of Advanced Industrial Science and Technology, 2-3-26 Aomi Koto-ku, Tokyo, 135-0064, Japan.,Computational Bioscience Research Center, King Abdullah University of Science and Technology, Ibn Al-Haytham Building (West), Thuwal, 23955-6900, Kingdom of Saudi Arabia
| | - Tadashi Imanishi
- National Institute of Advanced Industrial Science and Technology, 2-3-26 Aomi Koto-ku, Tokyo, 135-0064, Japan.,Department of Molecular Life Science, Tokai University School of Medicine, 143 Shimokasuya, Isehara, Kanagawa, 259-1193, Japan
| |
Collapse
|
3
|
Nagai Y, Takahashi Y, Imanishi T. VaDE: a manually curated database of reproducible associations between various traits and human genomic polymorphisms. Nucleic Acids Res 2014; 43:D868-72. [PMID: 25361969 PMCID: PMC4383886 DOI: 10.1093/nar/gku1037] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Genome-wide association studies (GWASs) have identified numerous single nucleotide polymorphisms (SNPs) associated with the development of common diseases. However, it is clear that genetic risk factors of common diseases are heterogeneous among human populations. Therefore, we developed a database of genomic polymorphisms that are reproducibly associated with disease susceptibilities, drug responses and other traits for each human population: 'VarySysDB Disease Edition' (VaDE; http://bmi-tokai.jp/VaDE/). SNP-trait association data were obtained from the National Human Genome Research Institute GWAS (NHGRI GWAS) catalog and RAvariome, and we added detailed information of sample populations by curating original papers. In addition, we collected and curated original papers, and registered the detailed information of SNP-trait associations in VaDE. Then, we evaluated reproducibility of associations in each population by counting the number of significantly associated studies. VaDE provides literature-based SNP-trait association data and functional genomic region annotation for SNP functional research. SNP functional annotation data included experimental data of the ENCODE project, H-InvDB transcripts and the 1000 Genome Project. A user-friendly web interface was developed to assist quick search, easy download and fast swapping among viewers. We believe that our database will contribute to the future establishment of personalized medicine and increase our understanding of genetic factors underlying diseases.
Collapse
Affiliation(s)
- Yoko Nagai
- Department of Molecular Life Science, Tokai University School of Medicine, Isehara, Kanagawa 259-1193, Japan
| | - Yasuko Takahashi
- Department of Molecular Life Science, Tokai University School of Medicine, Isehara, Kanagawa 259-1193, Japan
| | - Tadashi Imanishi
- Department of Molecular Life Science, Tokai University School of Medicine, Isehara, Kanagawa 259-1193, Japan Data Management and Integration Team, Molecular Profiling Research Center for Drug Discovery, National Institute of Advanced Industrial Science and Technology, Koto-ku, Tokyo 135-0064, Japan
| |
Collapse
|
4
|
Verschuren JJW, Trompet S, Sampietro ML, Heijmans BT, Koch W, Kastrati A, Houwing-Duistermaat JJ, Slagboom PE, Quax PHA, Jukema JW. Pathway analysis using genome-wide association study data for coronary restenosis--a potential role for the PARVB gene. PLoS One 2013; 8:e70676. [PMID: 23950981 PMCID: PMC3739784 DOI: 10.1371/journal.pone.0070676] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2013] [Accepted: 06/21/2013] [Indexed: 12/20/2022] Open
Abstract
Background Coronary restenosis after percutaneous coronary intervention (PCI) still remains a significant limitation of the procedure. The causative mechanisms of restenosis have not yet been fully identified. The goal of the current study was to perform gene-set analysis of biological pathways related to inflammation, proliferation, vascular function and transcriptional regulation on coronary restenosis to identify novel genes and pathways related to this condition. Methods The GENetic DEterminants of Restenosis (GENDER) databank contains genotypic data of 556,099SNPs of 295 cases with restenosis and 571 matched controls. Fifty-four pathways, related to known restenosis-related processes, were selected. Gene-set analysis was performed using PLINK, GRASS and ALIGATOR software. Pathways with a p<0.01 were fine-mapped and significantly associated SNPs were analyzed in an independent replication cohort. Results Six pathways (cell-extracellular matrix (ECM) interactions pathway, IL2 signaling pathway, IL6 signaling pathway, platelet derived growth factor pathway, vitamin D receptor pathway and the mitochondria pathway) were significantly associated in one or two of the software packages. Two SNPs in the cell-ECM interactions pathway were replicated in an independent restenosis cohort. No replication was obtained for the other pathways. Conclusion With these results we demonstrate a potential role of the cell-ECM interactions pathway in the development of coronary restenosis. These findings contribute to the increasing knowledge of the genetic etiology of restenosis formation and could serve as a hypothesis-generating effort for further functional studies.
Collapse
Affiliation(s)
| | - Stella Trompet
- Department of Cardiology, Leiden University Medical Center, Leiden, The Netherlands
- Department of Gerontology and Geriatrics, Leiden University Medical Center, Leiden, The Netherlands
| | - M. Lourdes Sampietro
- Department Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
- Interuniversity Cardiology Institute of the Netherlands (ICIN), Utrecht, The Netherlands
| | - Bastiaan T. Heijmans
- Molecular Epidemiology, Leiden University Medical Center, Leiden, The Netherlands
- Netherlands Consortium for Healthy Ageing, Leiden University Medical Center, Leiden, The Netherlands
| | - Werner Koch
- Deutsches Herzzentrum München, Technische Universität München, Munich, Germany
| | - Adnan Kastrati
- Deutsches Herzzentrum München, Technische Universität München, Munich, Germany
| | | | - P. Eline Slagboom
- Molecular Epidemiology, Leiden University Medical Center, Leiden, The Netherlands
- Netherlands Consortium for Healthy Ageing, Leiden University Medical Center, Leiden, The Netherlands
| | - Paul H. A. Quax
- Department of Vascular Surgery, Leiden University Medical Center, Leiden, The Netherlands
| | - J. Wouter Jukema
- Department of Cardiology, Leiden University Medical Center, Leiden, The Netherlands
- Netherlands Consortium for Healthy Ageing, Leiden University Medical Center, Leiden, The Netherlands
- Durrer Center for Cardiogenetic Research, Amsterdam, The Netherlands
- Interuniversity Cardiology Institute of the Netherlands (ICIN), Utrecht, The Netherlands
- * E-mail:
| |
Collapse
|
5
|
Non-homologous end-joining pathway associated with occurrence of myocardial infarction: gene set analysis of genome-wide association study data. PLoS One 2013; 8:e56262. [PMID: 23457540 PMCID: PMC3574159 DOI: 10.1371/journal.pone.0056262] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2012] [Accepted: 01/07/2013] [Indexed: 01/28/2023] Open
Abstract
PURPOSE DNA repair deficiencies have been postulated to play a role in the development and progression of cardiovascular disease (CVD). The hypothesis is that DNA damage accumulating with age may induce cell death, which promotes formation of unstable plaques. Defects in DNA repair mechanisms may therefore increase the risk of CVD events. We examined whether the joints effect of common genetic variants in 5 DNA repair pathways may influence the risk of CVD events. METHODS The PLINK set-based test was used to examine the association to myocardial infarction (MI) of the DNA repair pathway in GWAS data of 866 subjects of the GENetic DEterminants of Restenosis (GENDER) study and 5,244 subjects of the PROspective Study of Pravastatin in the Elderly at Risk (PROSPER) study. We included the main DNA repair pathways (base excision repair, nucleotide excision repair, mismatch repair, homologous recombination and non-homologous end-joining (NHEJ)) in the analysis. RESULTS The NHEJ pathway was associated with the occurrence of MI in both GENDER (P = 0.0083) and PROSPER (P = 0.014). This association was mainly driven by genetic variation in the MRE11A gene (PGENDER = 0.0001 and PPROSPER = 0.002). The homologous recombination pathway was associated with MI in GENDER only (P = 0.011), for the other pathways no associations were observed. CONCLUSION This is the first study analyzing the joint effect of common genetic variation in DNA repair pathways and the risk of CVD events, demonstrating an association between the NHEJ pathway and MI in 2 different cohorts.
Collapse
|
6
|
Wijaya E, Frith MC, Horton P, Asai K. Finding protein-coding genes through human polymorphisms. PLoS One 2013; 8:e54210. [PMID: 23349826 PMCID: PMC3551959 DOI: 10.1371/journal.pone.0054210] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2012] [Accepted: 12/10/2012] [Indexed: 11/29/2022] Open
Abstract
Human gene catalogs are fundamental to the study of human biology and medicine. But they are all based on open reading frames (ORFs) in a reference genome sequence (with allowance for introns). Individual genomes, however, are polymorphic: their sequences are not identical. There has been much research on how polymorphism affects previously-identified genes, but no research has been done on how it affects gene identification itself. We computationally predict protein-coding genes in a straightforward manner, by finding long ORFs in mRNA sequences aligned to the reference genome. We systematically test the effect of known polymorphisms with this procedure. Polymorphisms can not only disrupt ORFs, they can also create long ORFs that do not exist in the reference sequence. We found 5,737 putative protein-coding genes that do not exist in the reference, whose protein-coding status is supported by homology to known proteins. On average 10% of these genes are located in the genomic regions devoid of annotated genes in 12 other catalogs. Our statistical analysis showed that these ORFs are unlikely to occur by chance.
Collapse
Affiliation(s)
- Edward Wijaya
- Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Japan.
| | | | | | | |
Collapse
|
7
|
Imanishi T, Nagai Y, Habara T, Yamasaki C, Takeda JI, Mikami S, Bando Y, Tojo H, Nishimura T. Full-length Transcriptome-based H-InvDB Throws a New Light on Chromosome-centric Proteomics. J Proteome Res 2012; 12:62-6. [DOI: 10.1021/pr300861a] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Affiliation(s)
- Tadashi Imanishi
- Biomedicinal Information Research
Center, National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
- Department of Molecular Life
Science, Division of Basic Medical Science and Molecular Medicine, Tokai University, School of Medicine, Kanagawa, Japan
| | - Yoko Nagai
- Biomedicinal Information Research
Center, National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
| | - Takuya Habara
- Biomedicinal Information Research
Center, National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
| | - Chisato Yamasaki
- Biomedicinal Information Research
Center, National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
| | - Jun-ichi Takeda
- Biomedicinal Information Research
Center, National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
| | | | | | - Hiromasa Tojo
- Department of Biophysics and Biochemistry, Osaka University, Graduate School of Medicine, Osaka,
Japan
| | - Toshihide Nishimura
- Biosys Technologies, Inc., Tokyo, Japan
- Department of Surgery
I, Tokyo Medical University, Tokyo, Japan
| |
Collapse
|
8
|
Takeda JI, Yamasaki C, Murakami K, Nagai Y, Sera M, Hara Y, Obi N, Habara T, Gojobori T, Imanishi T. H-InvDB in 2013: an omics study platform for human functional gene and transcript discovery. Nucleic Acids Res 2012. [PMID: 23197657 PMCID: PMC3531145 DOI: 10.1093/nar/gks1245] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
H-InvDB (http://www.h-invitational.jp/) is a comprehensive human gene database started in 2004. In the latest version, H-InvDB 8.0, a total of 244 709 human complementary DNA was mapped onto the hg19 reference genome and 43 829 gene loci, including nonprotein-coding ones, were identified. Of these loci, 35 631 were identified as potential protein-coding genes, and 22 898 of these were identical to known genes. In our analysis, 19 309 annotated genes were specific to H-InvDB and not found in RefSeq and Ensembl. In fact, 233 genes of the 19 309 turned out to have protein functions in this version of H-InvDB; they were annotated as unknown protein functions in the previous version. Furthermore, 11 genes were identified as known Mendelian disorder genes. It is advantageous that many biologically functional genes are hidden in the H-InvDB unique genes. As large-scale proteomic projects have been conducted to elucidate the functions of all human proteins, we have enhanced the proteomic information with an advanced protein view and new subdatabase of protein complexes (Protein Complex Database with quality index). We propose that H-InvDB is an important resource for finding novel candidate targets for medical care and drug development.
Collapse
Affiliation(s)
- Jun-Ichi Takeda
- Integrated Database and Systems Biology Team, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, Aomi 2-4-7, Koto-ku, Tokyo 135-0064, Japan
| | | | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Abstract
The relationship between sequence polymorphisms and human disease has been studied mostly in terms of effects of single nucleotide polymorphisms (SNPs) leading to single amino acid substitutions that change protein structure and function. However, less attention has been paid to more drastic sequence polymorphisms which cause premature termination of a protein’s sequence or large changes, insertions, or deletions in the sequence. We have analyzed a large set (n = 512) of insertions and deletions (indels) and single nucleotide polymorphisms causing premature termination of translation in disease-related genes. Prediction of protein-destabilization effects was performed by graphical presentation of the locations of polymorphisms in the protein structure, using the Genomes TO Protein (GTOP) database, and manual annotation with a set of specific criteria. Protein-destabilization was predicted for 44.4% of the nonsense SNPs, 32.4% of the frameshifting indels, and 9.1% of the non-frameshifting indels. A prediction of nonsense-mediated decay allowed to infer which truncated proteins would actually be translated as defective proteins. These cases included the proteins linked to diseases inherited dominantly, suggesting a relation between these diseases and toxic aggregation. Our approach would be useful in identifying potentially aggregation-inducing polymorphisms that may have pathological effects.
Collapse
|
10
|
NovelSNPer: A Fast Tool for the Identification and Characterization of Novel SNPs and InDels. Adv Bioinformatics 2011; 2011:657341. [PMID: 22110502 PMCID: PMC3206323 DOI: 10.1155/2011/657341] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2011] [Accepted: 08/11/2011] [Indexed: 02/06/2023] Open
Abstract
Typically, next-generation resequencing projects produce large lists of variants. NovelSNPer is a software
tool that permits fast and efficient processing of such output lists. In a first step, NovelSNPer determines if a variant represents a known variant or a previously unknown variant. In a second step, each variant is classified into one of 15 SNP classes or 19 InDel classes. Beside the classes used by Ensembl, we introduce POTENTIAL_START_GAINED and START_LOST as new functional classes and present a classification scheme for InDels. NovelSNPer is based upon the gene structure information stored in Ensembl. It processes two million SNPs in six hours. The tool can be used online or downloaded.
Collapse
|
11
|
Castro-Chavez F. The rules of variation: amino acid exchange according to the rotating circular genetic code. J Theor Biol 2010; 264:711-21. [PMID: 20371250 PMCID: PMC3130497 DOI: 10.1016/j.jtbi.2010.03.046] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2009] [Revised: 03/06/2010] [Accepted: 03/30/2010] [Indexed: 12/11/2022]
Abstract
General guidelines for the molecular basis of functional variation are presented while focused on the rotating circular genetic code and allowable exchanges that make it resistant to genetic diseases under normal conditions. The rules of variation, bioinformatics aids for preventative medicine, are: (1) same position in the four quadrants for hydrophobic codons, (2) same or contiguous position in two quadrants for synonymous or related codons, and (3) same quadrant for equivalent codons. To preserve protein function, amino acid exchange according to the first rule takes into account the positional homology of essential hydrophobic amino acids with every codon with a central uracil in the four quadrants, the second rule includes codons for identical, acidic, or their amidic amino acids present in two quadrants, and the third rule, the smaller, aromatic, stop codons, and basic amino acids, each in proximity within a 90 degree angle. I also define codifying genes and palindromati, CTCGTGCCGAATTCGGCACGAG.
Collapse
|
12
|
Shimada MK, Hayakawa Y, Takeda JI, Gojobori T, Imanishi T. A comprehensive survey of human polymorphisms at conserved splice dinucleotides and its evolutionary relationship with alternative splicing. BMC Evol Biol 2010; 10:122. [PMID: 20433709 PMCID: PMC2882926 DOI: 10.1186/1471-2148-10-122] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2009] [Accepted: 04/30/2010] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Alternative splicing (AS) is a key molecular process that endows biological functions with diversity and complexity. Generally, functional redundancy leads to the generation of new functions through relaxation of selective pressure in evolution, as exemplified by duplicated genes. It is also known that alternatively spliced exons (ASEs) are subject to relaxed selective pressure. Within consensus sequences at the splice junctions, the most conserved sites are dinucleotides at both ends of introns (splice dinucleotides). However, a small number of single nucleotide polymorphisms (SNPs) occur at splice dinucleotides. An intriguing question relating to the evolution of AS diversity is whether mutations at splice dinucleotides are maintained as polymorphisms and produce diversity in splice patterns within the human population. We therefore surveyed validated SNPs in the database dbSNP located at splice dinucleotides of all human genes that are defined by the H-Invitational Database. RESULTS We found 212 validated SNPs at splice dinucleotides (sdSNPs); these were confirmed to be consistent with the GT-AG rule at either allele. Moreover, 53 of them were observed to neighbor ASEs (AE dinucleotides). No significant differences were observed between sdSNPs at AE dinucleotides and those at constitutive exons (CE dinucleotides) in SNP properties including average heterozygosity, SNP density, ratio of predicted alleles consistent with the GT-AG rule, and scores of splice sites formed with the predicted allele. We also found that the proportion of non-conserved exons was higher for exons with sdSNPs than for other exons. CONCLUSIONS sdSNPs are found at CE dinucleotides in addition to those at AE dinucleotides, suggesting two possibilities. First, sdSNPs at CE dinucleotides may be robust against sdSNPs because of unknown mechanisms. Second, similar to sdSNPs at AE dinucleotides, those at CE dinucleotides cause differences in AS patterns because of the arbitrariness in the classification of exons into alternative and constitutive type that varies according to the dataset. Taking into account the absence of differences in sdSNP properties between those at AE and CE dinucleotides, the increased proportion of non-conserved exons found in exons flanked by sdSNPs suggests the hypothesis that sdSNPs are maintained at the splice dinucleotides of newly generated exons at which negative selection pressure is relaxed.
Collapse
Affiliation(s)
- Makoto K Shimada
- Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo135-0064, Japan
- Japan Biological Informatics Consortium, 10F TIME24 Building, 2-45 Aomi, Koto-ku, Tokyo 135-0064, Japan
- Institute for Comprehensive Medical Science, Fujita Health University, 1-98 Dengakugakubo, Kutsukake-cho, Toyoake, Aichi 470-1192, Japan
| | - Yosuke Hayakawa
- Japan Biological Informatics Consortium, 10F TIME24 Building, 2-45 Aomi, Koto-ku, Tokyo 135-0064, Japan
- Hitachi Software Engineering Co., Ltd., 1-1-43 Suehirocho, Tsurumi-ku, Yokohama 230-0045, Japan
| | - Jun-ichi Takeda
- Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo135-0064, Japan
- Japan Biological Informatics Consortium, 10F TIME24 Building, 2-45 Aomi, Koto-ku, Tokyo 135-0064, Japan
| | - Takashi Gojobori
- Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo135-0064, Japan
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, 1111 Yata, Mishima, Shizuoka 411-8540, Japan
| | - Tadashi Imanishi
- Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo135-0064, Japan
| |
Collapse
|
13
|
Silveira AC, Morrison MA, Ji F, Xu H, Reinecke JB, Adams SM, Arneberg TM, Janssian M, Lee JE, Yuan Y, Schaumberg DA, Kotoula MG, Tsironi EE, Tsiloulis AN, Chatzoulis DZ, Miller JW, Kim IK, Hageman GS, Farrer LA, Haider NB, DeAngelis MM. Convergence of linkage, gene expression and association data demonstrates the influence of the RAR-related orphan receptor alpha (RORA) gene on neovascular AMD: a systems biology based approach. Vision Res 2009; 50:698-715. [PMID: 19786043 DOI: 10.1016/j.visres.2009.09.016] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2009] [Revised: 09/04/2009] [Accepted: 09/18/2009] [Indexed: 12/28/2022]
Abstract
To identify novel genes and pathways associated with AMD, we performed microarray gene expression and linkage analysis which implicated the candidate gene, retinoic acid receptor-related orphan receptor alpha (RORA, 15q). Subsequent genotyping of 159 RORA single nucleotide polymorphisms (SNPs) in a family-based cohort, followed by replication in an unrelated case-control cohort, demonstrated that SNPs and haplotypes located in intron 1 were significantly associated with neovascular AMD risk in both cohorts. This is the first report demonstrating a possible role for RORA, a receptor for cholesterol, in the pathophysiology of AMD. Moreover, we found a significant interaction between RORA and the ARMS2/HTRA1 locus suggesting a novel pathway underlying AMD pathophysiology.
Collapse
Affiliation(s)
- Alexandra C Silveira
- Ocular Molecular Genetics Institute and Department of Ophthalmology, Harvard Medical School, Massachusetts Eye and Ear Infirmary, Boston, MA, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Cookson W, Liang L, Abecasis G, Moffatt M, Lathrop M. Mapping complex disease traits with global gene expression. Nat Rev Genet 2009; 10:184-94. [PMID: 19223927 PMCID: PMC4550035 DOI: 10.1038/nrg2537] [Citation(s) in RCA: 602] [Impact Index Per Article: 40.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Variation in gene expression is an important mechanism underlying susceptibility to complex disease. The simultaneous genome-wide assay of gene expression and genetic variation allows the mapping of the genetic factors that underpin individual differences in quantitative levels of expression (expression QTLs; eQTLs). The availability of systematically generated eQTL information could provide immediate insight into a biological basis for disease associations identified through genome-wide association (GWA) studies, and can help to identify networks of genes involved in disease pathogenesis. Although there are limitations to current eQTL maps, understanding of disease will be enhanced with novel technologies and international efforts that extend to a wide range of new samples and tissues.
Collapse
Affiliation(s)
- William Cookson
- National Heart and Lung Institute, Imperial College London, SW3 6LY, England
| | - Liming Liang
- Center for Statistical Genetics, Dept. of Biostatistics, SPH II, Ann Arbor, MI 48109-2029, USA
| | - Gonçalo Abecasis
- Center for Statistical Genetics, Dept. of Biostatistics, SPH II, Ann Arbor, MI 48109-2029, USA
| | - Miriam Moffatt
- National Heart and Lung Institute, Imperial College London, SW3 6LY, England
| | - Mark Lathrop
- CEA/Centre National de Genotypage, 91057 Evry, France
| |
Collapse
|