1
|
Adamoski D, M Dos Reis L, Mafra ACP, Corrêa-da-Silva F, Moraes-Vieira PMMD, Berindan-Neagoe I, Calin GA, Dias SMG. HuR controls glutaminase RNA metabolism. Nat Commun 2024; 15:5620. [PMID: 38965208 PMCID: PMC11224379 DOI: 10.1038/s41467-024-49874-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2020] [Accepted: 06/21/2024] [Indexed: 07/06/2024] Open
Abstract
Glutaminase (GLS) is directly related to cell growth and tumor progression, making it a target for cancer treatment. The RNA-binding protein HuR (encoded by the ELAVL1 gene) influences mRNA stability and alternative splicing. Overexpression of ELAVL1 is common in several cancers, including breast cancer. Here we show that HuR regulates GLS mRNA alternative splicing and isoform translation/stability in breast cancer. Elevated ELAVL1 expression correlates with high levels of the glutaminase isoforms C (GAC) and kidney-type (KGA), which are associated with poor patient prognosis. Knocking down ELAVL1 reduces KGA and increases GAC levels, enhances glutamine anaplerosis into the TCA cycle, and drives cells towards glutamine dependence. Furthermore, we show that combining chemical inhibition of GLS with ELAVL1 silencing synergistically decreases breast cancer cell growth and invasion. These findings suggest that dual inhibition of GLS and HuR offers a therapeutic strategy for breast cancer treatment.
Collapse
Affiliation(s)
- Douglas Adamoski
- Brazilian Biosciences National Laboratory (LNBio), Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, Sao Paulo, Brazil
- Graduate Program in Genetics and Molecular Biology, Institute of Biology University of Campinas (UNICAMP), Campinas, Sao Paulo, Brazil
| | - Larissa M Dos Reis
- Brazilian Biosciences National Laboratory (LNBio), Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, Sao Paulo, Brazil
- Graduate Program in Genetics and Molecular Biology, Institute of Biology University of Campinas (UNICAMP), Campinas, Sao Paulo, Brazil
- Department of Genetics, Evolution, Microbiology, and Immunology, Laboratory of Immunometabolism, Institute of Biology, University of Campinas-UNICAMP, Campinas, SP, Brazil
| | - Ana Carolina Paschoalini Mafra
- Brazilian Biosciences National Laboratory (LNBio), Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, Sao Paulo, Brazil
- Graduate Program in Genetics and Molecular Biology, Institute of Biology University of Campinas (UNICAMP), Campinas, Sao Paulo, Brazil
- Department of Radiation Oncology, Washington University School of Medicine, S. Louis, MO, USA
| | - Felipe Corrêa-da-Silva
- Graduate Program in Genetics and Molecular Biology, Institute of Biology University of Campinas (UNICAMP), Campinas, Sao Paulo, Brazil
- Department of Genetics, Evolution, Microbiology, and Immunology, Laboratory of Immunometabolism, Institute of Biology, University of Campinas-UNICAMP, Campinas, SP, Brazil
| | - Pedro Manoel Mendes de Moraes-Vieira
- Department of Genetics, Evolution, Microbiology, and Immunology, Laboratory of Immunometabolism, Institute of Biology, University of Campinas-UNICAMP, Campinas, SP, Brazil
| | - Ioana Berindan-Neagoe
- Research Center for Functional Genomics, Biomedicine and Translational Medicine, University of Medicine and Pharmacy "Iuliu-Hatieganu", Cluj-Napoca, Romania
| | - George A Calin
- Department of Experimental Therapeutics, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
- Center for RNA Inference and Non-Coding RNAs, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Sandra Martha Gomes Dias
- Brazilian Biosciences National Laboratory (LNBio), Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, Sao Paulo, Brazil.
| |
Collapse
|
2
|
Nakai R, Yokota T, Tokunaga M, Takaishi M, Yokomizo T, Sudo T, Shi H, Yasumizu Y, Okuzaki D, Kokubu C, Tanaka S, Takaoka K, Yamanishi A, Yoshida J, Watanabe H, Kondoh G, Horie K, Hosen N, Sano S, Takeda J. A newly identified gene Ahed plays essential roles in murine haematopoiesis. Nat Commun 2024; 15:5090. [PMID: 38918373 PMCID: PMC11199565 DOI: 10.1038/s41467-024-49252-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2023] [Accepted: 05/29/2024] [Indexed: 06/27/2024] Open
Abstract
The development of haematopoiesis involves the coordinated action of numerous genes, some of which are implicated in haematological malignancies. However, the biological function of many genes remains elusive and unknown functional genes are likely to remain to be uncovered. Here, we report a previously uncharacterised gene in haematopoiesis, identified by screening mutant embryonic stem cells. The gene, 'attenuated haematopoietic development (Ahed)', encodes a nuclear protein. Conditional knockout (cKO) of Ahed results in anaemia from embryonic day 14.5 onward, leading to prenatal demise. Transplantation experiments demonstrate the incapacity of Ahed-deficient haematopoietic cells to reconstitute haematopoiesis in vivo. Employing a tamoxifen-inducible cKO model, we further reveal that Ahed deletion impairs the intrinsic capacity of haematopoietic cells in adult mice. Ahed deletion affects various pathways, and published databases present cancer patients with somatic mutations in Ahed. Collectively, our findings underscore the fundamental roles of Ahed in lifelong haematopoiesis, implicating its association with malignancies.
Collapse
Affiliation(s)
- Ritsuko Nakai
- Department of Haematology and Oncology, Graduate School of Medicine, Osaka University, Suita, Osaka, 565-0871, Japan
| | - Takafumi Yokota
- Department of Haematology and Oncology, Graduate School of Medicine, Osaka University, Suita, Osaka, 565-0871, Japan.
- Department of Haematology, Osaka International Cancer Institute, Osaka, Osaka, 541-8567, Japan.
| | - Masahiro Tokunaga
- Department of Haematology, Suita Municipal Hospital, Suita, Osaka, 564-0018, Japan
- Department of Genome Biology, Graduate School of Medicine, Osaka University, Suita, Osaka, 565-0871, Japan
| | - Mikiro Takaishi
- Department of Dermatology, Kochi Medical School, Kochi University, Nankoku, Kochi, 783-8505, Japan
| | - Tomomasa Yokomizo
- Department of Microscopic and Developmental Anatomy, Tokyo Women's Medical University, Shinjuku-ku, Tokyo, 162-8666, Japan
| | - Takao Sudo
- Department of Haematology and Oncology, Graduate School of Medicine, Osaka University, Suita, Osaka, 565-0871, Japan
- Department of Haematology, National Hospital Organisation Osaka National Hospital, Osaka, Osaka, 540-0006, Japan
| | - Henyun Shi
- Department of Haematology and Oncology, Graduate School of Medicine, Osaka University, Suita, Osaka, 565-0871, Japan
| | - Yoshiaki Yasumizu
- Department of Experimental Immunology, Immunology Frontier Research Centre, Osaka University, Suita, Osaka, 565-0871, Japan
- Integrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, Suita, Osaka, 565-0871, Japan
| | - Daisuke Okuzaki
- Integrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, Suita, Osaka, 565-0871, Japan
- Genome Information Research Centre, Research Institute for Microbial Diseases, Osaka University, Suita, Osaka, 565-0871, Japan
| | - Chikara Kokubu
- Department of Genome Biology, Graduate School of Medicine, Osaka University, Suita, Osaka, 565-0871, Japan
| | - Sachiyo Tanaka
- Department of Genome Biology, Graduate School of Medicine, Osaka University, Suita, Osaka, 565-0871, Japan
| | - Katsuyoshi Takaoka
- Developmental Genetics Group, Graduate School of Frontier Biosciences, Osaka University, Suita, Osaka, 565-0871, Japan
| | - Ayako Yamanishi
- Department of Genome Biology, Graduate School of Medicine, Osaka University, Suita, Osaka, 565-0871, Japan
| | - Junko Yoshida
- Department of Genome Biology, Graduate School of Medicine, Osaka University, Suita, Osaka, 565-0871, Japan
- Department of Physiology II, Nara Medical University, Kashihara, Nara, 634-8521, Japan
| | - Hitomi Watanabe
- Laboratory of Animal Experiments for Regeneration, Institute for Frontier Life and Medical Sciences, Kyoto University, Kyoto, Kyoto, 606-8507, Japan
| | - Gen Kondoh
- Laboratory of Animal Experiments for Regeneration, Institute for Frontier Life and Medical Sciences, Kyoto University, Kyoto, Kyoto, 606-8507, Japan
| | - Kyoji Horie
- Department of Genome Biology, Graduate School of Medicine, Osaka University, Suita, Osaka, 565-0871, Japan
- Department of Physiology II, Nara Medical University, Kashihara, Nara, 634-8521, Japan
| | - Naoki Hosen
- Department of Haematology and Oncology, Graduate School of Medicine, Osaka University, Suita, Osaka, 565-0871, Japan
- Integrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, Suita, Osaka, 565-0871, Japan
- Laboratory of Cellular Immunotherapy, World Premier International Immunology Frontier Research Centre, Osaka University, Suita, Osaka, 565-0871, Japan
| | - Shigetoshi Sano
- Department of Dermatology, Kochi Medical School, Kochi University, Nankoku, Kochi, 783-8505, Japan
| | - Junji Takeda
- Research Institute for Microbial Diseases, Osaka University, Suita, Osaka, 565-0871, Japan.
| |
Collapse
|
3
|
Kumar R, Duzan J, Drake E, Dawson J. A novel heterozygous frameshift c.277del p.Thr93Leufs*21 mutation in SERPINC1 associated with type 1 antithrombin deficiency. Pediatr Blood Cancer 2024; 71:e30986. [PMID: 38563157 DOI: 10.1002/pbc.30986] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 03/11/2024] [Accepted: 03/18/2024] [Indexed: 04/02/2024]
Affiliation(s)
- Riten Kumar
- Dana Farber/Boston Children's Cancer and Blood Disorders Center, Boston, Massachusetts, USA
- Department of Pediatrics, Harvard Medical School, Boston, Massachusetts, USA
| | - Juliann Duzan
- Dana Farber/Boston Children's Cancer and Blood Disorders Center, Boston, Massachusetts, USA
| | - Emily Drake
- Dana Farber/Boston Children's Cancer and Blood Disorders Center, Boston, Massachusetts, USA
| | - Jennifer Dawson
- Genomic Medicine, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, USA
| |
Collapse
|
4
|
Kundu S, Nunes L, Adler J, Mathot L, Stoimenov I, Sjöblom T. Recurring EPHB1 mutations in human cancers alter receptor signalling and compartmentalisation of colorectal cancer cells. Cell Commun Signal 2023; 21:354. [PMID: 38102712 PMCID: PMC10722860 DOI: 10.1186/s12964-023-01378-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 11/01/2023] [Indexed: 12/17/2023] Open
Abstract
BACKGROUND Ephrin (EPH) receptors have been implicated in tumorigenesis and metastasis, but the functional understanding of mutations observed in human cancers is limited. We previously demonstrated reduced cell compartmentalisation for somatic EPHB1 mutations found in metastatic colorectal cancer cases. We therefore integrated pan-cancer and pan-EPH mutational data to prioritise recurrent EPHB1 mutations for functional studies to understand their contribution to cancer development and metastasis. METHODS Here, 79,151 somatic mutations in 9,898 samples of 33 different tumour types were analysed with a bioinformatic pipeline to find 3D-mutated cluster pairs and hotspot mutations in EPH receptors. From these, 15 recurring EPHB1 mutations were stably expressed in colorectal cancer followed by confocal microscopy based in vitro compartmentalisation assays and phospho-proteome analysis. RESULTS The 3D-protein structure-based bioinformatics analysis resulted in 63% EPHB1 mutants with compartmentalisation phenotypes vs 43% for hotspot mutations. Whereas the ligand-binding domain mutations C61Y, R90C, and R170W, the fibronectin domain mutation R351L, and the kinase domain mutation D762N displayed reduced to strongly compromised cell compartmentalisation, the kinase domain mutations R743W and G821R enhanced this phenotype. While mutants with reduced compartmentalisation also had reduced ligand induced receptor phosphorylation, the enhanced compartmentalisation was not linked to receptor phosphorylation level. Phosphoproteome mapping pinpointed the PI3K pathway and PIK3C2B phosphorylation in cells harbouring mutants with reduced compartmentalisation. CONCLUSIONS This is the first integrative study of pan-cancer EPH receptor mutations followed by in vitro validation, a robust way to identify cancer-causing mutations, uncovering EPHB1 mutation phenotypes and demonstrating the utility of protein structure-based mutation analysis in characterization of novel cancer genes. Video Abstract.
Collapse
Affiliation(s)
- Snehangshu Kundu
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Luís Nunes
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Jeremy Adler
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Lucy Mathot
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Ivaylo Stoimenov
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Tobias Sjöblom
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden.
| |
Collapse
|
5
|
Xie MJ, Cromie GA, Owens K, Timour MS, Tang M, Kutz JN, El-Hattab AW, McLaughlin RN, Dudley AM. Constructing and interpreting a large-scale variant effect map for an ultrarare disease gene: Comprehensive prediction of the functional impact of PSAT1 genotypes. PLoS Genet 2023; 19:e1010972. [PMID: 37812589 PMCID: PMC10561871 DOI: 10.1371/journal.pgen.1010972] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Accepted: 09/13/2023] [Indexed: 10/11/2023] Open
Abstract
Reduced activity of the enzymes encoded by PHGDH, PSAT1, and PSPH causes a set of ultrarare, autosomal recessive diseases known as serine biosynthesis defects. These diseases present in a broad phenotypic spectrum: at the severe end is Neu-Laxova syndrome, in the intermediate range are infantile serine biosynthesis defects with severe neurological manifestations and growth deficiency, and at the mild end is childhood disease with intellectual disability. However, L-serine supplementation, especially if started early, can ameliorate and in some cases even prevent symptoms. Therefore, knowledge of pathogenic variants can improve clinical outcomes. Here, we use a yeast-based assay to individually measure the functional impact of 1,914 SNV-accessible amino acid substitutions in PSAT. Results of our assay agree well with clinical interpretations and protein structure-function relationships, supporting the inclusion of our data as functional evidence as part of the ACMG variant interpretation guidelines. We use existing ClinVar variants, disease alleles reported in the literature and variants present as homozygotes in the primAD database to define assay ranges that could aid clinical variant interpretation for up to 98% of the tested variants. In addition to measuring the functional impact of individual variants in yeast haploid cells, we also assay pairwise combinations of PSAT1 alleles that recapitulate human genotypes, including compound heterozygotes, in yeast diploids. Results from our diploid assay successfully distinguish the genotypes of affected individuals from those of healthy carriers and agree well with disease severity. Finally, we present a linear model that uses individual allele measurements to predict the biallelic function of ~1.8 million allele combinations corresponding to potential human genotypes. Taken together, our work provides an example of how large-scale functional assays in model systems can be powerfully applied to the study of ultrarare diseases.
Collapse
Affiliation(s)
- Michael J. Xie
- Pacific Northwest Research Institute, Seattle, Washington, United States of America
- Molecular Engineering Graduate Program, University of Washington, Seattle, Washington, United States of America
| | - Gareth A. Cromie
- Pacific Northwest Research Institute, Seattle, Washington, United States of America
| | - Katherine Owens
- Pacific Northwest Research Institute, Seattle, Washington, United States of America
- Department of Applied Mathematics, University of Washington, Seattle, Washington, United States of America
| | - Martin S. Timour
- Pacific Northwest Research Institute, Seattle, Washington, United States of America
| | - Michelle Tang
- Pacific Northwest Research Institute, Seattle, Washington, United States of America
| | - J. Nathan Kutz
- Department of Applied Mathematics, University of Washington, Seattle, Washington, United States of America
| | - Ayman W. El-Hattab
- Department of Clinical Sciences, College of Medicine, University of Sharjah, Sharjah, United Arab Emirates
| | | | - Aimée M. Dudley
- Pacific Northwest Research Institute, Seattle, Washington, United States of America
- Molecular Engineering Graduate Program, University of Washington, Seattle, Washington, United States of America
| |
Collapse
|
6
|
Dede M, Hart T. Recovering false negatives in CRISPR fitness screens with JLOE. Nucleic Acids Res 2023; 51:1637-1651. [PMID: 36727483 PMCID: PMC9976895 DOI: 10.1093/nar/gkad046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Revised: 01/09/2023] [Accepted: 01/16/2023] [Indexed: 02/03/2023] Open
Abstract
It is widely accepted that pooled library CRISPR knockout screens offer greater sensitivity and specificity than prior technologies in detecting genes whose disruption leads to fitness defects, a critical step in identifying candidate cancer targets. However, the assumption that CRISPR screens are saturating has been largely untested. Through integrated analysis of screen data in cancer cell lines generated by the Cancer Dependency Map, we show that a typical CRISPR screen has a ∼20% false negative rate, in addition to library-specific false negatives. Replicability falls sharply as gene expression decreases, while cancer subtype-specific genes within a tissue show distinct profiles compared to false negatives. Cumulative analyses across tissues improves our understanding of core essential genes and suggest only a small number of lineage-specific essential genes, enriched for transcription factors that define pathways of tissue differentiation. To recover false negatives, we introduce a method, Joint Log Odds of Essentiality (JLOE), which builds on our prior work with BAGEL to selectively rescue the false negatives without an increased false discovery rate.
Collapse
Affiliation(s)
- Merve Dede
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Traver Hart
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.,Department of Cancer Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| |
Collapse
|
7
|
Xie MJ, Cromie GA, Owens K, Timour MS, Tang M, Kutz JN, El-Hattab AW, McLaughlin RN, Dudley AM. Predicting the functional effect of compound heterozygous genotypes from large scale variant effect maps. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.11.523651. [PMID: 36711904 PMCID: PMC9882023 DOI: 10.1101/2023.01.11.523651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
Background Pathogenic variants in PHGDH, PSAT1 , and PSPH cause a set of rare, autosomal recessive diseases known as serine biosynthesis defects. Serine biosynthesis defects present in a broad phenotypic spectrum that includes, at the severe end, Neu-Laxova syndrome, a lethal multiple congenital anomaly disease, intermediately in the form of infantile serine biosynthesis defects with severe neurological manifestations and growth deficiency, and at the mild end, as childhood disease with intellectual disability. However, because L-serine supplementation, especially if started early, can ameliorate and in some cases even prevent symptoms, knowledge of pathogenic variants is highly actionable. Methods Recently, our laboratory established a yeast-based assay for human PSAT1 function. We have now applied it at scale to assay the functional impact of 1,914 SNV-accessible amino acid substitutions. In addition to assaying the functional impact of individual variants in yeast haploid cells, we can assay pairwise combinations of PSAT1 alleles that recapitulate human genotypes, including compound heterozygotes, in yeast diploids. Results Results of our assays of individual variants (in haploid yeast cells) agree well with clinical interpretations and protein structure-function relationships, supporting the use of our data as functional evidence under the ACMG interpretation guidelines. Results from our diploid assay successfully distinguish patient genotypes from those of healthy carriers and agree well with disease severity. Finally, we present a linear model that uses individual allele measurements (in haploid yeast cells) to accurately predict the biallelic function (in diploid yeast cells) of ~ 1.8 million allele combinations corresponding to potential human genotypes. Conclusions Taken together, our work provides an example of how large-scale functional assays in model systems can be powerfully applied to the study of a rare disease.
Collapse
|
8
|
Guillaudeux N, Belleannée C, Blanquart S. Identifying genes with conserved splicing structure and orthologous isoforms in human, mouse and dog. BMC Genomics 2022; 23:216. [PMID: 35303798 PMCID: PMC8933948 DOI: 10.1186/s12864-022-08429-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 02/07/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In eukaryote transcriptomes, a significant amount of transcript diversity comes from genes' capacity to generate different transcripts through alternative splicing. Identifying orthologous alternative transcripts across multiple species is of particular interest for genome annotators. However, there is no formal definition of transcript orthology based on the splicing structure conservation. Likewise there is no public dataset benchmark providing groups of orthologous transcripts sharing a conserved splicing structure. RESULTS We introduced a formal definition of splicing structure orthology and we predicted transcript orthologs in human, mouse and dog. Applying a selective strategy, we analyzed 2,167 genes and their 18,109 known transcripts and identified a set of 253 gene orthologs that shared a conserved splicing structure in all three species. We predicted 6,861 transcript CDSs (coding sequence), mainly for dog, an emergent model species. Each predicted transcript was an ortholog of a known transcript: both share the same CDS splicing structure. Evidence for the existence of the predicted CDSs was found in external data. CONCLUSIONS We generated a dataset of 253 gene triplets, structurally conserved and sharing all their CDSs in human, mouse and dog, which correspond to 879 triplets of spliced CDS orthologs. We have released the dataset both as an SQL database and as tabulated files. The data consists of the 879 CDS orthology groups with their detailed splicing structures, and the predicted CDSs, associated with their experimental evidence. The 6,861 predicted CDSs are provided in GTF files. Our data may contribute to compare highly conserved genes across three species, for comparative transcriptomics at the isoform level, or for benchmarking splice aligners and methods focusing on the identification of splicing orthologs. The data is available at https://data-access.cesgo.org/index.php/s/V97GXxOS66NqTkZ .
Collapse
|
9
|
Shirota M, Kinoshita K. Current status and future perspectives of the evaluation of missense variants by using three-dimensional structures of proteins. Biophys Physicobiol 2022; 19:e190023. [PMID: 36071878 PMCID: PMC9402263 DOI: 10.2142/biophysico.bppb-v19.0023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Accepted: 07/12/2022] [Indexed: 12/01/2022] Open
|
10
|
Tanoli Z, Seemab U, Scherer A, Wennerberg K, Tang J, Vähä-Koskela M. Exploration of databases and methods supporting drug repurposing: a comprehensive survey. Brief Bioinform 2021; 22:1656-1678. [PMID: 32055842 PMCID: PMC7986597 DOI: 10.1093/bib/bbaa003] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Revised: 12/09/2019] [Indexed: 02/07/2023] Open
Abstract
Drug development involves a deep understanding of the mechanisms of action and possible side effects of each drug, and sometimes results in the identification of new and unexpected uses for drugs, termed as drug repurposing. Both in case of serendipitous observations and systematic mechanistic explorations, confirmation of new indications for a drug requires hypothesis building around relevant drug-related data, such as molecular targets involved, and patient and cellular responses. These datasets are available in public repositories, but apart from sifting through the sheer amount of data imposing computational bottleneck, a major challenge is the difficulty in selecting which databases to use from an increasingly large number of available databases. The database selection is made harder by the lack of an overview of the types of data offered in each database. In order to alleviate these problems and to guide the end user through the drug repurposing efforts, we provide here a survey of 102 of the most promising and drug-relevant databases reported to date. We summarize the target coverage and types of data available in each database and provide several examples of how multi-database exploration can facilitate drug repurposing.
Collapse
Affiliation(s)
- Ziaurrehman Tanoli
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Finland
| | - Umair Seemab
- Haartman Institute, University of Helsinki, Finland
| | - Andreas Scherer
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Finland
| | - Krister Wennerberg
- Biotech Research & Innovation Centre (BRIC), University of Copenhagen, Denmark
| | - Jing Tang
- Faculty of medicine, University of Helsinki, Finland
| | - Markus Vähä-Koskela
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Finland
| |
Collapse
|
11
|
Zhang X, van Rooij JGJ, Wakabayashi Y, Hwang SJ, Yang Y, Ghanbari M, Bos D, Levy D, Johnson AD, van Meurs JBJ, Kavousi M, Zhu J, O'Donnell CJ. Genome-wide transcriptome study using deep RNA sequencing for myocardial infarction and coronary artery calcification. BMC Med Genomics 2021; 14:45. [PMID: 33568140 PMCID: PMC7874462 DOI: 10.1186/s12920-020-00838-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Accepted: 11/29/2020] [Indexed: 12/13/2022] Open
Abstract
Background Coronary artery calcification (CAC) is a noninvasive measure of coronary atherosclerosis, the proximal pathophysiology underlying most cases of myocardial infarction (MI). We sought to identify expression signatures of early MI and subclinical atherosclerosis in the Framingham Heart Study (FHS). In this study, we conducted paired-end RNA sequencing on whole blood collected from 198 FHS participants (55 with a history of early MI, 72 with high CAC without prior MI, and 71 controls free of elevated CAC levels or history of MI). We applied DESeq2 to identify coding-genes and long intergenic noncoding RNAs (lincRNAs) differentially expressed in MI and high CAC, respectively, compared with the control. Results On average, 150 million paired-end reads were obtained for each sample. At the false discovery rate (FDR) < 0.1, we found 68 coding genes and 2 lincRNAs that were differentially expressed in early MI versus controls. Among them, 60 coding genes were detectable and thus tested in an independent RNA-Seq data of 807 individuals from the Rotterdam Study, and 8 genes were supported by p value and direction of the effect. Immune response, lipid metabolic process, and interferon regulatory factor were enriched in these 68 genes. By contrast, only 3 coding genes and 1 lincRNA were differentially expressed in high CAC versus controls. APOD, encoding a component of high-density lipoprotein, was significantly downregulated in both early MI (FDR = 0.007) and high CAC (FDR = 0.01) compared with controls. Conclusions We identified transcriptomic signatures of early MI that include differentially expressed protein-coding genes and lincRNAs, suggesting important roles for protein-coding genes and lincRNAs in the pathogenesis of MI.
Collapse
Affiliation(s)
- Xiaoling Zhang
- Division of Intramural Research, National Heart, Lung and Blood Institute, Bethesda, MD, USA. .,The National Heart, Lung and Blood Institute's Framingham Heart Study, Framingham, MA, USA. .,Department of Medicine (Biomedical Genetics), Boston University School of Medicine, 72 East Concord Street, Boston, MA, 02118-2526, USA. .,Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA.
| | - Jeroen G J van Rooij
- Department of Internal Medicine, Erasmus Medical Center, Rotterdam, the Netherlands
| | - Yoshiyuki Wakabayashi
- DNA Sequencing and Genomics Core, National Heart, Lung and Blood Institute, Bethesda, MD, USA
| | - Shih-Jen Hwang
- Division of Intramural Research, National Heart, Lung and Blood Institute, Bethesda, MD, USA.,The National Heart, Lung and Blood Institute's Framingham Heart Study, Framingham, MA, USA
| | - Yanqin Yang
- DNA Sequencing and Genomics Core, National Heart, Lung and Blood Institute, Bethesda, MD, USA
| | - Mohsen Ghanbari
- Department of Internal Medicine, Erasmus Medical Center, Rotterdam, the Netherlands
| | - Daniel Bos
- Department of Epidemiology, Erasmus Medical Center, Rotterdam, the Netherlands.,Department of Radiology and Nuclear Medicine, Erasmus Medical Center, Rotterdam, the Netherlands
| | | | - Daniel Levy
- Division of Intramural Research, National Heart, Lung and Blood Institute, Bethesda, MD, USA.,The National Heart, Lung and Blood Institute's Framingham Heart Study, Framingham, MA, USA
| | - Andrew D Johnson
- Division of Intramural Research, National Heart, Lung and Blood Institute, Bethesda, MD, USA.,The National Heart, Lung and Blood Institute's Framingham Heart Study, Framingham, MA, USA
| | - Joyce B J van Meurs
- Department of Internal Medicine, Erasmus Medical Center, Rotterdam, the Netherlands
| | - Maryam Kavousi
- Department of Internal Medicine, Erasmus Medical Center, Rotterdam, the Netherlands
| | - Jun Zhu
- DNA Sequencing and Genomics Core, National Heart, Lung and Blood Institute, Bethesda, MD, USA
| | - Christopher J O'Donnell
- Division of Intramural Research, National Heart, Lung and Blood Institute, Bethesda, MD, USA. .,The National Heart, Lung and Blood Institute's Framingham Heart Study, Framingham, MA, USA. .,Cardiology Section, Veteran's Administration Boston Healthcare System, Boston, USA.
| |
Collapse
|
12
|
Guo W, Wang Y, Yang M, Wang Z, Wang Y, Chaurasia S, Wu Z, Zhang M, Yadav GS, Rathod S, Concha-Benavente F, Fernandez C, Li S, Xie W, Ferris RL, Kammula US, Lu B, Yang D. LincRNA-immunity landscape analysis identifies EPIC1 as a regulator of tumor immune evasion and immunotherapy resistance. SCIENCE ADVANCES 2021; 7:eabb3555. [PMID: 33568470 PMCID: PMC7875530 DOI: 10.1126/sciadv.abb3555] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Accepted: 12/23/2020] [Indexed: 05/11/2023]
Abstract
Through an integrative analysis of the lincRNA expression and tumor immune response in 9,626 tumor samples across 32 cancer types, we developed a lincRNA-based immune response (LIMER) score that can predict the immune cells infiltration and patient prognosis in multiple cancer types. Our analysis also identified tumor-specific lincRNAs, including EPIC1, that potentially regulate tumor immune response in multiple cancer types. Immunocompetent mouse models and in vitro co-culture assays demonstrated that EPIC1 induces tumor immune evasion and resistance to immunotherapy by suppressing tumor cell antigen presentation. Mechanistically, lincRNA EPIC1 interacts with the histone methyltransferase EZH2, leading to the epigenetic silencing of IFNGR1, TAP1/2, ERAP1/2, and MHC-I genes. Genetic and pharmacological inhibition of EZH2 abolish EPIC1's immune-related oncogenic effect and its suppression of interferon-γ signaling. The EPIC1-EZH2 axis emerges as a potential mechanism for tumor immune evasion that can serve as therapeutic targets for immunotherapy.
Collapse
Affiliation(s)
- Weiwei Guo
- Center for Pharmacogenetics, Department of Pharmaceutical Sciences, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Yue Wang
- Center for Pharmacogenetics, Department of Pharmaceutical Sciences, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Min Yang
- Department of Immunology, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Zehua Wang
- Center for Pharmacogenetics, Department of Pharmaceutical Sciences, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Yifei Wang
- Center for Pharmacogenetics, Department of Pharmaceutical Sciences, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Smriti Chaurasia
- UPMC Hillman Cancer Institute, University of Pittsburgh, Pittsburgh, PA 15261, USA
- Division of Surgical Oncology, Department of Surgery, University of Pittsburgh School of Medicine, University of Pittsburgh Cancer Institute, Pittsburgh, PA 15213, USA
| | - Zhiyuan Wu
- Center for Pharmacogenetics, Department of Pharmaceutical Sciences, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Min Zhang
- Center for Pharmacogenetics, Department of Pharmaceutical Sciences, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Ghanshyam Singh Yadav
- UPMC Hillman Cancer Institute, University of Pittsburgh, Pittsburgh, PA 15261, USA
- Division of Surgical Oncology, Department of Surgery, University of Pittsburgh School of Medicine, University of Pittsburgh Cancer Institute, Pittsburgh, PA 15213, USA
| | - Sanjay Rathod
- Center for Pharmacogenetics, Department of Pharmaceutical Sciences, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Fernando Concha-Benavente
- Department of Immunology, University of Pittsburgh, Pittsburgh, PA 15261, USA
- UPMC Hillman Cancer Institute, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Christian Fernandez
- Center for Pharmacogenetics, Department of Pharmaceutical Sciences, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Song Li
- Center for Pharmacogenetics, Department of Pharmaceutical Sciences, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Wen Xie
- Center for Pharmacogenetics, Department of Pharmaceutical Sciences, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Robert L Ferris
- Department of Immunology, University of Pittsburgh, Pittsburgh, PA 15261, USA
- UPMC Hillman Cancer Institute, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Udai S Kammula
- UPMC Hillman Cancer Institute, University of Pittsburgh, Pittsburgh, PA 15261, USA
- Division of Surgical Oncology, Department of Surgery, University of Pittsburgh School of Medicine, University of Pittsburgh Cancer Institute, Pittsburgh, PA 15213, USA
| | - Binfeng Lu
- Department of Immunology, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Da Yang
- Center for Pharmacogenetics, Department of Pharmaceutical Sciences, University of Pittsburgh, Pittsburgh, PA 15261, USA.
- UPMC Hillman Cancer Institute, University of Pittsburgh, Pittsburgh, PA 15261, USA
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15261, USA
| |
Collapse
|
13
|
Zhu L, Yang X, Zhu R, Yu L. Identifying Discriminative Biological Function Features and Rules for Cancer-Related Long Non-coding RNAs. Front Genet 2021; 11:598773. [PMID: 33391350 PMCID: PMC7772407 DOI: 10.3389/fgene.2020.598773] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Accepted: 11/23/2020] [Indexed: 01/17/2023] Open
Abstract
Cancer has been a major public health problem worldwide for many centuries. Cancer is a complex disease associated with accumulative genetic mutations, epigenetic aberrations, chromosomal instability, and expression alteration. Increasing lines of evidence suggest that many non-coding transcripts, which are termed as non-coding RNAs, have important regulatory roles in cancer. In particular, long non-coding RNAs (lncRNAs) play crucial roles in tumorigenesis. Cancer-related lncRNAs serve as oncogenic factors or tumor suppressors. Although many lncRNAs are identified as potential regulators in tumorigenesis by using traditional experimental methods, they are time consuming and expensive considering the tremendous amount of lncRNAs needed. Thus, effective and fast approaches to recognize tumor-related lncRNAs should be developed. The proposed approach should help us understand not only the mechanisms of lncRNAs that participate in tumorigenesis but also their satisfactory performance in distinguishing cancer-related lncRNAs. In this study, we utilized a decision tree (DT), a type of rule learning algorithm, to investigate cancer-related lncRNAs with functional annotation contents [gene ontology (GO) terms and KEGG pathways] of their co-expressed genes. Cancer-related and other lncRNAs encoded by the key enrichment features of GO and KEGG filtered by feature selection methods were used to build an informative DT, which further induced several decision rules. The rules provided not only a new tool for identifying cancer-related lncRNAs but also connected the lncRNAs and cancers with the combinations of GO terms. Results provided new directions for understanding cancer-related lncRNAs.
Collapse
Affiliation(s)
- Liucun Zhu
- School of Life Sciences, Shanghai University, Shanghai, China
| | - Xin Yang
- School of Life Sciences, Shanghai University, Shanghai, China
| | - Rui Zhu
- School of Life Sciences, Shanghai University, Shanghai, China
| | - Lei Yu
- Department of Medical Oncology, Shanghai Concord Medical Cancer Center, Shanghai, China
| |
Collapse
|
14
|
Peng H, Huang R, Wang K, Wang C, Li B, Guo Y, Li M, Zhang D, Dong H, Chen H, Chen C, Xu Q, Li F, Tian L, Wu J. Development and Validation of an RNA Sequencing Assay for Gene Fusion Detection in Formalin-Fixed, Paraffin-Embedded Tumors. J Mol Diagn 2020; 23:223-233. [PMID: 33271368 DOI: 10.1016/j.jmoldx.2020.11.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Revised: 10/10/2020] [Accepted: 11/06/2020] [Indexed: 12/31/2022] Open
Abstract
RNA sequencing (RNA-seq) is a well-validated tool for detecting gene fusions in fresh-frozen tumors; however, RNA-seq is much more challenging to use with formalin-fixed, paraffin-embedded (FFPE) tumor samples. We evaluated the performance of RNA-seq to detect gene fusions in clinical FFPE tumor samples. Our assay identified all 15 spiked-in NTRK fusions from RNA reference material and six known fusions from five cancer cell lines. Limit of detection for the assay was assessed with a series of dilutions of RNA from the cell line H2228. These fusions can be detected when the dilution is down to 10%. Good intra-assay and interassay reproducibility was observed in three specimens. For clinical validation, the assay detected 10 of 12 fusions initially identified by a DNA panel (covering 23 gene fusions) in clinical specimens (83.3% sensitivity), whereas one fusion (MET fusion) was identified in another 34 fusion-negative tumor specimens as determined by the DNA panel (negative prediction value of 94.3%). This MET fusion was confirmed by RT-PCR and Sanger sequencing, which found that this is a false-negative result for the DNA panel. The assay also identified 26 extra fusions not covered by the DNA panel, 20 (76.9%) of which were validated by RT-PCR and Sanger sequencing. Therefore, this RNA assay has reasonable performance and could complement DNA-based next-generation sequencing assays.
Collapse
Affiliation(s)
- Hao Peng
- The First People's Hospital of Yunnan Province, Kunming, China
| | - Rong Huang
- The First People's Hospital of Foshan, Foshan, China
| | - Kui Wang
- Department of Hepatic Surgery (II), Eastern Hepatobiliary Surgery Hospital, Navy Medical University (The Second Military Medical University), Shanghai, China
| | - Cuiyun Wang
- The R&D Center, 3D Medicines Inc., Shanghai, China
| | - Bin Li
- The Bioinformatics Department, 3D Medicines Inc., Shanghai, China
| | - Youbing Guo
- The Bioinformatics Department, 3D Medicines Inc., Shanghai, China
| | - Meng Li
- Cancer Center, The First Affiliated Hospital of Guangzhou University of Traditional Chinese Medicine, Guangzhou, China
| | - Dadong Zhang
- The R&D Center, 3D Medicines Inc., Shanghai, China
| | - Hua Dong
- The Bioinformatics Department, 3D Medicines Inc., Shanghai, China
| | - Hao Chen
- The Bioinformatics Department, 3D Medicines Inc., Shanghai, China
| | - Caifu Chen
- The R&D Center, 3D Medicines Inc., Shanghai, China
| | - Qing Xu
- The R&D Center, 3D Medicines Inc., Shanghai, China
| | - Fugen Li
- The Bioinformatics Department, 3D Medicines Inc., Shanghai, China
| | - Lei Tian
- Department of Thoracic Surgery, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China.
| | - Jianbing Wu
- Department of Oncology, Jiangxi Key Laboratory of Clinical and Translational Cancer Research, The Second Affiliated Hospital of Nanchang University, Jiangxi, China.
| |
Collapse
|
15
|
Dede M, McLaughlin M, Kim E, Hart T. Multiplex enCas12a screens detect functional buffering among paralogs otherwise masked in monogenic Cas9 knockout screens. Genome Biol 2020; 21:262. [PMID: 33059726 PMCID: PMC7558751 DOI: 10.1186/s13059-020-02173-2] [Citation(s) in RCA: 53] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Accepted: 09/30/2020] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Pooled library CRISPR/Cas9 knockout screening across hundreds of cell lines has identified genes whose disruption leads to fitness defects, a critical step in identifying candidate cancer targets. However, the number of essential genes detected from these monogenic knockout screens is low compared to the number of constitutively expressed genes in a cell. RESULTS Through a systematic analysis of screen data in cancer cell lines generated by the Cancer Dependency Map, we observe that half of all constitutively expressed genes are never detected in any CRISPR screen and that these never-essentials are highly enriched for paralogs. We investigated functional buffering among approximately 400 candidate paralog pairs using CRISPR/enCas12a dual-gene knockout screening in three cell lines. We observe 24 synthetic lethal paralog pairs that have escaped detection by monogenic knockout screens at stringent thresholds. Nineteen of 24 (79%) synthetic lethal interactions are present in at least two out of three cell lines and 14 of 24 (58%) are present in all three cell lines tested, including alternate subunits of stable protein complexes as well as functionally redundant enzymes. CONCLUSIONS Together, these observations strongly suggest that functionally redundant paralogs represent a targetable set of genetic dependencies that are systematically under-represented among cell-essential genes in monogenic CRISPR-based loss of function screens.
Collapse
Affiliation(s)
- Merve Dede
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
- Graduate School of Biological Sciences, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Megan McLaughlin
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
- Graduate School of Biological Sciences, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Eiru Kim
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Traver Hart
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.
- Department of Cancer Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.
| |
Collapse
|
16
|
Chen S, Wang G, Zheng X, Ge S, Dai Y, Ping P, Chen X, Liu G, Zhang J, Yang Y, Zhang X, Zhong A, Zhu Y, Chu Q, Huang Y, Zhang Y, Shen C, Yuan Y, Yuan Q, Pei X, Cheng CY, Sun F. Whole-exome sequencing of a large Chinese azoospermia and severe oligospermia cohort identifies novel infertility causative variants and genes. Hum Mol Genet 2020; 29:2451-2459. [PMID: 32469048 DOI: 10.1093/hmg/ddaa101] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Revised: 05/17/2020] [Accepted: 05/25/2020] [Indexed: 12/16/2022] Open
Abstract
Abstract
Rare coding variants have been proven to be one of the significant factors contributing to spermatogenic failure in patients with non-obstructive azoospermia (NOA) and severe oligospermia (SO). To delineate the molecular characteristics of idiopathic NOA and SO, we performed whole-exome sequencing of 314 unrelated patients of Chinese Han origin and verified our findings by comparing to 400 fertile controls. We detected six pathogenic/likely pathogenic variants and four variants of unknown significance, in genes known to cause NOA/SO, and 9 of which had not been earlier reported. Additionally, we identified 20 novel NOA candidate genes affecting 25 patients. Among them, five (BRDT, CHD5, MCM9, MLH3 and ZFX) were considered as strong candidates based on the evidence obtained from murine functional studies and human single-cell (sc)RNA-sequencing data. These genetic findings provide insight into the aetiology of human NOA/SO and pave the way for further functional analysis and molecular diagnosis of male infertility.
Collapse
Affiliation(s)
- Shitao Chen
- International Peace Maternity and Child Health Hospital, Shanghai Key Laboratory for Reproductive Medicine, School of Medicine, Shanghai Jiaotong University, Shanghai, 200030, China
| | - Guishuan Wang
- Institute of Reproductive Medicine, School of Medicine, Nantong University, Nantong, 226001, China
| | - Xiaoguo Zheng
- International Peace Maternity and Child Health Hospital, Shanghai Key Laboratory for Reproductive Medicine, School of Medicine, Shanghai Jiaotong University, Shanghai, 200030, China
| | - Shunna Ge
- International Peace Maternity and Child Health Hospital, Shanghai Key Laboratory for Reproductive Medicine, School of Medicine, Shanghai Jiaotong University, Shanghai, 200030, China
| | - Yubing Dai
- International Peace Maternity and Child Health Hospital, Shanghai Key Laboratory for Reproductive Medicine, School of Medicine, Shanghai Jiaotong University, Shanghai, 200030, China
| | - Ping Ping
- Department of Urology, Shanghai Human Sperm Bank, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, 200120, China
| | - Xiangfeng Chen
- Department of Urology, Shanghai Human Sperm Bank, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, 200120, China
| | - Guihua Liu
- Department of Andrology, Reproductive Medicine Research Center, the Sixth Affiliated Hospital of Sun Yat-sen University, Guangzhou, 510655, China
| | - Jing Zhang
- Department of Andrology, Reproductive Medicine Research Center, the Sixth Affiliated Hospital of Sun Yat-sen University, Guangzhou, 510655, China
| | - Yang Yang
- Department of Reproduction, Beijing Obstetrics and Gynecology Hospital, Capital Medical University, Beijing, 100026, China
| | - Xinzong Zhang
- Key Laboratory of Male Reproduction and Genetics, National Health and Family Planning Commission, Family Planning Research Institute of Guangdong Province, Guangzhou, 510031, China
| | - An Zhong
- Key Laboratory of Male Reproduction and Genetics, National Health and Family Planning Commission, Family Planning Research Institute of Guangdong Province, Guangzhou, 510031, China
| | - Yongtong Zhu
- Department of Obstetrics and Gynecology, Reproductive Medicine Center, Nanfang Hospital, Southern Medical University, Guangzhou, 510515, China
| | - Qingjun Chu
- Department of Obstetrics and Gynecology, Reproductive Medicine Center, Nanfang Hospital, Southern Medical University, Guangzhou, 510515, China
| | - Yonghan Huang
- The First People's Hospital of Foshan, Sun Yat-sen University, Foshan, 528000, China
| | - Yong Zhang
- Center of Assisted Reproductive Medicine, The Sixth Medical Center of PLA General Hospital, Beijing, 100083, China
| | - Changli Shen
- Reproductive Center, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, 510120, China
| | - Yiming Yuan
- Peking University First Hospital Andrology Center & Urology Department, Beijing, 100034, China
| | - Qilong Yuan
- Guangdong Province Hospital of Chinese Medicine, Guangzhou, 510140, China
| | - Xiuying Pei
- Key Laboratory of Fertility Preservation and Maintenance of Ministry of Education, School of Basic Medicine, Ningxia Medical University, Yinchuan, 750004, China
| | - C Yan Cheng
- The Mary M. Wohlford Laboratory for Male Contraceptive Research, Center for Biomedical Research, Population Council, New York, 10065, USA
| | - Fei Sun
- International Peace Maternity and Child Health Hospital, Shanghai Key Laboratory for Reproductive Medicine, School of Medicine, Shanghai Jiaotong University, Shanghai, 200030, China
- Institute of Reproductive Medicine, School of Medicine, Nantong University, Nantong, 226001, China
| |
Collapse
|
17
|
van Groen BD, Bi C, Gaedigk R, Staggs VS, Tibboel D, de Wildt SN, Leeder JS. Alternative Splicing of the SLCO1B1 Gene: An Exploratory Analysis of Isoform Diversity in Pediatric Liver. Clin Transl Sci 2020; 13:509-519. [PMID: 31917523 PMCID: PMC7214651 DOI: 10.1111/cts.12733] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2019] [Accepted: 10/26/2019] [Indexed: 11/30/2022] Open
Abstract
The hepatic influx transporter OATP1B1 (SLCO1B1) plays an important role in the disposition of endogenous substrates and drugs prescribed to children. Alternative splicing increases the diversity of protein products from > 90% of human genes and may be triggered by developmental signals. As concentrations of several endogenous OATP1B1 substrates change during growth and development, with this exploratory study we investigated age-dependent alternative splicing of SLCO1B1 mRNA in 97 postmortem livers (fetus-adolescents). Twenty-seven splice variants were detected; 10 were confirmed by additional bioinformatic analyses and verified by quantitative polymerase chain reaction, and selected for detailed analysis based on relative abundance, association with age, and overlap with an adjacent gene. Two splice variants code for reference OATP1B1 protein, and eight code for truncated proteins. The expression of eight isoforms was associated with age. We conclude that alternative splicing of SLCO1B1 occurs frequently in children; although the functional consequences remain unknown, the data raise the possibility of a regulatory role for alternative splicing in mediating developmental changes in drug disposition.
Collapse
Affiliation(s)
- Bianca D. van Groen
- Intensive Care and Department of Pediatric SurgeryErasmus MC‐Sophia Children’s HospitalRotterdamThe Netherlands
| | - Chengpeng Bi
- Division of Clinical Pharmacology, Toxicology, & Therapeutic InnovationDepartment of PediatricsChildren's Mercy Kansas CityKansas CityMissouriUSA
| | - Roger Gaedigk
- Division of Clinical Pharmacology, Toxicology, & Therapeutic InnovationDepartment of PediatricsChildren's Mercy Kansas CityKansas CityMissouriUSA
| | - Vincent S. Staggs
- Health Services and Outcomes ResearchChildren's Mercy Kansas CitySchool of MedicineUniversity of Missouri‐KansasKansas CityMissouriUSA
| | - Dick Tibboel
- Intensive Care and Department of Pediatric SurgeryErasmus MC‐Sophia Children’s HospitalRotterdamThe Netherlands
| | - Saskia N. de Wildt
- Intensive Care and Department of Pediatric SurgeryErasmus MC‐Sophia Children’s HospitalRotterdamThe Netherlands
- Department of Pharmacology and ToxicologyRadboud Institute for Health SciencesRadboud University Medical CenterNijmegenThe Netherlands
| | - J. Steven Leeder
- Division of Clinical Pharmacology, Toxicology, & Therapeutic InnovationDepartment of PediatricsChildren's Mercy Kansas CityKansas CityMissouriUSA
| |
Collapse
|
18
|
Formation of human long intergenic non-coding RNA genes, pseudogenes, and protein genes: Ancestral sequences are key players. PLoS One 2020; 15:e0230236. [PMID: 32214344 PMCID: PMC7098633 DOI: 10.1371/journal.pone.0230236] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Accepted: 02/25/2020] [Indexed: 12/20/2022] Open
Abstract
Pathways leading to formation of non-coding RNA and protein genes are varied and complex. We report finding a conserved repeat sequence present in human and chimpanzee genomes that appears to have originated from a common primate ancestor. This sequence is repeatedly copied in human chromosome 22 (chr22) low copy repeats (LCR22) or segmental duplications and forms twenty-one different genes, which include the human long intergenic non-coding RNA (lincRNA) family FAM230, a newly discovered lincRNA gene family termed conserved long intergenic non-coding RNAs (clincRNA), pseudogene families, as well as the gamma-glutamyltransferase (GGT) protein gene family and the RNA pseudogenes that originate from GGT sequences. Of particular interest are the GGT5 and USP18 protein genes that appear to have formed from an homologous repeat sequence that also forms the clincRNA gene family. The data point to ancestral DNA sequences, conserved through evolution and duplicated in humans by chromosomal repeat sequences that may serve as functional genomic elements in the development of diverse genes.
Collapse
|
19
|
Lal D, May P, Perez-Palma E, Samocha KE, Kosmicki JA, Robinson EB, Møller RS, Krause R, Nürnberg P, Weckhuysen S, De Jonghe P, Guerrini R, Niestroj LM, Du J, Marini C, Ware JS, Kurki M, Gormley P, Tang S, Wu S, Biskup S, Poduri A, Neubauer BA, Koeleman BPC, Helbig KL, Weber YG, Helbig I, Majithia AR, Palotie A, Daly MJ. Gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders. Genome Med 2020; 12:28. [PMID: 32183904 PMCID: PMC7079346 DOI: 10.1186/s13073-020-00725-6] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2019] [Accepted: 02/21/2020] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Classifying pathogenicity of missense variants represents a major challenge in clinical practice during the diagnoses of rare and genetic heterogeneous neurodevelopmental disorders (NDDs). While orthologous gene conservation is commonly employed in variant annotation, approximately 80% of known disease-associated genes belong to gene families. The use of gene family information for disease gene discovery and variant interpretation has not yet been investigated on a genome-wide scale. We empirically evaluate whether paralog-conserved or non-conserved sites in human gene families are important in NDDs. METHODS Gene family information was collected from Ensembl. Paralog-conserved sites were defined based on paralog sequence alignments; 10,068 NDD patients and 2078 controls were statistically evaluated for de novo variant burden in gene families. RESULTS We demonstrate that disease-associated missense variants are enriched at paralog-conserved sites across all disease groups and inheritance models tested. We developed a gene family de novo enrichment framework that identified 43 exome-wide enriched gene families including 98 de novo variant carrying genes in NDD patients of which 28 represent novel candidate genes for NDD which are brain expressed and under evolutionary constraint. CONCLUSION This study represents the first method to incorporate gene family information into a statistical framework to interpret variant data for NDDs and to discover new NDD-associated genes.
Collapse
Affiliation(s)
- Dennis Lal
- Epilepsy Center, Neurological Institute, Cleveland Clinic, Cleveland, OH, USA.
- Stanley Center for Psychiatric Research, The Broad Institute of Harvard and M.I.T, Cambridge, MA, USA.
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, USA.
- Cologne Center for Genomics, University of Cologne, Cologne, Germany.
- Genomic Medicine Institute, Lerner Research Institute Cleveland Clinic, 9500 Euclid Avenue, Cleveland, OH, 44195, USA.
| | - Patrick May
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 6, Avenue du Swing, 4367, Belvaux, Luxembourg.
| | - Eduardo Perez-Palma
- Cologne Center for Genomics, University of Cologne, Cologne, Germany
- Genomic Medicine Institute, Lerner Research Institute Cleveland Clinic, 9500 Euclid Avenue, Cleveland, OH, 44195, USA
| | - Kaitlin E Samocha
- Stanley Center for Psychiatric Research, The Broad Institute of Harvard and M.I.T, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, USA
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | - Jack A Kosmicki
- Stanley Center for Psychiatric Research, The Broad Institute of Harvard and M.I.T, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, USA
| | - Elise B Robinson
- Stanley Center for Psychiatric Research, The Broad Institute of Harvard and M.I.T, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, USA
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Rikke S Møller
- The Danish Epilepsy Centre, Dianalund, Denmark
- Institute for Regional Health research, University of Southern Denmark, Odense, Denmark
| | - Roland Krause
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 6, Avenue du Swing, 4367, Belvaux, Luxembourg
| | - Peter Nürnberg
- Cologne Center for Genomics, University of Cologne, Cologne, Germany
- Center for Molecular Medicine Cologne, University of Cologne, Cologne, Germany
- Cologne Excellence Cluster on Cellular Stress Responses in Aging-Associated Diseases, University of Cologne, Cologne, Germany
| | - Sarah Weckhuysen
- Division of Neurology, Antwerp University Hospital, Antwerp, Belgium
- Neurogenetics Group, Center for Molecular Neurology, VIB, Antwerp, Belgium
- Laboratory of Neurogenetics, Institute Born-Bunge, University of Antwerp, Antwerp, Belgium
| | - Peter De Jonghe
- Division of Neurology, Antwerp University Hospital, Antwerp, Belgium
| | - Renzo Guerrini
- Pediatric Neurology and Neuroscience Department, Children's Hospital Anna Meyer, University of Florence, Florence, Italy
| | - Lisa M Niestroj
- Cologne Center for Genomics, University of Cologne, Cologne, Germany
| | - Juliana Du
- Cologne Center for Genomics, University of Cologne, Cologne, Germany
| | - Carla Marini
- Pediatric Neurology and Neuroscience Department, Children's Hospital Anna Meyer, University of Florence, Florence, Italy
| | - James S Ware
- National Heart & Lung Institute and MRC London Institute of Medical Science, Imperial College London, London, UK
| | - Mitja Kurki
- Stanley Center for Psychiatric Research, The Broad Institute of Harvard and M.I.T, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, USA
| | - Padhraig Gormley
- Stanley Center for Psychiatric Research, The Broad Institute of Harvard and M.I.T, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, USA
| | - Sha Tang
- Division of Clinical Genomics, Ambry Genetics, Aliso Viejo, CA, USA
| | - Sitao Wu
- Division of Clinical Genomics, Ambry Genetics, Aliso Viejo, CA, USA
| | - Saskia Biskup
- CeGat and Practice for Human Genetics, Tübingen, Germany
| | - Annapurna Poduri
- Epilepsy Genetics Program, Boston Children's Hospital, Boston, MA, USA
| | - Bernd A Neubauer
- Department of Neuropediatrics UKGM, University of Giessen, Giessen, Germany
| | - Bobby P C Koeleman
- Department of Genetics, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Katherine L Helbig
- Division of Clinical Genomics, Ambry Genetics, Aliso Viejo, CA, USA
- Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Yvonne G Weber
- Department of Neurology and Epileptology, Hertie Institute for Clinical Brain Research, University of Tübingen, Tübingen, Germany
- Department of Epileptology and Neurology, University of Aachen, Aachen, Germany
| | - Ingo Helbig
- Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- The Epilepsy NeuroGenetics Initiative (ENGIN), Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Neurology, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA, 19104, USA
| | - Amit R Majithia
- Division of Endocrinology, Department of Medicine, University of California, San Diego, CA, USA
| | - Aarno Palotie
- Stanley Center for Psychiatric Research, The Broad Institute of Harvard and M.I.T, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, USA
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
| | - Mark J Daly
- Stanley Center for Psychiatric Research, The Broad Institute of Harvard and M.I.T, Cambridge, MA, USA.
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, USA.
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland.
| |
Collapse
|
20
|
Complete Profiling of Methyl-CpG-Binding Domains for Combinations of Cytosine Modifications at CpG Dinucleotides Reveals Differential Read-out in Normal and Rett-Associated States. Sci Rep 2020; 10:4053. [PMID: 32132616 PMCID: PMC7055227 DOI: 10.1038/s41598-020-61030-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Accepted: 02/03/2020] [Indexed: 11/17/2022] Open
Abstract
5-Methylcytosine (mC) exists in CpG dinucleotides of mammalian DNA and plays key roles in chromatin regulation during development and disease. As a main regulatory pathway, fully methylated CpG are recognized by methyl-CpG-binding domain (MBD) proteins that act in concert with chromatin remodelers, histone deacetylases and methyltransferases to trigger transcriptional downregulation. In turn, MBD mutations can alter CpG binding, and in case of the MBD protein MeCP2 can cause the neurological disorder Rett syndrome (RTT). An additional layer of complexity in CpG recognition is added by ten-eleven-translocation (TET) dioxygenases that oxidize mC to 5-hydroxymethyl-, 5-formyl- and 5-carboxylcytosine, giving rise to fifteen possible combinations of cytosine modifications in the two CpG strands. We report a comprehensive, comparative interaction analysis of the human MBD proteins MeCP2, MBD1, MBD2, MBD3, and MBD4 with all CpG combinations and observe individual preferences of each MBD for distinct combinations. In addition, we profile four MeCP2 RTT mutants and reveal that although interactions to methylated CpGs are similarly affected by the mutations, interactions to oxidized mC combinations are differentially affected. These findings argue for a complex interplay between local TET activity/processivity and CpG recognition by MBDs, with potential consequences for the transcriptional landscape in normal and RTT states.
Collapse
|
21
|
Quality of whole genome sequencing from blood versus saliva derived DNA in cardiac patients. BMC Med Genomics 2020; 13:11. [PMID: 31996208 PMCID: PMC6988365 DOI: 10.1186/s12920-020-0664-7] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2019] [Accepted: 01/20/2020] [Indexed: 01/03/2023] Open
Abstract
Background Whole-genome sequencing (WGS) is becoming an increasingly important tool for detecting genomic variation. Blood derived DNA is the current standard for WGS for research or clinical purposes but may not always be feasible to acquire. The usability of DNA from saliva for WGS is not known. We compared the quality of WGS between blood versus saliva derived DNA. Methods WGS was performed in DNA from 531 blood and 502 saliva samples (including 5 paired samples) from participants enrolled in a heart disease biorepository. We compared the proportion of sequencing reads that mapped to non-human sources (microbiome), the sequencing coverage, and the yield and concordance of single nucleotide variant (SNV) and copy number variant (CNV) calls between blood and saliva genomes. Results Of 531 blood and 502 saliva samples, 46% saliva DNA failed quality control (QC) requirements for WGS compared to 6% QC failure for blood DNA. An average of 10.7% WGS reads in the saliva samples mapped to the human oral microbiome compared to 0.09% WGS reads in blood samples. However, these reads were readily excluded by excluding reads that did not map to the human reference genome. Sequencing coverage met or exceeded the target sequencing depth of 30x in all the blood samples and 4 of the 5 saliva samples; the fifth saliva sample had an average sequencing depth of 22.6x. Over 95% of SNVs identified in saliva were concordant with those identified in blood across the genome, within all gene coding regions, and within cardiovascular disease-related gene coding regions. Rare SNVs, defined as those with a minor allele frequency of less than 1% in the Genome Aggregation Database, had a lower concordance of 90% between blood and saliva genomes. CNVs had only 76% concordance between blood and saliva samples. Conclusions High quality saliva samples that meet stringent QC criteria can be used for WGS when blood-derived DNA is not available or is not suitable. Saliva DNA provides an acceptable yield of SNV calls but has a lower yield for CNV calls compared to blood DNA.
Collapse
|
22
|
Abstract
Enhancers are cis-acting elements with many sites bound by transcription factors and activate transcription over long distance. Histone modifications are critical for enhancer activity and utilized as hallmarks for the identification of putative enhancers. Monomethylation of histone H3 lysine 4 (H3K4me1) is the mark for enhancer priming; acetylation of histone H3 lysine 27 (H3K27ac) for active enhancers and trimethylation of histone H3 lysine 27 (H3K27me3) for silent enhancers. Recent studies from multiple groups have provided evidence that enhancer reprogramming, especially gain of enhancer activity, is closely related to tumorigenesis and cancer development. In this review, we will summarize the recent discoveries about enhancer regulation and the mechanisms of enhancer reprogramming in tumorigenesis, and discuss the potential application of enhancer manipulation in precision medicine.
Collapse
Affiliation(s)
- Jie Yao
- College of Life Sciences, Frontier Science Center for Immunology and Metabolism, Hubei Key Laboratory of Cell Homeostasis, Hubei Key Laboratory of Developmentally Originated Disease, Hubei Key Laboratory of Enteropathy, Wuhan University, Wuhan, Hubei, China
| | - Ji Chen
- College of Life Sciences, Frontier Science Center for Immunology and Metabolism, Hubei Key Laboratory of Cell Homeostasis, Hubei Key Laboratory of Developmentally Originated Disease, Hubei Key Laboratory of Enteropathy, Wuhan University, Wuhan, Hubei, China
| | - Lian-Yun Li
- College of Life Sciences, Frontier Science Center for Immunology and Metabolism, Hubei Key Laboratory of Cell Homeostasis, Hubei Key Laboratory of Developmentally Originated Disease, Hubei Key Laboratory of Enteropathy, Wuhan University, Wuhan, Hubei, China
| | - Min Wu
- College of Life Sciences, Frontier Science Center for Immunology and Metabolism, Hubei Key Laboratory of Cell Homeostasis, Hubei Key Laboratory of Developmentally Originated Disease, Hubei Key Laboratory of Enteropathy, Wuhan University, Wuhan, Hubei, China
| |
Collapse
|
23
|
Lansing F, Paszkowski-Rogacz M, Schmitt LT, Schneider PM, Rojo Romanos T, Sonntag J, Buchholz F. A heterodimer of evolved designer-recombinases precisely excises a human genomic DNA locus. Nucleic Acids Res 2020; 48:472-485. [PMID: 31745551 PMCID: PMC7107906 DOI: 10.1093/nar/gkz1078] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2019] [Revised: 10/29/2019] [Accepted: 11/04/2019] [Indexed: 01/04/2023] Open
Abstract
Site-specific recombinases (SSRs) such as the Cre/loxP system are useful genome engineering tools that can be repurposed by altering their DNA-binding specificity. However, SSRs that delete a natural sequence from the human genome have not been reported thus far. Here, we describe the generation of an SSR system that precisely excises a 1.4 kb fragment from the human genome. Through a streamlined process of substrate-linked directed evolution we generated two separate recombinases that, when expressed together, act as a heterodimer to delete a human genomic sequence from chromosome 7. Our data indicates that designer-recombinases can be generated in a manageable timeframe for precision genome editing. A large-scale bioinformatics analysis suggests that around 13% of all human protein-coding genes could be targetable by dual designer-recombinase induced genomic deletion (dDRiGD). We propose that heterospecific designer-recombinases, which work independently of the host DNA repair machinery, represent an efficient and safe alternative to nuclease-based genome editing technologies.
Collapse
Affiliation(s)
- Felix Lansing
- Medical Faculty and University Hospital Carl Gustav Carus, UCC Section Medical Systems Biology, TU Dresden, 01307 Dresden, Germany
| | - Maciej Paszkowski-Rogacz
- Medical Faculty and University Hospital Carl Gustav Carus, UCC Section Medical Systems Biology, TU Dresden, 01307 Dresden, Germany
| | - Lukas Theo Schmitt
- Medical Faculty and University Hospital Carl Gustav Carus, UCC Section Medical Systems Biology, TU Dresden, 01307 Dresden, Germany
| | - Paul Martin Schneider
- Medical Faculty and University Hospital Carl Gustav Carus, UCC Section Medical Systems Biology, TU Dresden, 01307 Dresden, Germany
| | - Teresa Rojo Romanos
- Medical Faculty and University Hospital Carl Gustav Carus, UCC Section Medical Systems Biology, TU Dresden, 01307 Dresden, Germany
| | - Jan Sonntag
- Medical Faculty and University Hospital Carl Gustav Carus, UCC Section Medical Systems Biology, TU Dresden, 01307 Dresden, Germany
| | - Frank Buchholz
- Medical Faculty and University Hospital Carl Gustav Carus, UCC Section Medical Systems Biology, TU Dresden, 01307 Dresden, Germany
| |
Collapse
|
24
|
Evolutionary genomics analysis of human nucleus-encoded mitochondrial genes: implications for the roles of energy production and metabolic pathways in the pathogenesis and pathophysiology of demyelinating diseases. Neurosci Lett 2019; 715:134600. [PMID: 31726178 DOI: 10.1016/j.neulet.2019.134600] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2019] [Revised: 09/19/2019] [Accepted: 10/28/2019] [Indexed: 02/02/2023]
Abstract
The myelin sheath is a plasma membrane extension that lines nerve fibers to protect, support and insulate neurons. The myelination of axons in vertebrates enables fast, saltatory impulse propagation, and this process relies on organelles, especially on mitochondria to supply energy. Approximately 99% of mitochondrial proteins are encoded in the nucleus. Since mitochondria play a central role in the energy production and metabolic pathways, which are essential for myelinogenesis, studying these nucleus-encoded genes (nMGs) may provide new insights into the roles of energy metabolism in demyelinating diseases. In this work, a multiomics-based approach was employed to 1) construct a 1,740 human nMG subset with mitochondrial localization evidence obtained from the Integrated Mitochondrial Protein Index (IMPI) and MitoCarta databases, 2) conduct an evolutionary genomics analysis across mouse, rat, monkey, chimp, and human models, 3) examine dysmyelination phenotype-related genes (nMG subset genes with oligodendrocyte- and myelin-related phenotypes, OMP-nMGs) in MGI mouse lines and human patients, 4) determine the expression discrepancy of OMP-nMGs in brain tissues of cuprizone-treated mice, multiple sclerosis patients, and normal controls, and 5) conduct literature data mining to explore OMP-nMG-associated disease impacts. By contrasting OMP-nMGs with other genes, OMP-nMGs were found to be more ubiquitously expressed (59.1% vs. 16.1%), disease-associated (67.3% vs. 20.2%), and evolutionarily conserved within the human populations. Our multiomics-based analysis identified 110 OMP-nMGs implicated in energy production and lipid and glycan biosynthesis in the pathogenesis and pathophysiology of demyelinating disorders. Future targeted characterization of OMP-nMGs in abnormal myelination conditions may allow the discovery of novel nMG mediated mechanisms underlying myelinogenesis and related diseases.
Collapse
|
25
|
Zhu M, Gribskov M. MiPepid: MicroPeptide identification tool using machine learning. BMC Bioinformatics 2019; 20:559. [PMID: 31703551 PMCID: PMC6842143 DOI: 10.1186/s12859-019-3033-9] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Accepted: 08/16/2019] [Indexed: 12/13/2022] Open
Abstract
Background Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to technical difficulties, as few small peptides had been experimentally confirmed. In the past decade, a growing number of micropeptides have been shown to play significant roles in vital biological activities. Despite the increased amount of data, we still lack bioinformatics tools for specifically identifying micropeptides from DNA sequences. Indeed, most existing tools for classifying coding and noncoding ORFs were built on datasets in which “normal-sized” proteins were considered to be positives and short ORFs were generally considered to be noncoding. Since the functional and biophysical constraints on small peptides are likely to be different from those on “normal” proteins, methods for predicting short translated ORFs must be trained independently from those for longer proteins. Results In this study, we have developed MiPepid, a machine-learning tool specifically for the identification of micropeptides. We trained MiPepid using carefully cleaned data from existing databases and used logistic regression with 4-mer features. With only the sequence information of an ORF, MiPepid is able to predict whether it encodes a micropeptide with 96% accuracy on a blind dataset of high-confidence micropeptides, and to correctly classify newly discovered micropeptides not included in either the training or the blind test data. Compared with state-of-the-art coding potential prediction methods, MiPepid performs exceptionally well, as other methods incorrectly classify most bona fide micropeptides as noncoding. MiPepid is alignment-free and runs sufficiently fast for genome-scale analyses. It is easy to use and is available at https://github.com/MindAI/MiPepid. Conclusions MiPepid was developed to specifically predict micropeptides, a category of proteins with increasing significance, from DNA sequences. It shows evident advantages over existing coding potential prediction methods on micropeptide identification. It is ready to use and runs fast. Electronic supplementary material The online version of this article (10.1186/s12859-019-3033-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Mengmeng Zhu
- Department of Statistics, Purdue University, West Lafayette, IN, 47907, USA.,Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA
| | - Michael Gribskov
- Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA.
| |
Collapse
|
26
|
Pujar S, O'Leary NA, Farrell CM, Loveland JE, Mudge JM, Wallin C, Girón CG, Diekhans M, Barnes I, Bennett R, Berry AE, Cox E, Davidson C, Goldfarb T, Gonzalez JM, Hunt T, Jackson J, Joardar V, Kay MP, Kodali VK, Martin FJ, McAndrews M, McGarvey KM, Murphy M, Rajput B, Rangwala SH, Riddick LD, Seal RL, Suner MM, Webb D, Zhu S, Aken BL, Bruford EA, Bult CJ, Frankish A, Murphy T, Pruitt KD. Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation. Nucleic Acids Res 2019; 46:D221-D228. [PMID: 29126148 PMCID: PMC5753299 DOI: 10.1093/nar/gkx1031] [Citation(s) in RCA: 74] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2017] [Accepted: 10/20/2017] [Indexed: 01/29/2023] Open
Abstract
The Consensus Coding Sequence (CCDS) project provides a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assembly in genome annotations produced independently by NCBI and the Ensembl group at EMBL-EBI. This dataset is the product of an international collaboration that includes NCBI, Ensembl, HUGO Gene Nomenclature Committee, Mouse Genome Informatics and University of California, Santa Cruz. Identically annotated coding regions, which are generated using an automated pipeline and pass multiple quality assurance checks, are assigned a stable and tracked identifier (CCDS ID). Additionally, coordinated manual review by expert curators from the CCDS collaboration helps in maintaining the integrity and high quality of the dataset. The CCDS data are available through an interactive web page (https://www.ncbi.nlm.nih.gov/CCDS/CcdsBrowse.cgi) and an FTP site (ftp://ftp.ncbi.nlm.nih.gov/pub/CCDS/). In this paper, we outline the ongoing work, growth and stability of the CCDS dataset and provide updates on new collaboration members and new features added to the CCDS user interface. We also present expert curation scenarios, with specific examples highlighting the importance of an accurate reference genome assembly and the crucial role played by input from the research community.
Collapse
Affiliation(s)
- Shashikant Pujar
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Nuala A O'Leary
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Catherine M Farrell
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Jane E Loveland
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Craig Wallin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Carlos G Girón
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mark Diekhans
- University of California Santa Cruz Genomics Institute, Santa Cruz, CA 95064, USA
| | - If Barnes
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ruth Bennett
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Andrew E Berry
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Eric Cox
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Claire Davidson
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Tamara Goldfarb
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Jose M Gonzalez
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Toby Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - John Jackson
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Vinita Joardar
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Mike P Kay
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Vamsi K Kodali
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Fergal J Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Monica McAndrews
- Mouse Genome Informatics, The Jackson Laboratory, Bar Harbor, ME 04609, USA
| | - Kelly M McGarvey
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Michael Murphy
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Bhanu Rajput
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Sanjida H Rangwala
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Lillian D Riddick
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Ruth L Seal
- HUGO Gene Nomenclature Committee, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Marie-Marthe Suner
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - David Webb
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Sophia Zhu
- Mouse Genome Informatics, The Jackson Laboratory, Bar Harbor, ME 04609, USA
| | - Bronwen L Aken
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Elspeth A Bruford
- HUGO Gene Nomenclature Committee, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Carol J Bult
- Mouse Genome Informatics, The Jackson Laboratory, Bar Harbor, ME 04609, USA
| | - Adam Frankish
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Terence Murphy
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Kim D Pruitt
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
27
|
Ishige T, Itoga S, Kawasaki K, Utsuno E, Beppu M, Sawai S, Nishimura M, Ichikawa T, Nomura F, Matsushita K. Evaluation of analytical factors associated with targeted MEFV gene sequencing using long-range PCR/massively parallel sequencing of whole blood DNA for molecular diagnosis of Familial Mediterranean fever. Clin Chim Acta 2019; 495:562-569. [DOI: 10.1016/j.cca.2019.06.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2019] [Revised: 05/27/2019] [Accepted: 06/03/2019] [Indexed: 10/26/2022]
|
28
|
Deep sequencing across germline genome-wide association study signals relating to breast cancer events in women receiving aromatase inhibitors for adjuvant therapy of early breast cancer. Pharmacogenet Genomics 2019; 29:183-191. [PMID: 31211741 DOI: 10.1097/fpc.0000000000000382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
OBJECTIVE To identify additional genetic variants beyond those observed in a previous genome-wide association study (GWAS) in women treated on the MA.27 clinical trial in which women were randomized to 5 years of adjuvant therapy with anastrozole or exemestane. PATIENTS AND METHODS We performed a matched case-control study in 234 women who had a recurrence of breast cancer (cases) and 649 women who had not (controls). The analysis was restricted to White women with an estrogen receptor-positive breast cancer. Multiplex PCR-based targeted deep sequencing was performed of the MIR2052HG region on chromosome 8 between positions 75.4 and 75.7, a span of 300 kb, in an attempt to identify additional functional single nucleotide polymorphisms (SNPs). RESULTS A total of 4677 unique variants were identified that had not been identified in the previous GWAS. Clinical Annotation of Variants analysis revealed 10 variants, including eight SNPs and two insertion-deletion mutations with moderate or high impact. However, none of the common and variant regions was significant after adjustment for the most significant SNP (rs13260300) identified in our previous GWAS. We performed haplotype analysis that revealed two regions in which the haplotypes lost significance when adjusted for this prior GWAS SNP and one region with two significant haplotypes (P = 0.046 and 0.031) after adjusting for the GWAS SNP. CONCLUSION We were unable to identify common or rare variant regions that added value to the findings from our previous GWAS. We did find two haplotypes that were significant after adjusting for our top GWAS SNP but these were considered to be of marginal value.
Collapse
|
29
|
Koyanagi Y, Akiyama M, Nishiguchi KM, Momozawa Y, Kamatani Y, Takata S, Inai C, Iwasaki Y, Kumano M, Murakami Y, Omodaka K, Abe T, Komori S, Gao D, Hirakata T, Kurata K, Hosono K, Ueno S, Hotta Y, Murakami A, Terasaki H, Wada Y, Nakazawa T, Ishibashi T, Ikeda Y, Kubo M, Sonoda KH. Genetic characteristics of retinitis pigmentosa in 1204 Japanese patients. J Med Genet 2019; 56:662-670. [PMID: 31213501 DOI: 10.1136/jmedgenet-2018-105691] [Citation(s) in RCA: 67] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2018] [Revised: 04/21/2019] [Accepted: 05/14/2019] [Indexed: 12/20/2022]
Abstract
BACKGROUND The genetic profile of retinitis pigmentosa (RP) in East Asian populations has not been well characterised. Therefore, we conducted a large-scale sequencing study to investigate the genes and variants causing RP in a Japanese population. METHODS A total of 1209 Japanese patients diagnosed with typical RP were enrolled. We performed deep resequencing of 83 known causative genes of RP using next-generation sequencing. We defined pathogenic variants as those that were putatively deleterious or registered as pathogenic in the Human Gene Mutation Database or ClinVar database and had a minor allele frequency in any ethnic population of ≤0.5% for recessive genes or ≤0.01% for dominant genes as determined using population-based databases. RESULTS We successfully sequenced 1204 patients with RP and determined 200 pathogenic variants in 38 genes as the cause of RP in 356 patients (29.6%). Variants in six genes (EYS, USH2A, RP1L1, RHO, RP1 and RPGR) caused RP in 65.4% (233/356) of those patients. Among autosomal recessive genes, two known founder variants in EYS [p.(Ser1653fs) and p.(Tyr2935*)] and four East Asian-specific variants [p.(Gly2752Arg) in USH2A, p.(Arg658*) in RP1L1, p.(Gly2186Glu) in EYS and p.(Ile535Asn) in PDE6B] and p.(Cys934Trp) in USH2A were found in ≥10 patients. Among autosomal dominant genes, four pathogenic variants [p.(Pro347Leu) in RHO, p.(Arg872fs) in RP1, p.(Arg41Trp) in CRX and p.(Gly381fs) in PRPF31] were found in ≥4 patients, while these variants were unreported or extremely rare in both East Asian and non-East Asian population-based databases. CONCLUSIONS East Asian-specific variants in causative genes were the major causes of RP in the Japanese population.
Collapse
Affiliation(s)
- Yoshito Koyanagi
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan.,Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Masato Akiyama
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan.,Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Koji M Nishiguchi
- Department of Ophthalmology, Tohoku University Graduate School of Medicine, Sendai, Japan.,Department of Advanced Ophthalmic Medicine, Tohoku University Graduate School of Medicine, Sendai, Japan
| | - Yukihide Momozawa
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| | - Yoichiro Kamatani
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan.,Kyoto-McGill International Collaborative School in Genomic Medicine, Kyoto University Graduate School of Medicine, Kyoto, Japan
| | - Sadaaki Takata
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| | - Chihiro Inai
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| | - Yusuke Iwasaki
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| | - Mikako Kumano
- Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Yusuke Murakami
- Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Kazuko Omodaka
- Department of Ophthalmology, Tohoku University Graduate School of Medicine, Sendai, Japan
| | - Toshiaki Abe
- Division of Clinical Cell Therapy, United Centers for Advanced Research and Translational Medicine (ART), Tohoku University Graduate School of Medicine, Sendai, Japan
| | - Shiori Komori
- Department of Ophthalmology, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Dan Gao
- Department of Ophthalmology, Juntendo University Graduate School of Medicine, Tokyo, Japan
| | - Toshiaki Hirakata
- Department of Ophthalmology, Juntendo University Graduate School of Medicine, Tokyo, Japan
| | - Kentaro Kurata
- Department of Ophthalmology, Hamamatsu University School of Medicine, Shizuoka, Japan
| | - Katsuhiro Hosono
- Department of Ophthalmology, Hamamatsu University School of Medicine, Shizuoka, Japan
| | - Shinji Ueno
- Department of Ophthalmology, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Yoshihiro Hotta
- Department of Ophthalmology, Hamamatsu University School of Medicine, Shizuoka, Japan
| | - Akira Murakami
- Department of Ophthalmology, Juntendo University Graduate School of Medicine, Tokyo, Japan
| | - Hiroko Terasaki
- Department of Ophthalmology, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | | | - Toru Nakazawa
- Department of Ophthalmology, Tohoku University Graduate School of Medicine, Sendai, Japan.,Department of Advanced Ophthalmic Medicine, Tohoku University Graduate School of Medicine, Sendai, Japan
| | - Tatsuro Ishibashi
- Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Yasuhiro Ikeda
- Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Michiaki Kubo
- RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| | - Koh-Hei Sonoda
- Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| |
Collapse
|
30
|
Motegi T, Kochi Y, Matsuda K, Kubo M, Yamamoto K, Momozawa Y. Identification of rare coding variants in TYK2 protective for rheumatoid arthritis in the Japanese population and their effects on cytokine signalling. Ann Rheum Dis 2019; 78:1062-1069. [PMID: 31118190 DOI: 10.1136/annrheumdis-2019-215062] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2019] [Revised: 03/19/2019] [Accepted: 04/18/2019] [Indexed: 02/07/2023]
Abstract
OBJECTIVE Although genome-wide association studies (GWAS) have identified approximately 100 loci for rheumatoid arthritis (RA), the disease mechanisms are not completely understood. We evaluated the pathogenesis of RA by focusing on rare coding variants. METHODS The coding regions of 98 candidate genes identified by GWAS were sequenced in 2294 patients with RA and 4461 controls in Japan. An association analysis was performed using cases and controls for variants, genes and domains of TYK2. Cytokine responses for two associated variants (R231W, rs201917359; and R703W, rs55882956) in TYK2 as well as a previously reported risk variant (P1004A, rs34536443) for multiple autoimmune diseases were evaluated by reporter assays. RESULTS A variant in TYK2 (R703W) showed a suggestive association (p=5.47×10-8, OR=0.48). We observed more accumulation of rare coding variants in controls in TYK2 (p=3.94×10-12, OR=0.56). The four-point-one, ezrin, radixin, moesin (FERM; 2.14×10-3, OR=0.66) and pseudokinase domains (1.63×10-8, OR=0.52) of TYK2 also showed enrichment of variants in controls. R231W in FERM domain especially reduced interleukin (IL)-6 and interferon (IFN)-γ signalling, whereas P1104A in kinase domain reduced IL-12, IL-23 and IFN-α signalling. R703W in pseudokinase domain reduced cytokine signals similarly to P1104A, but the effects were weaker than those of P1104A. CONCLUSIONS The FERM and pseudokinase domains in TYK2 were associated with the risk of RA in the Japanese population. Variants in TYK2 had different effects on cytokine signalling, suggesting that the regulation of selective cytokine signalling is a target for RA treatment.
Collapse
Affiliation(s)
- Tomoki Motegi
- Veterinary Medical Center, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan.,Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Yuta Kochi
- Laboratory for Autoimmune Diseases, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Koichi Matsuda
- Laboratory of Clinical Genome Sequencing, Graduate School of Frontier Sciences, The University of Tokyo, Tokyo, Japan
| | - Michiaki Kubo
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Kazuhiko Yamamoto
- Laboratory for Autoimmune Diseases, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Yukihide Momozawa
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| |
Collapse
|
31
|
Kim E, Dede M, Lenoir WF, Wang G, Srinivasan S, Colic M, Hart T. A network of human functional gene interactions from knockout fitness screens in cancer cells. Life Sci Alliance 2019; 2:2/2/e201800278. [PMID: 30979825 PMCID: PMC6464042 DOI: 10.26508/lsa.201800278] [Citation(s) in RCA: 59] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2018] [Revised: 04/03/2019] [Accepted: 04/04/2019] [Indexed: 12/12/2022] Open
Abstract
The function of human genes can be strongly inferred from their knockout fitness profiles across hundreds of CRISPR screens, illuminating the modular organization of the cell. Genetic interactions mediate the emergence of phenotype from genotype. The systematic survey of genetic interactions in yeast showed that genes operating in the same biological process have highly correlated genetic interaction profiles, and this observation has been exploited to infer gene function in model organisms. Such assays of digenic perturbations in human cells are also highly informative, but are not scalable, even with CRISPR-mediated methods. As an alternative, we developed an indirect method of deriving functional interactions. We show that genes having correlated knockout fitness profiles across diverse, non-isogenic cell lines are analogous to genes having correlated genetic interaction profiles across isogenic query strains and similarly imply shared biological function. We constructed a network of genes with correlated fitness profiles across 276 high-quality CRISPR knockout screens in cancer cell lines into a “coessentiality network,” with up to 500-fold enrichment for co-functional gene pairs, enabling strong inference of gene function and highlighting the modular organization of the cell.
Collapse
Affiliation(s)
- Eiru Kim
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Merve Dede
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.,UTHealth Graduate School of Biomedical Sciences, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Walter F Lenoir
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.,UTHealth Graduate School of Biomedical Sciences, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Gang Wang
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Sanjana Srinivasan
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.,UTHealth Graduate School of Biomedical Sciences, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Medina Colic
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.,UTHealth Graduate School of Biomedical Sciences, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Traver Hart
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA .,Department of Cancer Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| |
Collapse
|
32
|
Hao Y, Zhang L, Niu Y, Cai T, Luo J, He S, Zhang B, Zhang D, Qin Y, Yang F, Chen R. SmProt: a database of small proteins encoded by annotated coding and non-coding RNA loci. Brief Bioinform 2019; 19:636-643. [PMID: 28137767 DOI: 10.1093/bib/bbx005] [Citation(s) in RCA: 57] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2016] [Indexed: 11/12/2022] Open
Abstract
Small proteins is the general term for proteins with length shorter than 100 amino acids. Identification and functional studies of small proteins have advanced rapidly in recent years, and several studies have shown that small proteins play important roles in diverse functions including development, muscle contraction and DNA repair. Identification and characterization of previously unrecognized small proteins may contribute in important ways to cell biology and human health. Current databases are generally somewhat deficient in that they have either not collected small proteins systematically, or contain only predictions of small proteins in a limited number of tissues and species. Here, we present a specifically designed web-accessible database, small proteins database (SmProt, http://bioinfo.ibp.ac.cn/SmProt), which is a database documenting small proteins. The current release of SmProt incorporates 255 010 small proteins computationally or experimentally identified in 291 cell lines/tissues derived from eight popular species. The database provides a variety of data including basic information (sequence, location, gene name, organism, etc.) as well as specific information (experiment, function, disease type, etc.). To facilitate data extraction, SmProt supports multiple search options, including species, genome location, gene name and their aliases, cell lines/tissues, ORF type, gene type, PubMed ID and SmProt ID. SmProt also incorporates a service for the BLAST alignment search and provides a local UCSC Genome Browser. Additionally, SmProt defines a high-confidence set of small proteins and predicts the functions of the small proteins.
Collapse
Affiliation(s)
- Yajing Hao
- Key Laboratory of RNA Biology, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Lili Zhang
- Key Laboratory of RNA Biology, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Yiwei Niu
- Key Laboratory of RNA Biology, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Tanxi Cai
- Key Laboratory of Protein and Peptide Pharmaceuticals and Laboratory of Proteomics, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Jianjun Luo
- Key Laboratory of RNA Biology, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Shunmin He
- Key Laboratory of the Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Bao Zhang
- Key Laboratory of RNA Biology, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Dejiu Zhang
- Key Laboratory of RNA Biology, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Yan Qin
- Key Laboratory of RNA Biology, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Fuquan Yang
- Key Laboratory of Protein and Peptide Pharmaceuticals and Laboratory of Proteomics, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Runsheng Chen
- Key Laboratory of RNA Biology, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
33
|
Gong Y, Wang K, Xiao SP, Mi P, Li W, Shang Y, Dou F. Overexpressed TTC3 Protein Tends to be Cleaved into Fragments and Form Aggregates in the Nucleus. Neuromolecular Med 2019; 21:85-96. [PMID: 30203323 DOI: 10.1007/s12017-018-8509-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2018] [Accepted: 08/31/2018] [Indexed: 12/01/2022]
Abstract
Human tetratricopeptide repeat domain 3 (TTC3) is a gene on 21q22.2 within the Down syndrome critical region (DSCR). Earlier studies suggest that TTC3 may be an important regulator in individual development, especially in neural development. As an E3 ligase, TTC3 binds to phosphorylated Akt and silence its activity via proteasomal cascade. Several groups also reported the involvement of TTC3 in familial Alzheimer's disease recently. In addition, our previous work shows that TTC3 also regulates the degradation of DNA polymerase gamma and over-expressed TTC3 protein tends to form insoluble aggregates in cells. In this study, we focus on the solubility and intracellular localization of TTC3 protein. Over-expressed TTC3 tends to form insoluble aggregates over time. The proteasome inhibitor MG132 treatment resulted in more TTC3 aggregates in a short period of time. We fused the fluorescent protein to either terminus of the TTC3 protein and found that the intracellular localization of fluorescent signals are different between the N-terminal tagged and C-terminal tagged proteins. Western blotting revealed that the TTC3 protein is cleaved into fragments of different sizes at multiple sites. The N-terminal sub-fragments of TTC3 are prone to from nuclear aggregates and the TTC3 nuclear import is mediated by signals within the N-terminal 1 to 650 residues. Moreover, over-expressed TTC3 induced a considerable degree of cytotoxicity, and its N-terminal sub-fragments are more potent inhibitors of cell proliferation than full-length protein. Considering the prevalent proteostasis dysregulation in neurodegenerative diseases, these findings may relate to the pathology of such diseases.
Collapse
Affiliation(s)
- Yueqing Gong
- State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, College of Life Sciences, Beijing Normal University, Beijing, China
- Key Laboratory of Cell Proliferation and Regulation Biology, Ministry of Education, College of Life Sciences, Beijing Normal University, Beijing, China
- Center for Collaboration and Innovation in Brain and Learning Sciences, Beijing Normal University, Beijing, China
| | - Kun Wang
- State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, College of Life Sciences, Beijing Normal University, Beijing, China
- Key Laboratory of Cell Proliferation and Regulation Biology, Ministry of Education, College of Life Sciences, Beijing Normal University, Beijing, China
- Center for Collaboration and Innovation in Brain and Learning Sciences, Beijing Normal University, Beijing, China
| | - Sheng-Ping Xiao
- State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, College of Life Sciences, Beijing Normal University, Beijing, China
- Key Laboratory of Cell Proliferation and Regulation Biology, Ministry of Education, College of Life Sciences, Beijing Normal University, Beijing, China
- Center for Collaboration and Innovation in Brain and Learning Sciences, Beijing Normal University, Beijing, China
| | - Panying Mi
- State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, College of Life Sciences, Beijing Normal University, Beijing, China
- Key Laboratory of Cell Proliferation and Regulation Biology, Ministry of Education, College of Life Sciences, Beijing Normal University, Beijing, China
- Center for Collaboration and Innovation in Brain and Learning Sciences, Beijing Normal University, Beijing, China
| | - Wanjie Li
- Key Laboratory of Cell Proliferation and Regulation Biology, Ministry of Education, College of Life Sciences, Beijing Normal University, Beijing, China
| | - Yu Shang
- Key Laboratory of Cell Proliferation and Regulation Biology, Ministry of Education, College of Life Sciences, Beijing Normal University, Beijing, China
| | - Fei Dou
- State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, College of Life Sciences, Beijing Normal University, Beijing, China.
- Key Laboratory of Cell Proliferation and Regulation Biology, Ministry of Education, College of Life Sciences, Beijing Normal University, Beijing, China.
- Center for Collaboration and Innovation in Brain and Learning Sciences, Beijing Normal University, Beijing, China.
| |
Collapse
|
34
|
Pertea M, Shumate A, Pertea G, Varabyou A, Breitwieser FP, Chang YC, Madugundu AK, Pandey A, Salzberg SL. CHESS: a new human gene catalog curated from thousands of large-scale RNA sequencing experiments reveals extensive transcriptional noise. Genome Biol 2018; 19:208. [PMID: 30486838 PMCID: PMC6260756 DOI: 10.1186/s13059-018-1590-2] [Citation(s) in RCA: 162] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2018] [Accepted: 11/16/2018] [Indexed: 01/06/2023] Open
Abstract
We assembled the sequences from deep RNA sequencing experiments by the Genotype-Tissue Expression (GTEx) project, to create a new catalog of human genes and transcripts, called CHESS. The new database contains 42,611 genes, of which 20,352 are potentially protein-coding and 22,259 are noncoding, and a total of 323,258 transcripts. These include 224 novel protein-coding genes and 116,156 novel transcripts. We detected over 30 million additional transcripts at more than 650,000 genomic loci, nearly all of which are likely nonfunctional, revealing a heretofore unappreciated amount of transcriptional noise in human cells. The CHESS database is available at http://ccb.jhu.edu/chess .
Collapse
Affiliation(s)
- Mihaela Pertea
- Center for Computational Biology, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Alaina Shumate
- Center for Computational Biology, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Geo Pertea
- Center for Computational Biology, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Ales Varabyou
- Center for Computational Biology, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Florian P Breitwieser
- Center for Computational Biology, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Yu-Chi Chang
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Anil K Madugundu
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Institute of Bioinformatics, International Technology Park, Bangalore, India
- Manipal Academy of Higher Education (MAHE), Manipal, Karnataka, India
- Present address: Center for Individualized Medicine and Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA
| | - Akhilesh Pandey
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Departments of Biological Chemistry, Pathology, Neurology, and Oncology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Present address: Center for Individualized Medicine and Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA
| | - Steven L Salzberg
- Center for Computational Biology, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA.
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA.
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA.
- Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD, USA.
| |
Collapse
|
35
|
Momozawa Y, Iwasaki Y, Parsons MT, Kamatani Y, Takahashi A, Tamura C, Katagiri T, Yoshida T, Nakamura S, Sugano K, Miki Y, Hirata M, Matsuda K, Spurdle AB, Kubo M. Germline pathogenic variants of 11 breast cancer genes in 7,051 Japanese patients and 11,241 controls. Nat Commun 2018; 9:4083. [PMID: 30287823 PMCID: PMC6172276 DOI: 10.1038/s41467-018-06581-8] [Citation(s) in RCA: 137] [Impact Index Per Article: 22.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2018] [Accepted: 09/12/2018] [Indexed: 12/15/2022] Open
Abstract
Pathogenic variants in highly penetrant genes are useful for the diagnosis, therapy, and surveillance for hereditary breast cancer. Large-scale studies are needed to inform future testing and variant classification processes in Japanese. We performed a case-control association study for variants in coding regions of 11 hereditary breast cancer genes in 7051 unselected breast cancer patients and 11,241 female controls of Japanese ancestry. Here, we identify 244 germline pathogenic variants. Pathogenic variants are found in 5.7% of patients, ranging from 15% in women diagnosed <40 years to 3.2% in patients ≥80 years, with BRCA1/2, explaining two-thirds of pathogenic variants identified at all ages. BRCA1/2, PALB2, and TP53 are significant causative genes. Patients with pathogenic variants in BRCA1/2 or PTEN have significantly younger age at diagnosis. In conclusion, BRCA1/2, PALB2, and TP53 are the major hereditary breast cancer genes, irrespective of age at diagnosis, in Japanese women.
Collapse
Affiliation(s)
- Yukihide Momozawa
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, Japan.
| | - Yusuke Iwasaki
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, Japan
| | - Michael T Parsons
- Division of Genetics and Population Health, QIMR Berghofer Medical Research Institute, 300 Herston Rd, Herston, Brisbane, QLD, 4006, Australia
| | - Yoichiro Kamatani
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, Japan
| | - Atsushi Takahashi
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, Japan
- Department of Genomic Medicine, Research Institute, National Cerebral and Cardiovascular Center, 5-7-1 Fujishiro-dai, Suita, Osaka, 565-8565, Japan
| | - Chieko Tamura
- FMC Tokyo Clinic, 1-3-2, Iidabashi, Chiyoda-ku, Tokyo, 102-0072, Japan
| | - Toyomasa Katagiri
- Division of Genome Medicine, Institute for Genome Research, Tokushima University, 3-18-15 Kuramoto, Tokushima, 770-8503, Japan
| | - Teruhiko Yoshida
- Department of Genetic Medicine and Services, National Cancer Centre Hospital, 5-1-1 Tsukiji, Chuo-ku, Tokyo, 104-0045, Japan
| | - Seigo Nakamura
- Division of Breast Surgical Oncology, Department of Surgery, Showa University School of Medicine, 1-5-8 Hatanodai, Shinagawa-ku, Tokyo, 142-8666, Japan
| | - Kokichi Sugano
- Department of Genetic Medicine and Services, National Cancer Centre Hospital, 5-1-1 Tsukiji, Chuo-ku, Tokyo, 104-0045, Japan
- Oncogene Research Unit/Cancer Prevention Unit, Tochigi Cancer Centre Research Institute, 4-9-13 Yohnan, Tochigi, 320-0834, Japan
| | - Yoshio Miki
- Department of Molecular Genetics, Medical Research Institute, Tokyo Medical and Dental University, 1-5-45 Yushima, Bunkyo-ku, Tokyo, 113-8510, Japan
| | - Makoto Hirata
- Department of Genetic Medicine and Services, National Cancer Centre Hospital, 5-1-1 Tsukiji, Chuo-ku, Tokyo, 104-0045, Japan
- Laboratory of Genome Technology, Human Genome Center, Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo, 108-8639, Japan
| | - Koichi Matsuda
- Graduate School of Frontier Sciences, The University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo, 108-8639, Japan
| | - Amanda B Spurdle
- Division of Genetics and Population Health, QIMR Berghofer Medical Research Institute, 300 Herston Rd, Herston, Brisbane, QLD, 4006, Australia
| | - Michiaki Kubo
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, Japan.
| |
Collapse
|
36
|
Duek P, Gateau A, Bairoch A, Lane L. Exploring the Uncharacterized Human Proteome Using neXtProt. J Proteome Res 2018; 17:4211-4226. [DOI: 10.1021/acs.jproteome.8b00537] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
|
37
|
Yuan F, Lu L, Zhang Y, Wang S, Cai YD. Data mining of the cancer-related lncRNAs GO terms and KEGG pathways by using mRMR method. Math Biosci 2018; 304:1-8. [PMID: 30086268 DOI: 10.1016/j.mbs.2018.08.001] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2018] [Revised: 06/15/2018] [Accepted: 08/01/2018] [Indexed: 02/07/2023]
Abstract
LncRNAs plays an important role in the regulation of gene expression. Identification of cancer-related lncRNAs GO terms and KEGG pathways is great helpful for revealing cancer-related functional biological processes. Therefore, in this study, we proposed a computational method to identify novel cancer-related lncRNAs GO terms and KEGG pathways. By using existing lncRNA database and Max-relevance Min-redundancy (mRMR) method, GO terms and KEGG pathways were evaluated based on their importance on distinguishing cancer-related and non-cancer-related lncRNAs. Finally, GO terms and KEGG pathways with high importance were presented and analyzed. Our literature reviewing showed that the top 10 ranked GO terms and pathways were really related to interpretable tumorigenesis according to recent publications.
Collapse
Affiliation(s)
- Fei Yuan
- Department of Science & Technology, Binzhou Medical University Hospital, Binzhou 256603, Shandong, China.
| | - Lin Lu
- Department of Radiology, Columbia University Medical Center, New York 10032, USA.
| | - YuHang Zhang
- Institute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China.
| | - ShaoPeng Wang
- School of Life Sciences, Shanghai University, Shanghai 200444, China.
| | - Yu-Dong Cai
- School of Life Sciences, Shanghai University, Shanghai 200444, China.
| |
Collapse
|
38
|
Momozawa Y, Dmitrieva J, Théâtre E, Deffontaine V, Rahmouni S, Charloteaux B, Crins F, Docampo E, Elansary M, Gori AS, Lecut C, Mariman R, Mni M, Oury C, Altukhov I, Alexeev D, Aulchenko Y, Amininejad L, Bouma G, Hoentjen F, Löwenberg M, Oldenburg B, Pierik MJ, Vander Meulen-de Jong AE, Janneke van der Woude C, Visschedijk MC, Lathrop M, Hugot JP, Weersma RK, De Vos M, Franchimont D, Vermeire S, Kubo M, Louis E, Georges M. IBD risk loci are enriched in multigenic regulatory modules encompassing putative causative genes. Nat Commun 2018; 9:2427. [PMID: 29930244 PMCID: PMC6013502 DOI: 10.1038/s41467-018-04365-8] [Citation(s) in RCA: 125] [Impact Index Per Article: 20.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2017] [Accepted: 04/24/2018] [Indexed: 02/08/2023] Open
Abstract
GWAS have identified >200 risk loci for Inflammatory Bowel Disease (IBD). The majority of disease associations are known to be driven by regulatory variants. To identify the putative causative genes that are perturbed by these variants, we generate a large transcriptome data set (nine disease-relevant cell types) and identify 23,650 cis-eQTL. We show that these are determined by ∼9720 regulatory modules, of which ∼3000 operate in multiple tissues and ∼970 on multiple genes. We identify regulatory modules that drive the disease association for 63 of the 200 risk loci, and show that these are enriched in multigenic modules. Based on these analyses, we resequence 45 of the corresponding 100 candidate genes in 6600 Crohn disease (CD) cases and 5500 controls, and show with burden tests that they include likely causative genes. Our analyses indicate that ≥10-fold larger sample sizes will be required to demonstrate the causality of individual genes using this approach.
Collapse
Affiliation(s)
- Yukihide Momozawa
- Unit of Animal Genomics, WELBIO, GIGA-R & Faculty of Veterinary Medicine, University of Liège (B34), 1 Avenue de l'Hôpital, Liège, 4000, Belgium
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Science, 1-7-22, Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan
| | - Julia Dmitrieva
- Unit of Animal Genomics, WELBIO, GIGA-R & Faculty of Veterinary Medicine, University of Liège (B34), 1 Avenue de l'Hôpital, Liège, 4000, Belgium
| | - Emilie Théâtre
- Unit of Animal Genomics, WELBIO, GIGA-R & Faculty of Veterinary Medicine, University of Liège (B34), 1 Avenue de l'Hôpital, Liège, 4000, Belgium
| | - Valérie Deffontaine
- Unit of Animal Genomics, WELBIO, GIGA-R & Faculty of Veterinary Medicine, University of Liège (B34), 1 Avenue de l'Hôpital, Liège, 4000, Belgium
| | - Souad Rahmouni
- Unit of Animal Genomics, WELBIO, GIGA-R & Faculty of Veterinary Medicine, University of Liège (B34), 1 Avenue de l'Hôpital, Liège, 4000, Belgium
| | - Benoît Charloteaux
- Unit of Animal Genomics, WELBIO, GIGA-R & Faculty of Veterinary Medicine, University of Liège (B34), 1 Avenue de l'Hôpital, Liège, 4000, Belgium
| | - François Crins
- Unit of Animal Genomics, WELBIO, GIGA-R & Faculty of Veterinary Medicine, University of Liège (B34), 1 Avenue de l'Hôpital, Liège, 4000, Belgium
| | - Elisa Docampo
- Unit of Animal Genomics, WELBIO, GIGA-R & Faculty of Veterinary Medicine, University of Liège (B34), 1 Avenue de l'Hôpital, Liège, 4000, Belgium
| | - Mahmoud Elansary
- Unit of Animal Genomics, WELBIO, GIGA-R & Faculty of Veterinary Medicine, University of Liège (B34), 1 Avenue de l'Hôpital, Liège, 4000, Belgium
| | - Ann-Stephan Gori
- Unit of Animal Genomics, WELBIO, GIGA-R & Faculty of Veterinary Medicine, University of Liège (B34), 1 Avenue de l'Hôpital, Liège, 4000, Belgium
| | - Christelle Lecut
- Laboratory of Thrombosis and Hemostasis, GIGA-R, University of Liège (B34), 1 Avenue de l'Hôpital, 4000, Liège, Belgium
| | - Rob Mariman
- Unit of Animal Genomics, WELBIO, GIGA-R & Faculty of Veterinary Medicine, University of Liège (B34), 1 Avenue de l'Hôpital, Liège, 4000, Belgium
| | - Myriam Mni
- Unit of Animal Genomics, WELBIO, GIGA-R & Faculty of Veterinary Medicine, University of Liège (B34), 1 Avenue de l'Hôpital, Liège, 4000, Belgium
| | - Cécile Oury
- Laboratory of Thrombosis and Hemostasis, GIGA-R, University of Liège (B34), 1 Avenue de l'Hôpital, 4000, Liège, Belgium
| | - Ilya Altukhov
- Moscow Institute of Physics and Technology, Institutskiy Pereulok 9, Dolgoprudny, 141700, Russian Federation
| | - Dmitry Alexeev
- Novosibirsk State University, Pirogova ave. 2, Novosibirsk, 630090, Russian Federation
| | - Yuri Aulchenko
- PolyOmica, Het Vlaggeschip 61, 's-Hertogenbosch, 5237 PA, The Netherlands
- Institute of Cytology and Genetics SD RAS, Lavrentyeva ave. 10, 630090, Novosibirsk, Russia
- Centre for Global Health Research, Usher Institute of Population Health Sciences and Informatics, University of Edinburgh, Teviot Place, Edinburgh, EH8 9AG, UK
| | - Leila Amininejad
- Gastroentérologie Médicale, Faculté de Médicine, Université Libre de Bruxelles, Route de Lennik 808, Anderlecht, 1070, Belgium
| | - Gerd Bouma
- Department of Gastroenterology and Hepatology, VU University Medical Centre, Amsterdam, 1081 HV, The Netherlands
| | - Frank Hoentjen
- Department of Gastroenterology and Hepatology, University Medical Centre St. Radboud, Nijmegen, 6525 GA, The Netherlands
| | - Mark Löwenberg
- Department of Gastroenterology and Hepatology, Amsterdam Medical Centre, Amsterdam, 1105 AZ, The Netherlands
| | - Bas Oldenburg
- Department of Gastroenterology and Hepatology, University Medical Centre Utrecht, 3584 cX, Utrecht, The Netherlands
| | - Marieke J Pierik
- Department of Gastroenterology and Hepatology, University Medical Centre Maastricht, Maastricht, 6229 HX, The Netherlands
| | | | - C Janneke van der Woude
- Department of Gastroenterology and Hepatology, Erasmus Medical Centre, Rotterdam, 3015 CE, The Netherlands
| | - Marijn C Visschedijk
- Department of Gastroenterology and Hepatology, University of Groningen and University Medical Center Groningen, Hanzeplein 1, Groningen, 9713 GZ, The Netherlands
| | - Mark Lathrop
- McGill University Centre for Molecular and Computational Genomics, 740 Dr. Penfield Avenue, Montreal, H3A 0G1, QC, Canada
| | - Jean-Pierre Hugot
- UMR 1149 INSERM/Université Paris-Diderot Sorbonne Paris-Cité, Assistance Publique Hôpitaux de Paris, 48 Bd Sérurier, Paris, 75019, France
| | - Rinse K Weersma
- Department of Gastroenterology and Hepatology, University of Groningen and University Medical Center Groningen, Hanzeplein 1, Groningen, 9713 GZ, The Netherlands
| | - Martine De Vos
- Department of Gastroenterology, University Hospital, De Pintelaan 185, Gent, 9000, Belgium
| | - Denis Franchimont
- Gastroentérologie Médicale, Faculté de Médicine, Université Libre de Bruxelles, Route de Lennik 808, Anderlecht, 1070, Belgium
| | - Severine Vermeire
- Translational Research in Gastrointestinal Disorders, Department of Clinical and Experimental Medicine, KU Leuven, UZ Herestraat 49, Leuven, 3000, Belgium
| | - Michiaki Kubo
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Science, 1-7-22, Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan
| | - Edouard Louis
- CHU-Liège and Unit of Gastroenterology, GIGA-R & Faculty of Medicine, University of Liège, 1 Avenue de l'Hôpital, Liège, 4000, Belgium
| | - Michel Georges
- Unit of Animal Genomics, WELBIO, GIGA-R & Faculty of Veterinary Medicine, University of Liège (B34), 1 Avenue de l'Hôpital, Liège, 4000, Belgium.
| |
Collapse
|
39
|
Raghavan NS, Brickman AM, Andrews H, Manly JJ, Schupf N, Lantigua R, Wolock CJ, Kamalakaran S, Petrovski S, Tosto G, Vardarajan BN, Goldstein DB, Mayeux R. Whole-exome sequencing in 20,197 persons for rare variants in Alzheimer's disease. Ann Clin Transl Neurol 2018; 5:832-842. [PMID: 30009200 PMCID: PMC6043775 DOI: 10.1002/acn3.582] [Citation(s) in RCA: 80] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2018] [Accepted: 04/27/2018] [Indexed: 12/29/2022] Open
Abstract
Objective The genetic bases of Alzheimer's disease remain uncertain. An international effort to fully articulate genetic risks and protective factors is underway with the hope of identifying potential therapeutic targets and preventive strategies. The goal here was to identify and characterize the frequency and impact of rare and ultra-rare variants in Alzheimer's disease, using whole-exome sequencing in 20,197 individuals. Methods We used a gene-based collapsing analysis of loss-of-function ultra-rare variants in a case-control study design with data from the Washington Heights-Inwood Columbia Aging Project, the Alzheimer's Disease Sequencing Project and unrelated individuals from the Institute of Genomic Medicine at Columbia University. Results We identified 19 cases carrying extremely rare SORL1 loss-of-function variants among a collection of 6,965 cases and a single loss-of-function variant among 13,252 controls (P = 2.17 × 10-8; OR: 36.2 [95% CI: 5.8-1493.0]). Age-at-onset was 7 years earlier for patients with SORL1 qualifying variant compared with noncarriers. No other gene attained a study-wide level of statistical significance, but multiple top-ranked genes, including GRID2IP,WDR76 and GRN, were among candidates for follow-up studies. Interpretation This study implicates ultra-rare, loss-of-function variants in SORL1 as a significant genetic risk factor for Alzheimer's disease and provides a comprehensive dataset comparing the burden of rare variation in nearly all human genes in Alzheimer's disease cases and controls. This is the first investigation to establish a genome-wide statistically significant association between multiple extremely rare loss-of-function variants in SORL1 and Alzheimer's disease in a large whole-exome study of unrelated cases and controls.
Collapse
Affiliation(s)
- Neha S Raghavan
- The Taub Institute for Research on Alzheimer's Disease and the Aging Brain College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,The Gertrude H. Sergievsky Center College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York
| | - Adam M Brickman
- The Taub Institute for Research on Alzheimer's Disease and the Aging Brain College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,The Gertrude H. Sergievsky Center College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,Department of Neurology College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York
| | - Howard Andrews
- The Taub Institute for Research on Alzheimer's Disease and the Aging Brain College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,The Gertrude H. Sergievsky Center College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,Department of Psychiatry College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York
| | - Jennifer J Manly
- The Taub Institute for Research on Alzheimer's Disease and the Aging Brain College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,The Gertrude H. Sergievsky Center College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,Department of Neurology College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York
| | - Nicole Schupf
- The Taub Institute for Research on Alzheimer's Disease and the Aging Brain College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,The Gertrude H. Sergievsky Center College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,Department of Neurology College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,The Department of Epidemiology Mailman School of Public Health College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York
| | - Rafael Lantigua
- The Taub Institute for Research on Alzheimer's Disease and the Aging Brain College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,Department of Medicine College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York
| | - Charles J Wolock
- Institute of Genomic Medicine Columbia University The New York Presbyterian Hospital New York New York
| | - Sitharthan Kamalakaran
- Institute of Genomic Medicine Columbia University The New York Presbyterian Hospital New York New York
| | - Slave Petrovski
- Institute of Genomic Medicine Columbia University The New York Presbyterian Hospital New York New York.,AstraZeneca Centre for Genomics Research Precision Medicine and Genomics IMED Biotech Unit AstraZeneca Cambridge CB2 0AA United Kingdom
| | - Giuseppe Tosto
- The Taub Institute for Research on Alzheimer's Disease and the Aging Brain College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,The Gertrude H. Sergievsky Center College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,Department of Neurology College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York
| | - Badri N Vardarajan
- The Taub Institute for Research on Alzheimer's Disease and the Aging Brain College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,The Gertrude H. Sergievsky Center College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,Department of Neurology College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,Department of Systems Biology College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York
| | - David B Goldstein
- Department of Neurology College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,Department of Medicine College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,Institute of Genomic Medicine Columbia University The New York Presbyterian Hospital New York New York
| | - Richard Mayeux
- The Taub Institute for Research on Alzheimer's Disease and the Aging Brain College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,The Gertrude H. Sergievsky Center College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,Department of Neurology College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,Department of Psychiatry College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York.,The Department of Epidemiology Mailman School of Public Health College of Physicians and Surgeons Columbia University The New York Presbyterian Hospital New York New York
| | | |
Collapse
|
40
|
Polimanti R, Gelernter J. ADH1B: From alcoholism, natural selection, and cancer to the human phenome. Am J Med Genet B Neuropsychiatr Genet 2018; 177:113-125. [PMID: 28349588 PMCID: PMC5617762 DOI: 10.1002/ajmg.b.32523] [Citation(s) in RCA: 57] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/24/2016] [Accepted: 12/19/2016] [Indexed: 12/18/2022]
Abstract
The ADH1B (Alcohol Dehydrogenase 1B (class I), Beta Polypeptide) gene and its best-known functional alleles, Arg48His (rs1229984, ADH1B*2) and Arg370Cys (rs2066702, ADH1B*3), have been investigated in relation to many phenotypic traits; most frequently including alcohol metabolism and alcohol drinking behaviors, but also human evolution, liver function, cancer, and, recently, the comprehensive human phenome. To understand ADH1B functions and consequences, we provide here a bioinformatic analysis of its gene regulation and molecular functions, literature review of studies focused on this gene, and a discussion regarding future research perspectives. Certain ADH1B alleles have large effects on alcohol metabolism, and this relationship particularly encourages further investigations in relation to alcoholism and alcohol-associated cancer to understand better the mechanisms by which alcohol metabolism contributes to alcohol abuse and carcinogenesis. We also observed that ADH1B has complex mechanisms that regulate its expression across multiple human tissues, and these may be involved in cardiac and metabolic traits. Evolutionary data strongly suggest that the selection signatures at the ADH1B locus are primarily related to effects other than those on alcohol metabolism. This is also supported by the involvement of ADH1B in multiple molecular pathways and by the findings of our recent phenome-wide association study. Accordingly, future studies should also investigate other functions of ADH1B potentially relevant for the human phenome. © 2017 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Renato Polimanti
- Department of Psychiatry, Yale School of Medicine and VA CT Healthcare Center, West Haven, CT, USA
| | - Joel Gelernter
- Department of Psychiatry, Yale School of Medicine and VA CT Healthcare Center, West Haven, CT, USA
- Department of Genetics, Yale School of Medicine, West Haven, CT, USA
- Department of Neuroscience, Yale School of Medicine, West Haven, CT, USA
| |
Collapse
|
41
|
Momozawa Y, Akiyama M, Kamatani Y, Arakawa S, Yasuda M, Yoshida S, Oshima Y, Mori R, Tanaka K, Mori K, Inoue S, Terasaki H, Yasuma T, Honda S, Miki A, Inoue M, Fujisawa K, Takahashi K, Yasukawa T, Yanagi Y, Kadonosono K, Sonoda KH, Ishibashi T, Takahashi A, Kubo M. Low-frequency coding variants in CETP and CFB are associated with susceptibility of exudative age-related macular degeneration in the Japanese population. Hum Mol Genet 2018; 25:5027-5034. [PMID: 28173125 DOI: 10.1093/hmg/ddw335] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2016] [Revised: 09/08/2016] [Accepted: 09/28/2016] [Indexed: 12/21/2022] Open
Abstract
Age-related macular degeneration (AMD) is a major cause of blindness in the elderly. Previous sequencing studies of AMD susceptibility genes have revealed the association of rare coding variants in CFH, CFI, C3 and C9 in European population; however, the impact of rare or low-frequency coding variants on AMD susceptibility in other populations is largely unknown. To identify the role of low-frequency coding variants on exudative AMD susceptibility in a Japanese population, we analysed the association of coding variants of 34 AMD candidate genes in the two-stage design by a multiplex PCR-based target sequencing method. We used a total of 2,886 (1st: 827, 2nd: 2,059) exudative AMD cases including typical AMD, polypoidal choroidal vasculopathy, and retinal angiomatous proliferation and 9,337 (1st: 3,247 2nd: 6,090) controls. Gene-based analysis found a significant association of low-frequency variants (minor allele frequency (MAF) < 0.05) in CETP, C2 and CFB. The association of CETP remained after conditioned with all known genome-wide association study (GWAS) associated variants. In addition, when we included only disruptive variants, enrichment of rare variants (MAF < 0.01) was also observed after conditioned with all GWAS associated variants (P = 1.03 × 10−6, odds ratio (OR) = 2.48). Haplotype and conditional analysis of the C2-CFB-SKIV2L locus showed a low-frequency variant (R74H) in CFB would be individually associated with AMD susceptibility independent of the GWAS associated SNP. These findings highlight the importance of target sequencing to reveal the impact of rare or low-frequency coding variants on disease susceptibility in different ethnic populations.
Collapse
Affiliation(s)
- Yukihide Momozawa
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| | - Masato Akiyama
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan.,Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan.,Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Yoichiro Kamatani
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| | - Satoshi Arakawa
- Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan.,Japan Community Health care Organization, Kyushu Hospital, Fukuoka, Japan
| | - Miho Yasuda
- Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Shigeo Yoshida
- Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Yuji Oshima
- Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Ryusaburo Mori
- Division of Ophthalmology, Department of Visual Sciences, Nihon University School of Medicine, Nihon University Hospital, Tokyo, Japan
| | - Koji Tanaka
- Division of Ophthalmology, Department of Visual Sciences, Nihon University School of Medicine, Nihon University Hospital, Tokyo, Japan
| | - Keisuke Mori
- Department of Ophthalmology, Saitama Medical University, Saitama, Japan.,Department of Ophthalmology, International University of Health and Welfare Hospital, Tochigi, Japan
| | - Satoshi Inoue
- Division of Gene Regulation and Signal Transduction, Research Center for Genomic Medicine, Saitama Medical University, Saitama, Japan
| | - Hiroko Terasaki
- Department of Ophthalmology, Nagoya University Graduate School of Medicine, Aichi, Japan
| | - Tetsuhiro Yasuma
- Department of Ophthalmology, Nagoya University Graduate School of Medicine, Aichi, Japan
| | - Shigeru Honda
- Department of Surgery, Division of Ophthalmology, Kobe University Graduate School of Medicine, Hyogo, Japan
| | - Akiko Miki
- Department of Surgery, Division of Ophthalmology, Kobe University Graduate School of Medicine, Hyogo, Japan
| | - Maiko Inoue
- Department of Surgery, Division of Ophthalmology, Kobe University Graduate School of Medicine, Hyogo, Japan
| | - Kimihiko Fujisawa
- Japan Community Health care Organization, Kyushu Hospital, Fukuoka, Japan
| | - Kanji Takahashi
- Department of Surgery, Division of Ophthalmology, Kobe University Graduate School of Medicine, Hyogo, Japan
| | - Tsutomu Yasukawa
- Department of Surgery, Division of Ophthalmology, Kobe University Graduate School of Medicine, Hyogo, Japan
| | - Yasuo Yanagi
- Department of Ophthalmology and Visual Science, Nagoya City University Graduate School of Medical Sciences, Nagoya, Japan
| | - Kazuaki Kadonosono
- Department of Pediatric Surgery, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Koh-Hei Sonoda
- Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Tatsuro Ishibashi
- Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Atsushi Takahashi
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| | - Michiaki Kubo
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| |
Collapse
|
42
|
Han H, Cho JW, Lee S, Yun A, Kim H, Bae D, Yang S, Kim CY, Lee M, Kim E, Lee S, Kang B, Jeong D, Kim Y, Jeon HN, Jung H, Nam S, Chung M, Kim JH, Lee I. TRRUST v2: an expanded reference database of human and mouse transcriptional regulatory interactions. Nucleic Acids Res 2018; 46:D380-D386. [PMID: 29087512 PMCID: PMC5753191 DOI: 10.1093/nar/gkx1013] [Citation(s) in RCA: 1026] [Impact Index Per Article: 171.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2017] [Revised: 10/02/2017] [Accepted: 10/13/2017] [Indexed: 02/07/2023] Open
Abstract
Transcription factors (TFs) are major trans-acting factors in transcriptional regulation. Therefore, elucidating TF-target interactions is a key step toward understanding the regulatory circuitry underlying complex traits such as human diseases. We previously published a reference TF-target interaction database for humans-TRRUST (Transcriptional Regulatory Relationships Unraveled by Sentence-based Text mining)-which was constructed using sentence-based text mining, followed by manual curation. Here, we present TRRUST v2 (www.grnpedia.org/trrust) with a significant improvement from the previous version, including a significantly increased size of the database consisting of 8444 regulatory interactions for 800 TFs in humans. More importantly, TRRUST v2 also contains a database for TF-target interactions in mice, including 6552 TF-target interactions for 828 mouse TFs. TRRUST v2 is also substantially more comprehensive and less biased than other TF-target interaction databases. We also improved the web interface, which now enables prioritization of key TFs for a physiological condition depicted by a set of user-input transcriptional responsive genes. With the significant expansion in the database size and inclusion of the new web tool for TF prioritization, we believe that TRRUST v2 will be a versatile database for the study of the transcriptional regulation involved in human diseases.
Collapse
Affiliation(s)
- Heonjong Han
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea
| | - Jae-Won Cho
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea
| | - Sangyoung Lee
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea
| | - Ayoung Yun
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea
| | - Hyojin Kim
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea
| | - Dasom Bae
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea
| | - Sunmo Yang
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea
| | - Chan Yeong Kim
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea
| | - Muyoung Lee
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea
| | - Eunbeen Kim
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea
| | - Sungho Lee
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea
| | - Byunghee Kang
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea
| | - Dabin Jeong
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea
| | - Yaeji Kim
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea
| | - Hyeon-Nae Jeon
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea
| | - Haein Jung
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea
| | - Sunhwee Nam
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea
| | - Michael Chung
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea
| | - Jong-Hoon Kim
- Department of Biotechnology, College of Life Science and Biotechnology, Korea University, Seoul 02841, Korea
| | - Insuk Lee
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea
| |
Collapse
|
43
|
Li J, Shi L, Zhang K, Zhang Y, Hu S, Zhao T, Teng H, Li X, Jiang Y, Ji L, Sun Z. VarCards: an integrated genetic and clinical database for coding variants in the human genome. Nucleic Acids Res 2018; 46:D1039-D1048. [PMID: 29112736 PMCID: PMC5753295 DOI: 10.1093/nar/gkx1039] [Citation(s) in RCA: 141] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Revised: 10/16/2017] [Accepted: 10/18/2017] [Indexed: 12/24/2022] Open
Abstract
A growing number of genomic tools and databases were developed to facilitate the interpretation of genomic variants, particularly in coding regions. However, these tools are separately available in different online websites or databases, making it challenging for general clinicians, geneticists and biologists to obtain the first-hand information regarding some particular variants and genes of interest. Starting with coding regions and splice sties, we artificially generated all possible single nucleotide variants (n = 110 154 363) and cataloged all reported insertion and deletions (n = 1 223 370). We then annotated these variants with respect to functional consequences from more than 60 genomic data sources to develop a database, named VarCards (http://varcards.biols.ac.cn/), by which users can conveniently search, browse and annotate the variant- and gene-level implications of given variants, including the following information: (i) functional effects; (ii) functional consequences through different in silico algorithms; (iii) allele frequencies in different populations; (iv) disease- and phenotype-related knowledge; (v) general meaningful gene-level information; and (vi) drug-gene interactions. As a case study, we successfully employed VarCards in interpretation of de novo mutations in autism spectrum disorders. In conclusion, VarCards provides an intuitive interface of necessary information for researchers to prioritize candidate variations and genes.
Collapse
Affiliation(s)
- Jinchen Li
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, Zhejiang 325025, China
- National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, Hunan 410078, China
- Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan 410078, China
| | - Leisheng Shi
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, Zhejiang 325025, China
| | - Kun Zhang
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, Zhejiang 325025, China
| | - Yi Zhang
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, Zhejiang 325025, China
| | - Shanshan Hu
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, Zhejiang 325025, China
| | - Tingting Zhao
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, Zhejiang 325025, China
| | - Huajing Teng
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing 100101, China
| | - Xianfeng Li
- Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan 410078, China
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing 100101, China
| | - Yi Jiang
- Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan 410078, China
| | - Liying Ji
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, Zhejiang 325025, China
| | - Zhongsheng Sun
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, Zhejiang 325025, China
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing 100101, China
| |
Collapse
|
44
|
Darling AL, Liu Y, Oldfield CJ, Uversky VN. Intrinsically Disordered Proteome of Human Membrane-Less Organelles. Proteomics 2017; 18:e1700193. [PMID: 29068531 DOI: 10.1002/pmic.201700193] [Citation(s) in RCA: 123] [Impact Index Per Article: 17.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2017] [Revised: 02/10/2017] [Indexed: 11/10/2022]
Abstract
It is recognized now that various proteinaceous membrane-less organelles (PMLOs) are commonly found in cytoplasm, nucleus, and mitochondria of various eukaryotic cells (as well as in the chloroplasts of plant cells). Being different from the "traditional" membrane-encapsulated organelles, such as chloroplasts, endoplasmic reticulum, Golgi apparatus, lysosomes, mitochondria, nucleus, and vacuoles, PMLOs solve the cellular need to facilitate and regulate molecular interactions via reversible and controllable isolation of target molecules in specialized compartments. PMLOs possess liquid-like behavior and are believed to be formed as a result of biological liquid-liquid phase transitions (LLPTs, also known as liquid-liquid phase separation), where an intricate interplay between RNA and intrinsically disordered proteins (IDPs) or hybrid proteins containing ordered domains and intrinsically disordered protein regions (IDPRs) may play an important role. This review analyzes the prevalence of intrinsic disorder in proteins associated with various PMLOs found in human cells and considers some of the functional roles of IDPs/IDPRs in biogenesis of these organelles.
Collapse
Affiliation(s)
- April L Darling
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
| | - Yun Liu
- Guangdong Provincial Key Laboratory for Plant Epigenetics, Shenzhen Key Laboratory of Microbial Genetic Engineering, College of Life Sciences and Oceanography, Shenzhen University, Shenzhen, Guangdong, P. R. China
| | | | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA.,Institute for Biological Instrumentation, Russian Academy of Sciences, Moscow Region, Russia
| |
Collapse
|
45
|
Mori M, Haskell G, Kazi Z, Zhu X, DeArmey SM, Goldstein JL, Bali D, Rehder C, Cirulli ET, Kishnani PS. Sensitivity of whole exome sequencing in detecting infantile- and late-onset Pompe disease. Mol Genet Metab 2017; 122:189-197. [PMID: 29122469 PMCID: PMC5907499 DOI: 10.1016/j.ymgme.2017.10.008] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/13/2017] [Accepted: 10/13/2017] [Indexed: 12/30/2022]
Abstract
Pompe disease is a metabolic myopathy with a wide spectrum of clinical presentation. The gold-standard diagnostic test is acid alpha-glucosidase assay on skin fibroblasts, muscle or blood. Identification of two GAA pathogenic variants in-trans is confirmatory. Optimal effectiveness of enzyme replacement therapy hinges on early diagnosis, which is challenging in late-onset form of the disease due to non-specific presentation. Next-generation sequencing-based panels effectively facilitate diagnosis, but the sensitivity of whole-exome sequencing (WES) in detecting pathogenic GAA variants remains unknown. We analyzed WES data from 93 patients with confirmed Pompe disease and GAA genotypes based on PCR/Sanger sequencing. After ensuring that the common intronic variant c.-32-13T>G is not filtered out, whole-exome sequencing identified both GAA pathogenic variants in 77/93 (83%) patients. However, one variant was missed in 14/93 (15%), and both variants were missed in 2/93 (2%). One complex indel leading to a severe phenotype was incorrectly called a nonsynonymous substitution c.-32-13T>C due to misalignment. These results demonstrate that WES may fail to diagnose Pompe disease. Clinicians need to be aware of limitations of WES, and consider tests specific to Pompe disease when WES does not provide a diagnosis in patients with proximal myopathy, progressive respiratory failure or other subtle symptoms.
Collapse
Affiliation(s)
- Mari Mori
- Department of Pediatrics, Warren Alpert Medical School, Brown University, Providence, RI, USA; Department of Pediatrics, Duke University Medical Center, Durham, NC, USA
| | - Gloria Haskell
- Department of Pathology, Duke University Medical Center, Durham, NC, USA
| | - Zoheb Kazi
- Department of Pediatrics, Duke University Medical Center, Durham, NC, USA
| | - Xiaolin Zhu
- Institute for Genomic Medicine, Columbia University, New York, NY, USA
| | | | - Jennifer L Goldstein
- Department of Pediatrics, Duke University Medical Center, Durham, NC, USA; Department of Genetics, University of North Carolina, Chapel Hill, NC, USA
| | - Deeksha Bali
- Department of Pathology, Duke University Medical Center, Durham, NC, USA
| | - Catherine Rehder
- Department of Pathology, Duke University Medical Center, Durham, NC, USA
| | | | - Priya S Kishnani
- Department of Pediatrics, Duke University Medical Center, Durham, NC, USA.
| |
Collapse
|
46
|
Delcourt V, Staskevicius A, Salzet M, Fournier I, Roucou X. Small Proteins Encoded by Unannotated ORFs are Rising Stars of the Proteome, Confirming Shortcomings in Genome Annotations and Current Vision of an mRNA. Proteomics 2017. [DOI: 10.1002/pmic.201700058] [Citation(s) in RCA: 43] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Affiliation(s)
- Vivian Delcourt
- Department of Biochemistry; Université de Sherbrooke; Quebec Canada
- Univ. Lille, INSERM U1192, Laboratoire Protéomique; Réponse Inflammatoire & Spectrométrie de Masse (PRISM); Lille France
- PROTEO, Quebec Network for Research on Protein Function; Structure, and Engineering; Quebec Canada
| | | | - Michel Salzet
- Univ. Lille, INSERM U1192, Laboratoire Protéomique; Réponse Inflammatoire & Spectrométrie de Masse (PRISM); Lille France
| | - Isabelle Fournier
- Univ. Lille, INSERM U1192, Laboratoire Protéomique; Réponse Inflammatoire & Spectrométrie de Masse (PRISM); Lille France
| | - Xavier Roucou
- Department of Biochemistry; Université de Sherbrooke; Quebec Canada
- PROTEO, Quebec Network for Research on Protein Function; Structure, and Engineering; Quebec Canada
| |
Collapse
|
47
|
WDR26 Haploinsufficiency Causes a Recognizable Syndrome of Intellectual Disability, Seizures, Abnormal Gait, and Distinctive Facial Features. Am J Hum Genet 2017; 101:139-148. [PMID: 28686853 DOI: 10.1016/j.ajhg.2017.06.002] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2017] [Accepted: 05/30/2017] [Indexed: 11/21/2022] Open
Abstract
We report 15 individuals with de novo pathogenic variants in WDR26. Eleven of the individuals carry loss-of-function mutations, and four harbor missense substitutions. These 15 individuals comprise ten females and five males, and all have intellectual disability with delayed speech, a history of febrile and/or non-febrile seizures, and a wide-based, spastic, and/or stiff-legged gait. These subjects share a set of common facial features that include a prominent maxilla and upper lip that readily reveal the upper gingiva, widely spaced teeth, and a broad nasal tip. Together, these features comprise a recognizable facial phenotype. We compared these features with those of chromosome 1q41q42 microdeletion syndrome, which typically contains WDR26, and noted that clinical features are consistent between the two subsets, suggesting that haploinsufficiency of WDR26 contributes to the pathology of 1q41q42 microdeletion syndrome. Consistent with this, WDR26 loss-of-function single-nucleotide mutations identified in these subjects lead to nonsense-mediated decay with subsequent reduction of RNA expression and protein levels. We derived a structural model of WDR26 and note that missense variants identified in these individuals localize to highly conserved residues of this WD-40-repeat-containing protein. Given that WDR26 mutations have been identified in ∼1 in 2,000 of subjects in our clinical cohorts and that WDR26 might be poorly annotated in exome variant-interpretation pipelines, we would anticipate that this disorder could be more common than currently appreciated.
Collapse
|
48
|
Eppig JT. Mouse Genome Informatics (MGI) Resource: Genetic, Genomic, and Biological Knowledgebase for the Laboratory Mouse. ILAR J 2017; 58:17-41. [PMID: 28838066 PMCID: PMC5886341 DOI: 10.1093/ilar/ilx013] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2016] [Revised: 03/14/2017] [Accepted: 03/28/2017] [Indexed: 12/13/2022] Open
Abstract
The Mouse Genome Informatics (MGI) Resource supports basic, translational, and computational research by providing high-quality, integrated data on the genetics, genomics, and biology of the laboratory mouse. MGI serves a strategic role for the scientific community in facilitating biomedical, experimental, and computational studies investigating the genetics and processes of diseases and enabling the development and testing of new disease models and therapeutic interventions. This review describes the nexus of the body of growing genetic and biological data and the advances in computer technology in the late 1980s, including the World Wide Web, that together launched the beginnings of MGI. MGI develops and maintains a gold-standard resource that reflects the current state of knowledge, provides semantic and contextual data integration that fosters hypothesis testing, continually develops new and improved tools for searching and analysis, and partners with the scientific community to assure research data needs are met. Here we describe one slice of MGI relating to the development of community-wide large-scale mutagenesis and phenotyping projects and introduce ways to access and use these MGI data. References and links to additional MGI aspects are provided.
Collapse
Affiliation(s)
- Janan T. Eppig
- Janan T. Eppig, PhD, is Professor Emeritus at The Jackson Laboratory in Bar Harbor, Maine
| |
Collapse
|
49
|
Catharina L, Lima CR, Franca A, Guimarães ACR, Alves-Ferreira M, Tuffery P, Derreumaux P, Carels N. A Computational Methodology to Overcome the Challenges Associated With the Search for Specific Enzyme Targets to Develop Drugs Against Leishmania major. Bioinform Biol Insights 2017. [PMID: 28638238 PMCID: PMC5470852 DOI: 10.1177/1177932217712471] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
We present an approach for detecting enzymes that are specific of Leishmania major compared with Homo sapiens and provide targets that may assist research in drug development. This approach is based on traditional techniques of sequence homology comparison by similarity search and Markov modeling; it integrates the characterization of enzymatic functionality, secondary and tertiary protein structures, protein domain architecture, and metabolic environment. From 67 enzymes represented by 42 enzymatic activities classified by AnEnPi (Analogous Enzymes Pipeline) as specific for L major compared with H sapiens, only 40 (23 Enzyme Commission [EC] numbers) could actually be considered as strictly specific of L major and 27 enzymes (19 EC numbers) were disregarded for having ambiguous homologies or analogies with H sapiens. Among the 40 strictly specific enzymes, we identified sterol 24-C-methyltransferase, pyruvate phosphate dikinase, trypanothione synthetase, and RNA-editing ligase as 4 essential enzymes for L major that may serve as targets for drug development.
Collapse
Affiliation(s)
- Larissa Catharina
- Laboratório de Modelagem de Sistemas Biológicos, Instituto Nacional de Ciência e Tecnologia de Inovação em Doenças de Populações Negligenciadas (INCT-IDPN), Centro de Desenvolvimento Tecnológico em Saúde (CDTS), Fundação Oswaldo Cruz (Fiocruz), Rio de Janeiro, Brazil
| | - Carlyle Ribeiro Lima
- Laboratoire de Biochimie Théorique, Institut de Biologie Physico-Chimique (UPR 9080), Centre National de la Recherche Scientifique (CNRS), Université Paris 7, Paris, France.,Molécules Thérapeutiques in silico (UMR-S 973), Institut National de la Santé et de la Recherche Médicale (INSERM), Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | - Alexander Franca
- Laboratório de Genômica Funcional e Bioinformática, Instituto Oswaldo Cruz (IOC), Fundação Oswaldo Cruz (Fiocruz), Rio de Janeiro, Brazil
| | - Ana Carolina Ramos Guimarães
- Laboratório de Genômica Funcional e Bioinformática, Instituto Oswaldo Cruz (IOC), Fundação Oswaldo Cruz (Fiocruz), Rio de Janeiro, Brazil
| | - Marcelo Alves-Ferreira
- Laboratório de Modelagem de Sistemas Biológicos, Instituto Nacional de Ciência e Tecnologia de Inovação em Doenças de Populações Negligenciadas (INCT-IDPN), Centro de Desenvolvimento Tecnológico em Saúde (CDTS), Fundação Oswaldo Cruz (Fiocruz), Rio de Janeiro, Brazil
| | - Pierre Tuffery
- Molécules Thérapeutiques in silico (UMR-S 973), Institut National de la Santé et de la Recherche Médicale (INSERM), Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | - Philippe Derreumaux
- Laboratoire de Biochimie Théorique, Institut de Biologie Physico-Chimique (UPR 9080), Centre National de la Recherche Scientifique (CNRS), Université Paris 7, Paris, France
| | - Nicolas Carels
- Laboratório de Modelagem de Sistemas Biológicos, Instituto Nacional de Ciência e Tecnologia de Inovação em Doenças de Populações Negligenciadas (INCT-IDPN), Centro de Desenvolvimento Tecnológico em Saúde (CDTS), Fundação Oswaldo Cruz (Fiocruz), Rio de Janeiro, Brazil
| |
Collapse
|
50
|
Steward CA, Parker APJ, Minassian BA, Sisodiya SM, Frankish A, Harrow J. Genome annotation for clinical genomic diagnostics: strengths and weaknesses. Genome Med 2017; 9:49. [PMID: 28558813 PMCID: PMC5448149 DOI: 10.1186/s13073-017-0441-1] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
The Human Genome Project and advances in DNA sequencing technologies have revolutionized the identification of genetic disorders through the use of clinical exome sequencing. However, in a considerable number of patients, the genetic basis remains unclear. As clinicians begin to consider whole-genome sequencing, an understanding of the processes and tools involved and the factors to consider in the annotation of the structure and function of genomic elements that might influence variant identification is crucial. Here, we discuss and illustrate the strengths and weaknesses of approaches for the annotation and classification of important elements of protein-coding genes, other genomic elements such as pseudogenes and the non-coding genome, comparative-genomic approaches for inferring gene function, and new technologies for aiding genome annotation, as a practical guide for clinicians when considering pathogenic sequence variation. Complete and accurate annotation of structure and function of genome features has the potential to reduce both false-negative (from missing annotation) and false-positive (from incorrect annotation) errors in causal variant identification in exome and genome sequences. Re-analysis of unsolved cases will be necessary as newer technology improves genome annotation, potentially improving the rate of diagnosis.
Collapse
Affiliation(s)
- Charles A Steward
- Congenica Ltd, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1DR, UK. .,The Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| | | | - Berge A Minassian
- Department of Pediatrics (Neurology), University of Texas Southwestern, Dallas, TX, USA.,Program in Genetics and Genome Biology and Department of Paediatrics (Neurology), The Hospital for Sick Children and University of Toronto, Toronto, Canada
| | - Sanjay M Sisodiya
- Department of Clinical and Experimental Epilepsy, UCL Institute of Neurology, London, WC1N 3BG, UK.,Chalfont Centre for Epilepsy, Chesham Lane, Chalfont St Peter, Buckinghamshire, SL9 0RJ, UK
| | - Adam Frankish
- The Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.,European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Jennifer Harrow
- The Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.,Illumina Inc, Great Chesterford, Essex, CB10 1XL, UK
| |
Collapse
|