1
|
Iyyappan OR, Manoharan S. Finding Gene Associations by Text Mining and Annotating it with Gene Ontology. Methods Mol Biol 2022; 2496:71-90. [PMID: 35713859 DOI: 10.1007/978-1-0716-2305-3_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Digitalization of the research articles and their maintenance in a database was the first stage toward the development of biomedical research. With the large amounts of research being published daily, it has created a large gap in accessing all the articles for review to a given problem. To understand any biological process, an insight into the role of each element in the genome is essential. But with this gap in manual curation of literature, there are chances that important biological information may be lost. Hence, text mining plays an important role in bridging this gap and extracting important biological information from the text, finding associations among them and predicting annotations. An annotation may be gene, gene products, gene names, their physical and functional characteristics, and so on. The process of annotations may be classified as structural annotation, functional annotation, and relational annotation. In this chapter, a basic protocol utilizing text mining to extract biological information and predict their functional role based on Gene Ontology is provided.
Collapse
Affiliation(s)
- Oviya Ramalakshmi Iyyappan
- Department of Sciences, Amrita School of Engineering, Amrita Vishwa Vidyapeetham, Chennai, Tamilnadu, India.
| | - Sharanya Manoharan
- Department of Bioinformatics, Stella Maris College (Autonomous), Chennai, Tamilnadu, India
| |
Collapse
|
2
|
McCormick EM, Lott MT, Dulik MC, Shen L, Attimonelli M, Vitale O, Karaa A, Bai R, Pineda-Alvarez DE, Singh LN, Stanley CM, Wong S, Bhardwaj A, Merkurjev D, Mao R, Sondheimer N, Zhang S, Procaccio V, Wallace DC, Gai X, Falk MJ. Specifications of the ACMG/AMP standards and guidelines for mitochondrial DNA variant interpretation. Hum Mutat 2020; 41:2028-2057. [PMID: 32906214 DOI: 10.1002/humu.24107] [Citation(s) in RCA: 80] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Revised: 08/20/2020] [Accepted: 08/28/2020] [Indexed: 12/12/2022]
Abstract
Mitochondrial DNA (mtDNA) variant pathogenicity interpretation has special considerations given unique features of the mtDNA genome, including maternal inheritance, variant heteroplasmy, threshold effect, absence of splicing, and contextual effects of haplogroups. Currently, there are insufficient standardized criteria for mtDNA variant assessment, which leads to inconsistencies in clinical variant pathogenicity reporting. An international working group of mtDNA experts was assembled within the Mitochondrial Disease Sequence Data Resource Consortium and obtained Expert Panel status from ClinGen. This group reviewed the 2015 American College of Medical Genetics and Association of Molecular Pathology standards and guidelines that are widely used for clinical interpretation of DNA sequence variants and provided further specifications for additional and specific guidance related to mtDNA variant classification. These Expert Panel consensus specifications allow for consistent consideration of the unique aspects of the mtDNA genome that directly influence variant assessment, including addressing mtDNA genome composition and structure, haplogroups and phylogeny, maternal inheritance, heteroplasmy, and functional analyses unique to mtDNA, as well as specifications for utilization of mtDNA genomic databases and computational algorithms.
Collapse
Affiliation(s)
- Elizabeth M McCormick
- Mitochondrial Medicine Frontier Program, Division of Human Genetics, Department of Pediatrics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Marie T Lott
- Center for Mitochondrial and Epigenomic Medicine, Division of Human Genetics, Department of Pediatrics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Matthew C Dulik
- Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA.,Division of Genomic Diagnostics, Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Lishuang Shen
- Center for Personalized Medicine, Department of Pathology and Laboratory Medicine, Children's Hospital Los Angeles, Los Angeles, California, USA
| | - Marcella Attimonelli
- Department of Biosciences, Biotechnology, and Biopharmaceutics, University of Bari "A. Moro", Bari, Italy
| | - Ornella Vitale
- Department of Biosciences, Biotechnology, and Biopharmaceutics, University of Bari "A. Moro", Bari, Italy
| | - Amel Karaa
- Genetics Unit, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts, USA
| | | | | | - Larry N Singh
- Center for Mitochondrial and Epigenomic Medicine, Division of Human Genetics, Department of Pediatrics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Christine M Stanley
- Variantyx, Inc, Framingham, Massachusetts, USA.,QNA Diagnostics, Cambridge, Massachusetts, USA
| | | | - Anshu Bhardwaj
- CSIR-Institute of Microbial Technology, Chandigarh, India
| | - Daria Merkurjev
- Center for Personalized Medicine, Department of Pathology and Laboratory Medicine, Children's Hospital Los Angeles, Los Angeles, California, USA
| | - Rong Mao
- ARUP Institute for Clinical and Experimental Pathology, ARUP Laboratories, Salt Lake City, Utah, USA.,Department of Pathology, University of Utah, Salt Lake City, Utah, USA
| | - Neal Sondheimer
- Clinical and Metabolic Genetics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Shiping Zhang
- Center for Mitochondrial and Epigenomic Medicine, Division of Human Genetics, Department of Pediatrics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA.,Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Vincent Procaccio
- Department of Biochemistry and Genetics, MitoVasc Institute, UMR CNRS 6015- INSERM U1083, CHU Angers, Angers, France
| | - Douglas C Wallace
- Center for Mitochondrial and Epigenomic Medicine, Division of Human Genetics, Department of Pediatrics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA.,Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Xiaowu Gai
- Center for Personalized Medicine, Department of Pathology and Laboratory Medicine, Children's Hospital Los Angeles, Los Angeles, California, USA.,Keck School of Medicine, University of Southern California, Los Angeles, California, USA
| | - Marni J Falk
- Mitochondrial Medicine Frontier Program, Division of Human Genetics, Department of Pediatrics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA.,Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| |
Collapse
|
3
|
Vitale O, Preste R, Palmisano D, Attimonelli M. A data and text mining pipeline to annotate human mitochondrial variants with functional and clinical information. Mol Genet Genomic Med 2019; 8:e1085. [PMID: 31821723 PMCID: PMC7005629 DOI: 10.1002/mgg3.1085] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2019] [Revised: 10/24/2019] [Accepted: 10/30/2019] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND Human mitochondrial DNA has an important role in the cellular energy production through oxidative phosphorylation. Therefore, this process may be the cause and have an effect on mitochondrial DNA mutability, functional alteration, and disease onset related to a wide range of different clinical expressions and phenotypes. Although a large part of the observed variations is fixed in a population and hence expected to be benign, the estimation of the degree of the pathogenicity of any possible human mitochondrial DNA variant is clinically pivotal. METHODS In this scenario, the establishment of standard criteria based on functional studies is required. In this context, a "data and text mining" pipeline is proposed here, developed using the programming language R, capable of extracting information regarding mitochondrial DNA functional studies and related clinical assessments from the literature, thus improving the annotation of human mitochondrial variants reported in the HmtVar database. RESULTS The data mining pipeline has produced a list of 1,073 Pubmed IDs (PMIDs) from which the text mining pipeline has retrieved information on 932 human mitochondrial variants regarding experimental validation and clinical features. CONCLUSIONS The application of the pipeline will contribute to supporting the interpretation of pathogenicity of human mitochondrial variants by facilitating diagnosis to clinicians and researchers faced with this task.
Collapse
Affiliation(s)
- Ornella Vitale
- Department of Biosciences, Biotechnologies and Biopharmaceutics, University of Bari, Bari, Italy
| | - Roberto Preste
- Department of Biosciences, Biotechnologies and Biopharmaceutics, University of Bari, Bari, Italy
| | - Donato Palmisano
- Department of Biosciences, Biotechnologies and Biopharmaceutics, University of Bari, Bari, Italy
| | - Marcella Attimonelli
- Department of Biosciences, Biotechnologies and Biopharmaceutics, University of Bari, Bari, Italy
| |
Collapse
|