1
|
Segal MM, George R, Waltman P, El-Hattab AW, James KN, Stanley V, Gleeson J. Clinician-centric diagnosis of rare genetic diseases: performance of a gene pertinence metric in decision support for clinicians. Orphanet J Rare Dis 2020; 15:191. [PMID: 32698834 PMCID: PMC7374885 DOI: 10.1186/s13023-020-01461-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Accepted: 07/07/2020] [Indexed: 12/30/2022] Open
Abstract
Background In diagnosis of rare genetic diseases we face a decision as to the degree to which the sequencing lab offers one or more diagnoses based on clinical input provided by the clinician, or the clinician reaches a diagnosis based on the complete set of variants provided by the lab. We tested a software approach to assist the clinician in making the diagnosis based on clinical findings and an annotated genomic variant table, using cases already solved using less automated processes. Results For the 81 cases studied (involving 216 individuals), 70 had genetic abnormalities with phenotypes previously described in the literature, and 11 were not described in the literature at the time of analysis (“discovery genes”). These included cases beyond a trio, including ones with different variants in the same gene. In 100% of cases the abnormality was recognized. Of the 70, the abnormality was ranked #1 in 94% of cases, with an average rank 1.1 for all cases. Large CNVs could be analyzed in an integrated analysis, performed in 24 of the cases. The process is rapid enough to allow for periodic reanalysis of unsolved cases. Conclusions A clinician-friendly environment for clinical correlation can be provided to clinicians who are best positioned to have the clinical information needed for this interpretation.
Collapse
Affiliation(s)
| | - Renee George
- Department of Neurosciences, University of California San Diego, La Jolla, CA, USA.,Rady Children's Institute for Genomic Medicine, Rady Children's Hospital, San Diego, CA, USA
| | - Peter Waltman
- Rockefeller University, New York, NY, USA.,current address Department of Systems Biology, Columbia University, New York, NY, USA
| | - Ayman W El-Hattab
- Department of Clinical Sciences, College of Medicine, University of Sharjah, Sharjah, United Arab Emirates
| | - Kiely N James
- Department of Neurosciences, University of California San Diego, La Jolla, CA, USA.,Rady Children's Institute for Genomic Medicine, Rady Children's Hospital, San Diego, CA, USA
| | - Valentina Stanley
- Department of Neurosciences, University of California San Diego, La Jolla, CA, USA.,Rady Children's Institute for Genomic Medicine, Rady Children's Hospital, San Diego, CA, USA
| | - Joseph Gleeson
- Department of Neurosciences, University of California San Diego, La Jolla, CA, USA.,Rady Children's Institute for Genomic Medicine, Rady Children's Hospital, San Diego, CA, USA
| |
Collapse
|
2
|
Waltman P, Kacmarczyk T, Bate AR, Kearns DB, Reiss DJ, Eichenberger P, Bonneau R. Multi-species integrative biclustering. Genome Biol 2010; 11:R96. [PMID: 20920250 PMCID: PMC2965388 DOI: 10.1186/gb-2010-11-9-r96] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2010] [Revised: 09/19/2010] [Accepted: 09/29/2010] [Indexed: 12/22/2022] Open
Abstract
We describe an algorithm, multi-species cMonkey, for the simultaneous biclustering of heterogeneous multiple-species data collections and apply the algorithm to a group of bacteria containing Bacillus subtilis, Bacillus anthracis, and Listeria monocytogenes. The algorithm reveals evolutionary insights into the surprisingly high degree of conservation of regulatory modules across these three species and allows data and insights from well-studied organisms to complement the analysis of related but less well studied organisms.
Collapse
Affiliation(s)
- Peter Waltman
- Computer Science Department, Warren Weaver Hall (Room 305), 251 Mercer Street, New York, NY 10012, USA.
| | | | | | | | | | | | | |
Collapse
|
3
|
Waltman P, Blumer A, Kaplan D. FiberID-A technique to identify fibrous protein subclasses. Proteins 2006; 66:127-35. [PMID: 17039548 DOI: 10.1002/prot.21128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Fibrous proteins such as collagen, silk, and elastin play critical biological roles, yet they have been the subject of few projects that use computational techniques to predict either their class or their structure. In this article, we present FiberID, a simple yet effective method for identifying and distinguishing three fibrous protein subclasses from their primary sequences. Using a combination of amino acid composition and fast Fourier measurements, FiberID can classify fibrous proteins belonging to these subclasses with high accuracy by using two standard machine learning techniques (decision trees and Naïve Bayesian classifiers). After presenting our results, we present several fibrous sequences that are regularly misclassified by FiberID as sequences of potential interest for further study. Finally, we analyze the decision trees developed by FiberID for potential insights regarding the structure of these proteins.
Collapse
Affiliation(s)
- Peter Waltman
- Department of Computer Science, Tufts University, Medford, Massachusetts 02155, USA
| | | | | |
Collapse
|
4
|
Abstract
Comprehensive, systematic and integrated data-centric statistical approaches to disease modeling can provide powerful frameworks for understanding disease etiology. Here, one such computational framework based on redescription mining in both its incarnations, static and dynamic, is discussed. The static framework provides bioinformatic tools applicable to multifaceted datasets, containing genetic, transcriptomic, proteomic, and clinical data for diseased patients and normal subjects. The dynamic redescription framework provides systems biology tools to model complex sets of regulatory, metabolic and signaling pathways in the initiation and progression of a disease. As an example, the case of chronic fatigue syndrome (CFS) is considered, which has so far remained intractable and unpredictable in its etiology and nosology. The redescription mining approaches can be applied to the Centers for Disease Control and Prevention's Wichita (KS, USA) dataset, integrating transcriptomic, epidemiological and clinical data, and can also be used to study how pathways in the hypothalamic-pituitary-adrenal axis affect CFS patients.
Collapse
Affiliation(s)
- Peter Waltman
- New York University, Courant Institute of Mathematical Sciences, 715 Broadway, New York, NY 10003, USA.
| | | | | |
Collapse
|
5
|
Abstract
The chemostat is a basic model for competition in an open system and a model for the laboratory bio-reactor (CSTR). Inhibitors in open systems are studied with a view of detoxification in natural systems and of control in bio-reactors. This study allows the amount of resource devoted to inhibitor production to depend on the state of the system. The feasibility of one dependence is provided by quorum sensing. In contrast to the constant allocation case, a much wider set of outcomes is possible including interior, stable rest points and stable limit cycles. These outcomes are important contrasts to competitive exclusion or bistable attractors that are often the outcomes for competitive systems. The model consists of four non-linear ordinary differential equations and computer software is used for most of the stability calculations.
Collapse
Affiliation(s)
- J P Braselton
- Department of Mathematics, Georgia Southern University, Statesboro, GA 30460, USA
| | | |
Collapse
|
6
|
Abstract
The study considers two organisms competing for a nutrient in an open system in the presence of an inhibitor (or toxicant). The inhibitor is input at a constant rate and is lethal to one competitor while being taken up by the other without harm. This is in contrast to previous studies, where the inhibitor decreases the reproductive rate of one of the organisms. The mathematical result of the lethal effect, modeled by a mass action term, is that the system cannot be reduced to a monotone dynamical system of one order lower as is common with chemostat-like problems. The model is described by four non-linear, ordinary differential equations and we seek to describe the asymptotic behavior as a function of the parameters of the system. Several global exclusion results are presented with mathematical proofs. However, in the case of coexistence, oscillatory behavior is possible and the study proceeds with numerical examples. The model is relevant to bioremediation problems in nature and to laboratory bio-reactors.
Collapse
Affiliation(s)
- S B Hsu
- Department of Mathematics, National Tsing Hua University, Hsinchu, Taiwan, ROC
| | | | | |
Collapse
|
7
|
|
8
|
Smith HL, Waltman P. The gradostat: A model of competition along a nutrient gradient. Microb Ecol 1991; 22:207-226. [PMID: 24194337 DOI: 10.1007/bf02540224] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/1990] [Revised: 03/11/1991] [Indexed: 06/02/2023]
Abstract
The general mathematical theory of the gradostat is presented for two competitors. The gradostat provides a mechanism for studying competition along a nutrient gradient. In the two vessel case, the results are complete and the conditions are testable. In then-vessel case, the relevant conditions are stated in terms of the stability modulii of certain matrices and are testable for any specific case.
Collapse
Affiliation(s)
- H L Smith
- Department of Mathematics, Arizona State University, 85287-1804, Tempe, Arizona
| | | |
Collapse
|
9
|
Hoppensteadt F, Lauffenburger DA, Waltman P. Editorial. Microb Ecol 1991; 22:109-110. [PMID: 24194330 DOI: 10.1007/bf02540217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Affiliation(s)
- F Hoppensteadt
- College of Natural Science, Michigan State University, 48824, East Lansing, Michigan, USA
| | | | | |
Collapse
|
10
|
Morgan BJT, Waltman P. Competition Models in Population Biology. Biometrics 1984. [DOI: 10.2307/2531181] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
11
|
|
12
|
|
13
|
Hethcote HW, Waltman P. Theoretical determination of optimal treatment schedules for radiation therapy. Radiat Res 1973; 56:150-61. [PMID: 4743722] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
|