1
|
Pretorius E, Kell DB. A Perspective on How Fibrinaloid Microclots and Platelet Pathology May be Applied in Clinical Investigations. Semin Thromb Hemost 2024; 50:537-551. [PMID: 37748515 PMCID: PMC11105946 DOI: 10.1055/s-0043-1774796] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/27/2023]
Abstract
Microscopy imaging has enabled us to establish the presence of fibrin(ogen) amyloid (fibrinaloid) microclots in a range of chronic, inflammatory diseases. Microclots may also be induced by a variety of purified substances, often at very low concentrations. These molecules include bacterial inflammagens, serum amyloid A, and the S1 spike protein of severe acute respiratory syndrome coronavirus 2. Here, we explore which of the properties of these microclots might be used to contribute to differential clinical diagnoses and prognoses of the various diseases with which they may be associated. Such properties include distributions in their size and number before and after the addition of exogenous thrombin, their spectral properties, the diameter of the fibers of which they are made, their resistance to proteolysis by various proteases, their cross-seeding ability, and the concentration dependence of their ability to bind small molecules including fluorogenic amyloid stains. Measuring these microclot parameters, together with microscopy imaging itself, along with methodologies like proteomics and imaging flow cytometry, as well as more conventional assays such as those for cytokines, might open up the possibility of a much finer use of these microclot properties in generative methods for a future where personalized medicine will be standard procedures in all clotting pathology disease diagnoses.
Collapse
Affiliation(s)
- Etheresia Pretorius
- Department of Physiological Sciences, Faculty of Science, Stellenbosch University, Stellenbosch, Matieland, South Africa
- Department of Biochemistry and Systems Biology, Institute of Systems, Molecular and Integrative Biology, Faculty of Health and Life Sciences, University of Liverpool, Liverpool, United Kingdom
| | - Douglas B. Kell
- Department of Physiological Sciences, Faculty of Science, Stellenbosch University, Stellenbosch, Matieland, South Africa
- Department of Biochemistry and Systems Biology, Institute of Systems, Molecular and Integrative Biology, Faculty of Health and Life Sciences, University of Liverpool, Liverpool, United Kingdom
- The Novo Nordisk Foundation Centre for Biosustainability, Technical University of Denmark, Lyngby, Denmark
| |
Collapse
|
2
|
Su P, Luo Y, Huang Y, Akhatayeva Z, Xin D, Guo Z, Pan C, Zhang Q, Xu H, Lan X. Short variation of the sheep PDGFD gene is correlated with litter size. Gene X 2022; 844:146797. [PMID: 35985413 DOI: 10.1016/j.gene.2022.146797] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Revised: 08/02/2022] [Accepted: 08/05/2022] [Indexed: 11/24/2022] Open
Abstract
Platelet-derived growth factor (PDGF) family, exert plays a key role in embryonic development, cell proliferation, cell migration, angiogenesis and reproduction. Related studies about GWAS analyses have found that PDGFD significantly affected deposition of tail fat in sheep, but there are no studies on reproduction in animals. In this study, three breed of sheep were used to find insertion/deletion (indel) fragment polymorphism of PDGFD which including Australian white (AUW) sheep (Meat type, n = 932), Guiqian semi-fine wool (GSFW) sheep (wool type, n = 60) and East Friensian milk (EFM) sheep (dairy type, n = 60). Only a 18-bp variation was polymorphic in the study AUW sheep population and the genotypes of different sheep breed are also specific. Moreover, the association analysis indicated that this variant was associated with litter size of AUW sheep in the first parity (p < 0.05). The litter size of II genotype was significantly lower than other genotypes in the first parity (p < 0.05). We also revealed that the PDGFD gene was relatively conservative in eight species, PDGFD mRNA expression in 832 sheep samples implying this gene was related to reproduction traits. Hence, these finding demonstrated the one-cause multipotency of PDGFD gene. Collectively, these results suggest that this indel can be used as an effective marker for sheep breeding.
Collapse
Affiliation(s)
- Peng Su
- Key Laboratory of Animal Genetics Breeding and Reproduction of Shanxi Province, College Animal Science and Technology, Northwest A&F University, Yangling, Shanxi 712100, China; Tianjin Aoqun Animal Husbandry Co.Ltd., Tianjin 301607, China.
| | - Yunyun Luo
- Key Laboratory of Animal Genetics Breeding and Reproduction of Shanxi Province, College Animal Science and Technology, Northwest A&F University, Yangling, Shanxi 712100, China.
| | - Yangming Huang
- Key Laboratory of Animal Genetics Breeding and Reproduction of Shanxi Province, College Animal Science and Technology, Northwest A&F University, Yangling, Shanxi 712100, China; Tianjin Aoqun Animal Husbandry Co.Ltd., Tianjin 301607, China.
| | - Zhanerke Akhatayeva
- Key Laboratory of Animal Genetics Breeding and Reproduction of Shanxi Province, College Animal Science and Technology, Northwest A&F University, Yangling, Shanxi 712100, China.
| | - Dongyun Xin
- Key Laboratory of Animal Genetics Breeding and Reproduction of Shanxi Province, College Animal Science and Technology, Northwest A&F University, Yangling, Shanxi 712100, China.
| | - Zhengang Guo
- Key Laboratory of Animal Genetics Breeding and Reproduction of Shanxi Province, College Animal Science and Technology, Northwest A&F University, Yangling, Shanxi 712100, China.
| | - Chuanying Pan
- Key Laboratory of Animal Genetics Breeding and Reproduction of Shanxi Province, College Animal Science and Technology, Northwest A&F University, Yangling, Shanxi 712100, China.
| | - Qingfeng Zhang
- Tianjin Aoqun Sheep Industry Academy Company, Tianjin 300000, China; Tianjin Aoqun Animal Husbandry Co.Ltd., Tianjin 301607, China.
| | - Hongwei Xu
- College of Life Science and Engineering, Northwest Minzu University, Lanzhou 730030, China.
| | - Xianyong Lan
- Key Laboratory of Animal Genetics Breeding and Reproduction of Shanxi Province, College Animal Science and Technology, Northwest A&F University, Yangling, Shanxi 712100, China.
| |
Collapse
|
3
|
Yang CH, Scarpino SV. A Family of Fitness Landscapes Modeled through Gene Regulatory Networks. ENTROPY (BASEL, SWITZERLAND) 2022; 24:622. [PMID: 35626507 PMCID: PMC9141513 DOI: 10.3390/e24050622] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Revised: 04/11/2022] [Accepted: 04/26/2022] [Indexed: 02/01/2023]
Abstract
Fitness landscapes are a powerful metaphor for understanding the evolution of biological systems. These landscapes describe how genotypes are connected to each other through mutation and related through fitness. Empirical studies of fitness landscapes have increasingly revealed conserved topographical features across diverse taxa, e.g., the accessibility of genotypes and "ruggedness". As a result, theoretical studies are needed to investigate how evolution proceeds on fitness landscapes with such conserved features. Here, we develop and study a model of evolution on fitness landscapes using the lens of Gene Regulatory Networks (GRNs), where the regulatory products are computed from multiple genes and collectively treated as phenotypes. With the assumption that regulation is a binary process, we prove the existence of empirically observed, topographical features such as accessibility and connectivity. We further show that these results hold across arbitrary fitness functions and that a trade-off between accessibility and ruggedness need not exist. Then, using graph theory and a coarse-graining approach, we deduce a mesoscopic structure underlying GRN fitness landscapes where the information necessary to predict a population's evolutionary trajectory is retained with minimal complexity. Using this coarse-graining, we develop a bottom-up algorithm to construct such mesoscopic backbones, which does not require computing the genotype network and is therefore far more efficient than brute-force approaches. Altogether, this work provides mathematical results of high-dimensional fitness landscapes and a path toward connecting theory to empirical studies.
Collapse
Affiliation(s)
- Chia-Hung Yang
- Network Science Institute, Northeastern University, Boston, MA 02115, USA
| | - Samuel V. Scarpino
- Network Science Institute, Northeastern University, Boston, MA 02115, USA
- Physics Department, Northeastern University, Boston, MA 02115, USA
- Roux Institute, Northeastern University, Boston, MA 02115, USA
- Institute for Experiential AI, Northeastern University, Boston, MA 02115, USA
- Santa Fe Institute, Santa Fe, NM 87501, USA
- Vermont Complex Systems Center, University of Vermont, Burlington, VT 05405, USA
| |
Collapse
|
4
|
Baker RL, Leong WF, Welch S, Weinig C. Mapping and Predicting Non-Linear Brassica rapa Growth Phenotypes Based on Bayesian and Frequentist Complex Trait Estimation. G3 (BETHESDA, MD.) 2018; 8:1247-1258. [PMID: 29467188 PMCID: PMC5873914 DOI: 10.1534/g3.117.300350] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/11/2017] [Accepted: 02/08/2018] [Indexed: 12/23/2022]
Abstract
Predicting phenotypes based on genotypes and understanding the effects of complex multi-locus traits on plant performance requires a description of the underlying developmental processes, growth trajectories, and their genomic architecture. Using data from Brassica rapa genotypes grown in multiple density settings and seasons, we applied a hierarchical Bayesian Function-Valued Trait (FVT) approach to fit logistic growth curves to leaf phenotypic data (length and width) and characterize leaf development. We found evidence of genetic variation in phenotypic plasticity of rate and duration of leaf growth to growing season. In contrast, the magnitude of the plastic response for maximum leaf size was relatively small, suggesting that growth dynamics vs. final leaf sizes have distinct patterns of environmental sensitivity. Consistent with patterns of phenotypic plasticity, several QTL-by-year interactions were significant for parameters describing leaf growth rates and durations but not leaf size. In comparison to frequentist approaches for estimating leaf FVT, Bayesian trait estimation resulted in more mapped QTL that tended to have greater average LOD scores and to explain a greater proportion of trait variance. We then constructed QTL-based predictive models for leaf growth rate and final size using data from one treatment (uncrowded plants in one growing season). Models successfully predicted non-linear developmental phenotypes for genotypes not used in model construction and, due to a lack of QTL-by-treatment interactions, predicted phenotypes across sites differing in plant density.
Collapse
Affiliation(s)
- R L Baker
- Department of Biology, Miami University, Oxford, OH 45056,
| | - W F Leong
- Department of Agronomy, Kansas State University, Manhattan, KS 66506
| | - S Welch
- Department of Agronomy, Kansas State University, Manhattan, KS 66506
| | - C Weinig
- Department of Molecular Biology and
- Department of Botany, University of Wyoming, Laramie, WY 82071
| |
Collapse
|
5
|
Kell DB. Evolutionary algorithms and synthetic biology for directed evolution: commentary on "on the mapping of genotype to phenotype in evolutionary algorithms" by Peter A. Whigham, Grant Dick, and James Maclaurin. GENETIC PROGRAMMING AND EVOLVABLE MACHINES 2017; 18:373-378. [PMID: 29033669 PMCID: PMC5618731 DOI: 10.1007/s10710-017-9292-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
I rehearse two issues around the commentary of Whigham and colleagues. (1) There really are many more reasons than those given as to why natural evolution cannot reasonably find or select the 'optimal' individual. (2) A series of experimental molecular biology programmes, known generically as directed evolution, can use operators and selection schemes that natural evolution cannot. When developed further using the methods of synthetic biology, there are no operators or schemes for in silico evolution that cannot be applied precisely to directed evolution. The issues raised apply only to natural evolution but not to directed evolution.
Collapse
Affiliation(s)
- Douglas B. Kell
- School of Chemistry, The University of Manchester, 131, Princess St, Manchester, Lancs, M1 7DN UK
- The Manchester Institute of Biotechnology, The University of Manchester, 131, Princess St, Manchester, Lancs, M1 7DN UK
- Centre for Synthetic Biology of Fine and Speciality Chemicals, The University of Manchester, 131, Princess St, Manchester, Lancs, M1 7DN UK
| |
Collapse
|
6
|
Yang H, Li S, Cao H, Zhang C, Cui Y. Predicting disease trait with genomic data: a composite kernel approach. Brief Bioinform 2016; 18:591-601. [DOI: 10.1093/bib/bbw043] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2016] [Indexed: 01/17/2023] Open
|
7
|
Currin A, Swainston N, Day PJ, Kell DB. Synthetic biology for the directed evolution of protein biocatalysts: navigating sequence space intelligently. Chem Soc Rev 2015; 44:1172-239. [PMID: 25503938 PMCID: PMC4349129 DOI: 10.1039/c4cs00351a] [Citation(s) in RCA: 251] [Impact Index Per Article: 27.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2014] [Indexed: 12/21/2022]
Abstract
The amino acid sequence of a protein affects both its structure and its function. Thus, the ability to modify the sequence, and hence the structure and activity, of individual proteins in a systematic way, opens up many opportunities, both scientifically and (as we focus on here) for exploitation in biocatalysis. Modern methods of synthetic biology, whereby increasingly large sequences of DNA can be synthesised de novo, allow an unprecedented ability to engineer proteins with novel functions. However, the number of possible proteins is far too large to test individually, so we need means for navigating the 'search space' of possible protein sequences efficiently and reliably in order to find desirable activities and other properties. Enzymologists distinguish binding (Kd) and catalytic (kcat) steps. In a similar way, judicious strategies have blended design (for binding, specificity and active site modelling) with the more empirical methods of classical directed evolution (DE) for improving kcat (where natural evolution rarely seeks the highest values), especially with regard to residues distant from the active site and where the functional linkages underpinning enzyme dynamics are both unknown and hard to predict. Epistasis (where the 'best' amino acid at one site depends on that or those at others) is a notable feature of directed evolution. The aim of this review is to highlight some of the approaches that are being developed to allow us to use directed evolution to improve enzyme properties, often dramatically. We note that directed evolution differs in a number of ways from natural evolution, including in particular the available mechanisms and the likely selection pressures. Thus, we stress the opportunities afforded by techniques that enable one to map sequence to (structure and) activity in silico, as an effective means of modelling and exploring protein landscapes. Because known landscapes may be assessed and reasoned about as a whole, simultaneously, this offers opportunities for protein improvement not readily available to natural evolution on rapid timescales. Intelligent landscape navigation, informed by sequence-activity relationships and coupled to the emerging methods of synthetic biology, offers scope for the development of novel biocatalysts that are both highly active and robust.
Collapse
Affiliation(s)
- Andrew Currin
- Manchester Institute of Biotechnology , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK . ; http://dbkgroup.org/; @dbkell ; Tel: +44 (0)161 306 4492
- School of Chemistry , The University of Manchester , Manchester M13 9PL , UK
- Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM) , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK
| | - Neil Swainston
- Manchester Institute of Biotechnology , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK . ; http://dbkgroup.org/; @dbkell ; Tel: +44 (0)161 306 4492
- Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM) , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK
- School of Computer Science , The University of Manchester , Manchester M13 9PL , UK
| | - Philip J. Day
- Manchester Institute of Biotechnology , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK . ; http://dbkgroup.org/; @dbkell ; Tel: +44 (0)161 306 4492
- Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM) , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK
- Faculty of Medical and Human Sciences , The University of Manchester , Manchester M13 9PT , UK
| | - Douglas B. Kell
- Manchester Institute of Biotechnology , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK . ; http://dbkgroup.org/; @dbkell ; Tel: +44 (0)161 306 4492
- School of Chemistry , The University of Manchester , Manchester M13 9PL , UK
- Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM) , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK
| |
Collapse
|
8
|
Nonequilibrium population dynamics of phenotype conversion of cancer cells. PLoS One 2014; 9:e110714. [PMID: 25438251 PMCID: PMC4249833 DOI: 10.1371/journal.pone.0110714] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2014] [Accepted: 09/20/2014] [Indexed: 11/19/2022] Open
Abstract
Tumorigenesis is a dynamic biological process that involves distinct cancer cell subpopulations proliferating at different rates and interconverting between them. In this paper we proposed a mathematical framework of population dynamics that considers both distinctive growth rates and intercellular transitions between cancer cell populations. Our mathematical framework showed that both growth and transition influence the ratio of cancer cell subpopulations but the latter is more significant. We derived the condition that different cancer cell types can maintain distinctive subpopulations and we also explain why there always exists a stable fixed ratio after cell sorting based on putative surface markers. The cell fraction ratio can be shifted by changing either the growth rates of the subpopulations (Darwinism selection) or by environment-instructed transitions (Lamarckism induction). This insight can help us to understand the dynamics of the heterogeneity of cancer cells and lead us to new strategies to overcome cancer drug resistance.
Collapse
|
9
|
|
10
|
Fukushima A, Kanaya S, Nishida K. Integrated network analysis and effective tools in plant systems biology. FRONTIERS IN PLANT SCIENCE 2014; 5:598. [PMID: 25408696 PMCID: PMC4219401 DOI: 10.3389/fpls.2014.00598] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2014] [Accepted: 10/14/2014] [Indexed: 05/18/2023]
Abstract
One of the ultimate goals in plant systems biology is to elucidate the genotype-phenotype relationship in plant cellular systems. Integrated network analysis that combines omics data with mathematical models has received particular attention. Here we focus on the latest cutting-edge computational advances that facilitate their combination. We highlight (1) network visualization tools, (2) pathway analyses, (3) genome-scale metabolic reconstruction, and (4) the integration of high-throughput experimental data and mathematical models. Multi-omics data that contain the genome, transcriptome, proteome, and metabolome and mathematical models are expected to integrate and expand our knowledge of complex plant metabolisms.
Collapse
Affiliation(s)
- Atsushi Fukushima
- RIKEN Center for Sustainable Resource ScienceTsurumi, Yokohama, Japan
- Japan Science and Technology Agency, National Bioscience Database CenterTokyo, Japan
- *Correspondence: Atsushi Fukushima, RIKEN Center for Sustainable Resource Science, 1-7-22 Suehirocho, Tsurumi, Yokohama 230-0045, Japan e-mail:
| | - Shigehiko Kanaya
- Graduate School of Information Science, Nara Institute of Science and TechnologyNara, Japan
| | - Kozo Nishida
- Japan Science and Technology Agency, National Bioscience Database CenterTokyo, Japan
- Laboratory for Biochemical Simulation, RIKEN Quantitative Biology CenterOsaka, Japan
| |
Collapse
|
11
|
Kell DB. Finding novel pharmaceuticals in the systems biology era using multiple effective drug targets, phenotypic screening and knowledge of transporters: where drug discovery went wrong and how to fix it. FEBS J 2013; 280:5957-80. [PMID: 23552054 DOI: 10.1111/febs.12268] [Citation(s) in RCA: 86] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2013] [Revised: 03/20/2013] [Accepted: 03/26/2013] [Indexed: 12/16/2022]
Abstract
Despite the sequencing of the human genome, the rate of innovative and successful drug discovery in the pharmaceutical industry has continued to decrease. Leaving aside regulatory matters, the fundamental and interlinked intellectual issues proposed to be largely responsible for this are: (a) the move from 'function-first' to 'target-first' methods of screening and drug discovery; (b) the belief that successful drugs should and do interact solely with single, individual targets, despite natural evolution's selection for biochemical networks that are robust to individual parameter changes; (c) an over-reliance on the rule-of-5 to constrain biophysical and chemical properties of drug libraries; (d) the general abandoning of natural products that do not obey the rule-of-5; (e) an incorrect belief that drugs diffuse passively into (and presumably out of) cells across the bilayers portions of membranes, according to their lipophilicity; (f) a widespread failure to recognize the overwhelmingly important role of proteinaceous transporters, as well as their expression profiles, in determining drug distribution in and between different tissues and individual patients; and (g) the general failure to use engineering principles to model biology in parallel with performing 'wet' experiments, such that 'what if?' experiments can be performed in silico to assess the likely success of any strategy. These facts/ideas are illustrated with a reasonably extensive literature review. Success in turning round drug discovery consequently requires: (a) decent systems biology models of human biochemical networks; (b) the use of these (iteratively with experiments) to model how drugs need to interact with multiple targets to have substantive effects on the phenotype; (c) the adoption of polypharmacology and/or cocktails of drugs as a desirable goal in itself; (d) the incorporation of drug transporters into systems biology models, en route to full and multiscale systems biology models that incorporate drug absorption, distribution, metabolism and excretion; (e) a return to 'function-first' or phenotypic screening; and (f) novel methods for inferring modes of action by measuring the properties on system variables at all levels of the 'omes. Such a strategy offers the opportunity of achieving a state where we can hope to predict biological processes and the effect of pharmaceutical agents upon them. Consequently, this should both lower attrition rates and raise the rates of discovery of effective drugs substantially.
Collapse
Affiliation(s)
- Douglas B Kell
- School of Chemistry, The University of Manchester, UK; Manchester Institute of Biotechnology, The University of Manchester, UK
| |
Collapse
|
12
|
Abstract
Comparatively few studies have addressed directly the question of quantifying the benefits to be had from using molecular genetic markers in experimental breeding programmes (e.g. for improved crops and livestock), nor the question of which organisms should be mated with each other to best effect. We argue that this requires in silico modelling, an approach for which there is a large literature in the field of evolutionary computation (EC), but which has not really been applied in this way to experimental breeding programmes. EC seeks to optimise measurable outcomes (phenotypic fitnesses) by optimising in silico the mutation, recombination and selection regimes that are used. We review some of the approaches from EC, and compare experimentally, using a biologically relevant in silico landscape, some algorithms that have knowledge of where they are in the (genotypic) search space (G-algorithms) with some (albeit well-tuned ones) that do not (F-algorithms). For the present kinds of landscapes, F- and G-algorithms were broadly comparable in quality and effectiveness, although we recognise that the G-algorithms were not equipped with any ‘prior knowledge’ of epistatic pathway interactions. This use of algorithms based on machine learning has important implications for the optimisation of experimental breeding programmes in the post-genomic era when we shall potentially have access to the full genome sequence of every organism in a breeding population. The non-proprietary code that we have used is made freely available (via Supplementary information).
Collapse
|
13
|
Khan MW, Alam M. A survey of application: genomics and genetic programming, a new frontier. Genomics 2012; 100:65-71. [PMID: 22683715 DOI: 10.1016/j.ygeno.2012.05.014] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2011] [Revised: 05/22/2012] [Accepted: 05/29/2012] [Indexed: 11/15/2022]
Abstract
The aim of this paper is to provide an introduction to the rapidly developing field of genetic programming (GP). Particular emphasis is placed on the application of GP to genomics. First, the basic methodology of GP is introduced. This is followed by a review of applications in the areas of gene network inference, gene expression data analysis, SNP analysis, epistasis analysis and gene annotation. Finally this paper concluded by suggesting potential avenues of possible future research on genetic programming, opportunities to extend the technique, and areas for possible practical applications.
Collapse
Affiliation(s)
- Mohammad Wahab Khan
- Department of Computer Science, Jamia Millia Islamia, Maulana Mohammad Ali Jauhar Marg, New Delhi 110025, India.
| | | |
Collapse
|
14
|
Abstract
There is a general agreement that the development of metabolomics depends not only on advances in chemical analysis techniques but also on advances in computing and data analysis methods. Metabolomics data usually requires intensive pre-processing, analysis, and mining procedures. Selecting and applying such procedures requires attention to issues including justification, traceability, and reproducibility. We describe a strategy for selecting data mining techniques which takes into consideration the goals of data mining techniques on the one hand, and the goals of metabolomics investigations and the nature of the data on the other. The strategy aims to ensure the validity and soundness of results and promote the achievement of the investigation goals.
Collapse
|
15
|
Fisher P, Noyes H, Kemp S, Stevens R, Brass A. A systematic strategy for the discovery of candidate genes responsible for phenotypic variation. Methods Mol Biol 2009; 573:329-345. [PMID: 19763936 DOI: 10.1007/978-1-60761-247-6_18] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
It is increasingly common to combine genome-wide expression data with quantitative trait mapping data to aid in the search for sequence polymorphisms responsible for phenotypic variation. By joining these complex but different data types at the level of the biological pathway, we can take advantage of existing biological knowledge to systematically identify possible mechanisms of genotype-phenotype interaction. With the development of web services and workflows, this process can be made rapid and systematic. Our methodology was applied to a use case of resistance to African trypanosomiasis in mice. Workflows developed in this investigation, including a guide to loading and executing them with example data, are available at http://www.myexperiment.org/users/43/workflows .
Collapse
Affiliation(s)
- Paul Fisher
- School of Computer Science, University of Manchester, Manchester, UK
| | | | | | | | | |
Collapse
|
16
|
Klingenberg CP. Morphological Integration and Developmental Modularity. ANNUAL REVIEW OF ECOLOGY EVOLUTION AND SYSTEMATICS 2008. [DOI: 10.1146/annurev.ecolsys.37.091305.110054] [Citation(s) in RCA: 537] [Impact Index Per Article: 33.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
17
|
Wedge DC, Rowe W, Kell DB, Knowles J. In silico modelling of directed evolution: Implications for experimental design and stepwise evolution. J Theor Biol 2008; 257:131-41. [PMID: 19073195 DOI: 10.1016/j.jtbi.2008.11.005] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2008] [Revised: 11/03/2008] [Accepted: 11/03/2008] [Indexed: 11/26/2022]
Abstract
We model the process of directed evolution (DE) in silico using genetic algorithms. Making use of the NK fitness landscape model, we analyse the effects of mutation rate, crossover and selection pressure on the performance of DE. A range of values of K, the epistatic interaction of the landscape, are considered, and high- and low-throughput modes of evolution are compared. Our findings suggest that for runs of or around ten generations' duration-as is typical in DE-there is little difference between the way in which DE needs to be configured in the high- and low-throughput regimes, nor across different degrees of landscape epistasis. In all cases, a high selection pressure (but not an extreme one) combined with a moderately high mutation rate works best, while crossover provides some benefit but only on the less rugged landscapes. These genetic algorithms were also compared with a "model-based approach" from the literature, which uses sequential fixing of the problem parameters based on fitting a linear model. Overall, we find that purely evolutionary techniques fare better than do model-based approaches across all but the smoothest landscapes.
Collapse
Affiliation(s)
- David C Wedge
- Manchester Interdisciplinary Biocentre, University of Manchester, 131 Princess Street, Manchester, M1 7ND, UK.
| | | | | | | |
Collapse
|
18
|
Abstract
A false discovery occurs when a researcher concludes that a marker is involved in the etiology of the disease whereas in reality it is not. In genetic studies the risk of false discoveries is very high because only few among the many markers that can be tested will have an effect on the disease. In this article, we argue that it may be best to use methods for controlling false discoveries that would introduce the same ratio of false discoveries divided by all rejected tests into the literature regardless of systematic differences between studies. After a brief discussion of traditional "multiple testing" methods, we show that methods that control the false discovery rate (FDR) may be more suitable to achieve this goal. These FDR methods are therefore discussed in more detail. Instead of merely testing for main effects, it may be important to search for gene-environment/covariate interactions, gene-gene interactions or genetic variants affecting disease subtypes. In the second section, we point out the challenges involved in controlling false discoveries in such searches. The final section discusses the role of replication studies for eliminating false discoveries and the complexities associated with the definition of what constitutes a replication and the design of these studies.
Collapse
Affiliation(s)
- Edwin J C G van den Oord
- Center for Biomarker Research and Personalized Medicine, Medical College of Virginia, Virginia Commonwealth University, Richmond, Virginia 23298-0533, USA.
| |
Collapse
|
19
|
Kell DB, Mendes P. The markup is the model: Reasoning about systems biology models in the Semantic Web era. J Theor Biol 2008; 252:538-43. [DOI: 10.1016/j.jtbi.2007.10.023] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2007] [Revised: 10/19/2007] [Accepted: 10/22/2007] [Indexed: 01/09/2023]
|
20
|
Genetic Programming: An Introduction and Tutorial, with a Survey of Techniques and Applications. STUDIES IN COMPUTATIONAL INTELLIGENCE 2008. [DOI: 10.1007/978-3-540-78293-3_22] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
21
|
Tsai LJ, Hsiao SH, Tsai LM, Lin CY, Tsai JJ, Liou DM, Lan JL. The sodium-dependent glucose cotransporter SLC5A11 as an autoimmune modifier gene in SLE. ACTA ACUST UNITED AC 2007; 71:114-26. [PMID: 18069935 DOI: 10.1111/j.1399-0039.2007.00975.x] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Genetic studies in several human autoimmune diseases suggest that the pericentromeric region of chromosome 16 might harbor an autoimmune modifier gene. We hypothesized that the sodium-dependent glucose cotransporter gene SLC5A11 is such a gene, and so might interact with immune-related genes. Herein, this hypothesis was tested in a genetic evaluation of the multiple gene effect in systemic lupus erythematosus (SLE). We used the case-control candidate gene association approach. Eight immune-related genes involved in inflammation and autoantibody generation and clear-up [interleukin 1 receptor antagonist (IL1RN), interleukin 1-beta (IL1-beta), tumor necrosis factor-alpha (TNF-alpha), lymphotoxin-alpha (LTA), tumor necrosis factor ligand superfamily, member 6 (TNFSF6), programmed cell death 1 (PDCD1), C2, and complement component 4 (C4)] were selected for study. Frequency of each candidate's genotype and allele between case and control were compared. Results were stratified by reanalyzing genotype data with relevant symptoms. Finally, improved computational data mining was used to analyze the phenotypes in a large data set. In the frequency analysis, only IL1-beta was significantly associated with SLE. Stratification analysis showed a significant association with SLE symptoms between SLC5A11 and the other immune-related genes, with the exceptions of TNFSF6 and C4. SLC5A11 was significantly associated with low C4 (as was TNF-alpha), anti-Smith antibody (anti-Sm) (as was C2), serositis, and alopecia. Finally, SLC5A11 interacted with PDCD1, TNF-alpha, LTA, and C4. After our study, we concluded that SLC5A11 is involved with some immune effects and interacts with immune-related gene(s), consistent with its function as an autoimmune modifier gene. Furthermore, SLC5A11 might induce apoptosis through the TNF-alpha, PDCD1 pathway. The present genotype-phenotype mapping approach should be applicable to genetic study of other complex diseases.
Collapse
Affiliation(s)
- L-J Tsai
- Graduate Institute of Life Sciences, National Defense Medical Center, Taipei, Taiwan
| | | | | | | | | | | | | |
Collapse
|
22
|
|
23
|
Fisher P, Hedeler C, Wolstencroft K, Hulme H, Noyes H, Kemp S, Stevens R, Brass A. A systematic strategy for large-scale analysis of genotype phenotype correlations: identification of candidate genes involved in African trypanosomiasis. Nucleic Acids Res 2007; 35:5625-33. [PMID: 17709344 PMCID: PMC2018629 DOI: 10.1093/nar/gkm623] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
It is increasingly common to combine Microarray and Quantitative Trait Loci data to aid the search for candidate genes responsible for phenotypic variation. Workflows provide a means of systematically processing these large datasets and also represent a framework for the re-use and the explicit declaration of experimental methods. In this article, we highlight the issues facing the manual analysis of microarray and QTL data for the discovery of candidate genes underlying complex phenotypes. We show how automated approaches provide a systematic means to investigate genotype-phenotype correlations. This methodology was applied to a use case of resistance to African trypanosomiasis in the mouse. Pathways represented in the results identified Daxx as one of the candidate genes within the Tir1 QTL region. Subsequent re-sequencing in Daxx identified a deletion of an amino acid, identified in susceptible mouse strains, in the Daxx-p53 protein-binding region. This supports recent experimental evidence that apoptosis could be playing a role in the trypanosomiasis resistance phenotype. Workflows developed in this investigation, including a guide to loading and executing them with example data, are available at http://workflows.mygrid.org.uk/repository/myGrid/PaulFisher/.
Collapse
Affiliation(s)
- Paul Fisher
- School of Computer Science, Kilburn Building, University of Manchester, Oxford Road, Manchester, UK.
| | | | | | | | | | | | | | | |
Collapse
|
24
|
van den Oord E, McClay J, York T, Murrelle L, Robles J. Genetics and diagnostic refinement. Behav Genet 2006; 37:535-45. [PMID: 17160700 DOI: 10.1007/s10519-006-9135-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2006] [Accepted: 11/20/2006] [Indexed: 10/23/2022]
Abstract
For many psychiatric conditions it is speculated that, rather than being single disease entities, they are a set of several disorders sharing clinical features but having (partly) different underlying causes. The possibility of measuring genetic variation on a large scale has given researchers new hope of identifying these disease subtypes that may differ with respect to prognosis, course, and response to treatment. However, although a considerable number of articles have been published suggesting that we may even be on the verge of making genotype-based diagnoses, the reality is that we do not have a good answer to even the most basic question of how measured genes could best be used to refine diagnostic categories. In this article, we show that for common psychiatric disorders, it may not be possible to simply look for similar genetic profiles in groups of patients. Instead, we propose a model assuming that genotypes affect phenotypes through more or less coherent etiological systems or pathogenic processes and argue that these etiological systems may provide a more fruitful basis for defining disease subtypes. Several examples from the literature that support the face validity of different aspects of our model are given. Finally, we argue that, given our limited knowledge of disease etiology, the use of discovery-oriented techniques requiring extensive data collection and (artificial) intelligent computer searches may be imperative, and discuss the prospect of model-based diagnosis to classify etiologically different disease subtypes.
Collapse
Affiliation(s)
- Edwin van den Oord
- Center for Biomarker Research and Personalized Medicine, Department of Pharmacy, Medical College of Virginia, Virginia Commonwealth University, 980533, Richmond, VA 23298-0533, USA.
| | | | | | | | | |
Collapse
|
25
|
Kell DB. Systems biology, metabolic modelling and metabolomics in drug discovery and development. Drug Discov Today 2006; 11:1085-92. [PMID: 17129827 DOI: 10.1016/j.drudis.2006.10.004] [Citation(s) in RCA: 219] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2006] [Revised: 09/25/2006] [Accepted: 10/09/2006] [Indexed: 01/03/2023]
Abstract
Unlike signalling pathways, metabolic networks are subject to strict stoichiometric constraints. Metabolomics amplifies changes in the proteome, and represents more closely the phenotype of an organism. Recent advances enable the production (and computer-readable encoding as SBML) of metabolic network models reconstructed from genome sequences, as well as experimental measurements of much of the metabolome. There is increasing convergence between the number of human metabolites estimated via genomics ( approximately 3000) and the number measured experimentally. It is thus both timely, and now possible, to bring these two approaches together as an integrated (if distributed) whole to help understand the genesis of metabolic biomarkers, the progress of disease, and the modes of action, efficacy, off-target effects and toxicity of pharmaceutical drugs.
Collapse
Affiliation(s)
- Douglas B Kell
- School of Chemistry, Faraday Building, The University of Manchester. PO Box 88, Manchester, M60 1QD, UK.
| |
Collapse
|
26
|
Hoti F, Sillanpää MJ. Bayesian mapping of genotype x expression interactions in quantitative and qualitative traits. Heredity (Edinb) 2006; 97:4-18. [PMID: 16670709 DOI: 10.1038/sj.hdy.6800817] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
A novel Bayesian gene mapping method, which can simultaneously utilize both molecular marker and gene expression data, is introduced. The approach enables a quantitative or qualitative phenotype to be expressed as a linear combination of the marker genotypes, gene expression levels, and possible genotype x gene expression interactions. The interaction data, given as marker-gene pairs, contains possible in cis and in trans effects obtained from earlier allelic expression studies, genetical genomics studies, biological hypotheses, or known pathways. The method is presented for an inbred line cross design and can be easily generalized to handle other types of populations and designs. The model selection is based on the use of effect-specific variance components combined with Jeffreys' non-informative prior--the method operates by adaptively shrinking marker, expression, and interaction effects toward zero so that non-negligible effects are expected to occur only at very few positions. The estimation of the model parameters and the handling of missing genotype or expression data is performed via Markov chain Monte Carlo sampling. The potential of the method including heritability estimation is presented using simulated examples and novel summary statistics. The method is also applied to a real yeast data set with known pathways.
Collapse
Affiliation(s)
- F Hoti
- Department of Mathematics and Statistics, Rolf Nevanlinna Institute, University of Helsinki, FIN-00014 Helsinki, Finland
| | | |
Collapse
|
27
|
Kell DB. Theodor Bücher Lecture. Metabolomics, modelling and machine learning in systems biology - towards an understanding of the languages of cells. Delivered on 3 July 2005 at the 30th FEBS Congress and the 9th IUBMB conference in Budapest. FEBS J 2006; 273:873-94. [PMID: 16478464 DOI: 10.1111/j.1742-4658.2006.05136.x] [Citation(s) in RCA: 130] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The newly emerging field of systems biology involves a judicious interplay between high-throughput 'wet' experimentation, computational modelling and technology development, coupled to the world of ideas and theory. This interplay involves iterative cycles, such that systems biology is not at all confined to hypothesis-dependent studies, with intelligent, principled, hypothesis-generating studies being of high importance and consequently very far from aimless fishing expeditions. I seek to illustrate each of these facets. Novel technology development in metabolomics can increase substantially the dynamic range and number of metabolites that one can detect, and these can be exploited as disease markers and in the consequent and principled generation of hypotheses that are consistent with the data and achieve this in a value-free manner. Much of classical biochemistry and signalling pathway analysis has concentrated on the analyses of changes in the concentrations of intermediates, with 'local' equations - such as that of Michaelis and Menten v=(Vmax x S)/(S+K m) - that describe individual steps being based solely on the instantaneous values of these concentrations. Recent work using single cells (that are not subject to the intellectually unsupportable averaging of the variable displayed by heterogeneous cells possessing nonlinear kinetics) has led to the recognition that some protein signalling pathways may encode their signals not (just) as concentrations (AM or amplitude-modulated in a radio analogy) but via changes in the dynamics of those concentrations (the signals are FM or frequency-modulated). This contributes in principle to a straightforward solution of the crosstalk problem, leads to a profound reassessment of how to understand the downstream effects of dynamic changes in the concentrations of elements in these pathways, and stresses the role of signal processing (and not merely the intermediates) in biological signalling. It is this signal processing that lies at the heart of understanding the languages of cells. The resolution of many of the modern and postgenomic problems of biochemistry requires the development of a myriad of new technologies (and maybe a new culture), and thus regular input from the physical sciences, engineering, mathematics and computer science. One solution, that we are adopting in the Manchester Interdisciplinary Biocentre (http://www.mib.ac.uk/) and the Manchester Centre for Integrative Systems Biology (http://www.mcisb.org/), is thus to colocate individuals with the necessary combinations of skills. Novel disciplines that require such an integrative approach continue to emerge. These include fields such as chemical genomics, synthetic biology, distributed computational environments for biological data and modelling, single cell diagnostics/bionanotechnology, and computational linguistics/text mining.
Collapse
Affiliation(s)
- Douglas B Kell
- School of Chemistry, Faraday Building, The University of Manchester, UK.
| |
Collapse
|
28
|
Vemuri GN, Aristidou AA. Metabolic engineering in the -omics era: elucidating and modulating regulatory networks. Microbiol Mol Biol Rev 2006; 69:197-216. [PMID: 15944454 PMCID: PMC1197421 DOI: 10.1128/mmbr.69.2.197-216.2005] [Citation(s) in RCA: 91] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The importance of regulatory control in metabolic processes is widely acknowledged, and several enquiries (both local and global) are being made in understanding regulation at various levels of the metabolic hierarchy. The wealth of biological information has enabled identifying the individual components (genes, proteins, and metabolites) of a biological system, and we are now in a position to understand the interactions between these components. Since phenotype is the net result of these interactions, it is immensely important to elucidate them not only for an integrated understanding of physiology, but also for practical applications of using biological systems as cell factories. We present some of the recent "-omics" approaches that have expanded our understanding of regulation at the gene, protein, and metabolite level, followed by analysis of the impact of this progress on the advancement of metabolic engineering. Although this review is by no means exhaustive, we attempt to convey our ideology that combining global information from various levels of metabolic hierarchy is absolutely essential in understanding and subsequently predicting the relationship between changes in gene expression and the resulting phenotype. The ultimate aim of this review is to provide metabolic engineers with an overview of recent advances in complementary aspects of regulation at the gene, protein, and metabolite level and those involved in fundamental research with potential hurdles in the path to implementing their discoveries in practical applications.
Collapse
Affiliation(s)
- Goutham N Vemuri
- Center for Molecular BioEngineering, Drifmier Engineering Center, University of Georgia, Athens, 30605, USA
| | | |
Collapse
|
29
|
Teusink B, Smid EJ. Modelling strategies for the industrial exploitation of lactic acid bacteria. Nat Rev Microbiol 2006; 4:46-56. [PMID: 16357860 DOI: 10.1038/nrmicro1319] [Citation(s) in RCA: 103] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Lactic acid bacteria (LAB) have a long tradition of use in the food industry, and the number and diversity of their applications has increased considerably over the years. Traditionally, process optimization for these applications involved both strain selection and trial and error. More recently, metabolic engineering has emerged as a discipline that focuses on the rational improvement of industrially useful strains. In the post-genomic era, metabolic engineering increasingly benefits from systems biology, an approach that combines mathematical modelling techniques with functional-genomics data to build models for biological interpretation and--ultimately--prediction. In this review, the industrial applications of LAB are mapped onto available global, genome-scale metabolic modelling techniques to evaluate the extent to which functional genomics and systems biology can live up to their industrial promise.
Collapse
Affiliation(s)
- Bas Teusink
- Kluyver Centre for Genomics of Industrial Fermentations.
| | | |
Collapse
|
30
|
Abstract
In a short time, plant metabolomics has gone from being just an ambitious concept to being a rapidly growing, valuable technology applied in the stride to gain a more global picture of the molecular organization of multicellular organisms. The combination of improved analytical capabilities with newly designed, dedicated statistical, bioinformatics and data mining strategies, is beginning to broaden the horizons of our understanding of how plants are organized and how metabolism is both controlled but highly flexible. Metabolomics is predicted to play a significant, if not indispensable role in bridging the phenotype-genotype gap and thus in assisting us in our desire for full genome sequence annotation as part of the quest to link gene to function. Plants are a fabulously rich source of diverse functional biochemicals and metabolomics is also already proving valuable in an applied context. By creating unique opportunities for us to interrogate plant systems and characterize their biochemical composition, metabolomics will greatly assist in identifying and defining much of the still unexploited biodiversity available today.
Collapse
Affiliation(s)
- Robert D Hall
- Plant Research International, Business Unit Bioscience, PO Box 16, 6700 AA Wageningen, the Netherlands.
| |
Collapse
|
31
|
Kell DB, Brown M, Davey HM, Dunn WB, Spasic I, Oliver SG. Metabolic footprinting and systems biology: the medium is the message. Nat Rev Microbiol 2005; 3:557-65. [PMID: 15953932 DOI: 10.1038/nrmicro1177] [Citation(s) in RCA: 261] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
One element of classical systems analysis treats a system as a black or grey box, the inner structure and behaviour of which can be analysed and modelled by varying an internal or external condition, probing it from outside and studying the effect of the variation on the external observables. The result is an understanding of the inner make-up and workings of the system. The equivalent of this in biology is to observe what a cell or system excretes under controlled conditions - the 'metabolic footprint' or exometabolome - as this is readily and accurately measurable. Here, we review the principles, experimental approaches and scientific outcomes that have been obtained with this useful and convenient strategy.
Collapse
Affiliation(s)
- Douglas B Kell
- School of Chemistry, University of Manchester, Faraday Building, PO Box 88, Sackville Street, Manchester M60 1QD, UK.
| | | | | | | | | | | |
Collapse
|
32
|
Kell DB. Metabolomics, machine learning and modelling: towards an understanding of the language of cells. Biochem Soc Trans 2005; 33:520-4. [PMID: 15916555 DOI: 10.1042/bst0330520] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
In answering the question ‘Systems Biology – will it work?’ (which it self-evidently has already), it is appropriate to highlight advances in philosophy, in new technique development and in novel findings. In terms of philosophy, we see that systems biology involves an iterative interplay between linked activities – for instance, between theory and experiment, between induction and deduction and between measurements of parameters and variables – with more emphasis than has perhaps been common now being focused on the first in each of these pairs. In technique development, we highlight closed loop machine learning and its use in the optimization of scientific instrumentation, and the ability to effect high-quality and quasi-continuous optical images of cells. This leads to many important and novel findings. In the first case, these may involve new biomarkers for disease, whereas in the second case, we have determined that many biological signals may be frequency-rather than amplitude-encoded. This leads to a very different view of how signalling ‘works’ (equations such as that of Michaelis and Menten which use only amplitudes, i.e. concentrations, are inadequate descriptors), lays emphasis on the signal processing network elements that lie ‘downstream’ of what are traditionally considered the signals, and allows one simply to understand how cross-talk may be avoided between pathways which nevertheless use common signalling elements. The language of cells is much richer than we had supposed, and we are now well placed to decode it.
Collapse
Affiliation(s)
- D B Kell
- School of Chemistry, The University of Manchester, Faraday Building, Sackville Street, P.O. Box 88, Manchester M60 1QD, UK.
| |
Collapse
|
33
|
Goodacre R, Vaidyanathan S, Dunn WB, Harrigan GG, Kell DB. Metabolomics by numbers: acquiring and understanding global metabolite data. Trends Biotechnol 2005; 22:245-52. [PMID: 15109811 DOI: 10.1016/j.tibtech.2004.03.007] [Citation(s) in RCA: 781] [Impact Index Per Article: 41.1] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Royston Goodacre
- Department of Chemistry, UMIST, P.O. Box 88, Sackville Street, Manchester M60 1QD, UK.
| | | | | | | | | |
Collapse
|
34
|
Abstract
Disappointments in replicating initial findings in gene mapping for complex traits are often attributed to small sample sizes and inadequate techniques to determine the threshold value. This is clearly not the whole truth. More fundamental reasons lie in the inherent heterogeneity related to disease, including genetic heterogeneity, differences in allele frequencies, and context-dependency in genetic architecture. There are also other reasons related to the data collection and analysis. Replication may remain a source of frustration unless more emphasis is put on controlling these sources of heterogeneity between studies.
Collapse
Affiliation(s)
- M J Sillanpää
- Rolf Nevanlinna Institute, Department of Mathematics and Statistics, P.O. Box 68, FIN-00014 University of Helsinki, Finland.
| | | |
Collapse
|
35
|
Twist CR, Winson MK, Rowland JJ, Kell DB. Single-nucleotide polymorphism detection using nanomolar nucleotides and single-molecule fluorescence. Anal Biochem 2004; 327:35-44. [PMID: 15033508 DOI: 10.1016/j.ab.2003.12.023] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2003] [Indexed: 11/23/2022]
Abstract
We have exploited three methods for discriminating single-nucleotide polymorphisms (SNPs) by detecting the incorporation or otherwise of labeled dideoxy nucleotides at the end of a primer chain using single-molecule fluorescence detection methods. Good discrimination of incorporated vs free nucleotide may be obtained in a homogeneous assay (without washing steps) via confocal fluorescence correlation spectroscopy or by polarization anisotropy obtained from confocal fluorescence intensity distribution analysis. Moreover, the ratio of the fluorescence intensities on each polarization channel may be used directly to discriminate the nucleotides incorporated. Each measurement took just a few seconds and was done in microliter volumes with nanomolar concentrations of labeled nucleotides. Since the confocal volumes interrogated are approximately 1fL and the reaction volume could easily be lowered to nanoliters, the possibility of SNP analysis with attomoles of reagents opens up a route to very rapid and inexpensive SNP detection. The method was applied with success to the detections of SNPs that are known to occur in the BRCA1 and CFTR genes.
Collapse
Affiliation(s)
- Charles R Twist
- Institute of Biological Sciences, Cledwyn Building, University of Wales, Aberystwyth SY23 3DD, Wales, UK
| | | | | | | |
Collapse
|
36
|
Allen J, Davey HM, Broadhurst D, Rowland JJ, Oliver SG, Kell DB. Discrimination of modes of action of antifungal substances by use of metabolic footprinting. Appl Environ Microbiol 2004; 70:6157-65. [PMID: 15466562 PMCID: PMC522091 DOI: 10.1128/aem.70.10.6157-6165.2004] [Citation(s) in RCA: 68] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2004] [Accepted: 06/22/2004] [Indexed: 11/20/2022] Open
Abstract
Diploid cells of Saccharomyces cerevisiae were grown under controlled conditions with a Bioscreen instrument, which permitted the essentially continuous registration of their growth via optical density measurements. Some cultures were exposed to concentrations of a number of antifungal substances with different targets or modes of action (sterol biosynthesis, respiratory chain, amino acid synthesis, and the uncoupler). Culture supernatants were taken and analyzed for their "metabolic footprints" by using direct-injection mass spectrometry. Discriminant function analysis and hierarchical cluster analysis allowed these antifungal compounds to be distinguished and classified according to their modes of action. Genetic programming, a rule-evolving machine learning strategy, allowed respiratory inhibitors to be discriminated from others by using just two masses. Metabolic footprinting thus represents a rapid, convenient, and information-rich method for classifying the modes of action of antifungal substances.
Collapse
Affiliation(s)
- Jess Allen
- Department of Biological Sciences, University of Wales, Aberystwyth, UK
| | | | | | | | | | | |
Collapse
|
37
|
Johnson HE, Broadhurst D, Kell DB, Theodorou MK, Merry RJ, Griffith GW. High-throughput metabolic fingerprinting of legume silage fermentations via Fourier transform infrared spectroscopy and chemometrics. Appl Environ Microbiol 2004; 70:1583-92. [PMID: 15006782 PMCID: PMC368363 DOI: 10.1128/aem.70.3.1583-1592.2004] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Silage quality is typically assessed by the measurement of several individual parameters, including pH, lactic acid, acetic acid, bacterial numbers, and protein content. The objective of this study was to use a holistic metabolic fingerprinting approach, combining a high-throughput microtiter plate-based fermentation system with Fourier transform infrared (FT-IR) spectroscopy, to obtain a snapshot of the sample metabolome (typically low-molecular-weight compounds) at a given time. The aim was to study the dynamics of red clover or grass silage fermentations in response to various inoculants incorporating lactic acid bacteria (LAB). The hyperspectral multivariate datasets generated by FT-IR spectroscopy are difficult to interpret visually, so chemometrics methods were used to deconvolute the data. Two-phase principal component-discriminant function analysis allowed discrimination between herbage types and different LAB inoculants and modeling of fermentation dynamics over time. Further analysis of FT-IR spectra by the use of genetic algorithms to identify the underlying biochemical differences between treatments revealed that the amide I and amide II regions (wavenumbers of 1,550 to 1,750 cm(-1)) of the spectra were most frequently selected (reflecting changes in proteins and free amino acids) in comparisons between control and inoculant-treated fermentations. This corresponds to the known importance of rapid fermentation for the efficient conservation of forage proteins.
Collapse
Affiliation(s)
- Helen E Johnson
- Institute of Biological Sciences, University of Wales, Aberystwyth, Ceredigion SY23 3DD, United Kingdom.
| | | | | | | | | | | |
Collapse
|
38
|
Allen J, Davey HM, Broadhurst D, Heald JK, Rowland JJ, Oliver SG, Kell DB. High-throughput classification of yeast mutants for functional genomics using metabolic footprinting. Nat Biotechnol 2003; 21:692-6. [PMID: 12740584 DOI: 10.1038/nbt823] [Citation(s) in RCA: 361] [Impact Index Per Article: 17.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2002] [Accepted: 02/28/2003] [Indexed: 11/08/2022]
Abstract
Many technologies have been developed to help explain the function of genes discovered by systematic genome sequencing. At present, transcriptome and proteome studies dominate large-scale functional analysis strategies. Yet the metabolome, because it is 'downstream', should show greater effects of genetic or physiological changes and thus should be much closer to the phenotype of the organism. We earlier presented a functional analysis strategy that used metabolic fingerprinting to reveal the phenotype of silent mutations of yeast genes. However, this is difficult to scale up for high-throughput screening. Here we present an alternative that has the required throughput (2 min per sample). This 'metabolic footprinting' approach recognizes the significance of 'overflow metabolism' in appropriate media. Measuring intracellular metabolites is time-consuming and subject to technical difficulties caused by the rapid turnover of intracellular metabolites and the need to quench metabolism and separate metabolites from the extracellular space. We therefore focused instead on direct, noninvasive, mass spectrometric monitoring of extracellular metabolites in spent culture medium. Metabolic footprinting can distinguish between different physiological states of wild-type yeast and between yeast single-gene deletion mutants even from related areas of metabolism. By using appropriate clustering and machine learning techniques, the latter based on genetic programming, we show that metabolic footprinting is an effective method to classify 'unknown' mutants by genetic defect.
Collapse
Affiliation(s)
- Jess Allen
- Institute of Biological Sciences, Cledwyn Building, University of Wales, Aberystwyth, Aberystwyth SY23 3DD, UK
| | | | | | | | | | | | | |
Collapse
|