Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Williams AJ, Harland L, Groth P, Pettifer S, Chichester C, Willighagen EL, Evelo CT, Blomberg N, Ecker G, Goble C, Mons B. Open PHACTS: semantic interoperability for drug discovery. Drug Discov Today 2012;17:1188-98. [PMID: 22683805 DOI: 10.1016/j.drudis.2012.05.016] [Citation(s) in RCA: 172] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2012] [Revised: 05/18/2012] [Accepted: 05/31/2012] [Indexed: 01/22/2023]

For:	Williams AJ, Harland L, Groth P, Pettifer S, Chichester C, Willighagen EL, Evelo CT, Blomberg N, Ecker G, Goble C, Mons B. Open PHACTS: semantic interoperability for drug discovery. Drug Discov Today 2012;17:1188-98. [PMID: 22683805 DOI: 10.1016/j.drudis.2012.05.016] [Citation(s) in RCA: 172] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2012] [Revised: 05/18/2012] [Accepted: 05/31/2012] [Indexed: 01/22/2023]

Collapse

Number

Cited by Other Article(s)

101

Shimizu K, Nuida K, Arai H, Mitsunari S, Attrapadung N, Hamada M, Tsuda K, Hirokawa T, Sakuma J, Hanaoka G, Asai K. Privacy-preserving search for chemical compound databases. BMC Bioinformatics 2015;16 Suppl 18:S6. [PMID: 26678650 PMCID: PMC4704467 DOI: 10.1186/1471-2105-16-s18-s6] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

102

Pharmacophore Models and Pharmacophore-Based Virtual Screening: Concepts and Applications Exemplified on Hydroxysteroid Dehydrogenases. Molecules 2015;20:22799-832. [PMID: 26703541 PMCID: PMC6332202 DOI: 10.3390/molecules201219880] [Citation(s) in RCA: 95] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2015] [Revised: 12/03/2015] [Accepted: 12/09/2015] [Indexed: 01/06/2023] Open

103

Hofmann-Apitius M, Ball G, Gebel S, Bagewadi S, de Bono B, Schneider R, Page M, Kodamullil AT, Younesi E, Ebeling C, Tegnér J, Canard L. Bioinformatics Mining and Modeling Methods for the Identification of Disease Mechanisms in Neurodegenerative Disorders. Int J Mol Sci 2015;16:29179-206. [PMID: 26690135 PMCID: PMC4691095 DOI: 10.3390/ijms161226148] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2015] [Revised: 11/10/2015] [Accepted: 11/12/2015] [Indexed: 12/22/2022] Open

Abstract

Since the decoding of the Human Genome, techniques from bioinformatics, statistics, and machine learning have been instrumental in uncovering patterns in increasing amounts and types of different data produced by technical profiling technologies applied to clinical samples, animal models, and cellular systems. Yet, progress on unravelling biological mechanisms, causally driving diseases, has been limited, in part due to the inherent complexity of biological systems. Whereas we have witnessed progress in the areas of cancer, cardiovascular and metabolic diseases, the area of neurodegenerative diseases has proved to be very challenging. This is in part because the aetiology of neurodegenerative diseases such as Alzheimer´s disease or Parkinson´s disease is unknown, rendering it very difficult to discern early causal events. Here we describe a panel of bioinformatics and modeling approaches that have recently been developed to identify candidate mechanisms of neurodegenerative diseases based on publicly available data and knowledge. We identify two complementary strategies-data mining techniques using genetic data as a starting point to be further enriched using other data-types, or alternatively to encode prior knowledge about disease mechanisms in a model based framework supporting reasoning and enrichment analysis. Our review illustrates the challenges entailed in integrating heterogeneous, multiscale and multimodal information in the area of neurology in general and neurodegeneration in particular. We conclude, that progress would be accelerated by increasing efforts on performing systematic collection of multiple data-types over time from each individual suffering from neurodegenerative disease. The work presented here has been driven by project AETIONOMY; a project funded in the course of the Innovative Medicines Initiative (IMI); which is a public-private partnership of the European Federation of Pharmaceutical Industry Associations (EFPIA) and the European Commission (EC).

Collapse

Affiliation(s)

Martin Hofmann-Apitius Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Institutszentrum Birlinghoven, Sankt Augustin D-53754, Germany. Rheinische Friedrich-Wilhelms-Universitaet Bonn, University of Bonn, Bonn 53113, Germany.
Gordon Ball Unit of Computational Medicine, Center for Molecular Medicine, Department of Medicine, and Unit of Clinical Epidemiology, Karolinska University Hospital, Stockholm SE-171 77, Sweden. Science for Life Laboratories, Karolinska Institutet, Stockholm SE-171 77, Sweden.
Stephan Gebel Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 7, avenue des Hauts-Fourneaux, Esch-sur-Alzette L-4362, Luxembourg.
Shweta Bagewadi Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Institutszentrum Birlinghoven, Sankt Augustin D-53754, Germany.
Bernard de Bono Institute of Health Informatics, University College London, London NW1 2DA, UK. Auckland Bioengineering Institute, University of Auckland, Symmonds Street, Auckland 1142, New Zealand.
Reinhard Schneider Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 7, avenue des Hauts-Fourneaux, Esch-sur-Alzette L-4362, Luxembourg.
Matt Page Translational Bioinformatics, UCB Pharma, 216 Bath Rd, Slough SL1 3WE, UK.
Alpha Tom Kodamullil Rheinische Friedrich-Wilhelms-Universitaet Bonn, University of Bonn, Bonn 53113, Germany.
Erfan Younesi Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Institutszentrum Birlinghoven, Sankt Augustin D-53754, Germany.
Christian Ebeling Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Institutszentrum Birlinghoven, Sankt Augustin D-53754, Germany.
Jesper Tegnér Unit of Computational Medicine, Center for Molecular Medicine, Department of Medicine, and Unit of Clinical Epidemiology, Karolinska University Hospital, Stockholm SE-171 77, Sweden. Science for Life Laboratories, Karolinska Institutet, Stockholm SE-171 77, Sweden.
Luc Canard Translational Science Unit, SANOFI Recherche & Développement, 1 Avenue Pierre Brossolette, Chilly-Mazarin Cedex 91385, France.

Collapse

104

Baumeister J, Striffler A. Knowledge-driven systems for episodic decision support. Knowl Based Syst 2015. [DOI: 10.1016/j.knosys.2015.08.008] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

105

César-Razquin A, Snijder B, Frappier-Brinton T, Isserlin R, Gyimesi G, Bai X, Reithmeier RA, Hepworth D, Hediger MA, Edwards AM, Superti-Furga G. A Call for Systematic Research on Solute Carriers. Cell 2015;162:478-87. [PMID: 26232220 DOI: 10.1016/j.cell.2015.07.022] [Citation(s) in RCA: 392] [Impact Index Per Article: 43.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2015] [Indexed: 01/10/2023]

106

Kutmon M, Riutta A, Nunes N, Hanspers K, Willighagen EL, Bohler A, Mélius J, Waagmeester A, Sinha SR, Miller R, Coort SL, Cirillo E, Smeets B, Evelo CT, Pico AR. WikiPathways: capturing the full diversity of pathway knowledge. Nucleic Acids Res 2015;44:D488-94. [PMID: 26481357 PMCID: PMC4702772 DOI: 10.1093/nar/gkv1024] [Citation(s) in RCA: 298] [Impact Index Per Article: 33.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2015] [Accepted: 09/28/2015] [Indexed: 12/19/2022] Open

Affiliation(s)

Martina Kutmon Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
Anders Riutta Gladstone Institutes, San Francisco, California, CA 94158, USA
Nuno Nunes Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
Kristina Hanspers Gladstone Institutes, San Francisco, California, CA 94158, USA
Egon L Willighagen Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
Anwesha Bohler Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
Jonathan Mélius Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
Andra Waagmeester Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands Micelio, Antwerp, 2180 Antwerp, Belgium
Sravanthi R Sinha Keshav Memorial Institute of Technology, Hyderabad, Telangana 500029, India
Ryan Miller Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
Susan L Coort Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
Elisa Cirillo Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
Bart Smeets Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
Chris T Evelo Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
Alexander R Pico Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands

Collapse

107

Garcia-Serna R, Vidal D, Remez N, Mestres J. Large-Scale Predictive Drug Safety: From Structural Alerts to Biological Mechanisms. Chem Res Toxicol 2015;28:1875-87. [PMID: 26360911 DOI: 10.1021/acs.chemrestox.5b00260] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]

108

Nicola G, Berthold MR, Hedrick MP, Gilson MK. Connecting proteins with drug-like compounds: Open source drug discovery workflows with BindingDB and KNIME. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2015;2015:bav087. [PMID: 26384374 PMCID: PMC4572361 DOI: 10.1093/database/bav087] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/09/2015] [Accepted: 08/17/2015] [Indexed: 12/24/2022]

109

Activity, assay and target data curation and quality in the ChEMBL database. J Comput Aided Mol Des 2015. [PMID: 26201396 PMCID: PMC4607714 DOI: 10.1007/s10822-015-9860-5] [Citation(s) in RCA: 87] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

110

Bolton E. Reporting biological assay screening results for maximum impact. DRUG DISCOVERY TODAY. TECHNOLOGIES 2015;14:31-6. [PMID: 26194585 DOI: 10.1016/j.ddtec.2015.03.004] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2015] [Revised: 03/18/2015] [Accepted: 03/29/2015] [Indexed: 11/19/2022]

111

Fu G, Batchelor C, Dumontier M, Hastings J, Willighagen E, Bolton E. PubChemRDF: towards the semantic annotation of PubChem compound and substance databases. J Cheminform 2015;7:34. [PMID: 26175801 PMCID: PMC4500850 DOI: 10.1186/s13321-015-0084-4] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2015] [Accepted: 06/22/2015] [Indexed: 12/02/2022] Open

Abstract

Background

PubChem is an open repository for chemical structures, biological activities and biomedical annotations. Semantic Web technologies are emerging as an increasingly important approach to distribute and integrate scientific data. Exposing PubChem data to Semantic Web services may help enable automated data integration and management, as well as facilitate interoperable web applications.

Description

This work, one of a series covering the PubChemRDF project, describes an approach to translate PubChem Substance and Compound information into Resource Description Framework (RDF) format. Basic examples are provided to demonstrate its use. The aim of this effort is to provide two new primary benefits to researchers in a cost-effective manner. Firstly, we aim to remove the inherent limitations of using the web-based resource PubChem by allowing a researcher to use readily available semantic technologies (namely, RDF triple stores and their corresponding SPARQL query engines) to query and analyze PubChem data on local computing resources. Secondly, this work intends to help improve data sharing, analysis, and integration of PubChem data to resources external to NCBI and across scientific domains, by means of the association of PubChem data to existing ontological frameworks, including CHEMical INFormation ontology, Semanticscience Integrated Ontology, and others.

Conclusions

With the goal of semantically describing information available in the PubChem archive, pre-existing ontological frameworks were used, rather than creating new ones. Semantic relationships between compounds and substances, chemical descriptors associated with compounds and substances, interrelationships between chemicals, as well as provenance and attribute metadata of substances are described.

Electronic supplementary material

The online version of this article (doi:10.1186/s13321-015-0084-4) contains supplementary material, which is available to authorized users.

Collapse

112

Hu XL, Li D, Shao L, Dong X, He XP, Chen GR, Chen D. Triazole-Linked Glycolipids Enhance the Susceptibility of MRSA to β-Lactam Antibiotics. ACS Med Chem Lett 2015;6:793-7. [PMID: 26191368 DOI: 10.1021/acsmedchemlett.5b00142] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2015] [Accepted: 06/01/2015] [Indexed: 12/12/2022] Open

113

A large-scale crop protection bioassay data set. Sci Data 2015;2:150032. [PMID: 26175909 PMCID: PMC4493826 DOI: 10.1038/sdata.2015.32] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2015] [Accepted: 06/10/2015] [Indexed: 11/09/2022] Open

114

Richter L, Ecker GF. Medicinal chemistry in the era of big data. DRUG DISCOVERY TODAY. TECHNOLOGIES 2015;14:37-41. [PMID: 26194586 DOI: 10.1016/j.ddtec.2015.06.001] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/20/2015] [Revised: 06/02/2015] [Accepted: 06/02/2015] [Indexed: 06/04/2023]

115

Hersey A, Chambers J, Bellis L, Patrícia Bento A, Gaulton A, Overington JP. Chemical databases: curation or integration by user-defined equivalence? DRUG DISCOVERY TODAY. TECHNOLOGIES 2015;14:17-24. [PMID: 26194583 PMCID: PMC6294287 DOI: 10.1016/j.ddtec.2015.01.005] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2014] [Revised: 01/15/2015] [Accepted: 01/16/2015] [Indexed: 11/30/2022]

116

Finding the right approach to big data-driven medicinal chemistry. Future Med Chem 2015;7:1213-6. [DOI: 10.4155/fmc.15.58] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open

117

Lambrinidis G, Vallianatou T, Tsantili-Kakoulidou A. In vitro, in silico and integrated strategies for the estimation of plasma protein binding. A review. Adv Drug Deliv Rev 2015;86:27-45. [PMID: 25819487 DOI: 10.1016/j.addr.2015.03.011] [Citation(s) in RCA: 73] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2014] [Revised: 02/11/2015] [Accepted: 03/20/2015] [Indexed: 12/28/2022]

Abstract

Plasma protein binding (PPB) strongly affects drug distribution and pharmacokinetic behavior with consequences in overall pharmacological action. Extended plasma protein binding may be associated with drug safety issues and several adverse effects, like low clearance, low brain penetration, drug-drug interactions, loss of efficacy, while influencing the fate of enantiomers and diastereoisomers by stereoselective binding within the body. Therefore in holistic drug design approaches, where ADME(T) properties are considered in parallel with target affinity, considerable efforts are focused in early estimation of PPB mainly in regard to human serum albumin (HSA), which is the most abundant and most important plasma protein. The second critical serum protein α1-acid glycoprotein (AGP), although often underscored, plays also an important and complicated role in clinical therapy and thus the last years it has been studied thoroughly too. In the present review, after an overview of the principles of HSA and AGP binding as well as the structure topology of the proteins, the current trends and perspectives in the field of PPB predictions are presented and discussed considering both HSA and AGP binding. Since however for the latter protein systematic studies have started only the last years, the review focuses mainly to HSA. One part of the review highlights the challenge to develop rapid techniques for HSA and AGP binding simulation and their performance in assessment of PPB. The second part focuses on in silico approaches to predict HSA and AGP binding, analyzing and evaluating structure-based and ligand-based methods, as well as combination of both methods in the aim to exploit the different information and overcome the limitations of each individual approach. Ligand-based methods use the Quantitative Structure-Activity Relationships (QSAR) methodology to establish quantitate models for the prediction of binding constants from molecular descriptors, while they provide only indirect information on binding mechanism. Efforts for the establishment of global models, automated workflows and web-based platforms for PPB predictions are presented and discussed. Structure-based methods relying on the crystal structures of drug-protein complexes provide detailed information on the underlying mechanism but are usually restricted to specific compounds. They are useful to identify the specific binding site while they may be important in investigating drug-drug interactions, related to PPB. Moreover, chemometrics or structure-based modeling may be supported by experimental data a promising integrated alternative strategy for ADME(T) properties optimization. In the case of PPB the use of molecular modeling combined with bioanalytical techniques is frequently used for the investigation of AGP binding.

Collapse

118

Karapetyan K, Batchelor C, Sharpe D, Tkachenko V, Williams AJ. The Chemical Validation and Standardization Platform (CVSP): large-scale automated validation of chemical structure datasets. J Cheminform 2015;7:30. [PMID: 26155308 PMCID: PMC4494041 DOI: 10.1186/s13321-015-0072-8] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2014] [Accepted: 04/28/2015] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

There are presently hundreds of online databases hosting millions of chemical compounds and associated data. As a result of the number of cheminformatics software tools that can be used to produce the data, subtle differences between the various cheminformatics platforms, as well as the naivety of the software users, there are a myriad of issues that can exist with chemical structure representations online. In order to help facilitate validation and standardization of chemical structure datasets from various sources we have delivered a freely available internet-based platform to the community for the processing of chemical compound datasets.

RESULTS

The chemical validation and standardization platform (CVSP) both validates and standardizes chemical structure representations according to sets of systematic rules. The chemical validation algorithms detect issues with submitted molecular representations using pre-defined or user-defined dictionary-based molecular patterns that are chemically suspicious or potentially requiring manual review. Each identified issue is assigned one of three levels of severity - Information, Warning, and Error - in order to conveniently inform the user of the need to browse and review subsets of their data. The validation process includes validation of atoms and bonds (e.g., making aware of query atoms and bonds), valences, and stereo. The standard form of submission of collections of data, the SDF file, allows the user to map the data fields to predefined CVSP fields for the purpose of cross-validating associated SMILES and InChIs with the connection tables contained within the SDF file. This platform has been applied to the analysis of a large number of data sets prepared for deposition to our ChemSpider database and in preparation of data for the Open PHACTS project. In this work we review the results of the automated validation of the DrugBank dataset, a popular drug and drug target database utilized by the community, and ChEMBL 17 data set. CVSP web site is located at http://cvsp.chemspider.com/.

CONCLUSION

A platform for the validation and standardization of chemical structure representations of various formats has been developed and made available to the community to assist and encourage the processing of chemical structure files to produce more homogeneous compound representations for exchange and interchange between online databases. While the CVSP platform is designed with flexibility inherent to the rules that can be used for processing the data we have produced a recommended rule set based on our own experiences with the large data sets such as DrugBank, ChEMBL, and data sets from ChemSpider.

Collapse

119

Warr WA. Many InChIs and quite some feat. J Comput Aided Mol Des 2015;29:681-94. [PMID: 26081259 DOI: 10.1007/s10822-015-9854-3] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2015] [Accepted: 06/10/2015] [Indexed: 12/14/2022]

120

Alnazzawi N, Thompson P, Batista-Navarro R, Ananiadou S. Using text mining techniques to extract phenotypic information from the PhenoCHF corpus. BMC Med Inform Decis Mak 2015;15 Suppl 2:S3. [PMID: 26099853 PMCID: PMC4474585 DOI: 10.1186/1472-6947-15-s2-s3] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

Abstract

Background

Phenotypic information locked away in unstructured narrative text presents significant barriers to information accessibility, both for clinical practitioners and for computerised applications used for clinical research purposes. Text mining (TM) techniques have previously been applied successfully to extract different types of information from text in the biomedical domain. They have the potential to be extended to allow the extraction of information relating to phenotypes from free text.

Methods

To stimulate the development of TM systems that are able to extract phenotypic information from text, we have created a new corpus (PhenoCHF) that is annotated by domain experts with several types of phenotypic information relating to congestive heart failure. To ensure that systems developed using the corpus are robust to multiple text types, it integrates text from heterogeneous sources, i.e., electronic health records (EHRs) and scientific articles from the literature. We have developed several different phenotype extraction methods to demonstrate the utility of the corpus, and tested these methods on a further corpus, i.e., ShARe/CLEF 2013.

Results

Evaluation of our automated methods showed that PhenoCHF can facilitate the training of reliable phenotype extraction systems, which are robust to variations in text type. These results have been reinforced by evaluating our trained systems on the ShARe/CLEF corpus, which contains clinical records of various types. Like other studies within the biomedical domain, we found that solutions based on conditional random fields produced the best results, when coupled with a rich feature set.

Conclusions

PhenoCHF is the first annotated corpus aimed at encoding detailed phenotypic information. The unique heterogeneous composition of the corpus has been shown to be advantageous in the training of systems that can accurately extract phenotypic information from a range of different text types. Although the scope of our annotation is currently limited to a single disease, the promising results achieved can stimulate further work into the extraction of phenotypic information for other diseases. The PhenoCHF annotation guidelines and annotations are publicly available at https://code.google.com/p/phenochf-corpus.

Collapse

121

Ernst P, Siu A, Weikum G. KnowLife: a versatile approach for constructing a large knowledge graph for biomedical sciences. BMC Bioinformatics 2015;16:157. [PMID: 25971816 PMCID: PMC4448285 DOI: 10.1186/s12859-015-0549-5] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2014] [Accepted: 03/25/2015] [Indexed: 12/16/2022] Open

Abstract

BACKGROUND

Biomedical knowledge bases (KB's) have become important assets in life sciences. Prior work on KB construction has three major limitations. First, most biomedical KBs are manually built and curated, and cannot keep up with the rate at which new findings are published. Second, for automatic information extraction (IE), the text genre of choice has been scientific publications, neglecting sources like health portals and online communities. Third, most prior work on IE has focused on the molecular level or chemogenomics only, like protein-protein interactions or gene-drug relationships, or solely address highly specific topics such as drug effects.

RESULTS

We address these three limitations by a versatile and scalable approach to automatic KB construction. Using a small number of seed facts for distant supervision of pattern-based extraction, we harvest a huge number of facts in an automated manner without requiring any explicit training. We extend previous techniques for pattern-based IE with confidence statistics, and we combine this recall-oriented stage with logical reasoning for consistency constraint checking to achieve high precision. To our knowledge, this is the first method that uses consistency checking for biomedical relations. Our approach can be easily extended to incorporate additional relations and constraints. We ran extensive experiments not only for scientific publications, but also for encyclopedic health portals and online communities, creating different KB's based on different configurations. We assess the size and quality of each KB, in terms of number of facts and precision. The best configured KB, KnowLife, contains more than 500,000 facts at a precision of 93% for 13 relations covering genes, organs, diseases, symptoms, treatments, as well as environmental and lifestyle risk factors.

CONCLUSION

KnowLife is a large knowledge base for health and life sciences, automatically constructed from different Web sources. As a unique feature, KnowLife is harvested from different text genres such as scientific publications, health portals, and online communities. Thus, it has the potential to serve as one-stop portal for a wide range of relations and use cases. To showcase the breadth and usefulness, we make the KnowLife KB accessible through the health portal (http://knowlife.mpi-inf.mpg.de).

Collapse

122

Leroux H, Lefort L. Semantic enrichment of longitudinal clinical study data using the CDISC standards and the semantic statistics vocabularies. J Biomed Semantics 2015;6:16. [PMID: 25973166 PMCID: PMC4429421 DOI: 10.1186/s13326-015-0012-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2014] [Accepted: 03/05/2015] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

There is an increasing recognition of the need for the data capture phase of clinical studies to be improved and for more effective sharing of clinical data. The Health Care and Life Sciences community has embraced semantic technologies to facilitate the integration of health data from electronic health records, clinical studies and pharmaceutical research. This paper explores the integration of clinical study data exchange standards and semantic statistic vocabularies to deliver clinical data as linked data in a format that is easier to enrich with links to complementary data sources and consume by a broad user base.

METHODS

We propose a Linked Clinical Data Cube (LCDC), which combines the strength of the RDF Data Cube and DDI-RDF vocabulary to enrich clinical data based on the CDISC standards. The CDISC standards provide the mechanisms for the data to be standardised, made more accessible and accountable whereas the RDF Data Cube and DDI-RDF vocabularies provide novel approaches to managing large volumes of heterogeneous linked data resources.

RESULTS

We validate our approach using a large-scale longitudinal clinical study into neurodegenerative diseases. This dataset, comprising more than 1600 variables clustered in 25 different sub-domains, has been fully converted into RDF forming one main data cube and one specialised cube for each sub-domain. One sub-domain, the Medications specialised cube, has been linked to relevant external vocabularies, such as the Australian Medicines Terminology and the ATC DDD taxonomy and DrugBank terminology. This provides new dimensions on which to query the data that promote the exploration of drug-drug and drug-disease interactions.

CONCLUSIONS

This implementation highlights the effectiveness of the association of the semantic statistics vocabularies for the publication of large heterogeneous data sets as linked data and the integration of the semantic statistics vocabularies with the CDISC standards. In particular, it demonstrates the potential of the two vocabularies in overcoming the monolithic nature of the underlying model and improving the navigation and querying of the data from multiple angles to support richer data analysis of clinical study data. The forecasted benefits are more efficient use of clinicians' time and the potential to facilitate cross-study analysis.

Collapse

123

Drug discovery FAQs: workflows for answering multidomain drug discovery questions. Drug Discov Today 2015;20:399-405. [DOI: 10.1016/j.drudis.2014.11.006] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2014] [Revised: 10/22/2014] [Accepted: 11/13/2014] [Indexed: 12/26/2022]

124

Clark AM, Williams AJ, Ekins S. Machines first, humans second: on the importance of algorithmic interpretation of open chemistry data. J Cheminform 2015;7:9. [PMID: 25798198 PMCID: PMC4369291 DOI: 10.1186/s13321-015-0057-7] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2014] [Accepted: 02/23/2015] [Indexed: 11/12/2022] Open

Lab notebook entries must target both visualisation by scientists and use by machine learning algorithms

Alex M Clark Molecular Materials Informatics, 1900 St. Jacques #302, Montreal, H3J 2S1, QC Canada
Antony J Williams Royal Society of Chemistry, 904 Tamaras Circle, Wake Forest, NC 27587 USA
Sean Ekins Collaborations in Chemistry, 5616 Hilltop Needmore Road, Fuquay-Varina, NC 27526 USA ; Collaborative Drug Discovery, 1633 Bayshore Highway, Suite 342, Burlingame, CA 94010 USA

Collapse

125

Hastings J, Jeliazkova N, Owen G, Tsiliki G, Munteanu CR, Steinbeck C, Willighagen E. eNanoMapper: harnessing ontologies to enable data integration for nanomaterial risk assessment. J Biomed Semantics 2015;6:10. [PMID: 25815161 PMCID: PMC4374589 DOI: 10.1186/s13326-015-0005-5] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2014] [Accepted: 02/27/2015] [Indexed: 11/18/2022] Open

126

Carrió P, López O, Sanz F, Pastor M. eTOXlab, an open source modeling framework for implementing predictive models in production environments. J Cheminform 2015;7:8. [PMID: 25774224 PMCID: PMC4358905 DOI: 10.1186/s13321-015-0058-6] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2014] [Accepted: 02/24/2015] [Indexed: 11/10/2022] Open

127

Nantasenamat C, Prachayasittikul V. Maximizing computational tools for successful drug discovery. Expert Opin Drug Discov 2015;10:321-9. [PMID: 25693813 DOI: 10.1517/17460441.2015.1016497] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]

128

Eijssen L, Evelo C, Kok R, Mons B, Hooft R. The Dutch Techcentre for Life Sciences: Enabling data-intensive life science research in the Netherlands. F1000Res 2015;4:33. [PMID: 26913186 PMCID: PMC4743138 DOI: 10.12688/f1000research.6009.2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/04/2016] [Indexed: 11/20/2022] Open

129

Application of text mining in the biomedical domain. Methods 2015;74:97-106. [PMID: 25641519 DOI: 10.1016/j.ymeth.2015.01.015] [Citation(s) in RCA: 78] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2014] [Revised: 01/21/2015] [Accepted: 01/23/2015] [Indexed: 12/12/2022] Open

130

Hoehndorf R, Slater L, Schofield PN, Gkoutos GV. Aber-OWL: a framework for ontology-based data access in biology. BMC Bioinformatics 2015;16:26. [PMID: 25627673 PMCID: PMC4384359 DOI: 10.1186/s12859-015-0456-9] [Citation(s) in RCA: 58] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2014] [Accepted: 01/09/2015] [Indexed: 11/10/2022] Open

131

Ontology-based data integration between clinical and research systems. PLoS One 2015;10:e0116656. [PMID: 25588043 PMCID: PMC4294641 DOI: 10.1371/journal.pone.0116656] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2014] [Accepted: 12/06/2014] [Indexed: 12/17/2022] Open

132

Moghadam BT, Alvarsson J, Holm M, Eklund M, Carlsson L, Spjuth O. Scaling predictive modeling in drug development with cloud computing. J Chem Inf Model 2015;55:19-25. [PMID: 25493610 DOI: 10.1021/ci500580y] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

133

Machado CM, Rebholz-Schuhmann D, Freitas AT, Couto FM. The semantic web in translational medicine: current applications and future directions. Brief Bioinform 2015;16:89-103. [PMID: 24197933 PMCID: PMC4293377 DOI: 10.1093/bib/bbt079] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2013] [Accepted: 10/08/2013] [Indexed: 11/14/2022] Open

134

Ratnam J, Zdrazil B, Digles D, Cuadrado-Rodriguez E, Neefs JM, Tipney H, Siebes R, Waagmeester A, Bradley G, Chau CH, Richter L, Brea J, Evelo CT, Jacoby E, Senger S, Loza MI, Ecker GF, Chichester C. The application of the open pharmacological concepts triple store (open PHACTS) to support drug discovery research. PLoS One 2014;9:e115460. [PMID: 25522365 PMCID: PMC4270790 DOI: 10.1371/journal.pone.0115460] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2014] [Accepted: 10/30/2014] [Indexed: 01/08/2023] Open

135

Rinaldi F, Clematide S, Marques H, Ellendorff T, Romacker M, Rodriguez-Esteban R. OntoGene web services for biomedical text mining. BMC Bioinformatics 2014;15 Suppl 14:S6. [PMID: 25472638 PMCID: PMC4255746 DOI: 10.1186/1471-2105-15-s14-s6] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open

136

Wassermann AM, Lounkine E, Davies JW, Glick M, Camargo LM. The opportunities of mining historical and collective data in drug discovery. Drug Discov Today 2014;20:422-34. [PMID: 25463034 DOI: 10.1016/j.drudis.2014.11.004] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2014] [Revised: 10/21/2014] [Accepted: 11/10/2014] [Indexed: 12/26/2022]

137

Exploiting open data: a new era in pharmacoinformatics. Future Med Chem 2014;6:503-14. [PMID: 24649954 DOI: 10.4155/fmc.14.13] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

138

The eTOX data-sharing project to advance in silico drug-induced toxicity prediction. Int J Mol Sci 2014;15:21136-54. [PMID: 25405742 PMCID: PMC4264217 DOI: 10.3390/ijms151121136] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2014] [Accepted: 10/20/2014] [Indexed: 11/16/2022] Open

139

Hu Y, Bajorath J. Influence of search parameters and criteria on compound selection, promiscuity, and pan assay interference characteristics. J Chem Inf Model 2014;54:3056-66. [PMID: 25329977 DOI: 10.1021/ci5005509] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

140

Chen B, Wang H, Ding Y, Wild D. Semantic Breakthrough in Drug Discovery. ACTA ACUST UNITED AC 2014. [DOI: 10.2200/s00600ed1v01y201409web009] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]

141

Vuorinen A, Schuster D. Methods for generating and applying pharmacophore models as virtual screening filters and for bioactivity profiling. Methods 2014;71:113-34. [PMID: 25461773 DOI: 10.1016/j.ymeth.2014.10.013] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2014] [Revised: 09/29/2014] [Accepted: 10/14/2014] [Indexed: 01/03/2023] Open

142

Ekins S, Clark AM, Swamidass SJ, Litterman N, Williams AJ. Bigger data, collaborative tools and the future of predictive drug discovery. J Comput Aided Mol Des 2014;28:997-1008. [PMID: 24943138 PMCID: PMC4198464 DOI: 10.1007/s10822-014-9762-y] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2014] [Accepted: 06/09/2014] [Indexed: 12/31/2022]

143

Butler WE, Atai N, Carter B, Hochberg F. Informatic system for a global tissue-fluid biorepository with a graph theory-oriented graphical user interface. J Extracell Vesicles 2014;3:24247. [PMID: 25317275 PMCID: PMC4172698 DOI: 10.3402/jev.v3.24247] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2014] [Revised: 06/13/2014] [Accepted: 06/15/2014] [Indexed: 12/12/2022] Open

144

Hettne KM, Dharuri H, Zhao J, Wolstencroft K, Belhajjame K, Soiland-Reyes S, Mina E, Thompson M, Cruickshank D, Verdes-Montenegro L, Garrido J, de Roure D, Corcho O, Klyne G, van Schouwen R, ‘t Hoen PAC, Bechhofer S, Goble C, Roos M. Structuring research methods and data with the research object model: genomics workflows as a case study. J Biomed Semantics 2014;5:41. [PMID: 25276335 PMCID: PMC4177597 DOI: 10.1186/2041-1480-5-41] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2013] [Accepted: 07/29/2014] [Indexed: 12/24/2022] Open

145

Chambers J, Davies M, Gaulton A, Papadatos G, Hersey A, Overington JP. UniChem: extension of InChI-based compound mapping to salt, connectivity and stereochemistry layers. J Cheminform 2014;6:43. [PMID: 25221628 PMCID: PMC4158273 DOI: 10.1186/s13321-014-0043-5] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2014] [Accepted: 09/01/2014] [Indexed: 11/10/2022] Open

146

Korb O, Finn PW, Jones G. The cloud and other new computational methods to improve molecular modelling. Expert Opin Drug Discov 2014;9:1121-31. [DOI: 10.1517/17460441.2014.941800] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

147

Clark AM, Bunin BA, Litterman NK, Schürer SC, Visser U. Fast and accurate semantic annotation of bioassays exploiting a hybrid of machine learning and user confirmation. PeerJ 2014;2:e524. [PMID: 25165633 PMCID: PMC4137659 DOI: 10.7717/peerj.524] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2014] [Accepted: 07/27/2014] [Indexed: 11/29/2022] Open

148

The Royal Society of Chemistry and the delivery of chemistry data repositories for the community. J Comput Aided Mol Des 2014;28:1023-30. [DOI: 10.1007/s10822-014-9784-5] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2014] [Accepted: 07/25/2014] [Indexed: 10/24/2022]

149

Micropublications: a semantic model for claims, evidence, arguments and annotations in biomedical communications. J Biomed Semantics 2014;5:28. [PMID: 26261718 PMCID: PMC4530550 DOI: 10.1186/2041-1480-5-28] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2013] [Accepted: 06/16/2014] [Indexed: 11/10/2022] Open

150

Zdrazil B, Chichester C, Zander Balderud L, Engkvist O, Gaulton A, Overington JP. Transporter assays and assay ontologies: useful tools for drug discovery. DRUG DISCOVERY TODAY. TECHNOLOGIES 2014;12:e47-e54. [PMID: 25027375 DOI: 10.1016/j.ddtec.2014.03.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]

Janna Hastings European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Cambridge, United Kingdom
Nina Jeliazkova IdeaConsult Ltd., 4.A.Kanchev str., Sofia, Bulgaria
Gareth Owen European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Cambridge, United Kingdom
Georgia Tsiliki National Technical University of Athens (NTUA), Athens, Greece
Cristian R Munteanu Computer Science Faculty, University of A Coruña, A Coruña, Spain ; Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, Netherlands
Christoph Steinbeck European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Cambridge, United Kingdom
Egon Willighagen Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, Netherlands

Pau Carrió Research Programme on Biomedical Informatics (GRIB), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, IMIM (Hospital del Mar Medical Research Institute), Dr. Aiguader 88, E-08003 Barcelona, Spain
Oriol López Research Programme on Biomedical Informatics (GRIB), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, IMIM (Hospital del Mar Medical Research Institute), Dr. Aiguader 88, E-08003 Barcelona, Spain
Ferran Sanz Research Programme on Biomedical Informatics (GRIB), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, IMIM (Hospital del Mar Medical Research Institute), Dr. Aiguader 88, E-08003 Barcelona, Spain
Manuel Pastor Research Programme on Biomedical Informatics (GRIB), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, IMIM (Hospital del Mar Medical Research Institute), Dr. Aiguader 88, E-08003 Barcelona, Spain

Lars Eijssen Department of Bioinformatics - BiGCaT, Maastricht University, 6229 ER Maastricht, Netherlands
Chris Evelo Department of Bioinformatics - BiGCaT, Maastricht University, 6229 ER Maastricht, Netherlands
Ruben Kok Dutch Techcentre for Life Sciences (Foundation office), Catharijnesingel 54, 3511 GC Utrecht, Netherlands
Barend Mons Dutch Techcentre for Life Sciences (Foundation office), Catharijnesingel 54, 3511 GC Utrecht, Netherlands; Netherlands eScience Center, Science Park 140, 1098 XG Amsterdam, Netherlands; Leiden University Medical Center, Albinusdreef 2, 2333 ZA, Leiden, Netherlands
Rob Hooft Dutch Techcentre for Life Sciences (Foundation office), Catharijnesingel 54, 3511 GC Utrecht, Netherlands; Netherlands eScience Center, Science Park 140, 1098 XG Amsterdam, Netherlands

Robert Hoehndorf Computational Bioscience Research Center, King Abdullah University of Science and Technology, 4700 KAUST, Thuwal, 23955-6900, Saudi Arabia. .,Computer, Electrical and Mathematical Sciences & Engineering Division, King Abdullah University of Science and Technology, 4700 KAUST, Thuwal, 23955-6900, Saudi Arabia.
Luke Slater Computational Bioscience Research Center, King Abdullah University of Science and Technology, 4700 KAUST, Thuwal, 23955-6900, Saudi Arabia. .,Computer, Electrical and Mathematical Sciences & Engineering Division, King Abdullah University of Science and Technology, 4700 KAUST, Thuwal, 23955-6900, Saudi Arabia. .,Department of Computer Science, Aberystwyth University, Llandinam Building, Aberystwyth, SY23 3DB, UK.
Paul N Schofield Department of Physiology, Development & Neuroscience, University of Cambridge, Downing Street, Cambridge, CB2 3EG, UK.
Georgios V Gkoutos Department of Computer Science, Aberystwyth University, Llandinam Building, Aberystwyth, SY23 3DB, UK.

Behrooz Torabi Moghadam Department of Pharmaceutical Biosciences, ‡Department of Information Technology, and §Department of Pharmaceutical Biosciences and Science for Life Laboratory, Uppsala University , SE-751 24 Uppsala, Sweden
Jonathan Alvarsson
Marcus Holm
Martin Eklund
Lars Carlsson
Ola Spjuth

Chanin Nantasenamat Mahidol University, Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology , 10700 Bangkok , Thailand
Virapong Prachayasittikul

Catia M. Machado *Corresponding author. Catia M. Machado, Departamento de Informática, Faculdade de Ciências, Universidade de Lisboa, Portugal and Instituto de Engenharia de Sistemas e Computadores - Investigação e Desenvolvimento, Universidade de Lisboa, Portugal. E-mail:
Dietrich Rebholz-Schuhmann
Ana T. Freitas
Francisco M. Couto

Joseline Ratnam Universidade de Santiago de Compostela, Grupo BioFarma-USEF, Departamento de Farmacología, Campus Universitario Sur s/n, 15782 Santiago de Compostela, Spain * E-mail:
Barbara Zdrazil University of Vienna, Department of Pharmaceutical Chemistry, Althanstrasse 14, 1090 Vienna, Austria
Daniela Digles University of Vienna, Department of Pharmaceutical Chemistry, Althanstrasse 14, 1090 Vienna, Austria
Emiliano Cuadrado-Rodriguez Universidade de Santiago de Compostela, Grupo BioFarma-USEF, Departamento de Farmacología, Campus Universitario Sur s/n, 15782 Santiago de Compostela, Spain
Jean-Marc Neefs Janssen Research & Development, Turnhoutseweg 30, Beerse, Belgium
Hannah Tipney GSK Medicines Research Centre, Gunnels Wood Road, Stevenage, Hertfordshire, SG1 2NY, United Kingdom
Ronald Siebes Vrije Universiteit, Faculty of Sciences, division of Math. and Computer Science, De Boelelaan 1081a, 1081 HV Amsterdam, The Netherlands
Andra Waagmeester Department of Bioinformatics – BiGCaT, Maastricht University, Maastricht, The Netherlands
Glyn Bradley GSK Medicines Research Centre, Gunnels Wood Road, Stevenage, Hertfordshire, SG1 2NY, United Kingdom
Chau Han Chau GSK Medicines Research Centre, Gunnels Wood Road, Stevenage, Hertfordshire, SG1 2NY, United Kingdom
Lars Richter University of Vienna, Department of Pharmaceutical Chemistry, Althanstrasse 14, 1090 Vienna, Austria
Jose Brea Universidade de Santiago de Compostela, Grupo BioFarma-USEF, Departamento de Farmacología, Campus Universitario Sur s/n, 15782 Santiago de Compostela, Spain
Chris T. Evelo Department of Bioinformatics – BiGCaT, Maastricht University, Maastricht, The Netherlands
Edgar Jacoby Janssen Research & Development, Turnhoutseweg 30, Beerse, Belgium
Stefan Senger GSK Medicines Research Centre, Gunnels Wood Road, Stevenage, Hertfordshire, SG1 2NY, United Kingdom
Maria Isabel Loza Universidade de Santiago de Compostela, Grupo BioFarma-USEF, Departamento de Farmacología, Campus Universitario Sur s/n, 15782 Santiago de Compostela, Spain
Gerhard F. Ecker University of Vienna, Department of Pharmaceutical Chemistry, Althanstrasse 14, 1090 Vienna, Austria
Christine Chichester Swiss Institute of Bioinformatics, CALIPHO Group, CMU – Rue Michel-Servet 1, 1211 Geneva 4, Switzerland

Anne Mai Wassermann In Silico Lead Discovery, Novartis Institutes for Biomedical Research, 250 Massachusetts Avenue, Cambridge, MA 02139, USA.
Eugen Lounkine In Silico Lead Discovery, Novartis Institutes for Biomedical Research, 250 Massachusetts Avenue, Cambridge, MA 02139, USA
John W Davies In Silico Lead Discovery, Novartis Institutes for Biomedical Research, 250 Massachusetts Avenue, Cambridge, MA 02139, USA
Meir Glick In Silico Lead Discovery, Novartis Institutes for Biomedical Research, 250 Massachusetts Avenue, Cambridge, MA 02139, USA
L Miguel Camargo In Silico Lead Discovery, Novartis Institutes for Biomedical Research, 250 Massachusetts Avenue, Cambridge, MA 02139, USA.

Ye Hu Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität , Dahlmannstr. 2, D-53113 Bonn, Germany
Jürgen Bajorath

Anna Vuorinen Institute of Pharmacy/Pharmaceutical Chemistry and Center for Molecular Biosciences Innsbruck - CMBI, University of Innsbruck, Innrain 80/82, 6020 Innsbruck, Austria
Daniela Schuster Institute of Pharmacy/Pharmaceutical Chemistry and Center for Molecular Biosciences Innsbruck - CMBI, University of Innsbruck, Innrain 80/82, 6020 Innsbruck, Austria.

Sean Ekins Collaborations in Chemistry, 5616 Hilltop Needmore Road, Fuquay-Varina, NC, 27526, USA,
Alex M Clark
S Joshua Swamidass
Nadia Litterman
Antony J Williams

William E. Butler Neurosurgical Service, Massachusetts General Hospital, Boston, MA, USA Massachusetts General Hospital, Boston, MA, USA
Nadia Atai Neurosurgical Service, Massachusetts General Hospital, Boston, MA, USA Massachusetts General Hospital, Boston, MA, USA Department of Cell Biology and Histology, University of Amsterdam, Amsterdam, The Netherlands
Bob Carter Department of Neurosurgery, University of San Diego Medical School, San Diego, CA, USA
Fred Hochberg Massachusetts General Hospital, Boston, MA, USA

Kristina M Hettne />Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
Harish Dharuri />Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
Jun Zhao />Department of Zoology, University of Oxford, Oxford, UK
Katherine Wolstencroft />School of Computer Science, University of Manchester, Manchester, UK />Leiden Institute of Advanced Computer Science, Leiden University, Leiden, The Netherlands
Khalid Belhajjame />School of Computer Science, University of Manchester, Manchester, UK
Stian Soiland-Reyes />School of Computer Science, University of Manchester, Manchester, UK
Eleni Mina />Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
Mark Thompson />Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
Don Cruickshank />Department of Zoology, University of Oxford, Oxford, UK
Lourdes Verdes-Montenegro />Instituto de Astrofísica de Andalucía, Granada, Spain
Julian Garrido />Instituto de Astrofísica de Andalucía, Granada, Spain
David de Roure />Department of Zoology, University of Oxford, Oxford, UK
Oscar Corcho />Ontology Engineering Group, Universidad Politécnica de Madrid, Madrid, Spain
Graham Klyne />Department of Zoology, University of Oxford, Oxford, UK
Reinout van Schouwen />Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
Peter A C ‘t Hoen />Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
Sean Bechhofer />School of Computer Science, University of Manchester, Manchester, UK
Carole Goble />School of Computer Science, University of Manchester, Manchester, UK
Marco Roos />Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands

Jon Chambers ChEMBL, European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton Cambridge, CB10 1SD UK
Mark Davies ChEMBL, European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton Cambridge, CB10 1SD UK
Anna Gaulton ChEMBL, European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton Cambridge, CB10 1SD UK
George Papadatos ChEMBL, European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton Cambridge, CB10 1SD UK
Anne Hersey ChEMBL, European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton Cambridge, CB10 1SD UK
John P Overington ChEMBL, European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton Cambridge, CB10 1SD UK

Alex M Clark Collaborative Drug Discovery, Inc. , Burlingame, CA , USA
Barry A Bunin Collaborative Drug Discovery, Inc. , Burlingame, CA , USA
Nadia K Litterman Collaborative Drug Discovery, Inc. , Burlingame, CA , USA
Stephan C Schürer Center for Computational Science, University of Miami , Miami, FL , USA
Ubbo Visser Center for Computational Science, University of Miami , Miami, FL , USA

Barbara Zdrazil University of Vienna, Division of Drug Design and Medicinal Chemistry, Department of Pharmaceutical Chemistry, Pharmacoinformatics Research Group, Althanstrasse 14, A-1090 Vienna, Austria
Christine Chichester Swiss Institute of Bioinformatics, CALIPHO Group, CMU - Rue Michel-Servet 1, 1211 Geneva 4, Switzerland
Linda Zander Balderud Discovery Sciences, Chemistry Innovation Center, AstraZeneca R&D, Mölndal, Sweden
Ola Engkvist Discovery Sciences, Chemistry Innovation Center, AstraZeneca R&D, Mölndal, Sweden
Anna Gaulton European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
John P Overington European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom