Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Katayama T, Wilkinson MD, Micklem G, Kawashima S, Yamaguchi A, Nakao M, Yamamoto Y, Okamoto S, Oouchida K, Chun HW, Aerts J, Afzal H, Antezana E, Arakawa K, Aranda B, Belleau F, Bolleman J, Bonnal RJ, Chapman B, Cock PJ, Eriksson T, Gordon PM, Goto N, Hayashi K, Horn H, Ishiwata R, Kaminuma E, Kasprzyk A, Kawaji H, Kido N, Kim YJ, Kinjo AR, Konishi F, Kwon KH, Labarga A, Lamprecht AL, Lin Y, Lindenbaum P, McCarthy L, Morita H, Murakami K, Nagao K, Nishida K, Nishimura K, Nishizawa T, Ogishima S, Ono K, Oshita K, Park KJ, Prins P, Saito TL, Samwald M, Satagopam VP, Shigemoto Y, Smith R, Splendiani A, Sugawara H, Taylor J, Vos RA, Withers D, Yamasaki C, Zmasek CM, Kawamoto S, Okubo K, Asai K, Takagi T. The 3rd DBCLS BioHackathon: improving life science data integration with Semantic Web technologies. J Biomed Semantics 2013;4:6. [PMID: 23398680 DOI: 10.1186/2041-1480-4-6] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2012] [Accepted: 02/05/2013] [Indexed: 01/20/2023] Open

For:	Katayama T, Wilkinson MD, Micklem G, Kawashima S, Yamaguchi A, Nakao M, Yamamoto Y, Okamoto S, Oouchida K, Chun HW, Aerts J, Afzal H, Antezana E, Arakawa K, Aranda B, Belleau F, Bolleman J, Bonnal RJ, Chapman B, Cock PJ, Eriksson T, Gordon PM, Goto N, Hayashi K, Horn H, Ishiwata R, Kaminuma E, Kasprzyk A, Kawaji H, Kido N, Kim YJ, Kinjo AR, Konishi F, Kwon KH, Labarga A, Lamprecht AL, Lin Y, Lindenbaum P, McCarthy L, Morita H, Murakami K, Nagao K, Nishida K, Nishimura K, Nishizawa T, Ogishima S, Ono K, Oshita K, Park KJ, Prins P, Saito TL, Samwald M, Satagopam VP, Shigemoto Y, Smith R, Splendiani A, Sugawara H, Taylor J, Vos RA, Withers D, Yamasaki C, Zmasek CM, Kawamoto S, Okubo K, Asai K, Takagi T. The 3rd DBCLS BioHackathon: improving life science data integration with Semantic Web technologies. J Biomed Semantics 2013;4:6. [PMID: 23398680 DOI: 10.1186/2041-1480-4-6] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2012] [Accepted: 02/05/2013] [Indexed: 01/20/2023] Open

Number

Cited by Other Article(s)

Vos RA, Katayama T, Mishima H, Kawano S, Kawashima S, Kim JD, Moriya Y, Tokimatsu T, Yamaguchi A, Yamamoto Y, Wu H, Amstutz P, Antezana E, Aoki NP, Arakawa K, Bolleman JT, Bolton E, Bonnal RJP, Bono H, Burger K, Chiba H, Cohen KB, Deutsch EW, Fernández-Breis JT, Fu G, Fujisawa T, Fukushima A, García A, Goto N, Groza T, Hercus C, Hoehndorf R, Itaya K, Juty N, Kawashima T, Kim JH, Kinjo AR, Kotera M, Kozaki K, Kumagai S, Kushida T, Lütteke T, Matsubara M, Miyamoto J, Mohsen A, Mori H, Naito Y, Nakazato T, Nguyen-Xuan J, Nishida K, Nishida N, Nishide H, Ogishima S, Ohta T, Okuda S, Paten B, Perret JL, Prathipati P, Prins P, Queralt-Rosinach N, Shinmachi D, Suzuki S, Tabata T, Takatsuki T, Taylor K, Thompson M, Uchiyama I, Vieira B, Wei CH, Wilkinson M, Yamada I, Yamanaka R, Yoshitake K, Yoshizawa AC, Dumontier M, Kosaki K, Takagi T. BioHackathon 2015: Semantics of data for life sciences and reproducible research. F1000Res 2020;9:136. [PMID: 32308977 PMCID: PMC7141167 DOI: 10.12688/f1000research.18236.1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 02/05/2020] [Indexed: 01/08/2023] Open

Affiliation(s)

Rutger A. Vos Institute of Biology Leiden, Leiden University, Leiden, The Netherlands Naturalis Biodiversity Center, Leiden, The Netherlands
Toshiaki Katayama Database Center for Life Science, Tokyo, Japan
Hiroyuki Mishima Department of Human Genetics, Nagasaki University Graduate School of Biomedical Sciences, Nagasaki, Japan
Shin Kawano Database Center for Life Science, Tokyo, Japan
Shuichi Kawashima Database Center for Life Science, Tokyo, Japan
Jin-Dong Kim Database Center for Life Science, Tokyo, Japan
Yuki Moriya Database Center for Life Science, Tokyo, Japan
Toshiaki Tokimatsu DDBJ Center, National Institute of Genetics, Mishima, Japan
Atsuko Yamaguchi Database Center for Life Science, Tokyo, Japan
Yasunori Yamamoto Database Center for Life Science, Tokyo, Japan
Hongyan Wu Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
Peter Amstutz Curoverse, Somerville, USA
Erick Antezana Department of Biology, Norwegian University of Science and Technology, Trondheim, Norway
Nobuyuki P. Aoki Faculty of Science and Engineering, SOKA University, Tokyo, Japan
Kazuharu Arakawa Institute for Advanced Biosciences, Keio University, Tokyo, Japan
Jerven T. Bolleman SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, Lausanne, Switzerland
Evan Bolton National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, USA
Raoul J. P. Bonnal Istituto Nazionale Genetica Molecolare, Romeo ed Enrica Invernizzi, Milan, Italy
Hidemasa Bono Database Center for Life Science, Tokyo, Japan
Kees Burger Dutch Techcentre for Life Sciences, Utrecht, The Netherlands
Hirokazu Chiba National Institute for Basic Biology, National Institutes of Natural Sciences, Okazaki, Japan
Kevin B. Cohen Computational Bioscience Program, University of Colorado School of Medicine, Denver, USA Université Paris-Saclay, LIMSI, CNRS, Paris, France
Eric W. Deutsch Institute for Systems Biology, Seattle, USA
Jesualdo T. Fernández-Breis Universidad de Murcia, IMIB-Arrixaca, Murcia, Spain
Gang Fu National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, USA
Takatomo Fujisawa National Institute of Genetics, Mishima, Japan
Atsushi Fukushima RIKEN Center for Sustainable Resource Science, Yokohama, Japan
Alexander García Polytechnic University of Madrid, Madrid, Spain
Naohisa Goto Research Institute for Microbial Diseases, Osaka University, Osaka, Japan
Tudor Groza St Vincent's Clinical School, Faculty of Medicine, University of New South Wales, Darlinghurst, Australia Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, Australia
Colin Hercus Novocraft Technologies Sdn. Bhd., Selangor, Malaysia
Robert Hoehndorf Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
Kotone Itaya Institute for Advanced Biosciences, Keio University, Tokyo, Japan
Nick Juty European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
Takeshi Kawashima National Institute of Genetics, Mishima, Japan
Jee-Hyub Kim European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
Akira R. Kinjo Institute for Protein Research, Osaka University, Osaka, Japan
Masaaki Kotera School of Life Science and Technology, Tokyo Institute of Technology, Tokyo, Japan
Kouji Kozaki The Institute of Scientific and Industrial Research, Osaka University, Osaka, Japan
Sadahiro Kumagai Hitachi Ltd., Tokyo, Japan
Tatsuya Kushida National Bioscience Database Center, Japan Science and Technology Agency, Tokyo, Japan
Thomas Lütteke Institute of Veterinary Physiology and Biochemistry, Justus-Liebig University Giessen, Giessen, Germany Gesellschaft für innovative Personalwirtschaftssysteme mbH (GIP GmbH), Offenbach, Germany
Masaaki Matsubara The Noguchi Institute, Tokyo, Japan
Joe Miyamoto National Cancer Center Japan, Tokyo, Japan
Attayeb Mohsen National Institutes of Biomedical Innovation, Health and Nutrition, Osaka, Japan
Hiroshi Mori Center for Information Biology, National Institute of Genetics, Mishima, Japan
Yuki Naito Database Center for Life Science, Tokyo, Japan
Takeru Nakazato Database Center for Life Science, Tokyo, Japan
Jeremy Nguyen-Xuan Lawrence Berkeley National Laboratory, Berkeley, USA
Kozo Nishida RIKEN Quantitative Biology Center, Osaka, Japan
Naoki Nishida Department of Systems Science, Osaka University, Osaka, Japan
Hiroyo Nishide National Institute for Basic Biology, National Institutes of Natural Sciences, Okazaki, Japan
Soichi Ogishima Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
Tazro Ohta Database Center for Life Science, Tokyo, Japan
Shujiro Okuda Niigata University Graduate School of Medical and Dental Sciences, Niigata, Japan
Benedict Paten UC Santa Cruz Genomics Institute, University of California, Santa Cruz, USA
Jean-Luc Perret INVENesis, Neuchâtel, Switzerland
Philip Prathipati National Institutes of Biomedical Innovation, Health and Nutrition, Osaka, Japan
Pjotr Prins University Medical Center Utrecht, Utrecht, The Netherlands University of Tennessee Health Science Center, Memphis, USA
Núria Queralt-Rosinach Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
Daisuke Shinmachi Faculty of Science and Engineering, SOKA University, Tokyo, Japan
Shinya Suzuki School of Life Science and Technology, Tokyo Institute of Technology, Tokyo, Japan
Tsuyosi Tabata Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan
Terue Takatsuki RIKEN BioResource Center, Ibaraki, Japan
Kieron Taylor European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
Mark Thompson Leiden University Medical Center, Leiden, The Netherlands
Ikuo Uchiyama National Institute for Basic Biology, National Institutes of Natural Sciences, Okazaki, Japan
Bruno Vieira WurmLab, School of Biological & Chemical Sciences, Queen Mary University of London, London, UK
Chih-Hsuan Wei National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, USA
Mark Wilkinson Escuela Técnica Superior de Ingeniería Agronómica, Alimentaria y de Biosistemas, Universidad Politécnica de Madrid, Madrid, Spain
Issaku Yamada The Noguchi Institute, Tokyo, Japan
Ryota Yamanaka Oracle Corporation, Tokyo, Japan
Kazutoshi Yoshitake Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
Akiyasu C. Yoshizawa Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan
Michel Dumontier Institute of Data Science, Maastricht University, Maastricht, The Netherlands
Kenjiro Kosaki Center for Medical Genetics, Keio University School of Medicine, Tokyo, Japan
Toshihisa Takagi National Bioscience Database Center, Japan Science and Technology Agency, Tokyo, Japan Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo, Japan

Collapse

Kawashima S, Katayama T, Hatanaka H, Kushida T, Takagi T. NBDC RDF portal: a comprehensive repository for semantic data in life sciences. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2018;2018:5255118. [PMID: 30576482 PMCID: PMC6301334 DOI: 10.1093/database/bay123] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/05/2018] [Accepted: 10/15/2018] [Indexed: 11/28/2022]

Gojobori T, Ikeo K, Katayama Y, Kawabata T, Kinjo AR, Kinoshita K, Kwon Y, Migita O, Mizutani H, Muraoka M, Nagata K, Omori S, Sugawara H, Yamada D, Yura K. VaProS: a database-integration approach for protein/genome information retrieval. JOURNAL OF STRUCTURAL AND FUNCTIONAL GENOMICS 2016;17:69-81. [PMID: 28012137 PMCID: PMC5274651 DOI: 10.1007/s10969-016-9211-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/03/2016] [Accepted: 12/05/2016] [Indexed: 01/01/2023]

Affiliation(s)

Takashi Gojobori Computational Bioscience Research Center, Biological and Environmental Sciences and Engineering, King Abdullah University of Science and Technology, Thuwal, 23955-6900, Saudi Arabia National Institute of Genetics, Shizuoka, 411-8540, Mishima, Japan
Kazuho Ikeo National Institute of Genetics, Shizuoka, 411-8540, Mishima, Japan
Yukie Katayama Graduate School of Agricultural and Life Sciences, University of Tokyo, Bunkyo, Tokyo, 113-8657, Japan
Takeshi Kawabata Institute for Protein Research, Osaka University, Suita, Osaka, 565-0871, Japan
Akira R Kinjo Institute for Protein Research, Osaka University, Suita, Osaka, 565-0871, Japan
Kengo Kinoshita Graduate School of Information Sciences, Tohoku University, Miyagi, Sendai, 980-8597, Japan Tohoku Medical Megabank Organization, Tohoku University, Miyagi, Sendai, 980-8573, Japan
Yeondae Kwon Graduate School of Agricultural and Life Sciences, University of Tokyo, Bunkyo, Tokyo, 113-8657, Japan
Ohsuke Migita Department of Maternal-Fetal Biology, National Research Institute for Child Health and Development, Setagaya, Tokyo, 157-8535, Japan Department of Pediatrics, St. Marianna University School of Medicine, Miyamae, Kawasaki, 216-8511, Japan
Hisashi Mizutani National Institute of Genetics, Shizuoka, 411-8540, Mishima, Japan
Masafumi Muraoka National Institute of Genetics, Shizuoka, 411-8540, Mishima, Japan
Koji Nagata Graduate School of Agricultural and Life Sciences, University of Tokyo, Bunkyo, Tokyo, 113-8657, Japan
Satoshi Omori Graduate School of Information Sciences, Tohoku University, Miyagi, Sendai, 980-8597, Japan
Hideaki Sugawara National Institute of Genetics, Shizuoka, 411-8540, Mishima, Japan
Daichi Yamada Center for Informational Biology, Ochanomizu University, 2-1-1, Otsuka, Bunkyo, Tokyo, 112-8610, Japan
Kei Yura National Institute of Genetics, Shizuoka, 411-8540, Mishima, Japan. Center for Informational Biology, Ochanomizu University, 2-1-1, Otsuka, Bunkyo, Tokyo, 112-8610, Japan.

Collapse

Kinjo AR, Bekker GJ, Suzuki H, Tsuchiya Y, Kawabata T, Ikegawa Y, Nakamura H. Protein Data Bank Japan (PDBj): updated user interfaces, resource description framework, analysis tools for large structures. Nucleic Acids Res 2016;45:D282-D288. [PMID: 27789697 PMCID: PMC5210648 DOI: 10.1093/nar/gkw962] [Citation(s) in RCA: 82] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2016] [Revised: 10/06/2016] [Accepted: 10/11/2016] [Indexed: 11/14/2022] Open

Tohsato Y, Ho KHL, Kyoda K, Onami S. SSBD: a database of quantitative data of spatiotemporal dynamics of biological phenomena. Bioinformatics 2016;32:3471-3479. [PMID: 27412095 PMCID: PMC5181557 DOI: 10.1093/bioinformatics/btw417] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2015] [Revised: 04/15/2016] [Accepted: 06/19/2016] [Indexed: 11/20/2022] Open

Bolleman JT, Mungall CJ, Strozzi F, Baran J, Dumontier M, Bonnal RJP, Buels R, Hoehndorf R, Fujisawa T, Katayama T, Cock PJA. FALDO: a semantic standard for describing the location of nucleotide and protein feature annotation. J Biomed Semantics 2016;7:39. [PMID: 27296299 PMCID: PMC4907002 DOI: 10.1186/s13326-016-0067-z] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2014] [Accepted: 03/17/2016] [Indexed: 11/18/2022] Open

Kawazoe Y, Imai T, Ohe K. A Querying Method over RDF-ized Health Level Seven v2.5 Messages Using Life Science Knowledge Resources. JMIR Med Inform 2016;4:e12. [PMID: 27050304 PMCID: PMC4837294 DOI: 10.2196/medinform.5275] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2015] [Revised: 01/11/2016] [Accepted: 02/19/2016] [Indexed: 11/23/2022] Open

Abstract

Background

Health level seven version 2.5 (HL7 v2.5) is a widespread messaging standard for information exchange between clinical information systems. By applying Semantic Web technologies for handling HL7 v2.5 messages, it is possible to integrate large-scale clinical data with life science knowledge resources.

Objective

Showing feasibility of a querying method over large-scale resource description framework (RDF)-ized HL7 v2.5 messages using publicly available drug databases.

Methods

We developed a method to convert HL7 v2.5 messages into the RDF. We also converted five kinds of drug databases into RDF and provided explicit links between the corresponding items among them. With those linked drug data, we then developed a method for query expansion to search the clinical data using semantic information on drug classes along with four types of temporal patterns. For evaluation purpose, medication orders and laboratory test results for a 3-year period at the University of Tokyo Hospital were used, and the query execution times were measured.

Results

Approximately 650 million RDF triples for medication orders and 790 million RDF triples for laboratory test results were converted. Taking three types of query in use cases for detecting adverse events of drugs as an example, we confirmed these queries were represented in SPARQL Protocol and RDF Query Language (SPARQL) using our methods and comparison with conventional query expressions were performed. The measurement results confirm that the query time is feasible and increases logarithmically or linearly with the amount of data and without diverging.

Conclusions

The proposed methods enabled query expressions that separate knowledge resources and clinical data, thereby suggesting the feasibility for improving the usability of clinical data by enhancing the knowledge resources. We also demonstrate that when HL7 v2.5 messages are automatically converted into RDF, searches are still possible through SPARQL without modifying the structure. As such, the proposed method benefits not only our hospitals, but also numerous hospitals that handle HL7 v2.5 messages. Our approach highlights a potential of large-scale data federation techniques to retrieve clinical information, which could be applied as applications of clinical intelligence to improve clinical practices, such as adverse drug event monitoring and cohort selection for a clinical study as well as discovering new knowledge from clinical information.

Collapse

Hofmann-Apitius M, Ball G, Gebel S, Bagewadi S, de Bono B, Schneider R, Page M, Kodamullil AT, Younesi E, Ebeling C, Tegnér J, Canard L. Bioinformatics Mining and Modeling Methods for the Identification of Disease Mechanisms in Neurodegenerative Disorders. Int J Mol Sci 2015;16:29179-206. [PMID: 26690135 PMCID: PMC4691095 DOI: 10.3390/ijms161226148] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2015] [Revised: 11/10/2015] [Accepted: 11/12/2015] [Indexed: 12/22/2022] Open

Abstract

Since the decoding of the Human Genome, techniques from bioinformatics, statistics, and machine learning have been instrumental in uncovering patterns in increasing amounts and types of different data produced by technical profiling technologies applied to clinical samples, animal models, and cellular systems. Yet, progress on unravelling biological mechanisms, causally driving diseases, has been limited, in part due to the inherent complexity of biological systems. Whereas we have witnessed progress in the areas of cancer, cardiovascular and metabolic diseases, the area of neurodegenerative diseases has proved to be very challenging. This is in part because the aetiology of neurodegenerative diseases such as Alzheimer´s disease or Parkinson´s disease is unknown, rendering it very difficult to discern early causal events. Here we describe a panel of bioinformatics and modeling approaches that have recently been developed to identify candidate mechanisms of neurodegenerative diseases based on publicly available data and knowledge. We identify two complementary strategies-data mining techniques using genetic data as a starting point to be further enriched using other data-types, or alternatively to encode prior knowledge about disease mechanisms in a model based framework supporting reasoning and enrichment analysis. Our review illustrates the challenges entailed in integrating heterogeneous, multiscale and multimodal information in the area of neurology in general and neurodegeneration in particular. We conclude, that progress would be accelerated by increasing efforts on performing systematic collection of multiple data-types over time from each individual suffering from neurodegenerative disease. The work presented here has been driven by project AETIONOMY; a project funded in the course of the Innovative Medicines Initiative (IMI); which is a public-private partnership of the European Federation of Pharmaceutical Industry Associations (EFPIA) and the European Commission (EC).

Collapse

Affiliation(s)

Martin Hofmann-Apitius Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Institutszentrum Birlinghoven, Sankt Augustin D-53754, Germany. Rheinische Friedrich-Wilhelms-Universitaet Bonn, University of Bonn, Bonn 53113, Germany.
Gordon Ball Unit of Computational Medicine, Center for Molecular Medicine, Department of Medicine, and Unit of Clinical Epidemiology, Karolinska University Hospital, Stockholm SE-171 77, Sweden. Science for Life Laboratories, Karolinska Institutet, Stockholm SE-171 77, Sweden.
Stephan Gebel Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 7, avenue des Hauts-Fourneaux, Esch-sur-Alzette L-4362, Luxembourg.
Shweta Bagewadi Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Institutszentrum Birlinghoven, Sankt Augustin D-53754, Germany.
Bernard de Bono Institute of Health Informatics, University College London, London NW1 2DA, UK. Auckland Bioengineering Institute, University of Auckland, Symmonds Street, Auckland 1142, New Zealand.
Reinhard Schneider Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 7, avenue des Hauts-Fourneaux, Esch-sur-Alzette L-4362, Luxembourg.
Matt Page Translational Bioinformatics, UCB Pharma, 216 Bath Rd, Slough SL1 3WE, UK.
Alpha Tom Kodamullil Rheinische Friedrich-Wilhelms-Universitaet Bonn, University of Bonn, Bonn 53113, Germany.
Erfan Younesi Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Institutszentrum Birlinghoven, Sankt Augustin D-53754, Germany.
Christian Ebeling Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Institutszentrum Birlinghoven, Sankt Augustin D-53754, Germany.
Jesper Tegnér Unit of Computational Medicine, Center for Molecular Medicine, Department of Medicine, and Unit of Clinical Epidemiology, Karolinska University Hospital, Stockholm SE-171 77, Sweden. Science for Life Laboratories, Karolinska Institutet, Stockholm SE-171 77, Sweden.
Luc Canard Translational Science Unit, SANOFI Recherche & Développement, 1 Avenue Pierre Brossolette, Chilly-Mazarin Cedex 91385, France.

Collapse

Chiba H, Nishide H, Uchiyama I. Construction of an ortholog database using the semantic web technology for integrative analysis of genomic data. PLoS One 2015;10:e0122802. [PMID: 25875762 PMCID: PMC4395280 DOI: 10.1371/journal.pone.0122802] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2014] [Accepted: 02/13/2015] [Indexed: 12/30/2022] Open

Beneventano D, Bergamaschi S, Sorrentino S, Vincini M, Benedetti F. Semantic annotation of the CEREALAB database by the AGROVOC linked dataset. ECOL INFORM 2015. [DOI: 10.1016/j.ecoinf.2014.07.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Möller S, Afgan E, Banck M, Bonnal RJP, Booth T, Chilton J, Cock PJA, Gumbel M, Harris N, Holland R, Kalaš M, Kaján L, Kibukawa E, Powel DR, Prins P, Quinn J, Sallou O, Strozzi F, Seemann T, Sloggett C, Soiland-Reyes S, Spooner W, Steinbiss S, Tille A, Travis AJ, Guimera R, Katayama T, Chapman BA. Community-driven development for computational biology at Sprints, Hackathons and Codefests. BMC Bioinformatics 2014;15 Suppl 14:S7. [PMID: 25472764 PMCID: PMC4255748 DOI: 10.1186/1471-2105-15-s14-s7] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

Abstract

Background

Computational biology comprises a wide range of technologies and approaches. Multiple technologies can be combined to create more powerful workflows if the individuals contributing the data or providing tools for its interpretation can find mutual understanding and consensus. Much conversation and joint investigation are required in order to identify and implement the best approaches.

Traditionally, scientific conferences feature talks presenting novel technologies or insights, followed up by informal discussions during coffee breaks. In multi-institution collaborations, in order to reach agreement on implementation details or to transfer deeper insights in a technology and practical skills, a representative of one group typically visits the other. However, this does not scale well when the number of technologies or research groups is large.

Conferences have responded to this issue by introducing Birds-of-a-Feather (BoF) sessions, which offer an opportunity for individuals with common interests to intensify their interaction. However, parallel BoF sessions often make it hard for participants to join multiple BoFs and find common ground between the different technologies, and BoFs are generally too short to allow time for participants to program together.

Results

This report summarises our experience with computational biology Codefests, Hackathons and Sprints, which are interactive developer meetings. They are structured to reduce the limitations of traditional scientific meetings described above by strengthening the interaction among peers and letting the participants determine the schedule and topics. These meetings are commonly run as loosely scheduled "unconferences" (self-organized identification of participants and topics for meetings) over at least two days, with early introductory talks to welcome and organize contributors, followed by intensive collaborative coding sessions. We summarise some prominent achievements of those meetings and describe differences in how these are organised, how their audience is addressed, and their outreach to their respective communities.

Conclusions

Hackathons, Codefests and Sprints share a stimulating atmosphere that encourages participants to jointly brainstorm and tackle problems of shared interest in a self-driven proactive environment, as well as providing an opportunity for new participants to get involved in collaborative projects.

Collapse

Oshita K, Tomita M, Arakawa K. G-Links: a gene-centric link acquisition service. F1000Res 2014;3:285. [PMID: 26673001 PMCID: PMC4670005 DOI: 10.12688/f1000research.5754.2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 11/16/2015] [Indexed: 01/20/2023] Open

Wu H, Fujiwara T, Yamamoto Y, Bolleman J, Yamaguchi A. BioBenchmark Toyama 2012: an evaluation of the performance of triple stores on biological data. J Biomed Semantics 2014;5:32. [PMID: 25089180 PMCID: PMC4118313 DOI: 10.1186/2041-1480-5-32] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2013] [Accepted: 04/27/2014] [Indexed: 12/21/2022] Open

Abstract

Background

Biological databases vary enormously in size and data complexity, from small databases that contain a few million Resource Description Framework (RDF) triples to large databases that contain billions of triples. In this paper, we evaluate whether RDF native stores can be used to meet the needs of a biological database provider. Prior evaluations have used synthetic data with a limited database size. For example, the largest BSBM benchmark uses 1 billion synthetic e-commerce knowledge RDF triples on a single node. However, real world biological data differs from the simple synthetic data much. It is difficult to determine whether the synthetic e-commerce data is efficient enough to represent biological databases. Therefore, for this evaluation, we used five real data sets from biological databases.

Results

We evaluated five triple stores, 4store, Bigdata, Mulgara, Virtuoso, and OWLIM-SE, with five biological data sets, Cell Cycle Ontology, Allie, PDBj, UniProt, and DDBJ, ranging in size from approximately 10 million to 8 billion triples.

For each database, we loaded all the data into our single node and prepared the database for use in a classical data warehouse scenario. Then, we ran a series of SPARQL queries against each endpoint and recorded the execution time and the accuracy of the query response.

Conclusions

Our paper shows that with appropriate configuration Virtuoso and OWLIM-SE can satisfy the basic requirements to load and query biological data less than 8 billion or so on a single node, for the simultaneous access of 64 clients.

OWLIM-SE performs best for databases with approximately 11 million triples; For data sets that contain 94 million and 590 million triples, OWLIM-SE and Virtuoso perform best. They do not show overwhelming advantage over each other; For data over 4 billion Virtuoso works best.

4store performs well on small data sets with limited features when the number of triples is less than 100 million, and our test shows its scalability is poor; Bigdata demonstrates average performance and is a good open source triple store for middle-sized (500 million or so) data set; Mulgara shows a little of fragility.

Collapse

Vos RA, Biserkov JV, Balech B, Beard N, Blissett M, Brenninkmeijer C, van Dooren T, Eades D, Gosline G, Groom QJ, Hamann TD, Hettling H, Hoehndorf R, Holleman A, Hovenkamp P, Kelbert P, King D, Kirkup D, Lammers Y, DeMeulemeester T, Mietchen D, Miller JA, Mounce R, Nicolson N, Page R, Pawlik A, Pereira S, Penev L, Richards K, Sautter G, Shorthouse DP, Tähtinen M, Weiland C, Williams AR, Sierra S. Enriched biodiversity data as a resource and service. Biodivers Data J 2014:e1125. [PMID: 25057255 PMCID: PMC4092319 DOI: 10.3897/bdj.2.e1125] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2014] [Accepted: 06/11/2014] [Indexed: 11/28/2022] Open

Abstract

Background: Recent years have seen a surge in projects that produce large volumes of structured, machine-readable biodiversity data. To make these data amenable to processing by generic, open source “data enrichment” workflows, they are increasingly being represented in a variety of standards-compliant interchange formats. Here, we report on an initiative in which software developers and taxonomists came together to address the challenges and highlight the opportunities in the enrichment of such biodiversity data by engaging in intensive, collaborative software development: The Biodiversity Data Enrichment Hackathon.

Results: The hackathon brought together 37 participants (including developers and taxonomists, i.e. scientific professionals that gather, identify, name and classify species) from 10 countries: Belgium, Bulgaria, Canada, Finland, Germany, Italy, the Netherlands, New Zealand, the UK, and the US. The participants brought expertise in processing structured data, text mining, development of ontologies, digital identification keys, geographic information systems, niche modeling, natural language processing, provenance annotation, semantic integration, taxonomic name resolution, web service interfaces, workflow tools and visualisation. Most use cases and exemplar data were provided by taxonomists.

One goal of the meeting was to facilitate re-use and enhancement of biodiversity knowledge by a broad range of stakeholders, such as taxonomists, systematists, ecologists, niche modelers, informaticians and ontologists. The suggested use cases resulted in nine breakout groups addressing three main themes: i) mobilising heritage biodiversity knowledge; ii) formalising and linking concepts; and iii) addressing interoperability between service platforms. Another goal was to further foster a community of experts in biodiversity informatics and to build human links between research projects and institutions, in response to recent calls to further such integration in this research domain.

Conclusions: Beyond deriving prototype solutions for each use case, areas of inadequacy were discussed and are being pursued further. It was striking how many possible applications for biodiversity data there were and how quickly solutions could be put together when the normal constraints to collaboration were broken down for a week. Conversely, mobilising biodiversity knowledge from their silos in heritage literature and natural history collections will continue to require formalisation of the concepts (and the links between them) that define the research domain, as well as increased interoperability between the software platforms that operate on these concepts.

Collapse

Affiliation(s)

Rutger Aldo Vos Naturalis Biodiversity Center, Leiden, Netherlands
Jordan Valkov Biserkov Pensoft Publishers, Sofia, Bulgaria
Bachir Balech Institute of Biomembranes and Bioenergetics, National Research Council, Bari, Italy
Niall Beard University of Manchester, Manchester, United Kingdom
Matthew Blissett Royal Botanic Gardens, Kew, United Kingdom
Christian Brenninkmeijer University of Manchester, Manchester, United Kingdom
Tom van Dooren Naturalis Biodiversity Center, Leiden, Netherlands
David Eades The Illinois Natural History Survey, Champaign, United States of America
George Gosline Royal Botanic Gardens, Kew, United Kingdom
Quentin John Groom Agentschap Plantentuin Meise, Meise, Belgium
Thomas D Hamann Naturalis Biodiversity Center, Leiden, Netherlands
Hannes Hettling Naturalis Biodiversity Center, Leiden, Netherlands
Robert Hoehndorf Aberystwyth University, Aberystwyth, United Kingdom
Ayco Holleman Naturalis Biodiversity Center, Leiden, Netherlands
Peter Hovenkamp Naturalis Biodiversity Center, Leiden, Netherlands
Patricia Kelbert Botanic Garden and Botanical Museum Berlin-Dahlem, Freie Universität Berlin, Berlin, Germany
David King The Open University, Milton Keynes, United Kingdom
Don Kirkup Royal Botanic Gardens, Kew, United Kingdom
Youri Lammers Naturalis Biodiversity Center, Leiden, Netherlands
Thibaut DeMeulemeester Naturalis Biodiversity Center, Leiden, Netherlands
Daniel Mietchen Museum für Naturkunde, Berlin, Germany
Jeremy A Miller Naturalis Biodiversity Center, Leiden, Netherlands
Ross Mounce University of Bath, Bath, United Kingdom
Nicola Nicolson Royal Botanic Gardens, Kew, United Kingdom
Rod Page University Of Glasgow, Glasgow, United Kingdom
Aleksandra Pawlik Software Sustainability Institute, Manchester, United Kingdom
Serrano Pereira Naturalis Biodiversity Center, Leiden, Netherlands
Lyubomir Penev Pensoft Publishers, Sofia, Bulgaria
Kevin Richards Biodiversity Informatics Consultant, Christchurch, New Zealand
Guido Sautter Plazi, Bern, Switzerland
David Peter Shorthouse Université de Montréal Biodiversity Centre, Montréal, Canada
Marko Tähtinen University of Eastern Finland, Espoo, Finland
Claus Weiland Biodiversity and Climate Research Centre, Senckenberg Gesellschaft für Naturforschung, Frankfurt, Germany
Alan R Williams University of Manchester, Manchester, United Kingdom
Soraya Sierra Naturalis Biodiversity Center, Leiden, Netherlands

Collapse

Katayama T, Wilkinson MD, Aoki-Kinoshita KF, Kawashima S, Yamamoto Y, Yamaguchi A, Okamoto S, Kawano S, Kim JD, Wang Y, Wu H, Kano Y, Ono H, Bono H, Kocbek S, Aerts J, Akune Y, Antezana E, Arakawa K, Aranda B, Baran J, Bolleman J, Bonnal RJ, Buttigieg PL, Campbell MP, Chen YA, Chiba H, Cock PJ, Cohen KB, Constantin A, Duck G, Dumontier M, Fujisawa T, Fujiwara T, Goto N, Hoehndorf R, Igarashi Y, Itaya H, Ito M, Iwasaki W, Kalaš M, Katoda T, Kim T, Kokubu A, Komiyama Y, Kotera M, Laibe C, Lapp H, Lütteke T, Marshall MS, Mori T, Mori H, Morita M, Murakami K, Nakao M, Narimatsu H, Nishide H, Nishimura Y, Nystrom-Persson J, Ogishima S, Okamura Y, Okuda S, Oshita K, Packer NH, Prins P, Ranzinger R, Rocca-Serra P, Sansone S, Sawaki H, Shin SH, Splendiani A, Strozzi F, Tadaka S, Toukach P, Uchiyama I, Umezaki M, Vos R, Whetzel PL, Yamada I, Yamasaki C, Yamashita R, York WS, Zmasek CM, Kawamoto S, Takagi T. BioHackathon series in 2011 and 2012: penetration of ontology and linked data in life science domains. J Biomed Semantics 2014;5:5. [PMID: 24495517 PMCID: PMC3978116 DOI: 10.1186/2041-1480-5-5] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2013] [Accepted: 11/26/2013] [Indexed: 01/24/2023] Open

Jupp S, Malone J, Bolleman J, Brandizi M, Davies M, Garcia L, Gaulton A, Gehant S, Laibe C, Redaschi N, Wimalaratne SM, Martin M, Le Novère N, Parkinson H, Birney E, Jenkinson AM. The EBI RDF platform: linked open data for the life sciences. Bioinformatics 2014;30:1338-9. [PMID: 24413672 PMCID: PMC3998127 DOI: 10.1093/bioinformatics/btt765] [Citation(s) in RCA: 117] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open

Fujisawa T, Okamoto S, Katayama T, Nakao M, Yoshimura H, Kajiya-Kanegae H, Yamamoto S, Yano C, Yanaka Y, Maita H, Kaneko T, Tabata S, Nakamura Y. CyanoBase and RhizoBase: databases of manually curated annotations for cyanobacterial and rhizobial genomes. Nucleic Acids Res 2013;42:D666-70. [PMID: 24275496 PMCID: PMC3965071 DOI: 10.1093/nar/gkt1145] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open