Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Sarntivijai S, Ade AS, Athey BD, States DJ. A bioinformatics analysis of the cell line nomenclature. ACTA ACUST UNITED AC 2008;24:2760-6. [PMID: 18849319 DOI: 10.1093/bioinformatics/btn502] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

For:	Sarntivijai S, Ade AS, Athey BD, States DJ. A bioinformatics analysis of the cell line nomenclature. ACTA ACUST UNITED AC 2008;24:2760-6. [PMID: 18849319 DOI: 10.1093/bioinformatics/btn502] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

Number

Cited by Other Article(s)

Li L, Li H, Ishdorj TO, Zheng C, Su Y. MDNNSyn: A Multi-Modal Deep Learning Framework for Drug Synergy Prediction. IEEE J Biomed Health Inform 2024;28:6225-6236. [PMID: 38954565 DOI: 10.1109/jbhi.2024.3421916] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/04/2024]

Minimum Information and Quality Standards for Conducting, Reporting, and Organizing In Vitro Research. Handb Exp Pharmacol 2020;257:177-196. [PMID: 31628600 DOI: 10.1007/164_2019_284] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Korch C, Varella-Garcia M. Tackling the Human Cell Line and Tissue Misidentification Problem Is Needed for Reproducible Biomedical Research. ACTA ACUST UNITED AC 2018. [DOI: 10.1016/j.yamp.2018.07.003] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Jeong I, Yu N, Jang I, Jun Y, Kim MS, Choi J, Lee B, Lee S. GEMiCCL: mining genotype and expression data of cancer cell lines with elaborate visualization. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2018;2018:4991663. [PMID: 29726944 PMCID: PMC5932466 DOI: 10.1093/database/bay041] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/04/2017] [Accepted: 04/05/2018] [Indexed: 12/21/2022]

Ong E, Sarntivijai S, Jupp S, Parkinson H, He Y. Comparison, alignment, and synchronization of cell line information between CLO and EFO. BMC Bioinformatics 2017;18:557. [PMID: 29322915 PMCID: PMC5763470 DOI: 10.1186/s12859-017-1979-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Kafkas Ş, Sarntivijai S, Hoehndorf R. Usage of cell nomenclature in biomedical literature. BMC Bioinformatics 2017;18:561. [PMID: 29322912 PMCID: PMC5763300 DOI: 10.1186/s12859-017-1978-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Reid YA. Best practices for naming, receiving, and managing cells in culture. In Vitro Cell Dev Biol Anim 2017;53:761-774. [PMID: 28986713 DOI: 10.1007/s11626-017-0199-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Accepted: 09/05/2017] [Indexed: 12/26/2022]

Yu M, Selvaraj SK, Liang-Chu MMY, Aghajani S, Busse M, Yuan J, Lee G, Peale F, Klijn C, Bourgon R, Kaminker JS, Neve RM. A resource for cell line authentication, annotation and quality control. Nature 2015;520:307-11. [PMID: 25877200 DOI: 10.1038/nature14397] [Citation(s) in RCA: 286] [Impact Index Per Article: 31.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2014] [Accepted: 03/09/2015] [Indexed: 01/25/2023]

Vempati UD, Przydzial MJ, Chung C, Abeyruwan S, Mir A, Sakurai K, Visser U, Lemmon VP, Schürer SC. Formalization, annotation and analysis of diverse drug and probe screening assay datasets using the BioAssay Ontology (BAO). PLoS One 2012;7:e49198. [PMID: 23155465 PMCID: PMC3498356 DOI: 10.1371/journal.pone.0049198] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2012] [Accepted: 10/04/2012] [Indexed: 11/30/2022] Open

Abstract

Huge amounts of high-throughput screening (HTS) data for probe and drug development projects are being generated in the pharmaceutical industry and more recently in the public sector. The resulting experimental datasets are increasingly being disseminated via publically accessible repositories. However, existing repositories lack sufficient metadata to describe the experiments and are often difficult to navigate by non-experts. The lack of standardized descriptions and semantics of biological assays and screening results hinder targeted data retrieval, integration, aggregation, and analyses across different HTS datasets, for example to infer mechanisms of action of small molecule perturbagens. To address these limitations, we created the BioAssay Ontology (BAO). BAO has been developed with a focus on data integration and analysis enabling the classification of assays and screening results by concepts that relate to format, assay design, technology, target, and endpoint. Previously, we reported on the higher-level design of BAO and on the semantic querying capabilities offered by the ontology-indexed triple store of HTS data. Here, we report on our detailed design, annotation pipeline, substantially enlarged annotation knowledgebase, and analysis results. We used BAO to annotate assays from the largest public HTS data repository, PubChem, and demonstrate its utility to categorize and analyze diverse HTS results from numerous experiments. BAO is publically available from the NCBO BioPortal at http://bioportal.bioontology.org/ontologies/1533. BAO provides controlled terminology and uniform scope to report probe and drug discovery screening assays and results. BAO leverages description logic to formalize the domain knowledge and facilitate the semantic integration with diverse other resources. As a consequence, BAO offers the potential to infer new knowledge from a corpus of assay results, for example molecular mechanisms of action of perturbagens.

Collapse

Ganzinger M, He S, Breuhahn K, Knaup P. On the ontology based representation of cell lines. PLoS One 2012;7:e48584. [PMID: 23144907 PMCID: PMC3492450 DOI: 10.1371/journal.pone.0048584] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2012] [Accepted: 09/26/2012] [Indexed: 11/23/2022] Open

Chen H, Yu T, Chen JY. Semantic Web meets Integrative Biology: a survey. Brief Bioinform 2012;14:109-25. [PMID: 22492191 DOI: 10.1093/bib/bbs014] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Harmston N, Filsell W, Stumpf MPH. Which species is it? Species-driven gene name disambiguation using random walks over a mixture of adjacency matrices. Bioinformatics 2011;28:254-60. [PMID: 22135416 DOI: 10.1093/bioinformatics/btr640] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Athey BD, Cavalcoli JD, Jagadish HV, Omenn GS, Mirel B, Kretzler M, Burant C, Isokpehi RD, DeLisi C. The NIH National Center for Integrative Biomedical Informatics (NCIBI). J Am Med Inform Assoc 2011;19:166-70. [PMID: 22101971 DOI: 10.1136/amiajnl-2011-000552] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open

Lu Z, Kao HY, Wei CH, Huang M, Liu J, Kuo CJ, Hsu CN, Tsai RTH, Dai HJ, Okazaki N, Cho HC, Gerner M, Solt I, Agarwal S, Liu F, Vishnyakova D, Ruch P, Romacker M, Rinaldi F, Bhattacharya S, Srinivasan P, Liu H, Torii M, Matos S, Campos D, Verspoor K, Livingston KM, Wilbur WJ. The gene normalization task in BioCreative III. BMC Bioinformatics 2011;12 Suppl 8:S2. [PMID: 22151901 PMCID: PMC3269937 DOI: 10.1186/1471-2105-12-s8-s2] [Citation(s) in RCA: 79] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

We report the Gene Normalization (GN) challenge in BioCreative III where participating teams were asked to return a ranked list of identifiers of the genes detected in full-text articles. For training, 32 fully and 500 partially annotated articles were prepared. A total of 507 articles were selected as the test set. Due to the high annotation cost, it was not feasible to obtain gold-standard human annotations for all test articles. Instead, we developed an Expectation Maximization (EM) algorithm approach for choosing a small number of test articles for manual annotation that were most capable of differentiating team performance. Moreover, the same algorithm was subsequently used for inferring ground truth based solely on team submissions. We report team performance on both gold standard and inferred ground truth using a newly proposed metric called Threshold Average Precision (TAP-k).

RESULTS

We received a total of 37 runs from 14 different teams for the task. When evaluated using the gold-standard annotations of the 50 articles, the highest TAP-k scores were 0.3297 (k=5), 0.3538 (k=10), and 0.3535 (k=20), respectively. Higher TAP-k scores of 0.4916 (k=5, 10, 20) were observed when evaluated using the inferred ground truth over the full test set. When combining team results using machine learning, the best composite system achieved TAP-k scores of 0.3707 (k=5), 0.4311 (k=10), and 0.4477 (k=20) on the gold standard, representing improvements of 12.4%, 21.8%, and 26.6% over the best team results, respectively.

CONCLUSIONS

By using full text and being species non-specific, the GN task in BioCreative III has moved closer to a real literature curation task than similar tasks in the past and presents additional challenges for the text mining community, as revealed in the overall team results. By evaluating teams using the gold standard, we show that the EM algorithm allows team submissions to be differentiated while keeping the manual annotation effort feasible. Using the inferred ground truth we show measures of comparative performance between teams. Finally, by comparing team rankings on gold standard vs. inferred ground truth, we further demonstrate that the inferred ground truth is as effective as the gold standard for detecting good team performance.

Collapse

Affiliation(s)

Zhiyong Lu National Center for Biotechnology Information (NCBI), 8600 Rockville Pike, Bethesda, Maryland 20894, USA
Hung-Yu Kao Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, Taiwan, R.O.C
Chih-Hsuan Wei Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, Taiwan, R.O.C
Minlie Huang Department of Computer Science and Technology, Tsinghua University, Beijing, 100084, China
Jingchen Liu Department of Computer Science and Technology, Tsinghua University, Beijing, 100084, China
Cheng-Ju Kuo Institute of Information Science, Academia Sinica, Taipei 115, Taiwan
Chun-Nan Hsu Institute of Information Science, Academia Sinica, Taipei 115, Taiwan Information Science Institute, University of Southern California, Marina del Rey, California, USA
Richard Tzong-Han Tsai Department of Computer Science and Engineering, Yuan Ze University, Chung-Li, Taiwan, R.O.C
Hong-Jie Dai Department of Computer Science, National Tsing-Hua University, Hsinchu, Taiwan, R.O.C Institute of Information Science, Academic Sinica, Taipei, Taiwan, R.O.C
Naoaki Okazaki Interfaculty Initiative in Information Studies, University of Tokyo, Japan
Han-Cheol Cho Graduate School of Information Science and Technology, University of Tokyo, Japan
Martin Gerner Faculty of Life Sciences, University of Manchester, Manchester, M13 9PT, UK
Illes Solt Department of Telecommunications and Media Informatics, Budapest University of Technology and Economics, 1117 Budapest, Hungary
Shashank Agarwal Medical Informatics, University of Wisconsin-Milwaukee, Milwaukee, Wisconsin, USA
Feifan Liu Medical Informatics, University of Wisconsin-Milwaukee, Milwaukee, Wisconsin, USA
Dina Vishnyakova BiTem Group, Division of Medical Information Sciences, University of Geneva, Switzerland
Patrick Ruch BiTeM Group, Information Science Department, University of Applied Science, Geneva, Switzerland
Martin Romacker NITAS/TMS, Text Mining Services, Novartis AG, Switzerland
Fabio Rinaldi Institute of Computational Linguistics, University of Zurich, Zurich, Switzerland
Sanmitra Bhattacharya Department of Computer Science, The University of Iowa, Iowa City, Iowa 52242, USA
Padmini Srinivasan Department of Computer Science, The University of Iowa, Iowa City, Iowa 52242, USA
Hongfang Liu Department of Health Sciences Research, Mayo Clinic College of Medicine, Rochester, MN 55905 USA
Manabu Torii Lab of Text Intelligence in Biomedicine, Georgetown University Medical Center, 4000 Reservoir Rd., NW, Washington, DC 20057 USA
Sergio Matos DETI/IEETA, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
David Campos DETI/IEETA, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
Karin Verspoor Center for Computational Pharmacology, University of Colorado School of Medicine, Aurora, Colorado, USA
Kevin M Livingston Center for Computational Pharmacology, University of Colorado School of Medicine, Aurora, Colorado, USA
W John Wilbur National Center for Biotechnology Information (NCBI), 8600 Rockville Pike, Bethesda, Maryland 20894, USA

Collapse

BioAssay Ontology (BAO): a semantic description of bioassays and high-throughput screening results. BMC Bioinformatics 2011;12:257. [PMID: 21702939 PMCID: PMC3149580 DOI: 10.1186/1471-2105-12-257] [Citation(s) in RCA: 81] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2011] [Accepted: 06/24/2011] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

High-throughput screening (HTS) is one of the main strategies to identify novel entry points for the development of small molecule chemical probes and drugs and is now commonly accessible to public sector research. Large amounts of data generated in HTS campaigns are submitted to public repositories such as PubChem, which is growing at an exponential rate. The diversity and quantity of available HTS assays and screening results pose enormous challenges to organizing, standardizing, integrating, and analyzing the datasets and thus to maximize the scientific and ultimately the public health impact of the huge investments made to implement public sector HTS capabilities. Novel approaches to organize, standardize and access HTS data are required to address these challenges.

RESULTS

We developed the first ontology to describe HTS experiments and screening results using expressive description logic. The BioAssay Ontology (BAO) serves as a foundation for the standardization of HTS assays and data and as a semantic knowledge model. In this paper we show important examples of formalizing HTS domain knowledge and we point out the advantages of this approach. The ontology is available online at the NCBO bioportal http://bioportal.bioontology.org/ontologies/44531.

CONCLUSIONS

After a large manual curation effort, we loaded BAO-mapped data triples into a RDF database store and used a reasoner in several case studies to demonstrate the benefits of formalized domain knowledge representation in BAO. The examples illustrate semantic querying capabilities where BAO enables the retrieval of inferred search results that are relevant to a given query, but are not explicitly defined. BAO thus opens new functionality for annotating, querying, and analyzing HTS datasets and the potential for discovering new knowledge by means of inference.

Collapse

Rinaldi F, Kaljurand K, Sætre R. Terminological resources for text mining over biomedical scientific literature. Artif Intell Med 2011;52:107-14. [PMID: 21652190 DOI: 10.1016/j.artmed.2011.04.011] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2010] [Revised: 04/18/2011] [Accepted: 04/18/2011] [Indexed: 11/30/2022]

Ozgür A, Xiang Z, Radev DR, He Y. Mining of vaccine-associated IFN-γ gene interaction networks using the Vaccine Ontology. J Biomed Semantics 2011;2 Suppl 2:S8. [PMID: 21624163 PMCID: PMC3102897 DOI: 10.1186/2041-1480-2-s2-s8] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Harmston N, Filsell W, Stumpf MPH. What the papers say: text mining for genomics and systems biology. Hum Genomics 2010;5:17-29. [PMID: 21106487 PMCID: PMC3500154 DOI: 10.1186/1479-7364-5-1-17] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2010] [Accepted: 08/06/2010] [Indexed: 12/11/2022] Open

Abstract

Keeping up with the rapidly growing literature has become virtually impossible for most scientists. This can have dire consequences. First, we may waste research time and resources on reinventing the wheel simply because we can no longer maintain a reliable grasp on the published literature. Second, and perhaps more detrimental, judicious (or serendipitous) combination of knowledge from different scientific disciplines, which would require following disparate and distinct research literatures, is rapidly becoming impossible for even the most ardent readers of research publications. Text mining - the automated extraction of information from (electronically) published sources - could potentially fulfil an important role - but only if we know how to harness its strengths and overcome its weaknesses. As we do not expect that the rate at which scientific results are published will decrease, text mining tools are now becoming essential in order to cope with, and derive maximum benefit from, this information explosion. In genomics, this is particularly pressing as more and more rare disease-causing variants are found and need to be understood. Not being conversant with this technology may put scientists and biomedical regulators at a severe disadvantage. In this review, we introduce the basic concepts underlying modern text mining and its applications in genomics and systems biology. We hope that this review will serve three purposes: (i) to provide a timely and useful overview of the current status of this field, including a survey of present challenges; (ii) to enable researchers to decide how and when to apply text mining tools in their own research; and (iii) to highlight how the research communities in genomics and systems biology can help to make text mining from biomedical abstracts and texts more straightforward.

Collapse

Shin YC, Shin SY, So I, Kwon D, Jeon JH. TRIP Database: a manually curated database of protein-protein interactions for mammalian TRP channels. Nucleic Acids Res 2010;39:D356-61. [PMID: 20851834 PMCID: PMC3013757 DOI: 10.1093/nar/gkq814] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open

Gerner M, Nenadic G, Bergman CM. LINNAEUS: a species name identification system for biomedical literature. BMC Bioinformatics 2010;11:85. [PMID: 20149233 PMCID: PMC2836304 DOI: 10.1186/1471-2105-11-85] [Citation(s) in RCA: 159] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2009] [Accepted: 02/11/2010] [Indexed: 11/25/2022] Open

Using Existing Biomedical Resources to Detect and Ground Terms in Biomedical Literature. Artif Intell Med 2009. [DOI: 10.1007/978-3-642-02976-9_32] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]