Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wijaya SH, Husnawati H, Afendi FM, Batubara I, Darusman LK, Altaf-Ul-Amin M, Sato T, Ono N, Sugiura T, Kanaya S. Supervised clustering based on DPClusO: prediction of plant-disease relations using Jamu formulas of KNApSAcK database. Biomed Res Int 2014;2014:831751. [PMID: 24804251 DOI: 10.1155/2014/831751] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/30/2013] [Accepted: 02/18/2014] [Indexed: 02/06/2023]

For:	Wijaya SH, Husnawati H, Afendi FM, Batubara I, Darusman LK, Altaf-Ul-Amin M, Sato T, Ono N, Sugiura T, Kanaya S. Supervised clustering based on DPClusO: prediction of plant-disease relations using Jamu formulas of KNApSAcK database. Biomed Res Int 2014;2014:831751. [PMID: 24804251 DOI: 10.1155/2014/831751] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/30/2013] [Accepted: 02/18/2014] [Indexed: 02/06/2023]

Number

Cited by Other Article(s)

Abdullah-Zawawi MR, Govender N, Karim MB, Altaf-Ul-Amin M, Kanaya S, Mohamed-Hussein ZA. Chemoinformatics-driven classification of Angiosperms using sulfur-containing compounds and machine learning algorithm. PLANT METHODS 2022;18:118. [PMID: 36335358 PMCID: PMC9636760 DOI: 10.1186/s13007-022-00951-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Accepted: 10/14/2022] [Indexed: 06/16/2023]

Abstract

BACKGROUND

Phytochemicals or secondary metabolites are low molecular weight organic compounds with little function in plant growth and development. Nevertheless, the metabolite diversity govern not only the phenetics of an organism but may also inform the evolutionary pattern and adaptation of green plants to the changing environment. Plant chemoinformatics analyzes the chemical system of natural products using computational tools and robust mathematical algorithms. It has been a powerful approach for species-level differentiation and is widely employed for species classifications and reinforcement of previous classifications.

RESULTS

This study attempts to classify Angiosperms using plant sulfur-containing compound (SCC) or sulphated compound information. The SCC dataset of 692 plant species were collected from the comprehensive species-metabolite relationship family (KNApSAck) database. The structural similarity score of metabolite pairs under all possible combinations (plant species-metabolite) were determined and metabolite pairs with a Tanimoto coefficient value > 0.85 were selected for clustering using machine learning algorithm. Metabolite clustering showed association between the similar structural metabolite clusters and metabolite content among the plant species. Phylogenetic tree construction of Angiosperms displayed three major clades, of which, clade 1 and clade 2 represented the eudicots only, and clade 3, a mixture of both eudicots and monocots. The SCC-based construction of Angiosperm phylogeny is a subset of the existing monocot-dicot classification. The majority of eudicots present in clade 1 and 2 were represented by glucosinolate compounds. These clades with SCC may have been a mixture of ancestral species whilst the combinatorial presence of monocot-dicot in clade 3 suggests sulphated-chemical structure diversification in the event of adaptation during evolutionary change.

CONCLUSIONS

Sulphated chemoinformatics informs classification of Angiosperms via machine learning technique.

Collapse

Wijaya SH, Afendi FM, Batubara I, Huang M, Ono N, Kanaya S, Altaf-Ul-Amin M. Identification of Targeted Proteins by Jamu Formulas for Different Efficacies Using Machine Learning Approach. Life (Basel) 2021;11:866. [PMID: 34440610 PMCID: PMC8398944 DOI: 10.3390/life11080866] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Revised: 08/12/2021] [Accepted: 08/18/2021] [Indexed: 11/23/2022] Open

Hossain SF, Huang M, Ono N, Morita A, Kanaya S, Altaf-Ul-Amin M. Development of a biomarker database toward performing disease classification and finding disease interrelations. Database (Oxford) 2021;2021:baab011. [PMID: 33705530 PMCID: PMC7951048 DOI: 10.1093/database/baab011] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Revised: 02/19/2021] [Accepted: 02/25/2021] [Indexed: 12/11/2022]

A cloud based knowledge discovery framework, for medicinal plants from PubMed literature. INFORMATICS IN MEDICINE UNLOCKED 2019. [DOI: 10.1016/j.imu.2019.100226] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Behera NK, Mahalakshmi G. A cloud based knowledge discovery framework, for medicinal plants from PubMed literature. INFORMATICS IN MEDICINE UNLOCKED 2019. [DOI: 10.1016/j.imu.2018.04.006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open

Suparmi S, Widiastuti D, Wesseling S, Rietjens IMCM. Natural occurrence of genotoxic and carcinogenic alkenylbenzenes in Indonesian jamu and evaluation of consumer risks. Food Chem Toxicol 2018;118:53-67. [PMID: 29727721 DOI: 10.1016/j.fct.2018.04.059] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Revised: 04/24/2018] [Accepted: 04/25/2018] [Indexed: 12/15/2022]

Wijaya SH, Batubara I, Nishioka T, Altaf-Ul-Amin M, Kanaya S. Metabolomic Studies of Indonesian Jamu Medicines: Prediction of Jamu Efficacy and Identification of Important Metabolites. Mol Inform 2017;36. [PMID: 28682479 DOI: 10.1002/minf.201700050] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2017] [Accepted: 06/22/2017] [Indexed: 12/15/2022]

Wijaya SH, Afendi FM, Batubara I, Darusman LK, Altaf-Ul-Amin M, Kanaya S. Finding an appropriate equation to measure similarity between binary vectors: case studies on Indonesian and Japanese herbal medicines. BMC Bioinformatics 2016;17:520. [PMID: 27927171 PMCID: PMC5142342 DOI: 10.1186/s12859-016-1392-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2016] [Accepted: 11/29/2016] [Indexed: 12/30/2022] Open

Abstract

Background

The binary similarity and dissimilarity measures have critical roles in the processing of data consisting of binary vectors in various fields including bioinformatics and chemometrics. These metrics express the similarity and dissimilarity values between two binary vectors in terms of the positive matches, absence mismatches or negative matches. To our knowledge, there is no published work presenting a systematic way of finding an appropriate equation to measure binary similarity that performs well for certain data type or application. A proper method to select a suitable binary similarity or dissimilarity measure is needed to obtain better classification results.

Results

In this study, we proposed a novel approach to select binary similarity and dissimilarity measures. We collected 79 binary similarity and dissimilarity equations by extensive literature search and implemented those equations as an R package called bmeasures. We applied these metrics to quantify the similarity and dissimilarity between herbal medicine formulas belonging to the Indonesian Jamu and Japanese Kampo separately. We assessed the capability of binary equations to classify herbal medicine pairs into match and mismatch efficacies based on their similarity or dissimilarity coefficients using the Receiver Operating Characteristic (ROC) curve analysis. According to the area under the ROC curve results, we found Indonesian Jamu and Japanese Kampo datasets obtained different ranking of binary similarity and dissimilarity measures. Out of all the equations, the Forbes-2 similarity and the Variant of Correlation similarity measures are recommended for studying the relationship between Jamu formulas and Kampo formulas, respectively.

Conclusions

The selection of binary similarity and dissimilarity measures for multivariate analysis is data dependent. The proposed method can be used to find the most suitable binary similarity and dissimilarity equation wisely for a particular data. Our finding suggests that all four types of matching quantities in the Operational Taxonomic Unit (OTU) table are important to calculate the similarity and dissimilarity coefficients between herbal medicine formulas. Also, the binary similarity and dissimilarity measures that include the negative match quantity d achieve better capability to separate herbal medicine pairs compared to equations that exclude d.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-016-1392-z) contains supplementary material, which is available to authorized users.

Collapse

Utilization of KNApSAcK Family Databases for Developing Herbal Medicine Systems. JOURNAL OF COMPUTER AIDED CHEMISTRY 2016. [DOI: 10.2751/jcac.17.1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]

Development and mining of a volatile organic compound database. BIOMED RESEARCH INTERNATIONAL 2015;2015:139254. [PMID: 26495281 PMCID: PMC4606137 DOI: 10.1155/2015/139254] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2015] [Accepted: 06/14/2015] [Indexed: 12/16/2022]