1
|
Malwe AS, Sharma VK. Application of artificial intelligence approaches to predict the metabolism of xenobiotic molecules by human gut microbiome. Front Microbiol 2023; 14:1254073. [PMID: 38116528 PMCID: PMC10728657 DOI: 10.3389/fmicb.2023.1254073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Accepted: 10/12/2023] [Indexed: 12/21/2023] Open
Abstract
A highly complex, diverse, and dense community of more than 1,000 different gut bacterial species constitutes the human gut microbiome that harbours vast metabolic capabilities encoded by more than 300,000 bacterial enzymes to metabolise complex polysaccharides, orally administered drugs/xenobiotics, nutraceuticals, or prebiotics. One of the implications of gut microbiome mediated biotransformation is the metabolism of xenobiotics such as medicinal drugs, which lead to alteration in their pharmacological properties, loss of drug efficacy, bioavailability, may generate toxic byproducts and sometimes also help in conversion of a prodrug into its active metabolite. Given the diversity of gut microbiome and the complex interplay of the metabolic enzymes and their diverse substrates, the traditional experimental methods have limited ability to identify the gut bacterial species involved in such biotransformation, and to study the bacterial species-metabolite interactions in gut. In this scenario, computational approaches such as machine learning-based tools presents unprecedented opportunities and ability to predict the gut bacteria and enzymes that can potentially metabolise a candidate drug. Here, we have reviewed the need to identify the gut microbiome-based metabolism of xenobiotics and have provided comprehensive information on the available methods, tools, and databases to address it along with their scope and limitations.
Collapse
Affiliation(s)
| | - Vineet K. Sharma
- MetaBioSys Lab, Department of Biological Sciences, Indian Institute of Science Education and Research, Bhopal, India
| |
Collapse
|
2
|
Huang J, Vong CM, Chen CLP, Zhou Y. Accurate and Efficient Large-Scale Multi-Label Learning With Reduced Feature Broad Learning System Using Label Correlation. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:10240-10253. [PMID: 35436203 DOI: 10.1109/tnnls.2022.3165299] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Multi-label learning for large-scale data is a grand challenge because of a large number of labels with a complex data structure. Hence, the existing large-scale multi-label methods either have unsatisfactory classification performance or are extremely time-consuming for training utilizing a massive amount of data. A broad learning system (BLS), a flat network with the advantages of succinct structures, is appropriate for addressing large-scale tasks. However, existing BLS models are not directly applicable for large-scale multi-label learning due to the large and complex label space. In this work, a novel multi-label classifier based on BLS (called BLS-MLL) is proposed with two new mechanisms: kernel-based feature reduction module and correlation-based label thresholding. The kernel-based feature reduction module contains three layers, namely, the feature mapping layer, enhancement nodes layer, and feature reduction layer. The feature mapping layer employs elastic network regularization to solve the randomness of features in order to improve performance. In the enhancement nodes layer, the kernel method is applied for high-dimensional nonlinear conversion to achieve high efficiency. The newly constructed feature reduction layer is used to further significantly improve both the training efficiency and accuracy when facing high-dimensionality with abundant or noisy information embedded in large-scale data. The correlation-based label thresholding enables BLS-MLL to generate a label-thresholding function for effective conversion of the final decision values to logical outputs, thus, improving the classification performance. Finally, experimental comparisons among six state-of-the-art multi-label classifiers on ten datasets demonstrate the effectiveness of the proposed BLS-MLL. The results of the classification performance show that BLS-MLL outperforms the compared algorithms in 86% of cases with better training efficiency in 90% of cases.
Collapse
|
3
|
Azad M, Moshkov M. Applications of Depth Minimization of Decision Trees Containing Hypotheses for Multiple-Value Decision Tables. ENTROPY (BASEL, SWITZERLAND) 2023; 25:e25040547. [PMID: 37190335 PMCID: PMC10137443 DOI: 10.3390/e25040547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Revised: 02/28/2023] [Accepted: 03/07/2023] [Indexed: 05/17/2023]
Abstract
In this research, we consider decision trees that incorporate standard queries with one feature per query as well as hypotheses consisting of all features' values. These decision trees are used to represent knowledge and are comparable to those investigated in exact learning, in which membership queries and equivalence queries are used. As an application, we look into the issue of creating decision trees for two cases: the sorting of a sequence that contains equal elements and multiple-value decision tables which are modified from UCI Machine Learning Repository. We contrast the efficiency of several forms of optimal (considering the parameter depth) decision trees with hypotheses for the aforementioned applications. We also investigate the efficiency of decision trees built by dynamic programming and by an entropy-based greedy method. We discovered that the greedy algorithm produces very similar results compared to the results of dynamic programming algorithms. Therefore, since the dynamic programming algorithms take a long time, we may readily apply the greedy algorithms.
Collapse
Affiliation(s)
- Mohammad Azad
- College of Computer and Information Sciences, Jouf University, Sakaka 72441, Saudi Arabia
| | - Mikhail Moshkov
- Computer, Electrical and Mathematical Sciences & Engineering Division and Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal 23955, Saudi Arabia
| |
Collapse
|
4
|
Zhang Y, Lian H, Yang G, Zhao S, Ni P, Chen H, Li C. Inaccurate-Supervised Learning With Generative Adversarial Nets. IEEE TRANSACTIONS ON CYBERNETICS 2023; 53:1522-1536. [PMID: 34464286 DOI: 10.1109/tcyb.2021.3104848] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Inaccurate-supervised learning (ISL) is a weakly supervised learning framework for imprecise annotation, which is derived from some specific popular learning frameworks, mainly including partial label learning (PLL), partial multilabel learning (PML), and multiview PML (MVPML). While PLL, PML, and MVPML are each solved as independent models through different methods and no general framework can currently be applied to these frameworks, most existing methods for solving them were designed based on traditional machine-learning techniques, such as logistic regression, KNN, SVM, decision tree. Prior to this study, there was no single general framework that used adversarial networks to solve ISL problems. To narrow this gap, this study proposed an adversarial network structure to solve ISL problems, called ISL with generative adversarial nets (ISL-GANs). In ISL-GAN, fake samples, which are quite similar to real samples, gradually promote the Discriminator to disambiguate the noise labels of real samples. We also provide theoretical analyses for ISL-GAN in effectively handling ISL data. In this article, we propose a general framework to solve PLL, PML, and MVPML, while in the published conference version, we adopt the specific framework, which is a special case of the general one, to solve the PLL problem. Finally, the effectiveness is demonstrated through extensive experiments on various imprecise annotation learning tasks, including PLL, PML, and MVPML.
Collapse
|
5
|
Kumar S, Ahmadi N, Rastogi R. Multi-label learning with missing labels using sparse global structure for label-specific features. APPL INTELL 2023. [DOI: 10.1007/s10489-022-04439-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
|
6
|
Rastogi R, Mortaza S. Imbalance multi-label data learning with label specific features. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.09.085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/31/2022]
|
7
|
Han M, Wu H, Chen Z, Li M, Zhang X. A survey of multi-label classification based on supervised and semi-supervised learning. INT J MACH LEARN CYB 2022. [DOI: 10.1007/s13042-022-01658-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
8
|
Dual projection learning with adaptive graph smoothing for multi-label classification. APPL INTELL 2022. [DOI: 10.1007/s10489-022-04200-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
9
|
Saini K, Ramanathan V. Predicting odor from molecular structure: a multi-label classification approach. Sci Rep 2022; 12:13863. [PMID: 35974078 PMCID: PMC9381526 DOI: 10.1038/s41598-022-18086-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2022] [Accepted: 08/04/2022] [Indexed: 11/23/2022] Open
Abstract
Decoding the factors behind odor perception has long been a challenge in the field of human neuroscience, olfactory research, perfumery, psychology, biology and chemistry. The new wave of data-driven and machine learning approaches to predicting molecular properties are a growing area of research interest and provide for significant improvement over conventional statistical methods. We look at these approaches in the context of predicting molecular odor, specifically focusing on multi-label classification strategies employed for the same. Namely binary relevance, classifier chains, and random forests adapted to deal with such a task. This challenge, termed quantitative structure–odor relationship, remains an unsolved task in the field of sensory perception in machine learning, and we hope to emulate the results achieved in the field of vision and auditory perception in olfaction over time.
Collapse
Affiliation(s)
- Kushagra Saini
- Department of Chemical Engineering, Indian Institute of Technology (Banaras Hindu University, Varanasi, U.P., 221005, India
| | - Venkatnarayan Ramanathan
- Department of Chemistry, Indian Institute of Technology (Banaras Hindu University), Varanasi, U.P., 221005, India.
| |
Collapse
|
10
|
Wang Y, Guan Y, Wang B, Li X. Learning with partial multi-labeled data by leveraging low-rank constraint and decomposition. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03989-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
11
|
Chen Z, Li S, Ye L, Zhang H. Multi-label classification of legal text based on label embedding and capsule network. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03455-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
12
|
Multilabel Text Classification Algorithm Based on Fusion of Two-Stream Transformer. ELECTRONICS 2022. [DOI: 10.3390/electronics11142138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]
Abstract
Existing multilabel text classification methods rely on a complex manual design to mine label correlation, which has the risk of overfitting and ignores the relationship between text and labels. To solve the above problems, this paper proposes a multilabel text classification algorithm based on a transformer encoder–decoder, which can adaptively extract the dependency relationship between different labels and text. First, text representation learning is carried out through word embedding and a bidirectional long short-term memory network. Second, the global relationship of the text is modeled by the transformer encoder, and then the multilabel query is adaptively learned by the transformer decoder. Last, a weighted fusion strategy under the supervision of multiple loss functions is proposed to further improve the classification performance. The experimental results on the AAPD and RCV1-V2 datasets show that compared with the existing methods, the proposed algorithm achieves better classification results. The optimal micro-F1 reaches 73.4% and 87.8%, respectively, demonstrating the effectiveness of the proposed algorithm.
Collapse
|
13
|
Discriminatory Label-specific Weights for Multi-label Learning with Missing Labels. Neural Process Lett 2022. [DOI: 10.1007/s11063-022-10945-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
14
|
Kumar S, Rastogi R. Low rank label subspace transformation for multi-label learning with missing labels. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.03.015] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
15
|
Law A, Ghosh A. Multi-Label Classification Using Binary Tree of Classifiers. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE 2022. [DOI: 10.1109/tetci.2021.3075717] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Anwesha Law
- Machine Intelligence Unit, Indian Statistical Institute, Kolkata, West Bengal, India
| | - Ashish Ghosh
- Machine Intelligence Unit, Indian Statistical Institute, Kolkata, West Bengal, India
| |
Collapse
|
16
|
Abstract
AbstractSingle-label classification technology has difficulty meeting the needs of text classification, and multi-label text classification has become an important research issue in natural language processing (NLP). Extracting semantic features from different levels and granularities of text is a basic and key task in multi-label text classification research. A topic model is an effective method for the automatic organization and induction of text information. It can reveal the latent semantics of documents and analyze the topics contained in massive information. Therefore, this paper proposes a multi-label text classification method based on tALBERT-CNN: an LDA topic model and ALBERT model are used to obtain the topic vector and semantic context vector of each word (document), a certain fusion mechanism is adopted to obtain in-depth topic and semantic representations of the document, and the multi-label features of the text are extracted through the TextCNN model to train a multi-label classifier. The experimental results obtained on standard datasets show that the proposed method can extract multi-label features from documents, and its performance is better than that of the existing state-of-the-art multi-label text classification algorithms.
Collapse
|
17
|
Henrique Sousa Mello L, Varejão FM, Rodrigues AL. An experimental framework for evaluating loss minimization in multi‐label classification via stochastic process. Comput Intell 2021. [DOI: 10.1111/coin.12491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
| | - Flávio M. Varejão
- Department of Informatics Federal University of Espírito Santo Vitoria Brazil
| | | |
Collapse
|
18
|
Kang HS, Jun CH. Mutual information-based multi-output tree learning algorithm. INTELL DATA ANAL 2021. [DOI: 10.3233/ida-205367] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
A tree model with low time complexity can support the application of artificial intelligence to industrial systems. Variable selection based tree learning algorithms are more time efficient than existing Classification and Regression Tree (CART) algorithms. To our best knowledge, there is no attempt to deal with categorical input variable in variable selection based multi-output tree learning. Also, in the case of multi-output regression tree, a conventional variable selection based algorithm is not suitable to large datasets. We propose a mutual information-based multi-output tree learning algorithm that consists of variable selection and split optimization. The proposed method discretizes each variable based on k-means into 2–4 clusters and selects the variable for splitting based on the discretized variables using mutual information. This variable selection component has relatively low time complexity and can be applied regardless of output dimension and types. The proposed split optimization component is more efficient than an exhaustive search. The performance of the proposed tree learning algorithm is similar to or better than that of a multi-output version of CART algorithm on a specific dataset. In addition, with a large dataset, the time complexity of the proposed algorithm is significantly reduced compared to a CART algorithm.
Collapse
Affiliation(s)
- Hyun-Seok Kang
- Technical Research Laboratories, POSCO, Pohang, Korea
- Graduate Institute of Ferrous Technology, Pohang University of Science and Technology (POSTECH), Pohang, Korea
| | - Chi-Hyuck Jun
- Department of Industrial and Management Engineering, Pohang University of Science and Technology (POSTECH), Pohang, Korea
| |
Collapse
|
19
|
Tan Y, Sun D, Shi Y, Gao L, Gao Q, Lu Y. Bi-directional mapping for multi-label learning of label-specific features. APPL INTELL 2021. [DOI: 10.1007/s10489-021-02868-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
20
|
Villa‐Blanco C, Larrañaga P, Bielza C. Multidimensional continuous time Bayesian network classifiers. INT J INTELL SYST 2021. [DOI: 10.1002/int.22611] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Carlos Villa‐Blanco
- Computational Intelligence Group, Departamento de Inteligencia Artificial Universidad Politécnica de Madrid Boadilla del Monte Madrid Spain
| | - Pedro Larrañaga
- Computational Intelligence Group, Departamento de Inteligencia Artificial Universidad Politécnica de Madrid Boadilla del Monte Madrid Spain
| | - Concha Bielza
- Computational Intelligence Group, Departamento de Inteligencia Artificial Universidad Politécnica de Madrid Boadilla del Monte Madrid Spain
| |
Collapse
|
21
|
Aljedani N, Alotaibi R, Taileb M. HMATC: Hierarchical multi-label Arabic text classification model using machine learning. EGYPTIAN INFORMATICS JOURNAL 2021. [DOI: 10.1016/j.eij.2020.08.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
22
|
Zhang Q, Zhong G, Dong J. A Graph-based Semi-supervised Multi-label Learning Method Based on Label Correlation Consistency. Cognit Comput 2021. [DOI: 10.1007/s12559-021-09912-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
23
|
Roseberry M, Krawczyk B, Djenouri Y, Cano A. Self-adjusting k nearest neighbors for continual learning from multi-label drifting data streams. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2021.02.032] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
24
|
Xia Y, Chen K, Yang Y. Multi-label classification with weighted classifier selection and stacked ensemble. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2020.06.017] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
25
|
A novel binary many-objective feature selection algorithm for multi-label data classification. INT J MACH LEARN CYB 2021. [DOI: 10.1007/s13042-021-01291-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
26
|
|
27
|
Pereira RB, Plastino A, Zadrozny B, Merschmann LH. A lazy feature selection method for multi-label classification. INTELL DATA ANAL 2021. [DOI: 10.3233/ida-194878] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
In many important application domains, such as text categorization, biomolecular analysis, scene or video classification and medical diagnosis, instances are naturally associated with more than one class label, giving rise to multi-label classification problems. This has led, in recent years, to a substantial amount of research in multi-label classification. More specifically, feature selection methods have been developed to allow the identification of relevant and informative features for multi-label classification. This work presents a new feature selection method based on the lazy feature selection paradigm and specific for the multi-label context. Experimental results show that the proposed technique is competitive when compared to multi-label feature selection techniques currently used in the literature, and is clearly more scalable, in a scenario where there is an increasing amount of data.
Collapse
|
28
|
Liu N, Wang Q, Ren J. Label-Embedding Bi-directional Attentive Model for Multi-label Text Classification. Neural Process Lett 2021. [DOI: 10.1007/s11063-020-10411-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
29
|
Chen YN, Weng W, Wu SX, Chen BH, Fan YL, Liu JH. An efficient stacking model with label selection for multi-label classification. APPL INTELL 2021. [DOI: 10.1007/s10489-020-01807-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
30
|
|
31
|
|
32
|
Dong H, Sun J, Sun X, Ding R. A many-objective feature selection for multi-label classification. Knowl Based Syst 2020. [DOI: 10.1016/j.knosys.2020.106456] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
33
|
Yao Y, Li Y, Ye Y, Li X. MLCE: A Multi-Label Crotch Ensemble Method for Multi-Label Classification. INT J PATTERN RECOGN 2020. [DOI: 10.1142/s021800142151006x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Multi-label classification addresses the problem that each instance is associated with multiple labels simultaneously. In this paper, we propose a multi-label crotch ensemble (MLCE) model for multi-label classification, which takes label correlations into consideration. In MLCE, a multi-label cluster tree is first constructed. Then, we incorporate all multi-label crotch predictors of the tree into a classifier, where the multi-label crotch predictor is the crotch formed by an inner node of the tree and its children. Finally, a flexible weighted voting scheme is designed to produce the classification output. We perform experiments on 11 benchmark datasets. Experimental results clearly demonstrate the MLCE significantly outperforms six well-established multi-label classification approaches, in terms of the widely used evaluation metrics.
Collapse
Affiliation(s)
- Yuan Yao
- Department of Computer Science, Harbin Institute of Technology, Shenzhen 518055, P. R. China
| | - Yan Li
- School of Computer Engineering, Shenzhen Polytechnic, Shenzhen 518055, P. R. China
| | - Yunming Ye
- Department of Computer Science, Harbin Institute of Technology, Shenzhen 518055, P. R. China
| | - Xutao Li
- Department of Computer Science, Harbin Institute of Technology, Shenzhen 518055, P. R. China
| |
Collapse
|
34
|
Yang H, Jiao SJ, Yin FD. Multilabel Image Classification Based Fresh Concrete Mix Proportion Monitoring Using Improved Convolutional Neural Network. SENSORS (BASEL, SWITZERLAND) 2020; 20:E4638. [PMID: 32824719 PMCID: PMC7472402 DOI: 10.3390/s20164638] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Revised: 08/13/2020] [Accepted: 08/14/2020] [Indexed: 11/16/2022]
Abstract
Proper and accurate mix proportion is deemed to be crucial for the concrete in service to implement its structural functions in a specific environment and structure. Neither existing testing methods nor previous studies have, to date, addressed the problem of real-time and full-scale monitoring of fresh concrete mix proportion during manufacturing. Green manufacturing and safety construction are hindered by such defects. In this study, a state-of-the-art method based on improved convolutional neural network multilabel image classification is presented for mix proportion monitoring. Elaborately planned, uniformly distributed, widely covered and high-quality images of concrete mixtures were collected as dataset during experiments. Four convolutional neural networks were improved or fine-tuned based on two solutions for multilabel image classification problems, since original networks are tailored for single-label multiclassification tasks, but mix proportions are determined by multiple parameters. Various metrices for effectiveness evaluation of training and testing all indicated that four improved network models showed outstanding learning and generalization ability during training and testing. The best-performing one was embedded into executable application and equipped with hardware facilities to establish fresh concrete mix proportion monitoring system. Such system was deployed to terminals and united with mechanical and weighing sensors to establish integrated intelligent sensing system. Fresh concrete mix proportion real-time and full-scale monitoring and inaccurate mix proportion sensing and warning could be achieved simply by taking pictures and feeding pictures into such sensing system instead of conducting experiments in laboratory after specimen retention.
Collapse
Affiliation(s)
- Han Yang
- Department of Civil Engineering, College of Engineering, Ocean University of China, Qingdao 266100, China; (S.-J.J.); (F.-D.Y.)
| | - Shuang-Jian Jiao
- Department of Civil Engineering, College of Engineering, Ocean University of China, Qingdao 266100, China; (S.-J.J.); (F.-D.Y.)
| | - Feng-De Yin
- Department of Civil Engineering, College of Engineering, Ocean University of China, Qingdao 266100, China; (S.-J.J.); (F.-D.Y.)
| |
Collapse
|
35
|
Shi Y, Yang W, Thung KH, Wang H, Gao Y, Pan Y, Zhang L, Shen D. Learning-Based Computer-Aided Prescription Model for Parkinson's Disease: A Data-Driven Perspective. IEEE J Biomed Health Inform 2020; 25:3258-3269. [PMID: 32750966 DOI: 10.1109/jbhi.2020.3010946] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
In this article, we study a novel problem: "automatic prescription recommendation for PD patients." To realize this goal, we first build a dataset by collecting 1) symptoms of PD patients, and 2) their prescription drug provided by neurologists. Then, we build a novel computer-aided prescription model by learning the relation between observed symptoms and prescription drug. Finally, for the new coming patients, we could recommend (predict) suitable prescription drug on their observed symptoms by our prescription model. From the methodology part, our proposed model, namely Prescription viA Learning lAtent Symptoms (PALAS), could recommend prescription using the multi-modality representation of the data. In PALAS, a latent symptom space is learned to better model the relationship between symptoms and prescription drug, as there is a large semantic gap between them. Moreover, we present an efficient alternating optimization method for PALAS. We evaluated our method using the data collected from 136 PD patients at Nanjing Brain Hospital, which can be regarded as a large dataset in PD research community. The experimental results demonstrate the effectiveness and clinical potential of our method in this recommendation task, if compared with other competing methods.
Collapse
|
36
|
Improving the $$\epsilon $$-approximate algorithm for Probabilistic Classifier Chains. Knowl Inf Syst 2020. [DOI: 10.1007/s10115-020-01436-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
37
|
Wang X, Yang Y, Xu Y, Chen Q, Wang H, Gao H. Predicting hypoglycemic drugs of type 2 diabetes based on weighted rank support vector machine. Knowl Based Syst 2020. [DOI: 10.1016/j.knosys.2020.105868] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|
38
|
A multi-label text classification method via dynamic semantic representation model and deep neural network. APPL INTELL 2020. [DOI: 10.1007/s10489-020-01680-w] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
39
|
Weng W, Chen YN, Chen CL, Wu SX, Liu JH. Non-sparse label specific features selection for multi-label classification. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2019.10.016] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
40
|
Huang SJ, Gao W, Zhou ZH. Fast Multi-Instance Multi-Label Learning. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2019; 41:2614-2627. [PMID: 30072313 DOI: 10.1109/tpami.2018.2861732] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
In many real-world tasks, particularly those involving data objects with complicated semantics such as images and texts, one object can be represented by multiple instances and simultaneously be associated with multiple labels. Such tasks can be formulated as multi-instance multi-label learning (MIML) problems, and have been extensively studied during the past few years. Existing MIML approaches have been found useful in many applications; however, most of them can only handle moderate-sized data. To efficiently handle large data sets, in this paper we propose the MIMLfast approach, which first constructs a low-dimensional subspace shared by all labels, and then trains label specific linear models to optimize approximated ranking loss via stochastic gradient descent. Although the MIML problem is complicated, MIMLfast is able to achieve excellent performance by exploiting label relations with shared space and discovering sub-concepts for complicated labels. Experiments show that the performance of MIMLfast is highly competitive to state-of-the-art techniques, whereas its time cost is much less. Moreover, our approach is able to identify the most representative instance for each label, and thus providing a chance to understand the relation between input patterns and output label semantics.
Collapse
|
41
|
Ma J, Chow TWS. Topic-Based Algorithm for Multilabel Learning With Missing Labels. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2019; 30:2138-2152. [PMID: 30442616 DOI: 10.1109/tnnls.2018.2874434] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
In multilabel learning (MLL), each instance can be assigned by several concepts simultaneously from a class dictionary. Usually, labels in the class dictionary have semantic correlations and semantic hierarchy. Instances can be categorized into different topics. Each topic has its own label candidates, and some topics have overlapped label candidates. In this paper, we propose a novel MLL method to deal with missing labels. The proposed algorithm can recover the label matrix according to local, topic-wise, and global semantic properties. Specifically, in the global level, label consistency, label-wise semantic correlations, and semantic hierarchy are exploited; in the local level, label importance and instance-wise semantic correlations in each topic are extracted; and in the topic level, label importance similarities and instance-wise semantic similarities between topics are mined. The experimental results on five image data sets in different applications demonstrate the effectiveness of the proposed approach.
Collapse
|
42
|
Azad-Manjiri M, Amiri A, Saleh Sedghpour A. ML-SLSTSVM: a new structural least square twin support vector machine for multi-label learning. Pattern Anal Appl 2019. [DOI: 10.1007/s10044-019-00779-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
43
|
Moral-García S, Mantas CJ, Castellano JG, Abellán J. Ensemble of classifier chains and Credal C4.5 for solving multi-label classification. PROGRESS IN ARTIFICIAL INTELLIGENCE 2019. [DOI: 10.1007/s13748-018-00171-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
44
|
Jiménez J, Sabbadin D, Cuzzolin A, Martínez-Rosell G, Gora J, Manchester J, Duca J, De Fabritiis G. PathwayMap: Molecular Pathway Association with Self-Normalizing Neural Networks. J Chem Inf Model 2019; 59:1172-1181. [PMID: 30586501 DOI: 10.1021/acs.jcim.8b00711] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Drug discovery suffers from high attrition because compounds initially deemed as promising can later show ineffectiveness or toxicity resulting from a poor understanding of their activity profile. In this work, we describe a deep self-normalizing neural network model for the prediction of molecular pathway association and evaluate its performance, showing an AUC ranging from 0.69 to 0.91 on a set of compounds extracted from ChEMBL and from 0.81 to 0.83 on an external data set provided by Novartis. We finally discuss the applicability of the proposed model in the domain of lead discovery. A usable application is available via PlayMolecule.org .
Collapse
Affiliation(s)
- José Jiménez
- Computational Science Laboratory , Universitat Pompeu Fabra , Barcelona Biomedical Research Park (PRBB), Carrer del Dr. Aiguader 88 , 08003 , Barcelona , Spain
| | - Davide Sabbadin
- Computational Science Laboratory , Universitat Pompeu Fabra , Barcelona Biomedical Research Park (PRBB), Carrer del Dr. Aiguader 88 , 08003 , Barcelona , Spain
| | - Alberto Cuzzolin
- Acellera , Barcelona Biomedical Research Park (PRBB) , Carrer del Dr. Aiguader 88 , 08003 , Barcelona , Spain
| | - Gerard Martínez-Rosell
- Acellera , Barcelona Biomedical Research Park (PRBB) , Carrer del Dr. Aiguader 88 , 08003 , Barcelona , Spain
| | - Jacob Gora
- Global Discovery Chemistry , Novartis Institutes for Biomedical Research , 250 Massachusetts Avenue , Cambridge , Massachusetts 02139 , United States.,Department of Mathematics and Computer Science , Freie Universität Berlin , Takustr. 9 , 14195 Berlin , Germany
| | - John Manchester
- Global Discovery Chemistry , Novartis Institutes for Biomedical Research , 250 Massachusetts Avenue , Cambridge , Massachusetts 02139 , United States
| | - José Duca
- Global Discovery Chemistry , Novartis Institutes for Biomedical Research , 250 Massachusetts Avenue , Cambridge , Massachusetts 02139 , United States
| | - Gianni De Fabritiis
- Computational Science Laboratory , Universitat Pompeu Fabra , Barcelona Biomedical Research Park (PRBB), Carrer del Dr. Aiguader 88 , 08003 , Barcelona , Spain.,Acellera , Barcelona Biomedical Research Park (PRBB) , Carrer del Dr. Aiguader 88 , 08003 , Barcelona , Spain.,Institució Catalana de Recerca i Estudis Avançats (ICREA) , Passeig Lluis Companys 23 , 08010 Barcelona , Spain
| |
Collapse
|
45
|
A snapshot on nonstandard supervised learning problems: taxonomy, relationships, problem transformations and algorithm adaptations. PROGRESS IN ARTIFICIAL INTELLIGENCE 2018. [DOI: 10.1007/s13748-018-00167-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
46
|
Liu Y, Luo Y, Zhu Y, Liu Y, Li X. Secure multi-label data classification in cloud by additionally homomorphic encryption. Inf Sci (N Y) 2018. [DOI: 10.1016/j.ins.2018.07.054] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
47
|
Sun Z, Guo Z, Liu C, Jiang M, Wang X. Fast multi-label SVM training based on approximate extreme points. INTELL DATA ANAL 2018. [DOI: 10.3233/ida-173525] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Zhongwei Sun
- Department of Computer Science, Qingdao University of Technology, Qingdao, Shandong, China
- Science and Information College, Qingdao Agricultural University, Qingdao, Shandong, China
- College of Information Science and Engineering, Ocean University of China, Qingdao, Shandong, China
| | - Zhongwen Guo
- College of Information Science and Engineering, Ocean University of China, Qingdao, Shandong, China
| | - Chao Liu
- College of Information Science and Engineering, Ocean University of China, Qingdao, Shandong, China
| | - Mingxing Jiang
- College of Information Science and Engineering, Ocean University of China, Qingdao, Shandong, China
| | - Xi Wang
- College of Information Science and Engineering, Ocean University of China, Qingdao, Shandong, China
| |
Collapse
|
48
|
Huang M, Han H, Wang H, Li L, Zhang Y, Bhatti UA. A Clinical Decision Support Framework for Heterogeneous Data Sources. IEEE J Biomed Health Inform 2018; 22:1824-1833. [PMID: 29994279 DOI: 10.1109/jbhi.2018.2846626] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
To keep pace with the developments in medical informatics, health medical data is being collected continually. But, owing to the diversity of its categories and sources, medical data has become so complicated in many hospitals that it now needs a clinical decision support (CDS) system for its management. To effectively utilize the accumulating health data, we propose a CDS framework that can integrate heterogeneous health data from different sources such as laboratory test results, basic information of patients, and health records into a consolidated representation of features of all patients. Using the electronic health medical data so created, multilabel classification was employed to recommend a list of diseases and thus assist physicians in diagnosing or treating their patients' health issues more efficiently. Once the physician diagnoses the disease of a patient, the next step is to consider the likely complications of that disease, which can lead to more diseases. Previous studies reveal that correlations do exist among some diseases. Considering these correlations, a k-nearest neighbors algorithm is improved for multilabel learning by using correlations among labels (CML-kNN). The CML- kNN algorithm first exploits the dependence between every two labels to update the origin label matrix and then performs multilabel learning to estimate the probabilities of labels by using the integrated features. Finally, it recommends the top N diseases to the physicians. Experimental results on real health medical data establish the effectiveness and practicability of the proposed CDS framework.
Collapse
|
49
|
|
50
|
Humpire-Mamani GE, Setio AAA, van Ginneken B, Jacobs C. Efficient organ localization using multi-label convolutional neural networks in thorax-abdomen CT scans. Phys Med Biol 2018; 63:085003. [PMID: 29512516 DOI: 10.1088/1361-6560/aab4b3] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Automatic localization of organs and other structures in medical images is an important preprocessing step that can improve and speed up other algorithms such as organ segmentation, lesion detection, and registration. This work presents an efficient method for simultaneous localization of multiple structures in 3D thorax-abdomen CT scans. Our approach predicts the location of multiple structures using a single multi-label convolutional neural network for each orthogonal view. Each network takes extra slices around the current slice as input to provide extra context. A sigmoid layer is used to perform multi-label classification. The output of the three networks is subsequently combined to compute a 3D bounding box for each structure. We used our approach to locate 11 structures of interest. The neural network was trained and evaluated on a large set of 1884 thorax-abdomen CT scans from patients undergoing oncological workup. Reference bounding boxes were annotated by human observers. The performance of our method was evaluated by computing the wall distance to the reference bounding boxes. The bounding boxes annotated by the first human observer were used as the reference standard for the test set. Using the best configuration, we obtained an average wall distance of [Formula: see text] mm in the test set. The second human observer achieved [Formula: see text] mm. For all structures, the results were better than those reported in previously published studies. In conclusion, we proposed an efficient method for the accurate localization of multiple organs. Our method uses multiple slices as input to provide more context around the slice under analysis, and we have shown that this improves performance. This method can easily be adapted to handle more organs.
Collapse
Affiliation(s)
- Gabriel Efrain Humpire-Mamani
- Department of Radiology and Nuclear Medicine, Diagnostic Image Analysis Group, Radboud University Medical Center, Nijmegen, Netherlands
| | | | | | | |
Collapse
|