Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Chen Z, Li J, Wei L. A multiple kernel support vector machine scheme for feature selection and rule extraction from gene expression data of cancer tissue. Artif Intell Med 2007;41:161-75. [PMID: 17851055 DOI: 10.1016/j.artmed.2007.07.008] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2006] [Revised: 07/31/2007] [Accepted: 07/31/2007] [Indexed: 10/22/2022]

For:	Chen Z, Li J, Wei L. A multiple kernel support vector machine scheme for feature selection and rule extraction from gene expression data of cancer tissue. Artif Intell Med 2007;41:161-75. [PMID: 17851055 DOI: 10.1016/j.artmed.2007.07.008] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2006] [Revised: 07/31/2007] [Accepted: 07/31/2007] [Indexed: 10/22/2022]

Number

Cited by Other Article(s)

Jia X, Wang T, Zhu H. Advancing Computational Toxicology by Interpretable Machine Learning. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023;57:17690-17706. [PMID: 37224004 PMCID: PMC10666545 DOI: 10.1021/acs.est.3c00653] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 05/05/2023] [Accepted: 05/05/2023] [Indexed: 05/26/2023]

He J, Li J, Leung K. Dynamic structural analysis-based epitope prediction of Exendin-4 in aqueous solution. Phys Rev E 2023;108:024403. [PMID: 37723773 DOI: 10.1103/physreve.108.024403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 07/22/2023] [Indexed: 09/20/2023]

Huang P, Feng Z, Shu X, Wu A, Wang Z, Hu T, Cao Y, Tu Y, Li Z. A bibliometric and visual analysis of publications on artificial intelligence in colorectal cancer (2002-2022). Front Oncol 2023;13:1077539. [PMID: 36824138 PMCID: PMC9941644 DOI: 10.3389/fonc.2023.1077539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2022] [Accepted: 01/27/2023] [Indexed: 02/10/2023] Open

Abstract

Background

Colorectal cancer (CRC) has the third-highest incidence and second-highest mortality rate of all cancers worldwide. Early diagnosis and screening of CRC have been the focus of research in this field. With the continuous development of artificial intelligence (AI) technology, AI has advantages in many aspects of CRC, such as adenoma screening, genetic testing, and prediction of tumor metastasis.

Objective

This study uses bibliometrics to analyze research in AI in CRC, summarize the field's history and current status of research, and predict future research directions.

Method

We searched the SCIE database for all literature on CRC and AI. The documents span the period 2002-2022. we used bibliometrics to analyze the data of these papers, such as authors, countries, institutions, and references. Co-authorship, co-citation, and co-occurrence analysis were the main methods of analysis. Citespace, VOSviewer, and SCImago Graphica were used to visualize the results.

Result

This study selected 1,531 articles on AI in CRC. China has published a maximum number of 580 such articles in this field. The U.S. had the most quality publications, boasting an average citation per article of 46.13. Mori Y and Ding K were the two authors with the highest number of articles. Scientific Reports, Cancers, and Frontiers in Oncology are this field's most widely published journals. Institutions from China occupy the top 9 positions among the most published institutions. We found that research on AI in this field mainly focuses on colonoscopy-assisted diagnosis, imaging histology, and pathology examination.

Conclusion

AI in CRC is currently in the development stage with good prospects. AI is currently widely used in colonoscopy, imageomics, and pathology. However, the scope of AI applications is still limited, and there is a lack of inter-institutional collaboration. The pervasiveness of AI technology is the main direction of future housing development in this field.

Collapse

Affiliation(s)

Pan Huang Department of General Surgery, First Affiliated Hospital of Nanchang University, Nanchang, China,Department of Digestive Surgery, Digestive Disease Hospital, The First Affiliated Hospital of Nanchang University, Nanchang, China,Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, China
Zongfeng Feng Department of General Surgery, First Affiliated Hospital of Nanchang University, Nanchang, China,Department of Digestive Surgery, Digestive Disease Hospital, The First Affiliated Hospital of Nanchang University, Nanchang, China,Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, China
Xufeng Shu Department of General Surgery, First Affiliated Hospital of Nanchang University, Nanchang, China,Department of Digestive Surgery, Digestive Disease Hospital, The First Affiliated Hospital of Nanchang University, Nanchang, China,Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, China
Ahao Wu Department of General Surgery, First Affiliated Hospital of Nanchang University, Nanchang, China,Department of Digestive Surgery, Digestive Disease Hospital, The First Affiliated Hospital of Nanchang University, Nanchang, China,Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, China
Zhonghao Wang Department of General Surgery, First Affiliated Hospital of Nanchang University, Nanchang, China,Department of Digestive Surgery, Digestive Disease Hospital, The First Affiliated Hospital of Nanchang University, Nanchang, China,Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, China
Tengcheng Hu Department of General Surgery, First Affiliated Hospital of Nanchang University, Nanchang, China,Department of Digestive Surgery, Digestive Disease Hospital, The First Affiliated Hospital of Nanchang University, Nanchang, China,Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, China
Yi Cao Department of General Surgery, First Affiliated Hospital of Nanchang University, Nanchang, China,Department of Digestive Surgery, Digestive Disease Hospital, The First Affiliated Hospital of Nanchang University, Nanchang, China,Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, China
Yi Tu Department of Pathology, The First Affiliated Hospital of Nanchang University, Nanchang, China,*Correspondence: Yi Tu, ; Zhengrong Li,
Zhengrong Li Department of General Surgery, First Affiliated Hospital of Nanchang University, Nanchang, China,Department of Digestive Surgery, Digestive Disease Hospital, The First Affiliated Hospital of Nanchang University, Nanchang, China,Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, China,*Correspondence: Yi Tu, ; Zhengrong Li,

Collapse

Human-in-the-loop machine learning: a state of the art. Artif Intell Rev 2022. [DOI: 10.1007/s10462-022-10246-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Lombardo T, Duquesnoy M, El-Bouysidy H, Årén F, Gallo-Bueno A, Jørgensen PB, Bhowmik A, Demortière A, Ayerbe E, Alcaide F, Reynaud M, Carrasco J, Grimaud A, Zhang C, Vegge T, Johansson P, Franco AA. Artificial Intelligence Applied to Battery Research: Hype or Reality? Chem Rev 2021;122:10899-10969. [PMID: 34529918 PMCID: PMC9227745 DOI: 10.1021/acs.chemrev.1c00108] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

Affiliation(s)

Teo Lombardo Laboratoire de Réactivité et Chimie des Solides (LRCS), UMR CNRS 7314, Université de Picardie Jules Verne, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Réseau sur le Stockage Electrochimique de l'Energie (RS2E), FR CNRS 3459, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France
Marc Duquesnoy Laboratoire de Réactivité et Chimie des Solides (LRCS), UMR CNRS 7314, Université de Picardie Jules Verne, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Réseau sur le Stockage Electrochimique de l'Energie (RS2E), FR CNRS 3459, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France
Hassna El-Bouysidy Laboratoire de Réactivité et Chimie des Solides (LRCS), UMR CNRS 7314, Université de Picardie Jules Verne, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Department of Physics, Chalmers University of Technology, SE-41296 Göteborg, Sweden
Fabian Årén ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Department of Physics, Chalmers University of Technology, SE-41296 Göteborg, Sweden
Alfonso Gallo-Bueno ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Centre for Cooperative Research on Alternative Energies (CIC energiGUNE), Basque Research and Technology Alliance (BRTA), Alava Technology Park, Albert Einstein 48, 01510 Vitoria-Gasteiz, Spain
Peter Bjørn Jørgensen ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Department of Energy Conversion and Storage, Technical University of Denmark, Anker Engelunds Vej, Building 301, 2800 Kgs. Lyngby, Denmark
Arghya Bhowmik ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Department of Energy Conversion and Storage, Technical University of Denmark, Anker Engelunds Vej, Building 301, 2800 Kgs. Lyngby, Denmark
Arnaud Demortière Laboratoire de Réactivité et Chimie des Solides (LRCS), UMR CNRS 7314, Université de Picardie Jules Verne, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Réseau sur le Stockage Electrochimique de l'Energie (RS2E), FR CNRS 3459, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France
Elixabete Ayerbe ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,CIDETEC, Basque Research and Technology Alliance (BRTA), Po. Miramón 196, 20014 Donostia-San Sebastián, Spain
Francisco Alcaide ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,CIDETEC, Basque Research and Technology Alliance (BRTA), Po. Miramón 196, 20014 Donostia-San Sebastián, Spain
Marine Reynaud ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Centre for Cooperative Research on Alternative Energies (CIC energiGUNE), Basque Research and Technology Alliance (BRTA), Alava Technology Park, Albert Einstein 48, 01510 Vitoria-Gasteiz, Spain
Javier Carrasco ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Centre for Cooperative Research on Alternative Energies (CIC energiGUNE), Basque Research and Technology Alliance (BRTA), Alava Technology Park, Albert Einstein 48, 01510 Vitoria-Gasteiz, Spain
Alexis Grimaud Réseau sur le Stockage Electrochimique de l'Energie (RS2E), FR CNRS 3459, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,UMR CNRS 8260 "Chimie du Solide et Energie", Collège de France, 11 Place Marcelin Berthelot, 75231 Paris Cedex 05, France Sorbonne Universités - UPMC Univ Paris 06, 4 Place Jussieu, F-75005 Paris, France
Chao Zhang ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Department of Chemistry - Ångström Laboratory, Box 538, 75121 Uppsala, Sweden
Tejs Vegge ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Department of Energy Conversion and Storage, Technical University of Denmark, Anker Engelunds Vej, Building 301, 2800 Kgs. Lyngby, Denmark
Patrik Johansson ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Department of Physics, Chalmers University of Technology, SE-41296 Göteborg, Sweden
Alejandro A Franco Laboratoire de Réactivité et Chimie des Solides (LRCS), UMR CNRS 7314, Université de Picardie Jules Verne, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Réseau sur le Stockage Electrochimique de l'Energie (RS2E), FR CNRS 3459, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Institut Universitaire de France, 103 Boulevard Saint Michel, 75005 Paris, France

Collapse

Xue H, Song Y, Xu HM. Multiple indefinite kernel learning for feature selection. Knowl Based Syst 2020. [DOI: 10.1016/j.knosys.2019.105272] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Yan M, Wang X, Wang B, Chang M, Muhammad I. Bearing remaining useful life prediction using support vector machine and hybrid degradation tracking model. ISA TRANSACTIONS 2020;98:471-482. [PMID: 31492470 DOI: 10.1016/j.isatra.2019.08.058] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/18/2019] [Revised: 08/24/2019] [Accepted: 08/28/2019] [Indexed: 06/10/2023]

Abstract

Rolling element bearing is one of the critical components in rotating machines, and its running state determines machinery Remaining Useful Life (RUL). Estimating impending failure and predicting RUL of bearing is beneficial to schedule maintenance strategy and avoid abrupt shutdowns. This paper presents a novel method of RUL prediction of bearings, which can evaluate the degradation stage of bearings through dimensionless measurements and exploit the optimal RUL prediction through hybrid degradation tracing model in degradation stage. Two new measurements reflect the vibration intensity of bearings regarding normal vibration value. They can eliminate individual differences of bearings, improve sensitivity to the incipient defect of bearings, and reduce fluctuation. Moreover, they are helpful to detect the time to start prediction and set dimensionless failure threshold. SVM classifier is used to assess the degradation stage of bearing, which shows a high classification accuracy because of its excellent generalization ability and mathematical foundation. As input, the fitted measurements based on the generalized degradation model are used to train the SVM classifier. As output, five degradation stages are defined. However, actual measurements are used as inputs in the prediction process. According to the classification results, a hybrid degradation tracing model is utilized to exploit the optimal RUL prediction by tracking the degradation process of bearings. The proposed method is validated on the public IMS and PRONOSTIA bearing datasets, and its performance is compared with other methods on PRONOSTIA bearing datasets. The results show that the proposed approach is an effective way for RUL prediction of bearings within the prescribed error range. Given that the proposed measurements are dimensionless, this method can be applied under different operating conditions.

Collapse

Frey LJ, Talbert DA. Artificial Intelligence Pipeline to Bridge the Gap between Bench Researchers and Clinical Researchers in Precision Medicine. MED ONE 2020;5:10.20900/mo20200001. [PMID: 33511289 PMCID: PMC7839064 DOI: 10.20900/mo20200001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Non-convex approximation based l0-norm multiple indefinite kernel feature selection. APPL INTELL 2020. [DOI: 10.1007/s10489-018-01407-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

Bremer Hinckel BC, Marlais T, Airs S, Bhattacharyya T, Imamura H, Dujardin JC, El-Safi S, Singh OP, Sundar S, Falconar AK, Andersson B, Litvinov S, Miles MA, Mertens P. Refining wet lab experiments with in silico searches: A rational quest for diagnostic peptides in visceral leishmaniasis. PLoS Negl Trop Dis 2019;13:e0007353. [PMID: 31059497 PMCID: PMC6522066 DOI: 10.1371/journal.pntd.0007353] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2018] [Revised: 05/16/2019] [Accepted: 04/01/2019] [Indexed: 11/19/2022] Open

Abstract

Background

The search for diagnostic biomarkers has been profiting from a growing number of high quality sequenced genomes and freely available bioinformatic tools. These can be combined with wet lab experiments for a rational search. Improved, point-of-care diagnostic tests for visceral leishmaniasis (VL), early case detection and surveillance are required. Previous investigations demonstrated the potential of IgG1 as a biomarker for monitoring clinical status in rapid diagnostic tests (RDTs), although using a crude lysate antigen (CLA) as capturing antigen. Replacing the CLA by specific antigens would lead to more robust RDTs.

Methodology

Immunoblots revealed L. donovani protein bands detected by IgG1 from VL patients. Upon confident identification of these antigens by mass spectrometry (MS), we searched for evidence of constitutive protein expression and presence of antigenic domains or high accessibility to B-cells. Selected candidates had their linear epitopes mapped with in silico algorithms. Multiple high-scoring predicted epitopes from the shortlisted proteins were screened in peptide arrays. The most promising candidate was tested in RDT prototypes using VL and nonendemic healthy control (NEHC) patient sera.

Results

Over 90% of the proteins identified from the immunoblots did not satisfy the selection criteria and were excluded from the downstream epitope mapping. Screening of predicted epitope peptides from the shortlisted proteins identified the most reactive, for which the sensitivity for IgG1 was 84% (95% CI 60—97%) with Sudanese VL sera on RDT prototypes. None of the sera from NEHCs were positive.

Conclusion

We employed in silico searches to reduce drastically the output of wet lab experiments, focusing on promising candidates containing selected protein features. By predicting epitopes in silico we screened a large number of peptides using arrays, identifying the most promising one, for which IgG1 sensitivity and specificity, with limited sample size, supported this proof of concept strategy for diagnostics discovery, which can be applied to the development of more robust IgG1 RDTs for monitoring clinical status in VL.

Visceral leishmaniasis (VL) is a neglected tropical disease caused by protozoan parasites of the Leishmania donovani complex. Without treatment, VL is fatal. Although diagnostic techniques, mainly based on the detection of anti-Leishmania antibodies are available, invasive procedures such as microscopy from spleen or bone marrow aspirates are still required for the diagnosis of seronegative VL suspects, for the detection of recurrent cases and to confirm cure after successful treatment. Previous investigations showed the potential of IgG1 as a biomarker of post-chemotherapeutic relapse for VL in rapid diagnostic tests (RDTs) sensitised with crude lysate antigen (CLA). Here we employed in silico tools to search for desired protein features in a large number of L. donovani antigens detected by human IgG1 in western blots. We then employed prediction algorithms to profile epitopes from the shortlisted proteins. We screened a panel of high-scoring peptides in a high-throughput manner using arrays, with low reagent consumption. The most reactive peptide was adapted to RDTs, showing promising results of both sensitivity and specificity. This peptide has the potential of replacing the CLAs in IgG1 RDTs. Thus we believe that in silico tools can be used to optimise wet lab experiments for a rational search of biomarkers.

Collapse

Maniruzzaman M, Jahanur Rahman M, Ahammed B, Abedin MM, Suri HS, Biswas M, El-Baz A, Bangeas P, Tsoulfas G, Suri JS. Statistical characterization and classification of colon microarray gene expression data using multiple machine learning paradigms. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2019;176:173-193. [PMID: 31200905 DOI: 10.1016/j.cmpb.2019.04.008] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2018] [Revised: 02/28/2019] [Accepted: 04/08/2019] [Indexed: 02/08/2023]

Gene selection from large-scale gene expression data based on fuzzy interactive multi-objective binary optimization for medical diagnosis. Biocybern Biomed Eng 2018. [DOI: 10.1016/j.bbe.2018.02.002] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Banjar H, Adelson D, Brown F, Chaudhri N. Intelligent Techniques Using Molecular Data Analysis in Leukaemia: An Opportunity for Personalized Medicine Support System. BIOMED RESEARCH INTERNATIONAL 2017;2017:3587309. [PMID: 28812013 PMCID: PMC5547708 DOI: 10.1155/2017/3587309] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/05/2017] [Revised: 06/12/2017] [Accepted: 06/15/2017] [Indexed: 12/05/2022]

Kuo RJ, Huang SBL, Zulvia FE, Liao TW. Artificial bee colony-based support vector machines with feature selection and parameter optimization for rule extraction. Knowl Inf Syst 2017. [DOI: 10.1007/s10115-017-1083-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Du W, Cao Z, Song T, Li Y, Liang Y. A feature selection method based on multiple kernel learning with expression profiles of different types. BioData Min 2017;10:4. [PMID: 28184251 PMCID: PMC5288949 DOI: 10.1186/s13040-017-0124-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2015] [Accepted: 01/11/2017] [Indexed: 11/28/2022] Open

Abstract

Background

With the development of high-throughput technology, the researchers can acquire large number of expression data with different types from several public databases. Because most of these data have small number of samples and hundreds or thousands features, how to extract informative features from expression data effectively and robustly using feature selection technique is challenging and crucial. So far, a mass of many feature selection approaches have been proposed and applied to analyse expression data of different types. However, most of these methods only are limited to measure the performances on one single type of expression data by accuracy or error rate of classification.

Results

In this article, we propose a hybrid feature selection method based on Multiple Kernel Learning (MKL) and evaluate the performance on expression datasets of different types. Firstly, the relevance between features and classifying samples is measured by using the optimizing function of MKL. In this step, an iterative gradient descent process is used to perform the optimization both on the parameters of Support Vector Machine (SVM) and kernel confidence. Then, a set of relevant features is selected by sorting the optimizing function of each feature. Furthermore, we apply an embedded scheme of forward selection to detect the compact feature subsets from the relevant feature set.

Conclusions

We not only compare the classification accuracy with other methods, but also compare the stability, similarity and consistency of different algorithms. The proposed method has a satisfactory capability of feature selection for analysing expression datasets of different types using different performance measurements.

Electronic supplementary material

The online version of this article (doi:10.1186/s13040-017-0124-x) contains supplementary material, which is available to authorized users.

Collapse

Potocnakova L, Bhide M, Pulzova LB. An Introduction to B-Cell Epitope Mapping and In Silico Epitope Prediction. J Immunol Res 2016;2016:6760830. [PMID: 28127568 PMCID: PMC5227168 DOI: 10.1155/2016/6760830] [Citation(s) in RCA: 198] [Impact Index Per Article: 24.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2016] [Revised: 11/21/2016] [Accepted: 12/13/2016] [Indexed: 01/09/2023] Open

Zhongxin W, Gang S, Jing Z, Jia Z. Feature Selection Algorithm Based on Mutual Information and Lasso for Microarray Data. ACTA ACUST UNITED AC 2016. [DOI: 10.2174/1874070701610010278] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Zhang Z, Ma H, Fu H, Zhang C. Scene-free multi-class weather classification on single images. Neurocomputing 2016. [DOI: 10.1016/j.neucom.2016.05.015] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Niño-Sandoval TC, Guevara Perez SV, González FA, Jaque RA, Infante-Contreras C. An automatic method for skeletal patterns classification using craniomaxillary variables on a Colombian population. Forensic Sci Int 2015;261:159.e1-6. [PMID: 26782070 DOI: 10.1016/j.forsciint.2015.12.025] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2015] [Revised: 12/02/2015] [Accepted: 12/15/2015] [Indexed: 10/22/2022]

Abstract

BACKGROUND

The mandibular bone is an important part of the forensic facial reconstruction and it has the possibility of getting lost in skeletonized remains; for this reason, it is necessary to facilitate the identification process simulating the mandibular position only through craniomaxillary measures, for this task, different modeling techniques have been performed, but they only contemplate a straight facial profile that belong to skeletal pattern Class I, but the 24.5% corresponding to the Colombian skeletal patterns Class II and III are not taking into account, besides, craniofacial measures do not follow a parametric trend or a normal distribution.

OBJECTIVE

The aim of this study was to employ an automatic non-parametric method as the Support Vector Machines to classify skeletal patterns through craniomaxillary variables, in order to simulate the natural mandibular position on a contemporary Colombian sample.

MATERIALS AND METHODS

Lateral cephalograms (229) of Colombian young adults of both sexes were collected. Landmark coordinates protocols were used to create craniomaxillary variables. A Support Vector Machine with a linear kernel classifier model was trained on a subset of the available data and evaluated over the remaining samples. The weights of the model were used to select the 10 best variables for classification accuracy.

RESULTS

An accuracy of 74.51% was obtained, defined by Pr-A-N, N-Pr-A, A-N-Pr, A-Te-Pr, A-Pr-Rhi, Rhi-A-Pr, Pr-A-Te, Te-Pr-A, Zm-A-Pr and PNS-A-Pr angles. The Class Precision and the Class Recall showed a correct distinction of the Class II from the Class III and vice versa.

CONCLUSIONS

Support Vector Machines created an important model of classification of skeletal patterns using craniomaxillary variables that are not commonly used in the literature and could be applicable to the 24.5% of the contemporary Colombian sample.

Collapse

Banwait JK, Bastola DR. Contribution of bioinformatics prediction in microRNA-based cancer therapeutics. Adv Drug Deliv Rev 2015;81:94-103. [PMID: 25450261 DOI: 10.1016/j.addr.2014.10.030] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2014] [Revised: 10/13/2014] [Accepted: 10/30/2014] [Indexed: 12/15/2022]

Farquad M, Ravi V, Raju SB. Churn prediction using comprehensible support vector machine: An analytical CRM application. Appl Soft Comput 2014. [DOI: 10.1016/j.asoc.2014.01.031] [Citation(s) in RCA: 94] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Luque-Baena RM, Urda D, Subirats JL, Franco L, Jerez JM. Application of genetic algorithms and constructive neural networks for the analysis of microarray cancer data. Theor Biol Med Model 2014;11 Suppl 1:S7. [PMID: 25077572 PMCID: PMC4108856 DOI: 10.1186/1742-4682-11-s1-s7] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

Background

Extracting relevant information from microarray data is a very complex task due to the characteristics of the data sets, as they comprise a large number of features while few samples are generally available. In this sense, feature selection is a very important aspect of the analysis helping in the tasks of identifying relevant genes and also for maximizing predictive information.

Methods

Due to its simplicity and speed, Stepwise Forward Selection (SFS) is a widely used feature selection technique. In this work, we carry a comparative study of SFS and Genetic Algorithms (GA) as general frameworks for the analysis of microarray data with the aim of identifying group of genes with high predictive capability and biological relevance. Six standard and machine learning-based techniques (Linear Discriminant Analysis (LDA), Support Vector Machines (SVM), Naive Bayes (NB), C-MANTEC Constructive Neural Network, K-Nearest Neighbors (kNN) and Multilayer perceptron (MLP)) are used within both frameworks using six free-public datasets for the task of predicting cancer outcome.

Results

Better cancer outcome prediction results were obtained using the GA framework noting that this approach, in comparison to the SFS one, leads to a larger selection set, uses a large number of comparison between genetic profiles and thus it is computationally more intensive. Also the GA framework permitted to obtain a set of genes that can be considered to be more biologically relevant. Regarding the different classifiers used standard feedforward neural networks (MLP), LDA and SVM lead to similar and best results, while C-MANTEC and k-NN followed closely but with a lower accuracy. Further, C-MANTEC, MLP and LDA permitted to obtain a more limited set of genes in comparison to SVM, NB and kNN, and in particular C-MANTEC resulted in the most robust classifier in terms of changes in the parameter settings.

Conclusions

This study shows that if prediction accuracy is the objective, the GA-based approach lead to better results respect to the SFS approach, independently of the classifier used. Regarding classifiers, even if C-MANTEC did not achieve the best overall results, the performance was competitive with a very robust behaviour in terms of the parameters of the algorithm, and thus it can be considered as a candidate technique for future studies.

Collapse

Huang YH. A note on hyper ellipse method for classifying biological and medical data. Comput Biol Med 2013;43:1978-86. [DOI: 10.1016/j.compbiomed.2013.08.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2011] [Revised: 08/12/2013] [Accepted: 08/15/2013] [Indexed: 11/28/2022]

Ercan S, Kayakutlu G. Patent value analysis using support vector machines. Soft comput 2013. [DOI: 10.1007/s00500-013-1059-x] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Chen ZY, Fan ZP. Parallel multiple kernel learning: a hybrid alternating direction method of multipliers. Knowl Inf Syst 2013. [DOI: 10.1007/s10115-013-0655-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

Chen ZY, Fan ZP. Dynamic customer lifetime value prediction using longitudinal data: An improved multiple kernel SVR approach. Knowl Based Syst 2013. [DOI: 10.1016/j.knosys.2013.01.022] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Wu X, Zhu X, He Y, Arslan AN. PMBC: pattern mining from biological sequences with wildcard constraints. Comput Biol Med 2013;43:481-92. [PMID: 23566394 DOI: 10.1016/j.compbiomed.2013.02.006] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2008] [Revised: 02/05/2013] [Accepted: 02/07/2013] [Indexed: 11/25/2022]

Abstract

Patterns/subsequences frequently appearing in sequences provide essential knowledge for domain experts, such as molecular biologists, to discover rules or patterns hidden behind the data. Due to the inherent complex nature of the biological data, patterns rarely exactly reproduce and repeat themselves, but rather appear with a slightly different form in each of its appearances. A gap constraint (In this paper, a gap constraint (also referred to as a wildcard) is a character that can be substituted for any character predefined in an alphabet.) provides flexibility for users to capture useful patterns even if their appearances vary in the sequences. In order to find patterns, existing tools require users to explicitly specify gap constraints beforehand. In reality, it is often nontrivial or time-consuming for users to provide proper gap constraint values. In addition, a change made to the gap values may give completely different results, and require a separate time-consuming re-mining procedure. Therefore, it is desirable to automatically and efficiently find patterns without involving user-specified gap requirements. In this paper, we study the problem of frequent pattern mining without user-specified gap constraints and propose PMBC (namely P̲atternM̲ining from B̲iological sequences with wildcard C onstraints) to solve the problem. Given a sequence and a support threshold value (i.e. pattern frequency threshold), PMBC intends to discover all subsequences with their support values equal to or greater than the given threshold value. The frequent subsequences then form patterns later on. Two heuristic methods (one-way vs. two-way scans) are proposed to discover frequent subsequences and estimate their frequency in the sequences. Experimental results on both synthetic and real-world DNA sequences demonstrate the performance of both methods for frequent pattern mining and pattern frequency estimation.

Collapse

Zhu P, Hu Q. Rule extraction from support vector machines based on consistent region covering reduction. Knowl Based Syst 2013. [DOI: 10.1016/j.knosys.2012.12.003] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Zhao X, Deng W, Shi Y. Feature Selection with Attributes Clustering by Maximal Information Coefficient. ACTA ACUST UNITED AC 2013. [DOI: 10.1016/j.procs.2013.05.011] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Chen ZY, Fan ZP. Distributed customer behavior prediction using multiplex data: A collaborative MK-SVM approach. Knowl Based Syst 2012. [DOI: 10.1016/j.knosys.2012.04.023] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

LING YUN, CAO QIUYAN, ZHANG HUA. CREDIT SCORING USING MULTI-KERNEL SUPPORT VECTOR MACHINE AND CHAOS PARTICLE SWARM OPTIMIZATION. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS 2012. [DOI: 10.1142/s1469026812500198] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Evolution strategy based adaptive Lq penalty support vector machines with Gauss kernel for credit risk analysis. Appl Soft Comput 2012. [DOI: 10.1016/j.asoc.2012.04.011] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Fuzzy rules extraction from support vector machines for multi-class classification. Neural Comput Appl 2012. [DOI: 10.1007/s00521-012-1048-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Accurate Prediction of Coronary Artery Disease Using Reliable Diagnosis System. J Med Syst 2012;36:3353-73. [DOI: 10.1007/s10916-012-9828-0] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2011] [Accepted: 01/30/2012] [Indexed: 10/14/2022]

Florido JP, Pomares H, Rojas I. Generating balanced learning and test sets for function approximation problems. Int J Neural Syst 2011;21:247-63. [PMID: 21656926 DOI: 10.1142/s0129065711002791] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Petri T, Küfner R, Zimmer R. Experiment specific expression patterns. J Comput Biol 2011;18:1423-35. [PMID: 21919744 DOI: 10.1089/cmb.2011.0159] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Saei AA, Omidi Y. A glance at DNA microarray technology and applications. BIOIMPACTS : BI 2011;1:75-86. [PMID: 23678411 DOI: 10.5681/bi.2011.011] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 06/27/2011] [Revised: 07/13/2011] [Accepted: 07/20/2011] [Indexed: 01/06/2023]

Wei L, Chen Z, Li J. Evolution strategies based adaptive Lp LS-SVM. Inf Sci (N Y) 2011. [DOI: 10.1016/j.ins.2011.02.029] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Barakat N, Bradley AP. Rule extraction from support vector machines: A review. Neurocomputing 2010. [DOI: 10.1016/j.neucom.2010.02.016] [Citation(s) in RCA: 82] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Hu Q, Pan W, An S, Ma P, Wei J. An efficient gene selection technique for cancer recognition based on neighborhood mutual information. INT J MACH LEARN CYB 2010. [DOI: 10.1007/s13042-010-0008-6] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Barakat N, Bradley AP, Barakat MNH. Intelligible Support Vector Machines for Diagnosis of Diabetes Mellitus. ACTA ACUST UNITED AC 2010;14:1114-20. [DOI: 10.1109/titb.2009.2039485] [Citation(s) in RCA: 177] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Guan P, Huang D, He M, Zhou B. Lung cancer gene expression database analysis incorporating prior knowledge with support vector machine-based classification method. JOURNAL OF EXPERIMENTAL & CLINICAL CANCER RESEARCH : CR 2009;28:103. [PMID: 19615083 PMCID: PMC2719616 DOI: 10.1186/1756-9966-28-103] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/03/2009] [Accepted: 07/18/2009] [Indexed: 01/13/2023]

Alladi SM, P SS, Ravi V, Murthy US. Colon cancer prediction with genetic profiles using intelligent techniques. Bioinformation 2008;3:130-3. [PMID: 19238250 PMCID: PMC2639687 DOI: 10.6026/97320630003130] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2008] [Revised: 08/28/2008] [Accepted: 09/13/2008] [Indexed: 11/23/2022] Open

Wood SJ, Pantelis C, Velakoulis D, Yücel M, Fornito A, McGorry PD. Progressive changes in the development toward schizophrenia: studies in subjects at increased symptomatic risk. Schizophr Bull 2008;34:322-9. [PMID: 18199631 PMCID: PMC2632412 DOI: 10.1093/schbul/sbm149] [Citation(s) in RCA: 148] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

Peng Y, Zhang X. Integrative data mining in systems biology: from text to network mining. Artif Intell Med 2007;41:83-6. [PMID: 17888638 DOI: 10.1016/j.artmed.2007.08.001] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]