Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Chang CC, Lin CJ. Training nu-support vector classifiers: theory and algorithms. Neural Comput 2001;13:2119-47. [PMID: 11516360 DOI: 10.1162/089976601750399335] [Citation(s) in RCA: 335] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]

Number

Cited by Other Article(s)

Pike R, Sechopoulos I, Fei B. A minimum spanning forest based classification method for dedicated breast CT images. Med Phys 2015;42:6190-202. [PMID: 26520712 DOI: 10.1118/1.4931958] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open

Wild-type APC predicts poor prognosis in microsatellite-stable proximal colon cancer. Br J Cancer 2015;113:979-88. [PMID: 26305864 PMCID: PMC4578087 DOI: 10.1038/bjc.2015.296] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2015] [Revised: 07/08/2015] [Accepted: 07/20/2015] [Indexed: 12/14/2022] Open

Cheng CC, Lu CF, Hsieh TY, Lin YJ, Taur JS, Chen YF. Design of a Computer-Assisted System to Automatically Detect Cell Types Using ANA IIF Images for the Diagnosis of Autoimmune Diseases. J Med Syst 2015;39:314. [PMID: 26289629 DOI: 10.1007/s10916-015-0314-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2015] [Accepted: 08/04/2015] [Indexed: 10/23/2022]

Li Y, Oommen BJ, Ngom A, Rueda L. Pattern classification using a new border identification paradigm: The nearest border technique. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2015.01.030] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Tao J, Wang S, Hu W. Minimum class spread constrained support vector machine. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2014.09.017] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

New one-class classifiers based on the origin separation approach. Pattern Recognit Lett 2015. [DOI: 10.1016/j.patrec.2014.11.008] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Learning in Reproducing Kernel Hilbert Spaces. Mach Learn 2015. [DOI: 10.1016/b978-0-12-801522-3.00011-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Emblem KE, Pinho MC, Zöllner FG, Due-Tonnessen P, Hald JK, Schad LR, Meling TR, Rapalino O, Bjornerud A. A generic support vector machine model for preoperative glioma survival associations. Radiology 2014;275:228-34. [PMID: 25486589 DOI: 10.1148/radiol.14140770] [Citation(s) in RCA: 75] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Abstract

PURPOSE

To develop a generic support vector machine (SVM) model by using magnetic resonance (MR) imaging-based blood volume distribution data for preoperative glioma survival associations and to prospectively evaluate the diagnostic effectiveness of this model in autonomous patient data.

MATERIALS AND METHODS

Institutional and regional medical ethics committees approved the study, and all patients signed a consent form. Two hundred thirty-five preoperative adult patients from two institutions with a subsequent histologically confirmed diagnosis of glioma after surgery were included retrospectively. An SVM learning technique was applied to MR imaging-based whole-tumor relative cerebral blood volume (rCBV) histograms. SVM models with the highest diagnostic accuracy for 6-month and 1-, 2-, and 3-year survival associations were trained on 101 patients from the first institution. With Cox survival analysis, the diagnostic effectiveness of the SVM models was tested on independent data from 134 patients at the second institution.

RESULTS

were adjusted for known survival predictors, including patient age, tumor size, neurologic status, and postsurgery treatment, and were compared with survival associations from an expert reader.

RESULTS

Compared with total qualitative assessment by an expert reader, the whole-tumor rCBV-based SVM model was the strongest parameter associated with 6-month and 1-, 2-, and 3-year survival in the independent patient data (area under the receiver operating characteristic curve, 0.794-0.851; hazard ratio, 5.4-21.2).

DISCUSSION

Machine learning by means of SVM in combination with whole-tumor rCBV histogram analysis can be used to identify early patient survival in aggressive gliomas. The SVM model returned higher diagnostic accuracy values than an expert reader, and the model appears to be insensitive to patient, observer, and institutional variations.

Collapse

Weitsman G, Lawler K, Kelleher MT, Barrett JE, Barber PR, Shamil E, Festy F, Patel G, Fruhwirth GO, Huang L, Tullis ID, Woodman N, Ofo E, Ameer-Beg SM, Irshad S, Condeelis J, Gillett CE, Ellis PA, Vojnovic B, Coolen AC, Ng T. Imaging tumour heterogeneity of the consequences of a PKCα-substrate interaction in breast cancer patients. Biochem Soc Trans 2014;42:1498-505. [PMID: 25399560 PMCID: PMC4259014 DOI: 10.1042/bst20140165] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Affiliation(s)

Gregory Weitsman Richard Dimbleby Department of Cancer Research, Randall Division & Division of Cancer Studies, Kings College London, Guy’s Medical School Campus, London SE1 1UL, U.K
Katherine Lawler Richard Dimbleby Department of Cancer Research, Randall Division & Division of Cancer Studies, Kings College London, Guy’s Medical School Campus, London SE1 1UL, U.K Department of Mathematics, King’s College London, Strand Campus, London WC2R 2LS, U.K
Muireann T. Kelleher Richard Dimbleby Department of Cancer Research, Randall Division & Division of Cancer Studies, Kings College London, Guy’s Medical School Campus, London SE1 1UL, U.K Department of Medical Oncology, St George’s NHS Trust, London SW17 0QT, U.K
James E. Barrett Department of Mathematics, King’s College London, Strand Campus, London WC2R 2LS, U.K
Paul R. Barber Gray Institute for Radiation Oncology & Biology, University of Oxford, Old Road Campus Research Building, Roosevelt Drive, Oxford OX3 7DQ, U.K
Eamon Shamil Richard Dimbleby Department of Cancer Research, Randall Division & Division of Cancer Studies, Kings College London, Guy’s Medical School Campus, London SE1 1UL, U.K
Frederic Festy Biomaterials, Biomimetics and Biophotonics Division, King’s College London Dental Institute, London SE1 9RT, U.K
Gargi Patel Richard Dimbleby Department of Cancer Research, Randall Division & Division of Cancer Studies, Kings College London, Guy’s Medical School Campus, London SE1 1UL, U.K Department of Medical Oncology, Guy’s and St. Thomas Foundation Trust, London SE1 9RT, U.K
Gilbert O. Fruhwirth Richard Dimbleby Department of Cancer Research, Randall Division & Division of Cancer Studies, Kings College London, Guy’s Medical School Campus, London SE1 1UL, U.K Division of Imaging Science and Biomedical Engineering, King’s College London, London SE1 7EH, U.K
Lufei Huang Gray Institute for Radiation Oncology & Biology, University of Oxford, Old Road Campus Research Building, Roosevelt Drive, Oxford OX3 7DQ, U.K
Iain D.C. Tullis Gray Institute for Radiation Oncology & Biology, University of Oxford, Old Road Campus Research Building, Roosevelt Drive, Oxford OX3 7DQ, U.K
Natalie Woodman Guy’s & St. Thomas’ Breast Tissue & Data Bank, King’s College London, Guy’s Hospital, London SE1 9RT, U.K
Enyinnaya Ofo Richard Dimbleby Department of Cancer Research, Randall Division & Division of Cancer Studies, Kings College London, Guy’s Medical School Campus, London SE1 1UL, U.K
Simon M. Ameer-Beg Richard Dimbleby Department of Cancer Research, Randall Division & Division of Cancer Studies, Kings College London, Guy’s Medical School Campus, London SE1 1UL, U.K
Sheeba Irshad Breakthrough Breast Cancer Research Unit, Department of Research Oncology, Guy’s Hospital King’s College London School of Medicine, London, SE1 9RT, U.K
John Condeelis Tumor Microenvironment and Metastasis Program, Albert Einstein Cancer Center, New York, NY 10461, U.S.A
Cheryl E. Gillett Guy’s & St. Thomas’ Breast Tissue & Data Bank, King’s College London, Guy’s Hospital, London SE1 9RT, U.K
Paul A. Ellis Department of Medical Oncology, Guy’s and St. Thomas Foundation Trust, London SE1 9RT, U.K
Borivoj Vojnovic Gray Institute for Radiation Oncology & Biology, University of Oxford, Old Road Campus Research Building, Roosevelt Drive, Oxford OX3 7DQ, U.K Randall Division of Cell & Molecular Biophysics, King’s College London, London, U.K
Anthony C.C. Coolen Department of Mathematics, King’s College London, Strand Campus, London WC2R 2LS, U.K
Tony Ng Richard Dimbleby Department of Cancer Research, Randall Division & Division of Cancer Studies, Kings College London, Guy’s Medical School Campus, London SE1 1UL, U.K Breakthrough Breast Cancer Research Unit, Department of Research Oncology, Guy’s Hospital King’s College London School of Medicine, London, SE1 9RT, U.K UCL Cancer Institute, Paul O’Gorman Building, University College London, London WC1E 6DD, U.K

Collapse

Asymmetric ν-tube support vector regression. Comput Stat Data Anal 2014. [DOI: 10.1016/j.csda.2014.03.016] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Kadyrova NO, Pavlova LV. Statistical analysis of big data: an approach based on support vector machines for classification and regression problems. Biophysics (Nagoya-shi) 2014. [DOI: 10.1134/s0006350914030105] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

Chen YF, Huang PC, Lin KC, Lin HH, Wang LE, Cheng CC, Chen TP, Chan YK, Chiang JY. Semi-automatic segmentation and classification of Pap smear cells. IEEE J Biomed Health Inform 2014;18:94-108. [PMID: 24403407 DOI: 10.1109/jbhi.2013.2250984] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Yang F, Xu YY, Wang ST, Shen HB. Image-based classification of protein subcellular location patterns in human reproductive tissue by ensemble learning global and local features. Neurocomputing 2014. [DOI: 10.1016/j.neucom.2013.10.034] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

Wei X, Ai J, Deng Y, Guan X, Johnson DR, Ang CY, Zhang C, Perkins EJ. Identification of biomarkers that distinguish chemical contaminants based on gene expression profiles. BMC Genomics 2014;15:248. [PMID: 24678894 PMCID: PMC4051169 DOI: 10.1186/1471-2164-15-248] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2013] [Accepted: 03/11/2014] [Indexed: 11/29/2022] Open

Abstract

Background

High throughput transcriptomics profiles such as those generated using microarrays have been useful in identifying biomarkers for different classification and toxicity prediction purposes. Here, we investigated the use of microarrays to predict chemical toxicants and their possible mechanisms of action.

Results

In this study, in vitro cultures of primary rat hepatocytes were exposed to 105 chemicals and vehicle controls, representing 14 compound classes. We comprehensively compared various normalization of gene expression profiles, feature selection and classification algorithms for the classification of these 105 chemicals into14 compound classes. We found that normalization had little effect on the averaged classification accuracy. Two support vector machine (SVM) methods, LibSVM and sequential minimal optimization, had better classification performance than other methods. SVM recursive feature selection (SVM-RFE) had the highest overfitting rate when an independent dataset was used for a prediction. Therefore, we developed a new feature selection algorithm called gradient method that had a relatively high training classification as well as prediction accuracy with the lowest overfitting rate of the methods tested. Analysis of biomarkers that distinguished the 14 classes of compounds identified a group of genes principally involved in cell cycle function that were significantly downregulated by metal and inflammatory compounds, but were induced by anti-microbial, cancer related drugs, pesticides, and PXR mediators.

Conclusions

Our results indicate that using microarrays and a supervised machine learning approach to predict chemical toxicants, their potential toxicity and mechanisms of action is practical and efficient. Choosing the right feature and classification algorithms for this multiple category classification and prediction is critical.

Collapse

Sankar M, Nieminen K, Ragni L, Xenarios I, Hardtke CS. Automated quantitative histology reveals vascular morphodynamics during Arabidopsis hypocotyl secondary growth. eLife 2014;3:e01567. [PMID: 24520159 PMCID: PMC3917233 DOI: 10.7554/elife.01567] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open

Abstract

Among various advantages, their small size makes model organisms preferred subjects of investigation. Yet, even in model systems detailed analysis of numerous developmental processes at cellular level is severely hampered by their scale. For instance, secondary growth of Arabidopsis hypocotyls creates a radial pattern of highly specialized tissues that comprises several thousand cells starting from a few dozen. This dynamic process is difficult to follow because of its scale and because it can only be investigated invasively, precluding comprehensive understanding of the cell proliferation, differentiation, and patterning events involved. To overcome such limitation, we established an automated quantitative histology approach. We acquired hypocotyl cross-sections from tiled high-resolution images and extracted their information content using custom high-throughput image processing and segmentation. Coupled with automated cell type recognition through machine learning, we could establish a cellular resolution atlas that reveals vascular morphodynamics during secondary growth, for example equidistant phloem pole formation.

DOI:http://dx.doi.org/10.7554/eLife.01567.001

Our understanding of the living world has been advanced greatly by studies of ‘model organisms’, such as mice, zebrafish, and fruit flies. Studying these creatures has been crucial to uncovering the genes that control how our bodies develop and grow, and also to discover the genetic basis of diseases such as cancer.

Thale cress—or Arabidopsis thaliana to give its formal name—is the model organism of choice for many plant biologists. This tiny weed has been widely studied because it can complete its lifecycle, from seed to seed, in about 6 weeks, and because its relatively small genome simplifies the search for genes that control specific traits. However, as with other much-studied model systems, understanding the changes that underpin the development of some of the more complex tissues in Arabidopsis has been severely hampered by the shear number of cells involved.

After it has emerged from the seed, the plant’s first stem will develop from a few dozen cells in width to several thousand cells with highly specialized tissues arranged in a complex pattern of concentric circles. Although this stem thickening process represents a major developmental change in many plants—from Arabidopsis to oak trees—it has been under-researched. This is partly because it involves so many different cells, and also because it can only be observed in thin sections cut out of the plant’s stem.

Now Sankar, Nieminen, Ragni et al. have developed a novel approach, termed ‘automated quantitative histology’, to overcome these problems. This strategy involves ‘teaching’ a computer to automatically recognize different plant cells and to measure their important features in high-resolution images of tissue sections. The resulting ‘map’ of the developing stem—which required over 800 hr of computing time to complete—reveals the changes to cells and tissues as they develop that allow the transport of water, sugars and nutrients between the above- and below-ground organs. Sankar, Nieminen, Ragni et al. suggest that their novel approach could, in the future, also be applied to study the development of other tissues and organisms, including animals.

DOI:http://dx.doi.org/10.7554/eLife.01567.002

Collapse

Application of Hybrid Functional Groups to Predict ATP Binding Proteins. ACTA ACUST UNITED AC 2014;2014:581245. [PMID: 24729962 PMCID: PMC3980875 DOI: 10.1155/2014/581245] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Jändel M. Biologically relevant neural network architectures for support vector machines. Neural Netw 2014;49:39-50. [DOI: 10.1016/j.neunet.2013.09.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2011] [Revised: 06/05/2013] [Accepted: 09/18/2013] [Indexed: 10/26/2022]

Emblem KE, Due-Tonnessen P, Hald JK, Bjornerud A, Pinho MC, Scheie D, Schad LR, Meling TR, Zoellner FG. Machine learning in preoperative glioma MRI: Survival associations by perfusion-based support vector machine outperforms traditional MRI. J Magn Reson Imaging 2013;40:47-54. [DOI: 10.1002/jmri.24390] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2012] [Accepted: 07/12/2013] [Indexed: 11/12/2022] Open

Noh E, Herzmann G, Curran T, de Sa VR. Using single-trial EEG to predict and analyze subsequent memory. Neuroimage 2013;84:712-23. [PMID: 24064073 DOI: 10.1016/j.neuroimage.2013.09.028] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2013] [Revised: 08/28/2013] [Accepted: 09/13/2013] [Indexed: 11/27/2022] Open

Sheng VS. Feasibility and finite convergence analysis for accurate on-line ν-support vector machine. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2013;24:1304-1315. [PMID: 24808569 DOI: 10.1109/tnnls.2013.2250300] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]

Xu X, Tsang IW, Xu D. Soft margin multiple kernel learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2013;24:749-761. [PMID: 24808425 DOI: 10.1109/tnnls.2012.2237183] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]

Abstract

Multiple kernel learning (MKL) has been proposed for kernel methods by learning the optimal kernel from a set of predefined base kernels. However, the traditional L1MKL method often achieves worse results than the simplest method using the average of base kernels (i.e., average kernel) in some practical applications. In order to improve the effectiveness of MKL, this paper presents a novel soft margin perspective for MKL. Specifically, we introduce an additional slack variable called kernel slack variable to each quadratic constraint of MKL, which corresponds to one support vector machine model using a single base kernel. We first show that L1MKL can be deemed as hard margin MKL, and then we propose a novel soft margin framework for MKL. Three commonly used loss functions, including the hinge loss, the square hinge loss, and the square loss, can be readily incorporated into this framework, leading to the new soft margin MKL objective functions. Many existing MKL methods can be shown as special cases under our soft margin framework. For example, the hinge loss soft margin MKL leads to a new box constraint for kernel combination coefficients. Using different hyper-parameter values for this formulation, we can inherently bridge the method using average kernel, L1MKL, and the hinge loss soft margin MKL. The square hinge loss soft margin MKL unifies the family of elastic net constraint/regularizer based approaches; and the square loss soft margin MKL incorporates L2MKL naturally. Moreover, we also develop efficient algorithms for solving both the hinge loss and square hinge loss soft margin MKL. Comprehensive experimental studies for various MKL algorithms on several benchmark data sets and two real world applications, including video action recognition and event recognition demonstrate that our proposed algorithms can efficiently achieve an effective yet sparse solution for MKL.

Collapse

Song L, Langfelder P, Horvath S. Random generalized linear model: a highly accurate and interpretable ensemble predictor. BMC Bioinformatics 2013;14:5. [PMID: 23323760 PMCID: PMC3645958 DOI: 10.1186/1471-2105-14-5] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2012] [Accepted: 01/03/2013] [Indexed: 01/13/2023] Open

Abstract

BACKGROUND

Ensemble predictors such as the random forest are known to have superior accuracy but their black-box predictions are difficult to interpret. In contrast, a generalized linear model (GLM) is very interpretable especially when forward feature selection is used to construct the model. However, forward feature selection tends to overfit the data and leads to low predictive accuracy. Therefore, it remains an important research goal to combine the advantages of ensemble predictors (high accuracy) with the advantages of forward regression modeling (interpretability). To address this goal several articles have explored GLM based ensemble predictors. Since limited evaluations suggested that these ensemble predictors were less accurate than alternative predictors, they have found little attention in the literature.

RESULTS

Comprehensive evaluations involving hundreds of genomic data sets, the UCI machine learning benchmark data, and simulations are used to give GLM based ensemble predictors a new and careful look. A novel bootstrap aggregated (bagged) GLM predictor that incorporates several elements of randomness and instability (random subspace method, optional interaction terms, forward variable selection) often outperforms a host of alternative prediction methods including random forests and penalized regression models (ridge regression, elastic net, lasso). This random generalized linear model (RGLM) predictor provides variable importance measures that can be used to define a "thinned" ensemble predictor (involving few features) that retains excellent predictive accuracy.

CONCLUSION

RGLM is a state of the art predictor that shares the advantages of a random forest (excellent predictive accuracy, feature importance measures, out-of-bag estimates of accuracy) with those of a forward selected generalized linear model (interpretability). These methods are implemented in the freely available R software package randomGLM.

Collapse

Takeda A, Mitsugi H, Kanamori T. A unified classification model based on robust optimization. Neural Comput 2013;25:759-804. [PMID: 23272917 DOI: 10.1162/neco_a_00412] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]

Proximal parametric-margin support vector classifier and its applications. Neural Comput Appl 2012. [DOI: 10.1007/s00521-012-1278-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Bai B, Wang Y, Yang C. Predicting atrial fibrillation inducibility in a canine model by multi-threshold spectra of the recurrence complex network. Med Eng Phys 2012;35:668-75. [PMID: 22925583 DOI: 10.1016/j.medengphy.2012.07.012] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2011] [Revised: 07/19/2012] [Accepted: 07/21/2012] [Indexed: 10/28/2022]

Kim J, Yi GS. PKMiner: a database for exploring type II polyketide synthases. BMC Microbiol 2012;12:169. [PMID: 22871112 PMCID: PMC3462128 DOI: 10.1186/1471-2180-12-169] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2012] [Accepted: 08/02/2012] [Indexed: 11/17/2022] Open

Abstract

Background

Bacterial aromatic polyketides are a pharmacologically important group of natural products synthesized by type II polyketide synthases (type II PKSs) in actinobacteria. Isolation of novel aromatic polyketides from microbial sources is currently impeded because of the lack of knowledge about prolific taxa for polyketide synthesis and the difficulties in finding and optimizing target microorganisms. Comprehensive analysis of type II PKSs and the prediction of possible polyketide chemotypes in various actinobacterial genomes will thus enable the discovery or synthesis of novel polyketides in the most plausible microorganisms.

Description

We performed a comprehensive computational analysis of type II PKSs and their gene clusters in actinobacterial genomes. By identifying type II PKS subclasses from the sequence analysis of 280 known type II PKSs, we developed highly accurate domain classifiers for these subclasses and derived prediction rules for aromatic polyketide chemotypes generated by different combinations of type II PKS domains. Using 319 available actinobacterial genomes, we predicted 231 type II PKSs from 40 PKS gene clusters in 25 actinobacterial genomes, and polyketide chemotypes corresponding to 22 novel PKS gene clusters in 16 genomes. These results showed that the microorganisms capable of producing aromatic polyketides are specifically distributed within a certain suborder of Actinomycetales such as Catenulisporineae, Frankineae, Micrococcineae, Micromonosporineae, Pseudonocardineae, Streptomycineae, and Streptosporangineae.

Conclusions

We could identify the novel candidates of type II PKS gene clusters and their polyketide chemotypes in actinobacterial genomes by comprehensive analysis of type II PKSs and prediction of aromatic polyketides. The genome analysis results indicated that the specific suborders in actinomycetes could be used as prolific taxa for polyketide synthesis. The chemotype-prediction rules with the suggested type II PKS modules derived using this resource can be used further for microbial engineering to produce various aromatic polyketides. All these resources, together with the results of the analysis, are organized into an easy-to-use database PKMiner, which is accessible at the following URL: http://pks.kaist.ac.kr/pkminer. We believe that this web-based tool would be useful for research in the discovery of novel bacterial aromatic polyketides.

Collapse

Gu B, Wang JD, Zheng GS, Yu YC. Regularization path for ν-support vector classification. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2012;23:800-811. [PMID: 24806128 DOI: 10.1109/tnnls.2012.2183644] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]

Doktorski L. Properties of the solution of L2-Support Vector Machine as a function of regularization parameter. PATTERN RECOGNITION AND IMAGE ANALYSIS 2012. [DOI: 10.1134/s1054661812010129] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Gu B, Wang JD, Yu YC, Zheng GS, Huang YF, Xu T. Accurate on-line -support vector learning. Neural Netw 2012;27:51-9. [DOI: 10.1016/j.neunet.2011.10.006] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2009] [Revised: 10/06/2011] [Accepted: 10/14/2011] [Indexed: 11/25/2022]

Li L, Wang B, Meroueh SO. Support vector regression scoring of receptor-ligand complexes for rank-ordering and virtual screening of chemical libraries. J Chem Inf Model 2011;51:2132-8. [PMID: 21728360 PMCID: PMC3209528 DOI: 10.1021/ci200078f] [Citation(s) in RCA: 69] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

Chen YF, Hsu KC, Lin PT, Hsu DF, Kristal BS, Yang JM. LigSeeSVM: ligand-based virtual screening using support vector machines and data fusion. ACTA ACUST UNITED AC 2011;4:274-89. [PMID: 21778560 DOI: 10.1504/ijcbdd.2011.041415] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]

Capriotti E, Altman RB. A new disease-specific machine learning approach for the prediction of cancer-causing missense variants. Genomics 2011;98:310-7. [PMID: 21763417 DOI: 10.1016/j.ygeno.2011.06.010] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2011] [Revised: 06/26/2011] [Accepted: 06/28/2011] [Indexed: 12/20/2022]

Capriotti E, Altman RB. Improving the prediction of disease-related variants using protein three-dimensional structure. BMC Bioinformatics 2011;12 Suppl 4:S3. [PMID: 21992054 PMCID: PMC3194195 DOI: 10.1186/1471-2105-12-s4-s3] [Citation(s) in RCA: 80] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

Background

Single Nucleotide Polymorphisms (SNPs) are an important source of human genome variability. Non-synonymous SNPs occurring in coding regions result in single amino acid polymorphisms (SAPs) that may affect protein function and lead to pathology. Several methods attempt to estimate the impact of SAPs using different sources of information. Although sequence-based predictors have shown good performance, the quality of these predictions can be further improved by introducing new features derived from three-dimensional protein structures.

Results

In this paper, we present a structure-based machine learning approach for predicting disease-related SAPs. We have trained a Support Vector Machine (SVM) on a set of 3,342 disease-related mutations and 1,644 neutral polymorphisms from 784 protein chains. We use SVM input features derived from the protein’s sequence, structure, and function. After dataset balancing, the structure-based method (SVM-3D) reaches an overall accuracy of 85%, a correlation coefficient of 0.70, and an area under the receiving operating characteristic curve (AUC) of 0.92. When compared with a similar sequence-based predictor, SVM-3D results in an increase of the overall accuracy and AUC by 3%, and correlation coefficient by 0.06. The robustness of this improvement has been tested on different datasets and in all the cases SVM-3D performs better than previously developed methods even when compared with PolyPhen2, which explicitly considers in input protein structure information.

Conclusion

This work demonstrates that structural information can increase the accuracy of disease-related SAPs identification. Our results also quantify the magnitude of improvement on a large dataset. This improvement is in agreement with previously observed results, where structure information enhanced the prediction of protein stability changes upon mutation. Although the structural information contained in the Protein Data Bank is limiting the application and the performance of our structure-based method, we expect that SVM-3D will result in higher accuracy when more structural date become available.

Collapse

Chen M, Guan J, Liu H. Enabling fast brain-computer interaction by single-trial extraction of visual evoked potentials. J Med Syst 2011;35:1323-31. [PMID: 21681514 DOI: 10.1007/s10916-011-9696-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2010] [Accepted: 03/29/2011] [Indexed: 11/28/2022]

Doktorski L. L2-SVM: Dependence on the regularization parameter. PATTERN RECOGNITION AND IMAGE ANALYSIS 2011. [DOI: 10.1134/s1054661811020258] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Kamath SD, Ray S, Mahato KK. Photoacoustic spectroscopy of ovarian normal, benign, and malignant tissues: a pilot study. JOURNAL OF BIOMEDICAL OPTICS 2011;16:067001. [PMID: 21721822 DOI: 10.1117/1.3583573] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]

Ko D, Windle B. Enriching for correct prediction of biological processes using a combination of diverse classifiers. BMC Bioinformatics 2011;12:189. [PMID: 21605426 PMCID: PMC3121646 DOI: 10.1186/1471-2105-12-189] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2010] [Accepted: 05/23/2011] [Indexed: 11/20/2022] Open

Abstract

Background

Machine learning models (classifiers) for classifying genes to biological processes each have their own unique characteristics in what genes can be classified and to what biological processes. No single learning model is qualitatively superior to any other model and overall precision for each model tends to be low. The classification results for each classifier can be complementary and synergistic suggesting the benefit of a combination of algorithms, but often the prediction probability outputs of various learning models are neither comparable nor compatible for combining. A means to compare outputs regardless of the model and data used and combine the results into an improved comprehensive model is needed.

Results

Gene expression patterns from NCI's panel of 60 cell lines were used to train a Random Forest, a Support Vector Machine and a Neural Network model, plus two over-sampled models for classifying genes to biological processes. Each model produced unique characteristics in the classification results. We introduce the Precision Index measure (PIN) from the maximum posterior probability that allows assessing, comparing and combining multiple classifiers. The class specific precision measure (PIC) is introduced and used to select a subset of predictions across all classes and all classifiers with high precision. We developed a single classifier that combines the PINs from these five models in prediction and found that the PIN Combined Classifier (PINCom) significantly increased the number of correctly predicted genes over any single classifier. The PINCom applied to test genes that were not used in training also showed substantial improvement over any single model.

Conclusions

This paper introduces novel and effective ways of assessing predictions by their precision and recall plus a method that combines several machine learning models and capitalizes on synergy and complementation in class selection, resulting in higher precision and recall. Different machine learning models yielded incongruent results each of which were successfully combined into one superior model using the PIN measure we developed. Validation of the boosted predictions for gene functions showed the genes to be accurately predicted.

Collapse

Chang CC, Lin CJ. LIBSVM. ACM T INTEL SYST TEC 2011. [DOI: 10.1145/1961189.1961199] [Citation(s) in RCA: 8788] [Impact Index Per Article: 676.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Senapedis WT, Kennedy CJ, Boyle PM, Silver PA. Whole genome siRNA cell-based screen links mitochondria to Akt signaling network through uncoupling of electron transport chain. Mol Biol Cell 2011;22:1791-805. [PMID: 21460183 PMCID: PMC3093329 DOI: 10.1091/mbc.e10-10-0854] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open

Baralis E, Bruno G, Fiori A. Measuring gene similarity by means of the classification distance. Knowl Inf Syst 2011. [DOI: 10.1007/s10115-010-0374-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Jändel M. Natural evolution of neural support vector machines. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2011;718:193-207. [PMID: 21744220 DOI: 10.1007/978-1-4614-0164-3_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]

Cevikalp H, Triggs B, Yavuz HS, Küçük Y, Küçük M, Barkana A. Large margin classifiers based on affine hulls. Neurocomputing 2010. [DOI: 10.1016/j.neucom.2010.06.018] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Davenport MA, Baraniuk RG, Scott CD. Tuning support vector machines for minimax and Neyman-Pearson classification. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2010;32:1888-1898. [PMID: 20724764 DOI: 10.1109/tpami.2010.29] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]

Konstantinopoulos PA, Spentzos D, Karlan BY, Taniguchi T, Fountzilas E, Francoeur N, Levine DA, Cannistra SA. Gene expression profile of BRCAness that correlates with responsiveness to chemotherapy and with outcome in patients with epithelial ovarian cancer. J Clin Oncol 2010;28:3555-61. [PMID: 20547991 DOI: 10.1200/jco.2009.27.5719] [Citation(s) in RCA: 363] [Impact Index Per Article: 25.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open

Bai O, Lin P, Huang D, Fei DY, Floeter MK. Towards a user-friendly brain-computer interface: initial tests in ALS and PLS patients. Clin Neurophysiol 2010;121:1293-303. [PMID: 20347612 DOI: 10.1016/j.clinph.2010.02.157] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2009] [Revised: 02/02/2010] [Accepted: 02/25/2010] [Indexed: 10/19/2022]

Abstract

OBJECTIVE

Patients usually require long-term training for effective EEG-based brain-computer interface (BCI) control due to fatigue caused by the demands for focused attention during prolonged BCI operation. We intended to develop a user-friendly BCI requiring minimal training and less mental load.

METHODS

Testing of BCI performance was investigated in three patients with amyotrophic lateral sclerosis (ALS) and three patients with primary lateral sclerosis (PLS), who had no previous BCI experience. All patients performed binary control of cursor movement. One ALS patient and one PLS patient performed four-directional cursor control in a two-dimensional domain under a BCI paradigm associated with human natural motor behavior using motor execution and motor imagery. Subjects practiced for 5-10min and then participated in a multi-session study of either binary control or four-directional control including online BCI game over 1.5-2h in a single visit.

RESULTS

Event-related desynchronization and event-related synchronization in the beta band were observed in all patients during the production of voluntary movement either by motor execution or motor imagery. The online binary control of cursor movement was achieved with an average accuracy about 82.1+/-8.2% with motor execution and about 80% with motor imagery, whereas offline accuracy was achieved with 91.4+/-3.4% with motor execution and 83.3+/-8.9% with motor imagery after optimization. In addition, four-directional cursor control was achieved with an accuracy of 50-60% with motor execution and motor imagery.

CONCLUSION

Patients with ALS or PLS may achieve BCI control without extended training, and fatigue might be reduced during operation of a BCI associated with human natural motor behavior.

SIGNIFICANCE

The development of a user-friendly BCI will promote practical BCI applications in paralyzed patients.

Collapse

Jändel M. A neural support vector machine. Neural Netw 2010;23:607-13. [PMID: 20092978 DOI: 10.1016/j.neunet.2010.01.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2008] [Revised: 10/04/2009] [Accepted: 01/02/2010] [Indexed: 12/01/2022]

A method to sparsify the solution of support vector regression. Neural Comput Appl 2009. [DOI: 10.1007/s00521-009-0255-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Calabrese R, Capriotti E, Fariselli P, Martelli PL, Casadio R. Functional annotations improve the predictive score of human disease-related mutations in proteins. Hum Mutat 2009;30:1237-44. [PMID: 19514061 DOI: 10.1002/humu.21047] [Citation(s) in RCA: 455] [Impact Index Per Article: 30.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Chen B, Johnson M. Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM). BMC Bioinformatics 2009;10 Suppl 11:S15. [PMID: 19811680 PMCID: PMC3226186 DOI: 10.1186/1471-2105-10-s11-s15] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

100

Huang D, Lin P, Fei DY, Chen X, Bai O. Decoding human motor activity from EEG single trials for a discrete two-dimensional cursor control. J Neural Eng 2009;6:046005. [PMID: 19556679 DOI: 10.1088/1741-2560/6/4/046005] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]