1
|
Park M, Shin SJ. Fraud detection support vector machines with a functional predictor: application to defective wafer detection problem. KOREAN JOURNAL OF APPLIED STATISTICS 2022. [DOI: 10.5351/kjas.2022.35.5.593] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
2
|
A model-free soft classification with a functional predictor. COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS 2019. [DOI: 10.29220/csam.2019.26.6.635] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
3
|
Decker JT, Hall MS, Blaisdell RB, Schwark K, Jeruss JS, Shea LD. Dynamic microRNA activity identifies therapeutic targets in trastuzumab-resistant HER2 + breast cancer. Biotechnol Bioeng 2018; 115:2613-2623. [PMID: 29981261 DOI: 10.1002/bit.26791] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2018] [Revised: 06/28/2018] [Accepted: 07/02/2018] [Indexed: 02/06/2023]
Abstract
MicroRNAs (miRNAs) are implicated in numerous physiologic and pathologic processes, such as the development of resistance to chemotherapy. Determining the role of miRNAs in these processes is often accomplished through measuring miRNA abundance by polymerase chain reaction, sequencing, or microarrays. We have developed a system for the large-scale monitoring of dynamic miRNA activity and have applied this system to identify the contribution miRNA activity to the development of trastuzumab resistance in a cell model of HER2+ breast cancer. MiRNA activity measurements identified significantly different activity levels between BT474 cells (HER2 + breast cancer) and BT474R cells (HER2 + breast cancer cells selected for resistance to trastuzumab). We created a library of 32 miRNA reporter constructs, which were delivered by lentiviral transduction into cells, and miRNA activity was quantified by bioluminescence imaging. Upon treatment with the bioimmune therapy, trastuzumab, the activity of 11 miRNAs were significantly altered in parental BT474 cells, and 20 miRNAs had significantly altered activity in the therapy-resistant BT474R cell line. A combination of statistical, network and classification analysis was applied to the dynamic data, which identified miR-21 as a controlling factor in trastuzumab response. Our data suggested downregulation of miR-21 activity was associated with resistance, which was confirmed in an additional HER2 + breast cancer cell line, SKBR3. Collectively, the dynamic miRNA activity measurements and analysis provided a system to identify new potential therapeutic targets in treatment-resistant cancers.
Collapse
Affiliation(s)
- Joseph T Decker
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, Michigan
| | - Matthew S Hall
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, Michigan
| | - Rachel B Blaisdell
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, Michigan
| | - Kallen Schwark
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, Michigan
| | - Jacqueline S Jeruss
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, Michigan.,Department of Surgery, University of Michigan, Ann Arbor, Michigan
| | - Lonnie D Shea
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, Michigan
| |
Collapse
|
4
|
Affiliation(s)
- Hyojoong Kim
- Department of Industrial and Systems Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Republic of Korea
| | - Heeyoung Kim
- Department of Industrial and Systems Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Republic of Korea
| |
Collapse
|
5
|
Application and interpretation of functional data analysis techniques to differential scanning calorimetry data from lupus patients. PLoS One 2017; 12:e0186232. [PMID: 29121669 PMCID: PMC5679774 DOI: 10.1371/journal.pone.0186232] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2016] [Accepted: 09/27/2017] [Indexed: 11/19/2022] Open
Abstract
Background DSC is used to determine thermally-induced conformational changes of biomolecules within a blood plasma sample. Recent research has indicated that DSC curves (or thermograms) may have different characteristics based on disease status and, thus, may be useful as a monitoring and diagnostic tool for some diseases. Since thermograms are curves measured over a range of temperature values, they are considered functional data. In this paper we apply functional data analysis techniques to analyze differential scanning calorimetry (DSC) data from individuals from the Lupus Family Registry and Repository (LFRR). The aim was to assess the effect of lupus disease status as well as additional covariates on the thermogram profiles, and use FD analysis methods to create models for classifying lupus vs. control patients on the basis of the thermogram curves. Methods Thermograms were collected for 300 lupus patients and 300 controls without lupus who were matched with diseased individuals based on sex, race, and age. First, functional regression with a functional response (DSC) and categorical predictor (disease status) was used to determine how thermogram curve structure varied according to disease status and other covariates including sex, race, and year of birth. Next, functional logistic regression with disease status as the response and functional principal component analysis (FPCA) scores as the predictors was used to model the effect of thermogram structure on disease status prediction. The prediction accuracy for patients with Osteoarthritis and Rheumatoid Arthritis but without Lupus was also calculated to determine the ability of the classifier to differentiate between Lupus and other diseases. Data were divided 1000 times into separate 2/3 training and 1/3 test data for evaluation of predictions. Finally, derivatives of thermogram curves were included in the models to determine whether they aided in prediction of disease status. Results Functional regression with thermogram as a functional response and disease status as predictor showed a clear separation in thermogram curve structure between cases and controls. The logistic regression model with FPCA scores as the predictors gave the most accurate results with a mean 79.22% correct classification rate with a mean sensitivity = 79.70%, and specificity = 81.48%. The model correctly classified OA and RA patients without Lupus as controls at a rate of 75.92% on average with a mean sensitivity = 79.70% and specificity = 77.6%. Regression models including FPCA scores for derivative curves did not perform as well, nor did regression models including covariates. Conclusion Changes in thermograms observed in the disease state likely reflect covalent modifications of plasma proteins or changes in large protein-protein interacting networks resulting in the stabilization of plasma proteins towards thermal denaturation. By relating functional principal components from thermograms to disease status, our Functional Principal Component Analysis model provides results that are more easily interpretable compared to prior studies. Further, the model could also potentially be coupled with other biomarkers to improve diagnostic classification for lupus.
Collapse
|
6
|
|
7
|
|
8
|
Liu S, Maharaj EA, Inder B. Polarization of forecast densities: A new approach to time series classification. Comput Stat Data Anal 2014. [DOI: 10.1016/j.csda.2013.10.008] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
9
|
A hypothesis test using bias-adjusted AR estimators for classifying time series in small samples. Comput Stat Data Anal 2013. [DOI: 10.1016/j.csda.2012.11.014] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
10
|
Ullah S, Finch CF. Applications of functional data analysis: A systematic review. BMC Med Res Methodol 2013; 13:43. [PMID: 23510439 PMCID: PMC3626842 DOI: 10.1186/1471-2288-13-43] [Citation(s) in RCA: 86] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2012] [Accepted: 03/04/2013] [Indexed: 12/26/2022] Open
Abstract
BACKGROUND Functional data analysis (FDA) is increasingly being used to better analyze, model and predict time series data. Key aspects of FDA include the choice of smoothing technique, data reduction, adjustment for clustering, functional linear modeling and forecasting methods. METHODS A systematic review using 11 electronic databases was conducted to identify FDA application studies published in the peer-review literature during 1995-2010. Papers reporting methodological considerations only were excluded, as were non-English articles. RESULTS In total, 84 FDA application articles were identified; 75.0% of the reviewed articles have been published since 2005. Application of FDA has appeared in a large number of publications across various fields of sciences; the majority is related to biomedicine applications (21.4%). Overall, 72 studies (85.7%) provided information about the type of smoothing techniques used, with B-spline smoothing (29.8%) being the most popular. Functional principal component analysis (FPCA) for extracting information from functional data was reported in 51 (60.7%) studies. One-quarter (25.0%) of the published studies used functional linear models to describe relationships between explanatory and outcome variables and only 8.3% used FDA for forecasting time series data. CONCLUSIONS Despite its clear benefits for analyzing time series data, full appreciation of the key features and value of FDA have been limited to date, though the applications show its relevance to many public health and biomedical problems. Wider application of FDA to all studies involving correlated measurements should allow better modeling of, and predictions from, such data in the future especially as FDA makes no a priori age and time effects assumptions.
Collapse
Affiliation(s)
- Shahid Ullah
- Flinders Centre for Epidemiology and Biostatistics, School of Medicine, Faculty of Health Sciences, Flinders University, Adelaide, SA, 5001, Australia
| | - Caroline F Finch
- Centre for Healthy and Safe Sports (CHASS), University of Ballarat, SMB Campus, Ballarat, VIC, 3353, Australia
| |
Collapse
|
11
|
Seoud L, Dansereau J, Labelle H, Cheriet F. Non invasive clinical assessment of trunk deformities associated with scoliosis. IEEE J Biomed Health Inform 2012; 17:392-401. [PMID: 23047883 DOI: 10.1109/titb.2012.2222425] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Besides the spinal deformity, scoliosis modifies notably the general appearance of the trunk resulting in trunk rotation, imbalance and asymmetries which constitutes patients' major concern. Existing classifications of scoliosis, based on the type of spinal curve as depicted on radiographs, are currently used to guide treatment strategies. Unfortunately, even though a perfect correction of the spinal curve is achieved, some trunk deformities remain, making patients dissatisfied with the treatment received. The purpose of this study is to identify possible shape patterns of trunk surface deformity associated with scoliosis. First, trunk surface is represented by a multivariate functional trunk shape descriptor based on 3D clinical measurements computed on cross sections of the trunk. Then, the classical formulation of hierarchical clustering is adapted to the case of multivariate functional data and applied to a set of 236 trunk surface 3D reconstructions. The highest internal validity is obtained when considering 11 clusters that explain up to 65% of the variance in our dataset. Our clustering result shows a concordance with the radiographic classification of spinal curves in 68% of the cases. As opposed to radiographic evaluation, the trunk descriptor is three-dimensional and its functional nature offers a compact and elegant description of not only the type, but also the severity and extent of the trunk surface deformity along the trunk length. In future work, new management strategies based on the resulting trunk shape patterns could be thought of in order to improve the esthetic outcome after treatment, and thus patients satisfaction.
Collapse
|
12
|
Multilevel analysis of trunk surface measurements for noninvasive assessment of scoliosis deformities. Spine (Phila Pa 1976) 2012; 37:E1045-53. [PMID: 22472809 DOI: 10.1097/brs.0b013e3182575938] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
STUDY DESIGN Reliability study. OBJECTIVES To assess between-acquisition reliability of new multilevel trunk cross sections measurements, in order to define what is a real change when comparing 2 trunk surface acquisitions of a same patient, before and after surgery or throughout the clinical monitoring. SUMMARY OF BACKGROUND DATA Several cross-sectional surface measurements have been proposed in the literature for noninvasive assessment of trunk deformity in patients with adolescent idiopathic scoliosis (AIS). However, only the maximum values along the trunk are evaluated and used for monitoring progression and assessing treatment outcome. METHODS Back surface rotation (BSR), trunk rotation (TR), and coronal and sagittal trunk deviation are computed on 300 cross sections of the trunk. Each set of 300 measures is represented as a single functional data, using a set of basis functions. To evaluate between-acquisition variability at all trunk levels, a test-retest reliability study is conducted on 35 patients with AIS. A functional correlation analysis is also carried out to evaluate any redundancy between the measurements. RESULTS Each set of 300 measures was successfully described using only 10 basis functions. The test-retest reliability of the functional measurements is good to very good all over the trunk, except above the shoulders level. The typical errors of measurement are between 1.20° and 2.2° for the rotational measures and between 2 and 6 mm for deviation measures. There is a very strong correlation between BSR and TR all over the trunk, a moderate correlation between coronal trunk deviation and both BSR and TR, and no correlation between sagittal trunk deviation and any other measurement. CONCLUSION This novel representation of trunk surface measurements allows for a global assessment of trunk surface deformity. Multilevel trunk measurements provide a broader perspective of the trunk deformity and allow a reliable multilevel monitoring during clinical follow-up of patients with AIS and a reliable assessment of the esthetic outcome after surgery.
Collapse
|
13
|
HIV-1 CRF01_AE coreceptor usage prediction using kernel methods based logistic model trees. Comput Biol Med 2012; 42:885-9. [PMID: 22824642 DOI: 10.1016/j.compbiomed.2012.06.011] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2011] [Revised: 05/26/2012] [Accepted: 06/24/2012] [Indexed: 11/20/2022]
Abstract
The determination of HIV-1 coreceptor usage plays a major role in HIV treatment. Since Maraviroc has been used in a treatment for patients those exclusively harbor R5-tropic strains, the efficient performance of classifying HIV-1 coreceptor usage can help choose the most advantaged HIV treatment. In general, HIV-1 variants are classified as R5-tropic and X4-tropic or dual/mixed tropic based on their coreceptor usages. The classification of the coreceptor usage has been developed by using the various computational methods or genotypic algorithms based on V3 amino acid sequences. Most genotypic tools have been designed based on a data set of the HIV-1 subtype B that seemed to be reliable only for this subtype. However, the performance of these tools decreases in non-B subtypes. In this study, the support vector machine (SVM) method has been used to classify the HIV-1 coreceptor. To develop an efficient SVM classifier, we present a feature selector using the logistic model tree (LMT) method to select the most relevant positions from the V3 amino acid sequences. Our approach achieves as high as 97.8% accuracy, 97.7% specificity, and 97.9% sensitivity measured by ten-fold cross-validation on 273 sequences.
Collapse
|
14
|
Douzal-Chouakria A, Diallo A, Giroud F. A random-periods model for the comparison of a metrics efficiency to classify cell-cycle expressed genes. Pattern Recognit Lett 2010. [DOI: 10.1016/j.patrec.2010.05.008] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
15
|
Allison DB, Visscher PM, Rosa GJ, Amos CI. Statistical genetics & statistical genomics: Where biology, epistemology, statistics, and computation collide. Comput Stat Data Anal 2009. [DOI: 10.1016/j.csda.2009.01.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
16
|
Shim J, Sohn I, Kim S, Lee JW, Green PE, Hwang C. Selecting marker genes for cancer classification using supervised weighted kernel clustering and the support vector machine. Comput Stat Data Anal 2009. [DOI: 10.1016/j.csda.2008.04.028] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
17
|
Sohn I, Shim J, Hwang C, Kim S, Lee JW. Informative transcription factor selection using support vector machine-based generalized approximate cross validation criteria. Comput Stat Data Anal 2009. [DOI: 10.1016/j.csda.2008.05.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
18
|
Douzal-Chouakria A, Diallo A, Giroud F. Adaptive clustering for time series: Application for identifying cell cycle expressed genes. Comput Stat Data Anal 2009. [DOI: 10.1016/j.csda.2008.11.031] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
19
|
Nielsen JD, Rumí R, Salmerón A. Supervised classification using probabilistic decision graphs. Comput Stat Data Anal 2009. [DOI: 10.1016/j.csda.2008.11.003] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
20
|
|