2501
|
Wang X, Simon R. Microarray-based cancer prediction using single genes. BMC Bioinformatics 2011; 12:391. [PMID: 21982331 PMCID: PMC3228540 DOI: 10.1186/1471-2105-12-391] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2011] [Accepted: 10/07/2011] [Indexed: 11/23/2022] Open
Abstract
Background Although numerous methods of using microarray data analysis for cancer classification have been proposed, most utilize many genes to achieve accurate classification. This can hamper interpretability of the models and ease of translation to other assay platforms. We explored the use of single genes to construct classification models. We first identified the genes with the most powerful univariate class discrimination ability and then constructed simple classification rules for class prediction using the single genes. Results We applied our model development algorithm to eleven cancer gene expression datasets and compared classification accuracy to that for standard methods including Diagonal Linear Discriminant Analysis, k-Nearest Neighbor, Support Vector Machine and Random Forest. The single gene classifiers provided classification accuracy comparable to or better than those obtained by existing methods in most cases. We analyzed the factors that determined when simple single gene classification is effective and when more complex modeling is warranted. Conclusions For most of the datasets examined, the single-gene classification methods appear to work as well as more standard methods, suggesting that simple models could perform well in microarray-based cancer prediction.
Collapse
Affiliation(s)
- Xiaosheng Wang
- Biometric Research Branch, National Cancer Institute, National Institutes of Health, Rockville, MD 20852, USA
| | | |
Collapse
|
2502
|
Hyung SW, Lee MY, Yu JH, Shin B, Jung HJ, Park JM, Han W, Lee KM, Moon HG, Zhang H, Aebersold R, Hwang D, Lee SW, Yu MH, Noh DY. A serum protein profile predictive of the resistance to neoadjuvant chemotherapy in advanced breast cancers. Mol Cell Proteomics 2011; 10:M111.011023. [PMID: 21799047 PMCID: PMC3205875 DOI: 10.1074/mcp.m111.011023] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2011] [Revised: 07/20/2011] [Indexed: 11/06/2022] Open
Abstract
Prediction of the responses to neoadjuvant chemotherapy (NACT) can improve the treatment of patients with advanced breast cancer. Genes and proteins predictive of chemoresistance have been extensively studied in breast cancer tissues. However, noninvasive serum biomarkers capable of such prediction have been rarely exploited. Here, we performed profiling of N-glycosylated proteins in serum from fifteen advanced breast cancer patients (ten patients sensitive to and five patients resistant to NACT) to discover serum biomarkers of chemoresistance using a label-free liquid chromatography-tandem MS method. By performing a series of statistical analyses of the proteomic data, we selected thirteen biomarker candidates and tested their differential serum levels by Western blotting in 13 independent samples (eight patients sensitive to and five patients resistant to NACT). Among the candidates, we then selected the final set of six potential serum biomarkers (AHSG, APOB, C3, C9, CP, and ORM1) whose differential expression was confirmed in the independent samples. Finally, we demonstrated that a multivariate classification model using the six proteins could predict responses to NACT and further predict relapse-free survival of patients. In summary, global N-glycoproteome profile in serum revealed a protein pattern predictive of the responses to NACT, which can be further validated in large clinical studies.
Collapse
Affiliation(s)
- Seok-Won Hyung
- From the ‡Department of Chemistry, Korea University, Seoul, Republic of Korea
| | - Min Young Lee
- §School of Interdisciplinary Bioscience and Bioengineering, POSTECH, Pohang, Republic of Korea
- ¶Functional Proteomics Center, Korea Institute of Science and Technology, Seoul, Republic of Korea
| | - Jong-Han Yu
- ‖Department of Surgery, College of Medicine, Seoul National University, Seoul, Republic of Korea
- **Cancer Research Institute, College of Medicine, Seoul National University, Seoul, Republic of Korea
| | - Byunghee Shin
- ¶Functional Proteomics Center, Korea Institute of Science and Technology, Seoul, Republic of Korea
| | - Hee-Jung Jung
- From the ‡Department of Chemistry, Korea University, Seoul, Republic of Korea
| | - Jong-Moon Park
- From the ‡Department of Chemistry, Korea University, Seoul, Republic of Korea
| | - Wonshik Han
- ‖Department of Surgery, College of Medicine, Seoul National University, Seoul, Republic of Korea
- **Cancer Research Institute, College of Medicine, Seoul National University, Seoul, Republic of Korea
| | - Kyung-Min Lee
- **Cancer Research Institute, College of Medicine, Seoul National University, Seoul, Republic of Korea
| | - Hyeong-Gon Moon
- ‖Department of Surgery, College of Medicine, Seoul National University, Seoul, Republic of Korea
| | - Hui Zhang
- Department of Pathology, Clinical Chemistry Division 1550 Orleans Street Cancer Research Building II, Room 3M-03 Baltimore, MD 21231, USA
| | - Ruedi Aebersold
- ‡‡The Institute for Systems Biology, Seattle, WA
- §§Institute for Molecular Systems Biology, ETH and Faculty of Natural Science, University of Zurich, Zurich, Switzerland
| | - Daehee Hwang
- §School of Interdisciplinary Bioscience and Bioengineering, POSTECH, Pohang, Republic of Korea
- ¶¶Division of Integrative Biosciences and Biotechnology, POSTECH, Pohang, Republic of Korea; and
- ‖‖Department of Chemical Eng. POSTECH, Pohang, Republic of Korea
| | - Sang-Won Lee
- From the ‡Department of Chemistry, Korea University, Seoul, Republic of Korea
| | - Myeong-Hee Yu
- ¶Functional Proteomics Center, Korea Institute of Science and Technology, Seoul, Republic of Korea
| | - Dong-Young Noh
- ‖Department of Surgery, College of Medicine, Seoul National University, Seoul, Republic of Korea
- **Cancer Research Institute, College of Medicine, Seoul National University, Seoul, Republic of Korea
| |
Collapse
|
2503
|
|
2504
|
Van Ness PH, Fried TR, Gill TM. Mixed Methods for the Interpretation of Longitudinal Gerontologic Data: Insights From Philosophical Hermeneutics. JOURNAL OF MIXED METHODS RESEARCH 2011; 5:293-308. [PMID: 22582035 PMCID: PMC3347468 DOI: 10.1177/1558689811412973] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
This article's main objective is to demonstrate that data analysis, including quantitative data analysis, is a process of interpretation involving basic hermeneutic principles that philosophers have identified in the interpretive process as applied to other, mainly literary, creations. Such principles include a version of the hermeneutic circle, an insistence on interpretive presuppositions, and a resistance to reducing the discovery of truth to the application of inductive methods. The importance of interpretation becomes especially evident when qualitative and quantitative methods are combined in a single clinical research project and when the data being analyzed are longitudinal. Study objectives will be accomplished by showing that three major hermeneutic principles make practical methodological contributions to an insightful, illustrative mixed methods analysis of a qualitative study of changes in functional disability over time embedded in the Precipitating Events Project-a major longitudinal, quantitative study of functional disability among older persons. Mixed methods, especially as shaped by hermeneutic insights such as the importance of empathetic understanding, are potentially valuable resources for scientific investigations of the experience of aging: a practical aim of this article is to articulate and demonstrate this contention.
Collapse
Affiliation(s)
| | - Terri R. Fried
- Yale University, New Haven, CT, USA
- Veterans Affairs Connecticut Healthcare System, West Haven, CT, USA
| | | |
Collapse
|
2505
|
Zeiss CJ, Ward JM, Allore HG. Designing phenotyping studies for genetically engineered mice. Vet Pathol 2011; 49:24-31. [PMID: 21930803 DOI: 10.1177/0300985811417247] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
A phenotyping study records physiologic or morphologic changes in an experimental animal resulting from an intervention. In mice, this intervention is most frequently genetic, but it may be any type of experimental manipulation. Accurate representation of the human condition under study is essential if the model is to yield useful conclusions. In this review, general approaches to the design of phenotyping studies are considered. These approaches take into account major sources of reduced model validity, such as unexpected phenotypic variation in mice, evolutionary divergence between mice and humans, unanticipated sources of variation, and common design errors. As poor design is the most common reason why studies fail to yield enduring results, emphasis is placed on reduction of bias, sampling, controlled study design, and appropriate statistical analysis.
Collapse
Affiliation(s)
- C J Zeiss
- Section of Comparative Medicine, Yale University School of Medicine, TAC N230, New Haven, CT 06520, USA.
| | | | | |
Collapse
|
2506
|
Yang Y, Cheng G. Comment. J Am Stat Assoc 2011. [DOI: 10.1198/jasa.2011.tm11323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
2507
|
Brunner G, Nambi V, Yang E, Kumar A, Virani SS, Kougias P, Shah D, Lumsden A, Ballantyne CM, Morrisett JD. Automatic quantification of muscle volumes in magnetic resonance imaging scans of the lower extremities. Magn Reson Imaging 2011; 29:1065-75. [PMID: 21855242 DOI: 10.1016/j.mri.2011.02.033] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2010] [Revised: 12/21/2010] [Accepted: 02/20/2011] [Indexed: 11/30/2022]
Abstract
Muscle volume measurements are essential for an array of diseases ranging from peripheral arterial disease, muscular dystrophies, neurological conditions to sport injuries and aging. In the clinical setting, muscle volume is not routinely measured due to the lack of standardized ways for its repeatable quantification. In this paper, we present magnetic resonance muscle quantification (MRMQ), a method for the automatic quantification of thigh muscle volume in magnetic resonance imaging (MRI) scans. MRMQ integrates a thigh segmentation and nonuniform image gradient correction step, followed by feature extraction and classification. The classification step leverages prior probabilities, introducing prior knowledge to a maximum a posteriori classifier. MRMQ was validated on 344 slices taken from 60 MRI scans. Experiments for the fully automatic detection of muscle volume in MRI scans demonstrated an averaged accuracy, sensitivity and specificity for leave-one-out cross-validation of 88.3%, 93.6% and 87.2%, respectively.
Collapse
Affiliation(s)
- Gerd Brunner
- Division of Atherosclerosis and Vascular Medicine, Department of Medicine, Baylor College of Medicine, Houston, TX 77030, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
2508
|
Capizzi G, Masarotto G. A Least Angle Regression Control Chart for Multidimensional Data. Technometrics 2011. [DOI: 10.1198/tech.2011.10027] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
2509
|
Abstract
Many common human diseases and complex traits are highly heritable and influenced by multiple genetic and environmental factors. Although genome-wide association studies (GWAS) have successfully identified many disease-associated variants, these genetic variants explain only a small proportion of the heritability of most complex diseases. Genetic interactions (gene-gene and gene-environment) substantially contribute to complex traits and diseases and could be one of the main sources of the missing heritability. This paper provides an overview of the available statistical methods and related computer software for identifying genetic interactions in animal and plant experimental crosses and human genetic association studies. The main discussion falls under the three broad issues in statistical analysis of genetic interactions: the definition, detection and interpretation of genetic interactions. Recently developed methods based on modern techniques for high-dimensional data are reviewed, including penalized likelihood approaches and hierarchical models; the relationships between these methods are also discussed. I conclude this review by highlighting some areas of future research.
Collapse
|
2510
|
|
2511
|
Chang AY, Piette EW, Foering KP, Tenhave TR, Okawa J, Werth VP. Response to antimalarial agents in cutaneous lupus erythematosus: a prospective analysis. ACTA ACUST UNITED AC 2011; 147:1261-7. [PMID: 21768444 DOI: 10.1001/archdermatol.2011.191] [Citation(s) in RCA: 100] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
OBJECTIVE To demonstrate response to antimalarial agents in patients with cutaneous lupus erythematosus (CLE) using activity scores from the Cutaneous Lupus Erythematosus Disease Area and Severity Index, a validated outcome measure. DESIGN Prospective, longitudinal cohort study. SETTING University cutaneous autoimmune disease clinic. PARTICIPANTS A total of 128 patients with CLE who presented from January 2007 to July 2010 and had at least 2 visits with activity scores. INTERVENTION Administration of antimalarial agents. MAIN OUTCOME MEASURES Response was defined by a 4-point or 20% decrease in activity score. Response to initiation was determined by the difference between the scores before treatment and at the first visit at least 2 months after treatment. Response to continuation was determined by the difference between the scores at the first visit and the most recent visit while undergoing treatment. RESULTS Of 11 patients who initiated treatment with hydroxychloroquine, 55% were responders (n = 6), showing a decrease in median (interquartile range [IQR]) activity score from 8.0 (3.5-13.0) to 3.0 (1.8-7.3) (P = .03). Of 15 patients for whom hydroxychloroquine failed, 67% were responders to initiation of hydroxychloroquine-quinacrine therapy (n = 10), showing a decrease in median (IQR) activity score from 6.0 (4.8-8.3) to 3.0 (0.75-5.0) (P = .004). Nine of 21 patients who continued hydroxychloroquine treatment (43%), and 9 of 21 patients who continued hydroxychloroquine-quinacrine (43%) were responders, showing a decrease in median (IQR) activity score from 6.0 (1.5-9.5) to 1.0 (0.0-4.5) (P = .01) and 8.5 (4.25-17.5) to 5.0 (0.5-11.5) (P = .01), respectively. CONCLUSIONS The use of quinacrine with hydroxychloroquine is associated with response in patients for whom hydroxychloroquine monotherapy fails. Further reduction in disease activity can be associated with continuation of treatment with antimalarial agents.
Collapse
Affiliation(s)
- Aileen Y Chang
- Philadelphia Veterans Affairs Medical Center, Philadelphia, Pennsylvania, USA
| | | | | | | | | | | |
Collapse
|
2512
|
Li X, Coyle D, Maguire L, McGinnity TM, Benali H. A model selection method for nonlinear system identification based FMRI effective connectivity analysis. IEEE TRANSACTIONS ON MEDICAL IMAGING 2011; 30:1365-1380. [PMID: 21335308 DOI: 10.1109/tmi.2011.2116034] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
In this paper a model selection algorithm for a nonlinear system identification method is proposed to study functional magnetic resonance imaging (fMRI) effective connectivity. Unlike most other methods, this method does not need a pre-defined structure/model for effective connectivity analysis. Instead, it relies on selecting significant nonlinear or linear covariates for the differential equations to describe the mapping relationship between brain output (fMRI response) and input (experiment design). These covariates, as well as their coefficients, are estimated based on a least angle regression (LARS) method. In the implementation of the LARS method, Akaike's information criterion corrected (AICc) algorithm and the leave-one-out (LOO) cross-validation method were employed and compared for model selection. Simulation comparison between the dynamic causal model (DCM), nonlinear identification method, and model selection method for modelling the single-input-single-output (SISO) and multiple-input multiple-output (MIMO) systems were conducted. Results show that the LARS model selection method is faster than DCM and achieves a compact and economic nonlinear model simultaneously. To verify the efficacy of the proposed approach, an analysis of the dorsal and ventral visual pathway networks was carried out based on three real datasets. The results show that LARS can be used for model selection in an fMRI effective connectivity study with phase-encoded, standard block, and random block designs. It is also shown that the LOO cross-validation method for nonlinear model selection has less residual sum squares than the AICc algorithm for the study.
Collapse
Affiliation(s)
- Xingfeng Li
- INSERM, UPMC Université Paris 06, UMR_S 678, Laboratoire d’Imagerie Fonctionnelle, Paris Cedex 13, France.
| | | | | | | | | |
Collapse
|
2513
|
|
2514
|
|
2515
|
|
2516
|
Zhao Z. Nonparametric model validations for hidden Markov models with applications in financial econometrics. JOURNAL OF ECONOMETRICS 2011; 162:225-239. [PMID: 21750601 PMCID: PMC3132196 DOI: 10.1016/j.jeconom.2011.01.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
We address the nonparametric model validation problem for hidden Markov models with partially observable variables and hidden states. We achieve this goal by constructing a nonparametric simultaneous confidence envelope for transition density function of the observable variables and checking whether the parametric density estimate is contained within such an envelope. Our specification test procedure is motivated by a functional connection between the transition density of the observable variables and the Markov transition kernel of the hidden states. Our approach is applicable for continuous time diffusion models, stochastic volatility models, nonlinear time series models, and models with market microstructure noise.
Collapse
Affiliation(s)
- Zhibiao Zhao
- Department of Statistics, Penn State University, University Park, PA 16802, United States
| |
Collapse
|
2517
|
|
2518
|
Liang Y, Kelemen A. Sequential Support Vector Regression with Embedded Entropy for SNP Selection and Disease Classification. Stat Anal Data Min 2011; 4:301-312. [PMID: 21666834 DOI: 10.1002/sam.10110] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Comprehensive evaluation of common genetic variations through association of SNP structure with common diseases on the genome-wide scale is currently a hot area in human genome research. For less costly and faster diagnostics, advanced computational approaches are needed to select the minimum SNPs with the highest prediction accuracy for common complex diseases. In this paper, we present a sequential support vector regression model with embedded entropy algorithm to deal with the redundancy for the selection of the SNPs that have best prediction performance of diseases. We implemented our proposed method for both SNP selection and disease classification, and applied it to simulation data sets and two real disease data sets. Results show that on the average, our proposed method outperforms the well known methods of Support Vector Machine Recursive Feature Elimination, logistic regression, CART, and logic regression based SNP selections for disease classification.
Collapse
Affiliation(s)
- Yulan Liang
- Department of Family and Community Health, University of Maryland, Baltimore 655 W. Lombard Street, Baltimore, MD 21201-1579
| | | |
Collapse
|
2519
|
Pruzek RM. Introduction to the Special Issue on Propensity Score Methods in Behavioral Research. MULTIVARIATE BEHAVIORAL RESEARCH 2011; 46:389-398. [PMID: 26735882 DOI: 10.1080/00273171.2011.576618] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
This issue includes six articles that present logic, methods, and models for causal analyses of observational data, in particular those based on propensity score (PS) methods. The articles include a general introduction to propensity score analysis (PSA), uses of PSA in mediation studies, issues involved in choosing covariates, challenges that often arise in PSA applications, hierarchical data issues and models, and an application in an educational testing context. In this editorial I briefly summarize each article and make a few recommendations that relate to future applications in this field: the first pertains to how propensity score (PS) work could profit by connecting it with stronger forms of randomized experiments, not just simple randomization; the second to how and why graphical methods could be used to greater advantage in PSA studies; then why it might be helpful to reconsider the meaning of the term "treatments" in observational studies and why conventional usage might be modified; and finally, to the distinction between retrospective and prospective approaches to observational study design, noting the advantages, when feasible, of the latter approach.
Collapse
|
2520
|
Marra G, Radice R. Estimation of a semiparametric recursive bivariate probit model in the presence of endogeneity. CAN J STAT 2011. [DOI: 10.1002/cjs.10100] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
2521
|
Shalizi CR, Thomas AC. Homophily and Contagion Are Generically Confounded in Observational Social Network Studies. SOCIOLOGICAL METHODS & RESEARCH 2011; 40:211-239. [PMID: 22523436 PMCID: PMC3328971 DOI: 10.1177/0049124111404820] [Citation(s) in RCA: 247] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
The authors consider processes on social networks that can potentially involve three factors: homophily, or the formation of social ties due to matching individual traits; social contagion, also known as social influence; and the causal effect of an individual's covariates on his or her behavior or other measurable responses. The authors show that generically, all of these are confounded with each other. Distinguishing them from one another requires strong assumptions on the parametrization of the social process or on the adequacy of the covariates used (or both). In particular the authors demonstrate, with simple examples, that asymmetries in regression coefficients cannot identify causal effects and that very simple models of imitation (a form of social contagion) can produce substantial correlations between an individual's enduring traits and his or her choices, even when there is no intrinsic affinity between them. The authors also suggest some possible constructive responses to these results.
Collapse
|
2522
|
Prediction of body mass index in mice using dense molecular markers and a regularized neural network. Genet Res (Camb) 2011; 93:189-201. [PMID: 21481292 DOI: 10.1017/s0016672310000662] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Abstract
Bayesian regularization of artificial neural networks (BRANNs) were used to predict body mass index (BMI) in mice using single nucleotide polymorphism (SNP) markers. Data from 1896 animals with both phenotypic and genotypic (12 320 loci) information were used for the analysis. Missing genotypes were imputed based on estimated allelic frequencies, with no attempt to reconstruct haplotypes based on family information or linkage disequilibrium between markers. A feed-forward multilayer perceptron network consisting of a single output layer and one hidden layer was used. Training of the neural network was done using the Bayesian regularized backpropagation algorithm. When the number of neurons in the hidden layer was increased, the number of effective parameters, γ, increased up to a point and stabilized thereafter. A model with five neurons in the hidden layer produced a value of γ that saturated the data. In terms of predictive ability, a network with five neurons in the hidden layer attained the smallest error and highest correlation in the test data although differences among networks were negligible. Using inherent weight information of BRANN with different number of neurons in the hidden layer, it was observed that 17 SNPs had a larger impact on the network, indicating their possible relevance in prediction of BMI. It is concluded that BRANN may be at least as useful as other methods for high-dimensional genome-enabled prediction, with the advantage of its potential ability of capturing non-linear relationships, which may be useful in the study of quantitative traits under complex gene action.
Collapse
|
2523
|
Morin AJS, Maïano C, Marsh HW, Janosz M, Nagengast B. The Longitudinal Interplay of Adolescents' Self-Esteem and Body Image: A Conditional Autoregressive Latent Trajectory Analysis. MULTIVARIATE BEHAVIORAL RESEARCH 2011; 46:157-201. [PMID: 26741327 DOI: 10.1080/00273171.2010.546731] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Self-esteem and body image are central to coping successfully with the developmental challenges of adolescence. However, the current knowledge surrounding self-esteem and body image is fraught with controversy. This study attempts to clarify some of them by addressing three questions: (1) Are the intraindividual developmental trajectories of self-esteem and body image stable across adolescence? (2) What is the direction of the relations between body image and self-esteem over time? (3) What is the role of gender, ethnicity, and pubertal development on those trajectories? This study relies on Autoregressive Latent Trajectory analyses based on data from a 4-year, 6-wave, prospective longitudinal study of 1,001 adolescents. Self-esteem and body image levels remained high and stable over time, although body image levels also tended to increase slightly. The results show that levels of self-esteem were positively influenced by levels of body image. However, these effects remained small and most of the observed associations were cross-sectional. Finally, the effects of pubertal development on body image and self-esteem levels were mostly limited to non-Caucasian females who appeared to benefit from more advanced pubertal development. Conversely, Caucasian females presented the lowest self-esteem and body image levels of all, although for them more advanced pubertal development levels were associated with a slight rise in body image over time.
Collapse
Affiliation(s)
- Alexandre J S Morin
- a Department of Psychology , University of Sherbrooke and Center for Educational Research (University of Western Sydney)
| | - Christophe Maïano
- b Institute of Movement Sciences Etienne-Jules Marey (UMR 6233), University of Aix-Marseille II
| | - Herbert W Marsh
- c Department of Education , University of Oxford and Center for Educational Research (University of Western Sydney)
| | - Michel Janosz
- d School Environment Research Group, School of Psychoeducation, University of Montreal
| | | |
Collapse
|
2524
|
Zhang R, Phoa FKH, Mukerjee R, Xu H. A trigonometric approach to quaternary code designs with application to one-eighth and one-sixteenth fractions. Ann Stat 2011. [DOI: 10.1214/10-aos815] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
2525
|
Nogueira RC, Costa AMM, Silva IDCG, Carvalho CV, Maganhin C, Baracat EC, Soares JM. Influence of the CYP17 polymorphism on vasomotor symptoms in postmenopausal women: a pilot study. Climacteric 2011; 14:537-43. [DOI: 10.3109/13697137.2010.548668] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
2526
|
Koehler ML, Bondell HD, Tzeng JY. Evaluating haplotype effects in case-control studies via penalized-likelihood approaches: prospective or retrospective analysis? Genet Epidemiol 2011; 34:892-911. [PMID: 21104891 DOI: 10.1002/gepi.20545] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Penalized likelihood methods have become increasingly popular in recent years for evaluating haplotype-phenotype association in case-control studies. Although a retrospective likelihood is dictated by the sampling scheme, these penalized methods are typically built on prospective likelihoods due to their modeling simplicity and computational feasibility. It has been well documented that for unpenalized methods, prospective analyses of case-control data can be valid but less efficient than their retrospective counterparts when testing for association, and result in substantial bias when estimating the haplotype effects. For penalized methods, which combine effect estimation and testing in one step, the impact of using a prospective likelihood is not clear. In this work, we examine the consequences of ignoring the sampling scheme for haplotype-based penalized likelihood methods. Our results suggest that the impact of prospective analyses depends on (1) the underlying genetic mode and (2) the genetic model adopted in the analysis. When the correct genetic model is used, the difference between the two analyses is negligible for additive and slight for dominant haplotype effects. For recessive haplotype effects, the more appropriate retrospective likelihood clearly outperforms the prospective likelihood. If an additive model is incorrectly used, as the true underlying genetic mode is unknown a priori, both retrospective and prospective penalized methods suffer from a sizeable power loss and increase in bias. The impact of using the incorrect genetic model is much bigger on retrospective analyses than prospective analyses, and results in comparable performances for both methods. An application of these methods to the Genetic Analysis Workshop 15 rheumatoid arthritis data is provided.
Collapse
Affiliation(s)
- Megan L Koehler
- Department of Statistics, North Carolina State University, Raleigh, North Carolina 27695, USA
| | | | | |
Collapse
|
2527
|
An attempt to use ectoparasites as tags for habitat occupancy by small mammalian hosts in central Europe: effects of host gender, parasite taxon and season. Parasitology 2011; 138:609-18. [PMID: 21320388 DOI: 10.1017/s0031182011000102] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
OBJECTIVE We used data on fleas and gamasid mites parasitic on 8 species of small mammals to test whether (a) species composition of ectoparasite infracommunities may be used to predict host habitat occupancy and (b) the accuracy of this prediction differs between ectoparasite taxa, host genders and seasons. METHODS We used a Random Forests algorithm that is based on the methodology of classification trees. RESULTS The accuracy of prediction of habitat occupancy was relatively low and varied substantially among host species. The combined rate of the correct prediction of host habitat occupancy from data on ectoparasites was significantly higher than 50%, albeit being relatively low. The accuracy of prediction (a) did not differ between male and female hosts when it was based on species composition of fleas in summer or of mites in summer and winter, (b) was significantly higher in male hosts than in female hosts when the winter data on fleas were used and (c) was significantly higher for flea than mite assemblages. The effect of season was found in mites but not in fleas with the accuracy of prediction being significantly higher in summer than in winter assemblages. CONCLUSIONS Ectoparasites appeared to be not especially useful as biological markers for distinguishing host populations in different habitats in temperate zones.
Collapse
|
2528
|
Blood profile of proteins and steroid hormones predicts weight change after weight loss with interactions of dietary protein level and glycemic index. PLoS One 2011; 6:e16773. [PMID: 21340022 PMCID: PMC3038864 DOI: 10.1371/journal.pone.0016773] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2010] [Accepted: 12/30/2010] [Indexed: 11/19/2022] Open
Abstract
Background Weight regain after weight loss is common. In the Diogenes dietary intervention study, high protein and low glycemic index (GI) diet improved weight maintenance. Objective To identify blood predictors for weight change after weight loss following the dietary intervention within the Diogenes study. Design Blood samples were collected at baseline and after 8-week low caloric diet-induced weight loss from 48 women who continued to lose weight and 48 women who regained weight during subsequent 6-month dietary intervention period with 4 diets varying in protein and GI levels. Thirty-one proteins and 3 steroid hormones were measured. Results Angiotensin I converting enzyme (ACE) was the most important predictor. Its greater reduction during the 8-week weight loss was related to continued weight loss during the subsequent 6 months, identified by both Logistic Regression and Random Forests analyses. The prediction power of ACE was influenced by immunoproteins, particularly fibrinogen. Leptin, luteinizing hormone and some immunoproteins showed interactions with dietary protein level, while interleukin 8 showed interaction with GI level on the prediction of weight maintenance. A predictor panel of 15 variables enabled an optimal classification by Random Forests with an error rate of 24±1%. A logistic regression model with independent variables from 9 blood analytes had a prediction accuracy of 92%. Conclusions A selected panel of blood proteins/steroids can predict the weight change after weight loss. ACE may play an important role in weight maintenance. The interactions of blood factors with dietary components are important for personalized dietary advice after weight loss. Registration ClinicalTrials.gov NCT00390637
Collapse
|
2529
|
Casanova R, Espeland MA, Goveas JS, Davatzikos C, Gaussoin SA, Maldjian JA, Brunner RL, Kuller LH, Johnson KC, Mysiw WJ, Wagner B, Resnick SM. Application of machine learning methods to describe the effects of conjugated equine estrogens therapy on region-specific brain volumes. Magn Reson Imaging 2011; 29:546-53. [PMID: 21292420 DOI: 10.1016/j.mri.2010.12.001] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2010] [Revised: 10/18/2010] [Accepted: 12/02/2010] [Indexed: 11/25/2022]
Abstract
Use of conjugated equine estrogens (CEE) has been linked to smaller regional brain volumes in women aged ≥65 years; however, it is unknown whether this results in a broad-based characteristic pattern of effects. Structural magnetic resonance imaging was used to assess regional volumes of normal tissue and ischemic lesions among 513 women who had been enrolled in a randomized clinical trial of CEE therapy for an average of 6.6 years, beginning at ages 65-80 years. A multivariate pattern analysis, based on a machine learning technique that combined Random Forest and logistic regression with L(1) penalty, was applied to identify patterns among regional volumes associated with therapy and whether patterns discriminate between treatment groups. The multivariate pattern analysis detected smaller regional volumes of normal tissue within the limbic and temporal lobes among women that had been assigned to CEE therapy. Mean decrements ranged as high as 7% in the left entorhinal cortex and 5% in the left perirhinal cortex, which exceeded the effect sizes reported previously in frontal lobe and hippocampus. Overall accuracy of classification based on these patterns, however, was projected to be only 54.5%. Prescription of CEE therapy for an average of 6.6 years is associated with lower regional brain volumes, but it does not induce a characteristic spatial pattern of changes in brain volumes of sufficient magnitude to discriminate users and nonusers.
Collapse
Affiliation(s)
- Ramon Casanova
- Department of Biostatistical Sciences, Wake Forest University School of Medicine, Winston-Salem, NC, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
2530
|
A hierarchical Bayesian network approach for linkage disequilibrium modeling and data-dimensionality reduction prior to genome-wide association studies. BMC Bioinformatics 2011; 12:16. [PMID: 21226914 PMCID: PMC3033325 DOI: 10.1186/1471-2105-12-16] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2010] [Accepted: 01/12/2011] [Indexed: 11/10/2022] Open
Abstract
Background Discovering the genetic basis of common genetic diseases in the human genome represents a public health issue. However, the dimensionality of the genetic data (up to 1 million genetic markers) and its complexity make the statistical analysis a challenging task. Results We present an accurate modeling of dependences between genetic markers, based on a forest of hierarchical latent class models which is a particular class of probabilistic graphical models. This model offers an adapted framework to deal with the fuzzy nature of linkage disequilibrium blocks. In addition, the data dimensionality can be reduced through the latent variables of the model which synthesize the information borne by genetic markers. In order to tackle the learning of both forest structure and probability distributions, a generic algorithm has been proposed. A first implementation of our algorithm has been shown to be tractable on benchmarks describing 105 variables for 2000 individuals. Conclusions The forest of hierarchical latent class models offers several advantages for genome-wide association studies: accurate modeling of linkage disequilibrium, flexible data dimensionality reduction and biological meaning borne by latent variables.
Collapse
|
2531
|
Commenges D, Jolly D, Drylewicz J, Putter H, Thiébaut R. Inference in HIV dynamics models via hierarchical likelihood. Comput Stat Data Anal 2011. [DOI: 10.1016/j.csda.2010.05.012] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
2532
|
van de Geer S, Bühlmann P, Zhou S. The adaptive and the thresholded Lasso for potentially misspecified models (and a lower bound for the Lasso). Electron J Stat 2011. [DOI: 10.1214/11-ejs624] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
2533
|
Chen X, Wang M, Zhang H. The use of classification trees for bioinformatics. WILEY INTERDISCIPLINARY REVIEWS. DATA MINING AND KNOWLEDGE DISCOVERY 2011; 1:55-63. [PMID: 22523608 PMCID: PMC3329156 DOI: 10.1002/widm.14] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Classification trees are non-parametric statistical learning methods that incorporate feature selection and interactions, possess intuitive interpretability, are efficient, and have high prediction accuracy when used in ensembles. This paper provides a brief introduction to the classification tree-based methods, a review of the recent developments, and a survey of the applications in bioinformatics and statistical genetics.
Collapse
|
2534
|
Hoffman S, Podgurski A. Improving health care outcomes through personalized comparisons of treatment effectiveness based on electronic health records. THE JOURNAL OF LAW, MEDICINE & ETHICS : A JOURNAL OF THE AMERICAN SOCIETY OF LAW, MEDICINE & ETHICS 2011; 39:425-436. [PMID: 21871040 DOI: 10.1111/j.1748-720x.2011.00612.x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Comparative effectiveness research (CER) is one of the Patient Protection and Affordable Care Act's significant initiatives that aims to improve treatment outcomes and lower health care costs. This article takes CER a step further and suggests a novel clinical application for it. The article proposes the development of a national framework to enable physicians to rapidly perform, through a computerized service, medically sound personalized comparisons of the effectiveness of possible treatments for patients' conditions. A treatment comparison for a given patient would be based on data from electronic health records of a cohort of clinically similar patients who received the treatments previously and whose outcomes were recorded. This framework has unique potential to simultaneously improve the quality of health care, reduce its cost, and alleviate public concerns about rationing and "one size fits all" medicine.
Collapse
|
2535
|
|
2536
|
Gage TB, Fang F, O'Neill EK, DiRienzo AG. Racial disparities in infant mortality: what has birth weight got to do with it and how large is it? BMC Pregnancy Childbirth 2010; 10:86. [PMID: 21189146 PMCID: PMC3025864 DOI: 10.1186/1471-2393-10-86] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2010] [Accepted: 12/28/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND It has been hypothesized that birth weight is not on the causal pathway to infant mortality, at least among "normal" births (i.e. those located in the central part of the birth weight distribution), and that US racial disparities (African American versus European American) may be underestimated. Here these hypotheses are tested by examining the role of birth weight on racial disparities in infant mortality. METHODS A two-component Covariate Density Defined mixture of logistic regressions model is used to decompose racial disparities, 1) into disparities due to "normal" versus "compromised" components of the birth cohort, and 2) further decompose these components into indirect effects, which are associated with birth weight, versus direct effects, which are independent of birth weight. RESULTS The results indicate that a direct effect is responsible for the racial disparity in mortality among "normal" births. No indirect effect of birth weight is observed despite significant disparities in birth weight. Among "compromised" births, an indirect effect is responsible for the disparity, which is consistent with disparities in birth weight. However, there is also a direct effect among "compromised" births that reduces the racial disparity in mortality. This direct effect is responsible for the "pediatric paradox" and maybe due to differential fetal loss. Model-based adjustment for this effect indicates that racial disparities corrected for fetal loss could be as high as 3 or 4 fold. This estimate is higher than the observed racial disparities in infant mortality (2.1 for both sexes). CONCLUSIONS The results support the hypothesis that birth weight is not on the causal pathway to infant mortality among "normal" births, although birth weight could play a role among "compromised" births. The overall size of the US racial disparities in infant mortality maybe considerably underestimated in the observed data possibly due to racial disparities in fetal loss.
Collapse
Affiliation(s)
- Timothy B Gage
- Department of Anthropology, University at Albany, State University of New York, 1400 Washington Ave., Albany, NY 12222, USA.
| | | | | | | |
Collapse
|
2537
|
Ayers KL, Cordell HJ. SNP selection in genome-wide and candidate gene studies via penalized logistic regression. Genet Epidemiol 2010; 34:879-91. [PMID: 21104890 PMCID: PMC3410531 DOI: 10.1002/gepi.20543] [Citation(s) in RCA: 140] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2010] [Revised: 08/25/2010] [Accepted: 09/10/2010] [Indexed: 11/07/2022]
Abstract
Penalized regression methods offer an attractive alternative to single marker testing in genetic association analysis. Penalized regression methods shrink down to zero the coefficient of markers that have little apparent effect on the trait of interest, resulting in a parsimonious subset of what we hope are true pertinent predictors. Here we explore the performance of penalization in selecting SNPs as predictors in genetic association studies. The strength of the penalty can be chosen either to select a good predictive model (via methods such as computationally expensive cross validation), through maximum likelihood-based model selection criterion (such as the BIC), or to select a model that controls for type I error, as done here. We have investigated the performance of several penalized logistic regression approaches, simulating data under a variety of disease locus effect size and linkage disequilibrium patterns. We compared several penalties, including the elastic net, ridge, Lasso, MCP and the normal-exponential-γ shrinkage prior implemented in the hyperlasso software, to standard single locus analysis and simple forward stepwise regression. We examined how markers enter the model as penalties and P-value thresholds are varied, and report the sensitivity and specificity of each of the methods. Results show that penalized methods outperform single marker analysis, with the main difference being that penalized methods allow the simultaneous inclusion of a number of markers, and generally do not allow correlated variables to enter the model, producing a sparse model in which most of the identified explanatory markers are accounted for.
Collapse
Affiliation(s)
- Kristin L Ayers
- Institute of Human Genetics, Central Parkway, Newcastle upon Tyne, United Kingdom.
| | | |
Collapse
|
2538
|
Heisey DM, Osnas EE, Cross PC, Joly DO, Langenberg JA, Miller MW. Rejoinder: sifting through model space. Ecology 2010. [DOI: 10.1890/10-0894.1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Affiliation(s)
- Dennis M. Heisey
- USGS, National Wildlife Health Center, Madison, Wisconsin 53711 USA
| | - Erik E. Osnas
- Department of Forest and Wildlife Ecology, University of Wisconsin, 1630 Linden Drive, Madison, Wisconsin 52706 USA
| | - Paul C. Cross
- USGS, Northern Rocky Mountain Science Center, Bozeman, Montana 59717 USA
| | - Damien O. Joly
- Global Health Programs, Wildlife Conservation Society, 1008 Beverly Drive, Nanaimo, British Columbia V9S 2S4 Canada
| | - Julia A. Langenberg
- Wisconsin Department of Natural Resources, 101 South Webster Street, Madison, Wisconsin 53703 USA
| | - Michael W. Miller
- Colorado Division of Wildlife, Wildlife Research Center, 317 West Prospect Road, Fort Collins, Colorado 80526-2097 USA
| |
Collapse
|
2539
|
Abstract
We systematically mistreat psychological phenomena, both logically and clinically. This article explores three contentions: that the dominant discourse in modern cognitive, affective, and clinical neuroscience assumes that we know how psychology/biology causation works when we do not; that there are serious intellectual, clinical, and policy costs to pretending we do know; and that crucial scientific and clinical progress will be stymied as long as we frame psychology, biology, and their relationship in currently dominant ways. The arguments are developed with emphasis on misguided attempts to localize psychological function via neuroimaging, misunderstandings about the role of genetics in psychopathology, and untoward constraints on health-care policy and clinical service delivery. A particular challenge, articulated but not resolved in this article, is determining what constitutes adequate explanation in the relationship between psychology and biology.
Collapse
Affiliation(s)
- Gregory A Miller
- Department of Psychology, University of Illinois at Urbana-Champaign, Champaign, IL, and Zukunfstkolleg, University of Konstanz, Konstanz, Germany
| |
Collapse
|
2540
|
|
2541
|
Affiliation(s)
- Piotr Fryzlewicz
- Department of Statistics, London School of Economics, University of London, London WC2A 2AE, UK
| |
Collapse
|
2542
|
Carstensen L, Sandelin A, Winther O, Hansen NR. Multivariate Hawkes process models of the occurrence of regulatory elements. BMC Bioinformatics 2010; 11:456. [PMID: 20828413 PMCID: PMC2949889 DOI: 10.1186/1471-2105-11-456] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2010] [Accepted: 09/09/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND A central question in molecular biology is how transcriptional regulatory elements (TREs) act in combination. Recent high-throughput data provide us with the location of multiple regulatory regions for multiple regulators, and thus with the possibility of analyzing the multivariate distribution of the occurrences of these TREs along the genome. RESULTS We present a model of TRE occurrences known as the Hawkes process. We illustrate the use of this model by analyzing two different publically available data sets. We are able to model, in detail, how the occurrence of one TRE is affected by the occurrences of others, and we can test a range of natural hypotheses about the dependencies among the TRE occurrences. In contrast to earlier efforts, pre-processing steps such as clustering or binning are not needed, and we thus retain information about the dependencies among the TREs that is otherwise lost. For each of the two data sets we provide two results: first, a qualitative description of the dependencies among the occurrences of the TREs, and second, quantitative results on the favored or avoided distances between the different TREs. CONCLUSIONS The Hawkes process is a novel way of modeling the joint occurrences of multiple TREs along the genome that is capable of providing new insights into dependencies among elements involved in transcriptional regulation. The method is available as an R package from http://www.math.ku.dk/~richard/ppstat/.
Collapse
Affiliation(s)
- Lisbeth Carstensen
- Department of Mathematical Sciences, University of Copenhagen, Universitetsparken 5, 2100 Copenhagen Ø, Denmark
| | | | | | | |
Collapse
|
2543
|
Cattaert T, Calle ML, Dudek SM, Mahachie John JM, Van Lishout F, Urrea V, Ritchie MD, Van Steen K. Model-based multifactor dimensionality reduction for detecting epistasis in case-control data in the presence of noise. Ann Hum Genet 2010; 75:78-89. [PMID: 21158747 DOI: 10.1111/j.1469-1809.2010.00604.x] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
Analyzing the combined effects of genes and/or environmental factors on the development of complex diseases is a great challenge from both the statistical and computational perspective, even using a relatively small number of genetic and nongenetic exposures. Several data-mining methods have been proposed for interaction analysis, among them, the Multifactor Dimensionality Reduction Method (MDR) has proven its utility in a variety of theoretical and practical settings. Model-Based Multifactor Dimensionality Reduction (MB-MDR), a relatively new MDR-based technique that is able to unify the best of both nonparametric and parametric worlds, was developed to address some of the remaining concerns that go along with an MDR analysis. These include the restriction to univariate, dichotomous traits, the absence of flexible ways to adjust for lower order effects and important confounders, and the difficulty in highlighting epistatic effects when too many multilocus genotype cells are pooled into two new genotype groups. We investigate the empirical power of MB-MDR to detect gene-gene interactions in the absence of any noise and in the presence of genotyping error, missing data, phenocopy, and genetic heterogeneity. Power is generally higher for MB-MDR than for MDR, in particular in the presence of genetic heterogeneity, phenocopy, or low minor allele frequencies.
Collapse
Affiliation(s)
- Tom Cattaert
- Montefiore Institute, University of Liege, Belgium
| | | | | | | | | | | | | | | |
Collapse
|
2544
|
Lee S, Huang JZ, Hu J. Sparse logistic principal components analysis for binary data. Ann Appl Stat 2010. [DOI: 10.1214/10-aoas327] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
2545
|
McElduff F, Cortina-Borja M, Chan SK, Wade A. When t-tests or Wilcoxon-Mann-Whitney tests won't do. ADVANCES IN PHYSIOLOGY EDUCATION 2010; 34:128-33. [PMID: 20826766 DOI: 10.1152/advan.00017.2010] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
t-Tests are widely used by researchers to compare the average values of a numeric outcome between two groups. If there are doubts about the suitability of the data for the requirements of a t-test, most notably the distribution being non-normal, the Wilcoxon-Mann-Whitney test may be used instead. However, although often applied, both tests may be invalid when discrete and/or extremely skew data are analyzed. In medicine, extremely skewed data having an excess of zeroes are often observed, representing a numeric outcome that does not occur for a large percentage of cases (so is often zero) but which also sometimes takes relatively large values. For data such as this, application of the t-test or Wilcoxon-Mann-Whitney test could lead researchers to draw incorrect conclusions. A valid alternative is regression modeling to quantify the characteristics of the data. The increased availability of software has simplified the application of these more complex statistical analyses and hence facilitates researchers to use them. In this article, we illustrate the methodology applied to a comparison of cyst counts taken from control and steroid-treated fetal mouse kidneys.
Collapse
Affiliation(s)
- Fiona McElduff
- Medical Research Council Centre of Epidemiology for Child Health, Institute of Child Health, University College London, London, United Kingdom.
| | | | | | | |
Collapse
|
2546
|
Wang Z, Liu T, Lin Z, Hegarty J, Koltun WA, Wu R. A general model for multilocus epistatic interactions in case-control studies. PLoS One 2010; 5:e11384. [PMID: 20814428 PMCID: PMC2909900 DOI: 10.1371/journal.pone.0011384] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2010] [Accepted: 05/31/2010] [Indexed: 12/16/2022] Open
Abstract
Background Epistasis, i.e., the interaction of alleles at different loci, is thought to play a central role in the formation and progression of complex diseases. The complexity of disease expression should arise from a complex network of epistatic interactions involving multiple genes. Methodology We develop a general model for testing high-order epistatic interactions for a complex disease in a case-control study. We incorporate the quantitative genetic theory of high-order epistasis into the setting of cases and controls sampled from a natural population. The new model allows the identification and testing of epistasis and its various genetic components. Conclusions Simulation studies were used to examine the power and false positive rates of the model under different sampling strategies. The model was used to detect epistasis in a case-control study of inflammatory bowel disease, in which five SNPs at a candidate gene were typed, leading to the identification of a significant three-locus epistasis.
Collapse
Affiliation(s)
- Zhong Wang
- Center for Statistical Genetics, Pennsylvania State University, Hershey, Pennsylvania, United States of America
- Pennsylvania State Cancer Institute, Pennsylvania State University, Hershey, Pennsylvania, United States of America
| | - Tian Liu
- Human Genetics Group, Genome Institute of Singapore, Singapore, Singapore
| | - Zhenwu Lin
- Department of Surgery, Pennsylvania State University, Hershey, Pennsylvania, United States of America
| | - John Hegarty
- Department of Surgery, Pennsylvania State University, Hershey, Pennsylvania, United States of America
| | - Walter A. Koltun
- Department of Surgery, Pennsylvania State University, Hershey, Pennsylvania, United States of America
| | - Rongling Wu
- Center for Statistical Genetics, Pennsylvania State University, Hershey, Pennsylvania, United States of America
- Pennsylvania State Cancer Institute, Pennsylvania State University, Hershey, Pennsylvania, United States of America
- * E-mail:
| |
Collapse
|
2547
|
Causal inference in perception. Trends Cogn Sci 2010; 14:425-32. [PMID: 20705502 DOI: 10.1016/j.tics.2010.07.001] [Citation(s) in RCA: 207] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2010] [Revised: 06/30/2010] [Accepted: 07/01/2010] [Indexed: 11/21/2022]
Abstract
Until recently, the question of how the brain performs causal inference has been studied primarily in the context of cognitive reasoning. However, this problem is at least equally crucial in perceptual processing. At any given moment, the perceptual system receives multiple sensory signals within and across modalities and, for example, has to determine the source of each of these signals. Recently, a growing number of studies from various fields of cognitive science have started to address this question and have converged to very similar computational models. Therefore, it seems that a common computational strategy, which is highly consistent with a normative model of causal inference, is exploited by the perceptual system in a variety of domains.
Collapse
|
2548
|
|
2549
|
|
2550
|
|