Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Chi EC, Zhou H, Chen GK, Del Vecchyo DO, Lange K. Genotype imputation via matrix completion. Genome Res 2012;23:509-18. [PMID: 23233546 PMCID: PMC3589539 DOI: 10.1101/gr.145821.112] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

For:	Chi EC, Zhou H, Chen GK, Del Vecchyo DO, Lange K. Genotype imputation via matrix completion. Genome Res 2012;23:509-18. [PMID: 23233546 PMCID: PMC3589539 DOI: 10.1101/gr.145821.112] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

Number

Cited by Other Article(s)

Li S, Cheng L, Zhang T, Zhao H, Li J. Graph-guided Bayesian matrix completion for ocean sound speed field reconstruction. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023;153:689. [PMID: 36732248 DOI: 10.1121/10.0017064] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Accepted: 01/09/2023] [Indexed: 06/18/2023]

Ye F, Cho H, Rouayheb SE. Mechanisms for Hiding Sensitive Genotypes with Information-Theoretic Privacy. IEEE TRANSACTIONS ON INFORMATION THEORY 2022;68:4090-4105. [PMID: 37283781 PMCID: PMC10243750 DOI: 10.1109/tit.2022.3156276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Sherpa S, Kebaïli C, Rioux D, Guéguen M, Renaud J, Després L. Population decline at distribution margins: Assessing extinction risk in the last glacial relictual but still functional metapopulation of a European butterfly. DIVERS DISTRIB 2021. [DOI: 10.1111/ddi.13460] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open

Fan M, Zhang Y, Fu Z, Xu M, Wang S, Xie S, Gao X, Wang Y, Li L. A deep matrix completion method for imputing missing histological data in breast cancer by integrating DCE-MRI radiomics. Med Phys 2021;48:7685-7697. [PMID: 34724248 DOI: 10.1002/mp.15316] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2021] [Revised: 10/13/2021] [Accepted: 10/14/2021] [Indexed: 02/01/2023] Open

Abstract

PURPOSE

Clinical indicators of histological information are important for breast cancer treatment and operational decision making, but these histological data suffer from frequent missing values due to various experimental/clinical reasons. The limited amount of histological information from breast cancer samples impedes the accuracy of data imputation. The purpose of this study was to impute missing histological data, including Ki-67 expression level, luminal A subtype, and histological grade, by integrating tumor radiomics.

METHODS

To this end, a deep matrix completion (DMC) method was proposed for imputing missing histological data using nonmissing features composed of histological and tumor radiomics (termed radiohistological features). DMC finds a latent nonlinear association between radiohistological features across all samples and samples for all the features. Radiomic features of morphologic, statistical, and texture were extracted from dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) inside the tumor. Experiments on missing histological data imputation were performed with a variable number of features and missing data rates. The performance of the DMC method was compared with those of the nonnegative matrix factorization (NMF) and collaborative filtering (MCF)-based data imputation methods. The area under the curve (AUC) was used to assess the performance of missing histological data imputation.

RESULTS

By integrating radiomics from DCE-MRI, the DMC method showed significantly better performance in terms of AUC than that using only histological data. Additionally, DMC using 120 radiomic features showed an optimal prediction performance (AUC = 0.793), which was better than the NMF (AUC = 0.756) and MCF methods (AUC = 0.706; corrected p = 0.001). The DMC method consistently performed better than the NMF and MCF methods with a variable number of radiomic features and missing data rates.

CONCLUSIONS

DMC improves imputation performance by integrating tumor histological and radiomics data. This study transforms latent imaging-scale patterns for interactions with molecular-scale histological information and is promising in the tumor characterization and management of patients.

Collapse

Wu H, Wang X, Chu M, Xiang R, Zhou K. FRMC: a fast and robust method for the imputation of scRNA-seq data. RNA Biol 2021;18:172-181. [PMID: 34459719 DOI: 10.1080/15476286.2021.1960688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open

Chu BB, Sobel EM, Wasiolek R, Ko S, Sinsheimer JS, Zhou H, Lange K. A fast Data-Driven method for genotype imputation, phasing, and local ancestry inference: MendelImpute.jl. Bioinformatics 2021;37:4756-4763. [PMID: 34289008 PMCID: PMC8665755 DOI: 10.1093/bioinformatics/btab489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Revised: 05/18/2021] [Accepted: 07/19/2021] [Indexed: 11/12/2022] Open

Gain C, François O. LEA 3: Factor models in population genetics and ecological genomics with R. Mol Ecol Resour 2021;21:2738-2748. [PMID: 33638893 DOI: 10.1111/1755-0998.13366] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Revised: 01/21/2021] [Accepted: 02/23/2021] [Indexed: 12/12/2022]

Hasan MK, Alam MA, Roy S, Dutta A, Jawad MT, Das S. Missing value imputation affects the performance of machine learning: A review and analysis of the literature (2010–2021). INFORMATICS IN MEDICINE UNLOCKED 2021. [DOI: 10.1016/j.imu.2021.100799] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open

Lee D, Oh J, Yu H. OCam: Out-of-core coordinate descent algorithm for matrix completion. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2019.09.077] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Sherpa S, Blum MGB, Després L. Cold adaptation in the Asian tiger mosquito's native range precedes its invasion success in temperate regions. Evolution 2019;73:1793-1808. [DOI: 10.1111/evo.13801] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2019] [Revised: 06/06/2019] [Accepted: 06/14/2019] [Indexed: 12/25/2022]

Chi EC, Li T. Matrix completion from a computational statistics perspective. ACTA ACUST UNITED AC 2019. [DOI: 10.1002/wics.1469] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Zhou H, Sinsheimer JS, Bates DM, Chu BB, German CA, Ji SS, Keys KL, Kim J, Ko S, Mosher GD, Papp JC, Sobel EM, Zhai J, Zhou JJ, Lange K. OPENMENDEL: a cooperative programming project for statistical genetics. Hum Genet 2019;139:61-71. [PMID: 30915546 DOI: 10.1007/s00439-019-02001-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2018] [Accepted: 03/15/2019] [Indexed: 01/06/2023]

Carpentier A, Klopp O, Löffler M, Nickl R. Adaptive confidence sets for matrix completion. BERNOULLI 2018. [DOI: 10.3150/17-bej933] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Sherpa S, Rioux D, Goindin D, Fouque F, François O, Després L. At the Origin of a Worldwide Invasion: Unraveling the Genetic Makeup of the Caribbean Bridgehead Populations of the Dengue Vector Aedes aegypti. Genome Biol Evol 2018;10:56-71. [PMID: 29267872 PMCID: PMC5758905 DOI: 10.1093/gbe/evx267] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/15/2017] [Indexed: 12/21/2022] Open

Chi EC, Hu L, Saibaba AK, Rao AUK. Going Off the Grid: Iterative Model Selection for Biclustered Matrix Completion. J Comput Graph Stat 2018. [DOI: 10.1080/10618600.2018.1482763] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]

Louzoun Y, Alter I, Gragert L, Albrecht M, Maiers M. Modeling coverage gaps in haplotype frequencies via Bayesian inference to improve stem cell donor selection. Immunogenetics 2017;70:279-292. [DOI: 10.1007/s00251-017-1040-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2017] [Accepted: 10/23/2017] [Indexed: 11/24/2022]

Li C, Zhou H. svt: Singular Value Thresholding in MATLAB. J Stat Softw 2017;81. [PMID: 32523475 DOI: 10.18637/jss.v081.c02] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open

Genotype Imputation Methods and Their Effects on Genomic Predictions in Cattle. ACTA ACUST UNITED AC 2017. [DOI: 10.1007/s40362-017-0041-x] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Zhu L, Guo WL, Lu C, Huang DS. Collaborative Completion of Transcription Factor Binding Profiles via Local Sensitive Unified Embedding. IEEE Trans Nanobioscience 2016;15:946-958. [PMID: 27845669 DOI: 10.1109/tnb.2016.2625823] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Abstract

Although the newly available ChIP-seq data provides immense opportunities for comparative study of regulatory activities across different biological conditions, due to cost, time or sample material availability, it is not always possible for researchers to obtain binding profiles for every protein in every sample of interest, which considerably limits the power of integrative studies. Recently, by leveraging related information from measured data, Ernst et al. proposed ChromImpute for predicting additional ChIP-seq and other types of datasets, it is demonstrated that the imputed signal tracks accurately approximate the experimentally measured signals, and thereby could potentially enhance the power of integrative analysis. Despite the success of ChromImpute, in this paper, we reexamine its learning process, and show that its performance may degrade substantially and sometimes may even fail to output a prediction when the available data is scarce. This limitation could hurt its applicability to important predictive tasks, such as the imputation of TF binding data. To alleviate this problem, we propose a novel method called Local Sensitive Unified Embedding (LSUE) for imputing new ChIP-seq datasets. In LSUE, the ChIP-seq data compendium are fused together by mapping proteins, samples, and genomic positions simultaneously into the Euclidean space, thereby making their underling associations directly evaluable using simple calculations. In contrast to ChromImpute which mainly makes use of the local correlations between available datasets, LSUE can better estimate the overall data structure by formulating the representation learning of all involved entities as a single unified optimization problem. Meanwhile, a novel form of local sensitive low rank regularization is also proposed to further improve the performance of LSUE. Experimental evaluations on the ENCODE TF ChIP-seq data illustrate the performance of the proposed model. The code of LSUE is available at https://github.com/ekffar/LSUE.

Collapse

SparRec: An effective matrix completion framework of missing data imputation for GWAS. Sci Rep 2016;6:35534. [PMID: 27762341 PMCID: PMC5071878 DOI: 10.1038/srep35534] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2016] [Accepted: 09/30/2016] [Indexed: 11/08/2022] Open

Cai T, Cai TT, Zhang A. Structured Matrix Completion with Applications to Genomic Data Integration. J Am Stat Assoc 2016;111:621-633. [PMID: 28042188 PMCID: PMC5198844 DOI: 10.1080/01621459.2015.1021005] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2014] [Revised: 01/01/2015] [Indexed: 10/23/2022]

Hodos RA, Kidd BA, Khader S, Readhead BP, Dudley JT. In silico methods for drug repurposing and pharmacology. WILEY INTERDISCIPLINARY REVIEWS. SYSTEMS BIOLOGY AND MEDICINE 2016;8:186-210. [PMID: 27080087 PMCID: PMC4845762 DOI: 10.1002/wsbm.1337] [Citation(s) in RCA: 179] [Impact Index Per Article: 22.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2015] [Revised: 02/08/2016] [Accepted: 02/11/2016] [Indexed: 12/18/2022]

Imputing Genotypes in Biallelic Populations from Low-Coverage Sequence Data. Genetics 2015;202:487-95. [PMID: 26715670 DOI: 10.1534/genetics.115.182071] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2015] [Accepted: 12/16/2015] [Indexed: 12/31/2022] Open

Wu TT, Lange K. Matrix Completion Discriminant Analysis. Comput Stat Data Anal 2015;92:115-125. [PMID: 26549920 PMCID: PMC4634674 DOI: 10.1016/j.csda.2015.06.006] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Wang Y, Wylie T, Stothard P, Lin G. Whole genome SNP genotype piecemeal imputation. BMC Bioinformatics 2015;16:340. [PMID: 26498158 PMCID: PMC4619096 DOI: 10.1186/s12859-015-0770-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2015] [Accepted: 10/09/2015] [Indexed: 11/10/2022] Open

Abstract

Background

Despite ongoing reductions in the cost of sequencing technologies, whole genome SNP genotype imputation is often used as an alternative for obtaining abundant SNP genotypes for genome wide association studies. Several existing genotype imputation methods can be efficient for this purpose, while achieving various levels of imputation accuracy. Recent empirical results have shown that the two-step imputation may improve accuracy by imputing the low density genotyped study animals to a medium density array first and then to the target density. We are interested in building a series of staircase arrays that lead the low density array to the high density array or even the whole genome, such that genotype imputation along these staircases can achieve the highest accuracy.

Results

For genotype imputation from a lower density to a higher density, we first show how to select untyped SNPs to construct a medium density array. Subsequently, we determine for each selected SNP those untyped SNPs to be imputed in the add-one two-step imputation, and lastly how the clusters of imputed genotype are pieced together as the final imputation result. We design extensive empirical experiments using several hundred sequenced and genotyped animals to demonstrate that our novel two-step piecemeal imputation always achieves an improvement compared to the one-step imputation by the state-of-the-art methods Beagle and FImpute. Using the two-step piecemeal imputation, we present some preliminary success on whole genome SNP genotype imputation for genotyped animals via a series of staircase arrays.

Conclusions

From a low SNP density to the whole genome, intermediate pseudo-arrays can be computationally constructed by selecting the most informative SNPs for untyped SNP genotype imputation. Such pseudo-array staircases are able to impute more accurately than the classic one-step imputation.

Collapse

Cahsai A, Anagnostopoulos C, Triantafillou P. Scalable Data Quality for Big Data: The Pythia Framework for Handling Missing Values. BIG DATA 2015;3:159-172. [PMID: 27442958 DOI: 10.1089/big.2015.0002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Abstract

Solving the missing-value (MV) problem with small estimation errors in large-scale data environments is a notoriously resource-demanding task. The most widely used MV imputation approaches are computationally expensive because they explicitly depend on the volume and the dimension of the data. Moreover, as datasets and their user community continuously grow, the problem can only be exacerbated. In an attempt to deal with such a problem, in our previous work, we introduced a novel framework coined Pythia, which employs a number of distributed data nodes (cohorts), each of which contains a partition of the original dataset. To perform MV imputation, the Pythia, based on specific machine and statistical learning structures (signatures), selects the most appropriate subset of cohorts to perform locally a missing value substitution algorithm (MVA). This selection relies on the principle that particular subset of cohorts maintains the most relevant partition of the dataset. In addition to this, as Pythia uses only part of the dataset for imputation and accesses different cohorts in parallel, it improves efficiency, scalability, and accuracy compared to a single machine (coined Godzilla), which uses the entire massive dataset to compute imputation requests. Although this article is an extension of our previous work, we particularly investigate the robustness of the Pythia framework and show that the Pythia is independent from any MVA and signature construction algorithms. In order to facilitate our research, we considered two well-known MVAs (namely K-nearest neighbor and expectation-maximization imputation algorithms), as well as two machine and neural computational learning signature construction algorithms based on adaptive vector quantization and competitive learning. We prove comprehensive experiments to assess the performance of the Pythia against Godzilla and showcase the benefits stemmed from this framework.

Collapse

Tiesinga P, Bakker R, Hill S, Bjaalie JG. Feeding the human brain model. Curr Opin Neurobiol 2015;32:107-14. [DOI: 10.1016/j.conb.2015.02.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2014] [Revised: 02/06/2015] [Accepted: 02/06/2015] [Indexed: 10/23/2022]

Singer M, Pachter L. Controlling for conservation in genome-wide DNA methylation studies. BMC Genomics 2015;16:420. [PMID: 26024968 PMCID: PMC4448855 DOI: 10.1186/s12864-015-1604-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2015] [Accepted: 05/01/2015] [Indexed: 11/10/2022] Open

Chen W, Schaid DJ. PedBLIMP: extending linear predictors to impute genotypes in pedigrees. Genet Epidemiol 2014;38:531-41. [PMID: 25044249 DOI: 10.1002/gepi.21838] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2014] [Revised: 05/15/2014] [Accepted: 05/19/2014] [Indexed: 12/13/2022]

Pasaniuc B, Zaitlen N, Shi H, Bhatia G, Gusev A, Pickrell J, Hirschhorn J, Strachan DP, Patterson N, Price AL. Fast and accurate imputation of summary statistics enhances evidence of functional enrichment. ACTA ACUST UNITED AC 2014;30:2906-14. [PMID: 24990607 DOI: 10.1093/bioinformatics/btu416] [Citation(s) in RCA: 121] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Abstract

MOTIVATION

Imputation using external reference panels (e.g. 1000 Genomes) is a widely used approach for increasing power in genome-wide association studies and meta-analysis. Existing hidden Markov models (HMM)-based imputation approaches require individual-level genotypes. Here, we develop a new method for Gaussian imputation from summary association statistics, a type of data that is becoming widely available.

RESULTS

In simulations using 1000 Genomes (1000G) data, this method recovers 84% (54%) of the effective sample size for common (>5%) and low-frequency (1-5%) variants [increasing to 87% (60%) when summary linkage disequilibrium information is available from target samples] versus the gold standard of 89% (67%) for HMM-based imputation, which cannot be applied to summary statistics. Our approach accounts for the limited sample size of the reference panel, a crucial step to eliminate false-positive associations, and it is computationally very fast. As an empirical demonstration, we apply our method to seven case-control phenotypes from the Wellcome Trust Case Control Consortium (WTCCC) data and a study of height in the British 1958 birth cohort (1958BC). Gaussian imputation from summary statistics recovers 95% (105%) of the effective sample size (as quantified by the ratio of [Formula: see text] association statistics) compared with HMM-based imputation from individual-level genotypes at the 227 (176) published single nucleotide polymorphisms (SNPs) in the WTCCC (1958BC height) data. In addition, for publicly available summary statistics from large meta-analyses of four lipid traits, we publicly release imputed summary statistics at 1000G SNPs, which could not have been obtained using previously published methods, and demonstrate their accuracy by masking subsets of the data. We show that 1000G imputation using our approach increases the magnitude and statistical evidence of enrichment at genic versus non-genic loci for these traits, as compared with an analysis without 1000G imputation. Thus, imputation of summary statistics will be a valuable tool in future functional enrichment analyses.

AVAILABILITY AND IMPLEMENTATION

Publicly available software package available at http://bogdan.bioinformatics.ucla.edu/software/.

CONTACT

bpasaniuc@mednet.ucla.edu or aprice@hsph.harvard.edu

SUPPLEMENTARY INFORMATION

Supplementary materials are available at Bioinformatics online.

Collapse

Affiliation(s)

Bogdan Pasaniuc Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, 90024, Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, 90024, Department of Medicine, Lung Biology Center, University of California San Francisco, San Francisco, 94143, Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, 02115, Departments of Epidemiology and Biostatistics, Harvard School of Public Health, Boston, MA, 02115, Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, 02142, Department of Genetics Harvard Medical School, Boston, MA, 02115 and Division of Population Health Sciences and Education, St George's, University of London, UK Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, 90024, Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, 90024, Department of Medicine, Lung Biology Center, University of California San Francisco, San Francisco, 94143, Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, 02115, Departments of Epidemiology and Biostatistics, Harvard School of Public Health, Boston, MA, 02115, Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, 02142, Department of Genetics Harvard Medical School, Boston, MA, 02115 and Division of Population Health Sciences and Education, St George's, University of London, UK
Noah Zaitlen Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, 90024, Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, 90024, Department of Medicine, Lung Biology Center, University of California San Francisco, San Francisco, 94143, Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, 02115, Departments of Epidemiology and Biostatistics, Harvard School of Public Health, Boston, MA, 02115, Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, 02142, Department of Genetics Harvard Medical School, Boston, MA, 02115 and Division of Population Health Sciences and Education, St George's, University of London, UK
Huwenbo Shi Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, 90024, Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, 90024, Department of Medicine, Lung Biology Center, University of California San Francisco, San Francisco, 94143, Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, 02115, Departments of Epidemiology and Biostatistics, Harvard School of Public Health, Boston, MA, 02115, Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, 02142, Department of Genetics Harvard Medical School, Boston, MA, 02115 and Division of Population Health Sciences and Education, St George's, University of London, UK
Gaurav Bhatia Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, 90024, Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, 90024, Department of Medicine, Lung Biology Center, University of California San Francisco, San Francisco, 94143, Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, 02115, Departments of Epidemiology and Biostatistics, Harvard School of Public Health, Boston, MA, 02115, Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, 02142, Department of Genetics Harvard Medical School, Boston, MA, 02115 and Division of Population Health Sciences and Education, St George's, University of London, UK Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, 90024, Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, 90024, Department of Medicine, Lung Biology Center, University of California San Francisco, San Francisco, 94143, Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, 02115, Departments of Epidemiology and Biostatistics, Harvard School of Public Health, Boston, MA, 02115, Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, 02142, Department of Genetics Harvard Medical School, Boston, MA, 02115 and Division of Population Health Sciences and Education, St George's, University of London, UK Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, 90024, Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, 90024, Department of Medicine, Lung Biology Center, University of California San Francisco, San Francisco, 94143, Program in Genetic Epidemiology and Statistical Genetics, Har
Alexander Gusev Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, 90024, Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, 90024, Department of Medicine, Lung Biology Center, University of California San Francisco, San Francisco, 94143, Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, 02115, Departments of Epidemiology and Biostatistics, Harvard School of Public Health, Boston, MA, 02115, Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, 02142, Department of Genetics Harvard Medical School, Boston, MA, 02115 and Division of Population Health Sciences and Education, St George's, University of London, UK Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, 90024, Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, 90024, Department of Medicine, Lung Biology Center, University of California San Francisco, San Francisco, 94143, Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, 02115, Departments of Epidemiology and Biostatistics, Harvard School of Public Health, Boston, MA, 02115, Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, 02142, Department of Genetics Harvard Medical School, Boston, MA, 02115 and Division of Population Health Sciences and Education, St George's, University of London, UK Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, 90024, Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, 90024, Department of Medicine, Lung Biology Center, University of California San Francisco, San Francisco, 94143, Program in Genetic Epidemiology and Statistical Genetics, Har
Joseph Pickrell Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, 90024, Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, 90024, Department of Medicine, Lung Biology Center, University of California San Francisco, San Francisco, 94143, Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, 02115, Departments of Epidemiology and Biostatistics, Harvard School of Public Health, Boston, MA, 02115, Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, 02142, Department of Genetics Harvard Medical School, Boston, MA, 02115 and Division of Population Health Sciences and Education, St George's, University of London, UK Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, 90024, Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, 90024, Department of Medicine, Lung Biology Center, University of California San Francisco, San Francisco, 94143, Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, 02115, Departments of Epidemiology and Biostatistics, Harvard School of Public Health, Boston, MA, 02115, Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, 02142, Department of Genetics Harvard Medical School, Boston, MA, 02115 and Division of Population Health Sciences and Education, St George's, University of London, UK
Joel Hirschhorn Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, 90024, Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, 90024, Department of Medicine, Lung Biology Center, University of California San Francisco, San Francisco, 94143, Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, 02115, Departments of Epidemiology and Biostatistics, Harvard School of Public Health, Boston, MA, 02115, Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, 02142, Department of Genetics Harvard Medical School, Boston, MA, 02115 and Division of Population Health Sciences and Education, St George's, University of London, UK
David P Strachan Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, 90024, Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, 90024, Department of Medicine, Lung Biology Center, University of California San Francisco, San Francisco, 94143, Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, 02115, Departments of Epidemiology and Biostatistics, Harvard School of Public Health, Boston, MA, 02115, Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, 02142, Department of Genetics Harvard Medical School, Boston, MA, 02115 and Division of Population Health Sciences and Education, St George's, University of London, UK
Nick Patterson Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, 90024, Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, 90024, Department of Medicine, Lung Biology Center, University of California San Francisco, San Francisco, 94143, Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, 02115, Departments of Epidemiology and Biostatistics, Harvard School of Public Health, Boston, MA, 02115, Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, 02142, Department of Genetics Harvard Medical School, Boston, MA, 02115 and Division of Population Health Sciences and Education, St George's, University of London, UK
Alkes L Price Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, 90024, Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, 90024, Department of Medicine, Lung Biology Center, University of California San Francisco, San Francisco, 94143, Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, 02115, Departments of Epidemiology and Biostatistics, Harvard School of Public Health, Boston, MA, 02115, Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, 02142, Department of Genetics Harvard Medical School, Boston, MA, 02115 and Division of Population Health Sciences and Education, St George's, University of London, UK Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, 90024, Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, 90024, Department of Medicine, Lung Biology Center, University of California San Francisco, San Francisco, 94143, Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, 02115, Departments of Epidemiology and Biostatistics, Harvard School of Public Health, Boston, MA, 02115, Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, 02142, Department of Genetics Harvard Medical School, Boston, MA, 02115 and Division of Population Health Sciences and Education, St George's, University of London, UK Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, 90024, Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, 90024, Department of Medicine, Lung Biology Center, University of California San Francisco, San Francisco, 94143, Program in Genetic Epidemiology and Statistical Genetics, Har

Collapse

Lange K, Chi EC, Zhou H. A Brief Survey of Modern Optimization for Statisticians. Int Stat Rev 2014;82:46-70. [PMID: 25242858 PMCID: PMC4166522 DOI: 10.1111/insr.12022] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2012] [Accepted: 04/20/2013] [Indexed: 11/30/2022]

Lange K, Chi EC, Zhou H. Rejoinder. Int Stat Rev 2014. [DOI: 10.1111/insr.12030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Zhang L, Pei YF, Fu X, Lin Y, Wang YP, Deng HW. FISH: fast and accurate diploid genotype imputation via segmental hidden Markov model. ACTA ACUST UNITED AC 2014;30:1876-83. [PMID: 24618466 DOI: 10.1093/bioinformatics/btu143] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]

Affiliation(s)

Lei Zhang School of Public Health, Xi'an Jiaotong University, Shaanxi, China, Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, USA and Center of System Biomedical Sciences, University of Shanghai for Science and Technology, Shanghai, ChinaSchool of Public Health, Xi'an Jiaotong University, Shaanxi, China, Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, USA and Center of System Biomedical Sciences, University of Shanghai for Science and Technology, Shanghai, China
Yu-Fang Pei School of Public Health, Xi'an Jiaotong University, Shaanxi, China, Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, USA and Center of System Biomedical Sciences, University of Shanghai for Science and Technology, Shanghai, ChinaSchool of Public Health, Xi'an Jiaotong University, Shaanxi, China, Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, USA and Center of System Biomedical Sciences, University of Shanghai for Science and Technology, Shanghai, China
Xiaoying Fu School of Public Health, Xi'an Jiaotong University, Shaanxi, China, Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, USA and Center of System Biomedical Sciences, University of Shanghai for Science and Technology, Shanghai, China
Yong Lin School of Public Health, Xi'an Jiaotong University, Shaanxi, China, Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, USA and Center of System Biomedical Sciences, University of Shanghai for Science and Technology, Shanghai, China
Yu-Ping Wang School of Public Health, Xi'an Jiaotong University, Shaanxi, China, Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, USA and Center of System Biomedical Sciences, University of Shanghai for Science and Technology, Shanghai, China
Hong-Wen Deng School of Public Health, Xi'an Jiaotong University, Shaanxi, China, Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, USA and Center of System Biomedical Sciences, University of Shanghai for Science and Technology, Shanghai, China

Collapse

Lange K, Papp JC, Sinsheimer JS, Sobel EM. Next Generation Statistical Genetics: Modeling, Penalization, and Optimization in High-Dimensional Data. ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION 2014;1:279-300. [PMID: 24955378 PMCID: PMC4062304 DOI: 10.1146/annurev-statistics-022513-115638] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]

Zhang L, Choi HJ, Estrada K, Leo PJ, Li J, Pei YF, Zhang Y, Lin Y, Shen H, Liu YZ, Liu Y, Zhao Y, Zhang JG, Tian Q, Wang YP, Han Y, Ran S, Hai R, Zhu XZ, Wu S, Yan H, Liu X, Yang TL, Guo Y, Zhang F, Guo YF, Chen Y, Chen X, Tan L, Zhang L, Deng FY, Deng H, Rivadeneira F, Duncan EL, Lee JY, Han BG, Cho NH, Nicholson GC, McCloskey E, Eastell R, Prince RL, Eisman JA, Jones G, Reid IR, Sambrook PN, Dennison EM, Danoy P, Yerges-Armstrong LM, Streeten EA, Hu T, Xiang S, Papasian CJ, Brown MA, Shin CS, Uitterlinden AG, Deng HW. Multistage genome-wide association meta-analyses identified two new loci for bone mineral density. Hum Mol Genet 2013;23:1923-33. [PMID: 24249740 DOI: 10.1093/hmg/ddt575] [Citation(s) in RCA: 116] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open

Lange K, Papp JC, Sinsheimer JS, Sripracha R, Zhou H, Sobel EM. Mendel: the Swiss army knife of genetic analysis programs. ACTA ACUST UNITED AC 2013;29:1568-70. [PMID: 23610370 DOI: 10.1093/bioinformatics/btt187] [Citation(s) in RCA: 93] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]