Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Shen A, Fu H, He K, Jiang H. False Discovery Rate Control in Cancer Biomarker Selection Using Knockoffs. Cancers (Basel) 2019;11:E744. [PMID: 31146393 DOI: 10.3390/cancers11060744] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Accepted: 05/23/2019] [Indexed: 11/17/2022] Open

For:	Shen A, Fu H, He K, Jiang H. False Discovery Rate Control in Cancer Biomarker Selection Using Knockoffs. Cancers (Basel) 2019;11:E744. [PMID: 31146393 DOI: 10.3390/cancers11060744] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Accepted: 05/23/2019] [Indexed: 11/17/2022] Open

Number

Cited by Other Article(s)

Wang YL, Liu C, Yang YY, Zhang L, Guo X, Niu C, Zhang NP, Ding J, Wu J. Dynamic changes of gut microbiota in mouse models of metabolic dysfunction-associated steatohepatitis and its transition to hepatocellular carcinoma. FASEB J 2024;38:e23766. [PMID: 38967214 DOI: 10.1096/fj.202400573rr] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Revised: 06/07/2024] [Accepted: 06/13/2024] [Indexed: 07/06/2024]

Affiliation(s)

Yu-Li Wang Department of Medical Microbiology and Parasitology, MOE/NHC/CAMS Key Laboratory of Medical Molecular Virology, School of Basic Medical Sciences, Fudan University Shanghai Medical College, Shanghai, China
Chang Liu Department of Medical Microbiology and Parasitology, MOE/NHC/CAMS Key Laboratory of Medical Molecular Virology, School of Basic Medical Sciences, Fudan University Shanghai Medical College, Shanghai, China
Yong-Yu Yang Department of Medical Microbiology and Parasitology, MOE/NHC/CAMS Key Laboratory of Medical Molecular Virology, School of Basic Medical Sciences, Fudan University Shanghai Medical College, Shanghai, China
Li Zhang Department of Medical Microbiology and Parasitology, MOE/NHC/CAMS Key Laboratory of Medical Molecular Virology, School of Basic Medical Sciences, Fudan University Shanghai Medical College, Shanghai, China
Xiao Guo Department of Medical Microbiology and Parasitology, MOE/NHC/CAMS Key Laboratory of Medical Molecular Virology, School of Basic Medical Sciences, Fudan University Shanghai Medical College, Shanghai, China
Chen Niu Department of Medical Microbiology and Parasitology, MOE/NHC/CAMS Key Laboratory of Medical Molecular Virology, School of Basic Medical Sciences, Fudan University Shanghai Medical College, Shanghai, China
Ning-Ping Zhang Department of Gastroenterology and Hepatology, Zhongshan Hospital of Fudan University, Shanghai, China Shanghai Institute of Liver Diseases, Fudan University Shanghai Medical College, Shanghai, China
Jia Ding Department of Gastroenterology, Shanghai Jing'an District Central Hospital, Fudan University, Shanghai, China
Jian Wu Department of Medical Microbiology and Parasitology, MOE/NHC/CAMS Key Laboratory of Medical Molecular Virology, School of Basic Medical Sciences, Fudan University Shanghai Medical College, Shanghai, China Department of Gastroenterology and Hepatology, Zhongshan Hospital of Fudan University, Shanghai, China Shanghai Institute of Liver Diseases, Fudan University Shanghai Medical College, Shanghai, China

Collapse

Farzad N, Enninful A, Bao S, Zhang D, Deng Y, Fan R. Spatially resolved epigenome sequencing via Tn5 transposition and deterministic DNA barcoding in tissue. Nat Protoc 2024:10.1038/s41596-024-01013-y. [PMID: 38943021 DOI: 10.1038/s41596-024-01013-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 04/11/2024] [Indexed: 06/30/2024]

Abstract

Spatial epigenetic mapping of tissues enables the study of gene regulation programs and cellular functions with the dependency on their local tissue environment. Here we outline a complete procedure for two spatial epigenomic profiling methods: spatially resolved genome-wide profiling of histone modifications using in situ cleavage under targets and tagmentation (CUT&Tag) chemistry (spatial-CUT&Tag) and transposase-accessible chromatin sequencing (spatial-ATAC-sequencing) for chromatin accessibility. Both assays utilize in-tissue Tn5 transposition to recognize genomic DNA loci followed by microfluidic deterministic barcoding to incorporate spatial address codes. Furthermore, these two methods do not necessitate prior knowledge of the transcription or epigenetic markers for a given tissue or cell type but permit genome-wide unbiased profiling pixel-by-pixel at the 10 μm pixel size level and single-base resolution. To support the widespread adaptation of these methods, details are provided in five general steps: (1) sample preparation; (2) Tn5 transposition in spatial-ATAC-sequencing or antibody-controlled pA-Tn5 tagmentation in CUT&Tag; (3) library preparation; (4) next-generation sequencing; and (5) data analysis using our customed pipelines available at: https://github.com/dyxmvp/Spatial_ATAC-seq and https://github.com/dyxmvp/spatial-CUT-Tag . The whole procedure can be completed on four samples in 2-3 days. Familiarity with basic molecular biology and bioinformatics skills with access to a high-performance computing environment are required. A rudimentary understanding of pathology and specimen sectioning, as well as deterministic barcoding in tissue-specific skills (e.g., design of a multiparameter barcode panel and creation of microfluidic devices), are also advantageous. In this protocol, we mainly focus on spatial profiling of tissue region-specific epigenetic landscapes in mouse embryos and mouse brains using spatial-ATAC-sequencing and spatial-CUT&Tag, but these methods can be used for other species with no need for species-specific probe design.

Collapse

Hlongwane R, Ramaboa KKKM, Mongwe W. Enhancing credit scoring accuracy with a comprehensive evaluation of alternative data. PLoS One 2024;19:e0303566. [PMID: 38771812 PMCID: PMC11108212 DOI: 10.1371/journal.pone.0303566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Accepted: 04/27/2024] [Indexed: 05/23/2024] Open

Fu H, Nicolet D, Mrózek K, Stone RM, Eisfeld A, Byrd JC, Archer KJ. Controlled variable selection in Weibull mixture cure models for high-dimensional data. Stat Med 2022;41:4340-4366. [PMID: 35792553 PMCID: PMC9545322 DOI: 10.1002/sim.9513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 06/14/2022] [Accepted: 06/19/2022] [Indexed: 12/03/2022]

Wang J, Liang H, Zhang Q, Ma S. Replicability in cancer omics data analysis: measures and empirical explorations. Brief Bioinform 2022;23:bbac304. [PMID: 35876281 PMCID: PMC9487717 DOI: 10.1093/bib/bbac304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2022] [Revised: 06/30/2022] [Accepted: 07/06/2022] [Indexed: 02/05/2023] Open

Li S, Sesia M, Romano Y, Candès E, Sabatti C. Searching for robust associations with a multi-environment knockoff filter. Biometrika 2022;109:611-629. [PMID: 38633763 PMCID: PMC11022501 DOI: 10.1093/biomet/asab055] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/19/2024] Open

Li Y, Dai R, Gwon Y, Rennard SI, Make BJ, Foer D, Strand MJ, Austin E, Young KA, Hokanson JE, Pratte KA, Conway R, Kinney GL. Identifying Individual Medications Affecting Pulmonary Outcomes When Multiple Medications are Present. Clin Epidemiol 2022;14:731-735. [PMID: 35677475 PMCID: PMC9167843 DOI: 10.2147/clep.s364692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Accepted: 05/19/2022] [Indexed: 11/25/2022] Open

Liu M, Katsevich E, Janson L, Ramdas A. Fast and powerful conditional randomization testing via distillation. Biometrika 2021. [DOI: 10.1093/biomet/asab039] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open

Abstract Summary We consider the problem of conditional independence testing: given a response $Y$ and covariates $(X,Z)$, we test the null hypothesis that $Y {\perp\!\!\!\perp} X \mid Z$. The conditional randomization test was recently proposed as a way to use distributional information about $X\mid Z$ to exactly and nonasymptotically control Type-I error using any test statistic in any dimensionality without assuming anything about $Y\mid (X,Z)$. This flexibility, in principle, allows one to derive powerful test statistics from complex prediction algorithms while maintaining statistical validity. Yet the direct use of such advanced test statistics in the conditional randomization test is prohibitively computationally expensive, especially with multiple testing, due to the requirement to recompute the test statistic many times on resampled data. We propose the distilled conditional randomization test, a novel approach to using state-of-the-art machine learning algorithms in the conditional randomization test while drastically reducing the number of times those algorithms need to be run, thereby taking advantage of their power and the conditional randomization test’s statistical guarantees without suffering the usual computational expense. In addition to distillation, we propose a number of other tricks, like screening and recycling computations, to further speed up the conditional randomization test without sacrificing its high power and exact validity. Indeed, we show in simulations that all our proposals combined lead to a test that has similar power to most powerful existing conditional randomization test implementations, but requires orders of magnitude less computation, making it a practical tool even for large datasets. We demonstrate these benefits on a breast cancer dataset by identifying biomarkers related to cancer stage. Collapse

Chia C, Sesia M, Ho CS, Jeffrey SS, Dionne J, Candes EJ, Howe RT. Interpretable Classification of Bacterial Raman Spectra with Knockoff Wavelets. IEEE J Biomed Health Inform 2021;26:740-748. [PMID: 34232897 DOI: 10.1109/jbhi.2021.3094873] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Jiang T, Li Y, Motsinger-Reif AA. Knockoff boosted tree for model-free variable selection. Bioinformatics 2021;37:976-983. [PMID: 32966559 DOI: 10.1093/bioinformatics/btaa770] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2020] [Revised: 08/17/2020] [Accepted: 09/09/2020] [Indexed: 11/13/2022] Open

Abstract

MOTIVATION

The recently proposed knockoff filter is a general framework for controlling the false discovery rate (FDR) when performing variable selection. This powerful new approach generates a 'knockoff' of each variable tested for exact FDR control. Imitation variables that mimic the correlation structure found within the original variables serve as negative controls for statistical inference. Current applications of knockoff methods use linear regression models and conduct variable selection only for variables existing in model functions. Here, we extend the use of knockoffs for machine learning with boosted trees, which are successful and widely used in problems where no prior knowledge of model function is required. However, currently available importance scores in tree models are insufficient for variable selection with FDR control.

RESULTS

We propose a novel strategy for conducting variable selection without prior model topology knowledge using the knockoff method with boosted tree models. We extend the current knockoff method to model-free variable selection through the use of tree-based models. Additionally, we propose and evaluate two new sampling methods for generating knockoffs, namely the sparse covariance and principal component knockoff methods. We test and compare these methods with the original knockoff method regarding their ability to control type I errors and power. In simulation tests, we compare the properties and performance of importance test statistics of tree models. The results include different combinations of knockoffs and importance test statistics. We consider scenarios that include main-effect, interaction, exponential and second-order models while assuming the true model structures are unknown. We apply our algorithm for tumor purity estimation and tumor classification using Cancer Genome Atlas (TCGA) gene expression data. Our results show improved discrimination between difficult-to-discriminate cancer types.

AVAILABILITY AND IMPLEMENTATION

The proposed algorithm is included in the KOBT package, which is available at https://cran.r-project.org/web/packages/KOBT/index.html.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Fu H, Archer KJ. High-dimensional variable selection for ordinal outcomes with error control. Brief Bioinform 2020;22:334-345. [PMID: 32031572 DOI: 10.1093/bib/bbaa007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Revised: 01/06/2020] [Indexed: 12/24/2022] Open

Applications of Bioinformatics in Cancer. Cancers (Basel) 2019;11:cancers11111630. [PMID: 31652939 PMCID: PMC6893424 DOI: 10.3390/cancers11111630] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Accepted: 10/23/2019] [Indexed: 01/02/2023] Open