1
|
Wang A, Tian P, Zhang YD. TWAS-GKF: a novel method for causal gene identification in transcriptome-wide association studies with knockoff inference. Bioinformatics 2024; 40:btae502. [PMID: 39189955 PMCID: PMC11361808 DOI: 10.1093/bioinformatics/btae502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Revised: 07/02/2024] [Accepted: 08/24/2024] [Indexed: 08/28/2024] Open
Abstract
MOTIVATION Transcriptome-wide association study (TWAS) aims to identify trait-associated genes regulated by significant variants to explore the underlying biological mechanisms at a tissue-specific level. Despite the advancement of current TWAS methods to cover diverse traits, traditional approaches still face two main challenges: (i) the lack of methods that can guarantee finite-sample false discovery rate (FDR) control in identifying trait-associated genes; and (ii) the requirement for individual-level data, which is often inaccessible. RESULTS To address this challenge, we propose a powerful knockoff inference method termed TWAS-GKF to identify candidate trait-associated genes with a guaranteed finite-sample FDR control. TWAS-GKF introduces the main idea of Ghostknockoff inference to generate knockoff variables using only summary statistics instead of individual-level data. In extensive studies, we demonstrate that TWAS-GKF successfully controls the finite-sample FDR under a pre-specified FDR level across all settings. We further apply TWAS-GKF to identify genes in brain cerebellum tissue from the Genotype-Tissue Expression (GTEx) v8 project associated with schizophrenia (SCZ) from the Psychiatric Genomics Consortium (PGC), and genes in liver tissue related to low-density lipoprotein cholesterol (LDL-C) from the UK Biobank, respectively. The results reveal that the majority of the identified genes are validated by Open Targets Validation Platform. AVAILABILITY AND IMPLEMENTATION The R package TWAS.GKF is publicly available at https://github.com/AnqiWang2021/TWAS.GKF.
Collapse
Affiliation(s)
- Anqi Wang
- Department of Statistics and Actuarial Science, The University of Hong Kong, Hong Kong SAR, 999077, China
| | - Peixin Tian
- Department of Statistics and Actuarial Science, The University of Hong Kong, Hong Kong SAR, 999077, China
| | - Yan Dora Zhang
- Department of Statistics and Actuarial Science, The University of Hong Kong, Hong Kong SAR, 999077, China
| |
Collapse
|
2
|
Yan G, Hua SH, Li JJ. Categorization of 31 computational methods to detect spatially variable genes from spatially resolved transcriptomics data. ARXIV 2024:arXiv:2405.18779v2. [PMID: 38855546 PMCID: PMC11160866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 06/11/2024]
Abstract
In the analysis of spatially resolved transcriptomics data, detecting spatially variable genes (SVGs) is crucial. Numerous computational methods exist, but varying SVG definitions and methodologies lead to incomparable results. We review 31 state-of-the-art methods, categorizing SVGs into three types: overall, cell-type-specific, and spatial-domain-marker SVGs. Our review explains the intuitions underlying these methods, summarizes their applications, and categorizes the hypothesis tests they use in the trade-off between generality and specificity for SVG detection. We discuss challenges in SVG detection and propose future directions for improvement. Our review offers insights for method developers and users, advocating for category-specific benchmarking.
Collapse
Affiliation(s)
- Guanao Yan
- Department of Statistics, University of California, Los Angeles, CA 90095-1554
| | - Shuo Harper Hua
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305
| | - Jingyi Jessica Li
- Department of Statistics, University of California, Los Angeles, CA 90095-1554
- Department of Human Genetics, University of California, Los Angeles, CA 90095-7088
- Department of Computational Medicine, University of California, Los Angeles, CA 90095-1766
- Department of Biostatistics, University of California, Los Angeles, CA 90095-1772
- Radcliffe Institute for Advanced Study, Harvard University, Cambridge, MA 02138
| |
Collapse
|
3
|
Chen YE, Ge X, Woyshner K, McDermott M, Manousopoulou A, Ficarro SB, Marto JA, Li K, Wang LD, Li JJ. APIR: Aggregating Universal Proteomics Database Search Algorithms for Peptide Identification with FDR Control. GENOMICS, PROTEOMICS & BIOINFORMATICS 2024; 22:qzae042. [PMID: 39198030 DOI: 10.1093/gpbjnl/qzae042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 02/26/2024] [Accepted: 03/11/2024] [Indexed: 09/01/2024]
Abstract
Advances in mass spectrometry (MS) have enabled high-throughput analysis of proteomes in biological systems. The state-of-the-art MS data analysis relies on database search algorithms to quantify proteins by identifying peptide-spectrum matches (PSMs), which convert mass spectra to peptide sequences. Different database search algorithms use distinct search strategies and thus may identify unique PSMs. However, no existing approaches can aggregate all user-specified database search algorithms with a guaranteed increase in the number of identified peptides and a control on the false discovery rate (FDR). To fill in this gap, we proposed a statistical framework, Aggregation of Peptide Identification Results (APIR), that is universally compatible with all database search algorithms. Notably, under an FDR threshold, APIR is guaranteed to identify at least as many, if not more, peptides as individual database search algorithms do. Evaluation of APIR on a complex proteomics standard dataset showed that APIR outpowers individual database search algorithms and empirically controls the FDR. Real data studies showed that APIR can identify disease-related proteins and post-translational modifications missed by some individual database search algorithms. The APIR framework is easily extendable to aggregating discoveries made by multiple algorithms in other high-throughput biomedical data analysis, e.g., differential gene expression analysis on RNA sequencing data. The APIR R package is available at https://github.com/yiling0210/APIR.
Collapse
Affiliation(s)
- Yiling Elaine Chen
- Department of Statistics and Data Science, University of California, Los Angeles, CA 90095, USA
| | - Xinzhou Ge
- Department of Statistics and Data Science, University of California, Los Angeles, CA 90095, USA
| | - Kyla Woyshner
- Department of Immuno-Oncology, Beckman Research Institute, City of Hope National Medical Center, Duarte, CA 91010, USA
| | - MeiLu McDermott
- Department of Immuno-Oncology, Beckman Research Institute, City of Hope National Medical Center, Duarte, CA 91010, USA
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Antigoni Manousopoulou
- Department of Immuno-Oncology, Beckman Research Institute, City of Hope National Medical Center, Duarte, CA 91010, USA
| | - Scott B Ficarro
- Department of Cancer Biology and Blais Proteomics Center, Dana-Farber Cancer Institute, Department of Pathology, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02215, USA
| | - Jarrod A Marto
- Department of Cancer Biology and Blais Proteomics Center, Dana-Farber Cancer Institute, Department of Pathology, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02215, USA
| | - Kexin Li
- Department of Statistics and Data Science, University of California, Los Angeles, CA 90095, USA
| | - Leo David Wang
- Department of Immuno-Oncology, Beckman Research Institute, City of Hope National Medical Center, Duarte, CA 91010, USA
- Department of Pediatrics, City of Hope National Medical Center, Duarte, CA 91010, USA
| | - Jingyi Jessica Li
- Department of Statistics and Data Science, University of California, Los Angeles, CA 90095, USA
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, CA 90095, USA
- Department of Human Genetics, University of California, Los Angeles, CA 90095, USA
- Department of Computational Medicine, University of California, Los Angeles, CA 90095, USA
- Department of Biostatistics, University of California, Los Angeles, CA 90095, USA
| |
Collapse
|
4
|
Lv F, Zhan Y, Feng H, Sun W, Yin C, Han Y, Shao Y, Xue W, Jiang S, Ma Y, Hu H, Wei J, Yan Y, Lin M. Integrated Hfq-interacting RNAome and transcriptomic analysis reveals complex regulatory networks of nitrogen fixation in root-associated Pseudomonas stutzeri A1501. mSphere 2024; 9:e0076223. [PMID: 38747590 PMCID: PMC11332353 DOI: 10.1128/msphere.00762-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2023] [Accepted: 04/10/2024] [Indexed: 06/26/2024] Open
Abstract
The RNA chaperone Hfq acts as a global regulator of numerous biological processes, such as carbon/nitrogen metabolism and environmental adaptation in plant-associated diazotrophs; however, its target RNAs and the mechanisms underlying nitrogen fixation remain largely unknown. Here, we used enhanced UV cross-linking immunoprecipitation coupled with high-throughput sequencing to identify hundreds of Hfq-binding RNAs probably involved in nitrogen fixation, carbon substrate utilization, biofilm formation, and other functions. Collectively, these processes endow strain A1501 with the requisite capabilities to thrive in the highly competitive rhizosphere. Our findings revealed a previously uncharted landscape of Hfq target genes. Notable among these is nifM, encoding an isomerase necessary for nitrogenase reductase solubility; amtB, encoding an ammonium transporter; oprB, encoding a carbohydrate porin; and cheZ, encoding a chemotaxis protein. Furthermore, we identified more than 100 genes of unknown function, which expands the potential direct regulatory targets of Hfq in diazotrophs. Our data showed that Hfq directly interacts with the mRNA of regulatory proteins (RsmA, AlgU, and NifA), regulatory ncRNA RsmY, and other potential targets, thus revealing the mechanistic links in nitrogen fixation and other metabolic pathways. IMPORTANCE Numerous experimental approaches often face challenges in distinguishing between direct and indirect effects of Hfq-mediated regulation. New technologies based on high-throughput sequencing are increasingly providing insight into the global regulation of Hfq in gene expression. Here, enhanced UV cross-linking immunoprecipitation coupled with high-throughput sequencing was employed to identify the Hfq-binding sites and potential targets in the root-associated Pseudomonas stutzeri A1501 and identify hundreds of novel Hfq-binding RNAs that are predicted to be involved in metabolism, environmental adaptation, and nitrogen fixation. In particular, we have shown Hfq interactions with various regulatory proteins' mRNA and their potential targets at the posttranscriptional level. This study not only enhances our understanding of Hfq regulation but, importantly, also provides a framework for addressing integrated regulatory network underlying root-associated nitrogen fixation.
Collapse
Affiliation(s)
- Fanyang Lv
- Biotechnology Research Institute/National Key Laboratory of Agricultural Microbiology, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yuhua Zhan
- Biotechnology Research Institute/National Key Laboratory of Agricultural Microbiology, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Haichao Feng
- College of Agriculture, Henan University, Kaifeng, Henan, China
| | - Wenyue Sun
- Biotechnology Research Institute/National Key Laboratory of Agricultural Microbiology, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Changyan Yin
- Biotechnology Research Institute/National Key Laboratory of Agricultural Microbiology, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yueyue Han
- Biotechnology Research Institute/National Key Laboratory of Agricultural Microbiology, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yahui Shao
- Biotechnology Research Institute/National Key Laboratory of Agricultural Microbiology, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Wei Xue
- Biotechnology Research Institute/National Key Laboratory of Agricultural Microbiology, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Shanshan Jiang
- Biotechnology Research Institute/National Key Laboratory of Agricultural Microbiology, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yiyuan Ma
- Biotechnology Research Institute/National Key Laboratory of Agricultural Microbiology, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Haonan Hu
- Biotechnology Research Institute/National Key Laboratory of Agricultural Microbiology, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Jinfeng Wei
- College of Agriculture, Henan University, Kaifeng, Henan, China
| | - Yongliang Yan
- Biotechnology Research Institute/National Key Laboratory of Agricultural Microbiology, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Min Lin
- College of Agriculture, Henan University, Kaifeng, Henan, China
| |
Collapse
|
5
|
Yang R, Chu H, Yue H, Mishina Y, Zhang Z, Liu H, Li B. BMP signaling maintains auricular chondrocyte identity and prevents microtia development by inhibiting protein kinase A. eLife 2024; 12:RP91883. [PMID: 38690987 PMCID: PMC11062634 DOI: 10.7554/elife.91883] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2024] Open
Abstract
Elastic cartilage constitutes a major component of the external ear, which functions to guide sound to the middle and inner ears. Defects in auricle development cause congenital microtia, which affects hearing and appearance in patients. Mutations in several genes have been implicated in microtia development, yet, the pathogenesis of this disorder remains incompletely understood. Here, we show that Prrx1 genetically marks auricular chondrocytes in adult mice. Interestingly, BMP-Smad1/5/9 signaling in chondrocytes is increasingly activated from the proximal to distal segments of the ear, which is associated with a decrease in chondrocyte regenerative activity. Ablation of Bmpr1a in auricular chondrocytes led to chondrocyte atrophy and microtia development at the distal part. Transcriptome analysis revealed that Bmpr1a deficiency caused a switch from the chondrogenic program to the osteogenic program, accompanied by enhanced protein kinase A activation, likely through increased expression of Adcy5/8. Inhibition of PKA blocked chondrocyte-to-osteoblast transformation and microtia development. Moreover, analysis of single-cell RNA-seq of human microtia samples uncovered enriched gene expression in the PKA pathway and chondrocyte-to-osteoblast transformation process. These findings suggest that auricle cartilage is actively maintained by BMP signaling, which maintains chondrocyte identity by suppressing osteogenic differentiation.
Collapse
Affiliation(s)
- Ruichen Yang
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong UniversityShanghaiChina
| | - Hongshang Chu
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong UniversityShanghaiChina
| | - Hua Yue
- Department of Osteoporosis and Bone Diseases, Shanghai Clinical Research Center of Bone Disease, Shanghai Jiao Tong University Affiliated Sixth People's HospitalShanghaiChina
| | - Yuji Mishina
- Department of Biologic and Materials & Prosthodontics, University of Michigan School of DentistryAnn ArborUnited States
| | - Zhenlin Zhang
- Department of Osteoporosis and Bone Diseases, Shanghai Clinical Research Center of Bone Disease, Shanghai Jiao Tong University Affiliated Sixth People's HospitalShanghaiChina
| | - Huijuan Liu
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong UniversityShanghaiChina
| | - Baojie Li
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong UniversityShanghaiChina
- Shanghai Institute of Stem Cell Research and Clinical TranslationShanghaiChina
| |
Collapse
|
6
|
Afshar Y, Yin O, Jeong A, Martinez G, Kim J, Ma F, Jang C, Tabatabaei S, You S, Tseng HR, Zhu Y, Krakow D. Placenta accreta spectrum disorder at single-cell resolution: a loss of boundary limits in the decidua and endothelium. Am J Obstet Gynecol 2024; 230:443.e1-443.e18. [PMID: 38296740 DOI: 10.1016/j.ajog.2023.10.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Revised: 09/25/2023] [Accepted: 10/01/2023] [Indexed: 02/02/2024]
Abstract
BACKGROUND Placenta accreta spectrum disorders are associated with severe maternal morbidity and mortality. Placenta accreta spectrum disorders involve excessive adherence of the placenta preventing separation at birth. Traditionally, this condition has been attributed to excessive trophoblast invasion; however, an alternative view is a fundamental defect in decidual biology. OBJECTIVE This study aimed to gain insights into the understanding of placenta accreta spectrum disorder by using single-cell and spatially resolved transcriptomics to characterize cellular heterogeneity at the maternal-fetal interface in placenta accreta spectrum disorders. STUDY DESIGN To assess cellular heterogeneity and the function of cell types, single-cell RNA sequencing and spatially resolved transcriptomics were used. A total of 12 placentas were included, 6 placentas with placenta accreta spectrum disorder and 6 controls. For each placenta with placenta accreta spectrum disorder, multiple biopsies were taken at the following sites: placenta accreta spectrum adherent and nonadherent sites in the same placenta. Of note, 2 platforms were used to generate libraries: the 10× Chromium and NanoString GeoMX Digital Spatial Profiler for single-cell and spatially resolved transcriptomes, respectively. Differential gene expression analysis was performed using a suite of bioinformatic tools (Seurat and GeoMxTools R packages). Correction for multiple testing was performed using Clipper. In situ hybridization was performed with RNAscope, and immunohistochemistry was used to assess protein expression. RESULTS In creating a placenta accreta cell atlas, there were dramatic difference in the transcriptional profile by site of biopsy between placenta accreta spectrum and controls. Most of the differences were noted at the site of adherence; however, differences existed within the placenta between the adherent and nonadherent site of the same placenta in placenta accreta. Among all cell types, the endothelial-stromal populations exhibited the greatest difference in gene expression, driven by changes in collagen genes, namely collagen type III alpha 1 chain (COL3A1), growth factors, epidermal growth factor-like protein 6 (EGFL6), and hepatocyte growth factor (HGF), and angiogenesis-related genes, namely delta-like noncanonical Notch ligand 1 (DLK1) and platelet endothelial cell adhesion molecule-1 (PECAM1). Intraplacental tropism (adherent versus non-adherent sites in the same placenta) was driven by differences in endothelial-stromal cells with notable differences in bone morphogenic protein 5 (BMP5) and osteopontin (SPP1) in the adherent vs nonadherent site of placenta accreta spectrum. CONCLUSION Placenta accreta spectrum disorders were characterized at single-cell resolution to gain insight into the pathophysiology of the disease. An atlas of the placenta at single cell resolution in accreta allows for understanding in the biology of the intimate maternal and fetal interaction. The contributions of stromal and endothelial cells were demonstrated through alterations in the extracellular matrix, growth factors, and angiogenesis. Transcriptional and protein changes in the stroma of placenta accreta spectrum shift the etiologic explanation away from "invasive trophoblast" to "loss of boundary limits" in the decidua. Gene targets identified in this study may be used to refine diagnostic assays in early pregnancy, track disease progression over time, and inform therapeutic discoveries.
Collapse
Affiliation(s)
- Yalda Afshar
- Division of Maternal-Fetal Medicine, Department of Obstetrics and Gynecology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA; Molecular Biology Institute, University of California, Los Angeles, Los Angeles, CA.
| | - Ophelia Yin
- Division of Maternal-Fetal Medicine, Department of Obstetrics and Gynecology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA; Division of Maternal-Fetal Medicine, Department of Obstetrics, Gynecology, and Reproductive Sciences, University of California, San Francisco, San Francisco, CA
| | - Anhyo Jeong
- Division of Maternal-Fetal Medicine, Department of Obstetrics and Gynecology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA
| | - Guadalupe Martinez
- Division of Maternal-Fetal Medicine, Department of Obstetrics and Gynecology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA
| | - Jina Kim
- Department of Urology, Cedars-Sinai Medical Center, Los Angeles, CA
| | - Feiyang Ma
- Department of Molecular, Cell, and Developmental Biology, University of California, Los Angeles, Los Angeles, CA
| | - Christine Jang
- Division of Maternal-Fetal Medicine, Department of Obstetrics and Gynecology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA
| | - Sarah Tabatabaei
- Division of Maternal-Fetal Medicine, Department of Obstetrics and Gynecology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA
| | - Sungyong You
- Department of Urology, Cedars-Sinai Medical Center, Los Angeles, CA; Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA
| | - Hsian-Rong Tseng
- Department of Molecular and Medical Pharmacology, California NanoSystems Institute, Crump Institute for Molecular Imaging, Los Angeles, CA
| | - Yazhen Zhu
- Department of Molecular and Medical Pharmacology, California NanoSystems Institute, Crump Institute for Molecular Imaging, Los Angeles, CA; Department of Pathology, University of California, Los Angeles, Los Angeles, CA
| | - Deborah Krakow
- Division of Maternal-Fetal Medicine, Department of Obstetrics and Gynecology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA; Departments of Orthopedic Surgery and Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA
| |
Collapse
|
7
|
Panthum T, Ariyaraphong N, Wongloet W, Wattanadilokchatkun P, Laopichienpong N, Rasoarahona R, Singchat W, Ahmad SF, Kraichak E, Muangmai N, Duengkae P, Fukuda Y, Banks S, Temsiripong Y, Ezaz T, Srikulnath K. Preserving Pure Siamese Crocodile Populations: A Comprehensive Approach Using Multi-Genetic Tools. BIOLOGY 2023; 12:1428. [PMID: 37998027 PMCID: PMC10669835 DOI: 10.3390/biology12111428] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Revised: 11/04/2023] [Accepted: 11/10/2023] [Indexed: 11/25/2023]
Abstract
Hybrids between the critically endangered Siamese crocodile (Crocodylus siamensis) and least-concern saltwater crocodile (C. porosus) in captive populations represent a serious challenge for conservation and reintroduction programs due to the impact of anthropogenic activities. A previous study used microsatellite and mitochondrial DNA data to establish the criteria for identifying species and their hybrids; however, the results may have been influenced by biased allelic frequencies and genetic drift within the examined population. To overcome these limitations and identify the true signals of selection, alternative DNA markers and a diverse set of populations should be employed. Therefore, this study used DArT sequencing to identify genome-wide single nucleotide polymorphisms (SNPs) in both species and confirm the genetic scenario of the parental species and their hybrids. A population of saltwater crocodiles from Australia was used to compare the distribution of species-diagnostic SNPs. Different analytical approaches were compared to diagnose the level of hybridization when an admixture was present, wherein three individuals had potential backcrossing. Approximately 17.00-26.00% of loci were conserved between the Siamese and saltwater crocodile genomes. Species-diagnostic SNP loci for Siamese and saltwater crocodiles were identified as 8051 loci and 1288 loci, respectively. To validate the species-diagnostic SNP loci, a PCR-based approach was used by selecting 20 SNP loci for PCR primer design, among which 3 loci were successfully able to differentiate the actual species and different hybridization levels. Mitochondrial and nuclear genetic information, including microsatellite genotyping and species-diagnostic DNA markers, were combined as a novel method that can compensate for the limitations of each method. This method enables conservation prioritization before release into the wild, thereby ensuring sustainable genetic integrity for long-term species survival through reintroduction and management programs.
Collapse
Affiliation(s)
- Thitipong Panthum
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok 10900, Thailand; (T.P.); (N.A.); (W.W.); (P.W.); (N.L.); (R.R.); (W.S.); (S.F.A.); (E.K.); (N.M.); (P.D.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok 10900, Thailand
| | - Nattakan Ariyaraphong
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok 10900, Thailand; (T.P.); (N.A.); (W.W.); (P.W.); (N.L.); (R.R.); (W.S.); (S.F.A.); (E.K.); (N.M.); (P.D.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok 10900, Thailand
| | - Wongsathit Wongloet
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok 10900, Thailand; (T.P.); (N.A.); (W.W.); (P.W.); (N.L.); (R.R.); (W.S.); (S.F.A.); (E.K.); (N.M.); (P.D.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok 10900, Thailand
| | - Pish Wattanadilokchatkun
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok 10900, Thailand; (T.P.); (N.A.); (W.W.); (P.W.); (N.L.); (R.R.); (W.S.); (S.F.A.); (E.K.); (N.M.); (P.D.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok 10900, Thailand
| | - Nararat Laopichienpong
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok 10900, Thailand; (T.P.); (N.A.); (W.W.); (P.W.); (N.L.); (R.R.); (W.S.); (S.F.A.); (E.K.); (N.M.); (P.D.)
| | - Ryan Rasoarahona
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok 10900, Thailand; (T.P.); (N.A.); (W.W.); (P.W.); (N.L.); (R.R.); (W.S.); (S.F.A.); (E.K.); (N.M.); (P.D.)
| | - Worapong Singchat
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok 10900, Thailand; (T.P.); (N.A.); (W.W.); (P.W.); (N.L.); (R.R.); (W.S.); (S.F.A.); (E.K.); (N.M.); (P.D.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok 10900, Thailand
| | - Syed Farhan Ahmad
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok 10900, Thailand; (T.P.); (N.A.); (W.W.); (P.W.); (N.L.); (R.R.); (W.S.); (S.F.A.); (E.K.); (N.M.); (P.D.)
| | - Ekaphan Kraichak
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok 10900, Thailand; (T.P.); (N.A.); (W.W.); (P.W.); (N.L.); (R.R.); (W.S.); (S.F.A.); (E.K.); (N.M.); (P.D.)
- Department of Botany, Kasetsart University, 50 Ngamwongwan, Bangkok 10900, Thailand
| | - Narongrit Muangmai
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok 10900, Thailand; (T.P.); (N.A.); (W.W.); (P.W.); (N.L.); (R.R.); (W.S.); (S.F.A.); (E.K.); (N.M.); (P.D.)
- Department of Fishery Biology, Faculty of Fisheries, Kasetsart University, 50 Ngamwongwan, Bangkok 10900, Thailand
| | - Prateep Duengkae
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok 10900, Thailand; (T.P.); (N.A.); (W.W.); (P.W.); (N.L.); (R.R.); (W.S.); (S.F.A.); (E.K.); (N.M.); (P.D.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok 10900, Thailand
| | - Yusuke Fukuda
- Department of Environment, Parks and Water Security, Northern Territory Government, Darwin, NT 0830, Australia;
| | - Sam Banks
- Research Institute for the Environment and Livelihoods, College of Engineering, IT and the Environment, Charles Darwin University, Darwin, NT 0909, Australia;
| | | | - Tariq Ezaz
- Centre for Conservation Ecology and Genomics, Institute for Applied Ecology, Faculty of Science and Technology, University of Canberra, Bruce, ACT 2617, Australia;
| | - Kornsorn Srikulnath
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok 10900, Thailand; (T.P.); (N.A.); (W.W.); (P.W.); (N.L.); (R.R.); (W.S.); (S.F.A.); (E.K.); (N.M.); (P.D.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok 10900, Thailand
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, 50 Ngamwongwan, Bangkok 10900, Thailand
| |
Collapse
|
8
|
Zhou Y, Sun H, Rapiejko AR, Vargas-Blanco DA, Martini MC, Chase MR, Joubran SR, Davis AB, Dainis JP, Kelly JM, Ioerger TR, Roberts LA, Fortune SM, Shell SS. Mycobacterial RNase E cleaves with a distinct sequence preference and controls the degradation rates of most Mycolicibacterium smegmatis mRNAs. J Biol Chem 2023; 299:105312. [PMID: 37802316 PMCID: PMC10641625 DOI: 10.1016/j.jbc.2023.105312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 08/29/2023] [Accepted: 09/29/2023] [Indexed: 10/08/2023] Open
Abstract
The mechanisms and regulation of RNA degradation in mycobacteria have been subject to increased interest following the identification of interplay between RNA metabolism and drug resistance. Mycobacteria encode multiple ribonucleases predicted to participate in mRNA degradation and/or processing of stable RNAs. RNase E is hypothesized to play a major role in mRNA degradation because of its essentiality in mycobacteria and its role in mRNA degradation in gram-negative bacteria. Here, we defined the impact of RNase E on mRNA degradation rates transcriptome-wide in the nonpathogenic model Mycolicibacterium smegmatis. RNase E played a rate-limiting role in degradation of the transcripts encoded by at least 89% of protein-coding genes, with leadered transcripts often being more affected by RNase E repression than leaderless transcripts. There was an apparent global slowing of transcription in response to knockdown of RNase E, suggesting that M. smegmatis regulates transcription in responses to changes in mRNA degradation. This compensation was incomplete, as the abundance of most transcripts increased upon RNase E knockdown. We assessed the sequence preferences for cleavage by RNase E transcriptome-wide in M. smegmatis and Mycobacterium tuberculosis and found a consistent bias for cleavage in C-rich regions. Purified RNase E had a clear preference for cleavage immediately upstream of cytidines, distinct from the sequence preferences of RNase E in gram-negative bacteria. We furthermore report a high-resolution map of mRNA cleavage sites in M. tuberculosis, which occur primarily within the RNase E-preferred sequence context, confirming that RNase E has a broad impact on the M. tuberculosis transcriptome.
Collapse
Affiliation(s)
- Ying Zhou
- Department of Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, Massachusetts, USA
| | - Huaming Sun
- Program in Bioinformatics and Computational Biology, Worcester Polytechnic Institute, Worcester, Massachusetts, USA
| | - Abigail R Rapiejko
- Department of Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, Massachusetts, USA
| | - Diego A Vargas-Blanco
- Department of Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, Massachusetts, USA
| | - Maria Carla Martini
- Department of Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, Massachusetts, USA
| | - Michael R Chase
- Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Samantha R Joubran
- Department of Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, Massachusetts, USA
| | - Alexa B Davis
- Department of Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, Massachusetts, USA
| | - Joseph P Dainis
- Department of Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, Massachusetts, USA
| | - Jessica M Kelly
- Department of Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, Massachusetts, USA
| | - Thomas R Ioerger
- Department of Computer Science & Engineering, Texas A&M University, College Station, Texas, USA
| | - Louis A Roberts
- Department of Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, Massachusetts, USA
| | - Sarah M Fortune
- Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Scarlet S Shell
- Department of Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, Massachusetts, USA.
| |
Collapse
|
9
|
Zhou HJ, Ge X, Li JJ. ClipperQTL: ultrafast and powerful eGene identification method. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.28.555191. [PMID: 37693523 PMCID: PMC10491229 DOI: 10.1101/2023.08.28.555191] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2023]
Abstract
A central task in expression quantitative trait locus (eQTL) analysis is to identify cis-eGenes (henceforth "eGenes"), i.e., genes whose expression levels are regulated by at least one local genetic variant. Among the existing eGene identification methods, FastQTL is considered the gold standard but is computationally expensive as it requires thousands of permutations for each gene. Alternative methods such as eigenMT and TreeQTL have lower power than FastQTL. In this work, we propose ClipperQTL, which reduces the number of permutations needed from thousands to 20 for data sets with large sample sizes (> 450) by using the contrastive strategy developed in Clipper; for data sets with smaller sample sizes, it uses the same permutation-based approach as FastQTL. We show that ClipperQTL performs as well as FastQTL and runs about 500 times faster if the contrastive strategy is used and 50 times faster if the conventional permutation-based approach is used. The R package ClipperQTL is available at https://github.com/heatherjzhou/ClipperQTL.
Collapse
Affiliation(s)
- Heather J. Zhou
- Department of Statistics and Data Science, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Xinzhou Ge
- Department of Statistics and Data Science, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Current address: Department of Statistics, Oregon State University, Corvallis, OR 97330, USA
| | - Jingyi Jessica Li
- Department of Statistics and Data Science, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Department of Biostatistics, University of California, Los Angeles, Los Angeles, CA 90095, USA
| |
Collapse
|
10
|
Song D, Li K, Ge X, Li JJ. ClusterDE: a post-clustering differential expression (DE) method robust to false-positive inflation caused by double dipping. RESEARCH SQUARE 2023:rs.3.rs-3211191. [PMID: 37577698 PMCID: PMC10418557 DOI: 10.21203/rs.3.rs-3211191/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]
Abstract
In typical single-cell RNA-seq (scRNA-seq) data analysis, a clustering algorithm is applied to find putative cell types as clusters, and then a statistical differential expression (DE) test is employed to identify the differentially expressed (DE) genes between the cell clusters. However, this common procedure uses the same data twice, an issue known as "double dipping": the same data is used twice to define cell clusters as potential cell types and DE genes as potential cell-type marker genes, leading to false-positive cell-type marker genes even when the cell clusters are spurious. To overcome this challenge, we propose ClusterDE, a post-clustering DE method for controlling the false discovery rate (FDR) of identified DE genes regardless of clustering quality, which can work as an add-on to popular pipelines such as Seurat. The core idea of ClusterDE is to generate real-data-based synthetic null data containing only one cluster, as contrast to the real data, for evaluating the whole procedure of clustering followed by a DE test. Using comprehensive simulation and real data analysis, we show that ClusterDE has not only solid FDR control but also the ability to identify cell-type marker genes as top DE genes and distinguish them from housekeeping genes. ClusterDE is fast, transparent, and adaptive to a wide range of clustering algorithms and DE tests. Besides scRNA-seq data, ClusterDE is generally applicable to post-clustering DE analysis, including single-cell multi-omics data analysis.
Collapse
Affiliation(s)
- Dongyuan Song
- Bioinformatics Interdepartmental Ph.D. Program, University of California, Los Angeles, CA 90095-7246
| | - Kexin Li
- Department of Statistics, University of California, Los Angeles, CA 90095-1554
| | - Xinzhou Ge
- Department of Statistics, University of California, Los Angeles, CA 90095-1554
| | - Jingyi Jessica Li
- Bioinformatics Interdepartmental Ph.D. Program, University of California, Los Angeles, CA 90095-7246
- Department of Statistics, University of California, Los Angeles, CA 90095-1554
- Department of Human Genetics, University of California, Los Angeles, CA 90095-7088
- Department of Computational Medicine, University of California, Los Angeles, CA 90095-1766
- Department of Biostatistics, University of California, Los Angeles, CA 90095-1772
- Radcliffe Institute for Advanced Study, Harvard University, Cambridge, MA 02138
| |
Collapse
|
11
|
Li JJ. How the Monty Hall problem is similar to the false discovery rate in high-throughput data analysis. Nat Biotechnol 2023; 41:754-755. [PMID: 37198440 PMCID: PMC11078861 DOI: 10.1038/s41587-023-01794-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Affiliation(s)
- Jingyi Jessica Li
- Department of Statistics, University of California, Los Angeles, Los Angeles, CA, USA.
- Radcliffe Institute of Advanced Study, Harvard University, Cambridge, MA, USA.
| |
Collapse
|
12
|
Yang L, Chen X, Lee C, Shi J, Lawrence EB, Zhang L, Li Y, Gao N, Jung SY, Creighton CJ, Li JJ, Cui Y, Arimura S, Lei Y, Li W, Shen L. Functional characterization of age-dependent p16 epimutation reveals biological drivers and therapeutic targets for colorectal cancer. J Exp Clin Cancer Res 2023; 42:113. [PMID: 37143122 PMCID: PMC10157929 DOI: 10.1186/s13046-023-02689-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Accepted: 04/27/2023] [Indexed: 05/06/2023] Open
Abstract
BACKGROUND Methylation of the p16 promoter resulting in epigenetic gene silencing-known as p16 epimutation-is frequently found in human colorectal cancer and is also common in normal-appearing colonic mucosa of aging individuals. Thus, to improve clinical care of colorectal cancer (CRC) patients, we explored the role of age-related p16 epimutation in intestinal tumorigenesis. METHODS We established a mouse model that replicates two common genetic and epigenetic events observed in human CRCs: Apc mutation and p16 epimutation. We conducted long-term survival and histological analysis of tumor development and progression. Colonic epithelial cells and tumors were collected from mice and analyzed by RNA sequencing (RNA-seq), quantitative PCR, and flow cytometry. We performed single-cell RNA sequencing (scRNA-seq) to characterize tumor-infiltrating immune cells throughout tumor progression. We tested whether anti-PD-L1 immunotherapy affects overall survival of tumor-bearing mice and whether inhibition of both epigenetic regulation and immune checkpoint is more efficacious. RESULTS Mice carrying combined Apc mutation and p16 epimutation had significantly shortened survival and increased tumor growth compared to those with Apc mutation only. Intriguingly, colon tumors with p16 epimutation exhibited an activated interferon pathway, increased expression of programmed death-ligand 1 (Pdl1), and enhanced infiltration of immune cells. scRNA-seq further revealed the presence of Foxp3+ Tregs and γδT17 cells, which contribute to an immunosuppressive tumor microenvironment (TME). Furthermore, we showed that a combined therapy using an inhibitor of DNA methylation and a PD-L1 immune checkpoint inhibitor is more effective for improving survival in tumor-bearing mice than blockade of either pathway alone. CONCLUSIONS Our study demonstrated that age-dependent p16 epimutation creates a permissive microenvironment for malignant transformation of polyps to colon cancer. Our findings provide a mechanistic rationale for future targeted therapy in patients with p16 epimutation.
Collapse
Affiliation(s)
- Li Yang
- USDA Children's Nutrition Research Center, Department of Pediatrics, Baylor College of Medicine, TX, Houston, USA
| | - Xiaomin Chen
- USDA Children's Nutrition Research Center, Department of Pediatrics, Baylor College of Medicine, TX, Houston, USA
| | - Christy Lee
- Department of Statistics, University of California, Los Angeles, CA, USA
| | - Jiejun Shi
- Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California, Irvine, CA, USA
- Present address: Department of General Surgery, Shanghai Tongji Hospital, School of Life Sciences and Technology, Tongji University, Shanghai, China
| | - Emily B Lawrence
- USDA Children's Nutrition Research Center, Department of Pediatrics, Baylor College of Medicine, TX, Houston, USA
| | - Lanjing Zhang
- Department of Pathology, Princeton Medical Center, Plainsboro, NJ, USA
- Department of Chemical Biology, Earnest Mario School of Pharmacy, Rutgers University, Piscataway, NJ, USA
| | - Yumei Li
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Nan Gao
- Department of Biological Sciences, Rutgers University, Newark, NJ, USA
| | - Sung Yun Jung
- Department of Biochemistry, Baylor College of Medicine, Houston, TX, USA
| | - Chad J Creighton
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
- Department of Medicine and Dan L Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, TX, USA
| | - Jingyi Jessica Li
- Department of Statistics, University of California, Los Angeles, CA, USA
| | - Ya Cui
- Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California, Irvine, CA, USA
| | - Sumimasa Arimura
- Department of Medicine and Section of Gastroenterology and Hepatology, Baylor College of Medicine, Houston, TX, USA
| | - Yunping Lei
- Center for Precision Environmental Health, Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX, USA
| | - Wei Li
- Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California, Irvine, CA, USA
| | - Lanlan Shen
- USDA Children's Nutrition Research Center, Department of Pediatrics, Baylor College of Medicine, TX, Houston, USA.
| |
Collapse
|
13
|
Chen S, Li Z, Liu L, Wen Y. The systematic comparison between Gaussian mirror and Model-X knockoff models. Sci Rep 2023; 13:5478. [PMID: 37015993 PMCID: PMC10073103 DOI: 10.1038/s41598-023-32605-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Accepted: 03/30/2023] [Indexed: 04/06/2023] Open
Abstract
While the high-dimensional biological data have provided unprecedented data resources for the identification of biomarkers, consensus is still lacking on how to best analyze them. The recently developed Gaussian mirror (GM) and Model-X (MX) knockoff-based methods have much related model assumptions, which makes them appealing for the detection of new biomarkers. However, there are no guidelines for their practical use. In this research, we systematically compared the performance of MX-based and GM methods, where the impacts of the distribution of explanatory variables, their relatedness and the signal-to-noise ratio were evaluated. MX with knockoff generated using the second-order approximates (MX-SO) has the best performance as compared to other MX-based methods. MX-SO and GM have similar levels of power and computational speed under most of the simulations, but GM is more robust in the control of false discovery rate (FDR). In particular, MX-SO can only control the FDR well when there are weak correlations among explanatory variables and the sample size is at least moderate. On the contrary, GM can have the desired FDR as long as explanatory variables are not highly correlated. We further used GM and MX-based methods to detect biomarkers that are associated with the Alzheimer's disease-related PET-imaging trait and the Parkinson's disease-related T-tau of cerebrospinal fluid. We found that MX-based and GM methods are both powerful for the analysis of big biological data. Although genes selected from MX-based methods are more similar as compared to those from the GM method, both MX-based and GM methods can identify the well-known disease-associated genes for each disease. While MX-based methods can have a slightly higher power than that of the GM method, it is less robust, especially for data with small sample sizes, unknown distributions, and high correlations.
Collapse
Affiliation(s)
- Shuai Chen
- Department of Health Statistics, School of Public Health, Shanxi Medical University, No 56 Xinjian South Road, Yingze District, Taiyuan, Shanxi Province, China
| | - Ziqi Li
- Department of Health Statistics, School of Public Health, Shanxi Medical University, No 56 Xinjian South Road, Yingze District, Taiyuan, Shanxi Province, China
| | - Long Liu
- Department of Health Statistics, School of Public Health, Shanxi Medical University, No 56 Xinjian South Road, Yingze District, Taiyuan, Shanxi Province, China.
| | - Yalu Wen
- Department of Health Statistics, School of Public Health, Shanxi Medical University, No 56 Xinjian South Road, Yingze District, Taiyuan, Shanxi Province, China.
- Department of Statistics, University of Auckland, 38 Princes Street, Auckland Central, Auckland, New Zealand, 1010.
| |
Collapse
|
14
|
Päll T, Luidalepp H, Tenson T, Maiväli Ü. A field-wide assessment of differential expression profiling by high-throughput sequencing reveals widespread bias. PLoS Biol 2023; 21:e3002007. [PMID: 36862747 PMCID: PMC10013925 DOI: 10.1371/journal.pbio.3002007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 03/14/2023] [Accepted: 01/20/2023] [Indexed: 03/03/2023] Open
Abstract
We assess inferential quality in the field of differential expression profiling by high-throughput sequencing (HT-seq) based on analysis of datasets submitted from 2008 to 2020 to the NCBI GEO data repository. We take advantage of the parallel differential expression testing over thousands of genes, whereby each experiment leads to a large set of p-values, the distribution of which can indicate the validity of assumptions behind the test. From a well-behaved p-value set π0, the fraction of genes that are not differentially expressed can be estimated. We found that only 25% of experiments resulted in theoretically expected p-value histogram shapes, although there is a marked improvement over time. Uniform p-value histogram shapes, indicative of <100 actual effects, were extremely few. Furthermore, although many HT-seq workflows assume that most genes are not differentially expressed, 37% of experiments have π0-s of less than 0.5, as if most genes changed their expression level. Most HT-seq experiments have very small sample sizes and are expected to be underpowered. Nevertheless, the estimated π0-s do not have the expected association with N, suggesting widespread problems of experiments with controlling false discovery rate (FDR). Both the fractions of different p-value histogram types and the π0 values are strongly associated with the differential expression analysis program used by the original authors. While we could double the proportion of theoretically expected p-value distributions by removing low-count features from the analysis, this treatment did not remove the association with the analysis program. Taken together, our results indicate widespread bias in the differential expression profiling field and the unreliability of statistical methods used to analyze HT-seq data.
Collapse
Affiliation(s)
- Taavi Päll
- Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu, Estonia
| | | | - Tanel Tenson
- Institute of Technology, University of Tartu, Tartu, Estonia
| | - Ülo Maiväli
- Institute of Technology, University of Tartu, Tartu, Estonia
- * E-mail:
| |
Collapse
|
15
|
Qin H, Sheng W, Weng J, Li G, Chen Y, Zhu Y, Wang Q, Chen Y, Yang Q, Yu F, Zeng H, Xiong A. Identification and verification of m7G-Related genes as biomarkers for prognosis of sarcoma. Front Genet 2023; 14:1101683. [PMID: 36816047 PMCID: PMC9935680 DOI: 10.3389/fgene.2023.1101683] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 01/26/2023] [Indexed: 02/05/2023] Open
Abstract
Background: Increasing evidence indicates a crucial role for N7-methylguanosine (m7G) methylation modification in human disease development, particularly cancer, and aberrant m7G levels are closely associated with tumorigenesis and progression via regulation of the expression of multiple oncogenes and tumor suppressor genes. However, the role of m7G in sarcomas (SARC) has not been adequately evaluated. Materials and methods: Transcriptome and clinical data were gathered from the TCGA database for this study. Normal and SARC groups were compared for the expression of m7G-related genes (m7GRGs). The expression of m7GRGs was verified using real-time quantitative PCR (RT-qPCR) in SARC cell lines. Then, differentially expressed genes (DEGs) were identified between high and low m7GRGs expression groups in SARC samples, and GO enrichment and KEGG pathways were evaluated. Next, prognostic values of m7GRGs were evaluated by Cox regression analysis. Subsequently, a prognostic model was constructed using m7GRGs with good prognostic values by Lasso regression analysis. Besides, the relationships between prognostic m7GRGs and immune infiltration, clinical features, cuproptosis-related genes, and antitumor drugs were investigated in patients with SARC. Finally, a ceRNA regulatory network based on m7GRGs was constructed. Results: The expression of ten m7GRGs was higher in the SARC group than in the control group. DEGs across groups with high and low m7GRGs expression were enriched for adhesion sites and cGMP-PKG. Besides, we constructed a prognostic model that consists of EIF4A1, EIF4G3, NCBP1, and WDR4 m7GRGs for predicting the survival likelihood of sarcoma patients. And the elevated expression of these four prognostic m7GRGs was substantially associated with poor prognosis and elevated expression in SARC cell lines. Moreover, we discovered that these four m7GRGs expressions were negatively correlated with CD4+ T cell levels, dendritic cell level and tumor purity, and positively correlated with tumor mutational burden, microsatellite instability, drug sensitivity and cuproptosis-related genes in patients with sarcomas. Then, a triple regulatory network of mRNA, miRNA, and lncRNA was established. Conclusion: The current study identified EIF4A1, EIF4G3, NCBP1, and WDR4 as prognostic genes for SARC that are associated with m7G.These findings extend our knowledge of m7G methylation in SARC and may guide the development of innovative treatment options.
Collapse
Affiliation(s)
- Haotian Qin
- National & Local Joint Engineering Research Center of Orthopaedic Biomaterials, Peking University Shenzhen Hospital, Shenzhen, China,Department of Bone & Joint Surgery, Peking University Shenzhen Hospital, Shenzhen, China
| | - Weibei Sheng
- National & Local Joint Engineering Research Center of Orthopaedic Biomaterials, Peking University Shenzhen Hospital, Shenzhen, China,Department of Bone & Joint Surgery, Peking University Shenzhen Hospital, Shenzhen, China
| | - Jian Weng
- National & Local Joint Engineering Research Center of Orthopaedic Biomaterials, Peking University Shenzhen Hospital, Shenzhen, China,Department of Bone & Joint Surgery, Peking University Shenzhen Hospital, Shenzhen, China
| | - Guoqing Li
- National & Local Joint Engineering Research Center of Orthopaedic Biomaterials, Peking University Shenzhen Hospital, Shenzhen, China,Department of Bone & Joint Surgery, Peking University Shenzhen Hospital, Shenzhen, China
| | - Yingqi Chen
- National & Local Joint Engineering Research Center of Orthopaedic Biomaterials, Peking University Shenzhen Hospital, Shenzhen, China,Department of Bone & Joint Surgery, Peking University Shenzhen Hospital, Shenzhen, China
| | - Yuanchao Zhu
- National & Local Joint Engineering Research Center of Orthopaedic Biomaterials, Peking University Shenzhen Hospital, Shenzhen, China,Department of Bone & Joint Surgery, Peking University Shenzhen Hospital, Shenzhen, China
| | - Qichang Wang
- National & Local Joint Engineering Research Center of Orthopaedic Biomaterials, Peking University Shenzhen Hospital, Shenzhen, China,Department of Bone & Joint Surgery, Peking University Shenzhen Hospital, Shenzhen, China
| | - Yixiao Chen
- National & Local Joint Engineering Research Center of Orthopaedic Biomaterials, Peking University Shenzhen Hospital, Shenzhen, China,Department of Bone & Joint Surgery, Peking University Shenzhen Hospital, Shenzhen, China
| | - Qi Yang
- Department of Medical Ultrasound, Peking University Shenzhen Hospital, Shenzhen, China
| | - Fei Yu
- National & Local Joint Engineering Research Center of Orthopaedic Biomaterials, Peking University Shenzhen Hospital, Shenzhen, China,Department of Bone & Joint Surgery, Peking University Shenzhen Hospital, Shenzhen, China,*Correspondence: Fei Yu, ; Hui Zeng, ; Ao Xiong,
| | - Hui Zeng
- National & Local Joint Engineering Research Center of Orthopaedic Biomaterials, Peking University Shenzhen Hospital, Shenzhen, China,Department of Bone & Joint Surgery, Peking University Shenzhen Hospital, Shenzhen, China,*Correspondence: Fei Yu, ; Hui Zeng, ; Ao Xiong,
| | - Ao Xiong
- National & Local Joint Engineering Research Center of Orthopaedic Biomaterials, Peking University Shenzhen Hospital, Shenzhen, China,Department of Bone & Joint Surgery, Peking University Shenzhen Hospital, Shenzhen, China,*Correspondence: Fei Yu, ; Hui Zeng, ; Ao Xiong,
| |
Collapse
|
16
|
Etourneau L, Burger T. Challenging Targets or Describing Mismatches? A Comment on Common Decoy Distribution by Madej et al. J Proteome Res 2022; 21:2840-2845. [PMID: 36305797 DOI: 10.1021/acs.jproteome.2c00279] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
In their recent article, Madej et al. (Madej, D.; Wu, L.; Lam, H.Common Decoy Distributions Simplify False Discovery Rate Estimation in Shotgun Proteomics. J. Proteome Res.2022, 21 (2), 339-348) proposed an original way to solve the recurrent issue of controlling for the false discovery rate (FDR) in peptide-spectrum-match (PSM) validation. Briefly, they proposed to derive a single precise distribution of decoy matches termed the Common Decoy Distribution (CDD) and to use it to control for FDR during a target-only search. Conceptually, this approach is appealing as it takes the best of two worlds, i.e., decoy-based approaches (which leverage a large-scale collection of empirical mismatches) and decoy-free approaches (which are not subject to the randomness of decoy generation while sparing an additional database search). Interestingly, CDD also corresponds to a middle-of-the-road approach in statistics with respect to the two main families of FDR control procedures: Although historically based on estimating the false-positive distribution, FDR control has recently been demonstrated to be possible thanks to competition between the original variables (in proteomics, target sequences) and their fictional counterparts (in proteomics, decoys). Discriminating between these two theoretical trends is of prime importance for computational proteomics. In addition to highlighting why proteomics was a source of inspiration for theoretical biostatistics, it provides practical insights into the improvements that can be made to FDR control methods used in proteomics, including CDD.
Collapse
Affiliation(s)
- Lucas Etourneau
- Univ. Grenoble Alpes, CNRS, CEA, Inserm, ProFI, FR2048Grenoble, France
| | - Thomas Burger
- Univ. Grenoble Alpes, CNRS, CEA, Inserm, ProFI, FR2048Grenoble, France
| |
Collapse
|
17
|
Wang D, Dai J, Suo C, Wang S, Zhang Y, Chen X. Molecular subtyping of esophageal squamous cell carcinoma by large-scale transcriptional profiling: Characterization, therapeutic targets, and prognostic value. Front Genet 2022; 13:1033214. [DOI: 10.3389/fgene.2022.1033214] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 10/28/2022] [Indexed: 11/11/2022] Open
Abstract
The tumor heterogeneity of the transcriptional profiles is independent of genetic variation. Several studies have successfully identified esophageal squamous cell carcinoma (ESCC) subtypes based on the somatic mutation profile and copy number variations on the genome. However, transcriptome-based classification is limited. In this study, we classified 141 patients with ESCC into three subtypes (Subtype 1, Subtype 2, and Subtype 3) via tumor sample gene expression profiling. Differential gene expression (DGE) analysis of paired tumor and normal samples for each subtype revealed significant difference among subtypes. Moreover, the degree of change in the expression levels of most genes gradually increased from Subtype 1 to Subtype 3. Gene set enrichment analysis (GSEA) identified the representative pathways in each subtype: Subtype 1, abnormal Wnt signaling pathway activation; Subtype 2, inhibition of glycogen metabolism; and Subtype 3, downregulation of neutrophil degranulation process. Weighted gene co-expression network analysis (WGCNA) was used to elucidate the finer regulation of biological pathways and discover hub genes. Subsequently, nine hub genes (CORO1A, CD180, SASH3, CD52, CD300A, CD14, DUSP1, KIF14, and MCM2) were validated to be associated with survival in ESCC based on the RNA sequencing (RNA-seq) data from The Cancer Genome Atlas (TCGA) database. The clustering analysis of ESCC granted better understanding of the molecular characteristics of ESCC and led to the discover of new potential therapeutic targets that may contribute to the clinical treatment of ESCC.
Collapse
|
18
|
Kim WK, Olson AW, Mi J, Wang J, Lee DH, Le V, Hiroto A, Aldahl J, Nenninger CH, Buckley AJ, Cardiff R, You S, Sun Z. Aberrant androgen action in prostatic progenitor cells induces oncogenesis and tumor development through IGF1 and Wnt axes. Nat Commun 2022; 13:4364. [PMID: 35902588 PMCID: PMC9334353 DOI: 10.1038/s41467-022-32119-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Accepted: 07/18/2022] [Indexed: 12/26/2022] Open
Abstract
Androgen/androgen receptor (AR) signaling pathways are essential for prostate tumorigenesis. However, the fundamental mechanisms underlying the AR functioning as a tumor promoter in inducing prostatic oncogenesis still remain elusive. Here, we demonstrate that a subpopulation of prostatic Osr1 (odd skipped-related 1)-lineage cells functions as tumor progenitors in prostate tumorigenesis. Single cell transcriptomic analyses reveal that aberrant AR activation in these cells elevates insulin-like growth factor 1 (IGF1) signaling pathways and initiates oncogenic transformation. Elevating IGF1 signaling further cumulates Wnt/β-catenin pathways in transformed cells to promote prostate tumor development. Correlations between altered androgen, IGF1, and Wnt/β-catenin signaling are also identified in human prostate cancer samples, uncovering a dynamic regulatory loop initiated by the AR through prostate cancer development. Co-inhibition of androgen and Wnt-signaling pathways significantly represses the growth of AR-positive tumor cells in both ex-vivo and in-vivo, implicating co-targeting therapeutic strategies for these pathways to treat advanced prostate cancer.
Collapse
Affiliation(s)
- Won Kyung Kim
- Department of Cancer Biology, Cancer Center and Beckman Research Institute, City of Hope, Duarte, CA, USA
| | - Adam W Olson
- Department of Cancer Biology, Cancer Center and Beckman Research Institute, City of Hope, Duarte, CA, USA
| | - Jiaqi Mi
- Department of Cancer Biology, Cancer Center and Beckman Research Institute, City of Hope, Duarte, CA, USA
| | - Jinhui Wang
- Integrative Genomics Core, Cancer Center and Beckman Research Institute, City of Hope, Duarte, CA, USA
| | - Dong-Hoon Lee
- Department of Cancer Biology, Cancer Center and Beckman Research Institute, City of Hope, Duarte, CA, USA
| | - Vien Le
- Department of Cancer Biology, Cancer Center and Beckman Research Institute, City of Hope, Duarte, CA, USA
| | - Alex Hiroto
- Department of Cancer Biology, Cancer Center and Beckman Research Institute, City of Hope, Duarte, CA, USA
| | - Joseph Aldahl
- Department of Cancer Biology, Cancer Center and Beckman Research Institute, City of Hope, Duarte, CA, USA
| | - Christian H Nenninger
- Department of Cancer Biology, Cancer Center and Beckman Research Institute, City of Hope, Duarte, CA, USA
| | - Alyssa J Buckley
- Department of Cancer Biology, Cancer Center and Beckman Research Institute, City of Hope, Duarte, CA, USA
| | - Robert Cardiff
- Center for Comparative Medicine, University of California at Davis, Davis, CA, USA
| | - Sungyong You
- Division of Cancer Biology and Therapeutics, Departments of Surgery, Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Zijie Sun
- Department of Cancer Biology, Cancer Center and Beckman Research Institute, City of Hope, Duarte, CA, USA.
| |
Collapse
|
19
|
Rao X, Cao H, Yu Q, Ou X, Deng R, Huang J. NEAT1/MALAT1/XIST/PKD--Hsa-Mir-101-3p--DLGAP5 Axis as a Novel Diagnostic and Prognostic Biomarker Associated With Immune Cell Infiltration in Bladder Cancer. Front Genet 2022; 13:892535. [PMID: 35873473 PMCID: PMC9305813 DOI: 10.3389/fgene.2022.892535] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Accepted: 06/16/2022] [Indexed: 12/24/2022] Open
Abstract
Background: The clinical value of the biomarkers of bladder cancer (BC) is limited due to their low sensitivity or specificity. As a biomarker, DLG associated protein 5 (DLGAP5) is a potential cell cycle regulator in cancer cell carcinogenesis. However, its functional part in BC remains unclear. Therefore, this study aims to identify DLGAP5 expression in BC and its potential diagnostic and prognostic values. Eventually, it predicts the possible RNA regulatory pathways of BC.Methods: Data on DLGAP5 expression levels in BC and normal bladder tissues were obtained from The Cancer Genome Atlas (TCGA) and the Gene Expression Omnibus (GEO) databases. The receiver operating characteristic (ROC), Kaplan–Meier survival curves, and the univariate and multivariate Cox regression analysis determined the diagnostic and prognostic values of DLGAP5 in BC patients. Finally, the StarBase predicted the target RNAs and constructed networks using Cytoscape.Results: DLGAP5 expression was significantly upregulated in BC tissue, verified by the TCGA (p < 0.001), GSE3167, GSE7476, and GSE65635 datasets (p < 0.01). BC patients with increased DLGAP5 had poor overall survival (OS) (p = 0.01), disease specific survival (DSS) (p = 0.006) and progress free interval (DFI) (p = 0.007). The area under the ROC curve (AUC) was 0.913. The multivariate Cox analysis identified that lymphovascular invasion (p = 0.007) and DLGAP5 (p = 0.002) were independent prognostic factors.Conclusion: Increased DLGAP5 expression was closely associated with a poor prognosis in BC patients. In this case, DLGAP5 might be a diagnostic and prognostic biomarker for BC. DLGAP5 expression might be regulated by NEAT1/MALAT1/XIST/PKD--Hsa-mir-101-3p pathways.
Collapse
Affiliation(s)
- Xiaosheng Rao
- Department of Urology, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Haiyan Cao
- Department of Nephrology, Tianjin Medical University General Hospital, Tianjin, China
| | - Qingfeng Yu
- Department of Urology, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Xiuyu Ou
- Department of Urology, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Ruiqi Deng
- Department of Urology, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Jinkun Huang
- Department of Urology, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
- *Correspondence: Jinkun Huang,
| |
Collapse
|
20
|
Fancello L, Burger T. An analysis of proteogenomics and how and when transcriptome-informed reduction of protein databases can enhance eukaryotic proteomics. Genome Biol 2022; 23:132. [PMID: 35725496 PMCID: PMC9208142 DOI: 10.1186/s13059-022-02701-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Accepted: 06/09/2022] [Indexed: 12/03/2022] Open
Abstract
BACKGROUND Proteogenomics aims to identify variant or unknown proteins in bottom-up proteomics, by searching transcriptome- or genome-derived custom protein databases. However, empirical observations reveal that these large proteogenomic databases produce lower-sensitivity peptide identifications. Various strategies have been proposed to avoid this, including the generation of reduced transcriptome-informed protein databases, which only contain proteins whose transcripts are detected in the sample-matched transcriptome. These were found to increase peptide identification sensitivity. Here, we present a detailed evaluation of this approach. RESULTS We establish that the increased sensitivity in peptide identification is in fact a statistical artifact, directly resulting from the limited capability of target-decoy competition to accurately model incorrect target matches when using excessively small databases. As anti-conservative false discovery rates (FDRs) are likely to hamper the robustness of the resulting biological conclusions, we advocate for alternative FDR control methods that are less sensitive to database size. Nevertheless, reduced transcriptome-informed databases are useful, as they reduce the ambiguity of protein identifications, yielding fewer shared peptides. Furthermore, searching the reference database and subsequently filtering proteins whose transcripts are not expressed reduces protein identification ambiguity to a similar extent, but is more transparent and reproducible. CONCLUSIONS In summary, using transcriptome information is an interesting strategy that has not been promoted for the right reasons. While the increase in peptide identifications from searching reduced transcriptome-informed databases is an artifact caused by the use of an FDR control method unsuitable to excessively small databases, transcriptome information can reduce the ambiguity of protein identifications.
Collapse
Affiliation(s)
- Laura Fancello
- CNRS, CEA, Inserm, BioSanté U1292, Profi FR2048, Université Grenoble Alpes, Grenoble, France
| | - Thomas Burger
- CNRS, CEA, Inserm, BioSanté U1292, Profi FR2048, Université Grenoble Alpes, Grenoble, France.
| |
Collapse
|
21
|
Li Y, Ge X, Peng F, Li W, Li JJ. Exaggerated false positives by popular differential expression methods when analyzing human population samples. Genome Biol 2022; 23:79. [PMID: 35292087 PMCID: PMC8922736 DOI: 10.1186/s13059-022-02648-4] [Citation(s) in RCA: 94] [Impact Index Per Article: 47.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Accepted: 03/07/2022] [Indexed: 12/05/2022] Open
Abstract
When identifying differentially expressed genes between two conditions using human population RNA-seq samples, we found a phenomenon by permutation analysis: two popular bioinformatics methods, DESeq2 and edgeR, have unexpectedly high false discovery rates. Expanding the analysis to limma-voom, NOISeq, dearseq, and Wilcoxon rank-sum test, we found that FDR control is often failed except for the Wilcoxon rank-sum test. Particularly, the actual FDRs of DESeq2 and edgeR sometimes exceed 20% when the target FDR is 5%. Based on these results, for population-level RNA-seq studies with large sample sizes, we recommend the Wilcoxon rank-sum test.
Collapse
Affiliation(s)
- Yumei Li
- Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California, Irvine, Irvine, CA, 92697, USA
| | - Xinzhou Ge
- Department of Statistics, University of California, Los Angeles, CA, 90095, USA
| | - Fanglue Peng
- Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Wei Li
- Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California, Irvine, Irvine, CA, 92697, USA.
| | - Jingyi Jessica Li
- Department of Statistics, University of California, Los Angeles, CA, 90095, USA.
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, CA, 90095, USA.
- Department of Human Genetics, University of California, Los Angeles, CA, 90095, USA.
- Department of Computational Medicine, University of California, Los Angeles, CA, 90095, USA.
- Department of Biostatistics, University of California, Los Angeles, CA, 90095, USA.
| |
Collapse
|
22
|
Li R, Lu C, Li X, Chen X, Huang G, Wen Z, Li H, Tao L, Hu Y, Zhao Z, Chen Z, Lai Y. A Four-MicroRNA Panel in Serum as a Potential Biomarker for Screening Renal Cell Carcinoma. Front Genet 2022; 13:897827. [PMID: 35938021 PMCID: PMC9355293 DOI: 10.3389/fgene.2022.897827] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Accepted: 06/23/2022] [Indexed: 02/05/2023] Open
Abstract
Background: Renal cell carcinoma (RCC) has been a major health problem and is one of the most malignant tumors around the world. Serum microRNA (miRNA) profiles previously have been reported as non-invasive biomarkers in cancer screening. The aim of this study was to explore serum miRNAs as potential biomarkers for screening RCC. Methods: A three-phase study was conducted to explore serum miRNAs as potential biomarkers for screening RCC. In the screening phase, 12 candidate miRNAs related to RCC were selected for further study by the ENCORI database with 517 RCC patients and 71 NCs. A total of 220 participants [108 RCC patients and 112 normal controls (NCs)] were enrolled for training and validation. The dysregulated candidate miRNAs were further confirmed with 30 RCC patients and 30 NCs in the training phase and with 78 RCC patients and 82 NCs in the validation phase. Receiver operating characteristic (ROC) curves and the area under the ROC curve (AUC) were used for assessing the diagnostic value of miRNAs. Bioinformatic analysis and survival analysis were also included in our study. Results: Compared to NCs, six miRNAs (miR-18a-5p, miR-138-5p, miR-141-3p, miR-181b-5p, miR-200a-3p, and miR-363-3p) in serum were significantly dysregulated in RCC patients. A four-miRNA panel was built by combining these candidate miRNAs to improve the diagnostic value with AUC = 0.908. ABCG1 and RNASET2, considered potential target genes of the four-miRNA panel, may play a significant role in the development of RCC. Conclusion: A four-miRNA panel in serum was identified for RCC screening in our study. The four--miRNA panel has a great potential to be a non-invasive biomarker for RCC screening.
Collapse
Affiliation(s)
- Rongkang Li
- Department of Urology, Guangdong and Shenzhen Key Laboratory of Male Reproductive Medicine and Genetics, Peking University Shenzhen Hospital, Clinical College of Anhui Medical University, Shenzhen, China
- The Fifth Clinical Medical College of Anhui Medical University, Hefei, China
| | - Chong Lu
- Department of Urology, Guangdong and Shenzhen Key Laboratory of Male Reproductive Medicine and Genetics, Peking University Shenzhen Hospital, Clinical College of Anhui Medical University, Shenzhen, China
- The Fifth Clinical Medical College of Anhui Medical University, Hefei, China
| | - Xinji Li
- Department of Urology, Guangdong and Shenzhen Key Laboratory of Male Reproductive Medicine and Genetics, Peking University Shenzhen Hospital, Clinical College of Anhui Medical University, Shenzhen, China
- Shantou University Medical College, Shantou, China
| | - Xuan Chen
- Department of Urology, Guangdong and Shenzhen Key Laboratory of Male Reproductive Medicine and Genetics, Peking University Shenzhen Hospital, Clinical College of Anhui Medical University, Shenzhen, China
- Shantou University Medical College, Shantou, China
| | - Guocheng Huang
- Department of Urology, Guangdong and Shenzhen Key Laboratory of Male Reproductive Medicine and Genetics, Peking University Shenzhen Hospital, Clinical College of Anhui Medical University, Shenzhen, China
- Shantou University Medical College, Shantou, China
| | - Zhenyu Wen
- Department of Urology, Guangdong and Shenzhen Key Laboratory of Male Reproductive Medicine and Genetics, Peking University Shenzhen Hospital, Clinical College of Anhui Medical University, Shenzhen, China
- Shantou University Medical College, Shantou, China
| | - Hang Li
- Department of Urology, Guangdong and Shenzhen Key Laboratory of Male Reproductive Medicine and Genetics, Peking University Shenzhen Hospital, Clinical College of Anhui Medical University, Shenzhen, China
| | - Lingzhi Tao
- Department of Urology, Guangdong and Shenzhen Key Laboratory of Male Reproductive Medicine and Genetics, Peking University Shenzhen Hospital, Clinical College of Anhui Medical University, Shenzhen, China
| | - Yimin Hu
- Department of Urology, Guangdong and Shenzhen Key Laboratory of Male Reproductive Medicine and Genetics, Peking University Shenzhen Hospital, Clinical College of Anhui Medical University, Shenzhen, China
| | - Zhengping Zhao
- Department of Urology, Guangdong and Shenzhen Key Laboratory of Male Reproductive Medicine and Genetics, Peking University Shenzhen Hospital, Clinical College of Anhui Medical University, Shenzhen, China
| | - Zebo Chen
- Department of Urology, Guangdong and Shenzhen Key Laboratory of Male Reproductive Medicine and Genetics, Peking University Shenzhen Hospital, Clinical College of Anhui Medical University, Shenzhen, China
- *Correspondence: Zebo Chen, ; Yongqing Lai,
| | - Yongqing Lai
- Department of Urology, Guangdong and Shenzhen Key Laboratory of Male Reproductive Medicine and Genetics, Peking University Shenzhen Hospital, Clinical College of Anhui Medical University, Shenzhen, China
- The Fifth Clinical Medical College of Anhui Medical University, Hefei, China
- *Correspondence: Zebo Chen, ; Yongqing Lai,
| |
Collapse
|
23
|
Zhang J, Wang Z, Lv H, Li G. Identification and Validation of Potential Candidate Genes of Colorectal Cancer in Response to Fusobacterium nucleatum Infection. Front Genet 2021; 12:690990. [PMID: 34650590 PMCID: PMC8508782 DOI: 10.3389/fgene.2021.690990] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Accepted: 09/13/2021] [Indexed: 11/13/2022] Open
Abstract
Objective: Recent investigations revealed the relationship between Fusobacterium nucleatum (Fn) infection and colorectal cancer (CRC). However, how the host genes changes contribute to CRC in response to Fn infection remains largely unknown. Materials and methods: In the present study, we aimed to comprehensively analyze microarray data obtained from a Caco-2 infection cell model using integrated bioinformatics analysis and further identify and validate potential candidate genes in Fn-infected Caco-2 cells and CRC specimens. Results: We identified 10 hub genes potentially involved in Fn induced tumor initiation and progression. Furthermore, we demonstrated that the expression of centrosomal protein of 55 kDa (CEP55) is significantly higher in Fn-infected Caco-2 cells. Knocking down of CEP55 could arrest the cell cycle progression and induce apoptosis in Fn-infected Caco-2 cells. The expression of CEP55 was positively correlated with the Fn amount in Fn-infected CRC patients, and these patients with high CEP55expression had an obviously poorer differentiation, worse metastasis and decreased cumulative survival rate. Conclusion: CEP55 plays an important role in Fn-infected colon cancer cell growth and cell cycle progression and could be used as a new diagnostic and prognostic biomarker for Fn-infected CRC.
Collapse
Affiliation(s)
- Jiangguo Zhang
- Department of Gastroenterology, Shenzhen Shekou People's Hospital, Shenzhen, China
| | - Zhimo Wang
- Department of Gastroenterology, Shenzhen Nanshan People's Hospital, Shenzhen, China
| | - Hong Lv
- Department of Gastroenterology, Shenzhen Shekou People's Hospital, Shenzhen, China
| | - Guojun Li
- Department of Liver Disease, Shenzhen Third People's Hospital, Shenzhen, China
| |
Collapse
|