1
|
Schmitt AD, Sikkink K, Ahmed AA, Melnyk S, Reid D, Van Meter L, Guest EM, Lansdon LA, Pastinen T, Pushel I, Yoo B, Farooqi MS. Evaluation of Hi-C sequencing for the detection of gene fusions in hematologic and solid pediatric cancer samples. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.05.10.24306838. [PMID: 38765974 PMCID: PMC11100933 DOI: 10.1101/2024.05.10.24306838] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2024]
Abstract
HiC sequencing is a DNA-based next-generation sequencing method that preserves the 3D conformation of the genome and has shown promise in detecting genomic rearrangements in translational research studies. To evaluate HiC as a potential clinical diagnostic platform, analytical concordance with routine laboratory testing was assessed using primary pediatric leukemia and sarcoma specimens previously positive for clinically significant genomic rearrangements. Archived specimen types tested included viable and nonviable frozen leukemic cells, as well as formalin-fixed paraffin-embedded (FFPE) tumor tissues. Initially, pediatric acute myeloid leukemia (AML) and alveolar rhabdomyosarcoma (A-RMS) specimens with known genomic rearrangements were subjected to HiC analysis to assess analytical concordance. Subsequently, a discovery cohort consisting of AML and acute lymphoblastic leukemia (ALL) cases with no known genomic rearrangements based on prior clinical diagnostic testing were evaluated to determine whether HiC could detect rearrangements. Using a standard sequencing depth of 50 million raw read-pairs per sample, or approximately 5X raw genomic coverage, 100% concordance was observed between HiC and previous clinical cytogenetic and molecular testing. In the discovery cohort, a clinically relevant gene fusion was detected in 45% of leukemia cases (5/11). This study demonstrates the value of HiC sequencing to medical diagnostic testing as it identified several clinically significant rearrangements, including those that might have been missed by current clinical testing workflows. Key points HiC sequencing is a DNA-based next-generation sequencing method that preserves the 3D conformation of the genome, facilitating detection of genomic rearrangements.HiC was 100% concordant with clinical diagnostic testing workflows for detecting clinically significant genomic rearrangements in pediatric leukemia and rhabdomyosarcoma specimens.HiC detected clinically significant genomic rearrangements not previously detected by prior clinical cytogenetic and molecular testing.HiC performed well with archived non-viable and viable frozen leukemic cell samples, as well as archived formalin-fixed paraffin-embedded tumor tissue specimens.
Collapse
|
2
|
Yu Y, Gao R, Luo J. LcDel: deletion variation detection based on clustering and long reads. Front Genet 2024; 15:1404415. [PMID: 38798694 PMCID: PMC11116628 DOI: 10.3389/fgene.2024.1404415] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Accepted: 04/25/2024] [Indexed: 05/29/2024] Open
Abstract
Motivation: Genomic structural variation refers to chromosomal level variations such as genome rearrangement or insertion/deletion, which typically involve larger DNA fragments compared to single nucleotide variations. Deletion is a common type of structural variants in the genome, which may lead to mangy diseases, so the detection of deletions can help to gain insights into the pathogenesis of diseases and provide accurate information for disease diagnosis, treatment, and prevention. Many tools exist for deletion variant detection, but they are still inadequate in some aspects, and most of them ignore the presence of chimeric variants in clustering, resulting in less precise clustering results. Results: In this paper, we present LcDel, which can detect deletion variation based on clustering and long reads. LcDel first finds the candidate deletion sites and then performs the first clustering step using two clustering methods (sliding window-based and coverage-based, respectively) based on the length of the deletion. After that, LcDel immediately uses the second clustering by hierarchical clustering to determine the location and length of the deletion. LcDel is benchmarked against some other structural variation detection tools on multiple datasets, and the results show that LcDel has better detection performance for deletion. The source code is available in https://github.com/cyq1314woaini/LcDel.
Collapse
Affiliation(s)
| | | | - Junwei Luo
- School of Software, Henan Polytechnic University, Jiaozuo, China
| |
Collapse
|
3
|
Xie T, Danieli-Mackay A, Buccarelli M, Barbieri M, Papadionysiou I, D'Alessandris QG, Robens C, Übelmesser N, Vinchure OS, Lauretti L, Fotia G, Schwarz RF, Wang X, Ricci-Vitiani L, Gopalakrishnan J, Pallini R, Papantonis A. Pervasive structural heterogeneity rewires glioblastoma chromosomes to sustain patient-specific transcriptional programs. Nat Commun 2024; 15:3905. [PMID: 38724522 PMCID: PMC11082206 DOI: 10.1038/s41467-024-48053-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2023] [Accepted: 04/18/2024] [Indexed: 05/12/2024] Open
Abstract
Glioblastoma multiforme (GBM) encompasses brain malignancies marked by phenotypic and transcriptional heterogeneity thought to render these tumors aggressive, resistant to therapy, and inevitably recurrent. However, little is known about how the spatial organization of GBM genomes underlies this heterogeneity and its effects. Here, we compile a cohort of 28 patient-derived glioblastoma stem cell-like lines (GSCs) known to reflect the properties of their tumor-of-origin; six of these were primary-relapse tumor pairs from the same patient. We generate and analyze 5 kbp-resolution chromosome conformation capture (Hi-C) data from all GSCs to systematically map thousands of standalone and complex structural variants (SVs) and the multitude of neoloops arising as a result. By combining Hi-C, histone modification, and gene expression data with chromatin folding simulations, we explain how the pervasive, uneven, and idiosyncratic occurrence of neoloops sustains tumor-specific transcriptional programs via the formation of new enhancer-promoter contacts. We also show how even moderately recurrent neoloops can relate to patient-specific vulnerabilities. Together, our data provide a resource for dissecting GBM biology and heterogeneity, as well as for informing therapeutic approaches.
Collapse
Affiliation(s)
- Ting Xie
- Institute of Pathology, University Medical Center Göttingen, Göttingen, Germany
| | - Adi Danieli-Mackay
- Institute of Pathology, University Medical Center Göttingen, Göttingen, Germany
| | - Mariachiara Buccarelli
- Department of Oncology and Molecular Medicine, Istituto Superiore di Sanità, Rome, Italy
| | - Mariano Barbieri
- Institute of Pathology, University Medical Center Göttingen, Göttingen, Germany
| | | | - Q Giorgio D'Alessandris
- Department of Neuroscience, Catholic University School of Medicine, Rome, Italy
- Department of Neuroscience, Fondazione Policlinico Universitario A. Gemelli IRCCS, Roma, Italy
| | - Claudia Robens
- Institute for Computational Cancer Biology (ICCB), Center for Integrated Oncology (CIO), Cancer Research Center Cologne Essen (CCCE), University of Cologne, Cologne, Germany
| | - Nadine Übelmesser
- Institute of Pathology, University Medical Center Göttingen, Göttingen, Germany
| | - Omkar Suhas Vinchure
- Institute of Human Genetics, University Hospital and Heinrich-Heine-University Düsseldorf, Düsseldorf, Germany
| | - Liverana Lauretti
- Department of Neuroscience, Catholic University School of Medicine, Rome, Italy
| | - Giorgio Fotia
- Centre for Advanced Studies, Research and Development in Sardinia (CRS4), Pula, Italy
| | - Roland F Schwarz
- Institute for Computational Cancer Biology (ICCB), Center for Integrated Oncology (CIO), Cancer Research Center Cologne Essen (CCCE), University of Cologne, Cologne, Germany
- Berlin Institute for the Foundations of Learning and Data (BIFOLD), Berlin, Germany
| | - Xiaotao Wang
- Institute of Reproduction and Development, Fudan University, Shanghai, China
- Research Units of Embryo Original Diseases, Chinese Academy of Medical Sciences, Shanghai, China
| | - Lucia Ricci-Vitiani
- Department of Oncology and Molecular Medicine, Istituto Superiore di Sanità, Rome, Italy
| | - Jay Gopalakrishnan
- Institute of Human Genetics, University Hospital and Heinrich-Heine-University Düsseldorf, Düsseldorf, Germany
- Institute of Human Genetics, Jena University Hospital and Friedrich Schiller University of Jena, Jena, Germany
| | - Roberto Pallini
- Department of Neuroscience, Catholic University School of Medicine, Rome, Italy.
| | - Argyris Papantonis
- Institute of Pathology, University Medical Center Göttingen, Göttingen, Germany.
| |
Collapse
|
4
|
Chang L, Xie Y, Taylor B, Wang Z, Sun J, Tan TR, Bejar R, Chen CC, Furnari FB, Hu M, Ren B. Droplet Hi-C for Fast and Scalable Profiling of Chromatin Architecture in Single Cells. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.18.590148. [PMID: 38712075 PMCID: PMC11071305 DOI: 10.1101/2024.04.18.590148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2024]
Abstract
Comprehensive analysis of chromatin architecture is crucial for understanding the gene regulatory programs during development and in disease pathogenesis, yet current methods often inadequately address the unique challenges presented by analysis of heterogeneous tissue samples. Here, we introduce Droplet Hi-C, which employs a commercial microfluidic device for high-throughput, single-cell chromatin conformation profiling in droplets. Using Droplet Hi-C, we mapped the chromatin architecture at single-cell resolution from the mouse cortex and analyzed gene regulatory programs in major cortical cell types. Additionally, we used this technique to detect copy number variation (CNV), structural variations (SVs) and extrachromosomal DNA (ecDNA) in cancer cells, revealing clonal dynamics and other oncogenic events during treatment. We further refined this technique to allow for joint profiling of chromatin architecture and transcriptome in single cells, facilitating a more comprehensive exploration of the links between chromatin architecture and gene expression in both normal tissues and tumors. Thus, Droplet Hi-C not only addresses critical gaps in chromatin analysis of heterogeneous tissues but also emerges as a versatile tool enhancing our understanding of gene regulation in health and disease.
Collapse
Affiliation(s)
- Lei Chang
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
| | - Yang Xie
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
- Biomedical Sciences Graduate Program, University of California, San Diego, La Jolla, CA, USA
| | - Brett Taylor
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
- Medical Scientist Training Program, University of California, San Diego, La Jolla, CA, USA
| | - Zhaoning Wang
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
| | - Jiachen Sun
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, OH, USA
- Department of Systems Biology and Bioinformatics, Case Western Reserve University School of Medicine, Cleveland, OH, USA
| | - Tuyet R. Tan
- Moores Cancer Center, UC San Diego, La Jolla, CA, USA
| | - Rafael Bejar
- Moores Cancer Center, UC San Diego, La Jolla, CA, USA
| | - Clark C. Chen
- Department of Neurosurgery, University of Minnesota, Minneapolis, MN, USA
| | - Frank B. Furnari
- Department of Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
| | - Ming Hu
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, OH, USA
| | - Bing Ren
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
- Center for Epigenomics, Institute for Genomic Medicine, Moores Cancer Center, University of California, San Diego, School of Medicine, La Jolla, CA, USA
| |
Collapse
|
5
|
Yu A, Yesilkanal AE, Thakur A, Wang F, Yang Y, Phillips W, Wu X, Muir A, He X, Spitz F, Yang L. HYENA detects oncogenes activated by distal enhancers in cancer. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.01.09.523321. [PMID: 38076958 PMCID: PMC10705271 DOI: 10.1101/2023.01.09.523321] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Somatic structural variations (SVs) in cancer can shuffle DNA content in the genome, relocate regulatory elements, and alter genome organization. Enhancer hijacking occurs when SVs relocate distal enhancers to activate proto-oncogenes. However, most enhancer hijacking studies have only focused on protein-coding genes. Here, we develop a computational algorithm "HYENA" to identify candidate oncogenes (both protein-coding and non-coding) activated by enhancer hijacking based on tumor whole-genome and transcriptome sequencing data. HYENA detects genes whose elevated expression is associated with somatic SVs by using a rank-based regression model. We systematically analyze 1,146 tumors across 25 types of adult tumors and identify a total of 108 candidate oncogenes including many non-coding genes. A long non-coding RNA TOB1-AS1 is activated by various types of SVs in 10% of pancreatic cancers through altered 3-dimensional genome structure. We find that high expression of TOB1-AS1 can promote cell invasion and metastasis. Our study highlights the contribution of genetic alterations in non-coding regions to tumorigenesis and tumor progression.
Collapse
Affiliation(s)
- Anqi Yu
- Ben May Department for Cancer Research, University of Chicago, Chicago IL, USA
| | - Ali E. Yesilkanal
- Ben May Department for Cancer Research, University of Chicago, Chicago IL, USA
| | - Ashish Thakur
- Department of Human Genetics, University of Chicago, Chicago IL, USA
| | - Fan Wang
- Ben May Department for Cancer Research, University of Chicago, Chicago IL, USA
| | - Yang Yang
- Ben May Department for Cancer Research, University of Chicago, Chicago IL, USA
| | - William Phillips
- Ben May Department for Cancer Research, University of Chicago, Chicago IL, USA
| | - Xiaoyang Wu
- Ben May Department for Cancer Research, University of Chicago, Chicago IL, USA
- University of Chicago Comprehensive Cancer Center, Chicago, IL, USA
| | - Alexander Muir
- Ben May Department for Cancer Research, University of Chicago, Chicago IL, USA
- University of Chicago Comprehensive Cancer Center, Chicago, IL, USA
| | - Xin He
- Department of Human Genetics, University of Chicago, Chicago IL, USA
| | - Francois Spitz
- Department of Human Genetics, University of Chicago, Chicago IL, USA
| | - Lixing Yang
- Ben May Department for Cancer Research, University of Chicago, Chicago IL, USA
- Department of Human Genetics, University of Chicago, Chicago IL, USA
- University of Chicago Comprehensive Cancer Center, Chicago, IL, USA
| |
Collapse
|
6
|
Yoon I, Kim U, Song Y, Park T, Lee DS. 3C methods in cancer research: recent advances and future prospects. Exp Mol Med 2024; 56:788-798. [PMID: 38658701 PMCID: PMC11059347 DOI: 10.1038/s12276-024-01236-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2023] [Revised: 03/15/2024] [Accepted: 03/25/2024] [Indexed: 04/26/2024] Open
Abstract
In recent years, Hi-C technology has revolutionized cancer research by elucidating the mystery of three-dimensional chromatin organization and its role in gene regulation. This paper explored the impact of Hi-C advancements on cancer research by delving into high-resolution techniques, such as chromatin loops, structural variants, haplotype phasing, and extrachromosomal DNA (ecDNA). Distant regulatory elements interact with their target genes through chromatin loops. Structural variants contribute to the development and progression of cancer. Haplotype phasing is crucial for understanding allele-specific genomic rearrangements and somatic clonal evolution in cancer. The role of ecDNA in driving oncogene amplification and drug resistance in cancer cells has also been revealed. These innovations offer a deeper understanding of cancer biology and the potential for personalized therapies. Despite these advancements, challenges, such as the accurate mapping of repetitive sequences and precise identification of structural variants, persist. Integrating Hi-C with multiomics data is key to overcoming these challenges and comprehensively understanding complex cancer genomes. Thus, Hi-C is a powerful tool for guiding precision medicine in cancer research and treatment.
Collapse
Affiliation(s)
- Insoo Yoon
- Department of Life Science, University of Seoul, Seoul, 02504, Republic of Korea
| | - Uijin Kim
- Department of Life Science, University of Seoul, Seoul, 02504, Republic of Korea
| | - Yousuk Song
- Department of Life Science, University of Seoul, Seoul, 02504, Republic of Korea
| | - Taesoo Park
- Department of Life Science, University of Seoul, Seoul, 02504, Republic of Korea
| | - Dong-Sung Lee
- Department of Life Science, University of Seoul, Seoul, 02504, Republic of Korea.
| |
Collapse
|
7
|
Mathur R, Wang Q, Schupp PG, Nikolic A, Hilz S, Hong C, Grishanina NR, Kwok D, Stevers NO, Jin Q, Youngblood MW, Stasiak LA, Hou Y, Wang J, Yamaguchi TN, Lafontaine M, Shai A, Smirnov IV, Solomon DA, Chang SM, Hervey-Jumper SL, Berger MS, Lupo JM, Okada H, Phillips JJ, Boutros PC, Gallo M, Oldham MC, Yue F, Costello JF. Glioblastoma evolution and heterogeneity from a 3D whole-tumor perspective. Cell 2024; 187:446-463.e16. [PMID: 38242087 PMCID: PMC10832360 DOI: 10.1016/j.cell.2023.12.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Revised: 10/03/2023] [Accepted: 12/06/2023] [Indexed: 01/21/2024]
Abstract
Treatment failure for the lethal brain tumor glioblastoma (GBM) is attributed to intratumoral heterogeneity and tumor evolution. We utilized 3D neuronavigation during surgical resection to acquire samples representing the whole tumor mapped by 3D spatial coordinates. Integrative tissue and single-cell analysis revealed sources of genomic, epigenomic, and microenvironmental intratumoral heterogeneity and their spatial patterning. By distinguishing tumor-wide molecular features from those with regional specificity, we inferred GBM evolutionary trajectories from neurodevelopmental lineage origins and initiating events such as chromothripsis to emergence of genetic subclones and spatially restricted activation of differential tumor and microenvironmental programs in the core, periphery, and contrast-enhancing regions. Our work depicts GBM evolution and heterogeneity from a 3D whole-tumor perspective, highlights potential therapeutic targets that might circumvent heterogeneity-related failures, and establishes an interactive platform enabling 360° visualization and analysis of 3D spatial patterns for user-selected genes, programs, and other features across whole GBM tumors.
Collapse
Affiliation(s)
- Radhika Mathur
- Department of Neurological Surgery, University of California San Francisco, San Francisco, CA, USA
| | - Qixuan Wang
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Patrick G Schupp
- Department of Neurological Surgery, University of California San Francisco, San Francisco, CA, USA
| | - Ana Nikolic
- Department of Biochemistry & Molecular Biology, University of Calgary, Calgary, AB
| | - Stephanie Hilz
- Department of Neurological Surgery, University of California San Francisco, San Francisco, CA, USA
| | - Chibo Hong
- Department of Neurological Surgery, University of California San Francisco, San Francisco, CA, USA
| | - Nadia R Grishanina
- Department of Neurological Surgery, University of California San Francisco, San Francisco, CA, USA
| | - Darwin Kwok
- Department of Neurological Surgery, University of California San Francisco, San Francisco, CA, USA
| | - Nicholas O Stevers
- Department of Neurological Surgery, University of California San Francisco, San Francisco, CA, USA
| | - Qiushi Jin
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Mark W Youngblood
- Department of Neurological Surgery, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Lena Ann Stasiak
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Ye Hou
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Juan Wang
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Takafumi N Yamaguchi
- Department of Human Genetics, University of California, Los Angeles, Los Angees, CA, USA
| | - Marisa Lafontaine
- Department of Neurological Surgery, University of California San Francisco, San Francisco, CA, USA
| | - Anny Shai
- Department of Neurological Surgery, University of California San Francisco, San Francisco, CA, USA
| | - Ivan V Smirnov
- Department of Neurological Surgery, University of California San Francisco, San Francisco, CA, USA
| | - David A Solomon
- Department of Pathology, University of California San Francisco, San Francisco, CA, USA
| | - Susan M Chang
- Department of Neurological Surgery, University of California San Francisco, San Francisco, CA, USA
| | - Shawn L Hervey-Jumper
- Department of Neurological Surgery, University of California San Francisco, San Francisco, CA, USA
| | - Mitchel S Berger
- Department of Neurological Surgery, University of California San Francisco, San Francisco, CA, USA
| | - Janine M Lupo
- Department of Neurological Surgery, University of California San Francisco, San Francisco, CA, USA
| | - Hideho Okada
- Department of Neurological Surgery, University of California San Francisco, San Francisco, CA, USA
| | - Joanna J Phillips
- Department of Neurological Surgery, University of California San Francisco, San Francisco, CA, USA
| | - Paul C Boutros
- Department of Human Genetics, University of California, Los Angeles, Los Angees, CA, USA
| | - Marco Gallo
- Department of Biochemistry & Molecular Biology, University of Calgary, Calgary, AB; Department of Pediatrics, Baylor College of Medicine, Texas Children's Hospital, Houston, TX, USA
| | - Michael C Oldham
- Department of Neurological Surgery, University of California San Francisco, San Francisco, CA, USA
| | - Feng Yue
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA; Robert H. Lurie Comprehensive Cancer Center, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA.
| | - Joseph F Costello
- Department of Neurological Surgery, University of California San Francisco, San Francisco, CA, USA.
| |
Collapse
|
8
|
Hua D, Gu M, Zhang X, Du Y, Xie H, Qi L, Du X, Bai Z, Zhu X, Tian D. DiffDomain enables identification of structurally reorganized topologically associating domains. Nat Commun 2024; 15:502. [PMID: 38218905 PMCID: PMC10787792 DOI: 10.1038/s41467-024-44782-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Accepted: 01/02/2024] [Indexed: 01/15/2024] Open
Abstract
Topologically associating domains (TADs) are critical structural units in three-dimensional genome organization of mammalian genome. Dynamic reorganizations of TADs between health and disease states are associated with essential genome functions. However, computational methods for identifying reorganized TADs are still in the early stages of development. Here, we present DiffDomain, an algorithm leveraging high-dimensional random matrix theory to identify structurally reorganized TADs using high-throughput chromosome conformation capture (Hi-C) contact maps. Method comparison using multiple real Hi-C datasets reveals that DiffDomain outperforms alternative methods for false positive rates, true positive rates, and identifying a new subtype of reorganized TADs. Applying DiffDomain to Hi-C data from different cell types and disease states demonstrates its biological relevance. Identified reorganized TADs are associated with structural variations and epigenomic changes such as changes in CTCF binding sites. By applying to a single-cell Hi-C data from mouse neuronal development, DiffDomain can identify reorganized TADs between cell types with reasonable reproducibility using pseudo-bulk Hi-C data from as few as 100 cells per condition. Moreover, DiffDomain reveals differential cell-to-population variability and heterogeneous cell-to-cell variability in TADs. Therefore, DiffDomain is a statistically sound method for better comparative analysis of TADs using both Hi-C and single-cell Hi-C data.
Collapse
Affiliation(s)
- Dunming Hua
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Ming Gu
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Xiao Zhang
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Yanyi Du
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Hangcheng Xie
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Li Qi
- Chongqing Municipal Center for Disease Control and Prevention, Chongqing, 400042, China
| | - Xiangjun Du
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Zhidong Bai
- KLASMOE & School of Mathematics and Statistics, Northeast Normal University, Changchun, Jilin, 130024, China
| | - Xiaopeng Zhu
- MyCellome LLC., Allison Park, PA, 15101, USA
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
| | - Dechao Tian
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China.
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China.
| |
Collapse
|
9
|
Erdmann-Pham DD, Batra SS, Turkalo TK, Durbin J, Blanchette M, Yeh I, Shain H, Bastian BC, Song YS, Rokhsar DS, Hockemeyer D. Tracing cancer evolution and heterogeneity using Hi-C. Nat Commun 2023; 14:7111. [PMID: 37932252 PMCID: PMC10628133 DOI: 10.1038/s41467-023-42651-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2022] [Accepted: 10/09/2023] [Indexed: 11/08/2023] Open
Abstract
Chromosomal rearrangements can initiate and drive cancer progression, yet it has been challenging to evaluate their impact, especially in genetically heterogeneous solid cancers. To address this problem we developed HiDENSEC, a new computational framework for analyzing chromatin conformation capture in heterogeneous samples that can infer somatic copy number alterations, characterize large-scale chromosomal rearrangements, and estimate cancer cell fractions. After validating HiDENSEC with in silico and in vitro controls, we used it to characterize chromosome-scale evolution during melanoma progression in formalin-fixed tumor samples from three patients. The resulting comprehensive annotation of the genomic events includes copy number neutral translocations that disrupt tumor suppressor genes such as NF1, whole chromosome arm exchanges that result in loss of CDKN2A, and whole-arm copy-number neutral loss of homozygosity involving PTEN. These findings show that large-scale chromosomal rearrangements occur throughout cancer evolution and that characterizing these events yields insights into drivers of melanoma progression.
Collapse
Affiliation(s)
- Dan Daniel Erdmann-Pham
- Department of Mathematics, University of California, Berkeley, CA, 94720, USA
- Department of Statistics, Stanford University, Stanford, CA, 94305, USA
| | - Sanjit Singh Batra
- Computer Science Division, University of California, Berkeley, CA, 94720, USA
| | - Timothy K Turkalo
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, 94720, USA
| | - James Durbin
- Dovetail Genomics, Enterprise Way, Scotts Valley, CA, 95066, USA
| | - Marco Blanchette
- Dovetail Genomics, Enterprise Way, Scotts Valley, CA, 95066, USA
| | - Iwei Yeh
- Department of Dermatology and Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, 94143, USA
- Department of Pathology, University of California, San Francisco, CA, 94143, USA
| | - Hunter Shain
- Department of Dermatology and Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, 94143, USA
| | - Boris C Bastian
- Department of Dermatology and Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, 94143, USA
- Department of Pathology, University of California, San Francisco, CA, 94143, USA
| | - Yun S Song
- Computer Science Division, University of California, Berkeley, CA, 94720, USA.
- Chan Zuckerberg Biohub, San Francisco, CA, 94158, USA.
- Department of Statistics, University of California, Berkeley, CA, 94720, USA.
| | - Daniel S Rokhsar
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, 94720, USA.
- Chan Zuckerberg Biohub, San Francisco, CA, 94158, USA.
- Innovative Genomics Institute, University of California, Berkeley, CA, 94720, USA.
- Okinawa Institute for Science and Technology, Tancha, Okinawa, Japan.
| | - Dirk Hockemeyer
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, 94720, USA.
- Chan Zuckerberg Biohub, San Francisco, CA, 94158, USA.
- Innovative Genomics Institute, University of California, Berkeley, CA, 94720, USA.
| |
Collapse
|
10
|
龚 海, 麻 付, 张 晓. [Advances in methods and applications of single-cell Hi-C data analysis]. SHENG WU YI XUE GONG CHENG XUE ZA ZHI = JOURNAL OF BIOMEDICAL ENGINEERING = SHENGWU YIXUE GONGCHENGXUE ZAZHI 2023; 40:1033-1039. [PMID: 37879935 PMCID: PMC10600426 DOI: 10.7507/1001-5515.202303046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 08/29/2023] [Indexed: 10/27/2023]
Abstract
Chromatin three-dimensional genome structure plays a key role in cell function and gene regulation. Single-cell Hi-C techniques can capture genomic structure information at the cellular level, which provides an opportunity to study changes in genomic structure between different cell types. Recently, some excellent computational methods have been developed for single-cell Hi-C data analysis. In this paper, the available methods for single-cell Hi-C data analysis were first reviewed, including preprocessing of single-cell Hi-C data, multi-scale structure recognition based on single-cell Hi-C data, bulk-like Hi-C contact matrix generation based on single-cell Hi-C data sets, pseudo-time series analysis, and cell classification. Then the application of single-cell Hi-C data in cell differentiation and structural variation was described. Finally, the future development direction of single-cell Hi-C data analysis was also prospected.
Collapse
Affiliation(s)
- 海燕 龚
- 北京科技大学 新材料技术研究院 (北京 100083)Institute for Advanced Materials and Technology, University of Science and Technology Beijing, Beijing 100083, P. R. China
- 北京科技大学 计算机与通信工程学院(北京 100083)School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, P. R. China
| | - 付强 麻
- 北京科技大学 新材料技术研究院 (北京 100083)Institute for Advanced Materials and Technology, University of Science and Technology Beijing, Beijing 100083, P. R. China
| | - 晓彤 张
- 北京科技大学 新材料技术研究院 (北京 100083)Institute for Advanced Materials and Technology, University of Science and Technology Beijing, Beijing 100083, P. R. China
- 北京科技大学 计算机与通信工程学院(北京 100083)School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, P. R. China
| |
Collapse
|
11
|
Dileep V, Boix CA, Mathys H, Marco A, Welch GM, Meharena HS, Loon A, Jeloka R, Peng Z, Bennett DA, Kellis M, Tsai LH. Neuronal DNA double-strand breaks lead to genome structural variations and 3D genome disruption in neurodegeneration. Cell 2023; 186:4404-4421.e20. [PMID: 37774679 PMCID: PMC10697236 DOI: 10.1016/j.cell.2023.08.038] [Citation(s) in RCA: 23] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 04/02/2023] [Accepted: 08/29/2023] [Indexed: 10/01/2023]
Abstract
Persistent DNA double-strand breaks (DSBs) in neurons are an early pathological hallmark of neurodegenerative diseases including Alzheimer's disease (AD), with the potential to disrupt genome integrity. We used single-nucleus RNA-seq in human postmortem prefrontal cortex samples and found that excitatory neurons in AD were enriched for somatic mosaic gene fusions. Gene fusions were particularly enriched in excitatory neurons with DNA damage repair and senescence gene signatures. In addition, somatic genome structural variations and gene fusions were enriched in neurons burdened with DSBs in the CK-p25 mouse model of neurodegeneration. Neurons enriched for DSBs also had elevated levels of cohesin along with progressive multiscale disruption of the 3D genome organization aligned with transcriptional changes in synaptic, neuronal development, and histone genes. Overall, this study demonstrates the disruption of genome stability and the 3D genome organization by DSBs in neurons as pathological steps in the progression of neurodegenerative diseases.
Collapse
Affiliation(s)
- Vishnu Dileep
- Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
| | - Carles A Boix
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Hansruedi Mathys
- Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Asaf Marco
- Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Gwyneth M Welch
- Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Hiruy S Meharena
- Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Anjanet Loon
- Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Ritika Jeloka
- Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Zhuyu Peng
- Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | | | - Manolis Kellis
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA.
| | - Li-Huei Tsai
- Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.
| |
Collapse
|
12
|
Raffo A, Paulsen J. The shape of chromatin: insights from computational recognition of geometric patterns in Hi-C data. Brief Bioinform 2023; 24:bbad302. [PMID: 37646128 PMCID: PMC10516369 DOI: 10.1093/bib/bbad302] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Revised: 07/05/2023] [Accepted: 08/03/2023] [Indexed: 09/01/2023] Open
Abstract
The three-dimensional organization of chromatin plays a crucial role in gene regulation and cellular processes like deoxyribonucleic acid (DNA) transcription, replication and repair. Hi-C and related techniques provide detailed views of spatial proximities within the nucleus. However, data analysis is challenging partially due to a lack of well-defined, underpinning mathematical frameworks. Recently, recognizing and analyzing geometric patterns in Hi-C data has emerged as a powerful approach. This review provides a summary of algorithms for automatic recognition and analysis of geometric patterns in Hi-C data and their correspondence with chromatin structure. We classify existing algorithms on the basis of the data representation and pattern recognition paradigm they make use of. Finally, we outline some of the challenges ahead and promising future directions.
Collapse
Affiliation(s)
- Andrea Raffo
- Department of Biosciences, University of Oslo, 0316 Oslo, Norway
| | - Jonas Paulsen
- Department of Biosciences, University of Oslo, 0316 Oslo, Norway
- Centre for Bioinformatics, Department of Informatics, University of Oslo, 0316 Oslo, Norway
| |
Collapse
|
13
|
Halawani R, Buchert M, Chen YPP. Deep learning exploration of single-cell and spatially resolved cancer transcriptomics to unravel tumour heterogeneity. Comput Biol Med 2023; 164:107274. [PMID: 37506451 DOI: 10.1016/j.compbiomed.2023.107274] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2023] [Revised: 07/03/2023] [Accepted: 07/16/2023] [Indexed: 07/30/2023]
Abstract
Tumour heterogeneity is one of the critical confounding aspects in decoding tumour growth. Malignant cells display variations in their gene transcription profiles and mutation spectra even when originating from a single progenitor cell. Single-cell and spatial transcriptomics sequencing have recently emerged as key technologies for unravelling tumour heterogeneity. Single-cell sequencing promotes individual cell-type identification through transcriptome-wide gene expression measurements of each cell. Spatial transcriptomics facilitates identification of cell-cell interactions and the structural organization of heterogeneous cells within a tumour tissue through associating spatial RNA abundance of cells at distinct spots in the tissue section. However, extracting features and analyzing single-cell and spatial transcriptomics data poses challenges. Single-cell transcriptome data is extremely noisy and its sparse nature and dropouts can lead to misinterpretation of gene expression and the misclassification of cell types. Deep learning predictive power can overcome data challenges, provide high-resolution analysis and enhance precision oncology applications that involve early cancer prognosis, diagnosis, patient survival estimation and anti-cancer therapy planning. In this paper, we provide a background to and review of the recent progress of deep learning frameworks to investigate tumour heterogeneity using both single-cell and spatial transcriptomics data types.
Collapse
Affiliation(s)
- Raid Halawani
- Department of Computer Science and Information Technology, La Trobe University, Melbourne, Australia
| | - Michael Buchert
- School of Cancer Medicine, La Trobe University, Melbourne, Victoria, Australia; Olivia Newton-John Cancer Research Institute, Melbourne, Victoria, Australia
| | - Yi-Ping Phoebe Chen
- Department of Computer Science and Information Technology, La Trobe University, Melbourne, Australia.
| |
Collapse
|
14
|
Abstract
DNA sequencing has revolutionized medicine over recent decades. However, analysis of large structural variation and repetitive DNA, a hallmark of human genomes, has been limited by short-read technology, with read lengths of 100-300 bp. Long-read sequencing (LRS) permits routine sequencing of human DNA fragments tens to hundreds of kilobase pairs in size, using both real-time sequencing by synthesis and nanopore-based direct electronic sequencing. LRS permits analysis of large structural variation and haplotypic phasing in human genomes and has enabled the discovery and characterization of rare pathogenic structural variants and repeat expansions. It has also recently enabled the assembly of a complete, gapless human genome that includes previously intractable regions, such as highly repetitive centromeres and homologous acrocentric short arms. With the addition of protocols for targeted enrichment, direct epigenetic DNA modification detection, and long-range chromatin profiling, LRS promises to launch a new era of understanding of genetic diversity and pathogenic mutations in human populations.
Collapse
Affiliation(s)
- Peter E Warburton
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA; ,
- Center for Advanced Genomics Technology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Robert P Sebra
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA; ,
- Center for Advanced Genomics Technology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Black Family Stem Cell Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| |
Collapse
|
15
|
Fu Y, Wang X, Yue F. EagleC Explorer: A desktop application for interactively detecting and visualizing SVs and enhancer hijacking on Hi-C contact maps. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.07.552228. [PMID: 37609202 PMCID: PMC10441372 DOI: 10.1101/2023.08.07.552228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/24/2023]
Abstract
It has been shown that Hi-C can be used as a powerful tool to detect structural variations (SVs) and enhancer hijacking events. However, there has been no existing programs that can directly visualize and detect such events on a personal computer, which hinders the broad adaption of the technology for intuitive discovery in cancer studies. Here, we introduce the EagleC Explorer, a desktop software that is specifically designed for exploring Hi-C and other chromatin contact data in cancer genomes. EagleC Explorer has a set of unique features, including 1) conveniently visualizing global and local Hi-C data; 2) interactively detecting SVs on a Hi-C map for any user-selected region on screen within seconds, using a deep-learning model; 3) reconstructing local Hi-C map surrounding user-provided SVs and generating publication-quality figures; 4) detecting enhancer hijacking events for any user-suggested regions on screen. In addition, EagleC Explorer can also incorporate other genomic tracks such as RNA-Seq or ChIP-Seq to facilitate scientists for integrative data analysis and making novel discoveries.
Collapse
|
16
|
Botten GA, Zhang Y, Dudnyk K, Kim YJ, Liu X, Sanders JT, Imanci A, Droin N, Cao H, Kaphle P, Dickerson KE, Kumar KR, Chen M, Chen W, Solary E, Ly P, Zhou J, Xu J. Structural variation cooperates with permissive chromatin to control enhancer hijacking-mediated oncogenic transcription. Blood 2023; 142:336-351. [PMID: 36947815 PMCID: PMC10447518 DOI: 10.1182/blood.2022017555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Revised: 02/24/2023] [Accepted: 03/10/2023] [Indexed: 03/24/2023] Open
Abstract
Structural variants (SVs) involving enhancer hijacking can rewire chromatin topologies to cause oncogene activation in human cancers, including hematologic malignancies; however, because of the lack of tools to assess their effects on gene regulation and chromatin organization, the molecular determinants for the functional output of enhancer hijacking remain poorly understood. Here, we developed a multimodal approach to integrate genome sequencing, chromosome conformation, chromatin state, and transcriptomic alteration for quantitative analysis of transcriptional effects and structural reorganization imposed by SVs in leukemic genomes. We identified known and new pathogenic SVs, including recurrent t(5;14) translocations that cause the hijacking of BCL11B enhancers for the allele-specific activation of TLX3 in a subtype of pediatric leukemia. Epigenetic perturbation of SV-hijacked BCL11B enhancers impairs TLX3 transcription, which are required for the growth of t(5;14) leukemia cells. By CRISPR engineering of patient-derived t(5;14) in isogenic leukemia cells, we uncovered a new mechanism whereby the transcriptional output of SV-induced BCL11B enhancer hijacking is dependent on the loss of DNA hypermethylation at the TLX3 promoter. Our results highlight the importance of the cooperation between genetic alteration and permissive chromatin as a critical determinant of SV-mediated oncogene activation, with implications for understanding aberrant gene transcription after epigenetic therapies in patients with leukemia. Hence, leveraging the interdependency of genetic alteration on chromatin variation may provide new opportunities to reprogram gene regulation as targeted interventions in human disease.
Collapse
Affiliation(s)
- Giovanni A. Botten
- Children’s Medical Center Research Institute, University of Texas Southwestern Medical Center, Dallas, TX
- Department of Pediatrics, University of Texas Southwestern Medical Center, Dallas, TX
- Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX
- Hamon Center for Regenerative Science and Medicine, University of Texas Southwestern Medical Center, Dallas, TX
| | - Yuannyu Zhang
- Children’s Medical Center Research Institute, University of Texas Southwestern Medical Center, Dallas, TX
- Department of Pediatrics, University of Texas Southwestern Medical Center, Dallas, TX
- Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX
- Hamon Center for Regenerative Science and Medicine, University of Texas Southwestern Medical Center, Dallas, TX
- Department of Pathology, Center of Excellence for Leukemia Studies, St. Jude Children’s Research Hospital, Memphis, TN
| | - Kseniia Dudnyk
- Lyda Hill Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, TX
| | - Yoon Jung Kim
- Children’s Medical Center Research Institute, University of Texas Southwestern Medical Center, Dallas, TX
- Department of Pediatrics, University of Texas Southwestern Medical Center, Dallas, TX
- Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX
- Hamon Center for Regenerative Science and Medicine, University of Texas Southwestern Medical Center, Dallas, TX
| | - Xin Liu
- Children’s Medical Center Research Institute, University of Texas Southwestern Medical Center, Dallas, TX
- Department of Pediatrics, University of Texas Southwestern Medical Center, Dallas, TX
- Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX
- Hamon Center for Regenerative Science and Medicine, University of Texas Southwestern Medical Center, Dallas, TX
| | - Jacob T. Sanders
- Department of Pathology, University of Texas Southwestern Medical Center, Dallas, TX
| | - Aygun Imanci
- Université Paris-Saclay, INSERM U1287, Gustave Roussy Cancer Center, Villejuif, France
| | - Nathalie Droin
- Université Paris-Saclay, INSERM U1287, Gustave Roussy Cancer Center, Villejuif, France
| | - Hui Cao
- Children’s Medical Center Research Institute, University of Texas Southwestern Medical Center, Dallas, TX
- Department of Pediatrics, University of Texas Southwestern Medical Center, Dallas, TX
- Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX
- Hamon Center for Regenerative Science and Medicine, University of Texas Southwestern Medical Center, Dallas, TX
- Department of Pathology, Center of Excellence for Leukemia Studies, St. Jude Children’s Research Hospital, Memphis, TN
| | - Pranita Kaphle
- Children’s Medical Center Research Institute, University of Texas Southwestern Medical Center, Dallas, TX
- Department of Pediatrics, University of Texas Southwestern Medical Center, Dallas, TX
- Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX
- Hamon Center for Regenerative Science and Medicine, University of Texas Southwestern Medical Center, Dallas, TX
| | - Kathryn E. Dickerson
- Children’s Medical Center Research Institute, University of Texas Southwestern Medical Center, Dallas, TX
- Department of Pediatrics, University of Texas Southwestern Medical Center, Dallas, TX
- Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX
- Hamon Center for Regenerative Science and Medicine, University of Texas Southwestern Medical Center, Dallas, TX
| | - Kirthi R. Kumar
- Medical City Dallas, Medical City Children’s Hospital, Dallas, TX
| | - Mingyi Chen
- Department of Pathology, University of Texas Southwestern Medical Center, Dallas, TX
| | - Weina Chen
- Department of Pathology, University of Texas Southwestern Medical Center, Dallas, TX
| | - Eric Solary
- Université Paris-Saclay, INSERM U1287, Gustave Roussy Cancer Center, Villejuif, France
| | - Peter Ly
- Department of Pathology, University of Texas Southwestern Medical Center, Dallas, TX
| | - Jian Zhou
- Lyda Hill Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, TX
| | - Jian Xu
- Children’s Medical Center Research Institute, University of Texas Southwestern Medical Center, Dallas, TX
- Department of Pediatrics, University of Texas Southwestern Medical Center, Dallas, TX
- Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX
- Hamon Center for Regenerative Science and Medicine, University of Texas Southwestern Medical Center, Dallas, TX
- Department of Pathology, Center of Excellence for Leukemia Studies, St. Jude Children’s Research Hospital, Memphis, TN
| |
Collapse
|
17
|
Stephenson-Gussinye A, Furlan-Magaril M. Chromosome conformation capture technologies as tools to detect structural variations and their repercussion in chromatin 3D configuration. Front Cell Dev Biol 2023; 11:1219968. [PMID: 37457299 PMCID: PMC10346842 DOI: 10.3389/fcell.2023.1219968] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Accepted: 06/09/2023] [Indexed: 07/18/2023] Open
Abstract
3D genome organization regulates gene expression in different physiological and pathological contexts. Characterization of chromatin structure at different scales has provided information about how the genome organizes in the nuclear space, from chromosome territories, compartments of euchromatin and heterochromatin, topologically associated domains to punctual chromatin loops between genomic regulatory elements and gene promoters. In recent years, chromosome conformation capture technologies have also been used to characterize structural variations (SVs) de novo in pathological conditions. The study of SVs in cancer, has brought information about transcriptional misregulation that relates directly to the incidence and prognosis of the disease. For example, gene fusions have been discovered arising from chromosomal translocations that upregulate oncogenes expression, and other types of SVs have been described that alter large genomic regions encompassing many genes. However, studying SVs in 2D cannot capture all their regulatory implications in the genome. Recently, several bioinformatic tools have been developed to identify and classify SVs from chromosome conformation capture data and clarify how they impact chromatin structure in 3D, resulting in transcriptional misregulation. Here, we review recent literature concerning bioinformatic tools to characterize SVs from chromosome conformation capture technologies and exemplify their vast potential to rebuild the 3D landscape of genomes in cancer. The study of SVs from the 3D perspective can produce essential information about drivers, molecular targets, and disease evolution.
Collapse
|
18
|
Shinde SS, Sharma A, Vijay N. Decoding the fibromelanosis locus complex chromosomal rearrangement of black-bone chicken: genetic differentiation, selective sweeps and protein-coding changes in Kadaknath chicken. Front Genet 2023; 14:1180658. [PMID: 37424723 PMCID: PMC10325862 DOI: 10.3389/fgene.2023.1180658] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 06/05/2023] [Indexed: 07/11/2023] Open
Abstract
Black-bone chicken (BBC) meat is popular for its distinctive taste and texture. A complex chromosomal rearrangement at the fibromelanosis (Fm) locus on the 20th chromosome results in increased endothelin-3 (EDN3) gene expression and is responsible for melanin hyperpigmentation in BBC. We use public long-read sequencing data of the Silkie breed to resolve high-confidence haplotypes at the Fm locus spanning both Dup1 and Dup2 regions and establish that the Fm_2 scenario is correct of the three possible scenarios of the complex chromosomal rearrangement. The relationship between Chinese and Korean BBC breeds with Kadaknath native to India is underexplored. Our data from whole-genome re-sequencing establish that all BBC breeds, including Kadaknath, share the complex chromosomal rearrangement junctions at the fibromelanosis (Fm) locus. We also identify two Fm locus proximal regions (∼70 Kb and ∼300 Kb) with signatures of selection unique to Kadaknath. These regions harbor several genes with protein-coding changes, with the bactericidal/permeability-increasing-protein-like gene having two Kadaknath-specific changes within protein domains. Our results indicate that protein-coding changes in the bactericidal/permeability-increasing-protein-like gene hitchhiked with the Fm locus in Kadaknath due to close physical linkage. Identifying this Fm locus proximal selective sweep sheds light on the genetic distinctiveness of Kadaknath compared to other BBC.
Collapse
Affiliation(s)
| | | | - Nagarjun Vijay
- Computational Evolutionary Genomics Lab, Department of Biological Sciences, IISER Bhopal, Bhauri, Madhya Pradesh, India
| |
Collapse
|
19
|
Xu M, Chen Z, Zheng J, Zhao Q, Yuan Z. Artificial Intelligence-Aided Optical Imaging for Cancer Theranostics. Semin Cancer Biol 2023:S1044-579X(23)00094-9. [PMID: 37302519 DOI: 10.1016/j.semcancer.2023.06.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 06/08/2023] [Accepted: 06/08/2023] [Indexed: 06/13/2023]
Abstract
The use of artificial intelligence (AI) to assist biomedical imaging have demonstrated its high accuracy and high efficiency in medical decision-making for individualized cancer medicine. In particular, optical imaging methods are able to visualize both the structural and functional information of tumors tissues with high contrast, low cost, and noninvasive property. However, no systematic work has been performed to inspect the recent advances on AI-aided optical imaging for cancer theranostics. In this review, we demonstrated how AI can guide optical imaging methods to improve the accuracy on tumor detection, automated analysis and prediction of its histopathological section, its monitoring during treatment, and its prognosis by using computer vision, deep learning and natural language processing. By contrast, the optical imaging techniques involved mainly consisted of various tomography and microscopy imaging methods such as optical endoscopy imaging, optical coherence tomography, photoacoustic imaging, diffuse optical tomography, optical microscopy imaging, Raman imaging, and fluorescent imaging. Meanwhile, existing problems, possible challenges and future prospects for AI-aided optical imaging protocol for cancer theranostics were also discussed. It is expected that the present work can open a new avenue for precision oncology by using AI and optical imaging tools.
Collapse
Affiliation(s)
- Mengze Xu
- Center for Cognition and Neuroergonomics, State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Zhuhai, China; Cancer Center, Faculty of Health Sciences, University of Macau, Macau SAR, China; Centre for Cognitive and Brain Sciences, University of Macau, Macau SAR, China
| | - Zhiyi Chen
- Institute of Medical Imaging, Hengyang Medical School, University of South China, Hengyang, China
| | - Junxiao Zheng
- Cancer Center, Faculty of Health Sciences, University of Macau, Macau SAR, China; Centre for Cognitive and Brain Sciences, University of Macau, Macau SAR, China
| | - Qi Zhao
- Cancer Center, Faculty of Health Sciences, University of Macau, Macau SAR, China
| | - Zhen Yuan
- Cancer Center, Faculty of Health Sciences, University of Macau, Macau SAR, China; Centre for Cognitive and Brain Sciences, University of Macau, Macau SAR, China.
| |
Collapse
|
20
|
Wang X, Yue F. HiCLift: a fast and efficient tool for converting chromatin interaction data between genome assemblies. Bioinformatics 2023; 39:btad389. [PMID: 37335863 PMCID: PMC10313346 DOI: 10.1093/bioinformatics/btad389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Revised: 05/30/2023] [Accepted: 06/15/2023] [Indexed: 06/21/2023] Open
Abstract
MOTIVATION With the continuous effort to improve the quality of human reference genome and the generation of more and more personal genomes, the conversion of genomic coordinates between genome assemblies is critical in many integrative and comparative studies. While tools have been developed for such task for linear genome signals such as ChIP-Seq, no tool exists to convert genome assemblies for chromatin interaction data, despite the importance of three-dimensional genome organization in gene regulation and disease. RESULTS Here, we present HiCLift, a fast and efficient tool that can convert the genomic coordinates of chromatin contacts such as Hi-C and Micro-C from one assembly to another, including the latest T2T-CHM13 genome. Comparing with the strategy of directly remapping raw reads to a different genome, HiCLift runs on average 42 times faster (hours vs. days), while outputs nearly identical contact matrices. More importantly, as HiCLift does not need to remap the raw reads, it can directly convert human patient sample data, where the raw sequencing reads are sometimes hard to acquire or not available. AVAILABILITY AND IMPLEMENTATION HiCLift is publicly available at https://github.com/XiaoTaoWang/HiCLift.
Collapse
Affiliation(s)
- Xiaotao Wang
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine, Northwestern University, Chicago, IL 60611, United States
| | - Feng Yue
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine, Northwestern University, Chicago, IL 60611, United States
- Robert H. Lurie Comprehensive Cancer Center of Northwestern University, Chicago, IL 60611, United States
| |
Collapse
|
21
|
Kresoja KP, Unterhuber M, Wachter R, Thiele H, Lurz P. A cardiologist's guide to machine learning in cardiovascular disease prognosis prediction. Basic Res Cardiol 2023; 118:10. [PMID: 36939941 PMCID: PMC10027799 DOI: 10.1007/s00395-023-00982-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 02/21/2023] [Accepted: 02/26/2023] [Indexed: 03/21/2023]
Abstract
A modern-day physician is faced with a vast abundance of clinical and scientific data, by far surpassing the capabilities of the human mind. Until the last decade, advances in data availability have not been accompanied by analytical approaches. The advent of machine learning (ML) algorithms might improve the interpretation of complex data and should help to translate the near endless amount of data into clinical decision-making. ML has become part of our everyday practice and might even further change modern-day medicine. It is important to acknowledge the role of ML in prognosis prediction of cardiovascular disease. The present review aims on preparing the modern physician and researcher for the challenges that ML might bring, explaining basic concepts but also caveats that might arise when using these methods. Further, a brief overview of current established classical and emerging concepts of ML disease prediction in the fields of omics, imaging and basic science is presented.
Collapse
Affiliation(s)
- Karl-Patrik Kresoja
- Department of Internal Medicine/Cardiology, Heart Center Leipzig at University of Leipzig, Struempellstr. 39, 04289, Leipzig, Germany
- Leipzig Heart Institute, Leipzig Heart Science at Heart Center Leipzig, Leipzig, Germany
| | - Matthias Unterhuber
- Department of Internal Medicine/Cardiology, Heart Center Leipzig at University of Leipzig, Struempellstr. 39, 04289, Leipzig, Germany
- Leipzig Heart Institute, Leipzig Heart Science at Heart Center Leipzig, Leipzig, Germany
| | - Rolf Wachter
- Department of Cardiology, University Hospital Leipzig, Leipzig, Germany
- Clinic for Cardiology and Pneumology, University Medicine Göttingen, Göttingen, Germany
- German Cardiovascular Research Center (DZHK), Partner Site Göttingen, Göttingen, Germany
| | - Holger Thiele
- Department of Internal Medicine/Cardiology, Heart Center Leipzig at University of Leipzig, Struempellstr. 39, 04289, Leipzig, Germany.
- Leipzig Heart Institute, Leipzig Heart Science at Heart Center Leipzig, Leipzig, Germany.
| | - Philipp Lurz
- Department of Internal Medicine/Cardiology, Heart Center Leipzig at University of Leipzig, Struempellstr. 39, 04289, Leipzig, Germany.
- Leipzig Heart Institute, Leipzig Heart Science at Heart Center Leipzig, Leipzig, Germany.
| |
Collapse
|
22
|
Ye Y, Zhang S, Gao L, Zhu Y, Zhang J. Deciphering Hierarchical Chromatin Domains and Preference of Genomic Position Forming Boundaries in Single Mouse Embryonic Stem Cells. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2023; 10:e2205162. [PMID: 36658736 PMCID: PMC10015865 DOI: 10.1002/advs.202205162] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 12/15/2022] [Indexed: 06/17/2023]
Abstract
The exploration of single-cell 3D genome maps reveals that chromatin domains are indeed physical structures presenting in single cells, and domain boundaries vary from cell to cell. However, systematic analysis of the association between regulatory factor binding and elements and the formation of chromatin domains in single cells has not yet emerged. To this end, a hierarchical chromatin domain structure identification algorithm (named as HiCS) is first developed from individual single-cell Hi-C maps, with superior performance in both accuracy and efficiency. The results suggest that in addition to the known CTCF-cohesin complex, Polycomb, TrxG, pluripotent protein families, and other multiple factors also contribute to shaping chromatin domain boundaries in single embryonic stem cells. Different cooperation patterns of these regulatory factors drive genomic position categories with differential preferences forming boundaries, and the most extensive six types of retrotransposons are differentially distributed in these genomic position categories with preferential localization. The above results suggest that these different retrotransposons within genomic regions interplay with regulatory factors navigating the preference of genomic positions forming boundaries, driving the formation of higher-order chromatin structures, and thus regulating cell functions in single mouse embryonic stem cells.
Collapse
Affiliation(s)
- Yusen Ye
- School of Computer Science and TechnologyXidian UniversityXi'anShaanxi710071P. R. China
| | - Shihua Zhang
- NCMISCEMSRCSDSAcademy of Mathematics and Systems ScienceChinese Academy of SciencesBeijing100190P. R. China
- School of Mathematical SciencesUniversity of Chinese Academy of SciencesBeijing100049P. R. China
- Center for Excellence in Animal Evolution and GeneticsChinese Academy of SciencesKunming650223P. R. China
| | - Lin Gao
- School of Computer Science and TechnologyXidian UniversityXi'anShaanxi710071P. R. China
| | - Yuqing Zhu
- Center for Stem Cell and Translational MedicineSchool of Life SciencesAnhui UniversityHefeiAnhui230601P. R. China
| | - Jin Zhang
- Center for Stem Cell and Regenerative MedicineDepartment of Basic Medical Sciences, and Bone Marrow Transplantation Center of the First Affiliated HospitalZhejiang University School of MedicineHangzhouZhejiang310003P. R. China
- Zhejiang Laboratory for Systems and Precision MedicineZhejiang University Medical CenterHangzhouZhejiang311121P. R. China
- Institute of HematologyZhejiang UniversityHangzhouZhejiang310058P. R. China
- Center of Gene/Cell Engineering and Genome MedicineHangzhouZhejiang310058P. R. China
| |
Collapse
|
23
|
Wang H, Wang LS, Schellenberg G, Lee WP. The role of structural variations in Alzheimer's disease and other neurodegenerative diseases. Front Aging Neurosci 2023; 14:1073905. [PMID: 36846102 PMCID: PMC9944073 DOI: 10.3389/fnagi.2022.1073905] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Accepted: 12/31/2022] [Indexed: 02/10/2023] Open
Abstract
Dozens of single nucleotide polymorphisms (SNPs) related to Alzheimer's disease (AD) have been discovered by large scale genome-wide association studies (GWASs). However, only a small portion of the genetic component of AD can be explained by SNPs observed from GWAS. Structural variation (SV) can be a major contributor to the missing heritability of AD; while SV in AD remains largely unexplored as the accurate detection of SVs from the widely used array-based and short-read technology are still far from perfect. Here, we briefly summarized the strengths and weaknesses of available SV detection methods. We reviewed the current landscape of SV analysis in AD and SVs that have been found associated with AD. Particularly, the importance of currently less explored SVs, including insertions, inversions, short tandem repeats, and transposable elements in neurodegenerative diseases were highlighted.
Collapse
Affiliation(s)
- Hui Wang
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
- Penn Neurodegeneration Genomics Center, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Li-San Wang
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
- Penn Neurodegeneration Genomics Center, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Gerard Schellenberg
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
- Penn Neurodegeneration Genomics Center, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Wan-Ping Lee
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
- Penn Neurodegeneration Genomics Center, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| |
Collapse
|
24
|
Maslova A, Plotnikov V, Nuriddinov M, Gridina M, Fishman V, Krasikova A. Hi-C analysis of genomic contacts revealed karyotype abnormalities in chicken HD3 cell line. BMC Genomics 2023; 24:66. [PMID: 36750787 PMCID: PMC9906895 DOI: 10.1186/s12864-023-09158-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 01/31/2023] [Indexed: 02/09/2023] Open
Abstract
BACKGROUND Karyotype abnormalities are frequent in immortalized continuous cell lines either transformed or derived from primary tumors. Chromosomal rearrangements can cause dramatic changes in gene expression and affect cellular phenotype and behavior during in vitro culture. Structural variations of chromosomes in many continuous mammalian cell lines are well documented, but chromosome aberrations in cell lines from other vertebrate models often remain understudied. The chicken LSCC-HD3 cell line (HD3), generated from erythroid precursors, was used as an avian model for erythroid differentiation and lineage-specific gene expression. However, karyotype abnormalities in the HD3 cell line were not assessed. In the present study, we applied high-throughput chromosome conformation capture to analyze 3D genome organization and to detect chromosome rearrangements in the HD3 cell line. RESULTS We obtained Hi-C maps of genomic interactions for the HD3 cell line and compared A/B compartments and topologically associating domains between HD3 and several other cell types. By analysis of contact patterns in the Hi-C maps of HD3 cells, we identified more than 25 interchromosomal translocations of regions ≥ 200 kb on both micro- and macrochromosomes. We classified most of the observed translocations as unbalanced, leading to the formation of heteromorphic chromosomes. In many cases of microchromosome rearrangements, an entire microchromosome together with other macro- and microchromosomes participated in the emergence of a derivative chromosome, resembling "chromosomal fusions'' between acrocentric microchromosomes. Intrachromosomal inversions, deletions and duplications were also detected in HD3 cells. Several of the identified simple and complex chromosomal rearrangements, such as between GGA2 and GGA1qter; GGA5, GGA4p and GGA7p; GGA4q, GGA6 and GGA19; and duplication of the sex chromosome GGAW, were confirmed by FISH. CONCLUSIONS In the erythroid progenitor HD3 cell line, in contrast to mature and immature erythrocytes, the genome is organized into distinct topologically associating domains. The HD3 cell line has a severely rearranged karyotype with most of the chromosomes engaged in translocations and can be used in studies of genome structure-function relationships. Hi-C proved to be a reliable tool for simultaneous assessment of the spatial genome organization and chromosomal aberrations in karyotypes of birds with a large number of microchromosomes.
Collapse
Affiliation(s)
- A. Maslova
- grid.15447.330000 0001 2289 6897Saint Petersburg State University, Saint Petersburg, Russia
| | - V. Plotnikov
- grid.15447.330000 0001 2289 6897Saint Petersburg State University, Saint Petersburg, Russia
| | - M. Nuriddinov
- grid.418953.2Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | - M. Gridina
- grid.418953.2Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | - V. Fishman
- grid.418953.2Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | - A. Krasikova
- grid.15447.330000 0001 2289 6897Saint Petersburg State University, Saint Petersburg, Russia
| |
Collapse
|
25
|
Li J, Gao L, Ye Y. HiSV: A control-free method for structural variation detection from Hi-C data. PLoS Comput Biol 2023; 19:e1010760. [PMID: 36608109 DOI: 10.1371/journal.pcbi.1010760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 11/24/2022] [Indexed: 01/07/2023] Open
Abstract
Structural variations (SVs) play an essential role in the evolution of human genomes and are associated with cancer genetics and rare disease. High-throughput chromosome capture (Hi-C) technology probed all genome-wide crosslinked chromatin to study the spatial architecture of chromosomes. Hi-C read pairs can span megabases, making the technology useful for detecting large-scale SVs. So far, the identification of SVs from Hi-C data is still in the early stages with only a few methods available. Especially, no algorithm has been developed that can detect SVs without control samples. Therefore, we developed HiSV (Hi-C for Structural Variation), a control-free method for identifying large-scale SVs from a Hi-C sample. Inspired by the single image saliency detection model, HiSV constructed a saliency map of interaction frequencies and extracted saliency segments as large-scale SVs. By evaluating both simulated and real data, HiSV not only detected all variant types, but also achieved a higher level of accuracy and sensitivity than existing methods. Moreover, our results on cancer cell lines showed that HiSV effectively detected eight complex SV events and identified two novel SVs of key factors associated with cancer development. Finally, we found that integrating the result of HiSV helped the WGS method to identify a total number of 94 novel SVs in two cancer cell lines.
Collapse
Affiliation(s)
- Junping Li
- Department of Computer Science, School of Computer Science and Technology, Xidian University, Xi'an, Shaanxi, China
| | - Lin Gao
- Department of Computer Science, School of Computer Science and Technology, Xidian University, Xi'an, Shaanxi, China
| | - Yusen Ye
- Department of Computer Science, School of Computer Science and Technology, Xidian University, Xi'an, Shaanxi, China
| |
Collapse
|
26
|
Comparison of Long-Read Methods for Sequencing and Assembly of Lepidopteran Pest Genomes. Int J Mol Sci 2022; 24:ijms24010649. [PMID: 36614092 PMCID: PMC9820851 DOI: 10.3390/ijms24010649] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 12/15/2022] [Accepted: 12/24/2022] [Indexed: 01/03/2023] Open
Abstract
Lepidopteran species are mostly pests, causing serious annual economic losses. High-quality genome sequencing and assembly uncover the genetic foundation of pest occurrence and provide guidance for pest control measures. Long-read sequencing technology and assembly algorithm advances have improved the ability to timeously produce high-quality genomes. Lepidoptera includes a wide variety of insects with high genetic diversity and heterozygosity. Therefore, the selection of an appropriate sequencing and assembly strategy to obtain high-quality genomic information is urgently needed. This research used silkworm as a model to test genome sequencing and assembly through high-coverage datasets by de novo assemblies. We report the first nearly complete telomere-to-telomere reference genome of silkworm Bombyx mori (P50T strain) produced by Pacific Biosciences (PacBio) HiFi sequencing, and highly contiguous and complete genome assemblies of two other silkworm strains by Oxford Nanopore Technologies (ONT) or PacBio continuous long-reads (CLR) that were unrepresented in the public database. Assembly quality was evaluated by use of BUSCO, Inspector, and EagleC. It is necessary to choose an appropriate assembler for draft genome construction, especially for low-depth datasets. For PacBio CLR and ONT sequencing, NextDenovo is superior. For PacBio HiFi sequencing, hifiasm is better. Quality assessment is essential for genome assembly and can provide better and more accurate results. For chromosome-level high-quality genome construction, we recommend using 3D-DNA with EagleC evaluation. Our study references how to obtain and evaluate high-quality genome assemblies, and is a resource for biological control, comparative genomics, and evolutionary studies of Lepidopteran pests and related species.
Collapse
|
27
|
Chromatin structure in cancer. BMC Mol Cell Biol 2022; 23:35. [PMID: 35902807 PMCID: PMC9331575 DOI: 10.1186/s12860-022-00433-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Accepted: 07/14/2022] [Indexed: 11/10/2022] Open
Abstract
In the past decade, we have seen the emergence of sequence-based methods to understand chromosome organization. With the confluence of in situ approaches to capture information on looping, topological domains, and larger chromatin compartments, understanding chromatin-driven disease is becoming feasible. Excitingly, recent advances in single molecule imaging with capacity to reconstruct “bulk-cell” features of chromosome conformation have revealed cell-to-cell chromatin structural variation. The fundamental question motivating our analysis of the literature is, can altered chromatin structure drive tumorigenesis? As our community learns more about rare disease, including low mutational frequency cancers, understanding “chromatin-driven” pathology will illuminate the regulatory structures of the genome. We describe recent insights into altered genome architecture in human cancer, highlighting multiple pathways toward disruptions of chromatin structure, including structural variation, noncoding mutations, metabolism, and de novo mutations to architectural regulators themselves. Our analysis of the literature reveals that deregulation of genome structure is characteristic in distinct classes of chromatin-driven tumors. As we begin to integrate the findings from single cell imaging studies and chromatin structural sequencing, we will be able to understand the diversity of cells within a common diagnosis, and begin to define structure–function relationships of the misfolded genome.
Collapse
|
28
|
Pokrovac I, Pezer Ž. Recent advances and current challenges in population genomics of structural variation in animals and plants. Front Genet 2022; 13:1060898. [PMID: 36523759 PMCID: PMC9745067 DOI: 10.3389/fgene.2022.1060898] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Accepted: 11/15/2022] [Indexed: 05/02/2024] Open
Abstract
The field of population genomics has seen a surge of studies on genomic structural variation over the past two decades. These studies witnessed that structural variation is taxonomically ubiquitous and represent a dominant form of genetic variation within species. Recent advances in technology, especially the development of long-read sequencing platforms, have enabled the discovery of structural variants (SVs) in previously inaccessible genomic regions which unlocked additional structural variation for population studies and revealed that more SVs contribute to evolution than previously perceived. An increasing number of studies suggest that SVs of all types and sizes may have a large effect on phenotype and consequently major impact on rapid adaptation, population divergence, and speciation. However, the functional effect of the vast majority of SVs is unknown and the field generally lacks evidence on the phenotypic consequences of most SVs that are suggested to have adaptive potential. Non-human genomes are heavily under-represented in population-scale studies of SVs. We argue that more research on other species is needed to objectively estimate the contribution of SVs to evolution. We discuss technical challenges associated with SV detection and outline the most recent advances towards more representative reference genomes, which opens a new era in population-scale studies of structural variation.
Collapse
Affiliation(s)
| | - Željka Pezer
- Laboratory for Evolutionary Genetics, Division of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| |
Collapse
|