Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wang Z, Cao R, Taylor K, Briley A, Caldwell C, Cheng J. The properties of genome conformation and spatial gene interaction and regulation networks of normal and malignant human cell types. PLoS One 2013;8:e58793. [PMID: 23536826 DOI: 10.1371/journal.pone.0058793] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2012] [Accepted: 02/06/2013] [Indexed: 01/01/2023] Open

For:	Wang Z, Cao R, Taylor K, Briley A, Caldwell C, Cheng J. The properties of genome conformation and spatial gene interaction and regulation networks of normal and malignant human cell types. PLoS One 2013;8:e58793. [PMID: 23536826 DOI: 10.1371/journal.pone.0058793] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2012] [Accepted: 02/06/2013] [Indexed: 01/01/2023] Open

Number

Cited by Other Article(s)

Lainscsek X, Taher L. ENT3C: an entropy-based similarity measure for Hi-C and micro-C derived contact matrices. NAR Genom Bioinform 2024;6:lqae076. [PMID: 38962256 PMCID: PMC11217677 DOI: 10.1093/nargab/lqae076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 06/05/2024] [Accepted: 06/27/2024] [Indexed: 07/05/2024] Open

Sun Y, Xu X, Lin L, Xu K, Zheng Y, Ren C, Tao H, Wang X, Zhao H, Tu W, Bai X, Wang J, Huang Q, Li Y, Chen H, Li H, Bo X. A graph neural network-based interpretable framework reveals a novel DNA fragility-associated chromatin structural unit. Genome Biol 2023;24:90. [PMID: 37095580 PMCID: PMC10124043 DOI: 10.1186/s13059-023-02916-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Accepted: 03/22/2023] [Indexed: 04/26/2023] Open

Guha S, Mitra MK. Multivalent binding proteins can drive collapse and reswelling of chromatin in confinement. SOFT MATTER 2022;19:153-163. [PMID: 36484149 DOI: 10.1039/d2sm00612j] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]

Xu Z, Lee DS, Chandran S, Le VT, Bump R, Yasis J, Dallarda S, Marcotte S, Clock B, Haghani N, Cho CY, Akdemir K, Tyndale S, Futreal PA, McVicker G, Wahl GM, Dixon JR. Structural variants drive context-dependent oncogene activation in cancer. Nature 2022;612:564-572. [PMID: 36477537 PMCID: PMC9810360 DOI: 10.1038/s41586-022-05504-4] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Accepted: 11/01/2022] [Indexed: 12/12/2022]

Affiliation(s)

Zhichao Xu Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA,5These authors contributed equally
Dong-Sung Lee Department of Life Sciences, University of Seoul, Seoul, South Korea,5These authors contributed equally
Sahaana Chandran Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
Victoria T. Le Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
Rosalind Bump Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
Jean Yasis Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
Sofia Dallarda Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
Samantha Marcotte Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
Benjamin Clock Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
Nicholas Haghani Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
Chae Yun Cho Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
Kadir Akdemir Department of Genomic Medicine; UT MD Anderson Cancer Center; Houston, TX, 77030; USA
Selene Tyndale Integrative Biology Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
P. Andrew Futreal Department of Genomic Medicine; UT MD Anderson Cancer Center; Houston, TX, 77030; USA
Graham McVicker Integrative Biology Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
Geoffrey M. Wahl Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
Jesse R. Dixon Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA,*Correspondence:

Collapse

Zhao C, Liu T, Wang Z. Functional Similarities of Protein-Coding Genes in Topologically Associating Domains and Spatially-Proximate Genomic Regions. Genes (Basel) 2022;13:genes13030480. [PMID: 35328034 PMCID: PMC8951421 DOI: 10.3390/genes13030480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2022] [Revised: 02/26/2022] [Accepted: 03/05/2022] [Indexed: 02/01/2023] Open

Hunt C, Montgomery S, Berkenpas JW, Sigafoos N, Oakley JC, Espinosa J, Justice N, Kishaba K, Hippe K, Si D, Hou J, Ding H, Cao R. Recent Progress of Machine Learning in Gene Therapy. Curr Gene Ther 2021;22:132-143. [PMID: 34161210 DOI: 10.2174/1566523221666210622164133] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Revised: 03/15/2021] [Accepted: 04/02/2021] [Indexed: 11/22/2022]

Li T, Li R, Dong X, Shi L, Lin M, Peng T, Wu P, Liu Y, Li X, He X, Han X, Kang B, Wang Y, Liu Z, Chen Q, Shen Y, Feng M, Wang X, Wu D, Wang J, Li C. Integrative Analysis of Genome, 3D Genome, and Transcriptome Alterations of Clinical Lung Cancer Samples. GENOMICS PROTEOMICS & BIOINFORMATICS 2021;19:741-753. [PMID: 34116262 PMCID: PMC9170781 DOI: 10.1016/j.gpb.2020.05.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/08/2019] [Revised: 03/28/2020] [Accepted: 06/11/2020] [Indexed: 10/31/2022]

Affiliation(s)

Tingting Li Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking University, Beijing 100871, China; State Key Laboratory of Proteomics, National Center of Biomedical Analysis, Institute of Basic Medical Sciences, Beijing 100850, China
Ruifeng Li Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking University, Beijing 100871, China
Xuan Dong BGI-Shenzhen, Shenzhen 518083, China; China National GeneBank, BGI-Shenzhen, Shenzhen 518083, China
Lin Shi Zhongshan Hospital Institute of Clinical Science, Fudan University, Shanghai Institute of Clinical Bioinformatics, Shanghai 200433, China; Fudan University Center for Clinical Bioinformatics, Shanghai 200433, China
Miao Lin Department of Thoracic Surgery, Zhongshan Hospital of Fudan University, Shanghai 200032, China
Ting Peng Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking University, Beijing 100871, China
Pengze Wu Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking University, Beijing 100871, China
Yuting Liu Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking University, Beijing 100871, China
Xiaoting Li Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking University, Beijing 100871, China; School of Life Sciences, Tsinghua University, Beijing 100084, China
Xuheng He BGI-Shenzhen, Shenzhen 518083, China; China National GeneBank, BGI-Shenzhen, Shenzhen 518083, China
Xu Han BGI-Shenzhen, Shenzhen 518083, China; China National GeneBank, BGI-Shenzhen, Shenzhen 518083, China
Bin Kang BGI-Shenzhen, Shenzhen 518083, China; China National GeneBank, BGI-Shenzhen, Shenzhen 518083, China
Yinan Wang Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking University, Beijing 100871, China
Zhiheng Liu Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking University, Beijing 100871, China
Qing Chen Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking University, Beijing 100871, China
Yue Shen BGI-Shenzhen, Shenzhen 518083, China; BGI-Qingdao, Qingdao 266426, China; Shenzhen Engineering Laboratory for Innovative Molecular Diagnostics, BGI-Shenzhen, Shenzhen 518083, China
Mingxiang Feng Department of Thoracic Surgery, Zhongshan Hospital of Fudan University, Shanghai 200032, China
Xiangdong Wang Zhongshan Hospital Institute of Clinical Science, Fudan University, Shanghai Institute of Clinical Bioinformatics, Shanghai 200433, China; Fudan University Center for Clinical Bioinformatics, Shanghai 200433, China
Duojiao Wu Zhongshan Hospital Institute of Clinical Science, Fudan University, Shanghai Institute of Clinical Bioinformatics, Shanghai 200433, China.
Jian Wang iCarbonX, Shenzhen 518053, China; Digital Life Research Institute, Shenzhen 518110, China.
Cheng Li Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking University, Beijing 100871, China.

Collapse

Liu L, Zhang LR, Dao FY, Yang YC, Lin H. A computational framework for identifying the transcription factors involved in enhancer-promoter loop formation. MOLECULAR THERAPY. NUCLEIC ACIDS 2020;23:347-354. [PMID: 33425492 PMCID: PMC7779541 DOI: 10.1016/j.omtn.2020.11.011] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Accepted: 11/11/2020] [Indexed: 12/30/2022]

Liu T, Wang Z. normGAM: an R package to remove systematic biases in genome architecture mapping data. BMC Genomics 2019;20:1006. [PMID: 31888469 PMCID: PMC6936146 DOI: 10.1186/s12864-019-6331-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

The genome architecture mapping (GAM) technique can capture genome-wide chromatin interactions. However, besides the known systematic biases in the raw GAM data, we have found a new type of systematic bias. It is necessary to develop and evaluate effective normalization methods to remove all systematic biases in the raw GAM data.

RESULTS

We have detected a new type of systematic bias, the fragment length bias, in the genome architecture mapping (GAM) data, which is significantly different from the bias of window detection frequency previously mentioned in the paper introducing the GAM method but is similar to the bias of distances between restriction sites existing in raw Hi-C data. We have found that the normalization method (a normalized variant of the linkage disequilibrium) used in the GAM paper is not able to effectively eliminate the new fragment length bias at 1 Mb resolution (slightly better at 30 kb resolution). We have developed an R package named normGAM for eliminating the new fragment length bias together with the other three biases existing in raw GAM data, which are the biases related to window detection frequency, mappability, and GC content. Five normalization methods have been implemented and included in the R package including Knight-Ruiz 2-norm (KR2, newly designed by us), normalized linkage disequilibrium (NLD), vanilla coverage (VC), sequential component normalization (SCN), and iterative correction and eigenvector decomposition (ICE).

CONCLUSIONS

Based on our evaluations, the five normalization methods can eliminate the four biases existing in raw GAM data, with VC and KR2 performing better than the others. We have observed that the KR2-normalized GAM data have a higher correlation with the KR-normalized Hi-C data on the same cell samples indicating that the KR-related methods are better than the others for keeping the consistency between the GAM and Hi-C experiments. Compared with the raw GAM data, the normalized GAM data are more consistent with the normalized distances from the fluorescence in situ hybridization (FISH) experiments. The source code of normGAM can be freely downloaded from http://dna.cs.miami.edu/normGAM/.

Collapse

Liu T, Wang Z. Exploring the 2D and 3D structural properties of topologically associating domains. BMC Bioinformatics 2019;20:592. [PMID: 31787081 PMCID: PMC6886161 DOI: 10.1186/s12859-019-3083-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open

Abstract

BACKGROUND

Topologically associating domains (TADs) are genomic regions with varying lengths. The interactions within TADs are more frequent than those between different TADs. TADs or sub-TADs are considered the structural and functional units of the mammalian genomes. Although TADs are important for understanding how genomes function, we have limited knowledge about their 3D structural properties.

RESULTS

In this study, we designed and benchmarked three metrics for capturing the three-dimensional and two-dimensional structural signatures of TADs, which can help better understand TADs' structural properties and the relationships between structural properties and genetic and epigenetic features. The first metric for capturing 3D structural properties is radius of gyration, which in this study is used to measure the spatial compactness of TADs. The mass value of each DNA bead in a 3D structure is novelly defined as one or more genetic or epigenetic feature(s). The second metric is folding degree. The last metric is exponent parameter, which is used to capture the 2D structural properties based on TADs' Hi-C contact matrices. In general, we observed significant correlations between the three metrics and the genetic and epigenetic features. We made the same observations when using H3K4me3, transcription start sites, and RNA polymerase II to represent the mass value in the modified radius-of-gyration metric. Moreover, we have found that the TADs in the clusters of depleted chromatin states apparently correspond to smaller exponent parameters and larger radius of gyrations. In addition, a new objective function of multidimensional scaling for modelling chromatin or TADs 3D structures was designed and benchmarked, which can handle the DNA bead-pairs with zero Hi-C contact values.

CONCLUSIONS

The web server for reconstructing chromatin 3D structures using multiple different objective functions and the related source code are publicly available at http://dna.cs.miami.edu/3DChrom/.

Collapse

Perrakis A, Bita CE, Arhondakis S, Krokida A, Mekkaoui K, Denic D, Blazakis KN, Kaloudas D, Kalaitzis P. Suppression of a Prolyl 4 Hydroxylase Results in Delayed Abscission of Overripe Tomato Fruits. FRONTIERS IN PLANT SCIENCE 2019;10:348. [PMID: 30984217 PMCID: PMC6447859 DOI: 10.3389/fpls.2019.00348] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/28/2018] [Accepted: 03/07/2019] [Indexed: 05/03/2023]

Liu T, Wang Z. Reconstructing high-resolution chromosome three-dimensional structures by Hi-C complex networks. BMC Bioinformatics 2018;19:496. [PMID: 30591009 PMCID: PMC6309071 DOI: 10.1186/s12859-018-2464-z] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open

Abstract

BACKGROUND

Hi-C data have been widely used to reconstruct chromosomal three-dimensional (3D) structures. One of the key limitations of Hi-C is the unclear relationship between spatial distance and the number of Hi-C contacts. Many methods used a fixed parameter when converting the number of Hi-C contacts to wish distances. However, a single parameter cannot properly explain the relationship between wish distances and genomic distances or the locations of topologically associating domains (TADs).

RESULTS

We have addressed one of the key issues of using Hi-C data, that is, the unclear relationship between spatial distances and the number of Hi-C contacts, which is crucial to understand significant biological functions, such as the enhancer-promoter interactions. Specifically, we developed a new method to infer this converting parameter and pairwise Euclidean distances based on the topology of the Hi-C complex network (HiCNet). The inferred distances were modeled by clustering coefficient and multiple other types of constraints. We found that our inferred distances between bead-pairs within the same TAD were apparently smaller than those distances between bead-pairs from different TADs. Our inferred distances had a higher correlation with fluorescence in situ hybridization (FISH) data, fitted the localization patterns of Xist transcripts on DNA, and better matched 156 pairs of protein-enabled long-range chromatin interactions detected by ChIA-PET. Using the inferred distances and another round of optimization, we further reconstructed 40 kb high-resolution 3D chromosomal structures of mouse male ES cells. The high-resolution structures successfully illustrate TADs and DNA loops (peaks in Hi-C contact heatmaps) that usually indicate enhancer-promoter interactions.

CONCLUSIONS

We developed a novel method to infer the wish distances between DNA bead-pairs from Hi-C contacts. High-resolution 3D structures of chromosomes were built based on the newly-inferred wish distances. This whole process has been implemented as a tool named HiCNet, which is publicly available at http://dna.cs.miami.edu/HiCNet/ .

Collapse

Dixon JR, Xu J, Dileep V, Zhan Y, Song F, Le VT, Yardımcı GG, Chakraborty A, Bann DV, Wang Y, Clark R, Zhang L, Yang H, Liu T, Iyyanki S, An L, Pool C, Sasaki T, Rivera-Mulia JC, Ozadam H, Lajoie BR, Kaul R, Buckley M, Lee K, Diegel M, Pezic D, Ernst C, Hadjur S, Odom DT, Stamatoyannopoulos JA, Broach JR, Hardison RC, Ay F, Noble WS, Dekker J, Gilbert DM, Yue F. Integrative detection and analysis of structural variation in cancer genomes. Nat Genet 2018;50:1388-1398. [PMID: 30202056 PMCID: PMC6301019 DOI: 10.1038/s41588-018-0195-8] [Citation(s) in RCA: 217] [Impact Index Per Article: 36.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2017] [Accepted: 07/16/2018] [Indexed: 01/19/2023]

Affiliation(s)

Jesse R Dixon Salk Institute for Biological Studies, La Jolla, CA, USA.
Jie Xu Department of Biochemistry and Molecular Biology, College of Medicine, The Pennsylvania State University, Hershey, PA, USA
Vishnu Dileep Department of Biological Science, Florida State University, Tallahassee, FL, USA
Ye Zhan Program in Systems Biology, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA, USA
Fan Song Bioinformatics and Genomics Program, The Pennsylvania State University, University Park, State College, PA, USA
Victoria T Le Salk Institute for Biological Studies, La Jolla, CA, USA
Galip Gürkan Yardımcı Department of Genome Sciences, University of Washington, Seattle, WA, USA
Abhijit Chakraborty La Jolla Institute for Allergy and Immunology, La Jolla, CA, USA
Darrin V Bann Division of Otolaryngology, Head & Neck Surgery, Milton S. Hershey Medical Center, Hershey, PA, USA
Yanli Wang Bioinformatics and Genomics Program, The Pennsylvania State University, University Park, State College, PA, USA
Royden Clark Penn State College of Medicine, Informatics and Technology, Hershey, PA, USA
Lijun Zhang Department of Biochemistry and Molecular Biology, College of Medicine, The Pennsylvania State University, Hershey, PA, USA
Hongbo Yang Department of Biochemistry and Molecular Biology, College of Medicine, The Pennsylvania State University, Hershey, PA, USA
Tingting Liu Department of Biochemistry and Molecular Biology, College of Medicine, The Pennsylvania State University, Hershey, PA, USA
Sriranga Iyyanki Department of Biochemistry and Molecular Biology, College of Medicine, The Pennsylvania State University, Hershey, PA, USA
Lin An Bioinformatics and Genomics Program, The Pennsylvania State University, University Park, State College, PA, USA
Christopher Pool Division of Otolaryngology, Head & Neck Surgery, Milton S. Hershey Medical Center, Hershey, PA, USA
Takayo Sasaki Department of Biological Science, Florida State University, Tallahassee, FL, USA
Juan Carlos Rivera-Mulia Department of Biological Science, Florida State University, Tallahassee, FL, USA
Hakan Ozadam Program in Systems Biology, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA, USA
Bryan R Lajoie Program in Systems Biology, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA, USA
Rajinder Kaul Altius institute for Biomedical Sciences, Seattle, WA, USA
Michael Buckley Altius institute for Biomedical Sciences, Seattle, WA, USA
Kristen Lee Altius institute for Biomedical Sciences, Seattle, WA, USA
Morgan Diegel Altius institute for Biomedical Sciences, Seattle, WA, USA
Dubravka Pezic Research Department of Cancer Biology, Cancer Institute, University College London, London, UK
Christina Ernst Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK
Suzana Hadjur Research Department of Cancer Biology, Cancer Institute, University College London, London, UK
Duncan T Odom Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK German Cancer Research Center (DKFZ), Division Signaling and Functional Genomics, Heidelberg, Germany
John A Stamatoyannopoulos Altius institute for Biomedical Sciences, Seattle, WA, USA
James R Broach Department of Biochemistry and Molecular Biology, College of Medicine, The Pennsylvania State University, Hershey, PA, USA
Ross C Hardison Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, State College, PA, USA
Ferhat Ay La Jolla Institute for Allergy and Immunology, La Jolla, CA, USA. School of Medicine, University of California San Diego, La Jolla, CA, USA.
William Stafford Noble Department of Genome Sciences, University of Washington, Seattle, WA, USA.
Job Dekker Program in Systems Biology, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA, USA. Howard Hughes Medical Institute, Chevy Chase, MD, USA.
David M Gilbert Department of Biological Science, Florida State University, Tallahassee, FL, USA.
Feng Yue Department of Biochemistry and Molecular Biology, College of Medicine, The Pennsylvania State University, Hershey, PA, USA. Bioinformatics and Genomics Program, The Pennsylvania State University, University Park, State College, PA, USA.

Collapse

Diament A, Tuller T. Modeling three-dimensional genomic organization in evolution and pathogenesis. Semin Cell Dev Biol 2018;90:78-93. [PMID: 30030143 DOI: 10.1016/j.semcdb.2018.07.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Accepted: 07/08/2018] [Indexed: 12/17/2022]

Oluwadare O, Zhang Y, Cheng J. A maximum likelihood algorithm for reconstructing 3D structures of human chromosomes from chromosomal contact data. BMC Genomics 2018;19:161. [PMID: 29471801 PMCID: PMC5824572 DOI: 10.1186/s12864-018-4546-8] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2017] [Accepted: 02/13/2018] [Indexed: 01/07/2023] Open

Abstract

Background

The development of chromosomal conformation capture techniques, particularly, the Hi-C technique, has made the analysis and study of the spatial conformation of a genome an important topic in bioinformatics and computational biology. Aided by high-throughput next generation sequencing techniques, the Hi-C technique can generate genome-wide, large-scale intra- and inter-chromosomal interaction data capable of describing in details the spatial interactions within a genome. These data can be used to reconstruct 3D structures of chromosomes that can be used to study DNA replication, gene regulation, genome interaction, genome folding, and genome function.

Results

Here, we introduce a maximum likelihood algorithm called 3DMax to construct the 3D structure of a chromosome from Hi-C data. 3DMax employs a maximum likelihood approach to infer the 3D structures of a chromosome, while automatically re-estimating the conversion factor (α) for converting Interaction Frequency (IF) to distance. Our results show that the models generated by 3DMax from a simulated Hi-C dataset match the true models better than most of the existing methods. 3DMax is more robust to structural variability and noise. Compared on a real Hi-C dataset, 3DMax constructs chromosomal models that fit the data better than most methods, and it is faster than all other methods. The models reconstructed by 3DMax were consistent with fluorescent in situ hybridization (FISH) experiments and existing knowledge about the organization of human chromosomes, such as chromosome compartmentalization.

Conclusions

3DMax is an effective approach to reconstructing 3D chromosomal models. The results, and the models generated for the simulated and real Hi-C datasets are available here: http://sysbio.rnet.missouri.edu/bdm_download/3DMax/. The source code is available here: https://github.com/BDM-Lab/3DMax. A short video demonstrating how to use 3DMax can be found here: https://youtu.be/ehQUFWoHwfo.

Collapse

Jia R, Chai P, Zhang H, Fan X. Novel insights into chromosomal conformations in cancer. Mol Cancer 2017;16:173. [PMID: 29149895 PMCID: PMC5693495 DOI: 10.1186/s12943-017-0741-5] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2017] [Accepted: 11/06/2017] [Indexed: 12/20/2022] Open

Oluwadare O, Cheng J. ClusterTAD: an unsupervised machine learning approach to detecting topologically associated domains of chromosomes from Hi-C data. BMC Bioinformatics 2017;18:480. [PMID: 29137603 PMCID: PMC5686814 DOI: 10.1186/s12859-017-1931-2] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2017] [Accepted: 11/06/2017] [Indexed: 11/10/2022] Open

Flyamer IM, Gassler J, Imakaev M, Brandão HB, Ulianov SV, Abdennur N, Razin SV, Mirny LA, Tachibana-Konwalski K. Single-nucleus Hi-C reveals unique chromatin reorganization at oocyte-to-zygote transition. Nature 2017;544:110-114. [PMID: 28355183 PMCID: PMC5639698 DOI: 10.1038/nature21711] [Citation(s) in RCA: 486] [Impact Index Per Article: 69.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2016] [Accepted: 02/14/2017] [Indexed: 12/15/2022]

Cagnone G, Sirard MA. The embryonic stress response to in vitro culture: insight from genomic analysis. Reproduction 2016;152:R247-R261. [DOI: 10.1530/rep-16-0391] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2016] [Accepted: 09/05/2016] [Indexed: 12/18/2022]

Predicting DNA Methylation State of CpG Dinucleotide Using Genome Topological Features and Deep Networks. Sci Rep 2016;6:19598. [PMID: 26797014 PMCID: PMC4726425 DOI: 10.1038/srep19598] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2015] [Accepted: 12/14/2015] [Indexed: 11/09/2022] Open

Cao R, Cheng J. Integrated protein function prediction by mining function associations, sequences, and protein-protein and gene-gene interaction networks. Methods 2016;93:84-91. [PMID: 26370280 PMCID: PMC4894840 DOI: 10.1016/j.ymeth.2015.09.011] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2015] [Revised: 09/03/2015] [Accepted: 09/10/2015] [Indexed: 11/30/2022] Open

Cao R, Cheng J. Deciphering the association between gene function and spatial gene-gene interactions in 3D human genome conformation. BMC Genomics 2015;16:880. [PMID: 26511362 PMCID: PMC4625479 DOI: 10.1186/s12864-015-2093-0] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2015] [Accepted: 10/15/2015] [Indexed: 01/17/2023] Open

Abstract

BACKGROUND

A number of factors have been investigated in the context of gene function prediction and analysis, such as sequence identity, gene expressions, and gene co-evolution. However, three-dimensional (3D) conformation of the genome has not been tapped to analyse gene function, probably largely due to lack of genome conformation data until recently.

METHODS

We construct the genome-wide spatial gene-gene interaction networks for three different human B-cells or cell lines from their chromosomal contact data generated by the Hi-C chromosome conformation capturing technique. The G-SESAME and Fast-SemSim are used to calculate function similarity between interacted / non-interacted genes. The Gene Ontology statistics computed from the gene-gene interaction networks is used for gene function prediction.

RESULTS

We compare the function similarity of gene pairs that do not spatially interact and that have interactions. We find that genes that have strong spatial interactions tend to have highly similar function in terms of biological process, molecular function and cellular component of the Gene Ontology. And even though the level of gene-gene interactions generally have no or weak correlation with either sequential genomic distance or sequence identity between genes, the interacted genes with high function similarity tend to have stronger interactions, somewhat shorter genomic distance and significantly higher sequence identity. And combining genomic distance or sequence identity with spatial gene-gene interaction information informs gene-gene function similarity much better than using either one of them alone, suggesting gene-gene interaction information is largely complementary with genomic distance and sequence identity in the context of gene function analysis. We develop and evaluate a new gene function prediction method based on gene-gene interacting networks, which can predict gene function well for a large number of human genes.

CONCLUSIONS

In this work, we demonstrate that the spatial conformation of the human genome is relevant to gene function similarity and is useful for gene function prediction.

Collapse

Nowotny J, Ahmed S, Xu L, Oluwadare O, Chen H, Hensley N, Trieu T, Cao R, Cheng J. Iterative reconstruction of three-dimensional models of human chromosomes from chromosomal contact data. BMC Bioinformatics 2015;16:338. [PMID: 26493399 PMCID: PMC4619219 DOI: 10.1186/s12859-015-0772-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2015] [Accepted: 10/13/2015] [Indexed: 11/10/2022] Open

Abstract

Background

The entire collection of genetic information resides within the chromosomes, which themselves reside within almost every cell nucleus of eukaryotic organisms. Each individual chromosome is found to have its own preferred three-dimensional (3D) structure independent of the other chromosomes. The structure of each chromosome plays vital roles in controlling certain genome operations, including gene interaction and gene regulation. As a result, knowing the structure of chromosomes assists in the understanding of how the genome functions. Fortunately, the 3D structure of chromosomes proves possible to construct through computational methods via contact data recorded from the chromosome. We developed a unique computational approach based on optimization procedures known as adaptation, simulated annealing, and genetic algorithm to construct 3D models of human chromosomes, using chromosomal contact data.

Results

Our models were evaluated using a percentage-based scoring function. Analysis of the scores of the final 3D models demonstrated their effective construction from our computational approach. Specifically, the models resulting from our approach yielded an average score of 80.41 %, with a high of 91 %, across models for all chromosomes of a normal human B-cell. Comparisons made with other methods affirmed the effectiveness of our strategy. Particularly, juxtaposition with models generated through the publicly available method Markov chain Monte Carlo 5C (MCMC5C) illustrated the outperformance of our approach, as seen through a higher average score for all chromosomes. Our methodology was further validated using two consistency checking techniques known as convergence testing and robustness checking, which both proved successful.

Conclusions

The pursuit of constructing accurate 3D chromosomal structures is fueled by the benefits revealed by the findings as well as any possible future areas of study that arise. This motivation has led to the development of our computational methodology. The implementation of our approach proved effective in constructing 3D chromosome models and proved consistent with, and more effective than, some other methods thereby achieving our goal of creating a tool to help advance certain research efforts. The source code, test data, test results, and documentation of our method, Gen3D, are available at our sourceforge site at: http://sourceforge.net/projects/gen3d/.

Collapse

Bergeron KF, Cardinal T, Touré AM, Béland M, Raiwet DL, Silversides DW, Pilon N. Male-biased aganglionic megacolon in the TashT mouse line due to perturbation of silencer elements in a large gene desert of chromosome 10. PLoS Genet 2015;11:e1005093. [PMID: 25786024 PMCID: PMC4364714 DOI: 10.1371/journal.pgen.1005093] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2014] [Accepted: 02/23/2015] [Indexed: 01/13/2023] Open

Abstract

Neural crest cells (NCC) are a transient migratory cell population that generates diverse cell types such as neurons and glia of the enteric nervous system (ENS). Via an insertional mutation screen for loci affecting NCC development in mice, we identified one line—named TashT—that displays a partially penetrant aganglionic megacolon phenotype in a strong male-biased manner. Interestingly, this phenotype is highly reminiscent of human Hirschsprung’s disease, a neurocristopathy with a still unexplained male sex bias. In contrast to the megacolon phenotype, colonic aganglionosis is almost fully penetrant in homozygous TashT animals. The sex bias in megacolon expressivity can be explained by the fact that the male ENS ends, on average, around a “tipping point” of minimal colonic ganglionosis while the female ENS ends, on average, just beyond it. Detailed analysis of embryonic intestines revealed that aganglionosis in homozygous TashT animals is due to slower migration of enteric NCC. The TashT insertional mutation is localized in a gene desert containing multiple highly conserved elements that exhibit repressive activity in reporter assays. RNAseq analyses and 3C assays revealed that the TashT insertion results, at least in part, in NCC-specific relief of repression of the uncharacterized gene Fam162b; an outcome independently confirmed via transient transgenesis. The transcriptional signature of enteric NCC from homozygous TashT embryos is also characterized by the deregulation of genes encoding members of the most important signaling pathways for ENS formation—Gdnf/Ret and Edn3/Ednrb—and, intriguingly, the downregulation of specific subsets of X-linked genes. In conclusion, this study not only allowed the identification of Fam162b coding and regulatory sequences as novel candidate loci for Hirschsprung’s disease but also provides important new insights into its male sex bias.

Hirschsprung’s disease (also known as aganglionic megacolon) is a severe congenital defect of the enteric nervous system (ENS) resulting in complete failure to pass stools. It is characterized by the absence of neural ganglia (aganglionosis) in the distal gut due to incomplete colonization of the embryonic intestines by neural crest cells (NCC), the ENS precursors. Hirschsprung’s disease has an incidence of 1 in 5000 newborns and a 4:1 male sex bias. Although many genes have been associated with this complex genetic disease, most of its heritability as well as its male sex bias remain unexplained. Here, we describe an insertional mutant mouse line (“TashT”) in which virtually all homozygotes display colonic aganglionosis due to defective migration of enteric NCC, but in which only a subset of homozygotes develops megacolon. Surprisingly, this group is almost exclusively male. The TashT ENS defect stems, at least in part, from the disruption of long-range interactions between evolutionarily conserved elements with silencer activity and Fam162b, resulting in NCC-specific upregulation of this uncharacterized protein coding gene. Global analysis of gene expression further revealed that several hundreds of genes are significantly deregulated in TashT enteric NCC. Interestingly, this dataset includes multiple X-linked candidate genes potentially underlying the male sex bias. Taken together, our data pave the way for a clearer understanding of the intriguing male sex bias of Hirschsprung’s disease.

Collapse

Merelli I, Tordini F, Drocco M, Aldinucci M, Liò P, Milanesi L. Integrating multi-omic features exploiting Chromosome Conformation Capture data. Front Genet 2015;6:40. [PMID: 25717338 PMCID: PMC4324155 DOI: 10.3389/fgene.2015.00040] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2014] [Accepted: 01/27/2015] [Indexed: 02/02/2023] Open

Abstract

The representation, integration, and interpretation of omic data is a complex task, in particular considering the huge amount of information that is daily produced in molecular biology laboratories all around the world. The reason is that sequencing data regarding expression profiles, methylation patterns, and chromatin domains is difficult to harmonize in a systems biology view, since genome browsers only allow coordinate-based representations, discarding functional clusters created by the spatial conformation of the DNA in the nucleus. In this context, recent progresses in high throughput molecular biology techniques and bioinformatics have provided insights into chromatin interactions on a larger scale and offer a formidable support for the interpretation of multi-omic data. In particular, a novel sequencing technique called Chromosome Conformation Capture allows the analysis of the chromosome organization in the cell’s natural state. While performed genome wide, this technique is usually called Hi–C. Inspired by service applications such as Google Maps, we developed NuChart, an R package that integrates Hi–C data to describe the chromosomal neighborhood starting from the information about gene positions, with the possibility of mapping on the achieved graphs genomic features such as methylation patterns and histone modifications, along with expression profiles. In this paper we show the importance of the NuChart application for the integration of multi-omic data in a systems biology fashion, with particular interest in cytogenetic applications of these techniques. Moreover, we demonstrate how the integration of multi-omic data can provide useful information in understanding why genes are in certain specific positions inside the nucleus and how epigenetic patterns correlate with their expression.

Collapse

Trieu T, Cheng J. Large-scale reconstruction of 3D structures of human chromosomes from chromosomal contact data. Nucleic Acids Res 2014;42:e52. [PMID: 24465004 PMCID: PMC3985632 DOI: 10.1093/nar/gkt1411] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open

Hoang SA, Bekiranov S. The network architecture of the Saccharomyces cerevisiae genome. PLoS One 2013;8:e81972. [PMID: 24349163 PMCID: PMC3857230 DOI: 10.1371/journal.pone.0081972] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2013] [Accepted: 10/18/2013] [Indexed: 11/19/2022] Open

Hu M, Deng K, Qin Z, Liu JS. Understanding spatial organizations of chromosomes via statistical analysis of Hi-C data. QUANTITATIVE BIOLOGY 2013;1:156-174. [PMID: 26124977 DOI: 10.1007/s40484-013-0016-0] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]