1
|
Liu T, Zhu H, Wang Z. Learning Micro-C from Hi-C with diffusion models. PLoS Comput Biol 2024; 20:e1012136. [PMID: 38758956 PMCID: PMC11139321 DOI: 10.1371/journal.pcbi.1012136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2023] [Revised: 05/30/2024] [Accepted: 05/05/2024] [Indexed: 05/19/2024] Open
Abstract
In the last few years, Micro-C has shown itself as an improved alternative to Hi-C. It replaced the restriction enzymes in Hi-C assays with micrococcal nuclease (MNase), resulting in capturing nucleosome resolution chromatin interactions. The signal-to-noise improvement of Micro-C allows it to detect more chromatin loops than high-resolution Hi-C. However, compared with massive Hi-C datasets available in the literature, there are only a limited number of Micro-C datasets. To take full advantage of these Hi-C datasets, we present HiC2MicroC, a computational method learning and then predicting Micro-C from Hi-C based on the denoising diffusion probabilistic models (DDPM). We trained our DDPM and other regression models in human foreskin fibroblast (HFFc6) cell line and evaluated these methods in six different cell types at 5-kb and 1-kb resolution. Our evaluations demonstrate that both HiC2MicroC and regression methods can markedly improve Hi-C towards Micro-C, and our DDPM-based HiC2MicroC outperforms regression in various terms. First, HiC2MicroC successfully recovers most of the Micro-C loops even those not detected in Hi-C maps. Second, a majority of the HiC2MicroC-recovered loops anchor CTCF binding sites in a convergent orientation. Third, HiC2MicroC loops share genomic and epigenetic properties with Micro-C loops, including linking promoters and enhancers, and their anchors are enriched for structural proteins (CTCF and cohesin) and histone modifications. Lastly, we find our recovered loops are also consistent with the loops identified from promoter capture Micro-C (PCMicro-C) and Chromatin Interaction Analysis by Paired-End Tag Sequencing (ChIA-PET). Overall, HiC2MicroC is an effective tool for further studying Hi-C data with Micro-C as a template. HiC2MicroC is publicly available at https://github.com/zwang-bioinformatics/HiC2MicroC/.
Collapse
Affiliation(s)
- Tong Liu
- Department of Computer Science, University of Miami, Coral Gables, Florida, United States of America
| | - Hao Zhu
- Department of Computer Science, University of Miami, Coral Gables, Florida, United States of America
| | - Zheng Wang
- Department of Computer Science, University of Miami, Coral Gables, Florida, United States of America
| |
Collapse
|
2
|
Wang F, Alinejad‐Rokny H, Lin J, Gao T, Chen X, Zheng Z, Meng L, Li X, Wong K. A Lightweight Framework For Chromatin Loop Detection at the Single-Cell Level. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2023; 10:e2303502. [PMID: 37816141 PMCID: PMC10667817 DOI: 10.1002/advs.202303502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 08/10/2023] [Indexed: 10/12/2023]
Abstract
Single-cell Hi-C (scHi-C) has made it possible to analyze chromatin organization at the single-cell level. However, scHi-C experiments generate inherently sparse data, which poses a challenge for loop calling methods. The existing approach performs significance tests across the imputed dense contact maps, leading to substantial computational overhead and loss of information at the single-cell level. To overcome this limitation, a lightweight framework called scGSLoop is proposed, which sets a new paradigm for scHi-C loop calling by adapting the training and inferencing strategies of graph-based deep learning to leverage the sequence features and 1D positional information of genomic loci. With this framework, sparsity is no longer a challenge, but rather an advantage that the model leverages to achieve unprecedented computational efficiency. Compared to existing methods, scGSLoop makes more accurate predictions and is able to identify more loops that have the potential to play regulatory roles in genome functioning. Moreover, scGSLoop preserves single-cell information by identifying a distinct group of loops for each individual cell, which not only enables an understanding of the variability of chromatin looping states between cells, but also allows scGSLoop to be extended for the investigation of multi-connected hubs and their underlying mechanisms.
Collapse
Affiliation(s)
- Fuzhou Wang
- Department of Computer ScienceCity University of Hong KongKowloon TongHong Kong SAR
| | - Hamid Alinejad‐Rokny
- BioMedical Machine Learning Lab, Graduate School of Biomedical EngineeringUniversity of New South WalesSydney2052Australia
| | - Jiecong Lin
- Molecular Pathology Unit, Center for Cancer Research, Massachusetts General HospitalDepartment of PathologyHarvard Medical SchoolBostonMA02129USA
- Department of Computer ScienceThe University of Hong KongPok Fu LamHong Kong SAR
| | - Tingxiao Gao
- Department of Medical Biophysics, Faculty of MedicineUniversity of TorontoTorontoOntarioM5G1L7Canada
| | - Xingjian Chen
- Department of Computer ScienceCity University of Hong KongKowloon TongHong Kong SAR
| | - Zetian Zheng
- Department of Computer ScienceCity University of Hong KongKowloon TongHong Kong SAR
| | - Lingkuan Meng
- Department of Computer ScienceCity University of Hong KongKowloon TongHong Kong SAR
| | - Xiangtao Li
- School of Artificial IntelligenceJilin UniversityChangchun130012China
| | - Ka‐Chun Wong
- Department of Computer ScienceCity University of Hong KongKowloon TongHong Kong SAR
| |
Collapse
|
3
|
Senapati S, Irshad IU, Sharma AK, Kumar H. Fundamental insights into the correlation between chromosome configuration and transcription. Phys Biol 2023; 20:051002. [PMID: 37467757 DOI: 10.1088/1478-3975/ace8e5] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Accepted: 07/19/2023] [Indexed: 07/21/2023]
Abstract
Eukaryotic chromosomes exhibit a hierarchical organization that spans a spectrum of length scales, ranging from sub-regions known as loops, which typically comprise hundreds of base pairs, to much larger chromosome territories that can encompass a few mega base pairs. Chromosome conformation capture experiments that involve high-throughput sequencing methods combined with microscopy techniques have enabled a new understanding of inter- and intra-chromosomal interactions with unprecedented details. This information also provides mechanistic insights on the relationship between genome architecture and gene expression. In this article, we review the recent findings on three-dimensional interactions among chromosomes at the compartment, topologically associating domain, and loop levels and the impact of these interactions on the transcription process. We also discuss current understanding of various biophysical processes involved in multi-layer structural organization of chromosomes. Then, we discuss the relationships between gene expression and genome structure from perturbative genome-wide association studies. Furthermore, for a better understanding of how chromosome architecture and function are linked, we emphasize the role of epigenetic modifications in the regulation of gene expression. Such an understanding of the relationship between genome architecture and gene expression can provide a new perspective on the range of potential future discoveries and therapeutic research.
Collapse
Affiliation(s)
- Swayamshree Senapati
- School of Basic Sciences, Indian Institute of Technology, Bhubaneswar, Argul, Odisha 752050, India
| | - Inayat Ullah Irshad
- Department of Physics, Indian Institute of Technology, Jammu, Jammu 181221, India
| | - Ajeet K Sharma
- Department of Physics, Indian Institute of Technology, Jammu, Jammu 181221, India
- Department of Biosciences and Bioengineering, Indian Institute of Technology Jammu, Jammu 181221, India
| | - Hemant Kumar
- School of Basic Sciences, Indian Institute of Technology, Bhubaneswar, Argul, Odisha 752050, India
| |
Collapse
|
4
|
Dekker J, Alber F, Aufmkolk S, Beliveau BJ, Bruneau BG, Belmont AS, Bintu L, Boettiger A, Calandrelli R, Disteche CM, Gilbert DM, Gregor T, Hansen AS, Huang B, Huangfu D, Kalhor R, Leslie CS, Li W, Li Y, Ma J, Noble WS, Park PJ, Phillips-Cremins JE, Pollard KS, Rafelski SM, Ren B, Ruan Y, Shav-Tal Y, Shen Y, Shendure J, Shu X, Strambio-De-Castillia C, Vertii A, Zhang H, Zhong S. Spatial and temporal organization of the genome: Current state and future aims of the 4D nucleome project. Mol Cell 2023; 83:2624-2640. [PMID: 37419111 PMCID: PMC10528254 DOI: 10.1016/j.molcel.2023.06.018] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Revised: 06/10/2023] [Accepted: 06/12/2023] [Indexed: 07/09/2023]
Abstract
The four-dimensional nucleome (4DN) consortium studies the architecture of the genome and the nucleus in space and time. We summarize progress by the consortium and highlight the development of technologies for (1) mapping genome folding and identifying roles of nuclear components and bodies, proteins, and RNA, (2) characterizing nuclear organization with time or single-cell resolution, and (3) imaging of nuclear organization. With these tools, the consortium has provided over 2,000 public datasets. Integrative computational models based on these data are starting to reveal connections between genome structure and function. We then present a forward-looking perspective and outline current aims to (1) delineate dynamics of nuclear architecture at different timescales, from minutes to weeks as cells differentiate, in populations and in single cells, (2) characterize cis-determinants and trans-modulators of genome organization, (3) test functional consequences of changes in cis- and trans-regulators, and (4) develop predictive models of genome structure and function.
Collapse
Affiliation(s)
- Job Dekker
- University of Massachusetts Chan Medical School, Boston, MA, USA; Howard Hughes Medical Institute, Chevy Chase, MD, USA.
| | - Frank Alber
- University of California, Los Angeles, Los Angeles, CA, USA
| | | | | | - Benoit G Bruneau
- Gladstone Institutes, San Francisco, CA, USA; University of California, San Francisco, San Francisco, CA, USA
| | | | | | | | | | | | | | | | | | - Bo Huang
- University of California, San Francisco, San Francisco, CA, USA
| | - Danwei Huangfu
- Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Reza Kalhor
- Johns Hopkins University, Baltimore, MD, USA
| | | | - Wenbo Li
- University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Yun Li
- University of North Carolina, Gillings School of Global Public Health, Chapel Hill, NC, USA
| | - Jian Ma
- Carnegie Mellon University, Pittsburgh, PA, USA
| | | | | | | | - Katherine S Pollard
- Gladstone Institutes, San Francisco, CA, USA; University of California, San Francisco, San Francisco, CA, USA; Chan Zuckerberg Biohub, San Francisco, San Francisco, CA, USA
| | | | - Bing Ren
- University of California, San Diego, La Jolla, CA, USA
| | - Yijun Ruan
- Zhejiang University, Hangzhou, Zhejiang, China
| | | | - Yin Shen
- University of California, San Francisco, San Francisco, CA, USA
| | | | - Xiaokun Shu
- University of California, San Francisco, San Francisco, CA, USA
| | | | | | | | - Sheng Zhong
- University of California, San Diego, La Jolla, CA, USA.
| |
Collapse
|
5
|
Li K, Zhang P, Wang Z, Shen W, Sun W, Xu J, Wen Z, Li L. iEnhance: a multi-scale spatial projection encoding network for enhancing chromatin interaction data resolution. Brief Bioinform 2023; 24:bbad245. [PMID: 37381618 DOI: 10.1093/bib/bbad245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Revised: 06/06/2023] [Accepted: 06/12/2023] [Indexed: 06/30/2023] Open
Abstract
Although sequencing-based high-throughput chromatin interaction data are widely used to uncover genome-wide three-dimensional chromatin architecture, their sparseness and high signal-noise-ratio greatly restrict the precision of the obtained structural elements. To improve data quality, we here present iEnhance (chromatin interaction data resolution enhancement), a multi-scale spatial projection and encoding network, to predict high-resolution chromatin interaction matrices from low-resolution and noisy input data. Specifically, iEnhance projects the input data into matrix spaces to extract multi-scale global and local feature sets, then hierarchically fused these features by attention mechanism. After that, dense channel encoding and residual channel decoding are used to effectively infer robust chromatin interaction maps. iEnhance outperforms state-of-the-art Hi-C resolution enhancement tools in both visual and quantitative evaluation. Comprehensive analysis shows that unlike other tools, iEnhance can recover both short-range structural elements and long-range interaction patterns precisely. More importantly, iEnhance can be transferred to data enhancement of other tissues or cell lines of unknown resolution. Furthermore, iEnhance performs robustly in enhancement of diverse chromatin interaction data including those from single-cell Hi-C and Micro-C experiments.
Collapse
Affiliation(s)
- Kai Li
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Ping Zhang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Zilin Wang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Wei Shen
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Weicheng Sun
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Jinsheng Xu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Zi Wen
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Li Li
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
- Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan 430070, China
| |
Collapse
|
6
|
Satam H, Joshi K, Mangrolia U, Waghoo S, Zaidi G, Rawool S, Thakare RP, Banday S, Mishra AK, Das G, Malonia SK. Next-Generation Sequencing Technology: Current Trends and Advancements. BIOLOGY 2023; 12:997. [PMID: 37508427 PMCID: PMC10376292 DOI: 10.3390/biology12070997] [Citation(s) in RCA: 50] [Impact Index Per Article: 50.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 07/09/2023] [Accepted: 07/11/2023] [Indexed: 07/30/2023]
Abstract
The advent of next-generation sequencing (NGS) has brought about a paradigm shift in genomics research, offering unparalleled capabilities for analyzing DNA and RNA molecules in a high-throughput and cost-effective manner. This transformative technology has swiftly propelled genomics advancements across diverse domains. NGS allows for the rapid sequencing of millions of DNA fragments simultaneously, providing comprehensive insights into genome structure, genetic variations, gene expression profiles, and epigenetic modifications. The versatility of NGS platforms has expanded the scope of genomics research, facilitating studies on rare genetic diseases, cancer genomics, microbiome analysis, infectious diseases, and population genetics. Moreover, NGS has enabled the development of targeted therapies, precision medicine approaches, and improved diagnostic methods. This review provides an insightful overview of the current trends and recent advancements in NGS technology, highlighting its potential impact on diverse areas of genomic research. Moreover, the review delves into the challenges encountered and future directions of NGS technology, including endeavors to enhance the accuracy and sensitivity of sequencing data, the development of novel algorithms for data analysis, and the pursuit of more efficient, scalable, and cost-effective solutions that lie ahead.
Collapse
Affiliation(s)
- Heena Satam
- miBiome Therapeutics, Mumbai 400102, India; (H.S.); (K.J.); (U.M.); (S.W.); (G.Z.); (S.R.)
| | - Kandarp Joshi
- miBiome Therapeutics, Mumbai 400102, India; (H.S.); (K.J.); (U.M.); (S.W.); (G.Z.); (S.R.)
| | - Upasana Mangrolia
- miBiome Therapeutics, Mumbai 400102, India; (H.S.); (K.J.); (U.M.); (S.W.); (G.Z.); (S.R.)
| | - Sanober Waghoo
- miBiome Therapeutics, Mumbai 400102, India; (H.S.); (K.J.); (U.M.); (S.W.); (G.Z.); (S.R.)
| | - Gulnaz Zaidi
- miBiome Therapeutics, Mumbai 400102, India; (H.S.); (K.J.); (U.M.); (S.W.); (G.Z.); (S.R.)
| | - Shravani Rawool
- miBiome Therapeutics, Mumbai 400102, India; (H.S.); (K.J.); (U.M.); (S.W.); (G.Z.); (S.R.)
| | - Ritesh P. Thakare
- Department of Molecular Cell and Cancer Biology, UMass Chan Medical School, Worcester, MA 01605, USA; (R.P.T.); (S.B.); (A.K.M.)
| | - Shahid Banday
- Department of Molecular Cell and Cancer Biology, UMass Chan Medical School, Worcester, MA 01605, USA; (R.P.T.); (S.B.); (A.K.M.)
| | - Alok K. Mishra
- Department of Molecular Cell and Cancer Biology, UMass Chan Medical School, Worcester, MA 01605, USA; (R.P.T.); (S.B.); (A.K.M.)
| | - Gautam Das
- miBiome Therapeutics, Mumbai 400102, India; (H.S.); (K.J.); (U.M.); (S.W.); (G.Z.); (S.R.)
| | - Sunil K. Malonia
- Department of Molecular Cell and Cancer Biology, UMass Chan Medical School, Worcester, MA 01605, USA; (R.P.T.); (S.B.); (A.K.M.)
| |
Collapse
|
7
|
Zhang Z, Feng F, Qiu Y, Liu J. A generalizable framework to comprehensively predict epigenome, chromatin organization, and transcriptome. Nucleic Acids Res 2023; 51:5931-5947. [PMID: 37224527 PMCID: PMC10325920 DOI: 10.1093/nar/gkad436] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 03/31/2023] [Accepted: 05/09/2023] [Indexed: 05/26/2023] Open
Abstract
Many deep learning approaches have been proposed to predict epigenetic profiles, chromatin organization, and transcription activity. While these approaches achieve satisfactory performance in predicting one modality from another, the learned representations are not generalizable across predictive tasks or across cell types. In this paper, we propose a deep learning approach named EPCOT which employs a pre-training and fine-tuning framework, and is able to accurately and comprehensively predict multiple modalities including epigenome, chromatin organization, transcriptome, and enhancer activity for new cell types, by only requiring cell-type specific chromatin accessibility profiles. Many of these predicted modalities, such as Micro-C and ChIA-PET, are quite expensive to get in practice, and the in silico prediction from EPCOT should be quite helpful. Furthermore, this pre-training and fine-tuning framework allows EPCOT to identify generic representations generalizable across different predictive tasks. Interpreting EPCOT models also provides biological insights including mapping between different genomic modalities, identifying TF sequence binding patterns, and analyzing cell-type specific TF impacts on enhancer activity.
Collapse
Affiliation(s)
- Zhenhao Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, 500 S. State St, Ann Arbor, MI 48109, USA
| | - Fan Feng
- Department of Computational Medicine and Bioinformatics, University of Michigan, 500 S. State St, Ann Arbor, MI 48109, USA
| | - Yiyang Qiu
- Department of Computer Science and Engineering, University of Michigan, 500 S. State St, Ann Arbor, MI 48109, USA
| | - Jie Liu
- Department of Computational Medicine and Bioinformatics, University of Michigan, 500 S. State St, Ann Arbor, MI 48109, USA
- Department of Computer Science and Engineering, University of Michigan, 500 S. State St, Ann Arbor, MI 48109, USA
| |
Collapse
|
8
|
Bobbitt JR, Seachrist DD, Keri RA. Chromatin Organization and Transcriptional Programming of Breast Cancer Cell Identity. Endocrinology 2023; 164:bqad100. [PMID: 37394919 PMCID: PMC10370366 DOI: 10.1210/endocr/bqad100] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/01/2023] [Revised: 06/26/2023] [Accepted: 06/28/2023] [Indexed: 07/04/2023]
Abstract
The advent of sequencing technologies for assessing chromosome conformations has provided a wealth of information on the organization of the 3-dimensional genome and its role in cancer progression. It is now known that changes in chromatin folding and accessibility can promote aberrant activation or repression of transcriptional programs that can drive tumorigenesis and progression in diverse cancers. This includes breast cancer, which comprises several distinct subtypes defined by their unique transcriptomes that dictate treatment response and patient outcomes. Of these, basal-like breast cancer is an aggressive subtype controlled by a pluripotency-enforcing transcriptome. Meanwhile, the more differentiated luminal subtype of breast cancer is driven by an estrogen receptor-dominated transcriptome that underlies its responsiveness to antihormone therapies and conveys improved patient outcomes. Despite the clear differences in molecular signatures, the genesis of each subtype from normal mammary epithelial cells remains unclear. Recent technical advances have revealed key distinctions in chromatin folding and organization between subtypes that could underlie their transcriptomic and, hence, phenotypic differences. These studies also suggest that proteins controlling particular chromatin states may be useful targets for treating aggressive disease. In this review, we explore the current state of understanding of chromatin architecture in breast cancer subtypes and its potential role in defining their phenotypic characteristics.
Collapse
Affiliation(s)
- Jessica R Bobbitt
- Department of Cancer Biology, Cleveland Clinic Lerner Research Institute, Cleveland, OH 44195, USA
- Department of Pathology, Case Western Reserve University School of Medicine, Cleveland, OH 44106, USA
| | - Darcie D Seachrist
- Department of Cancer Biology, Cleveland Clinic Lerner Research Institute, Cleveland, OH 44195, USA
| | - Ruth A Keri
- Department of Cancer Biology, Cleveland Clinic Lerner Research Institute, Cleveland, OH 44195, USA
- Case Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, OH 44106, USA
| |
Collapse
|
9
|
Wang F, Zhang B, Wen D, Liu R, Yao X, Chen Z, Mu R, Pei H, Liu M, Song B, Lu L. Chromosome-scale genome assembly of Camellia sinensis combined with multi-omics provides insights into its responses to infestation with green leafhoppers. FRONTIERS IN PLANT SCIENCE 2022; 13:1004387. [PMID: 36212364 PMCID: PMC9539759 DOI: 10.3389/fpls.2022.1004387] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/27/2022] [Accepted: 08/25/2022] [Indexed: 06/16/2023]
Abstract
The tea plant (Camellia sinensis) is an important economic crop, which is becoming increasingly popular worldwide, and is now planted in more than 50 countries. Tea green leafhopper is one of the major pests in tea plantations, which can significantly reduce the yield and quality of tea during the growth of plant. In this study, we report a genome assembly for DuyunMaojian tea plants using a combination of Oxford Nanopore Technology PromethION™ with high-throughput chromosome conformation capture technology and used multi-omics to study how the tea plant responds to infestation with tea green leafhoppers. The final genome was 3.08 Gb. A total of 2.97 Gb of the genome was mapped to 15 pseudo-chromosomes, and 2.79 Gb of them could confirm the order and direction. The contig N50, scaffold N50 and GC content were 723.7 kb, 207.72 Mb and 38.54%, respectively. There were 2.67 Gb (86.77%) repetitive sequences, 34,896 protein-coding genes, 104 miRNAs, 261 rRNA, 669 tRNA, and 6,502 pseudogenes. A comparative genomics analysis showed that DuyunMaojian was the most closely related to Shuchazao and Yunkang 10, followed by DASZ and tea-oil tree. The multi-omics results indicated that phenylpropanoid biosynthesis, α-linolenic acid metabolism, flavonoid biosynthesis and 50 differentially expressed genes, particularly peroxidase, played important roles in response to infestation with tea green leafhoppers (Empoasca vitis Göthe). This study on the tea tree is highly significant for its role in illustrating the evolution of its genome and discovering how the tea plant responds to infestation with tea green leafhoppers will contribute to a theoretical foundation to breed tea plants resistant to insects that will ultimately result in an increase in the yield and quality of tea.
Collapse
Affiliation(s)
- Fen Wang
- The Department of Life Science and Agriculture, Qiannan Normal College for Nationalities, Duyun, China
- The Key Laboratory of Plant Resources Conservation and Germplasm Innovation in Mountainous Region (Ministry of Education), Guiyang, China
| | - Baohui Zhang
- The Key Laboratory of Plant Resources Conservation and Germplasm Innovation in Mountainous Region (Ministry of Education), Guiyang, China
- Horticulture Institute (Guizhou Horticultural Engineering Technology Research Center), Guizhou Academy of Agricultural Sciences, Guiyang, China
| | - Di Wen
- The Department of Life Science and Agriculture, Qiannan Normal College for Nationalities, Duyun, China
| | - Rong Liu
- The Department of Life Science and Agriculture, Qiannan Normal College for Nationalities, Duyun, China
| | - Xinzhuan Yao
- The Key Laboratory of Plant Resources Conservation and Germplasm Innovation in Mountainous Region (Ministry of Education), Guiyang, China
- College of Tea Science, Guizhou University, Guiyang, China
| | - Zhi Chen
- The Department of Life Science and Agriculture, Qiannan Normal College for Nationalities, Duyun, China
| | - Ren Mu
- The Department of Life Science and Agriculture, Qiannan Normal College for Nationalities, Duyun, China
| | - Huimin Pei
- The Department of Life Science and Agriculture, Qiannan Normal College for Nationalities, Duyun, China
| | - Min Liu
- Biomarker Technologies Corporation, Beijing, China
| | - Baoxing Song
- The Department of Life Science and Agriculture, Qiannan Normal College for Nationalities, Duyun, China
- Peking University Institute of Advanced Agricultural Sciences, Weifang, China
| | - Litang Lu
- The Key Laboratory of Plant Resources Conservation and Germplasm Innovation in Mountainous Region (Ministry of Education), Guiyang, China
- College of Tea Science, Guizhou University, Guiyang, China
| |
Collapse
|
10
|
Kong S, Lu Y, Tan S, Li R, Gao Y, Li K, Zhang Y. Nucleosome-Omics: A Perspective on the Epigenetic Code and 3D Genome Landscape. Genes (Basel) 2022; 13:genes13071114. [PMID: 35885897 PMCID: PMC9323251 DOI: 10.3390/genes13071114] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Revised: 06/14/2022] [Accepted: 06/17/2022] [Indexed: 12/04/2022] Open
Abstract
Genetic information is loaded on chromatin, which involves DNA sequence arrangement and the epigenetic landscape. The epigenetic information including DNA methylation, nucleosome positioning, histone modification, 3D chromatin conformation, and so on, has a crucial impact on gene transcriptional regulation. Out of them, nucleosomes, as basal chromatin structural units, play an important central role in epigenetic code. With the discovery of nucleosomes, various nucleosome-level technologies have been developed and applied, pushing epigenetics to a new climax. As the underlying methodology, next-generation sequencing technology has emerged and allowed scientists to understand the epigenetic landscape at a genome-wide level. Combining with NGS, nucleosome-omics (or nucleosomics) provides a fresh perspective on the epigenetic code and 3D genome landscape. Here, we summarized and discussed research progress in technology development and application of nucleosome-omics. We foresee the future directions of epigenetic development at the nucleosome level.
Collapse
|