1
|
Yu H, Bhat JA, Li C, Zhao B, Bu M, Zhang Z, Guo T, Feng X. Identification of superior and rare haplotypes to optimize branch number in soybean. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:93. [PMID: 38570354 PMCID: PMC10991007 DOI: 10.1007/s00122-024-04596-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Accepted: 03/07/2024] [Indexed: 04/05/2024]
Abstract
KEY MESSAGE Using the integrated approach in the present study, we identified eleven significant SNPs, seven stable QTLs and 20 candidate genes associated with branch number in soybean. Branch number is a key yield-related quantitative trait that directly affects the number of pods and seeds per soybean plant. In this study, an integrated approach with a genome-wide association study (GWAS) and haplotype and candidate gene analyses was used to determine the detailed genetic basis of branch number across a diverse set of soybean accessions. The GWAS revealed a total of eleven SNPs significantly associated with branch number across three environments using the five GWAS models. Based on the consistency of the SNP detection in multiple GWAS models and environments, seven genomic regions within the physical distance of ± 202.4 kb were delineated as stable QTLs. Of these QTLs, six QTLs were novel, viz., qBN7, qBN13, qBN16, qBN18, qBN19 and qBN20, whereas the remaining one, viz., qBN12, has been previously reported. Moreover, 11 haplotype blocks, viz., Hap4, Hap7, Hap12, Hap13A, Hap13B, Hap16, Hap17, Hap18, Hap19A, Hap19B and Hap20, were identified on nine different chromosomes. Haplotype allele number across the identified haplotype blocks varies from two to five, and different branch number phenotype is regulated by these alleles ranging from the lowest to highest through intermediate branching. Furthermore, 20 genes were identified underlying the genomic region of ± 202.4 kb of the identified SNPs as putative candidates; and six of them showed significant differential expression patterns among the soybean cultivars possessing contrasting branch number, which might be the potential candidates regulating branch number in soybean. The findings of this study can assist the soybean breeding programs for developing cultivars with desirable branch numbers.
Collapse
Affiliation(s)
- Hui Yu
- Key Laboratory of Soybean Molecular Design Breeding, State Key Laboratory of Black Soils Conservation and Utilization, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun, 130102, China
- Zhejiang Lab, Hangzhou, 310012, China
| | | | - Candong Li
- Jiamusi Branch Academy of Heilongjiang Academy of Agricultural Sciences, Jiamusi, 154007, China
| | - Beifang Zhao
- Key Laboratory of Soybean Molecular Design Breeding, State Key Laboratory of Black Soils Conservation and Utilization, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun, 130102, China
| | - Moran Bu
- Key Laboratory of Soybean Molecular Design Breeding, State Key Laboratory of Black Soils Conservation and Utilization, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun, 130102, China
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing, 101408, China
| | - Zhirui Zhang
- Key Laboratory of Soybean Molecular Design Breeding, State Key Laboratory of Black Soils Conservation and Utilization, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun, 130102, China
| | - Tai Guo
- Jiamusi Branch Academy of Heilongjiang Academy of Agricultural Sciences, Jiamusi, 154007, China
| | - Xianzhong Feng
- Key Laboratory of Soybean Molecular Design Breeding, State Key Laboratory of Black Soils Conservation and Utilization, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun, 130102, China.
- Zhejiang Lab, Hangzhou, 310012, China.
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing, 101408, China.
| |
Collapse
|
2
|
Liu J, Zhong X. Population epigenetics: DNA methylation in the plant omics era. PLANT PHYSIOLOGY 2024; 194:2039-2048. [PMID: 38366882 PMCID: PMC10980424 DOI: 10.1093/plphys/kiae089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 01/22/2024] [Accepted: 01/22/2024] [Indexed: 02/18/2024]
Abstract
DNA methylation plays an important role in many biological processes. The mechanisms underlying the establishment and maintenance of DNA methylation are well understood thanks to decades of research using DNA methylation mutants, primarily in Arabidopsis (Arabidopsis thaliana) accession Col-0. Recent genome-wide association studies (GWASs) using the methylomes of natural accessions have uncovered a complex and distinct genetic basis of variation in DNA methylation at the population level. Sequencing following bisulfite treatment has served as an excellent method for quantifying DNA methylation. Unlike studies focusing on specific accessions with reference genomes, population-scale methylome research often requires an additional round of sequencing beyond obtaining genome assemblies or genetic variations from whole-genome sequencing data, which can be cost prohibitive. Here, we provide an overview of recently developed bisulfite-free methods for quantifying methylation and cost-effective approaches for the simultaneous detection of genetic and epigenetic information. We also discuss the plasticity of DNA methylation in a specific Arabidopsis accession, the contribution of DNA methylation to plant adaptation, and the genetic determinants of variation in DNA methylation in natural populations. The recently developed technology and knowledge will greatly benefit future studies in population epigenomes.
Collapse
Affiliation(s)
- Jie Liu
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Xuehua Zhong
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| |
Collapse
|
3
|
Jiang J, Xu YC, Zhang ZQ, Chen JF, Niu XM, Hou XH, Li XT, Wang L, Zhang YE, Ge S, Guo YL. Forces driving transposable element load variation during Arabidopsis range expansion. THE PLANT CELL 2024; 36:840-862. [PMID: 38036296 PMCID: PMC10980350 DOI: 10.1093/plcell/koad296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Revised: 10/25/2023] [Accepted: 11/06/2023] [Indexed: 12/02/2023]
Abstract
Genetic load refers to the accumulated and potentially life-threatening deleterious mutations in populations. Understanding the mechanisms underlying genetic load variation of transposable element (TE) insertion, a major large-effect mutation, during range expansion is an intriguing question in biology. Here, we used 1,115 global natural accessions of Arabidopsis (Arabidopsis thaliana) to study the driving forces of TE load variation during its range expansion. TE load increased with range expansion, especially in the recently established Yangtze River basin population. Effective population size, which explains 62.0% of the variance in TE load, high transposition rate, and selective sweeps contributed to TE accumulation in the expanded populations. We genetically mapped and identified multiple candidate causal genes and TEs, and revealed the genetic architecture of TE load variation. Overall, this study reveals the variation in TE genetic load during Arabidopsis expansion and highlights the causes of TE load variation from the perspectives of both population genetics and quantitative genetics.
Collapse
Affiliation(s)
- Juan Jiang
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yong-Chao Xu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
| | - Zhi-Qin Zhang
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jia-Fu Chen
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xiao-Min Niu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
| | - Xing-Hui Hou
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
| | - Xin-Tong Li
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Li Wang
- Agricultural Synthetic Biology Center, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518000, China
| | - Yong E Zhang
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
- State Key Laboratory of Integrated Management of Pest Insects and Rodents & Key Laboratory of the Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Song Ge
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Ya-Long Guo
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
4
|
Contreras-Garrido A, Galanti D, Movilli A, Becker C, Bossdorf O, Drost HG, Weigel D. Transposon dynamics in the emerging oilseed crop Thlaspi arvense. PLoS Genet 2024; 20:e1011141. [PMID: 38295109 PMCID: PMC10881000 DOI: 10.1371/journal.pgen.1011141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 02/21/2024] [Accepted: 01/17/2024] [Indexed: 02/02/2024] Open
Abstract
Genome evolution is partly driven by the mobility of transposable elements (TEs) which often leads to deleterious effects, but their activity can also facilitate genetic novelty and catalyze local adaptation. We explored how the intraspecific diversity of TE polymorphisms might contribute to the broad geographic success and adaptive capacity of the emerging oil crop Thlaspi arvense (field pennycress). We classified the TE inventory based on a high-quality genome assembly, estimated the age of retrotransposon TE families and comprehensively assessed their mobilization potential. A survey of 280 accessions from 12 regions across the Northern hemisphere allowed us to quantify over 90,000 TE insertion polymorphisms (TIPs). Their distribution mirrored the genetic differentiation as measured by single nucleotide polymorphisms (SNPs). The number and types of mobile TE families vary substantially across populations, but there are also shared patterns common to all accessions. Ty3/Athila elements are the main drivers of TE diversity in T. arvense populations, while a single Ty1/Alesia lineage might be particularly important for transcriptome divergence. The number of retrotransposon TIPs is associated with variation at genes related to epigenetic regulation, including an apparent knockout mutation in BROMODOMAIN AND ATPase DOMAIN-CONTAINING PROTEIN 1 (BRAT1), while DNA transposons are associated with variation at the HSP19 heat shock protein gene. We propose that the high rate of mobilization activity can be harnessed for targeted gene expression diversification, which may ultimately present a toolbox for the potential use of transposition in breeding and domestication of T. arvense.
Collapse
Affiliation(s)
| | - Dario Galanti
- Plant Evolutionary Ecology, University of Tübingen, Tübingen, Germany
| | - Andrea Movilli
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, Tübingen, Germany
| | - Claude Becker
- LMU Biocenter, Faculty of Biology, Ludwig Maximilians University Munich, Martinsried, Germany
| | - Oliver Bossdorf
- Plant Evolutionary Ecology, University of Tübingen, Tübingen, Germany
| | - Hajk-Georg Drost
- Computational Biology Group, Max Planck Institute for Biology Tübingen,Tübingen, Germany
| | - Detlef Weigel
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, Tübingen, Germany
| |
Collapse
|
5
|
Baduel P, Sasaki E. The genetic basis of epigenetic variation and its consequences for adaptation. CURRENT OPINION IN PLANT BIOLOGY 2023; 75:102409. [PMID: 37451221 DOI: 10.1016/j.pbi.2023.102409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 05/28/2023] [Accepted: 06/01/2023] [Indexed: 07/18/2023]
Abstract
Recent population genomic studies in plants have shed new light on natural epigenetic variation by identifying key genetic determinants, "trans modifiers," that influence epigenetic states genome-wide and their interplay with environmental factors. Here, we review this progress by focusing on the epigenetic control of transposition and life-cycle transitions to highlight the ecological consequences of this genetic architecture and its evolutionary significance. This knowledge provides new opportunities to address long-standing questions about the establishment of environment-associated epigenetic variation and its relevance in adaptation.
Collapse
Affiliation(s)
- Pierre Baduel
- Institut de Biologie de l'École Normale Supérieure (IBENS), ENS, PSL University, CNRS, 46 rue d'Ulm, Paris 75005, France
| | - Eriko Sasaki
- Department of Biology, Faculty of Science, Kyushu University, 744 Motooka Nishi-ku, Fukuoka 819-0395, Japan.
| |
Collapse
|
6
|
Martins LM, Law JA. Moving targets: Mechanisms regulating siRNA production and DNA methylation during plant development. CURRENT OPINION IN PLANT BIOLOGY 2023; 75:102435. [PMID: 37598540 PMCID: PMC10581331 DOI: 10.1016/j.pbi.2023.102435] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 06/29/2023] [Accepted: 07/10/2023] [Indexed: 08/22/2023]
Abstract
DNA methylation is a conserved modification that must be precisely regulated during development to facilitate its roles in silencing transposable elements and regulating gene expression. In plants, DNA methylation changes during reproduction are widely documented and, in many cases, the underlying mechanisms are well understood. In somatic tissues, the diversity of methylation patterns are only recently emerging but they are often associated with the RNA-directed DNA methylation (RdDM) pathway. Here, we discuss advances in our understanding of how the locus-specific targeting and tissue-specific expression of RdDM proteins regulate methylation patterns, how the targeting of methylation at loci with imperfect homology expands the purview of RdDM, and how natural variation within RdDM factors impacts DNA methylation patterns.
Collapse
Affiliation(s)
- Laura M Martins
- Plant Molecular and Cellular Biology Laboratory, Salk Institute for Biological Studies, La Jolla, CA, 92037, USA
| | - Julie A Law
- Plant Molecular and Cellular Biology Laboratory, Salk Institute for Biological Studies, La Jolla, CA, 92037, USA; Division of Biological Sciences, University of California, San Diego, La Jolla, CA 92093, USA.
| |
Collapse
|
7
|
Pisupati R, Nizhynska V, Mollá Morales A, Nordborg M. On the causes of gene-body methylation variation in Arabidopsis thaliana. PLoS Genet 2023; 19:e1010728. [PMID: 37141384 DOI: 10.1371/journal.pgen.1010728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Revised: 05/16/2023] [Accepted: 03/31/2023] [Indexed: 05/06/2023] Open
Abstract
Gene-body methylation (gbM) refers to sparse CG methylation of coding regions, which is especially prominent in evolutionarily conserved house-keeping genes. It is found in both plants and animals, but is directly and stably (epigenetically) inherited over multiple generations in the former. Studies in Arabidopsis thaliana have demonstrated that plants originating from different parts of the world exhibit genome-wide differences in gbM, which could reflect direct selection on gbM, but which could also reflect an epigenetic memory of ancestral genetic and/or environmental factors. Here we look for evidence of such factors in F2 plants resulting from a cross between a southern Swedish line with low gbM and a northern Swedish line with high gbM, grown at two different temperatures. Using bisulfite-sequencing data with nucleotide-level resolution on hundreds of individuals, we confirm that CG sites are either methylated (nearly 100% methylation across sampled cells) or unmethylated (approximately 0% methylation across sampled cells), and show that the higher level of gbM in the northern line is due to more sites being methylated. Furthermore, methylation variants almost always show Mendelian segregation, consistent with their being directly and stably inherited through meiosis. To explore how the differences between the parental lines could have arisen, we focused on somatic deviations from the inherited state, distinguishing between gains (relative to the inherited 0% methylation) and losses (relative to the inherited 100% methylation) at each site in the F2 generation. We demonstrate that deviations predominantly affect sites that differ between the parental lines, consistent with these sites being more mutable. Gains and losses behave very differently in terms of the genomic distribution, and are influenced by the local chromatin state. We find clear evidence for different trans-acting genetic polymorphism affecting gains and losses, with those affecting gains showing strong environmental interactions (G×E). Direct effects of the environment were minimal. In conclusion, we show that genetic and environmental factors can change gbM at a cellular level, and hypothesize that these factors can also lead to transgenerational differences between individuals via the inclusion of such changes in the zygote. If true, this could explain genographic pattern of gbM with selection, and would cast doubt on estimates of epimutation rates from inbred lines in constant environments.
Collapse
Affiliation(s)
- Rahul Pisupati
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna BioCenter (VBC), Vienna, Austria
- Vienna Graduate School of Population Genetics, Institut für Populationsgenetik, Vetmeduni, Vienna, Austria
| | - Viktoria Nizhynska
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna BioCenter (VBC), Vienna, Austria
| | - Almudena Mollá Morales
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna BioCenter (VBC), Vienna, Austria
| | - Magnus Nordborg
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna BioCenter (VBC), Vienna, Austria
| |
Collapse
|
8
|
Srikant T, Yuan W, Berendzen KW, Contreras-Garrido A, Drost HG, Schwab R, Weigel D. Canalization of genome-wide transcriptional activity in Arabidopsis thaliana accessions by MET1-dependent CG methylation. Genome Biol 2022; 23:263. [PMID: 36539836 PMCID: PMC9768921 DOI: 10.1186/s13059-022-02833-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2022] [Accepted: 12/05/2022] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Despite its conserved role on gene expression and transposable element (TE) silencing, genome-wide CG methylation differs substantially between wild Arabidopsis thaliana accessions. RESULTS To test our hypothesis that global reduction of CG methylation would reduce epigenomic, transcriptomic, and phenotypic diversity in A. thaliana accessions, we knock out MET1, which is required for CG methylation, in 18 early-flowering accessions. Homozygous met1 mutants in all accessions suffer from common developmental defects such as dwarfism and delayed flowering, in addition to accession-specific abnormalities in rosette leaf architecture, silique morphology, and fertility. Integrated analysis of genome-wide methylation, chromatin accessibility, and transcriptomes confirms that MET1 inactivation greatly reduces CG methylation and alters chromatin accessibility at thousands of loci. While the effects on TE activation are similarly drastic in all accessions, the quantitative effects on non-TE genes vary greatly. The global expression profiles of accessions become considerably more divergent from each other after genome-wide removal of CG methylation, although a few genes with diverse expression profiles across wild-type accessions tend to become more similar in mutants. Most differentially expressed genes do not exhibit altered chromatin accessibility or CG methylation in cis, suggesting that absence of MET1 can have profound indirect effects on gene expression and that these effects vary substantially between accessions. CONCLUSIONS Systematic analysis of MET1 requirement in different A. thaliana accessions reveals a dual role for CG methylation: for many genes, CG methylation appears to canalize expression levels, with methylation masking regulatory divergence. However, for a smaller subset of genes, CG methylation increases expression diversity beyond genetically encoded differences.
Collapse
Affiliation(s)
- Thanvi Srikant
- grid.419580.10000 0001 0942 1125Department of Molecular Biology, Max Planck Institute for Biology Tübingen, Tübingen, Germany ,grid.5801.c0000 0001 2156 2780Present address: Institute of Molecular Plant Biology, Department of Biology, ETH Zürich, Zürich, Switzerland
| | - Wei Yuan
- grid.419580.10000 0001 0942 1125Department of Molecular Biology, Max Planck Institute for Biology Tübingen, Tübingen, Germany
| | - Kenneth Wayne Berendzen
- grid.10392.390000 0001 2190 1447Plant Transformation and Flow Cytometry Facility, ZMBP, University of Tübingen, Tübingen, Germany
| | - Adrián Contreras-Garrido
- grid.419580.10000 0001 0942 1125Department of Molecular Biology, Max Planck Institute for Biology Tübingen, Tübingen, Germany
| | - Hajk-Georg Drost
- grid.419580.10000 0001 0942 1125Computational Biology Group, Max Planck Institute for Biology Tübingen, Tübingen, Germany
| | - Rebecca Schwab
- grid.419580.10000 0001 0942 1125Department of Molecular Biology, Max Planck Institute for Biology Tübingen, Tübingen, Germany
| | - Detlef Weigel
- grid.419580.10000 0001 0942 1125Department of Molecular Biology, Max Planck Institute for Biology Tübingen, Tübingen, Germany
| |
Collapse
|
9
|
Hüther P, Hagmann J, Nunn A, Kakoulidou I, Pisupati R, Langenberger D, Weigel D, Johannes F, Schultheiss SJ, Becker C. MethylScore, a pipeline for accurate and context-aware identification of differentially methylated regions from population-scale plant whole-genome bisulfite sequencing data. QUANTITATIVE PLANT BIOLOGY 2022; 3:e19. [PMID: 37077980 PMCID: PMC10095865 DOI: 10.1017/qpb.2022.14] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 07/14/2022] [Accepted: 07/15/2022] [Indexed: 05/03/2023]
Abstract
Whole-genome bisulfite sequencing (WGBS) is the standard method for profiling DNA methylation at single-nucleotide resolution. Different tools have been developed to extract differentially methylated regions (DMRs), often built upon assumptions from mammalian data. Here, we present MethylScore, a pipeline to analyse WGBS data and to account for the substantially more complex and variable nature of plant DNA methylation. MethylScore uses an unsupervised machine learning approach to segment the genome by classification into states of high and low methylation. It processes data from genomic alignments to DMR output and is designed to be usable by novice and expert users alike. We show how MethylScore can identify DMRs from hundreds of samples and how its data-driven approach can stratify associated samples without prior information. We identify DMRs in the A. thaliana 1,001 Genomes dataset to unveil known and unknown genotype-epigenotype associations .
Collapse
Affiliation(s)
- Patrick Hüther
- Gregor Mendel Institute of Molecular Plant Biology GmbH, Austrian Academy of Sciences, Vienna BioCenter (VBC), 1030 Vienna, Austria
- LMU Biocenter, Faculty of Biology, Ludwig-Maximilians-University Munich, 82152 Martinsried, Germany
| | | | - Adam Nunn
- ecSeq Bioinformatics GmbH, 04103 Leipzig, Germany
- Department of Computer Science, Leipzig University, 04107 Leipzig, Germany
| | - Ioanna Kakoulidou
- Department of Plant Sciences, Technical University of Munich, 85354 Freising, Germany
| | - Rahul Pisupati
- Gregor Mendel Institute of Molecular Plant Biology GmbH, Austrian Academy of Sciences, Vienna BioCenter (VBC), 1030 Vienna, Austria
| | | | - Detlef Weigel
- Department of Molecular Biology, Max Planck Institute for Biology, 72076 Tübingen, Germany
| | - Frank Johannes
- Department of Plant Sciences, Technical University of Munich, 85354 Freising, Germany
- Institute for Advanced Study, Technical University of Munich, 85748 Garching, Germany
| | | | - Claude Becker
- Gregor Mendel Institute of Molecular Plant Biology GmbH, Austrian Academy of Sciences, Vienna BioCenter (VBC), 1030 Vienna, Austria
- LMU Biocenter, Faculty of Biology, Ludwig-Maximilians-University Munich, 82152 Martinsried, Germany
| |
Collapse
|