1
|
Mardoc E, Sow MD, Déjean S, Salse J. Genomic data integration tutorial, a plant case study. BMC Genomics 2024; 25:66. [PMID: 38233804 PMCID: PMC10792847 DOI: 10.1186/s12864-023-09833-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Accepted: 11/22/2023] [Indexed: 01/19/2024] Open
Abstract
BACKGROUND The ongoing evolution of the Next Generation Sequencing (NGS) technologies has led to the production of genomic data on a massive scale. While tools for genomic data integration and analysis are becoming increasingly available, the conceptual and analytical complexities still represent a great challenge in many biological contexts. RESULTS To address this issue, we describe a six-steps tutorial for the best practices in genomic data integration, consisting of (1) designing a data matrix; (2) formulating a specific biological question toward data description, selection and prediction; (3) selecting a tool adapted to the targeted questions; (4) preprocessing of the data; (5) conducting preliminary analysis, and finally (6) executing genomic data integration. CONCLUSION The tutorial has been tested and demonstrated on publicly available genomic data generated from poplar (Populus L.), a woody plant model. We also developed a new graphical output for the unsupervised multi-block analysis, cimDiablo_v2, available at https://forgemia.inra.fr/umr-gdec/omics-integration-on-poplar , and allowing the selection of master drivers in genomic data variation and interplay.
Collapse
Affiliation(s)
- Emile Mardoc
- UCA-INRAE UMR 1095 Genetics, Diversity and Ecophysiology of Cereals (GDEC), 5 Chemin de Beaulieu, 63000, Clermont-Ferrand, France
| | - Mamadou Dia Sow
- UCA-INRAE UMR 1095 Genetics, Diversity and Ecophysiology of Cereals (GDEC), 5 Chemin de Beaulieu, 63000, Clermont-Ferrand, France
| | - Sébastien Déjean
- Institut de Mathématiques de Toulouse, UMR 5219, Université de Toulouse, CNRS, Université Paul Sabatier, Toulouse, France
| | - Jérôme Salse
- UCA-INRAE UMR 1095 Genetics, Diversity and Ecophysiology of Cereals (GDEC), 5 Chemin de Beaulieu, 63000, Clermont-Ferrand, France.
| |
Collapse
|
2
|
Guemri J, Pierre-Jean M, Brohard S, Oussada N, Horgues C, Bonnet E, Mauger F, Deleuze JF. Methylated ccfDNA from plasma biomarkers of Alzheimer's disease using targeted bisulfite sequencing. Epigenomics 2022; 14:451-468. [PMID: 35416052 DOI: 10.2217/epi-2021-0491] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Aim: Noninvasive biomarkers such as methylated ccfDNA from plasma could help to support the diagnosis of Alzheimer's disease (AD). Methods: A targeted sequencing protocol was developed to identify candidate biomarkers of AD in methylated ccfDNA extracted from plasma. Results: The authors identified differentially methylated CpGs, regions of which were the same as those identified in previous AD studies. Specifically, a differentially methylated CpG of the LHX2 gene previously identified in a plasma study of AD was replicated in the study. The MBP and DUSP22 regions have been identified in other brain studies of AD and in the authors' study. Conclusion: Although these biomarkers must be validated in other cohorts, methylated ccfDNA could be a relevant noninvasive biomarker in AD.
Collapse
Affiliation(s)
- Julien Guemri
- Université Paris-Saclay, Commissariat à l'Energie Atomique et aux Energies Alternatives, Centre National de Recherche en Génomique Humaine, Evry, 91057, France
| | - Morgane Pierre-Jean
- Université Paris-Saclay, Commissariat à l'Energie Atomique et aux Energies Alternatives, Centre National de Recherche en Génomique Humaine, Evry, 91057, France
| | - Solène Brohard
- Université Paris-Saclay, Commissariat à l'Energie Atomique et aux Energies Alternatives, Centre National de Recherche en Génomique Humaine, Evry, 91057, France
| | - Nouara Oussada
- Université Paris-Saclay, Commissariat à l'Energie Atomique et aux Energies Alternatives, Centre National de Recherche en Génomique Humaine, Evry, 91057, France
| | - Caroline Horgues
- Université Paris-Saclay, Commissariat à l'Energie Atomique et aux Energies Alternatives, Centre National de Recherche en Génomique Humaine, Evry, 91057, France
| | - Eric Bonnet
- Université Paris-Saclay, Commissariat à l'Energie Atomique et aux Energies Alternatives, Centre National de Recherche en Génomique Humaine, Evry, 91057, France
| | - Florence Mauger
- Université Paris-Saclay, Commissariat à l'Energie Atomique et aux Energies Alternatives, Centre National de Recherche en Génomique Humaine, Evry, 91057, France
| | - Jean-François Deleuze
- Université Paris-Saclay, Commissariat à l'Energie Atomique et aux Energies Alternatives, Centre National de Recherche en Génomique Humaine, Evry, 91057, France
| |
Collapse
|
3
|
Dugé de Bernonville T, Daviaud C, Chaparro C, Tost J, Maury S. From Methylome to Integrative Analysis of Tissue Specificity. Methods Mol Biol 2022; 2505:223-240. [PMID: 35732948 DOI: 10.1007/978-1-0716-2349-7_16] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
DNA methylation is the most studied epigenetic mark in both plants and animals. The gold standard for assaying genome-wide DNA methylation at single-base resolution is whole-genome bisulfite sequencing (WGBS). Here, we describe an improved procedure for WGBS and original bioinformatic workflows applied to unravel tissue-specific variations of the methylome in relation to gene expression and accumulation of secondary metabolites in the medicinal plant Catharanthus roseus.
Collapse
Affiliation(s)
- Thomas Dugé de Bernonville
- EA2106 Biomolécules et Biotechnologies Végétales, Université de Tours, Tours, France
- Limagrain, Centre de Recherches de Chappes, Route d'Ennezat, Chappes, France
| | - Christian Daviaud
- Laboratory for Epigenetics and Environment, Centre National de Recherche en Génomique Humaine, CEA-Institut de Biologie François Jacob, Université Paris Saclay, Evry, France
| | - Cristian Chaparro
- UMR5244 IHPE, Université Montpellier, CNRS, IFREMER, Université Perpignan, Perpignan, France
| | - Jörg Tost
- Laboratory for Epigenetics and Environment, Centre National de Recherche en Génomique Humaine, CEA-Institut de Biologie François Jacob, Université Paris Saclay, Evry, France
| | - Stéphane Maury
- EA1207 USC1328 Laboratoire de Biologie des Ligneux et des Grandes Cultures, INRAe, Université d'Orléans, Orléans, France.
| |
Collapse
|
4
|
Tost J. Current and Emerging Technologies for the Analysis of the Genome-Wide and Locus-Specific DNA Methylation Patterns. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2022; 1389:395-469. [DOI: 10.1007/978-3-031-11454-0_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
5
|
Sow MD, Le Gac AL, Fichot R, Lanciano S, Delaunay A, Le Jan I, Lesage-Descauses MC, Citerne S, Caius J, Brunaud V, Soubigou-Taconnat L, Cochard H, Segura V, Chaparro C, Grunau C, Daviaud C, Tost J, Brignolas F, Strauss SH, Mirouze M, Maury S. RNAi suppression of DNA methylation affects the drought stress response and genome integrity in transgenic poplar. THE NEW PHYTOLOGIST 2021; 232:80-97. [PMID: 34128549 DOI: 10.1111/nph.17555] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Accepted: 06/08/2021] [Indexed: 05/27/2023]
Abstract
Trees are long-lived organisms that continuously adapt to their environments, a process in which epigenetic mechanisms are likely to play a key role. Via downregulation of the chromatin remodeler DECREASED IN DNA METHYLATION 1 (DDM1) in poplar (Populus tremula × Populus alba) RNAi lines, we examined how DNA methylation coordinates genomic and physiological responses to moderate water deficit. We compared the growth and drought response of two RNAi-ddm1 lines to wild-type (WT) trees under well-watered and water deficit/rewatering conditions, and analyzed their methylomes, transcriptomes, mobilomes and phytohormone contents in the shoot apical meristem. The RNAi-ddm1 lines were more tolerant to drought-induced cavitation but did not differ in height or stem diameter growth. About 5000 differentially methylated regions were consistently detected in both RNAi-ddm1 lines, colocalizing with 910 genes and 89 active transposable elements. Under water deficit conditions, 136 differentially expressed genes were found, including many involved in phytohormone pathways; changes in phytohormone concentrations were also detected. Finally, the combination of hypomethylation and drought led to the mobility of two transposable elements. Our findings suggest major roles for DNA methylation in regulation of genes involved in hormone-related stress responses, and the maintenance of genome integrity through repression of transposable elements.
Collapse
Affiliation(s)
- Mamadou D Sow
- LBLGC, INRAE, Université d'Orléans, EA 1207 USC 1328, Orléans, 45067, France
| | - Anne-Laure Le Gac
- LBLGC, INRAE, Université d'Orléans, EA 1207 USC 1328, Orléans, 45067, France
| | - Régis Fichot
- LBLGC, INRAE, Université d'Orléans, EA 1207 USC 1328, Orléans, 45067, France
| | - Sophie Lanciano
- IRD, UMR 232 DIADE, Université de Montpellier, Montpellier, 34090, France
- Laboratory of Plant Genome and Development, Université de Perpignan, Perpignan, 66860, France
| | - Alain Delaunay
- LBLGC, INRAE, Université d'Orléans, EA 1207 USC 1328, Orléans, 45067, France
| | - Isabelle Le Jan
- LBLGC, INRAE, Université d'Orléans, EA 1207 USC 1328, Orléans, 45067, France
| | | | - Sylvie Citerne
- Institut Jean-Pierre Bourgin, INRAE, AgroParisTech, Université Paris-Saclay, Versailles, 78000, France
| | - Jose Caius
- Institute of Plant Sciences Paris-Saclay (IPS2), Université Paris-Saclay, CNRS, INRAE, Université Evry, Orsay, 91405, France
| | - Véronique Brunaud
- Institute of Plant Sciences Paris-Saclay (IPS2), Université Paris-Saclay, CNRS, INRAE, Université Evry, Orsay, 91405, France
| | - Ludivine Soubigou-Taconnat
- Institute of Plant Sciences Paris-Saclay (IPS2), Université Paris-Saclay, CNRS, INRAE, Université Evry, Orsay, 91405, France
| | - Hervé Cochard
- Université Clermont Auvergne, INRAE, PIAF, Clermont-Ferrand, 63000, France
| | - Vincent Segura
- BioForA, INRAE, ONF, UMR 0588, Orléans, 45075, France
- UMR AGAP Institut, Université Montpellier, CIRAD, INRAE, Institut Montpellier SupAgro, UMR 1334, Montpellier, F-34398, France
| | | | - Christoph Grunau
- UMR 5244, IHPE, Université de Perpignan, Perpignan, 66100, France
| | - Christian Daviaud
- Laboratory for Epigenetics and Environment Centre National de Recherche en Génomique Humaine, CEA- Institut de Biologie Francois Jacob, Université Paris-Saclay, Evry, 91057, France
| | - Jörg Tost
- Laboratory for Epigenetics and Environment Centre National de Recherche en Génomique Humaine, CEA- Institut de Biologie Francois Jacob, Université Paris-Saclay, Evry, 91057, France
| | - Franck Brignolas
- LBLGC, INRAE, Université d'Orléans, EA 1207 USC 1328, Orléans, 45067, France
| | - Steven H Strauss
- Department of Forest Ecosystems and Society, Oregon State University, Corvallis, OR, 97331-5752, USA
| | - Marie Mirouze
- IRD, UMR 232 DIADE, Université de Montpellier, Montpellier, 34090, France
- Laboratory of Plant Genome and Development, Université de Perpignan, Perpignan, 66860, France
| | - Stéphane Maury
- LBLGC, INRAE, Université d'Orléans, EA 1207 USC 1328, Orléans, 45067, France
| |
Collapse
|
6
|
Dugé de Bernonville T, Maury S, Delaunay A, Daviaud C, Chaparro C, Tost J, O’Connor SE, Courdavault V. Developmental Methylome of the Medicinal Plant Catharanthus roseus Unravels the Tissue-Specific Control of the Monoterpene Indole Alkaloid Pathway by DNA Methylation. Int J Mol Sci 2020; 21:E6028. [PMID: 32825765 PMCID: PMC7503379 DOI: 10.3390/ijms21176028] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2020] [Revised: 08/06/2020] [Accepted: 08/18/2020] [Indexed: 02/07/2023] Open
Abstract
Catharanthus roseus produces a wide spectrum of monoterpene indole alkaloids (MIAs). MIA biosynthesis requires a tightly coordinated pathway involving more than 30 enzymatic steps that are spatio-temporally and environmentally regulated so that some MIAs specifically accumulate in restricted plant parts. The first regulatory layer involves a complex network of transcription factors from the basic Helix Loop Helix (bHLH) or AP2 families. In the present manuscript, we investigated whether an additional epigenetic layer could control the organ-, developmental- and environmental-specificity of MIA accumulation. We used Whole-Genome Bisulfite Sequencing (WGBS) together with RNA-seq to identify differentially methylated and expressed genes among nine samples reflecting different plant organs and experimental conditions. Tissue specific gene expression was associated with specific methylation signatures depending on cytosine contexts and gene parts. Some genes encoding key enzymatic steps from the MIA pathway were found to be simultaneously differentially expressed and methylated in agreement with the corresponding MIA accumulation. In addition, we found that transcription factors were strikingly concerned by DNA methylation variations. Altogether, our integrative analysis supports an epigenetic regulation of specialized metabolisms in plants and more likely targeting transcription factors which in turn may control the expression of enzyme-encoding genes.
Collapse
Affiliation(s)
- Thomas Dugé de Bernonville
- Faculté des Sciences et Techniques, Université de Tours, EA2106 Biomolécules et Biotechnologies Végétales, F-37200 Tours, France;
| | - Stéphane Maury
- INRA, EA1207 USC1328 Laboratoire de Biologie des Ligneux et des Grandes Cultures, Université d’Orléans, F-45067 Orléans, France;
| | - Alain Delaunay
- INRA, EA1207 USC1328 Laboratoire de Biologie des Ligneux et des Grandes Cultures, Université d’Orléans, F-45067 Orléans, France;
| | - Christian Daviaud
- Laboratoire Epigénétique et Environnement, LEE, Centre National de Recherche en Génomique Humaine, Institut de Biologie François Jacob, F-92265 Evry, France; (C.D.); (J.T.)
| | - Cristian Chaparro
- CNRS, IFREMER, UMR5244 Interactions Hôtes-Pathogènes-Environnments, Université de Montpellier, Université de Perpignan Via Domitia, F-66860 Perpignan, France;
| | - Jörg Tost
- Laboratoire Epigénétique et Environnement, LEE, Centre National de Recherche en Génomique Humaine, Institut de Biologie François Jacob, F-92265 Evry, France; (C.D.); (J.T.)
| | - Sarah Ellen O’Connor
- Max Planck Institute for Chemical Ecology, Department of Natural Product Biosynthesis, 07745 Jena, Germany;
| | - Vincent Courdavault
- Faculté des Sciences et Techniques, Université de Tours, EA2106 Biomolécules et Biotechnologies Végétales, F-37200 Tours, France;
| |
Collapse
|
7
|
Comparison of commercially available whole-genome sequencing kits for variant detection in circulating cell-free DNA. Sci Rep 2020; 10:6190. [PMID: 32277101 PMCID: PMC7148341 DOI: 10.1038/s41598-020-63102-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2019] [Accepted: 03/19/2020] [Indexed: 12/13/2022] Open
Abstract
Circulating cell-free DNA (ccfDNA) has great potential for non-invasive diagnosis, prognosis and monitoring treatment of disease. However, a sensitive and specific whole-genome sequencing (WGS) method is required to identify novel genetic variations (i.e., SNVs, CNVs and INDELS) on ccfDNA that can be used as clinical biomarkers. In this article, five WGS methods were compared: ThruPLEX Plasma-seq, QIAseq cfDNA All-in-One, NEXTFLEX Cell Free DNA-seq, Accel-NGS 2 S PCR FREE DNA and Accel-NGS 2 S PLUS DNA. The Accel PCR-free kit did not produce enough material for sequencing. The other kits had significant common number of SNVs, INDELs and CNVs and showed similar results for SNVs and CNVs. The detection of variants and genomic signatures depends more upon the type of plasma sample rather than the WGS method used. Accel detected several variants not observed by the other kits. ThruPLEX seemed to identify more low-abundant SNVs and SNV signatures were similar to signatures observed with the QIAseq kit. Accel and NEXTFLEX had similar CNV and SNV signatures. These results demonstrate the importance of establishing a standardized workflow for identifying non-invasive candidate biomarkers. Moreover, the combination of variants discovered in ccfDNA using WGS has the potential to identify enrichment pathways, while the analysis of signatures could identify new subgroups of patients.
Collapse
|
8
|
Kernaleguen M, Daviaud C, Shen Y, Bonnet E, Renault V, Deleuze JF, Mauger F, Tost J. Whole-Genome Bisulfite Sequencing for the Analysis of Genome-Wide DNA Methylation and Hydroxymethylation Patterns at Single-Nucleotide Resolution. Methods Mol Biol 2018. [PMID: 29524144 DOI: 10.1007/978-1-4939-7774-1_18] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
The analysis of genome-wide epigenomic alterations including DNA methylation and hydroxymethylation has become a subject of intensive research for many biological and disease-associated investigations. Whole-genome bisulfite sequencing (WGBS) using next-generation sequencing technologies is currently considered as the gold standard for a comprehensive and quantitative analysis of DNA methylation throughout the genome. However, bisulfite conversion does not allow distinguishing between cytosine methylation and hydroxymethylation requiring an additional chemical or enzymatic step to identify hydroxymethylated cytosines. Here we provide two detailed protocols based on commercial kits for the preparation of sequencing libraries for the comprehensive whole-genome analysis of DNA methylation and/or hydroxymethylation. If only DNA methylation is of interest, sequencing libraries can be constructed from limited amounts of input DNA by ligation of methylated adaptors to the fragmented DNA prior to bisulfite conversion. For samples with significant levels of hydroxymethylation such as stem cells or brain tissue, we describe the protocol of oxidative bisulfite sequencing (OxBs-seq), which in its current version uses a post-bisulfite adaptor tagging (PBAT) approach. Two methylomes need to be generated: a classic methylome following bisulfite conversion and analyzing both methylated and hydroxymethylated cytosines and a methylome analyzing only methylated cytosines, respectively. We also provide a step-by-step description of the data analysis using publicly available bioinformatic tools. The described protocols have been successfully applied to different human samples and yield robust and reproducible results.
Collapse
Affiliation(s)
- Magali Kernaleguen
- Laboratory for Epigenetics and Environment, Centre National de Recherche en Génomique Humaine, CEA-Institut de Biologie Francois Jacob, Evry, France
| | - Christian Daviaud
- Laboratory for Epigenetics and Environment, Centre National de Recherche en Génomique Humaine, CEA-Institut de Biologie Francois Jacob, Evry, France
| | - Yimin Shen
- Laboratory for Epigenetics and Environment, Centre National de Recherche en Génomique Humaine, CEA-Institut de Biologie Francois Jacob, Evry, France
| | - Eric Bonnet
- Laboratory for Bio-analysis, Centre National de Recherche en Génomique Humaine, CEA-Institut de Biologie Francois Jacob, Evry, France
| | - Victor Renault
- Laboratory for Bioinformatics, Fondation Jean Dausset - CEPH, Paris, France
| | - Jean-François Deleuze
- Laboratory for Epigenetics and Environment, Centre National de Recherche en Génomique Humaine, CEA-Institut de Biologie Francois Jacob, Evry, France.,Laboratory for Bioinformatics, Fondation Jean Dausset - CEPH, Paris, France
| | - Florence Mauger
- Laboratory for Epigenetics and Environment, Centre National de Recherche en Génomique Humaine, CEA-Institut de Biologie Francois Jacob, Evry, France
| | - Jörg Tost
- Laboratory for Epigenetics and Environment, Centre National de Recherche en Génomique Humaine, CEA-Institut de Biologie Francois Jacob, Evry, France.
| |
Collapse
|