1
|
Aliaga B, Bulla I, Mouahid G, Duval D, Grunau C. Universality of the DNA methylation codes in Eucaryotes. Sci Rep 2019; 9:173. [PMID: 30655579 PMCID: PMC6336885 DOI: 10.1038/s41598-018-37407-8] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2018] [Accepted: 10/24/2018] [Indexed: 12/26/2022] Open
Abstract
Genetics and epigenetics are tightly linked heritable information classes. Question arises if epigenetics provides just a set of environment dependent instructions, or whether it is integral part of an inheritance system. We argued that in the latter case the epigenetic code should share the universality quality of the genetic code. We focused on DNA methylation. Since availability of DNA methylation data is biased towards model organisms we developed a method that uses kernel density estimations of CpG observed/expected ratios to infer DNA methylation types in any genome. We show here that our method allows for robust prediction of mosaic and full gene body methylation with a PPV of 1 and 0.87, respectively. We used this prediction to complement experimental data, and applied hierarchical clustering to identify methylation types in ~150 eucaryotic species covering different body plans, reproduction types and living conditions. Our analysis indicates that there are only four gene body methylation types. These types do not follow phylogeny (i.e. phylogenetically distant clades can have identical methylation types) but they are consistent within clades. We conclude that the gene body DNA methylation codes have universality similar to the universality of the genetic code and should consequently be considered as part of the inheritance system.
Collapse
Affiliation(s)
- Benoît Aliaga
- University Perpignan Via Domitia, IHPE UMR 5244, CNRS, IFREMER, University Montpellier, F-66860, Perpignan, France
| | - Ingo Bulla
- University Perpignan Via Domitia, IHPE UMR 5244, CNRS, IFREMER, University Montpellier, F-66860, Perpignan, France
- Institute for Mathematics and Informatics, University of Greifswald, Greifswald, Germany
- Department of Computer Science, ETH Zürich, Zürich, Switzerland
| | - Gabriel Mouahid
- University Perpignan Via Domitia, IHPE UMR 5244, CNRS, IFREMER, University Montpellier, F-66860, Perpignan, France
| | - David Duval
- University Perpignan Via Domitia, IHPE UMR 5244, CNRS, IFREMER, University Montpellier, F-66860, Perpignan, France
| | - Christoph Grunau
- University Perpignan Via Domitia, IHPE UMR 5244, CNRS, IFREMER, University Montpellier, F-66860, Perpignan, France.
| |
Collapse
|
2
|
Keller B, Manzanares C, Jara C, Lobaton JD, Studer B, Raatz B. Fine-mapping of a major QTL controlling angular leaf spot resistance in common bean (Phaseolus vulgaris L.). TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2015; 128:813-26. [PMID: 25740562 PMCID: PMC4544502 DOI: 10.1007/s00122-015-2472-6] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2014] [Accepted: 01/31/2015] [Indexed: 05/07/2023]
Abstract
KEY MESSAGE A major QTL for angular leaf spot resistance in the common bean accession G5686 was fine-mapped to a region containing 36 candidate genes. Markers have been developed for marker-assisted selection. Common bean (Phaseolus vulgaris L.) is an important grain legume and an essential protein source for human nutrition in developing countries. Angular leaf spot (ALS) caused by the pathogen Pseudocercospora griseola (Sacc.) Crous and U. Braun is responsible for severe yield losses of up to 80%. Breeding for resistant cultivars is the most ecological and economical means to control ALS and is particularly important for yield stability in low-input agriculture. Here, we report on a fine-mapping approach of a major quantitative trait locus (QTL) ALS4.1(GS, UC) for ALS resistance in a mapping population derived from the resistant genotype G5686 and the susceptible cultivar Sprite. 180 F3 individuals of the mapping population were evaluated for ALS resistance and genotyped with 22 markers distributed over 11 genome regions colocating with previously reported QTL for ALS resistance. Multiple QTL analysis identified three QTL regions, including one major QTL on chromosome Pv04 at 43.7 Mbp explaining over 75% of the observed variation for ALS resistance. Additional evaluation of 153 F4, 89 BC1F2 and 139 F4/F5/BC1F3 descendants with markers in the region of the major QTL delimited the region to 418 kbp harboring 36 candidate genes. Among these, 11 serine/threonine protein kinases arranged in a repetitive array constitute promising candidate genes for controlling ALS resistance. Single nucleotide polymorphism markers cosegregating with the major QTL for ALS resistance have been developed and constitute the basis for marker-assisted introgression of ALS resistance into advanced breeding germplasm of common bean.
Collapse
Affiliation(s)
- Beat Keller
- Forage Crop Genetics, Institute of Agricultural Sciences, ETH Zurich, Universitaetstrasse 2, 8092 Zurich, Switzerland
| | - Chloe Manzanares
- Forage Crop Genetics, Institute of Agricultural Sciences, ETH Zurich, Universitaetstrasse 2, 8092 Zurich, Switzerland
| | - Carlos Jara
- Agrobiodiversity Research Area, Bean Program, CIAT Cali-Palmira, A. A. 6713, Cali, Colombia
| | - Juan David Lobaton
- Agrobiodiversity Research Area, Bean Program, CIAT Cali-Palmira, A. A. 6713, Cali, Colombia
| | - Bruno Studer
- Forage Crop Genetics, Institute of Agricultural Sciences, ETH Zurich, Universitaetstrasse 2, 8092 Zurich, Switzerland
| | - Bodo Raatz
- Agrobiodiversity Research Area, Bean Program, CIAT Cali-Palmira, A. A. 6713, Cali, Colombia
| |
Collapse
|
3
|
Peng FY, Weselake RJ. Genome-wide identification and analysis of the B3 superfamily of transcription factors in Brassicaceae and major crop plants. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2013; 126:1305-19. [PMID: 23377560 DOI: 10.1007/s00122-013-2054-4] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2012] [Accepted: 01/09/2013] [Indexed: 05/04/2023]
Abstract
The plant-specific B3 superfamily of transcription factors has diverse functions in plant growth and development. Using a genome-wide domain analysis, we identified 92, 187, 58, 90, 81, 55, and 77 B3 transcription factor genes in the sequenced genome of Arabidopsis, Brassica rapa, castor bean (Ricinus communis), cocoa (Theobroma cacao), soybean (Glycine max), maize (Zea mays), and rice (Oryza sativa), respectively. The B3 superfamily has substantially expanded during the evolution in eudicots particularly in Brassicaceae, as compared to monocots in the analysis. We observed domain duplication in some of these B3 proteins, forming more complex domain architectures than currently understood. We found that the length of B3 domains exhibits a large variation, which may affect their exact number of α-helices and β-sheets in the core structure of B3 domains, and possibly have functional implications. Analysis of the public microarray data indicated that most of the B3 gene pairs encoding Arabidopsis-rice orthologs are preferentially expressed in different tissues, suggesting their different roles in these two species. Using ESTs in crops, we identified many B3 genes preferentially expressed in reproductive tissues. In a sequence-based quantitative trait loci analysis in rice and maize, we have found many B3 genes associated with traits such as grain yield, seed weight and number, and protein content. Our results provide a framework for future studies into the function of B3 genes in different phases of plant development, especially the ones related to traits in major crops.
Collapse
Affiliation(s)
- Fred Y Peng
- Agricultural Lipid Biotechnology Program, Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB T6G 2P5, Canada
| | | |
Collapse
|
4
|
Pérot P, Mugnier N, Montgiraud C, Gimenez J, Jaillard M, Bonnaud B, Mallet F. Microarray-based sketches of the HERV transcriptome landscape. PLoS One 2012; 7:e40194. [PMID: 22761958 PMCID: PMC3386233 DOI: 10.1371/journal.pone.0040194] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2012] [Accepted: 06/02/2012] [Indexed: 12/15/2022] Open
Abstract
Human endogenous retroviruses (HERVs) are spread throughout the genome and their long terminal repeats (LTRs) constitute a wide collection of putative regulatory sequences. Phylogenetic similarities and the profusion of integration sites, two inherent characteristics of transposable elements, make it difficult to study individual locus expression in a large-scale approach, and historically apart from some placental and testis-regulated elements, it was generally accepted that HERVs are silent due to epigenetic control. Herein, we have introduced a generic method aiming to optimally characterize individual loci associated with 25-mer probes by minimizing cross-hybridization risks. We therefore set up a microarray dedicated to a collection of 5,573 HERVs that can reasonably be assigned to a unique genomic position. We obtained a first view of the HERV transcriptome by using a composite panel of 40 normal and 39 tumor samples. The experiment showed that almost one third of the HERV repertoire is indeed transcribed. The HERV transcriptome follows tropism rules, is sensitive to the state of differentiation and, unexpectedly, seems not to correlate with the age of the HERV families. The probeset definition within the U3 and U5 regions was used to assign a function to some LTRs (i.e. promoter or polyA) and revealed that (i) autonomous active LTRs are broadly subjected to operational determinism (ii) the cellular gene density is substantially higher in the surrounding environment of active LTRs compared to silent LTRs and (iii) the configuration of neighboring cellular genes differs between active and silent LTRs, showing an approximately 8 kb zone upstream of promoter LTRs characterized by a drastic reduction in sense cellular genes. These gathered observations are discussed in terms of virus/host adaptive strategies, and together with the methods and tools developed for this purpose, this work paves the way for further HERV transcriptome projects.
Collapse
Affiliation(s)
- Philippe Pérot
- Joint Unit Hospices Civils de Lyon, bioMérieux, Cancer Biomarkers Research Group, Centre Hospitalier Lyon Sud, Lyon, France
| | - Nathalie Mugnier
- BioMérieux, Data and Knowledge Laboratory, Marcy l’Etoile, France
| | - Cécile Montgiraud
- Joint Unit Hospices Civils de Lyon, bioMérieux, Cancer Biomarkers Research Group, Centre Hospitalier Lyon Sud, Lyon, France
| | - Juliette Gimenez
- Joint Unit Hospices Civils de Lyon, bioMérieux, Cancer Biomarkers Research Group, Centre Hospitalier Lyon Sud, Lyon, France
| | - Magali Jaillard
- BioMérieux, Data and Knowledge Laboratory, Marcy l’Etoile, France
| | - Bertrand Bonnaud
- BioMérieux, Data and Knowledge Laboratory, Marcy l’Etoile, France
| | - François Mallet
- Joint Unit Hospices Civils de Lyon, bioMérieux, Cancer Biomarkers Research Group, Centre Hospitalier Lyon Sud, Lyon, France
- * E-mail:
| |
Collapse
|
5
|
Identification of OASL d, a splice variant of human OASL, with antiviral activity. Int J Biochem Cell Biol 2012; 44:1133-8. [PMID: 22531715 DOI: 10.1016/j.biocel.2012.04.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2011] [Revised: 03/16/2012] [Accepted: 04/04/2012] [Indexed: 11/21/2022]
Abstract
The 2',5'-oligoadenylate synthetases (OASs) are IFN-induced antiviral proteins and are upregulated by infection of viral and some bacterial pathogens. There are at least 2 transcripts of approximately 1.8 and 2.0 kb in interferon-beta treated samples that are recognized by a probe for human OASL in Northern blot assay. By RT-PCR amplification we have isolated a previously undescribed splice variant of human OASL, named OASL d. The new variant was derived from deletion of exons 4 and 5 and encodes a protein of 384 aa residues that shares the N-terminal 219 aa residues with OASL a. Sequence analysis indicates that OASL d also contains the entire ubiquitin-like domain identified in human OASL a. OASL d was strongly induced by IFNγ in THP-1 monocytic cells and in A549 epithelial cells by interferon-beta as detected by immunoblotting assay. Ectopic expression of OASL a or OASL d, but not OASL b that shares the N-terminus with OASL a and d, partially inhibited EV71 and VSV infection. No effect against HSV-2 infection was observed. Therefore, OASL d is a novel isoform of human OASL that possesses antiviral activity against RNA viruses.
Collapse
|
6
|
CANTACESSI C, CAMPBELL BE, JEX AR, YOUNG ND, HALL RS, RANGANATHAN S, GASSER RB. Bioinformatics meets parasitology. Parasite Immunol 2012; 34:265-75. [DOI: 10.1111/j.1365-3024.2011.01304.x] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
|
7
|
Fernández-Pozo N, Canales J, Guerrero-Fernández D, Villalobos DP, Díaz-Moreno SM, Bautista R, Flores-Monterroso A, Guevara MÁ, Perdiguero P, Collada C, Cervera MT, Soto A, Ordás R, Cantón FR, Avila C, Cánovas FM, Claros MG. EuroPineDB: a high-coverage web database for maritime pine transcriptome. BMC Genomics 2011; 12:366. [PMID: 21762488 PMCID: PMC3152544 DOI: 10.1186/1471-2164-12-366] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2011] [Accepted: 07/15/2011] [Indexed: 11/30/2022] Open
Abstract
Background Pinus pinaster is an economically and ecologically important species that is becoming a woody gymnosperm model. Its enormous genome size makes whole-genome sequencing approaches are hard to apply. Therefore, the expressed portion of the genome has to be characterised and the results and annotations have to be stored in dedicated databases. Description EuroPineDB is the largest sequence collection available for a single pine species, Pinus pinaster (maritime pine), since it comprises 951 641 raw sequence reads obtained from non-normalised cDNA libraries and high-throughput sequencing from adult (xylem, phloem, roots, stem, needles, cones, strobili) and embryonic (germinated embryos, buds, callus) maritime pine tissues. Using open-source tools, sequences were optimally pre-processed, assembled, and extensively annotated (GO, EC and KEGG terms, descriptions, SNPs, SSRs, ORFs and InterPro codes). As a result, a 10.5× P. pinaster genome was covered and assembled in 55 322 UniGenes. A total of 32 919 (59.5%) of P. pinaster UniGenes were annotated with at least one description, revealing at least 18 466 different genes. The complete database, which is designed to be scalable, maintainable, and expandable, is freely available at: http://www.scbi.uma.es/pindb/. It can be retrieved by gene libraries, pine species, annotations, UniGenes and microarrays (i.e., the sequences are distributed in two-colour microarrays; this is the only conifer database that provides this information) and will be periodically updated. Small assemblies can be viewed using a dedicated visualisation tool that connects them with SNPs. Any sequence or annotation set shown on-screen can be downloaded. Retrieval mechanisms for sequences and gene annotations are provided. Conclusions The EuroPineDB with its integrated information can be used to reveal new knowledge, offers an easy-to-use collection of information to directly support experimental work (including microarray hybridisation), and provides deeper knowledge on the maritime pine transcriptome.
Collapse
Affiliation(s)
- Noé Fernández-Pozo
- Departamento de Biología Molecular y Bioquímica, Facultad de Ciencias, Campus de Teatinos s/n, Universidad de Málaga, 29071 Málaga, Spain
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Understanding Vegetative Desiccation Tolerance Using Integrated Functional Genomics Approaches Within a Comparative Evolutionary Framework. PLANT DESICCATION TOLERANCE 2011. [DOI: 10.1007/978-3-642-19106-0_15] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
9
|
Lee BW, Chu IS, Kim NS, Lee JH, Kim SY, Kim WK, Lee SH. Bioinformatics Resources of the Korean Bioinformation Center (KOBIC). Genomics Inform 2010. [DOI: 10.5808/gi.2010.8.4.165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
|
10
|
Abstract
Background The general method used to determine the function of newly discovered proteins is to transfer annotations from well-characterized homologous proteins. The process of selecting homologous proteins can largely be classified into sequence-based and domain-based approaches. Domain-based methods have several advantages for identifying distant homology and homology among proteins with multiple domains, as compared to sequence-based methods. However, these methods are challenged by large families defined by 'promiscuous' (or 'mobile') domains. Results Here we present a measure, called Weighed Domain Architecture Comparison (WDAC), of domain architecture similarity, which can be used to identify homolog of multidomain proteins. To distinguish these promiscuous domains from conventional protein domains, we assigned a weight score to Pfam domain extracted from RefSeq proteins, based on its abundance and versatility. To measure the similarity of two domain architectures, cosine similarity (a similarity measure used in information retrieval) is used. We combined sequence similarity with domain architecture comparisons to identify proteins belonging to the same domain architecture. Using human and nematode proteomes, we compared WDAC with an unweighted domain architecture method (DAC) to evaluate the effectiveness of domain weight scores. We found that WDAC is better at identifying homology among multidomain proteins. Conclusion Our analysis indicates that considering domain weight scores in domain architecture comparisons improves protein homology identification. We developed a web-based server to allow users to compare their proteins with protein domain architectures.
Collapse
Affiliation(s)
- Byungwook Lee
- Korean BioInformation Center, KRIBB, Daejeon 305-806, Korea.
| | | |
Collapse
|