1
|
Hubert JN, Iannuccelli N, Cabau C, Jacomet E, Billon Y, Serre RF, Vandecasteele C, Donnadieu C, Demars J. Detection of DNA methylation signatures through the lens of genomic imprinting. Sci Rep 2024; 14:1694. [PMID: 38242932 PMCID: PMC10798973 DOI: 10.1038/s41598-024-52114-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Accepted: 01/14/2024] [Indexed: 01/21/2024] Open
Abstract
Genomic imprinting represents an original model of epigenetic regulation resulting in a parent-of-origin expression. Despite the critical role of imprinted genes in mammalian growth, metabolism and neuronal function, there is no molecular tool specifically targeting them for a systematic evaluation. We show here that enzymatic methyl-seq consistently outperforms the bisulfite-based standard in capturing 165 candidate regions for genomic imprinting in the pig. This highlights the potential for a turnkey, fully customizable and reliable capture tool of genomic regions regulated by cytosine methylation in any population of interest. For the field of genomic imprinting, it opens up the possibility of detecting multilocus imprinting variations across the genome, with implications for basic research, agrigenomics and clinical practice.
Collapse
Affiliation(s)
- Jean-Noël Hubert
- GenPhySE, Université de Toulouse, INRAE, ENVT, 31326, Castanet-Tolosan, France
| | | | - Cédric Cabau
- Sigenae, GenPhySE, Université de Toulouse, INRAE, ENVT, 31326, Castanet-Tolosan, France
| | - Eva Jacomet
- GenPhySE, Université de Toulouse, INRAE, ENVT, 31326, Castanet-Tolosan, France
- ENVT, 31326, Castanet-Tolosan, France
| | | | - Rémy-Félix Serre
- INRAE, GeT-PlaGe, Genotoul, 31326, Castanet-Tolosan, France
- Qualyse, Le Treuil, INRAE, 19000, Tulle, France
| | | | | | - Julie Demars
- GenPhySE, Université de Toulouse, INRAE, ENVT, 31326, Castanet-Tolosan, France.
| |
Collapse
|
2
|
Parey E, Louis A, Montfort J, Bouchez O, Roques C, Iampietro C, Lluch J, Castinel A, Donnadieu C, Desvignes T, Floi Bucao C, Jouanno E, Wen M, Mejri S, Dirks R, Jansen H, Henkel C, Chen WJ, Zahm M, Cabau C, Klopp C, Thompson AW, Robinson-Rechavi M, Braasch I, Lecointre G, Bobe J, Postlethwait JH, Berthelot C, Roest Crollius H, Guiguen Y. Genome structures resolve the early diversification of teleost fishes. Science 2023; 379:572-575. [PMID: 36758078 DOI: 10.1126/science.abq4257] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/11/2023]
Abstract
Accurate species phylogenies are a prerequisite for all evolutionary research. Teleosts are the largest and most diversified group of extant vertebrates, but relationships among their three oldest extant lineages remain unresolved. On the basis of seven high-quality new genome assemblies in Elopomorpha (tarpons, eels), we revisited the topology of the deepest branches of the teleost phylogeny using independent gene sequence and chromosomal rearrangement phylogenomic approaches. These analyses converged to a single scenario that unambiguously places the Elopomorpha and Osteoglossomorpha (arapaima, elephantnose fish) in a monophyletic sister group to all other teleosts, i.e., the Clupeocephala lineage (zebrafish, medaka). This finding resolves more than 50 years of controversy on the evolutionary relationships of these lineages and highlights the power of combining different levels of genome-wide information to solve complex phylogenies.
Collapse
Affiliation(s)
- Elise Parey
- Institut de Biologie de l'ENS (IBENS), Département de Biologie, École Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
- INRAE, LPGP, Rennes, France
| | - Alexandra Louis
- Institut de Biologie de l'ENS (IBENS), Département de Biologie, École Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
| | | | | | - Céline Roques
- GeT-PlaGe, INRAE, Genotoul, Castanet-Tolosan, France
| | | | - Jerome Lluch
- GeT-PlaGe, INRAE, Genotoul, Castanet-Tolosan, France
| | | | | | - Thomas Desvignes
- Institute of Neuroscience, University of Oregon, Eugene, OR, USA
| | - Christabel Floi Bucao
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | | | - Ming Wen
- INRAE, LPGP, Rennes, France
- State Key Laboratory of Developmental Biology of Freshwater Fish, College of Life Science, Hunan Normal University, Changsha, China
| | - Sahar Mejri
- Florida Atlantic University, Harbor Branch Oceanographic Institute, Fort Pierce, FL, USA
| | - Ron Dirks
- Future Genomics Technologies, Leiden, Netherlands
| | - Hans Jansen
- Future Genomics Technologies, Leiden, Netherlands
| | - Christiaan Henkel
- Institute of Biology, University of Leiden, Leiden, Netherlands
- Faculty of Veterinary Medicine, Norwegian University of Life Sciences, Ås, Norway
| | - Wei-Jen Chen
- Institute of Oceanography, National Taiwan University, Taipei 10617, Taiwan
| | - Margot Zahm
- Sigenae, GenPhySE, INRAE, ENVT, Université de Toulouse, Castanet Tolosan, France
| | - Cédric Cabau
- Sigenae, GenPhySE, INRAE, ENVT, Université de Toulouse, Castanet Tolosan, France
| | - Christophe Klopp
- Sigenae, GenPhySE, INRAE, ENVT, Université de Toulouse, Castanet Tolosan, France
- Sigenae, Genotoul Bioinfo, MIAT UR875, INRAE, Castanet Tolosan, France
| | - Andrew W Thompson
- Department of Integrative Biology, Michigan State University, East Lansing, MI, USA
- Ecology, Evolution, and Behavior Program, Michigan State University, East Lansing, MI, USA
- Department of Biological Sciences, Western Michigan University, Kalamazoo, MI, USA
| | | | - Ingo Braasch
- Department of Integrative Biology, Michigan State University, East Lansing, MI, USA
- Ecology, Evolution, and Behavior Program, Michigan State University, East Lansing, MI, USA
| | - Guillaume Lecointre
- Institut Systématique, Evolution, Biodiversité (ISYEB), Muséum national d'Histoire naturelle, CNRS, SU, EPHE, UA, Paris, France
| | | | | | - Camille Berthelot
- Institut de Biologie de l'ENS (IBENS), Département de Biologie, École Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
- Institut Pasteur, Université Paris Cité, CNRS UMR 3525, INSERM UA12, Comparative Functional Genomics group, F-75015 Paris, France
| | - Hugues Roest Crollius
- Institut de Biologie de l'ENS (IBENS), Département de Biologie, École Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
| | | |
Collapse
|
3
|
Iannuccelli N, Sarry J, Billon Y, Aymard P, Helies V, Cabau C, Donnadieu C, Demars J. Rabbit targeted genomic sequences after heterologous hybridization using human exome. BMC Res Notes 2022; 15:282. [PMID: 35986355 PMCID: PMC9389681 DOI: 10.1186/s13104-022-06162-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Accepted: 07/27/2022] [Indexed: 11/16/2022] Open
Abstract
Causal mutations for major genes that underlie a broad range of morphological traits are often located within exons of genes that then affect protein functions. Non-model organism genetic studies are not easy to perform due to the lack of genome-wide molecular tools such as SNP genotyping array. Genotyping-By-Sequencing (GBS) methods offer an alternative. Consequently, we used this approach that is focused on the exome to target and identify major genes in rabbit populations. Data description We used a heterologous enrichment method before sequencing, allowing us to capture the rabbit exome using the marketed human panel since mammal protein coding genes are well conserved across the phylogenic tree of species. This targeted strategy was performed on 52 French rabbits from 5 different French strains (Californian, New-Zealand, Castor, Chinchilla and Laghmere). We generated 3.4 billion sequencing reads and approximately 29–140 million of reads per DNA sample. The expected exome coverage per sample ranged between 118 and 566X. The present dataset could be useful for the scientific community working on rabbit species in order to (i) improve the annotation of the rabbit reference genome Oryctolagus cuniculus (OryCun2.0), (ii) enrich the characterization of polymorphisms segregating in rabbits and (iii) evaluate the genetic biodiversity in different rabbit strains. Raw sequences were deposited in the European Nucleotide Archive (ENA) at the European Molecular Biology Laboratory- European Bioinformatics Institute (EMBL-EBI) data portal under bioproject accession number PRJEB37917.
Collapse
|
4
|
Jasonowicz AJ, Simeon A, Zahm M, Cabau C, Klopp C, Roques C, Iampietro C, Lluch J, Donnadieu C, Parrinello H, Drinan DP, Hauser L, Guiguen Y, Planas JV. Generation of a chromosome‐level genome assembly for Pacific halibut (
Hippoglossus stenolepis
) and characterization of its sex‐determining genomic region. Mol Ecol Resour 2022; 22:2685-2700. [PMID: 35569134 PMCID: PMC9541706 DOI: 10.1111/1755-0998.13641] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Revised: 04/22/2022] [Accepted: 05/11/2022] [Indexed: 12/01/2022]
Abstract
The Pacific halibut (Hippoglossus stenolepis) is a key species in the North Pacific Ocean and Bering Sea ecosystems, where it also supports important fisheries. However, the lack of genomic resources limits our understanding of evolutionary, environmental and anthropogenic forces affecting key life history characteristics of Pacific halibut and prevents the application of genomic tools in fisheries management and conservation efforts. In the present study, we report on the first generation of a high‐quality chromosome‐level assembly of the Pacific halibut genome, with an estimated size of 602 Mb, 24 chromosome‐length scaffolds that contain 99.8% of the assembly and a N50 scaffold length of 27.3 Mb. In the first application of this important resource, we conducted genome‐wide analyses of sex‐specific genetic variation by pool sequencing and characterized a potential sex‐determining region in chromosome 9 with a high density of female‐specific SNPs. Within this region, we identified the bmpr1ba gene as a potential candidate for master sex‐determining (MSD) gene. bmpr1ba is a member of the TGF‐β family that in teleosts has provided the largest number of MSD genes, including a paralogue of this gene in Atlantic herring. The genome assembly constitutes an essential resource for future studies on Pacific halibut population structure and dynamics, evolutionary history and responses to environmental and anthropogenic influences. Furthermore, the genomic location of the sex‐determining region in Pacific halibut has been identified and a putative candidate MSD gene has been proposed, providing further support for the rapid evolution of sex‐determining mechanisms in teleost fish.
Collapse
Affiliation(s)
| | - Anna Simeon
- International Pacific Halibut Commission Seattle, WA 98199 USA
- Present address: School of Aquatic and Fishery Science University of Washington Seattle WA
| | - Margot Zahm
- SIGENAE, Bioinfo Genotoul, UMIAT, INRAE Castanet‐Tolosan France
| | - Cédric Cabau
- SIGENAE, GenPhySE Université de Toulouse INRAE, ENVT, 31326 Castanet‐Tolosan France
| | | | - Céline Roques
- INRAE, GeT‐PlaGe, Genotoul, 31326 Castanet‐Tolosan France
| | | | - Jérôme Lluch
- INRAE, GeT‐PlaGe, Genotoul, 31326 Castanet‐Tolosan France
| | | | - Hugues Parrinello
- MGX‐Montpellier GenomiX, Univ. Montpellier, CNRS, INSERM Montpellier France
| | - Daniel P. Drinan
- School of Aquatic and Fishery Science University of Washington Seattle, WA 98105 USA
| | - Lorenz Hauser
- School of Aquatic and Fishery Science University of Washington Seattle, WA 98105 USA
| | | | - Josep V. Planas
- International Pacific Halibut Commission Seattle, WA 98199 USA
| |
Collapse
|
5
|
Wen M, Pan Q, Jouanno E, Montfort J, Zahm M, Cabau C, Klopp C, Iampietro C, Roques C, Bouchez O, Castinel A, Donnadieu C, Parrinello H, Poncet C, Belmonte E, Gautier V, Avarre J, Dugue R, Gustiano R, Hà TTT, Campet M, Sriphairoj K, Ribolli J, de Almeida FL, Desvignes T, Postlethwait JH, Floi Bucao C, Robinson‐Rechavi M, Bobe J, Herpin A, Guiguen Y. An ancient truncated duplication of the anti‐Mullerian hormone receptor type 2 gene is a potential conserved master sex determinant in the Pangasiidae catfish family. Mol Ecol Resour 2022; 22:2411-2428. [PMID: 35429227 PMCID: PMC9555307 DOI: 10.1111/1755-0998.13620] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Revised: 03/29/2022] [Accepted: 04/11/2022] [Indexed: 11/30/2022]
Abstract
The evolution of sex determination (SD) in teleosts is amazingly dynamic, as reflected by the variety of different master sex-determining genes identified. Pangasiids are economically important catfishes in South Asian countries, but little is known about their SD system. Here, we generated novel genomic resources for 12 Pangasiids and characterized their SD system. Based on a Pangasianodon hypophthalmus chromosome-scale genome assembly, we identified an anti-Müllerian hormone receptor type Ⅱ gene (amhr2) duplication, which was further characterized as being sex-linked in males and expressed only in testes. These results point to a Y chromosome male-specific duplication (amhr2by) of the autosomal amhr2a. Sequence annotation revealed that the P. hypophthalmus Amhr2by is truncated in its N-terminal domain, lacking the cysteine-rich extracellular part of the receptor that is crucial for ligand binding, suggesting a potential route for its neofunctionalization. Reference-guided assembly of 11 additional Pangasiids, along with sex-linkage studies, revealed that this truncated amhr2by duplication is a male-specific conserved gene in Pangasiids. Reconstructions of the amhr2 phylogeny suggested that amhr2by arose from an ancient duplication/insertion event at the root of the Siluroidei radiation that is dated to ~100 million years ago. Together these results bring multiple lines of evidence supporting that amhr2by is an ancient and conserved master sex-determining gene in Pangasiids, a finding that highlights the recurrent use of the transforming growth factor β pathway, which is often used for the recruitment of teleost master SD genes, and provides another empirical case towards firther understanding of dynamics of SD systems.
Collapse
Affiliation(s)
- Ming Wen
- State Key Laboratory of Developmental Biology of Freshwater Fish College of Life Science Hunan Normal University Changsha China
- INRAE LPGP 35000 Rennes France
| | - Qiaowei Pan
- INRAE LPGP 35000 Rennes France
- Department of Ecology and Evolution University of Lausanne Lausanne Switzerland
| | | | | | - Margot Zahm
- Plate‐forme bio‐informatique Genotoul Mathématiques et Informatique Appliquées de Toulouse INRAE Castanet Tolosan France
| | - Cédric Cabau
- SIGENAE, GenPhySE Université de Toulouse INRAE ENVT Castanet Tolosan France
| | - Christophe Klopp
- Plate‐forme bio‐informatique Genotoul Mathématiques et Informatique Appliquées de Toulouse INRAE Castanet Tolosan France
- SIGENAE, GenPhySE Université de Toulouse INRAE ENVT Castanet Tolosan France
| | | | - Céline Roques
- INRAE, US 1426, GeT‐PlaGe Genotoul, Castanet‐Tolosan France
| | | | | | | | - Hugues Parrinello
- Montpellier GenomiX (MGX), c/o Institut de Génomique Fonctionnelle 141 rue de la Cardonille 34094 Montpellier Cedex France
| | - Charles Poncet
- GDEC Gentyane INRAE Université Clermont Auvergne Clermont‐Ferrand France
| | - Elodie Belmonte
- GDEC Gentyane INRAE Université Clermont Auvergne Clermont‐Ferrand France
| | - Véronique Gautier
- GDEC Gentyane INRAE Université Clermont Auvergne Clermont‐Ferrand France
| | | | - Remi Dugue
- ISEM Univ Montpellier CNRS IRD Montpellier France
| | - Rudhy Gustiano
- Research Institute of Freshwater Fisheries (CRIFI‐RIFF) Instalasi Penelitian Perikanan Air Tawar Jalan Ragunan‐Pasar Minggu P.O. Box 7220/jkspm Jakarta 12540 Indonesia
| | - Trần Thị Thúy Hà
- Research Institute for Aquaculture No.1. Dinh Bang Tu Son, Bac Ninh Viet Nam
| | | | - Kednapat Sriphairoj
- Faculty of Natural Resources and Agro‐Industry Kasetsart University Chalermphrakiat Sakon Nakhon Province Campus Sakon Nakhon Thailand
| | - Josiane Ribolli
- Laboratório de Biologia e Cultivo de Peixes de Água Doce Universidade Federal de Santa Catarina Florianópolis SC Brasil
| | | | - Thomas Desvignes
- Institute of Neuroscience University of Oregon Eugene OR 97403 USA
| | | | - Christabel Floi Bucao
- Department of Ecology and Evolution University of Lausanne Lausanne Switzerland
- SIB Swiss Institute of Bioinformatics 1015 Lausanne Switzerland
| | - Marc Robinson‐Rechavi
- Department of Ecology and Evolution University of Lausanne Lausanne Switzerland
- SIB Swiss Institute of Bioinformatics 1015 Lausanne Switzerland
| | | | | | | |
Collapse
|
6
|
Adolfi MC, Du K, Kneitz S, Cabau C, Zahm M, Klopp C, Feron R, Paixão RV, Varela ES, de Almeida FL, de Oliveira MA, Nóbrega RH, Lopez-Roques C, Iampietro C, Lluch J, Kloas W, Wuertz S, Schaefer F, Stöck M, Guiguen Y, Schartl M. A duplicated copy of id2b is an unusual sex-determining candidate gene on the Y chromosome of arapaima (Arapaima gigas). Sci Rep 2021; 11:21544. [PMID: 34732792 PMCID: PMC8566520 DOI: 10.1038/s41598-021-01066-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Accepted: 10/21/2021] [Indexed: 12/19/2022] Open
Abstract
Arapaima gigas is one of the largest freshwater fish species of high ecological and economic importance. Overfishing and habitat destruction are severe threats to the remaining wild populations. By incorporating a chromosomal Hi-C contact map, we improved the arapaima genome assembly to chromosome-level, revealing an unexpected high degree of chromosome rearrangements during evolution of the bonytongues (Osteoglossiformes). Combining this new assembly with pool-sequencing of male and female genomes, we identified id2bbY, a duplicated copy of the inhibitor of DNA binding 2b (id2b) gene on the Y chromosome as candidate male sex-determining gene. A PCR-test for id2bbY was developed, demonstrating that this gene is a reliable male-specific marker for genotyping. Expression analyses showed that this gene is expressed in juvenile male gonads. Its paralog, id2ba, exhibits a male-biased expression in immature gonads. Transcriptome analyses and protein structure predictions confirm id2bbY as a prime candidate for the master sex-determiner. Acting through the TGFβ signaling pathway, id2bbY from arapaima would provide the first evidence for a link of this family of transcriptional regulators to sex determination. Our study broadens our current understanding about the evolution of sex determination genetic networks and provide a tool for improving arapaima aquaculture for commercial and conservation purposes.
Collapse
Affiliation(s)
- Mateus C Adolfi
- Developmental Biochemistry, Biocenter, University of Wuerzburg, Am Hubland, 97074, Wuerzburg, Germany.
| | - Kang Du
- Developmental Biochemistry, Biocenter, University of Wuerzburg, Am Hubland, 97074, Wuerzburg, Germany
- The Xiphophorus Genetic Stock Center, Department of Chemistry and Biochemistry, Texas State University, San Marcos, Texas, TX, 78666, USA
| | - Susanne Kneitz
- Biochemistry and Cell Biology, Biocenter, University of Wuerzburg, Am Hubland, 97074, Wuerzburg, Germany
| | - Cédric Cabau
- Sigenae, GenPhySE, INRAE, ENVT, Université de Toulouse, Castanet Tolosan, France
| | - Margot Zahm
- Sigenae, GenPhySE, INRAE, ENVT, Université de Toulouse, Castanet Tolosan, France
| | - Christophe Klopp
- MIAT, INRA, Université de Toulouse, Chemin de Borde Rouge, 31326, Castanet-Tolosan Cedex, France
| | - Romain Feron
- INRAE, LPGP, Rennes, France
- Department of Ecology and Evolution, University of Lausanne, and Swiss Institute of Bioinformatics, 1015, Lausanne, Switzerland
| | | | | | | | - Marcos A de Oliveira
- Reproductive and Molecular Biology Group, Department of Morphology, Institute of Biosciences, UNESP, Botucatu, Brazil
| | - Rafael H Nóbrega
- Reproductive and Molecular Biology Group, Department of Morphology, Institute of Biosciences, UNESP, Botucatu, Brazil
| | | | | | - Jérôme Lluch
- GeT-PlaGe, INRAE, Genotoul, Castanet-Tolosan, France
| | - Werner Kloas
- Leibniz-Institute of Freshwater Ecology and Inland Fisheries, IGB, Müggelseedamm 301 & 310, 12587, Berlin, Germany
| | - Sven Wuertz
- Leibniz-Institute of Freshwater Ecology and Inland Fisheries, IGB, Müggelseedamm 301 & 310, 12587, Berlin, Germany
| | - Fabian Schaefer
- Leibniz-Institute of Freshwater Ecology and Inland Fisheries, IGB, Müggelseedamm 301 & 310, 12587, Berlin, Germany
| | - Matthias Stöck
- Leibniz-Institute of Freshwater Ecology and Inland Fisheries, IGB, Müggelseedamm 301 & 310, 12587, Berlin, Germany
- Amphibian Research Center, Hiroshima University, Higashi-Hiroshima, 739-8526, Japan
| | | | - Manfred Schartl
- Developmental Biochemistry, Biocenter, University of Wuerzburg, Am Hubland, 97074, Wuerzburg, Germany
- The Xiphophorus Genetic Stock Center, Department of Chemistry and Biochemistry, Texas State University, San Marcos, Texas, TX, 78666, USA
- Comprehensive Cancer Center Mainfranken, University Hospital, 97080, Würzburg, Germany
| |
Collapse
|
7
|
Klopp C, Cabau C, Greif G, Lasalle A, Di Landro S, Vizziano-Cantonnet D. Siberian sturgeon multi-tissue reference transcriptome database. Database (Oxford) 2020; 2020:6006229. [PMID: 33238003 PMCID: PMC7687680 DOI: 10.1093/database/baaa082] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Revised: 08/25/2020] [Accepted: 09/01/2020] [Indexed: 11/13/2022]
Abstract
Motivation: Siberian sturgeon is a long lived and late maturing fish farmed for caviar production in 50 countries. Functional genomics enable to find genes of interest for fish farming. In the absence of a reference genome, a reference transcriptome is very useful for sequencing based functional studies. Results: We present here a high-quality transcriptome assembly database built using RNA-seq reads coming from brain, pituitary, gonadal, liver, stomach, kidney, anterior kidney, heart, embryonic and pre-larval tissues. It will facilitate crucial research on topics such as puberty, reproduction, growth, food intake and immunology. This database represents a major contribution to the publicly available sturgeon transcriptome reference datasets. Availability: The database is publicly available at http://siberiansturgeontissuedb.sigenae.org Supplementary information: Supplementary data are available at Database online.
Collapse
Affiliation(s)
- Christophe Klopp
- SIGENAE, Genotoul Bioinfo, MIAT UR875, INRAe, Chemin de Borde-Rouge - Auzeville BP 52627, 31326 CASTANET-TOLOSAN CEDEX, France
| | - Cédric Cabau
- SIGENAE, GenPhySE, Université de Toulouse, INRAe, ENVT, Chemin de Borde-Rouge - Auzeville BP 52627, 31326 CASTANET-TOLOSAN CEDEX, France
| | - Gonzalo Greif
- Laboratorio de Interacción Hospedero-Patógeno/Unidad de Biología Molecular, Instituto Pasteur de Montevideo, Mataojo 2020, Montevideo 11400, Uruguay
| | - André Lasalle
- Laboratorio de Fisiología de la Reproducción y Ecología de Peces, Instituto de Biología, Facultad de Ciencias, Universidad de la República Oriental del Uruguay, Iguá 4225, Montevideo 11 400, Uruguay
| | - Santiago Di Landro
- Laboratorio de Fisiología de la Reproducción y Ecología de Peces, Instituto de Biología, Facultad de Ciencias, Universidad de la República Oriental del Uruguay, Iguá 4225, Montevideo 11 400, Uruguay
| | - Denise Vizziano-Cantonnet
- Laboratorio de Fisiología de la Reproducción y Ecología de Peces, Instituto de Biología, Facultad de Ciencias, Universidad de la República Oriental del Uruguay, Iguá 4225, Montevideo 11 400, Uruguay
| |
Collapse
|
8
|
Parey E, Louis A, Cabau C, Guiguen Y, Roest Crollius H, Berthelot C. Synteny-Guided Resolution of Gene Trees Clarifies the Functional Impact of Whole-Genome Duplications. Mol Biol Evol 2020; 37:3324-3337. [DOI: 10.1093/molbev/msaa149] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Abstract
Whole-genome duplications (WGDs) have major impacts on the evolution of species, as they produce new gene copies contributing substantially to adaptation, isolation, phenotypic robustness, and evolvability. They result in large, complex gene families with recurrent gene losses in descendant species that sequence-based phylogenetic methods fail to reconstruct accurately. As a result, orthologs and paralogs are difficult to identify reliably in WGD-descended species, which hinders the exploration of functional consequences of WGDs. Here, we present Synteny-guided CORrection of Paralogies and Orthologies (SCORPiOs), a novel method to reconstruct gene phylogenies in the context of a known WGD event. WGDs generate large duplicated syntenic regions, which SCORPiOs systematically leverages as a complement to sequence evolution to infer the evolutionary history of genes. We applied SCORPiOs to the 320-My-old WGD at the origin of teleost fish. We find that almost one in four teleost gene phylogenies in the Ensembl database (3,394) are inconsistent with their syntenic contexts. For 70% of these gene families (2,387), we were able to propose an improved phylogenetic tree consistent with both the molecular substitution distances and the local syntenic information. We show that these synteny-guided phylogenies are more congruent with the species tree, with sequence evolution and with expected expression conservation patterns than those produced by state-of-the-art methods. Finally, we show that synteny-guided gene trees emphasize contributions of WGD paralogs to evolutionary innovations in the teleost clade.
Collapse
Affiliation(s)
- Elise Parey
- Institut de Biologie de l'Ecole Normale Supérieure (IBENS), Ecole Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
| | - Alexandra Louis
- Institut de Biologie de l'Ecole Normale Supérieure (IBENS), Ecole Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
| | - Cédric Cabau
- SIGENAE, GenPhySE, Université de Toulouse, INRAE, ENVT, Castanet Tolosan, France
| | | | - Hugues Roest Crollius
- Institut de Biologie de l'Ecole Normale Supérieure (IBENS), Ecole Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
| | - Camille Berthelot
- Institut de Biologie de l'Ecole Normale Supérieure (IBENS), Ecole Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
| |
Collapse
|
9
|
Klopp C, Cabau C, Lasalle A, Di Landro S, Greif G, Vizziano-Cantonnet D. Multi-tissue Siberian sturgeon RNA sequencing data. Data Brief 2020; 31:105820. [PMID: 32596425 PMCID: PMC7306608 DOI: 10.1016/j.dib.2020.105820] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Revised: 05/14/2020] [Accepted: 06/02/2020] [Indexed: 10/28/2022] Open
Abstract
Siberian sturgeon, Acipenser baerii, is a commercially valuable fish for flesh and caviar production and a threatened species. We produced transcriptomic data for ten tissues with relevance to puberty, reproduction, early development, growth and food intake. The data includes RNA-Seq read sets of brain, pituitary, anterior-kidney, kidney, stomach, liver, heart, embryonic, pre-larval, and immature gonad sequences. Tissues were collected from sex differentiated fish (17 to 42 months of age, 66 to 85 cm) RNA was extracted and sequenced. Our purpose is to facilitate fundamental studies of sturgeon physiology to wild and aquaculture populations management.
Collapse
Affiliation(s)
- Christophe Klopp
- INRAE, SIGENAE, MIAT UR875, F-31326, Castanet Tolosan, France, 'SIGENAE, GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326, Castanet Tolosan, France
| | - Cédric Cabau
- Laboratorio de Interacción Hospedero-Patógeno/Unidad de Biología Molecular, Instituto Pasteur de Montevideo, Uruguay
| | - André Lasalle
- Laboratorio de Fisiología de la Reproducción y Ecología de Peces, Instituto de Biología, Facultad de Ciencias, Universidad de la República Oriental del Uruguay, Iguá 4225, Montevideo, 11400, Uruguay
| | - Santiago Di Landro
- Laboratorio de Fisiología de la Reproducción y Ecología de Peces, Instituto de Biología, Facultad de Ciencias, Universidad de la República Oriental del Uruguay, Iguá 4225, Montevideo, 11400, Uruguay
| | - Gonzalo Greif
- Laboratorio de Interacción Hospedero-Patógeno/Unidad de Biología Molecular, Instituto Pasteur de Montevideo, Uruguay
| | - Denise Vizziano-Cantonnet
- Laboratorio de Fisiología de la Reproducción y Ecología de Peces, Instituto de Biología, Facultad de Ciencias, Universidad de la República Oriental del Uruguay, Iguá 4225, Montevideo, 11400, Uruguay
| |
Collapse
|
10
|
Feron R, Zahm M, Cabau C, Klopp C, Roques C, Bouchez O, Eché C, Valière S, Donnadieu C, Haffray P, Bestin A, Morvezen R, Acloque H, Euclide PT, Wen M, Jouano E, Schartl M, Postlethwait JH, Schraidt C, Christie MR, Larson WA, Herpin A, Guiguen Y. Characterization of a Y-specific duplication/insertion of the anti-Mullerian hormone type II receptor gene based on a chromosome-scale genome assembly of yellow perch, Perca flavescens. Mol Ecol Resour 2020; 20:531-543. [PMID: 31903688 PMCID: PMC7050324 DOI: 10.1111/1755-0998.13133] [Citation(s) in RCA: 53] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2019] [Revised: 12/20/2019] [Accepted: 12/24/2019] [Indexed: 12/12/2022]
Abstract
Yellow perch, Perca flavescens, is an ecologically and economically important species native to a large portion of the northern United States and southern Canada and is also a promising candidate species for aquaculture. However, no yellow perch reference genome has been available to facilitate improvements in both fisheries and aquaculture management practices. By combining Oxford Nanopore Technologies long-reads, 10X Genomics Illumina short linked reads and a chromosome contact map produced with Hi-C, we generated a high-continuity chromosome-scale yellow perch genome assembly of 877.4 Mb. It contains, in agreement with the known diploid chromosome yellow perch count, 24 chromosome-size scaffolds covering 98.8% of the complete assembly (N50 = 37.4 Mb, L50 = 11). We also provide a first characterization of the yellow perch sex determination locus that contains a male-specific duplicate of the anti-Mullerian hormone type II receptor gene (amhr2by) inserted at the proximal end of the Y chromosome (chromosome 9). Using this sex-specific information, we developed a simple PCR genotyping assay which accurately differentiates XY genetic males (amhr2by+ ) from XX genetic females (amhr2by- ). Our high-quality genome assembly is an important genomic resource for future studies on yellow perch ecology, toxicology, fisheries and aquaculture research. In addition, characterization of the amhr2by gene as a candidate sex-determining gene in yellow perch provides a new example of the recurrent implication of the transforming growth factor beta pathway in fish sex determination, and highlights gene duplication as an important genomic mechanism for the emergence of new master sex determination genes.
Collapse
Affiliation(s)
- Romain Feron
- INRAE, UR 1037 Fish Physiology and Genomics, F-35000 Rennes, France
- Department of Ecology and Evolution, University of Lausanne, and Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Margot Zahm
- Plate-forme bio-informatique Genotoul, Mathématiques et Informatique Appliquées de Toulouse, INRAE, Castanet Tolosan, France
| | - Cédric Cabau
- SIGENAE, GenPhySE, Université de Toulouse, INRAE, ENVT, Castanet Tolosan, France
| | - Christophe Klopp
- Plate-forme bio-informatique Genotoul, Mathématiques et Informatique Appliquées de Toulouse, INRAE, Castanet Tolosan, France
- SIGENAE, GenPhySE, Université de Toulouse, INRAE, ENVT, Castanet Tolosan, France
| | - Céline Roques
- INRAE, US 1426, GeT-PlaGe, Genotoul, Castanet-Tolosan, France
| | - Olivier Bouchez
- INRAE, US 1426, GeT-PlaGe, Genotoul, Castanet-Tolosan, France
| | - Camille Eché
- INRAE, US 1426, GeT-PlaGe, Genotoul, Castanet-Tolosan, France
| | - Sophie Valière
- INRAE, US 1426, GeT-PlaGe, Genotoul, Castanet-Tolosan, France
| | | | - Pierrick Haffray
- SYSAAF, French poultry and aquaculture breeders, 35042, Rennes Cedex, France
| | - Anastasia Bestin
- SYSAAF, French poultry and aquaculture breeders, 35042, Rennes Cedex, France
| | - Romain Morvezen
- SYSAAF, French poultry and aquaculture breeders, 35042, Rennes Cedex, France
| | - Hervé Acloque
- GenPhySE, Université de Toulouse, INRAE, INPT, ENVT, Castanet-Tolosan, France
| | - Peter T. Euclide
- Wisconsin Cooperative Fishery Research Unit, University of Wisconsin-Stevens Point, 800 Reserve St., Stevens Point, WI 54481, USA
| | - Ming Wen
- INRAE, UR 1037 Fish Physiology and Genomics, F-35000 Rennes, France
- State Key Laboratory of Developmental Biology of Freshwater Fish, College of Life Science, Hunan Normal University, Changsha, China
| | - Elodie Jouano
- INRAE, UR 1037 Fish Physiology and Genomics, F-35000 Rennes, France
| | - Manfred Schartl
- Developmental Biochemistry, Biozentrum, University of Würzburg, Würzburg, Germany and The Xiphophorus Genetic Stock Center, Department of Chemistry and Biochemistry, Texas State University, San Marcos, Texas, USA
| | | | - Claire Schraidt
- Department of Forestry and Natural Resources, Purdue University; 715 W. State St., West Lafayette, Indiana 47907-2054 USA
| | - Mark R. Christie
- Department of Forestry and Natural Resources, Purdue University; 715 W. State St., West Lafayette, Indiana 47907-2054 USA
- Department of Biological Sciences, Purdue University; 915 W. State St., West Lafayette, Indiana 47907-2054 USA
| | - Wesley A. Larson
- U.S. Geological Survey Wisconsin Cooperative Fishery Research Unit, University of Wisconsin-Stevens Point, 800 Reserve St., Stevens Point, WI 54481, USA
| | - Amaury Herpin
- INRAE, UR 1037 Fish Physiology and Genomics, F-35000 Rennes, France
| | - Yann Guiguen
- INRAE, UR 1037 Fish Physiology and Genomics, F-35000 Rennes, France
| |
Collapse
|
11
|
Foissac S, Djebali S, Munyard K, Vialaneix N, Rau A, Muret K, Esquerré D, Zytnicki M, Derrien T, Bardou P, Blanc F, Cabau C, Crisci E, Dhorne-Pollet S, Drouet F, Faraut T, Gonzalez I, Goubil A, Lacroix-Lamandé S, Laurent F, Marthey S, Marti-Marimon M, Momal-Leisenring R, Mompart F, Quéré P, Robelin D, Cristobal MS, Tosser-Klopp G, Vincent-Naulleau S, Fabre S, der Laan MHPV, Klopp C, Tixier-Boichard M, Acloque H, Lagarrigue S, Giuffra E. Multi-species annotation of transcriptome and chromatin structure in domesticated animals. BMC Biol 2019; 17:108. [PMID: 31884969 PMCID: PMC6936065 DOI: 10.1186/s12915-019-0726-5] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2019] [Accepted: 11/19/2019] [Indexed: 01/10/2023] Open
Abstract
BACKGROUND Comparative genomics studies are central in identifying the coding and non-coding elements associated with complex traits, and the functional annotation of genomes is a critical step to decipher the genotype-to-phenotype relationships in livestock animals. As part of the Functional Annotation of Animal Genomes (FAANG) action, the FR-AgENCODE project aimed to create reference functional maps of domesticated animals by profiling the landscape of transcription (RNA-seq), chromatin accessibility (ATAC-seq) and conformation (Hi-C) in species representing ruminants (cattle, goat), monogastrics (pig) and birds (chicken), using three target samples related to metabolism (liver) and immunity (CD4+ and CD8+ T cells). RESULTS RNA-seq assays considerably extended the available catalog of annotated transcripts and identified differentially expressed genes with unknown function, including new syntenic lncRNAs. ATAC-seq highlighted an enrichment for transcription factor binding sites in differentially accessible regions of the chromatin. Comparative analyses revealed a core set of conserved regulatory regions across species. Topologically associating domains (TADs) and epigenetic A/B compartments annotated from Hi-C data were consistent with RNA-seq and ATAC-seq data. Multi-species comparisons showed that conserved TAD boundaries had stronger insulation properties than species-specific ones and that the genomic distribution of orthologous genes in A/B compartments was significantly conserved across species. CONCLUSIONS We report the first multi-species and multi-assay genome annotation results obtained by a FAANG project. Beyond the generation of reference annotations and the confirmation of previous findings on model animals, the integrative analysis of data from multiple assays and species sheds a new light on the multi-scale selective pressure shaping genome organization from birds to mammals. Overall, these results emphasize the value of FAANG for research on domesticated animals and reinforces the importance of future meta-analyses of the reference datasets being generated by this community on different species.
Collapse
Affiliation(s)
- Sylvain Foissac
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Chemin de Borde Rouge, Castanet-Tolosan Cedex, F-31326 France
| | - Sarah Djebali
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Chemin de Borde Rouge, Castanet-Tolosan Cedex, F-31326 France
| | - Kylie Munyard
- Curtin University, School of Pharmacy & Biomedical Sciences, CHIRI Biosciences, Perth, 24105 Australia
| | - Nathalie Vialaneix
- MIAT, Université de Toulouse, INRA, Chemin de Borde Rouge, Castanet-Tolosan Cedex, F-31326 France
| | - Andrea Rau
- GABI, AgroParisTech, INRA, Université Paris Saclay, Jouy-en-Josas, F-78350 France
| | - Kevin Muret
- PEGASE, Agrocampus-Ouest, INRA, Saint-Gilles Cedex, F-35590 France
| | - Diane Esquerré
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Chemin de Borde Rouge, Castanet-Tolosan Cedex, F-31326 France
- INRA, US1426, GeT-PlaGe, Genotoul, Chemin de Borde Rouge, Castanet-Tolosan Cedex, F-31326 France
| | - Matthias Zytnicki
- MIAT, Université de Toulouse, INRA, Chemin de Borde Rouge, Castanet-Tolosan Cedex, F-31326 France
| | | | - Philippe Bardou
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Chemin de Borde Rouge, Castanet-Tolosan Cedex, F-31326 France
| | - Fany Blanc
- GABI, AgroParisTech, INRA, Université Paris Saclay, Jouy-en-Josas, F-78350 France
| | - Cédric Cabau
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Chemin de Borde Rouge, Castanet-Tolosan Cedex, F-31326 France
| | - Elisa Crisci
- GABI, AgroParisTech, INRA, Université Paris Saclay, Jouy-en-Josas, F-78350 France
- Department of Population Health and Pathobiology, College of Veterinary Medicine, North Carolina State University, Raleigh, NC 27607 USA
| | - Sophie Dhorne-Pollet
- GABI, AgroParisTech, INRA, Université Paris Saclay, Jouy-en-Josas, F-78350 France
| | | | - Thomas Faraut
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Chemin de Borde Rouge, Castanet-Tolosan Cedex, F-31326 France
| | - Ignacio Gonzalez
- MIAT, Université de Toulouse, INRA, Chemin de Borde Rouge, Castanet-Tolosan Cedex, F-31326 France
| | - Adeline Goubil
- GABI, AgroParisTech, INRA, Université Paris Saclay, Jouy-en-Josas, F-78350 France
| | | | | | - Sylvain Marthey
- GABI, AgroParisTech, INRA, Université Paris Saclay, Jouy-en-Josas, F-78350 France
| | - Maria Marti-Marimon
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Chemin de Borde Rouge, Castanet-Tolosan Cedex, F-31326 France
| | | | - Florence Mompart
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Chemin de Borde Rouge, Castanet-Tolosan Cedex, F-31326 France
| | | | - David Robelin
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Chemin de Borde Rouge, Castanet-Tolosan Cedex, F-31326 France
| | - Magali San Cristobal
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Chemin de Borde Rouge, Castanet-Tolosan Cedex, F-31326 France
| | - Gwenola Tosser-Klopp
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Chemin de Borde Rouge, Castanet-Tolosan Cedex, F-31326 France
| | | | - Stéphane Fabre
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Chemin de Borde Rouge, Castanet-Tolosan Cedex, F-31326 France
| | | | - Christophe Klopp
- MIAT, Université de Toulouse, INRA, Chemin de Borde Rouge, Castanet-Tolosan Cedex, F-31326 France
| | | | - Hervé Acloque
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Chemin de Borde Rouge, Castanet-Tolosan Cedex, F-31326 France
- GABI, AgroParisTech, INRA, Université Paris Saclay, Jouy-en-Josas, F-78350 France
| | | | - Elisabetta Giuffra
- GABI, AgroParisTech, INRA, Université Paris Saclay, Jouy-en-Josas, F-78350 France
| |
Collapse
|
12
|
Juanchich A, Hennequet-Antier C, Cabau C, Le Bihan-Duval E, Duclos MJ, Mignon-Grasteau S, Narcy A. Functional genomics of the digestive tract in broilers. BMC Genomics 2018; 19:928. [PMID: 30545300 PMCID: PMC6293548 DOI: 10.1186/s12864-018-5344-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2018] [Accepted: 11/30/2018] [Indexed: 01/09/2023] Open
Abstract
Background The sustainability of poultry farming relies on the development of more efficient and autonomous production systems in terms of feed supply. This implies a better integration of adaptive traits in breeding programs, including digestive efficiency, in order to favor the use of a wider variety of feedstuffs. The aim of the project was to improve the understanding of genes involved in digestive functions by characterizing the transcriptome of different sections of the digestive tract: the junction between the proventriculus and the gizzard, the gizzard, the gastroduodenal junction, and the jejunum. Results Total RNA from the four tissues were sequenced on a HiSeq2500 for six 23-day-old chickens from a second generation (F2) cross between two lines that were divergent for their digestive efficiency (D+/D-). Bioinformatics and biostatistics analyses of the RNA-seq data showed a total of 11,040 differentially expressed transcripts between the four tissues. In total, seven clusters of genes with markedly different expression profiles were identified. Functional analysis on gene groups was performed using “Gene Ontology” and semantic similarity. It showed a significant enrichment of body immune defenses in the jejunum, and an enrichment of transcriptional activity in the gizzard. Moreover, an interesting enrichment for neurohormonal control of muscle contraction was found for the two gizzard’s junctions. Conclusion This analysis allows us to draw the first molecular portrait of the different sections of the digestive tract, which will serve as a basis for future studies on the genetic and physiological control of the response of the animal to feed variations. Electronic supplementary material The online version of this article (10.1186/s12864-018-5344-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | - Cédric Cabau
- SIGENAE, GenPhySE, Université de Toulouse, INRA, INPT, ENV, Castanet Tolosan, France
| | | | | | | | - Agnès Narcy
- BOA, INRA, Université de Tours, 37380, Nouzilly, France
| |
Collapse
|
13
|
Cabau C, Escudié F, Djari A, Guiguen Y, Bobe J, Klopp C. Compacting and correcting Trinity and Oases RNA-Seq de novo assemblies. PeerJ 2017; 5:e2988. [PMID: 28224052 PMCID: PMC5316280 DOI: 10.7717/peerj.2988] [Citation(s) in RCA: 67] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2016] [Accepted: 01/10/2017] [Indexed: 11/23/2022] Open
Abstract
Background De novo transcriptome assembly of short reads is now a common step in expression analysis of organisms lacking a reference genome sequence. Several software packages are available to perform this task. Even if their results are of good quality it is still possible to improve them in several ways including redundancy reduction or error correction. Trinity and Oases are two commonly used de novo transcriptome assemblers. The contig sets they produce are of good quality. Still, their compaction (number of contigs needed to represent the transcriptome) and their quality (chimera and nucleotide error rates) can be improved. Results We built a de novo RNA-Seq Assembly Pipeline (DRAP) which wraps these two assemblers (Trinity and Oases) in order to improve their results regarding the above-mentioned criteria. DRAP reduces from 1.3 to 15 fold the number of resulting contigs of the assemblies depending on the read set and the assembler used. This article presents seven assembly comparisons showing in some cases drastic improvements when using DRAP. DRAP does not significantly impair assembly quality metrics such are read realignment rate or protein reconstruction counts. Conclusion Transcriptome assembly is a challenging computational task even if good solutions are already available to end-users, these solutions can still be improved while conserving the overall representation and quality of the assembly. The de novo RNA-Seq Assembly Pipeline (DRAP) is an easy to use software package to produce compact and corrected transcript set. DRAP is free, open-source and available under GPL V3 license at http://www.sigenae.org/drap.
Collapse
Affiliation(s)
- Cédric Cabau
- SIGENAE, GenPhySE, Université de Toulouse, INRA, INPT, ENV, Castanet Tolosan, France
| | - Frédéric Escudié
- Plate-forme bio-informatique Genotoul, Mathématiques et Informatique Appliquées de Toulouse, INRA, Castanet Tolosan, France
| | - Anis Djari
- Laboratoire Génomique et Biotechnologie du Fruit, UMR990 INRA/INP-ENSAT, Auzeville, France
| | - Yann Guiguen
- UR1037 Fish Physiology and Genomics, INRA, Rennes, France
| | - Julien Bobe
- UR1037 Fish Physiology and Genomics, INRA, Rennes, France
| | - Christophe Klopp
- SIGENAE, GenPhySE, Université de Toulouse, INRA, INPT, ENV, Castanet Tolosan, France.,Plate-forme bio-informatique Genotoul, Mathématiques et Informatique Appliquées de Toulouse, INRA, Castanet Tolosan, France
| |
Collapse
|
14
|
Pasquier J, Cabau C, Nguyen T, Jouanno E, Severac D, Braasch I, Journot L, Pontarotti P, Klopp C, Postlethwait JH, Guiguen Y, Bobe J. Gene evolution and gene expression after whole genome duplication in fish: the PhyloFish database. BMC Genomics 2016; 17:368. [PMID: 27189481 PMCID: PMC4870732 DOI: 10.1186/s12864-016-2709-z] [Citation(s) in RCA: 194] [Impact Index Per Article: 24.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2015] [Accepted: 05/05/2016] [Indexed: 12/25/2022] Open
Abstract
With more than 30,000 species, ray-finned fish represent approximately half of vertebrates. The evolution of ray-finned fish was impacted by several whole genome duplication (WGD) events including a teleost-specific WGD event (TGD) that occurred at the root of the teleost lineage about 350 million years ago (Mya) and more recent WGD events in salmonids, carps, suckers and others. In plants and animals, WGD events are associated with adaptive radiations and evolutionary innovations. WGD-spurred innovation may be especially relevant in the case of teleost fish, which colonized a wide diversity of habitats on earth, including many extreme environments. Fish biodiversity, the use of fish models for human medicine and ecological studies, and the importance of fish in human nutrition, fuel an important need for the characterization of gene expression repertoires and corresponding evolutionary histories of ray-finned fish genes. To this aim, we performed transcriptome analyses and developed the PhyloFish database to provide (i) de novo assembled gene repertoires in 23 different ray-finned fish species including two holosteans (i.e. a group that diverged from teleosts before TGD) and 21 teleosts (including six salmonids), and (ii) gene expression levels in ten different tissues and organs (and embryos for many) in the same species. This resource was generated using a common deep RNA sequencing protocol to obtain the most exhaustive gene repertoire possible in each species that allows between-species comparisons to study the evolution of gene expression in different lineages. The PhyloFish database described here can be accessed and searched using RNAbrowse, a simple and efficient solution to give access to RNA-seq de novo assembled transcripts.
Collapse
Affiliation(s)
- Jeremy Pasquier
- INRA, Laboratoire de Physiologie et Génomique des poissons, Campus de Beaulieu, F-35042, Rennes cedex, France
| | - Cédric Cabau
- INRA, SIGENAE, GenPhySE, F-31326, Castanet-Tolosan, France
| | - Thaovi Nguyen
- INRA, Laboratoire de Physiologie et Génomique des poissons, Campus de Beaulieu, F-35042, Rennes cedex, France
| | - Elodie Jouanno
- INRA, Laboratoire de Physiologie et Génomique des poissons, Campus de Beaulieu, F-35042, Rennes cedex, France
| | - Dany Severac
- CNRS, MGX-Montpellier GenomiX, Montpellier, France
| | - Ingo Braasch
- Institute of Neuroscience, University of Oregon, Eugene, 97403-1254, OR, USA.,Department of Integrative Biology, Michigan State University, East Lansing, 48824, MI, USA
| | | | - Pierre Pontarotti
- Aix-Marseille Université, CNRS, Centrale Marseille, I2M, UMR7373, FR 4213 - FR, Eccorev 3098, équipe EBM, 13331, Marseille, France
| | | | - John H Postlethwait
- Institute of Neuroscience, University of Oregon, Eugene, 97403-1254, OR, USA
| | - Yann Guiguen
- INRA, Laboratoire de Physiologie et Génomique des poissons, Campus de Beaulieu, F-35042, Rennes cedex, France
| | - Julien Bobe
- INRA, Laboratoire de Physiologie et Génomique des poissons, Campus de Beaulieu, F-35042, Rennes cedex, France.
| |
Collapse
|
15
|
Tosser-Klopp G, Bardou P, Bouchez O, Cabau C, Crooijmans R, Dong Y, Donnadieu-Tonon C, Eggen A, Heuven HCM, Jamli S, Jiken AJ, Klopp C, Lawley CT, McEwan J, Martin P, Moreno CR, Mulsant P, Nabihoudine I, Pailhoux E, Palhière I, Rupp R, Sarry J, Sayre BL, Tircazes A, Wang J, Wang W, Zhang W. Correction: Design and Characterization of a 52K SNP Chip for Goats. PLoS One 2016; 11:e0152632. [PMID: 27011020 PMCID: PMC4807118 DOI: 10.1371/journal.pone.0152632] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
|
16
|
Smedley D, Haider S, Durinck S, Pandini L, Provero P, Allen J, Arnaiz O, Awedh MH, Baldock R, Barbiera G, Bardou P, Beck T, Blake A, Bonierbale M, Brookes AJ, Bucci G, Buetti I, Burge S, Cabau C, Carlson JW, Chelala C, Chrysostomou C, Cittaro D, Collin O, Cordova R, Cutts RJ, Dassi E, Di Genova A, Djari A, Esposito A, Estrella H, Eyras E, Fernandez-Banet J, Forbes S, Free RC, Fujisawa T, Gadaleta E, Garcia-Manteiga JM, Goodstein D, Gray K, Guerra-Assunção JA, Haggarty B, Han DJ, Han BW, Harris T, Harshbarger J, Hastings RK, Hayes RD, Hoede C, Hu S, Hu ZL, Hutchins L, Kan Z, Kawaji H, Keliet A, Kerhornou A, Kim S, Kinsella R, Klopp C, Kong L, Lawson D, Lazarevic D, Lee JH, Letellier T, Li CY, Lio P, Liu CJ, Luo J, Maass A, Mariette J, Maurel T, Merella S, Mohamed AM, Moreews F, Nabihoudine I, Ndegwa N, Noirot C, Perez-Llamas C, Primig M, Quattrone A, Quesneville H, Rambaldi D, Reecy J, Riba M, Rosanoff S, Saddiq AA, Salas E, Sallou O, Shepherd R, Simon R, Sperling L, Spooner W, Staines DM, Steinbach D, Stone K, Stupka E, Teague JW, Dayem Ullah AZ, Wang J, Ware D, Wong-Erasmus M, Youens-Clark K, Zadissa A, Zhang SJ, Kasprzyk A. The BioMart community portal: an innovative alternative to large, centralized data repositories. Nucleic Acids Res 2015; 43:W589-98. [PMID: 25897122 PMCID: PMC4489294 DOI: 10.1093/nar/gkv350] [Citation(s) in RCA: 491] [Impact Index Per Article: 54.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2015] [Accepted: 04/02/2015] [Indexed: 01/17/2023] Open
Abstract
The BioMart Community Portal (www.biomart.org) is a community-driven effort to provide a unified interface to biomedical databases that are distributed worldwide. The portal provides access to numerous database projects supported by 30 scientific organizations. It includes over 800 different biological datasets spanning genomics, proteomics, model organisms, cancer data, ontology information and more. All resources available through the portal are independently administered and funded by their host organizations. The BioMart data federation technology provides a unified interface to all the available data. The latest version of the portal comes with many new databases that have been created by our ever-growing community. It also comes with better support and extensibility for data analysis and visualization tools. A new addition to our toolbox, the enrichment analysis tool is now accessible through graphical and web service interface. The BioMart community portal averages over one million requests per day. Building on this level of service and the wealth of information that has become available, the BioMart Community Portal has introduced a new, more scalable and cheaper alternative to the large data stores maintained by specialized organizations.
Collapse
Affiliation(s)
- Damian Smedley
- Wellcome Trust Sanger Institute, Welcome Trust Genome Campus, Hinxton, CB10 1SD, UK
| | - Syed Haider
- The Weatherall Institute Of Molecular Medicine, University of Oxford, Oxford, OX3 9DS, UK
| | - Steffen Durinck
- Genentech, Inc. 1 DNA Way South San Francisco, CA 94080, USA
| | - Luca Pandini
- Center for Translational Genomics and Bioinformatics San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy
| | - Paolo Provero
- Center for Translational Genomics and Bioinformatics San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy Dept of Molecular Biotechnology and Health Sciences University of Turin, Italy
| | - James Allen
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Olivier Arnaiz
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris Sud, 1 avenue de la terrasse, 91198 Gif sur Yvette, France
| | - Mohammad Hamza Awedh
- Department of Electrical and Computer Engineering, Faculty of Engineering, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Richard Baldock
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK
| | - Giulia Barbiera
- Center for Translational Genomics and Bioinformatics San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy
| | | | - Tim Beck
- Department of Genetics, University of Leicester, University Road, Leicester, LE1 7RH, UK
| | - Andrew Blake
- MRC Harwell, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK
| | | | - Anthony J Brookes
- Department of Genetics, University of Leicester, University Road, Leicester, LE1 7RH, UK
| | - Gabriele Bucci
- Center for Translational Genomics and Bioinformatics San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy
| | - Iwan Buetti
- Center for Translational Genomics and Bioinformatics San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy
| | - Sarah Burge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | | | | | - Claude Chelala
- Centre for Molecular Oncology, Barts Cancer Institute, Queen Mary University of London, Charterhouse Square, London EC1M 6BQ, UK
| | | | - Davide Cittaro
- Center for Translational Genomics and Bioinformatics San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy
| | | | - Raul Cordova
- International Potato Center (CIP), Lima, 1558, Peru
| | - Rosalind J Cutts
- Centre for Molecular Oncology, Barts Cancer Institute, Queen Mary University of London, Charterhouse Square, London EC1M 6BQ, UK
| | - Erik Dassi
- Laboratory of Translational Genomics, Centre for Integrative Biology, University of Trento, Trento, Italy
| | - Alex Di Genova
- Center for Mathematical Modeling and Center for Genome Regulation, University of Chile, Beauchef 851, 7th floor, Chile
| | - Anis Djari
- Plate-forme bio-informatique Genotoul, Mathématiques et Informatique Appliquées de Toulouse, INRA, Castanet-Tolosan, France
| | | | | | - Eduardo Eyras
- Catalan Institute for Research and Advanced Studies (ICREA), Passeig Lluis Companys 23, E-08010 Barcelona, Spain Universitat Pompeu Fabra, Dr Aiguader 88 E-08003 Barcelona, Spain
| | | | - Simon Forbes
- Wellcome Trust Sanger Institute, Welcome Trust Genome Campus, Hinxton, CB10 1SD, UK
| | - Robert C Free
- Department of Genetics, University of Leicester, University Road, Leicester, LE1 7RH, UK
| | | | - Emanuela Gadaleta
- Centre for Molecular Oncology, Barts Cancer Institute, Queen Mary University of London, Charterhouse Square, London EC1M 6BQ, UK
| | - Jose M Garcia-Manteiga
- Center for Translational Genomics and Bioinformatics San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy
| | - David Goodstein
- Department of Energy, Joint Genome Institute, Walnut Creek, USA
| | - Kristian Gray
- HUGO Gene Nomenclature Committee (HGNC), European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus, Hinxton, CB10 1SD, UK
| | - José Afonso Guerra-Assunção
- Centre for Molecular Oncology, Barts Cancer Institute, Queen Mary University of London, Charterhouse Square, London EC1M 6BQ, UK
| | - Bernard Haggarty
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK
| | - Dong-Jin Han
- Medicinal Bioconvergence Research Center, College of Pharmacy, Seoul National University, Seoul 151-742, Republic of Korea Department of Molecular Medicine and Biopharmaceutical Sciences, Seoul National University, Seoul 151-742, Republic of Korea
| | - Byung Woo Han
- Research Institute of Pharmaceutical Sciences, College of Pharmacy, Seoul National University, Seoul 151-742, Republic of Korea Information Center for Bio-pharmacological Network, Seoul National University, Suwon 443-270, Republic of Korea
| | - Todd Harris
- Ontario Institute for Cancer Research, Toronto, M5G 0A3, Canada
| | - Jayson Harshbarger
- RIKEN Center for Life Science Technologies (CLST), Division of Genomic Technologies (DGT), Kanagawa, 230-0045, Japan
| | - Robert K Hastings
- Department of Genetics, University of Leicester, University Road, Leicester, LE1 7RH, UK
| | - Richard D Hayes
- Department of Energy, Joint Genome Institute, Walnut Creek, USA
| | - Claire Hoede
- Plate-forme bio-informatique Genotoul, Mathématiques et Informatique Appliquées de Toulouse, INRA, Castanet-Tolosan, France
| | - Shen Hu
- School of Dentistry and Dental Research Institute, University of California Los Angeles (UCLA), Los Angeles, CA 90095-1668, USA
| | | | - Lucie Hutchins
- Mouse Genomic Informatics Group, The Jackson Laboratory, Bar Harbor, ME 04609, USA
| | - Zhengyan Kan
- Oncology Computational Biology, Pfizer, La Jolla, USA
| | - Hideya Kawaji
- RIKEN Center for Life Science Technologies (CLST), Division of Genomic Technologies (DGT), Kanagawa, 230-0045, Japan RIKEN Preventive Medicine and Diagnosis Innovation Program, Saitama 351-0198, Japan
| | - Aminah Keliet
- INRA URGI Centre de Versailles, bâtiment 18 Route de Saint Cyr 78026 Versailles, France
| | - Arnaud Kerhornou
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Sunghoon Kim
- Medicinal Bioconvergence Research Center, College of Pharmacy, Seoul National University, Seoul 151-742, Republic of Korea Department of Molecular Medicine and Biopharmaceutical Sciences, Seoul National University, Seoul 151-742, Republic of Korea
| | - Rhoda Kinsella
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Christophe Klopp
- Plate-forme bio-informatique Genotoul, Mathématiques et Informatique Appliquées de Toulouse, INRA, Castanet-Tolosan, France
| | - Lei Kong
- Center for Bioinformatics, State Key Laboratory of Protein and Plant Gene Research, College of Life Sciences, Peking University, Beijing, 100871, P.R. China
| | - Daniel Lawson
- VectorBase, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1SD, UK
| | - Dejan Lazarevic
- Center for Translational Genomics and Bioinformatics San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy
| | - Ji-Hyun Lee
- Medicinal Bioconvergence Research Center, College of Pharmacy, Seoul National University, Seoul 151-742, Republic of Korea Research Institute of Pharmaceutical Sciences, College of Pharmacy, Seoul National University, Seoul 151-742, Republic of Korea Information Center for Bio-pharmacological Network, Seoul National University, Suwon 443-270, Republic of Korea
| | - Thomas Letellier
- INRA URGI Centre de Versailles, bâtiment 18 Route de Saint Cyr 78026 Versailles, France
| | - Chuan-Yun Li
- Institute of Molecular Medicine, Peking University, Beijing, China
| | - Pietro Lio
- Computer Laboratory, University of Cambridge, Cambridge, CB3 0FD, UK
| | - Chu-Jun Liu
- Institute of Molecular Medicine, Peking University, Beijing, China
| | - Jie Luo
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Alejandro Maass
- Center for Mathematical Modeling and Center for Genome Regulation, University of Chile, Beauchef 851, 7th floor, Chile Department of Mathematical Engineering, University of Chile, Av. Beauchef 851, 5th floor, Santiago, Chile
| | - Jerome Mariette
- Plate-forme bio-informatique Genotoul, Mathématiques et Informatique Appliquées de Toulouse, INRA, Castanet-Tolosan, France
| | - Thomas Maurel
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Stefania Merella
- Center for Translational Genomics and Bioinformatics San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy
| | - Azza Mostafa Mohamed
- Departament of Biochemistry, Faculty of Science for Girls, King Abdulaziz University, Jeddah, Saudi Arabia
| | | | - Ibounyamine Nabihoudine
- Plate-forme bio-informatique Genotoul, Mathématiques et Informatique Appliquées de Toulouse, INRA, Castanet-Tolosan, France
| | - Nelson Ndegwa
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, PO Box 281, 17177 Stockholm, Sweden
| | - Céline Noirot
- Plate-forme bio-informatique Genotoul, Mathématiques et Informatique Appliquées de Toulouse, INRA, Castanet-Tolosan, France
| | | | - Michael Primig
- Inserm U1085 IRSET, University of Rennes 1, 35042 Rennes, France
| | - Alessandro Quattrone
- Laboratory of Translational Genomics, Centre for Integrative Biology, University of Trento, Trento, Italy
| | - Hadi Quesneville
- INRA URGI Centre de Versailles, bâtiment 18 Route de Saint Cyr 78026 Versailles, France
| | - Davide Rambaldi
- Center for Translational Genomics and Bioinformatics San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy
| | | | - Michela Riba
- Center for Translational Genomics and Bioinformatics San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy
| | - Steven Rosanoff
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Amna Ali Saddiq
- Department of Biological Sciences, Faculty of Science for Girls, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Elisa Salas
- International Potato Center (CIP), Lima, 1558, Peru
| | | | - Rebecca Shepherd
- Wellcome Trust Sanger Institute, Welcome Trust Genome Campus, Hinxton, CB10 1SD, UK
| | | | - Linda Sperling
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris Sud, 1 avenue de la terrasse, 91198 Gif sur Yvette, France
| | - William Spooner
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA Eagle Genomics Ltd., Babraham Research Campus, Cambridge, CB22 3AT, UK
| | - Daniel M Staines
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Delphine Steinbach
- INRA URGI Centre de Versailles, bâtiment 18 Route de Saint Cyr 78026 Versailles, France
| | - Kevin Stone
- Mouse Genomic Informatics Group, The Jackson Laboratory, Bar Harbor, ME 04609, USA
| | - Elia Stupka
- Center for Translational Genomics and Bioinformatics San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy
| | - Jon W Teague
- Wellcome Trust Sanger Institute, Welcome Trust Genome Campus, Hinxton, CB10 1SD, UK
| | - Abu Z Dayem Ullah
- Centre for Molecular Oncology, Barts Cancer Institute, Queen Mary University of London, Charterhouse Square, London EC1M 6BQ, UK
| | - Jun Wang
- Center for Bioinformatics, State Key Laboratory of Protein and Plant Gene Research, College of Life Sciences, Peking University, Beijing, 100871, P.R. China
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Marie Wong-Erasmus
- Human Longevity, Inc. 10835 Road to the Cure 140 San Diego, CA 92121, USA
| | - Ken Youens-Clark
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Amonida Zadissa
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Shi-Jian Zhang
- Institute of Molecular Medicine, Peking University, Beijing, China
| | - Arek Kasprzyk
- Center for Translational Genomics and Bioinformatics San Raffaele Scientific Institute, Via Olgettina 58, 20132 Milan, Italy Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah, Saudi Arabia
| |
Collapse
|
17
|
Elzaiat M, Jouneau L, Thépot D, Klopp C, Allais-Bonnet A, Cabau C, André M, Chaffaux S, Cribiu EP, Pailhoux E, Pannetier M. High-throughput sequencing analyses of XX genital ridges lacking FOXL2 reveal DMRT1 up-regulation before SOX9 expression during the sex-reversal process in goats. Biol Reprod 2014; 91:153. [PMID: 25395674 DOI: 10.1095/biolreprod.114.122796] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022] Open
Abstract
FOXL2 loss of function in goats leads to the early transdifferentiation of ovaries into testes, then to the full sex reversal of XX homozygous mutants. By contrast, Foxl2 loss of function in mice induces an arrest of follicle formation after birth, followed by complete female sterility. In order to understand the molecular role of FOXL2 during ovarian differentiation in the goat species, putative FOXL2 target genes were determined at the earliest stage of gonadal sex-specific differentiation by comparing the mRNA profiles of XX gonads expressing the FOXL2 protein or not. Of these 163 deregulated genes, around two-thirds corresponded to testicular genes that were up-regulated when FOXL2 was absent, and only 19 represented female-associated genes, down-regulated in the absence of FOXL2. FOXL2 should therefore be viewed as an antitestis gene rather than as a female-promoting gene. In particular, the key testis-determining gene DMRT1 was found to be up-regulated ahead of SOX9, thus suggesting in goats that SOX9 primary up-regulation may require DMRT1. Overall, our results equated to FOXL2 being an antitestis gene, allowing us to propose an alternative model for the sex-determination process in goats that differs slightly from that demonstrated in mice.
Collapse
Affiliation(s)
- Maëva Elzaiat
- INRA, UMR 1198, Biologie du Développement et Reproduction, Jouy-en-Josas, France
| | - Luc Jouneau
- INRA, UMR 1198, Biologie du Développement et Reproduction, Jouy-en-Josas, France
| | - Dominique Thépot
- INRA, UMR 1198, Biologie du Développement et Reproduction, Jouy-en-Josas, France
| | | | | | - Cédric Cabau
- INRA, Sigenae GenPhySE (Génétique, Physiologie et Systèmes d'Elevage), Castanet-Tolosan, France
| | - Marjolaine André
- INRA, UMR 1198, Biologie du Développement et Reproduction, Jouy-en-Josas, France
| | - Stéphane Chaffaux
- INRA, UMR1313 Génétique Animale et Biologie Intégrative, Jouy-en-Josas, France
| | - Edmond-Paul Cribiu
- INRA, UMR1313 Génétique Animale et Biologie Intégrative, Jouy-en-Josas, France
| | - Eric Pailhoux
- INRA, UMR 1198, Biologie du Développement et Reproduction, Jouy-en-Josas, France
| | - Maëlle Pannetier
- INRA, UMR 1198, Biologie du Développement et Reproduction, Jouy-en-Josas, France
| |
Collapse
|
18
|
Mariette J, Noirot C, Nabihoudine I, Bardou P, Hoede C, Djari A, Cabau C, Klopp C. RNAbrowse: RNA-Seq de novo assembly results browser. PLoS One 2014; 9:e96821. [PMID: 24823498 PMCID: PMC4019526 DOI: 10.1371/journal.pone.0096821] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2014] [Accepted: 04/11/2014] [Indexed: 11/18/2022] Open
Abstract
Transcriptome analysis based on a de novo assembly of next generation RNA sequences is now performed routinely in many laboratories. The generated results, including contig sequences, quantification figures, functional annotations and variation discovery outputs are usually bulky and quite diverse. This article presents a user oriented storage and visualisation environment permitting to explore the data in a top-down manner, going from general graphical views to all possible details. The software package is based on biomart, easy to install and populate with local data. The software package is available under the GNU General Public License (GPL) at http://bioinfo.genotoul.fr/RNAbrowse.
Collapse
Affiliation(s)
- Jérôme Mariette
- Plate-forme bio-informatique Genotoul/Biométrie et Intelligence Artificielle, INRA, Castanet-Tolosan, France
- * E-mail:
| | - Céline Noirot
- Plate-forme bio-informatique Genotoul/Biométrie et Intelligence Artificielle, INRA, Castanet-Tolosan, France
| | - Ibounyamine Nabihoudine
- Plate-forme bio-informatique Genotoul/Biométrie et Intelligence Artificielle, INRA, Castanet-Tolosan, France
| | - Philippe Bardou
- Plate-forme SIGENAE/Génétique Cellulaire, INRA, Castanet-Tolosan, France
| | - Claire Hoede
- Plate-forme bio-informatique Genotoul/Biométrie et Intelligence Artificielle, INRA, Castanet-Tolosan, France
| | - Anis Djari
- Plate-forme SIGENAE/Génétique Cellulaire, INRA, Castanet-Tolosan, France
| | - Cédric Cabau
- Plate-forme SIGENAE/Génétique Cellulaire, INRA, Castanet-Tolosan, France
| | - Christophe Klopp
- Plate-forme bio-informatique Genotoul/Biométrie et Intelligence Artificielle, INRA, Castanet-Tolosan, France
- Plate-forme SIGENAE/Génétique Cellulaire, INRA, Castanet-Tolosan, France
| |
Collapse
|
19
|
Osteil P, Tapponnier Y, Markossian S, Godet M, Schmaltz-Panneau B, Jouneau L, Cabau C, Joly T, Blachère T, Gócza E, Bernat A, Yerle M, Acloque H, Hidot S, Bosze Z, Duranthon V, Savatier P, Afanassieff M. Induced pluripotent stem cells derived from rabbits exhibit some characteristics of naïve pluripotency. Biol Open 2013; 2:613-28. [PMID: 23789112 PMCID: PMC3683164 DOI: 10.1242/bio.20134242] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2013] [Accepted: 04/02/2013] [Indexed: 12/12/2022] Open
Abstract
Not much is known about the molecular and functional features of pluripotent stem cells (PSCs) in rabbits. To address this, we derived and characterized 2 types of rabbit PSCs from the same breed of New Zealand White rabbits: 4 lines of embryonic stem cells (rbESCs), and 3 lines of induced PSCs (rbiPSCs) that were obtained by reprogramming adult skin fibroblasts. All cell lines required fibroblast growth factor 2 for their growth and proliferation. All rbESC lines showed molecular and functional properties typically associated with primed pluripotency. The cell cycle of rbESCs had a prolonged G1 phase and a DNA damage checkpoint before entry into the S phase, which are the 2 features typically associated with the somatic cell cycle. In contrast, the rbiPSC lines exhibited some characteristics of naïve pluripotency, including resistance to single-cell dissociation by trypsin, robust activity of the distal enhancer of the mouse Oct4 gene, and expression of naïve pluripotency-specific genes, as defined in rodents. According to gene expression profiles, rbiPSCs were closer to the rabbit inner cell mass (ICM) than rbESCs. Furthermore, rbiPSCs were capable of colonizing the ICM after aggregation with morulas. Therefore, we propose that rbiPSCs self-renew in an intermediate state between naïve and primed pluripotency, which represents a key step toward the generation of bona fide naïve PSC lines in rabbits.
Collapse
Affiliation(s)
- Pierre Osteil
- INSERM, U846, Stem Cell and Brain Institute , 18 Avenue du Doyen Jean Lépine, F-69500 Bron , France ; Stem Cell and Brain Institute , F-69500 Bron , France ; Université de Lyon , F-69100 Villeurbanne , France ; INRA, USC1361, F-69500 Bron , France
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Osteil P, Tapponnier Y, Markossian S, Godet M, Schmaltz-Panneau B, Jouneau L, Cabau C, Joly T, Blachère T, Gócza E, Bernat A, Yerle M, Acloque H, Hidot S, Bosze Z, Duranthon V, Savatier P, Afanassieff M. Induced pluripotent stem cells derived from rabbits exhibit some characteristics of naïve pluripotency. Biol Open 2013. [PMID: 23789112 DOI: 10.1242/bio.20134242.] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/28/2022] Open
Abstract
Not much is known about the molecular and functional features of pluripotent stem cells (PSCs) in rabbits. To address this, we derived and characterized 2 types of rabbit PSCs from the same breed of New Zealand White rabbits: 4 lines of embryonic stem cells (rbESCs), and 3 lines of induced PSCs (rbiPSCs) that were obtained by reprogramming adult skin fibroblasts. All cell lines required fibroblast growth factor 2 for their growth and proliferation. All rbESC lines showed molecular and functional properties typically associated with primed pluripotency. The cell cycle of rbESCs had a prolonged G1 phase and a DNA damage checkpoint before entry into the S phase, which are the 2 features typically associated with the somatic cell cycle. In contrast, the rbiPSC lines exhibited some characteristics of naïve pluripotency, including resistance to single-cell dissociation by trypsin, robust activity of the distal enhancer of the mouse Oct4 gene, and expression of naïve pluripotency-specific genes, as defined in rodents. According to gene expression profiles, rbiPSCs were closer to the rabbit inner cell mass (ICM) than rbESCs. Furthermore, rbiPSCs were capable of colonizing the ICM after aggregation with morulas. Therefore, we propose that rbiPSCs self-renew in an intermediate state between naïve and primed pluripotency, which represents a key step toward the generation of bona fide naïve PSC lines in rabbits.
Collapse
Affiliation(s)
- Pierre Osteil
- INSERM, U846, Stem Cell and Brain Institute , 18 Avenue du Doyen Jean Lépine, F-69500 Bron , France ; Stem Cell and Brain Institute , F-69500 Bron , France ; Université de Lyon , F-69100 Villeurbanne , France ; INRA, USC1361, F-69500 Bron , France
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
21
|
Frésard L, Leroux S, Dehais P, Servin B, Gilbert H, Bouchez O, Klopp C, Cabau C, Vignoles F, Feve K, Ricros A, Gourichon D, Diot C, Richard S, Leterrier C, Beaumont C, Vignal A, Minvielle F, Pitel F. Fine mapping of complex traits in non-model species: using next generation sequencing and advanced intercross lines in Japanese quail. BMC Genomics 2012; 13:551. [PMID: 23066875 PMCID: PMC3534603 DOI: 10.1186/1471-2164-13-551] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2012] [Accepted: 10/08/2012] [Indexed: 11/16/2022] Open
Abstract
Background As for other non-model species, genetic analyses in quail will benefit greatly from a higher marker density, now attainable thanks to the evolution of sequencing and genotyping technologies. Our objective was to obtain the first genome wide panel of Japanese quail SNP (Single Nucleotide Polymorphism) and to use it for the fine mapping of a QTL for a fear-related behaviour, namely tonic immobility, previously localized on Coturnix japonica chromosome 1. To this aim, two reduced representations of the genome were analysed through high-throughput 454 sequencing: AFLP (Amplified Fragment Length Polymorphism) fragments as representatives of genomic DNA, and EST (Expressed Sequence Tag) as representatives of the transcriptome. Results The sequencing runs produced 399,189 and 1,106,762 sequence reads from cDNA and genomic fragments, respectively. They covered over 434 Mb of sequence in total and allowed us to detect 17,433 putative SNP. Among them, 384 were used to genotype two Advanced Intercross Lines (AIL) obtained from three quail lines differing for duration of tonic immobility. Despite the absence of genotyping for founder individuals in the analysis, the previously identified candidate region on chromosome 1 was refined and led to the identification of a candidate gene. Conclusions These data confirm the efficiency of transcript and AFLP-sequencing for SNP discovery in a non-model species, and its application to the fine mapping of a complex trait. Our results reveal a significant association of duration of tonic immobility with a genomic region comprising the DMD (dystrophin) gene. Further characterization of this candidate gene is needed to decipher its putative role in tonic immobility in Coturnix.
Collapse
Affiliation(s)
- Laure Frésard
- INRA, UMR444 Laboratoire de Génétique Cellulaire, Castanet-Tolosan, F-31326, France
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Bourin M, Gautron J, Berges M, Hennequet-Antier C, Cabau C, Nys Y, Réhault-Godbert S. Transcriptomic profiling of proteases and antiproteases in the liver of sexually mature hens in relation to vitellogenesis. BMC Genomics 2012; 13:457. [PMID: 22950364 PMCID: PMC3495648 DOI: 10.1186/1471-2164-13-457] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2012] [Accepted: 08/22/2012] [Indexed: 11/15/2022] Open
Abstract
Background Most egg yolk precursors are synthesized by the liver, secreted into the blood and transferred into oocytes, to provide nutrients and bioactive molecules for the avian embryo. Three hundred and sixteen distinct proteins have been identified in egg yolk. These include 37 proteases and antiproteases, which are likely to play a role in the formation of the yolk (vitellogenesis), as regulators of protein metabolism. We used a transcriptomic approach to define the protease and antiprotease genes specifically expressed in the hen liver in relation to vitellogenesis by comparing sexually mature and pre-laying chickens showing different steroid milieu. Results Using a 20 K chicken oligoarray, a total of 582 genes were shown to be over-expressed in the liver of sexually mature hens (1.2 to 67 fold-differences). Eight of the top ten over-expressed genes are known components of the egg yolk or perivitelline membrane. This list of 582 genes contains 12 proteases and 3 antiproteases. We found that “uncharacterized protein LOC419301/similar to porin” (GeneID:419301), an antiprotease and “cathepsin E-A-like/similar to nothepsin” (GeneID:417848), a protease, were the only over-expressed candidates (21-fold and 35-fold difference, respectively) that are present in the egg yolk. Additionally, we showed the 4-fold over-expression of “ovochymase-2/similar to oviductin” (GeneID:769290), a vitelline membrane-specific protease. Conclusions Our approach revealed that three proteases and antiproteases are likely to participate in the formation of the yolk. The role of the other 12 proteases and antiproteases which are over-expressed in our model remains unclear. At least 1/3 of proteases and antiproteases identified in egg yolk and vitelline membrane proteomes are expressed similarly in the liver regardless of the maturity of hens, and have been initially identified as regulators of haemostasis and inflammatory events. The lack of effect of sex steroids on these genes expressed in the liver but the products of which are found in the yolk suggests that these may be passively incorporated into the yolk rather than actively produced for that purpose. These results raise the question of the biological significance of egg yolk proteases and antiproteases, and more generally of all minor proteins that have been identified in egg yolk.
Collapse
Affiliation(s)
- Marie Bourin
- INRA, SIGENAE, UR83 Recherches Avicoles, 37380, F-37380 Nouzilly, France
| | | | | | | | | | | | | |
Collapse
|
23
|
Boussaha M, Guyomard R, Cabau C, Esquerré D, Quillet E. Development and characterisation of an expressed sequence tags (EST)-derived single nucleotide polymorphisms (SNPs) resource in rainbow trout. BMC Genomics 2012; 13:238. [PMID: 22694767 PMCID: PMC3536561 DOI: 10.1186/1471-2164-13-238] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2012] [Accepted: 06/13/2012] [Indexed: 12/29/2022] Open
Abstract
Background There is considerable interest in developing high-throughput genotyping with single nucleotide polymorphisms (SNPs) for the identification of genes affecting important ecological or economical traits. SNPs are evenly distributed throughout the genome and are likely to be functionally relevant. In rainbow trout, in silico screening of EST databases represents an attractive approach for de novo SNP identification. Nevertheless, EST sequencing errors and assembly of EST paralogous sequences can lead to the identification of false positive SNPs which renders the reliability of EST-derived SNPs relatively low. Further validation of EST-derived SNPs is therefore required. The objective of this work was to assess the quality of and to validate a large number of rainbow trout EST-derived SNPs. Results A panel of 1,152 EST-derived SNPs was selected from the INRA Sigenae SNP database and was genotyped in standard and double haploid individuals from several populations using the Illumina GoldenGate BeadXpress assay. High-quality genotyping data were obtained for 958 SNPs representing a genotyping success rate of 83.2 %, out of which, 350 SNPs (36.5 %) were polymorphic in at least one population and were designated as true SNPs. They also proved to be a potential tool to investigate genetic diversity of the species, as the set of SNP successfully sorted individuals into three main groups using STRUCTURE software. Functional annotations revealed 28 non-synonymous SNPs, out of which four substitutions were predicted to affect protein functions. A subset of 223 true SNPs were polymorphic in the two INRA mapping reference families and were integrated into the INRA microsatellite-based linkage map. Conclusions Our results represent the first study of EST-derived SNPs validation in rainbow trout, a species whose genome sequences is not yet available. We designed several specific filters in order to improve the genotyping yield. Nevertheless, our selection criteria should be further improved in order to reduce the observed high rate of false positive SNPs which results from the occurrence of whole genome duplications.
Collapse
Affiliation(s)
- Mekki Boussaha
- INRA, UMR 1313 Génétique Animale et Biologie Intégrative, 78350, Jouy-en-Josas, France.
| | | | | | | | | |
Collapse
|
24
|
Genini S, Badaoui B, Sclep G, Bishop SC, Waddington D, Pinard van der Laan MH, Klopp C, Cabau C, Seyfert HM, Petzl W, Jensen K, Glass EJ, de Greeff A, Smith HE, Smits MA, Olsaker I, Boman GM, Pisoni G, Moroni P, Castiglioni B, Cremonesi P, Del Corvo M, Foulon E, Foucras G, Rupp R, Giuffra E. Strengthening insights into host responses to mastitis infection in ruminants by combining heterogeneous microarray data sources. BMC Genomics 2011; 12:225. [PMID: 21569310 PMCID: PMC3118214 DOI: 10.1186/1471-2164-12-225] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2010] [Accepted: 05/11/2011] [Indexed: 12/30/2022] Open
Abstract
Background Gene expression profiling studies of mastitis in ruminants have provided key but fragmented knowledge for the understanding of the disease. A systematic combination of different expression profiling studies via meta-analysis techniques has the potential to test the extensibility of conclusions based on single studies. Using the program Pointillist, we performed meta-analysis of transcription-profiling data from six independent studies of infections with mammary gland pathogens, including samples from cattle challenged in vivo with S. aureus, E. coli, and S. uberis, samples from goats challenged in vivo with S. aureus, as well as cattle macrophages and ovine dendritic cells infected in vitro with S. aureus. We combined different time points from those studies, testing different responses to mastitis infection: overall (common signature), early stage, late stage, and cattle-specific. Results Ingenuity Pathway Analysis of affected genes showed that the four meta-analysis combinations share biological functions and pathways (e.g. protein ubiquitination and polyamine regulation) which are intrinsic to the general disease response. In the overall response, pathways related to immune response and inflammation, as well as biological functions related to lipid metabolism were altered. This latter observation is consistent with the milk fat content depression commonly observed during mastitis infection. Complementarities between early and late stage responses were found, with a prominence of metabolic and stress signals in the early stage and of the immune response related to the lipid metabolism in the late stage; both mechanisms apparently modulated by few genes, including XBP1 and SREBF1. The cattle-specific response was characterized by alteration of the immune response and by modification of lipid metabolism. Comparison of E. coli and S. aureus infections in cattle in vivo revealed that affected genes showing opposite regulation had the same altered biological functions and provided evidence that E. coli caused a stronger host response. Conclusions This meta-analysis approach reinforces previous findings but also reveals several novel themes, including the involvement of genes, biological functions, and pathways that were not identified in individual studies. As such, it provides an interesting proof of principle for future studies combining information from diverse heterogeneous sources.
Collapse
Affiliation(s)
- Sem Genini
- Parco Tecnologico Padano - CERSA, Via Einstein, 26900 Lodi, Italy.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Abstract
Evolutionary analyses of biological data are becoming a prerequisite in many fields of biology. At a time of high-throughput data analysis, phylogenetics is often a necessary complementary tool for biologists to understand, compare and identify the functions of sequences. But available bioinformatics tools are frequently not easy for non-specialists to use. We developed PhyleasProg (http://phyleasprog.inra.fr), a user-friendly web server as a turnkey tool dedicated to evolutionary analyses. PhyleasProg can help biologists with little experience in evolutionary methodologies by analysing their data in a simple and robust way, using methods corresponding to robust standards. Via a very intuitive web interface, users only need to enter a list of Ensembl protein IDs and a list of species as inputs. After dynamic computations, users have access to phylogenetic trees, positive/purifying selection data (on site and branch-site models), with a display of these results on the protein sequence and on a 3D structure model, and the synteny environment of related genes. This connection between different domains of phylogenetics opens the way to new biological analyses for the discovery of the function and structure of proteins.
Collapse
Affiliation(s)
- Joël Busset
- INRA, UMR85, Physiologie de la Reproduction et des Comportements, F-37380 Nouzilly, France
| | | | | | | |
Collapse
|
26
|
Jonchère V, Réhault-Godbert S, Hennequet-Antier C, Cabau C, Sibut V, Cogburn LA, Nys Y, Gautron J. Gene expression profiling to identify eggshell proteins involved in physical defense of the chicken egg. BMC Genomics 2010; 11:57. [PMID: 20092629 PMCID: PMC2827412 DOI: 10.1186/1471-2164-11-57] [Citation(s) in RCA: 88] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2009] [Accepted: 01/21/2010] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND As uricoletic animals, chickens produce cleidoic eggs, which are self-contained bacteria-resistant biological packages for extra-uterine development of the chick embryo. The eggshell constitutes a natural physical barrier against bacterial penetration if it forms correctly and remains intact. The eggshell's remarkable mechanical properties are due to interactions among mineral components and the organic matrix proteins. The purpose of our study was to identify novel eggshell proteins by examining the transcriptome of the uterus during calcification of the eggshell. An extensive bioinformatic analysis on genes over-expressed in the uterus allowed us to identify novel eggshell proteins that contribute to the egg's natural defenses. RESULTS Our 14 K Del-Mar Chicken Integrated Systems microarray was used for transcriptional profiling in the hen's uterus during eggshell deposition. A total of 605 transcripts were over-expressed in the uterus compared with the magnum or white isthmus across a wide range of abundance (1.1- to 79.4-fold difference). The 605 highly-expressed uterine transcripts correspond to 469 unique genes, which encode 437 different proteins. Gene Ontology (GO) analysis was used for interpretation of protein function. The most over-represented GO terms are related to genes encoding ion transport proteins, which provide eggshell mineral precursors. Signal peptide sequence was found for 54 putative proteins secreted by the uterus during eggshell formation. Many functional proteins are involved in calcium binding or biomineralization--prerequisites for interacting with the mineral phase during eggshell fabrication. While another large group of proteins could be involved in proper folding of the eggshell matrix. Many secreted uterine proteins possess antibacterial properties, which would protect the egg against microbial invasion. A final group includes proteases and protease inhibitors that regulate protein activity in the acellular uterine fluid where eggshell formation takes place. CONCLUSIONS Our original study provides the first detailed description of the chicken uterus transcriptome during formation of the eggshell. We have discovered a cache of about 600 functional genes and identified a large number of encoded proteins secreted into uterine fluid for fabrication of the eggshell and chemical protection of the egg. Some of these uterine genes could prove useful as biological markers for genetic improvement of phenotypic traits (i.e., egg and eggshell quality).
Collapse
Affiliation(s)
| | | | | | - Cédric Cabau
- INRA, UR83 Recherches Avicoles, F-37380 Nouzilly, France
| | - Vonick Sibut
- INRA, UR83 Recherches Avicoles, F-37380 Nouzilly, France
- Institut Technique Avicole, F-37380 Nouzilly, France
| | - Larry A Cogburn
- Department of Animal and Food Sciences, University of Delaware, Newark, DE 19717 USA
| | - Yves Nys
- INRA, UR83 Recherches Avicoles, F-37380 Nouzilly, France
| | - Joel Gautron
- INRA, UR83 Recherches Avicoles, F-37380 Nouzilly, France
| |
Collapse
|
27
|
Léandri RD, Archilla C, Bui LC, Peynot N, Liu Z, Cabau C, Chastellier A, Renard JP, Duranthon V. Revealing the dynamics of gene expression during embryonic genome activation and first differentiation in the rabbit embryo with a dedicated array screening. Physiol Genomics 2008; 36:98-113. [PMID: 19001509 DOI: 10.1152/physiolgenomics.90310.2008] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Early mammalian development is characterized by extensive changes in nuclear functions that result from epigenetic modifications of the newly formed embryonic genome. While the first embryonic cells are totipotent, this status spans only a few cell cycles. At the blastocyst stage, the embryo already contains differentiated trophectoderm cells and pluripotent inner cell mass cells. Concomitantly, the embryonic genome becomes progressively transcriptionally active. During this unique period of development, the gene expression pattern has been mainly characterized in the mouse, in which embryonic genome activation (EGA) spans a single cell cycle after abrupt epigenetic modifications. To further characterize this period, we chose to analyze it in the rabbit, in which, as in most mammals, EGA is more progressive and occurs closer to the first cell differentiation events. In this species, for which no transcriptomic arrays were available, we focused on genes expressed at EGA and first differentiation and established a 2,000-gene dedicated cDNA array. Screening this with pre-EGA, early post-EGA, and blastocyst embryos divided genes into seven clusters of expression according to their regulation during this period and revealed their dynamics of expression during EGA and first differentiation. Our results point to transient properties of embryo transcriptome at EGA, due not only to the transition between maternal and embryonic transcripts but also to the transient expression of a subset of embryonic genes whose functions remained largely uncharacterized. They also provide a first view of the functional consequences of the changes in gene expression program.
Collapse
Affiliation(s)
- R D Léandri
- INRA UMR 1198, Laboratoire de Biologie du Développement et Reproduction, Centre de Ressources Biologiques pour la Génomique des Animaux d'Elevages et d'Intérêt Economique, Jouy-en-Josas, France.
| | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Baillet A, Mandon-Pépin B, Cabau C, Poumerol E, Pailhoux E, Cotinot C. Identification of transcripts involved in meiosis and follicle formation during ovine ovary development. BMC Genomics 2008; 9:436. [PMID: 18811939 PMCID: PMC2566313 DOI: 10.1186/1471-2164-9-436] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2008] [Accepted: 09/23/2008] [Indexed: 12/02/2022] Open
Abstract
Background The key steps in germ cell survival during ovarian development are the entry into meiosis of oogonies and the formation of primordial follicles, which then determine the reproductive lifespan of the ovary. In sheep, these steps occur during fetal life, between 55 and 80 days of gestation, respectively. The aim of this study was to identify differentially expressed ovarian genes during prophase I meiosis and early folliculogenesis in sheep. Results In order to elucidate the molecular events associated with early ovarian differentiation, we generated two ovary stage-specific subtracted cDNA libraries using SSH. Large-scale sequencing of these SSH libraries identified 6,080 ESTs representing 2,535 contigs. Clustering and assembly of these ESTs resulted in a total of 2,101 unique sequences depicted in 1,305 singleton (62.11%) and 796 contigs (37.9%) ESTs (clusters). BLASTX evaluation indicated that 99% of the ESTs were homologous to various known genes/proteins in a broad range of organisms, especially ovine, bovine and human species. The remaining 1% which exhibited any homology to known gene sequences was considered as novel. Detailed study of the expression patterns of some of these genes using RT-PCR revealed new promising candidates for ovary differentiation genes in sheep. Conclusion We showed that the SSH approach was relevant to determining new mammalian genes which might be involved in oogenesis and early follicle development, and enabled the discovery of new potential oocyte and granulosa cell markers for future studies. These genes may have significant implications regarding our understanding of ovarian function in molecular terms, and for the development of innovative strategies to both promote and control fertility.
Collapse
Affiliation(s)
- Adrienne Baillet
- INRA, ENVA, UMR 1198 Biologie du Développement et Reproduction, F-78350 Jouy en Josas, France.
| | | | | | | | | | | |
Collapse
|
29
|
Degrelle SA, Campion E, Cabau C, Piumi F, Reinaud P, Richard C, Renard JP, Hue I. Molecular evidence for a critical period in mural trophoblast development in bovine blastocysts. Dev Biol 2005; 288:448-60. [PMID: 16289134 DOI: 10.1016/j.ydbio.2005.09.043] [Citation(s) in RCA: 159] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2005] [Revised: 09/20/2005] [Accepted: 09/22/2005] [Indexed: 01/12/2023]
Abstract
Embryonic and extra-embryonic lineages are separated at the blastocyst stage in the mouse at the onset of implantation but well ahead of implantation in most mammals. To provide information on the development of the trophoblast lineage in late-implanting bovine embryos, we combined the use of molecular markers defining embryonic and extra-embryonic lineages in the mouse with a transcriptomic approach dedicated to the early steps of the elongation process, a characteristic feature of blastocyst development in ruminants. In this study, we present molecular evidence for differences between the cow and the mouse in the programming of trophoblast differentiation. This different programming encompasses: (i) the expression of epiblast specifying genes (Oct-4, Nanog) in bovine trophoblast cells at the onset of elongation, (ii) the transcription of proliferation markers in early elongating blastocysts, (iii) the early detection of trophoblast-specific transcripts related to extra-embryonic tissue's differentiation (Hand1, Ets2, IFN-tau) and (iv) the identification of a new transcript (c12) which displays a reciprocal pattern to that of Oct-4 and Nanog genes in the embryonic cells and for which no equivalent has thus far been found in the mouse. Altogether, these results tended to show that early elongation is a critical transition in bovine trophoblast development.
Collapse
Affiliation(s)
- Séverine A Degrelle
- UMR INRA/ENVA/CNRS Biologie du Développement et de la Reproduction, 78352 Jouy-en-Josas cedex, France
| | | | | | | | | | | | | | | |
Collapse
|
30
|
Fang G, Ho C, Qiu Y, Cubas V, Yu Z, Cabau C, Cheung F, Moszer I, Danchin A. Specialized microbial databases for inductive exploration of microbial genome sequences. BMC Genomics 2005; 6:14. [PMID: 15698474 PMCID: PMC549560 DOI: 10.1186/1471-2164-6-14] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2004] [Accepted: 02/07/2005] [Indexed: 11/10/2022] Open
Abstract
Background The enormous amount of genome sequence data asks for user-oriented databases to manage sequences and annotations. Queries must include search tools permitting function identification through exploration of related objects. Methods The GenoList package for collecting and mining microbial genome databases has been rewritten using MySQL as the database management system. Functions that were not available in MySQL, such as nested subquery, have been implemented. Results Inductive reasoning in the study of genomes starts from "islands of knowledge", centered around genes with some known background. With this concept of "neighborhood" in mind, a modified version of the GenoList structure has been used for organizing sequence data from prokaryotic genomes of particular interest in China. GenoChore , a set of 17 specialized end-user-oriented microbial databases (including one instance of Microsporidia, Encephalitozoon cuniculi, a member of Eukarya) has been made publicly available. These databases allow the user to browse genome sequence and annotation data using standard queries. In addition they provide a weekly update of searches against the world-wide protein sequences data libraries, allowing one to monitor annotation updates on genes of interest. Finally, they allow users to search for patterns in DNA or protein sequences, taking into account a clustering of genes into formal operons, as well as providing extra facilities to query sequences using predefined sequence patterns. Conclusion This growing set of specialized microbial databases organize data created by the first Chinese bacterial genome programs (ThermaList, Thermoanaerobacter tencongensis, LeptoList, with two different genomes of Leptospira interrogans and SepiList, Staphylococcus epidermidis) associated to related organisms for comparison.
Collapse
Affiliation(s)
- Gang Fang
- HKU-Pasteur Research Centre, Dexter HC Man Building, 8, Sassoon Road, Pokfulam, Hong Kong, China
- Unité de Génétique des Génomes Bactériens, CNRS URA2171, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France
| | - Christine Ho
- HKU-Pasteur Research Centre, Dexter HC Man Building, 8, Sassoon Road, Pokfulam, Hong Kong, China
| | - Yaowu Qiu
- HKU-Pasteur Research Centre, Dexter HC Man Building, 8, Sassoon Road, Pokfulam, Hong Kong, China
| | - Virginie Cubas
- HKU-Pasteur Research Centre, Dexter HC Man Building, 8, Sassoon Road, Pokfulam, Hong Kong, China
| | - Zhou Yu
- HKU-Pasteur Research Centre, Dexter HC Man Building, 8, Sassoon Road, Pokfulam, Hong Kong, China
| | - Cédric Cabau
- HKU-Pasteur Research Centre, Dexter HC Man Building, 8, Sassoon Road, Pokfulam, Hong Kong, China
| | - Frankie Cheung
- HKU-Pasteur Research Centre, Dexter HC Man Building, 8, Sassoon Road, Pokfulam, Hong Kong, China
| | - Ivan Moszer
- Plate-forme Intégration et Analyse Génomiques, Genopole, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France
- Unité de Génétique des Génomes Bactériens, CNRS URA2171, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France
| | - Antoine Danchin
- HKU-Pasteur Research Centre, Dexter HC Man Building, 8, Sassoon Road, Pokfulam, Hong Kong, China
- Unité de Génétique des Génomes Bactériens, CNRS URA2171, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France
| |
Collapse
|
31
|
Lahbib-Mansais Y, Tosser-Klopp G, Leroux S, Cabau C, Karsenty E, Milan D, Barillot E, Yerle M, Hatey F, Gellin J. Contribution to high-resolution mapping in pigs with 101 type I markers and progress in comparative map between humans and pigs. Mamm Genome 2003; 14:275-88. [PMID: 12682780 DOI: 10.1007/s00335-002-2236-x] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2002] [Accepted: 12/09/2002] [Indexed: 10/27/2022]
Abstract
In the frame of the European program GenetPig, we localized on the Pig map 105 coding sequences (type I markers) from different origins, using INRA-University of Minnesota porcine Radiation Hybrid Panel (IMpRH, 101 markers) and somatic cell hybrid panel (SCHP, 93 markers, of which only four were not also mapped using IMpRH). Thus, we contributed to the improvement of the porcine high-resolution map, and we complemented the integration between the RH and cytogenetic maps. IMpRH tools allowed us to map 101 new markers relatively to reference markers of the first generation radiation hybrid map. Ninety out of 101 markers are linked to an already mapped marker with a LOD score greater than 4.8. Seventy-eight markers were informative for comparative mapping. Comparison of marker positions on the RH map with those obtained on the cytogenetic map or those expected by Human-Pig comparative map data suggested to us to be cautious with markers linked with a LOD lower than 6. These results allowed us to specify chromosomal fragments well conserved between humans and pigs and also to suggest new correspondences (Sscr1-Hsap3, Sscr9-Hsap9, Sscr13-Hsap11, Sscr15-Hsap6) confirmed by FISH on pig chromosomes. We examined in more detail the comparative map between Hsap12 and Sscr5 considering gene order, which suggests that rearrangements have occurred within the conserved synteny.
Collapse
Affiliation(s)
- Yvette Lahbib-Mansais
- INRA, Laboratoire de Génétique Cellulaire, BP27, 31326 Castanet-Tolosan Cedex, France.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
32
|
Genêt C, Renard C, Cabau C, Rogel-Gaillard C, Gellin J, Milan D. In the QTL region surrounding porcine MHC, gene order is conserved with human genome. Mamm Genome 2001; 12:246-9. [PMID: 11252175 DOI: 10.1007/s003350010261] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2000] [Accepted: 10/27/2000] [Indexed: 10/28/2022]
Abstract
On the porcine genome, the region surrounding the Major Histocompatibility Complex, also called Swine Leukocyte Antigens (SLA), is of particular interest not only owing to itq role in the control of immune response, but also because of its influence on many traits such as growth, fatness, and meat quality. To help in the identification of responsible genes, detailed comparative maps of the MHC region in mammalian species and powerful mapping tools allowing accurate ordering of genes and markers in this region are needed. In this report, we describe the use of the recently developed IMpRH radiation hybrid panel, to construct a higher density radiation hybrid map of swine Sscr 7p-q12, containing 23 additional loci. Our results show that the gene order is conserved between the two MHC-containing regions, even if an inversion is observed above the QTL region in the region containing DEK, SCA1, and EDN1 genes. The framework map produced shows that the IMpRH panel permits the ordering of genes and markers in the three MHC classes and would thus allow accurate localization of ESTs and candidate genes.
Collapse
Affiliation(s)
- C Genêt
- Laboratoire de Génétique Cellulaire, INRA, Castanet-Tolosan, France
| | | | | | | | | | | |
Collapse
|
33
|
Milan D, Hawken R, Cabau C, Leroux S, Genet C, Lahbib Y, Tosser G, Robic A, Hatey F, Alexander L, Beattie C, Schook L, Yerle M, Gellin J. IMpRH server: an RH mapping server available on the Web. Bioinformatics 2000; 16:558-9. [PMID: 10980153 DOI: 10.1093/bioinformatics/16.6.558] [Citation(s) in RCA: 192] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
SUMMARY The INRA-Minnesota Porcine Radiation Hybrid (IMpRH) Server provides both a mapping tool (IMpRH mapping tool) and a database (IMpRH database) of officially submitted results. The mapping tool permits the mapping of a new marker relatively to markers previously mapped on the IMpRH panel. The IMpRH database is the official database for submission of new results and queries. The database not only permits the sharing of public data but also semi-private and private data.
Collapse
Affiliation(s)
- D Milan
- Laboratoire de Génétique Cellulaire, INRA, BP27, 31326 Castanet-Tolosan, France.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|