1
|
Zhimulev I, Vatolina T, Levitsky V, Tsukanov A. Developmental and Housekeeping Genes: Two Types of Genetic Organization in the Drosophila Genome. Int J Mol Sci 2024; 25:4068. [PMID: 38612878 PMCID: PMC11012173 DOI: 10.3390/ijms25074068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 04/01/2024] [Accepted: 04/02/2024] [Indexed: 04/14/2024] Open
Abstract
We developed a procedure for locating genes on Drosophila melanogaster polytene chromosomes and described three types of chromosome structures (gray bands, black bands, and interbands), which differed markedly in morphological and genetic properties. This was reached through the use of our original methods of molecular and genetic analysis, electron microscopy, and bioinformatics data processing. Analysis of the genome-wide distribution of these properties led us to a bioinformatics model of the Drosophila genome organization, in which the genome was divided into two groups of genes. One was constituted by 65, in which the genome was divided into two groups, 62 genes that are expressed in most cell types during life cycle and perform basic cellular functions (the so-called "housekeeping genes"). The other one was made up of 3162 genes that are expressed only at particular stages of development ("developmental genes"). These two groups of genes are so different that we may state that the genome has two types of genetic organization. Different are the timings of their expression, chromatin packaging levels, the composition of activating and deactivating proteins, the sizes of these genes, the lengths of their introns, the organization of the promoter regions of the genes, the locations of origin recognition complexes (ORCs), and DNA replication timings.
Collapse
Affiliation(s)
- Igor Zhimulev
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Science, 630090 Novosibirsk, Russia;
| | - Tatyana Vatolina
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Science, 630090 Novosibirsk, Russia;
| | - Victor Levitsky
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Science, 630090 Novosibirsk, Russia; (V.L.); (A.T.)
| | - Anton Tsukanov
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Science, 630090 Novosibirsk, Russia; (V.L.); (A.T.)
| |
Collapse
|
2
|
Kimura A, Go AC, Markow T, Ranz JM. Evidence of Nonrandom Patterns of Functional Chromosome Organization in Danaus plexippus. Genome Biol Evol 2024; 16:evae054. [PMID: 38488057 PMCID: PMC10972686 DOI: 10.1093/gbe/evae054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/13/2024] [Indexed: 05/01/2024] Open
Abstract
Our understanding on the interplay between gene functionality and gene arrangement at different chromosome scales relies on a few Diptera and the honeybee, species with quality reference genome assemblies, accurate gene annotations, and abundant transcriptome data. Using recently generated 'omic resources in the monarch butterfly Danaus plexippus, a species with many more and smaller chromosomes relative to Drosophila species and the honeybee, we examined the organization of genes preferentially expressed at broadly defined developmental stages (larva, pupa, adult males, and adult females) at both fine and whole-chromosome scales. We found that developmental stage-regulated genes do not form more clusters, but do form larger clusters, than expected by chance, a pattern consistent across the gene categories examined. Notably, out of the 30 chromosomes in the monarch genome, 12 of them, plus the fraction of the chromosome Z that corresponds to the ancestral Z in other Lepidoptera, were found enriched for developmental stage-regulated genes. These two levels of nonrandom gene organization are not independent as enriched chromosomes for developmental stage-regulated genes tend to harbor disproportionately large clusters of these genes. Further, although paralogous genes were overrepresented in gene clusters, their presence is not enough to explain two-thirds of the documented cases of whole-chromosome enrichment. The composition of the largest clusters often included paralogs from more than one multigene family as well as unrelated single-copy genes. Our results reveal intriguing patterns at the whole-chromosome scale in D. plexippus while shedding light on the interplay between gene expression and chromosome organization beyond Diptera and Hymenoptera.
Collapse
Affiliation(s)
- Ashlyn Kimura
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, CA 92647, USA
| | - Alwyn C Go
- Department of Biology, University of Winnipeg, Winnipeg, MB R3B 2E9, Canada
| | - Therese Markow
- Unidad de Genómica Avanzada (Langebio), CINVESTAV, Irapuato, GTO 36824, México
- Section of Cell and Developmental Biology, Division of Biological Sciences, University of California San Diego, La Jolla, CA 92093, USA
| | - José M Ranz
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, CA 92647, USA
| |
Collapse
|
3
|
Xie J, Ma A, Fennell A, Ma Q, Zhao J. It is time to apply biclustering: a comprehensive review of biclustering applications in biological and biomedical data. Brief Bioinform 2020; 20:1449-1464. [PMID: 29490019 DOI: 10.1093/bib/bby014] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2017] [Revised: 01/16/2018] [Indexed: 12/12/2022] Open
Abstract
Biclustering is a powerful data mining technique that allows clustering of rows and columns, simultaneously, in a matrix-format data set. It was first applied to gene expression data in 2000, aiming to identify co-expressed genes under a subset of all the conditions/samples. During the past 17 years, tens of biclustering algorithms and tools have been developed to enhance the ability to make sense out of large data sets generated in the wake of high-throughput omics technologies. These algorithms and tools have been applied to a wide variety of data types, including but not limited to, genomes, transcriptomes, exomes, epigenomes, phenomes and pharmacogenomes. However, there is still a considerable gap between biclustering methodology development and comprehensive data interpretation, mainly because of the lack of knowledge for the selection of appropriate biclustering tools and further supporting computational techniques in specific studies. Here, we first deliver a brief introduction to the existing biclustering algorithms and tools in public domain, and then systematically summarize the basic applications of biclustering for biological data and more advanced applications of biclustering for biomedical data. This review will assist researchers to effectively analyze their big data and generate valuable biological knowledge and novel insights with higher efficiency.
Collapse
|
4
|
Different substrate regimes determine transcriptional profiles and gene co-expression in Methanosarcina barkeri (DSM 800). Appl Microbiol Biotechnol 2017; 101:7303-7316. [DOI: 10.1007/s00253-017-8457-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2017] [Revised: 07/26/2017] [Accepted: 07/30/2017] [Indexed: 01/15/2023]
|
5
|
Larsson J, Lönn M, Lind EE, Świeżak J, Smolarz K, Grahn M. Sewage treatment plant associated genetic differentiation in the blue mussel from the Baltic Sea and Swedish west coast. PeerJ 2016; 4:e2628. [PMID: 27812424 PMCID: PMC5088577 DOI: 10.7717/peerj.2628] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2016] [Accepted: 09/29/2016] [Indexed: 12/31/2022] Open
Abstract
Human-derived environmental pollutants and nutrients that reach the aquatic environment through sewage effluents, agricultural and industrial processes are constantly contributing to environmental changes that serve as drivers for adaptive responses and evolutionary changes in many taxa. In this study, we examined how two types of point sources of aquatic environmental pollution, harbors and sewage treatment plants, affect gene diversity and genetic differentiation in the blue mussel in the Baltic Sea area and off the Swedish west coast (Skagerrak). Reference sites (REF) were geographically paired with sites from sewage treatments plant (STP) and harbors (HAR) with a nested sampling scheme, and genetic differentiation was evaluated using a high-resolution marker amplified fragment length polymorphism (AFLP). This study showed that genetic composition in the Baltic Sea blue mussel was associated with exposure to sewage treatment plant effluents. In addition, mussel populations from harbors were genetically divergent, in contrast to the sewage treatment plant populations, suggesting that there is an effect of pollution from harbors but that the direction is divergent and site specific, while the pollution effect from sewage treatment plants on the genetic composition of blue mussel populations acts in the same direction in the investigated sites.
Collapse
Affiliation(s)
- Josefine Larsson
- School of Natural Science, Technology and Environmental Studies, Södertörn University, Huddinge, Stockholm, Sweden
| | - Mikael Lönn
- School of Natural Science, Technology and Environmental Studies, Södertörn University, Huddinge, Stockholm, Sweden
| | - Emma E. Lind
- Department of Aquatic Resources, Swedish University of Agricultural Sciences, Drottningholm, Stockholm, Stockholm, Sweden
| | - Justyna Świeżak
- Department of Marine Ecosystem Functioning, University of Gdansk, Institute of Oceanography, Gdynia, Poland
| | - Katarzyna Smolarz
- Department of Marine Ecosystem Functioning, University of Gdansk, Institute of Oceanography, Gdynia, Poland
| | - Mats Grahn
- School of Natural Science, Technology and Environmental Studies, Södertörn University, Huddinge, Stockholm, Sweden
| |
Collapse
|
6
|
The clustering of functionally related genes contributes to CNV-mediated disease. Genome Res 2015; 25:802-13. [PMID: 25887030 PMCID: PMC4448677 DOI: 10.1101/gr.184325.114] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2014] [Accepted: 04/13/2015] [Indexed: 12/20/2022]
Abstract
Clusters of functionally related genes can be disrupted by a single copy number variant (CNV). We demonstrate that the simultaneous disruption of multiple functionally related genes is a frequent and significant characteristic of de novo CNVs in patients with developmental disorders (P = 1 × 10−3). Using three different functional networks, we identified unexpectedly large numbers of functionally related genes within de novo CNVs from two large independent cohorts of individuals with developmental disorders. The presence of multiple functionally related genes was a significant predictor of a CNV's pathogenicity when compared to CNVs from apparently healthy individuals and a better predictor than the presence of known disease or haploinsufficient genes for larger CNVs. The functionally related genes found in the de novo CNVs belonged to 70% of all clusters of functionally related genes found across the genome. De novo CNVs were more likely to affect functional clusters and affect them to a greater extent than benign CNVs (P = 6 × 10−4). Furthermore, such clusters of functionally related genes are phenotypically informative: Different patients possessing CNVs that affect the same cluster of functionally related genes exhibit more similar phenotypes than expected (P < 0.05). The spanning of multiple functionally similar genes by single CNVs contributes substantially to how these variants exert their pathogenic effects.
Collapse
|
7
|
O'Neil ST, Dzurisin JDK, Williams CM, Lobo NF, Higgins JK, Deines JM, Carmichael RD, Zeng E, Tan JC, Wu GC, Emrich SJ, Hellmann JJ. Gene expression in closely related species mirrors local adaptation: consequences for responses to a warming world. Mol Ecol 2014; 23:2686-98. [PMID: 24766086 DOI: 10.1111/mec.12773] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2013] [Revised: 04/18/2014] [Accepted: 04/23/2014] [Indexed: 11/27/2022]
Abstract
Local adaptation of populations could preclude or slow range expansions in response to changing climate, particularly when dispersal is limited. To investigate the differential responses of populations to changing climatic conditions, we exposed poleward peripheral and central populations of two Lepidoptera to reciprocal, common-garden climatic conditions and compared their whole-transcriptome expression. We found evidence of simple population differentiation in both species, and in the species with previously identified population structure and phenotypic local adaptation, we found several hundred genes that responded in a synchronized and localized fashion. These genes were primarily involved in energy metabolism and oxidative stress, and expression levels were most divergent between populations in the same environment in which we previously detected divergence for metabolism. We found no localized genes in the species with less population structure and for which no local adaptation was previously detected. These results challenge the assumption that species are functionally similar across their ranges and poleward peripheral populations are preadapted to warmer conditions. Rather, some taxa deserve population-level consideration when predicting the effects of climate change because they respond in genetically based, distinctive ways to changing conditions.
Collapse
Affiliation(s)
- Shawn T O'Neil
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, 46556, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Rubin AF, Green P. Expression-based segmentation of the Drosophila genome. BMC Genomics 2013; 14:812. [PMID: 24256206 PMCID: PMC3909303 DOI: 10.1186/1471-2164-14-812] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2013] [Accepted: 11/18/2013] [Indexed: 01/22/2023] Open
Abstract
Background It is generally accepted that gene order in eukaryotes is nonrandom, with adjacent genes often sharing expression patterns across tissues, and that this organization may be important for gene regulation. Here we describe a novel method, based on an explicit probability model instead of correlation analysis, for identifying coordinately expressed gene clusters (‘coexpression segments’), apply it to Drosophila melanogaster, and look for epigenetic associations using publicly available data. Results We find that two-thirds of Drosophila genes fall into multigenic coexpression segments, and that such segments are of two main types, housekeeping and tissue-restricted. Consistent with correlation-based studies, we find that adjacent genes within the same segment tend to be physically closer to each other than to the adjacent genes in different segments, and that tissue-restricted segments are enriched for testis-expressed genes. Our segmentation pattern correlates with Hi-C based physical interaction domains, but segments are generally much smaller than domains. Intersegment regions (including those which do not correspond to physical domain boundaries) are enriched for insulator binding sites. Conclusions We describe a novel approach for identifying coexpression clusters that does not require arbitrary cutoff values or heuristics, and find that coexpression of adjacent genes is widespread in the Drosophila genome. Coexpression segments appear to reflect a level of regulatory organization related to, but below that of physical interaction domains, and depending in part on insulator binding.
Collapse
Affiliation(s)
- Alan F Rubin
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.
| | | |
Collapse
|
9
|
Rieder D, Ploner C, Krogsdam AM, Stocker G, Fischer M, Scheideler M, Dani C, Amri EZ, Müller WG, McNally JG, Trajanoski Z. Co-expressed genes prepositioned in spatial neighborhoods stochastically associate with SC35 speckles and RNA polymerase II factories. Cell Mol Life Sci 2013; 71:1741-59. [PMID: 24026398 DOI: 10.1007/s00018-013-1465-3] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2013] [Revised: 08/02/2013] [Accepted: 08/28/2013] [Indexed: 11/26/2022]
Abstract
Chromosomally separated, co-expressed genes can be in spatial proximity, but there is still debate about how this nuclear organization is achieved. Proposed mechanisms include global genome organization, preferential positioning of chromosome territories, or gene-gene sharing of various nuclear bodies. To investigate this question, we selected a set of genes that were co-expressed upon differentiation of human multipotent stem cells. We applied a novel multi-dimensional analysis procedure which revealed that prior to gene expression, the relative position of these genes was conserved in nuclei. Upon stem cell differentiation and concomitant gene expression, we found that co-expressed genes were closer together. In addition, we found that genes in the same 1-μm-diameter neighborhood associated with either the same splicing speckle or to a lesser extent with the same transcription factory. Dispersal of speckles by overexpression of the serine-arginine (SR) protein kinase cdc2-like kinase Clk2 led to a significant drop in the number of genes in shared neighborhoods. We demonstrate quantitatively that the frequencies of speckle and factory sharing can be explained by assuming stochastic selection of a nuclear body within a restricted sub-volume defined by the original global gene positioning present prior to gene expression. We conclude that the spatial organization of these genes is a two-step process in which transcription-induced association with nuclear bodies enhances and refines a pre-existing global organization.
Collapse
Affiliation(s)
- Dietmar Rieder
- Division of Bioinformatics, Biocenter, Innsbruck Medical University, Innrain 80, 6020, Innsbruck, Austria
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Fang G, Passalacqua KD, Hocking J, Llopis PM, Gerstein M, Bergman NH, Jacobs-Wagner C. Transcriptomic and phylogenetic analysis of a bacterial cell cycle reveals strong associations between gene co-expression and evolution. BMC Genomics 2013; 14:450. [PMID: 23829427 PMCID: PMC3829707 DOI: 10.1186/1471-2164-14-450] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2012] [Accepted: 05/13/2013] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND The genetic network involved in the bacterial cell cycle is poorly understood even though it underpins the remarkable ability of bacteria to proliferate. How such network evolves is even less clear. The major aims of this work were to identify and examine the genes and pathways that are differentially expressed during the Caulobacter crescentus cell cycle, and to analyze the evolutionary features of the cell cycle network. RESULTS We used deep RNA sequencing to obtain high coverage RNA-Seq data of five C. crescentus cell cycle stages, each with three biological replicates. We found that 1,586 genes (over a third of the genome) display significant differential expression between stages. This gene list, which contains many genes previously unknown for their cell cycle regulation, includes almost half of the genes involved in primary metabolism, suggesting that these "house-keeping" genes are not constitutively transcribed during the cell cycle, as often assumed. Gene and module co-expression clustering reveal co-regulated pathways and suggest functionally coupled genes. In addition, an evolutionary analysis of the cell cycle network shows a high correlation between co-expression and co-evolution. Most co-expression modules have strong phylogenetic signals, with broadly conserved genes and clade-specific genes predominating different substructures of the cell cycle co-expression network. We also found that conserved genes tend to determine the expression profile of their module. CONCLUSION We describe the first phylogenetic and single-nucleotide-resolution transcriptomic analysis of a bacterial cell cycle network. In addition, the study suggests how evolution has shaped this network and provides direct biological network support that selective pressure is not on individual genes but rather on the relationship between genes, which highlights the importance of integrating phylogenetic analysis into biological network studies.
Collapse
Affiliation(s)
- Gang Fang
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06511, USA.
| | | | | | | | | | | | | |
Collapse
|
11
|
White-Cooper H, Caporilli S. Transcriptional and post-transcriptional regulation of Drosophila germline stem cells and their differentiating progeny. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2013; 786:47-61. [PMID: 23696351 DOI: 10.1007/978-94-007-6621-1_4] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
In this chapter we will concentrate on the transcriptional and translational regulations that govern the development and differentiation of male germline cells. Our focus will be on the processes that occur during differentiation, that distinguish the differentiating population of cells from their stem cell parents. We discuss how these defining features are established as cells transit from a stem cell character to that of a fully committed differentiating cell. The focus will be on how GSCs differentiate, via spermatogonia, to spermatocytes. We will achieve this by first describing the transcriptional activity in the differentiating spermatocytes, cataloguing the known transcriptional regulators in these cells and then investigating how the transcription programme is set up by processes in the progentior cells. This process is particularly interesting to study from a stem cell perspective as the male GSCs are unipotent, so lineage decisions in differentiating progeny of stem cells, which occurs in many other stem cell systems, do not impinge on the behaviour of these cells.
Collapse
|
12
|
Naseeb S, Delneri D. Impact of chromosomal inversions on the yeast DAL cluster. PLoS One 2012; 7:e42022. [PMID: 22916115 PMCID: PMC3419248 DOI: 10.1371/journal.pone.0042022] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2012] [Accepted: 07/02/2012] [Indexed: 11/19/2022] Open
Abstract
Chromosomal rearrangements occur readily in nature and are a major reshaping force during genome evolution. Such large scale modifications are usually deleterious causing several fitness defects, but sometimes can confer an advantage and become adaptive. For example the DAL metabolic cluster in yeast was assembled in recent evolutionary times in the Hemiascomycetes lineage, through a set of rearrangements that brought together the genes involved in the allantoin degradation pathway. In eukaryotes, the existence of physical clustering of genes with related functions supports the notion that neighbouring ORFs tend to be co-expressed and that the order of genes along the chromosomes may have biological significance, rather than being random as previously believed. In this study, we investigate the phenotypic effect that inversions have on the DAL gene cluster, expressed during nitrogen starvation. In all Saccharomyces "sensu stricto" species the order of the DAL cluster is conserved, while in the "sensu lato" species Naumovia castellii, which grows significantly worse than S. cerevisiae on allantoin, the cluster includes two nested inversions encompassing three DAL genes. We constructed several inverted and non-inverted S. cerevisiae strains possessing different inversions including those to mimic the configuration of the N. castellii DAL cluster. We showed that the inversion of DAL2 lower its own expression and reduces yeast fitness during nitrogen starvation. This rearrangement also altered the expression of the neighbouring genes DAL1 and DAL4. Moreover, we showed that the expression of the DAL4 anti-sense transcript (SUT614) does not change upon inversions of DAL2 and therefore is unlikely to be involved in its regulation. These results show that the order of the DAL cluster has an impact on the phenotype and gene expression, suggesting that these rearrangements may have been adaptive in the "sensu stricto" group in relation to the low availability of nitrogen in the environment.
Collapse
Affiliation(s)
- Samina Naseeb
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom
| | - Daniela Delneri
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom
- * E-mail:
| |
Collapse
|
13
|
Abstract
Males and females share most of the same genes, so selection in one sex will typically produce a correlated response in the other sex. Yet, the sexes have evolved to differ in a multitude of behavioral, morphological, and physiological traits. How did this sexual dimorphism evolve despite the presence of a common underlying genome? We investigated the potential role of gene duplication in the evolution of sexual dimorphism. Because duplication events provide extra genetic material, the sexes each might use this redundancy to facilitate sex-specific gene expression, permitting the evolution of dimorphism. We investigated this hypothesis at the genome-wide level in Drosophila melanogaster, using the presence of sex-biased expression as a proxy for the sex-specific specialization of gene function. We expected that if sexually antagonistic selection is a potent force acting upon individual genes, duplication will result in paralog families whose members differ in sex-biased expression. Gene members of the same duplicate family can have different expression patterns in males versus females. In particular, duplicate pairs containing a male-biased gene are found more frequently than expected, in agreement with previous studies. Furthermore, when the singleton ortholog is unbiased, duplication appears to allow one of the paralog copies to acquire male-biased expression. Conversely, female-biased expression is not common among duplicates; fewer duplicate genes are expressed in the female-soma and ovaries than in the male-soma and testes. Expression divergence exists more in older than in younger duplicates pairs, but expression divergence does not correlate with protein sequence divergence. Finally, genomic proximity may have an effect on whether paralogs differ in sex-biased expression. We conclude that the data are consistent with a role of gene duplication in fostering male-biased, but not female-biased, gene expression, thereby aiding the evolution of sexual dimorphism.
Collapse
Affiliation(s)
- Minyoung J Wyman
- Department of Ecology & Evolutionary Biology, University of Toronto, Toronto, ON M5S 3B2, Canada.
| | | | | |
Collapse
|
14
|
Faro A, Giordano D, Spampinato C. Combining literature text mining with microarray data: advances for system biology modeling. Brief Bioinform 2011; 13:61-82. [PMID: 21677032 DOI: 10.1093/bib/bbr018] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
A huge amount of important biomedical information is hidden in the bulk of research articles in biomedical fields. At the same time, the publication of databases of biological information and of experimental datasets generated by high-throughput methods is in great expansion, and a wealth of annotated gene databases, chemical, genomic (including microarray datasets), clinical and other types of data repositories are now available on the Web. Thus a current challenge of bioinformatics is to develop targeted methods and tools that integrate scientific literature, biological databases and experimental data for reducing the time of database curation and for accessing evidence, either in the literature or in the datasets, useful for the analysis at hand. Under this scenario, this article reviews the knowledge discovery systems that fuse information from the literature, gathered by text mining, with microarray data for enriching the lists of down and upregulated genes with elements for biological understanding and for generating and validating new biological hypothesis. Finally, an easy to use and freely accessible tool, GeneWizard, that exploits text mining and microarray data fusion for supporting researchers in discovering gene-disease relationships is described.
Collapse
Affiliation(s)
- Alberto Faro
- Department of Informatics and Telecommunication Engineering-University of Catania, Catania, Italy
| | | | | |
Collapse
|
15
|
Wurmser F, Ogereau D, Mary-Huard T, Loriod B, Joly D, Montchamp-Moreau C. Population transcriptomics: insights from Drosophila simulans, Drosophila sechellia and their hybrids. Genetica 2011; 139:465-77. [PMID: 21424276 DOI: 10.1007/s10709-011-9566-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2010] [Accepted: 03/07/2011] [Indexed: 01/03/2023]
Abstract
Sequence differentiation has been widely studied between populations and species, whereas interest in expression divergence is relatively recent. Using microarrays, we compared four geographically distinct populations of Drosophila simulans and a population of Drosophila sechellia, and interspecific hybrids. We observed few differences between populations, suggesting a slight population structure in D. simulans. This structure was observed in direct population comparisons, as well as in interspecific comparisons (hybrids vs. parents, D. sechellia vs. D. simulans). Expression variance is higher in the French and Zimbabwean populations than in the populations from the ancestral range of D. simulans (Kenya and Seychelles). This suggests a large scale phenomenon of decanalization following the invasion of a new environment. Comparing D. simulans and D. sechellia, we revealed 304 consistently differentially expressed genes, with striking overrepresentation of genes of the cytochrome P450 family, which could be related to their role in detoxification as well as in hormone regulation. We also revealed differences in genes involved in Juvenile hormone and Dopamine differentiation. We finally observed very few differentially expressed genes between hybrids and parental populations, with an overrepresentation of X-linked genes.
Collapse
Affiliation(s)
- François Wurmser
- Laboratoire Evolution, Génomes et Spéciation, CNRS UPR9034 Avenue de la Terrasse, Gif-sur-Yvette F-91198 Cedex, and Univ Paris-Sud, 91405 Orsay, France.
| | | | | | | | | | | |
Collapse
|
16
|
Weber CC, Hurst LD. Support for multiple classes of local expression clusters in Drosophila melanogaster, but no evidence for gene order conservation. Genome Biol 2011; 12:R23. [PMID: 21414197 PMCID: PMC3129673 DOI: 10.1186/gb-2011-12-3-r23] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2011] [Revised: 03/04/2011] [Accepted: 03/17/2011] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND Gene order in eukaryotic genomes is not random, with genes with similar expression profiles tending to cluster. In yeasts, the model taxon for gene order analysis, such syntenic clusters of non-homologous genes tend to be conserved over evolutionary time. Whether similar clusters show gene order conservation in other lineages is, however, undecided. Here, we examine this issue in Drosophila melanogaster using high-resolution chromosome rearrangement data. RESULTS We show that D. melanogaster has at least three classes of expression clusters: first, as observed in mammals, large clusters of functionally unrelated housekeeping genes; second, small clusters of functionally related highly co-expressed genes; and finally, as previously defined by Spellman and Rubin, larger domains of co-expressed but functionally unrelated genes. The latter are, however, not independent of the small co-expression clusters and likely reflect a methodological artifact. While the small co-expression and housekeeping/essential gene clusters resemble those observed in yeast, in contrast to yeast, we see no evidence that any of the three cluster types are preserved as synteny blocks. If anything, adjacent co-expressed genes are more likely to become rearranged than expected. Again in contrast to yeast, in D. melanogaster, gene pairs with short intergene distance or in divergent orientations tend to have higher rearrangement rates. These findings are consistent with co-expression being partly due to shared chromatin environment. CONCLUSIONS We conclude that, while similar in terms of cluster types, gene order evolution has strikingly different patterns in yeasts and in D. melanogaster, although recombination is associated with gene order rearrangement in both.
Collapse
Affiliation(s)
- Claudia C Weber
- Department of Biology and Biochemistry, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| | | |
Collapse
|
17
|
Chang PL, Dilkes BP, McMahon M, Comai L, Nuzhdin SV. Homoeolog-specific retention and use in allotetraploid Arabidopsis suecica depends on parent of origin and network partners. Genome Biol 2010; 11:R125. [PMID: 21182768 PMCID: PMC3046485 DOI: 10.1186/gb-2010-11-12-r125] [Citation(s) in RCA: 72] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2010] [Revised: 11/06/2010] [Accepted: 12/23/2010] [Indexed: 01/17/2023] Open
Abstract
Background Allotetraploids carry pairs of diverged homoeologs for most genes. With the genome doubled in size, the number of putative interactions is enormous. This poses challenges on how to coordinate the two disparate genomes, and creates opportunities by enhancing the phenotypic variation. New combinations of alleles co-adapt and respond to new environmental pressures. Three stages of the allopolyploidization process - parental species divergence, hybridization, and genome duplication - have been well analyzed. The last stage of evolutionary adjustments remains mysterious. Results Homoeolog-specific retention and use were analyzed in Arabidopsis suecica (As), a species derived from A. thaliana (At) and A. arenosa (Aa) in a single event 12,000 to 300,000 years ago. We used 405,466 diagnostic features on tiling microarrays to recognize At and Aa contributions to the As genome and transcriptome: 324 genes lacked Aa contributions and 614 genes lacked At contributions within As. In leaf tissues, 3,458 genes preferentially expressed At homoeologs while 4,150 favored Aa homoeologs. These patterns were validated with resequencing. Genes with preferential use of Aa homoeologs were enriched for expression functions, consistent with the dominance of Aa transcription. Heterologous networks - mixed from At and Aa transcripts - were underrepresented. Conclusions Thousands of deleted and silenced homoeologs in the genome of As were identified. Since heterologous networks may be compromised by interspecies incompatibilities, these networks evolve co-biases, expressing either only Aa or only At homoeologs. This progressive change towards predominantly pure parental networks might contribute to phenotypic variability and plasticity, and enable the species to exploit a larger range of environments.
Collapse
Affiliation(s)
- Peter L Chang
- Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 201, Los Angeles, CA 90089-2910, USA.
| | | | | | | | | |
Collapse
|
18
|
Neighbourhood continuity is not required for correct testis gene expression in Drosophila. PLoS Biol 2010; 8:e1000552. [PMID: 21151342 PMCID: PMC2994658 DOI: 10.1371/journal.pbio.1000552] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2010] [Accepted: 10/20/2010] [Indexed: 01/26/2023] Open
Abstract
Disrupting the linear organization of testis gene expression neighborhoods in the Drosophila genome does not affect gene expression, suggesting that neighborhood organization is not primarily driven by gene expression requirements. It is now widely accepted that gene organisation in eukaryotic genomes is non-random and it is proposed that such organisation may be important for gene expression and genome evolution. In particular, the results of several large-scale gene expression analyses in a range of organisms from yeast to human indicate that sets of genes with similar tissue-specific or temporal expression profiles are clustered within the genome in gene expression neighbourhoods. While the existence of neighbourhoods is clearly established, the underlying reason for this facet of genome organisation is currently unclear and there is little experimental evidence that addresses the genomic requisites for neighbourhood organisation. We report the targeted disruption of three well-defined male-specific gene expression neighbourhoods in the Drosophila genome by the synthesis of precisely mapped chromosomal inversions. We compare gene expression in individuals carrying inverted chromosomes with their non-inverted but otherwise identical progenitors using whole-transcriptome microarray analysis, validating these data with specific quantitative real-time PCR assays. For each neighbourhood we generate and examine multiple inversions. We find no significant differences in the expression of genes that define each of the neighbourhoods. We further show that the inversions spatially separate both halves of a neighbourhood in the nucleus. Thus, models explaining neighbourhood organisation in terms of local sequence interactions, enhancer crosstalk, or short-range chromatin effects are unlikely to account for this facet of genome organisation. Our study challenges the notion that, at least in the case of the testis, expression neighbourhoods are a feature of eukaryotic genome organisation necessary for correct gene expression. The order of genes within eukaryotic genomes is not completely random. In all genomes characterised to date there are regions of the genome, known as gene expression neighbourhoods, which contain clusters of genes that are expressed together in a particular tissue or at a particular developmental stage. Comparative genomics indicates that at least some neighbourhoods have been conserved during evolution, suggesting that this facet of genome organisation may be functionally advantageous. While several models explaining the organisation of the genome into neighbourhoods have been proposed, the functional significance of neighbourhood organisation has not been experimentally tested. Here, we report experiments that disrupt defined testis gene expression neighbourhoods in the Drosophila genome. We generated chromosomal inversions with a breakpoint within a neighbourhood, defined as having genes co-expressed within the testis. Comparing gene expression in flies carrying inversions with their otherwise identical progenitors shows that maintaining the linear organisation of genes in a neighbourhood is not necessary for correct gene expression. We also show that it is not necessary for genes in a neighbourhood to be in close proximity in the nucleus for them to be co-expressed, since the inversions disrupt the spatial organisation of neighbourhood genes in the nucleus. Our experiments indicate that the current models used to account for the existence of gene expression neighbourhoods are unlikely to be sufficient.
Collapse
|
19
|
A conserved gene cluster as a putative functional unit in insect innate immunity. FEBS Lett 2010; 584:4375-8. [PMID: 20951134 DOI: 10.1016/j.febslet.2010.10.014] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2010] [Revised: 10/07/2010] [Accepted: 10/08/2010] [Indexed: 11/23/2022]
Abstract
The Nimrod gene superfamily is an important component of the innate immune response. The majority of its member genes are located in close proximity within the Drosophila melanogaster genome and they lie in a larger conserved cluster ("Nimrod cluster"), made up of non-related groups (families, superfamilies) of genes. This cluster has been a part of the Arthropod genomes for about 300-350 million years. The available data suggest that the Nimrod cluster is a functional module of the insect innate immune response.
Collapse
|
20
|
Yan H, Venkatesan K, Beaver JE, Klitgord N, Yildirim MA, Hao T, Hill DE, Cusick ME, Perrimon N, Roth FP, Vidal M. A genome-wide gene function prediction resource for Drosophila melanogaster. PLoS One 2010; 5:e12139. [PMID: 20711346 PMCID: PMC2920829 DOI: 10.1371/journal.pone.0012139] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2010] [Accepted: 07/14/2010] [Indexed: 11/19/2022] Open
Abstract
Predicting gene functions by integrating large-scale biological data remains a challenge for systems biology. Here we present a resource for Drosophila melanogaster gene function predictions. We trained function-specific classifiers to optimize the influence of different biological datasets for each functional category. Our model predicted GO terms and KEGG pathway memberships for Drosophila melanogaster genes with high accuracy, as affirmed by cross-validation, supporting literature evidence, and large-scale RNAi screens. The resulting resource of prioritized associations between Drosophila genes and their potential functions offers a guide for experimental investigations.
Collapse
Affiliation(s)
- Han Yan
- Department of Cancer Biology, Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Kavitha Venkatesan
- Department of Cancer Biology, Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - John E. Beaver
- Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Niels Klitgord
- Department of Cancer Biology, Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Muhammed A. Yildirim
- Department of Cancer Biology, Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
- Applied Physics Program, Division of Engineering and Applied Sciences, Graduate School of Arts and Sciences, Harvard University, Cambridge, Massachusetts, United States of America
| | - Tong Hao
- Department of Cancer Biology, Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - David E. Hill
- Department of Cancer Biology, Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Michael E. Cusick
- Department of Cancer Biology, Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Norbert Perrimon
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
- Howard Hughes Medical Institute, Boston, Massachusetts, United States of America
| | - Frederick P. Roth
- Department of Cancer Biology, Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
- Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, Massachusetts, United States of America
- * E-mail: (FPR); (MV)
| | - Marc Vidal
- Department of Cancer Biology, Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
- * E-mail: (FPR); (MV)
| |
Collapse
|
21
|
Levine MT, Eckert ML, Begun DJ. Whole-genome expression plasticity across tropical and temperate Drosophila melanogaster populations from Eastern Australia. Mol Biol Evol 2010; 28:249-56. [PMID: 20671040 DOI: 10.1093/molbev/msq197] [Citation(s) in RCA: 82] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
The genotypic signature of spatially varying selection is ubiquitous across the Drosophila melanogaster genome. Spatially structured adaptive phenotypic differences are also commonly found, particularly along New World and Australian latitudinal gradients. However, investigation of gene expression variation in one or multiple environments across these well-studied populations is surprisingly limited. Here, we report genome-wide transcript levels of tropical and temperate eastern Australian populations reared at two temperatures. As expected, a large number of genes exhibit geographic origin-dependent expression plasticity. Less expected was evidence for an enrichment of down-regulated genes in both temperate and tropical populations when lines were reared at the temperature less commonly encountered in the native range; that is, evidence for significant differences in a "directionality" of plasticity across these two climatic regions. We also report evidence of small scale "neighborhood effects" around those genes significant for geographic origin-dependent plasticity, a result consistent with the evolution of high level, likely chromatin based gene regulation during range expansion in D. melanogaster populations.
Collapse
Affiliation(s)
- Mia T Levine
- Department of Evolution and Ecology, University of California, Davis, USA.
| | | | | |
Collapse
|
22
|
von Grotthuss M, Ashburner M, Ranz JM. Fragile regions and not functional constraints predominate in shaping gene organization in the genus Drosophila. Genome Res 2010; 20:1084-96. [PMID: 20601587 DOI: 10.1101/gr.103713.109] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
During evolution, gene repatterning across eukaryotic genomes is not uniform. Some genomic regions exhibit a gene organization conserved phylogenetically, while others are recurrently involved in chromosomal rearrangement, resulting in breakpoint reuse. Both gene order conservation and breakpoint reuse can result from the existence of functional constraints on where chromosomal breakpoints occur or from the existence of regions that are susceptible to breakage. The balance between these two mechanisms is still poorly understood. Drosophila species have very dynamic genomes and, therefore, can be very informative. We compared the gene organization of the main five chromosomal elements (Muller's elements A-E) of nine Drosophila species. Under a parsimonious evolutionary scenario, we estimate that 6116 breakpoints differentiate the gene orders of the species and that breakpoint reuse is associated with approximately 80% of the orthologous landmarks. The comparison of the observed patterns of change in gene organization with those predicted under different simulated modes of evolution shows that fragile regions alone can explain the observed key patterns of Muller's element A (X chromosome) more often than for any other Muller's element. High levels of fragility plus constraints operating on approximately 15% of the genome are sufficient to explain the observed patterns of change and conservation across species. The orthologous landmarks more likely to be under constraint exhibit both a remarkable internal functional heterogeneity and a lack of common functional themes with the exception of the presence of highly conserved noncoding elements. Fragile regions rather than functional constraints have been the main determinant of the evolution of the Drosophila chromosomes.
Collapse
Affiliation(s)
- Marcin von Grotthuss
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California 92697, USA
| | | | | |
Collapse
|
23
|
Li S, Shih CH, Kohn MH. Functional and evolutionary correlates of gene constellations in the Drosophila melanogaster genome that deviate from the stereotypical gene architecture. BMC Genomics 2010; 11:322. [PMID: 20497561 PMCID: PMC2891614 DOI: 10.1186/1471-2164-11-322] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2009] [Accepted: 05/24/2010] [Indexed: 01/19/2023] Open
Abstract
Background The biological dimensions of genes are manifold. These include genomic properties, (e.g., X/autosomal linkage, recombination) and functional properties (e.g., expression level, tissue specificity). Multiple properties, each generally of subtle influence individually, may affect the evolution of genes or merely be (auto-)correlates. Results of multidimensional analyses may reveal the relative importance of these properties on the evolution of genes, and therefore help evaluate whether these properties should be considered during analyses. While numerous properties are now considered during studies, most work still assumes the stereotypical solitary gene as commonly depicted in textbooks. Here, we investigate the Drosophila melanogaster genome to determine whether deviations from the stereotypical gene architecture correlate with other properties of genes. Results Deviations from the stereotypical gene architecture were classified as the following gene constellations: Overlapping genes were defined as those that overlap in the 5-prime, exonic, or intronic regions. Chromatin co-clustering genes were defined as genes that co-clustered within 20 kb of transcriptional territories. If this scheme is applied the stereotypical gene emerges as a rare occurrence (7.5%), slightly varied schemes yielded between ~1%-50%. Moreover, when following our scheme, paired-overlapping genes and chromatin co-clustering genes accounted for 50.1 and 42.4% of the genes analyzed, respectively. Gene constellation was a correlate of a number of functional and evolutionary properties of genes, but its statistical effect was ~1-2 orders of magnitude lower than the effects of recombination, chromosome linkage and protein function. Analysis of datasets on male reproductive proteins showed these were biased in their representation of gene constellations and evolutionary rate Ka/Ks estimates, but these biases did not overwhelm the biologically meaningful observation of high evolutionary rates of male reproductive genes. Conclusion Given the rarity of the solitary stereotypical gene, and the abundance of gene constellations that deviate from it, the presence of gene constellations, while once thought to be exceptional in large Eukaryote genomes, might have broader relevance to the understanding and study of the genome. However, according to our definition, while gene constellations can be significant correlates of functional properties of genes, they generally are weak correlates of the evolution of genes. Thus, the need for their consideration would depend on the context of studies.
Collapse
Affiliation(s)
- Shuwei Li
- Department of Ecology and Evolutionary Biology, Rice University, 6100 Main Street, MS 170, Houston, Texas 77005, USA
| | | | | |
Collapse
|
24
|
Ng YK, Wu W, Zhang L. Positive correlation between gene coexpression and positional clustering in the zebrafish genome. BMC Genomics 2009; 10:42. [PMID: 19159490 PMCID: PMC2654907 DOI: 10.1186/1471-2164-10-42] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2008] [Accepted: 01/22/2009] [Indexed: 11/10/2022] Open
Abstract
Background Co-expressing genes tend to cluster in eukaryotic genomes. This paper analyzes correlation between the proximity of eukaryotic genes and their transcriptional expression pattern in the zebrafish (Danio rerio) genome using available microarray data and gene annotation. Results The analyses show that neighbouring genes are significantly coexpressed in the zebrafish genome, and the coexpression level is influenced by the intergenic distance and transcription orientation. This fact is further supported by examining the coexpression level of genes within positional clusters in the neighbourhood model. There is a positive correlation between gene coexpression and positional clustering in the zebrafish genome. Conclusion The study provides another piece of evidence for the hypothesis that coexpressed genes do cluster in the eukaryotic genomes.
Collapse
Affiliation(s)
- Yen Kaow Ng
- Department of Mathematics, National University of Singapore, 2 Science Drive 2, Singapore 117543, Singapore.
| | | | | |
Collapse
|
25
|
Genetic changes accompanying the evolution of host specialization in Drosophila sechellia. Genetics 2008; 181:721-36. [PMID: 19033155 DOI: 10.1534/genetics.108.093419] [Citation(s) in RCA: 70] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Changes in host specialization contribute to the diversification of phytophagous insects. When shifting to a new host, insects evolve new physiological, morphological, and behavioral adaptations. Our understanding of the genetic changes responsible for these adaptations is limited. For instance, we do not know how often host shifts involve gain-of-function vs. loss-of-function alleles. Recent work suggests that some genes involved in odor recognition are lost in specialists. Here we show that genes involved in detoxification and metabolism, as well as those affecting olfaction, have reduced gene expression in Drosophila sechellia-a specialist on the fruit of Morinda citrifolia. We screened for genes that differ in expression between D. sechellia and its generalist sister species, D. simulans. We also screened for genes that are differentially expressed in D. sechellia when these flies chose their preferred host vs. when they were forced onto other food. D. sechellia increases expression of genes involved with oogenesis and fatty acid metabolism when on its host. The majority of differentially expressed genes, however, appear downregulated in D. sechellia. For several functionally related genes, this decrease in expression is associated with apparent loss-of-function alleles. For example, the D. sechellia allele of Odorant binding protein 56e (Obp56e) harbors a premature stop codon. We show that knockdown of Obp56e activity significantly reduces the avoidance response of D. melanogaster toward M. citrifolia. We argue that apparent loss-of-function alleles like Obp56e potentially contributed to the initial adaptation of D. sechellia to its host. Our results suggest that a subset of genes reduce or lose function as a consequence of host specialization, which may explain why, in general, specialist insects tend to shift to chemically similar hosts.
Collapse
|
26
|
Lawniczak MKN, Holloway AK, Begun DJ, Jones CD. Genomic analysis of the relationship between gene expression variation and DNA polymorphism in Drosophila simulans. Genome Biol 2008; 9:R125. [PMID: 18700012 PMCID: PMC2575515 DOI: 10.1186/gb-2008-9-8-r125] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2008] [Revised: 05/20/2008] [Accepted: 08/12/2008] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND Understanding how DNA sequence polymorphism relates to variation in gene expression is essential to connecting genotypic differences with phenotypic differences among individuals. Addressing this question requires linking population genomic data with gene expression variation. RESULTS Using whole genome expression data and recent light shotgun genome sequencing of six Drosophila simulans genotypes, we assessed the relationship between expression variation in males and females and nucleotide polymorphism across thousands of loci. By examining sequence polymorphism in gene features, such as untranslated regions and introns, we find that genes showing greater variation in gene expression between genotypes also have higher levels of sequence polymorphism in many gene features. Accordingly, X-linked genes, which have lower sequence polymorphism levels than autosomal genes, also show less expression variation than autosomal genes. We also find that sex-specifically expressed genes show higher local levels of polymorphism and divergence than both sex-biased and unbiased genes, and that they appear to have simpler regulatory regions. CONCLUSION The gene-feature-based analyses and the X-to-autosome comparisons suggest that sequence polymorphism in cis-acting elements is an important determinant of expression variation. However, this relationship varies among the different categories of sex-biased expression, and trans factors might contribute more to male-specific gene expression than cis effects. Our analysis of sex-specific gene expression also shows that female-specific genes have been overlooked in analyses that only point to male-biased genes as having unusual patterns of evolution and that studies of sexually dimorphic traits need to recognize that the relationship between genetic and expression variation at these traits is different from the genome as a whole.
Collapse
Affiliation(s)
- Mara K N Lawniczak
- Division of Cell and Molecular Biology, Imperial College London, London, SW7 2AZ, UK.
| | | | | | | |
Collapse
|