1
|
Moschetti R, Palazzo A, Lorusso P, Viggiano L, Massimiliano Marsano R. "What You Need, Baby, I Got It": Transposable Elements as Suppliers of Cis-Operating Sequences in Drosophila. BIOLOGY 2020; 9:E25. [PMID: 32028630 PMCID: PMC7168160 DOI: 10.3390/biology9020025] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/26/2019] [Revised: 01/27/2020] [Accepted: 01/30/2020] [Indexed: 12/18/2022]
Abstract
Transposable elements (TEs) are constitutive components of both eukaryotic and prokaryotic genomes. The role of TEs in the evolution of genes and genomes has been widely assessed over the past years in a variety of model and non-model organisms. Drosophila is undoubtedly among the most powerful model organisms used for the purpose of studying the role of transposons and their effects on the stability and evolution of genes and genomes. Besides their most intuitive role as insertional mutagens, TEs can modify the transcriptional pattern of host genes by juxtaposing new cis-regulatory sequences. A key element of TE biology is that they carry transcriptional control elements that fine-tune the transcription of their own genes, but that can also perturb the transcriptional activity of neighboring host genes. From this perspective, the transposition-mediated modulation of gene expression is an important issue for the short-term adaptation of physiological functions to the environmental changes, and for long-term evolutionary changes. Here, we review the current literature concerning the regulatory and structural elements operating in cis provided by TEs in Drosophila. Furthermore, we highlight that, besides their influence on both TEs and host genes expression, they can affect the chromatin structure and epigenetic status as well as both the chromosome's structure and stability. It emerges that Drosophila is a good model organism to study the effect of TE-linked regulatory sequences, and it could help future studies on TE-host interactions in any complex eukaryotic genome.
Collapse
Affiliation(s)
- Roberta Moschetti
- Dipartimento di Biologia, Università degli Studi di Bari “Aldo Moro”, Via Orabona 4, 70125 Bari, Italy; (R.M.); (P.L.); (L.V.)
| | - Antonio Palazzo
- Laboratory of Translational Nanotechnology, “Istituto Tumori Giovanni Paolo II” I.R.C.C.S, Viale Orazio Flacco 65, 70125 Bari, Italy;
| | - Patrizio Lorusso
- Dipartimento di Biologia, Università degli Studi di Bari “Aldo Moro”, Via Orabona 4, 70125 Bari, Italy; (R.M.); (P.L.); (L.V.)
| | - Luigi Viggiano
- Dipartimento di Biologia, Università degli Studi di Bari “Aldo Moro”, Via Orabona 4, 70125 Bari, Italy; (R.M.); (P.L.); (L.V.)
| | - René Massimiliano Marsano
- Dipartimento di Biologia, Università degli Studi di Bari “Aldo Moro”, Via Orabona 4, 70125 Bari, Italy; (R.M.); (P.L.); (L.V.)
| |
Collapse
|
2
|
Yang W, Jiang C, Zhu Y, Chen K, Wang G, Yuan D, Miao W, Xiong J. Tetrahymena Comparative Genomics Database (TCGD): a community resource for Tetrahymena. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2019; 2019:5365189. [PMID: 30810209 PMCID: PMC6391650 DOI: 10.1093/database/baz029] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Revised: 02/04/2019] [Accepted: 02/06/2019] [Indexed: 11/12/2022]
Abstract
Ciliates are a large and diverse group of unicellular organisms characterized by having the following two distinct type of nuclei within a single cell: micronucleus (MIC) and macronucleus (MAC). Although the genomes of several ciliates in different groups have been sequenced, comparative genomics data for multiple species within a ciliate genus are not yet available. Here we collected the genome information and comparative genomics analysis results for 10 species in the Tetrahymena genus, including the previously sequenced model organism Tetrahymena thermophila and 9 newly sequenced species, and constructed a genus-level comparative analysis platform, the Tetrahymena Comparative Genomics Database (TCGD). Genome sequences, transcriptomic data, gene models, functional annotation, ortholog groups and synteny maps were built into this database and a user-friendly interface was developed for searching, visualizing and analyzing these data. In summary, the TCGD (http://ciliate.ihb.ac.cn) will be an important and useful resource for the ciliate research community.
Collapse
Affiliation(s)
- Wentao Yang
- Key Laboratory of Aquatic Biodiversity and Conservation, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Chuanqi Jiang
- Key Laboratory of Aquatic Biodiversity and Conservation, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Ying Zhu
- Nextomics Biosciences Institute, Wuhan, China
| | - Kai Chen
- Key Laboratory of Aquatic Biodiversity and Conservation, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Guangying Wang
- Key Laboratory of Aquatic Biodiversity and Conservation, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Dongxia Yuan
- Key Laboratory of Aquatic Biodiversity and Conservation, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China
| | - Wei Miao
- Key Laboratory of Aquatic Biodiversity and Conservation, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China.,University of Chinese Academy of Sciences, Beijing, China.,State Key Laboratory of Freshwater Ecology and Biotechnology, Wuhan, China.,CAS Center for Excellence in Animal Evolution and Genetics, Kunming, China
| | - Jie Xiong
- Key Laboratory of Aquatic Biodiversity and Conservation, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China
| |
Collapse
|
3
|
Benowitz KM, Coleman JM, Matzkin LM. Assessing the Architecture of Drosophila mojavensis Locomotor Evolution with Bulk Segregant Analysis. G3 (BETHESDA, MD.) 2019; 9:1767-1775. [PMID: 30926724 PMCID: PMC6505136 DOI: 10.1534/g3.119.400036] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/27/2019] [Accepted: 03/22/2019] [Indexed: 11/24/2022]
Abstract
Behavior is frequently predicted to be especially important for evolution in novel environments. If these predictions are accurate, there might be particular patterns of genetic architecture associated with recently diverged behaviors. Specifically, it has been predicted that behaviors linked to population divergence should be underpinned by a few genes of relatively large effect, compared to architectures of intrapopulation behavioral variation, which is considered to be highly polygenic. More mapping studies of behavioral variation between recently diverged populations are needed to continue assessing the generality of these predictions. Here, we used a bulk segregant mapping approach to dissect the genetic architecture of a locomotor trait that has evolved between two populations of the cactophilic fly Drosophila mojavensis We created an F8 mapping population of 1,500 individuals from advanced intercross lines and sequenced the 10% of individuals with the highest and lowest levels of locomotor activity. Using three alternative statistical approaches, we found strong evidence for two relatively large-effect QTL that is localized in a region homologous to a region of densely packed behavior loci in Drosophila melanogaster, suggesting that clustering of behavior genes may display relatively deep evolutionary conservation. Broadly, our data are most consistent with a polygenic architecture, though with several loci explaining a high proportion of variation in comparison to similar behavioral traits. We further note the presence of several antagonistic QTL linked to locomotion and discuss these results in light of theories regarding behavioral evolution and the effect size and direction of QTL for diverging traits in general.
Collapse
Affiliation(s)
- Kyle M Benowitz
- Department of Entomology, University of Arizona, Tucson, AZ 85721
| | - Joshua M Coleman
- Department of Entomology, University of Arizona, Tucson, AZ 85721
- Department of Biological Sciences, University of Alabama in Huntsville, Huntsville AL 35899
| | - Luciano M Matzkin
- Department of Entomology, University of Arizona, Tucson, AZ 85721
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721
- BIO5 Institute, University of Arizona, Tucson, AZ 85721
| |
Collapse
|
4
|
Nie H, Liu X, Pan J, Li W, Li Z, Zhang S, Chen S, Miao X, Zheng N, Su S. Identification of genes related to high royal jelly production in the honey bee (Apis mellifera) using microarray analysis. Genet Mol Biol 2017; 40:781-789. [PMID: 28981563 PMCID: PMC5738612 DOI: 10.1590/1678-4685-gmb-2017-0013] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2017] [Accepted: 05/04/2017] [Indexed: 01/11/2023] Open
Abstract
China is the largest royal jelly producer and exporter in the world, and high
royal jelly-yielding strains have been bred in the country for approximately
three decades. However, information on the molecular mechanism underlying high
royal jelly production is scarce. Here, a cDNA microarray was used to screen and
identify differentially expressed genes (DEGs) to obtain an overview on the
changes in gene expression levels between high and low royal jelly producing
bees. We developed a honey bee gene chip that covered 11,689 genes, and this
chip was hybridised with cDNA generated from RNA isolated from heads of nursing
bees. A total of 369 DEGs were identified between high and low royal jelly
producing bees. Amongst these DEGs, 201 (54.47%) genes were up-regulated,
whereas 168 (45.53%) were down-regulated in high royal jelly-yielding bees. Gene
ontology (GO) analyses showed that they are mainly involved in four key
biological processes, and pathway analyses revealed that they belong to a total
of 46 biological pathways. These results provide a genetic basis for further
studies on the molecular mechanisms involved in high royal jelly production.
Collapse
Affiliation(s)
- Hongyi Nie
- College of Bee Science, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Xiaoyan Liu
- College of Bee Science, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Jiao Pan
- College of Animal Sciences, Zhejiang University, Hangzhou, China
| | - Wenfeng Li
- College of Animal Sciences, Zhejiang University, Hangzhou, China
| | - Zhiguo Li
- College of Bee Science, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Shaowu Zhang
- College of Bee Science, Fujian Agriculture and Forestry University, Fuzhou, China.,Research School of Biology, College of Medicine, Biology and Environment, The Australian National University, Canberra, Australia
| | - Shenglu Chen
- College of Animal Sciences, Zhejiang University, Hangzhou, China
| | - Xiaoqing Miao
- College of Bee Science, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Nenggan Zheng
- College of Animal Sciences, Zhejiang University, Hangzhou, China.,Qiushi Academy for Advanced Studies, Zhejiang University, Hangzhou, China
| | - Songkun Su
- College of Bee Science, Fujian Agriculture and Forestry University, Fuzhou, China
| |
Collapse
|
5
|
Etges WJ, de Oliveira CC, Rajpurohit S, Gibbs AG. Effects of temperature on transcriptome and cuticular hydrocarbon expression in ecologically differentiated populations of desert Drosophila. Ecol Evol 2017; 7:619-637. [PMID: 28116058 PMCID: PMC5243788 DOI: 10.1002/ece3.2653] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Accepted: 11/08/2016] [Indexed: 12/20/2022] Open
Abstract
We assessed the effects of temperature differences on gene expression using whole-transcriptome microarrays and cuticular hydrocarbon variation in populations of cactophilic Drosophila mojavensis. Four populations from Baja California and mainland Mexico and Arizona were each reared on two different host cacti, reared to sexual maturity on laboratory media, and adults were exposed for 12 hr to 15, 25, or 35°C. Temperature differences influenced the expression of 3,294 genes, while population differences and host plants affected >2,400 each in adult flies. Enriched, functionally related groups of genes whose expression changed at high temperatures included heat response genes, as well as genes affecting chromatin structure. Gene expression differences between mainland and peninsular populations included genes involved in metabolism of secondary compounds, mitochondrial activity, and tRNA synthases. Flies reared on the ancestral host plant, pitaya agria cactus, showed upregulation of genes involved in metabolism, while flies reared on organ pipe cactus had higher expression of DNA repair and chromatin remodeling genes. Population × environment (G × E) interactions had widespread effects on the transcriptome where population × temperature interactions affected the expression of >5,000 orthologs, and there were >4,000 orthologs that showed temperature × host plant interactions. Adults exposed to 35°C had lower amounts of most cuticular hydrocarbons than those exposed to 15 or 25°C, including abundant unsaturated alkadienes. For insects adapted to different host plants and climatic regimes, our results suggest that temperature shifts associated with climate change have large and significant effects on transcriptomes of genetically differentiated natural populations.
Collapse
Affiliation(s)
- William J. Etges
- Program in Ecology and Evolutionary BiologyDepartment of Biological SciencesUniversity of ArkansasFayettevilleAR 72701USA
| | - Cássia C. de Oliveira
- Program in Ecology and Evolutionary BiologyDepartment of Biological SciencesUniversity of ArkansasFayettevilleAR 72701USA
- Present address: Math and Science DivisionLyon CollegeBatesvilleAR72501USA
| | - Subhash Rajpurohit
- School of Life SciencesUniversity of NevadaLas VegasNV 89919USA
- Present address: Department of BiologyUniversity of PennsylvaniaPhiladelphiaPA19104USA
| | - Allen G. Gibbs
- School of Life SciencesUniversity of NevadaLas VegasNV 89919USA
| |
Collapse
|
6
|
Barah P, Bones AM. Multidimensional approaches for studying plant defence against insects: from ecology to omics and synthetic biology. JOURNAL OF EXPERIMENTAL BOTANY 2015; 66:479-93. [PMID: 25538257 DOI: 10.1093/jxb/eru489] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
The biggest challenge for modern biology is to integrate multidisciplinary approaches towards understanding the organizational and functional complexity of biological systems at different hierarchies, starting from the subcellular molecular mechanisms (microscopic) to the functional interactions of ecological communities (macroscopic). The plant-insect interaction is a good model for this purpose with the availability of an enormous amount of information at the molecular and the ecosystem levels. Changing global climatic conditions are abruptly resetting plant-insect interactions. Integration of discretely located heterogeneous information from the ecosystem to genes and pathways will be an advantage to understand the complexity of plant-insect interactions. This review will present the recent developments in omics-based high-throughput experimental approaches, with particular emphasis on studying plant defence responses against insect attack. The review highlights the importance of using integrative systems approaches to study plant-insect interactions from the macroscopic to the microscopic level. We analyse the current efforts in generating, integrating and modelling multiomics data to understand plant-insect interaction at a systems level. As a future prospect, we highlight the growing interest in utilizing the synthetic biology platform for engineering insect-resistant plants.
Collapse
Affiliation(s)
- Pankaj Barah
- Cell Molecular Biology and Genomics Group, Department of Biology, Norwegian University of Science and Technology (NTNU), N 7491 Trondheim, Norway
| | - Atle M Bones
- Cell Molecular Biology and Genomics Group, Department of Biology, Norwegian University of Science and Technology (NTNU), N 7491 Trondheim, Norway
| |
Collapse
|
7
|
Dishaw LJ, Cannon JP, Litman GW, Parker W. Immune-directed support of rich microbial communities in the gut has ancient roots. DEVELOPMENTAL AND COMPARATIVE IMMUNOLOGY 2014; 47:36-51. [PMID: 24984114 PMCID: PMC4146740 DOI: 10.1016/j.dci.2014.06.011] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/07/2014] [Revised: 05/30/2014] [Accepted: 06/21/2014] [Indexed: 05/12/2023]
Abstract
The animal gut serves as a primary location for the complex host-microbe interplay that is essential for homeostasis and may also reflect the types of ancient selective pressures that spawned the emergence of immunity in metazoans. In this review, we present a phylogenetic survey of gut host-microbe interactions and suggest that host defense systems arose not only to protect tissue directly from pathogenic attack but also to actively support growth of specific communities of mutualists. This functional dichotomy resulted in the evolution of immune systems much more tuned for harmonious existence with microbes than previously thought, existing as dynamic but primarily cooperative entities in the present day. We further present the protochordate Ciona intestinalis as a promising model for studying gut host-bacterial dialogue. The taxonomic position, gut physiology and experimental tractability of Ciona offer unique advantages in dissecting host-microbe interplay and can complement studies in other model systems.
Collapse
Affiliation(s)
- Larry J Dishaw
- Department of Pediatrics, University of South Florida Morsani College of Medicine, USF/ACH Children's Research Institute, 140 7th Avenue South, St. Petersburg, FL 33701, USA.
| | - John P Cannon
- Department of Pediatrics, University of South Florida Morsani College of Medicine, USF/ACH Children's Research Institute, 140 7th Avenue South, St. Petersburg, FL 33701, USA
| | - Gary W Litman
- Department of Pediatrics, University of South Florida Morsani College of Medicine, USF/ACH Children's Research Institute, 140 7th Avenue South, St. Petersburg, FL 33701, USA; Department of Molecular Genetics, All Children's Hospital-Johns Hopkins Medicine, 501 6th Avenue South, St. Petersburg, FL 33701, USA
| | - William Parker
- Department of Surgery, Duke University Medical Center, Box 2605, Durham, NC 27710, USA
| |
Collapse
|
8
|
Posnien N, Zeng V, Schwager EE, Pechmann M, Hilbrant M, Keefe JD, Damen WGM, Prpic NM, McGregor AP, Extavour CG. A comprehensive reference transcriptome resource for the common house spider Parasteatoda tepidariorum. PLoS One 2014; 9:e104885. [PMID: 25118601 PMCID: PMC4132015 DOI: 10.1371/journal.pone.0104885] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2014] [Accepted: 07/17/2014] [Indexed: 12/12/2022] Open
Abstract
Parasteatoda tepidariorum is an increasingly popular model for the study of spider development and the evolution of development more broadly. However, fully understanding the regulation and evolution of P. tepidariorum development in comparison to other animals requires a genomic perspective. Although research on P. tepidariorum has provided major new insights, gene analysis to date has been limited to candidate gene approaches. Furthermore, the few available EST collections are based on embryonic transcripts, which have not been systematically annotated and are unlikely to contain transcripts specific to post-embryonic stages of development. We therefore generated cDNA from pooled embryos representing all described embryonic stages, as well as post-embryonic stages including nymphs, larvae and adults, and using Illumina HiSeq technology obtained a total of 625,076,514 100-bp paired end reads. We combined these data with 24,360 ESTs available in GenBank, and 1,040,006 reads newly generated from 454 pyrosequencing of a mixed-stage embryo cDNA library. The combined sequence data were assembled using a custom de novo assembly strategy designed to optimize assembly product length, number of predicted transcripts, and proportion of raw reads incorporated into the assembly. The de novo assembly generated 446,427 contigs with an N50 of 1,875 bp. These sequences obtained 62,799 unique BLAST hits against the NCBI non-redundant protein data base, including putative orthologs to 8,917 Drosophila melanogaster genes based on best reciprocal BLAST hit identity compared with the D. melanogaster proteome. Finally, we explored the utility of the transcriptome for RNA-Seq studies, and showed that this resource can be used as a mapping scaffold to detect differential gene expression in different cDNA libraries. This resource will therefore provide a platform for future genomic, gene expression and functional approaches using P. tepidariorum.
Collapse
Affiliation(s)
- Nico Posnien
- Johann-Friedrich-Blumenbach-Institute for Zoology and Anthropology, Department of Developmental Biology, Georg-August-University Göttingen, GZMB Ernst-Caspari-Haus, Göttingen, Germany
- * E-mail: (NP); (CGE)
| | - Victor Zeng
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
| | - Evelyn E. Schwager
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, United Kingdom
| | - Matthias Pechmann
- Cologne Biocenter, Institute of Developmental Biology, University of Cologne, Cologne, Germany
| | - Maarten Hilbrant
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, United Kingdom
| | - Joseph D. Keefe
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, United Kingdom
| | - Wim G. M. Damen
- Department of Genetics, Friedrich Schiller University Jena, Jena, Germany
| | - Nikola-Michael Prpic
- Johann-Friedrich-Blumenbach-Institute for Zoology and Anthropology, Department of Developmental Biology, Georg-August-University Göttingen, GZMB Ernst-Caspari-Haus, Göttingen, Germany
| | - Alistair P. McGregor
- Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, United Kingdom
| | - Cassandra G. Extavour
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
- * E-mail: (NP); (CGE)
| |
Collapse
|
9
|
Kim AR, Martinez C, Ionides J, Ramos AF, Ludwig MZ, Ogawa N, Sharp DH, Reinitz J. Rearrangements of 2.5 kilobases of noncoding DNA from the Drosophila even-skipped locus define predictive rules of genomic cis-regulatory logic. PLoS Genet 2013; 9:e1003243. [PMID: 23468638 PMCID: PMC3585115 DOI: 10.1371/journal.pgen.1003243] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2012] [Accepted: 11/30/2012] [Indexed: 01/19/2023] Open
Abstract
Rearrangements of about 2.5 kilobases of regulatory DNA located 5′ of the transcription start site of the Drosophila even-skipped locus generate large-scale changes in the expression of even-skipped stripes 2, 3, and 7. The most radical effects are generated by juxtaposing the minimal stripe enhancers MSE2 and MSE3 for stripes 2 and 3 with and without small “spacer” segments less than 360 bp in length. We placed these fusion constructs in a targeted transformation site and obtained quantitative expression data for these transformants together with their controlling transcription factors at cellular resolution. These data demonstrated that the rearrangements can alter expression levels in stripe 2 and the 2–3 interstripe by a factor of more than 10. We reasoned that this behavior would place tight constraints on possible rules of genomic cis-regulatory logic. To find these constraints, we confronted our new expression data together with previously obtained data on other constructs with a computational model. The model contained representations of thermodynamic protein–DNA interactions including steric interference and cooperative binding, short-range repression, direct repression, activation, and coactivation. The model was highly constrained by the training data, which it described within the limits of experimental error. The model, so constrained, was able to correctly predict expression patterns driven by enhancers for other Drosophila genes; even-skipped enhancers not included in the training set; stripe 2, 3, and 7 enhancers from various Drosophilid and Sepsid species; and long segments of even-skipped regulatory DNA that contain multiple enhancers. The model further demonstrated that elevated expression driven by a fusion of MSE2 and MSE3 was a consequence of the recruitment of a portion of MSE3 to become a functional component of MSE2, demonstrating that cis-regulatory “elements” are not elementary objects. Metazoan genes, including those of humans, contain large noncoding regions that are required for viability. Sequence variations in these regions are statistically associated with human disease, but the mechanisms underlying these associations are not well understood. These regions regulate transcription and are frequently larger than the gene's transcript by an order of magnitude. In this paper we attempt to elucidate the regulatory code of these noncoding segments of DNA by means of quantitative spatially resolved gene expression data and a computational model. The expression data comes from the early embryo of the fruit fly Drosophila melanogaster. We chose a family of DNA constructs to analyze that drive very different patterns of expression when very small changes in DNA sequence are made, reasoning that this sensitivity would reveal important properties of the regulatory code. The model reproduced the training data with precision greater than the expected accuracy of the training data itself. It was able to correctly predict from DNA sequence the expression of 44 segments of DNA from many genes and species.
Collapse
Affiliation(s)
- Ah-Ram Kim
- Department of Ecology and Evolution, Chicago Center for Systems Biology, University of Chicago, Chicago, Illinois, United States of America
- Department of Biochemistry and Cell Biology, Stony Brook University, Stony Brook, New York, United States of America
| | - Carlos Martinez
- Department of Ecology and Evolution, Chicago Center for Systems Biology, University of Chicago, Chicago, Illinois, United States of America
| | - John Ionides
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Alexandre F. Ramos
- Escola de Artes, Ciências e Humanidades, Universidade de São Paulo, São Paulo, Brazil
| | - Michael Z. Ludwig
- Department of Ecology and Evolution, Chicago Center for Systems Biology, University of Chicago, Chicago, Illinois, United States of America
| | - Nobuo Ogawa
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
| | - David H. Sharp
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - John Reinitz
- Department of Ecology and Evolution, Chicago Center for Systems Biology, University of Chicago, Chicago, Illinois, United States of America
- Department of Statistics, Department of Molecular Genetics and Cell Biology, and Institute of Genomics and Systems Biology, University of Chicago, Chicago, Illinois, United States of America
- * E-mail:
| |
Collapse
|
10
|
Li J, Hobman TC, Simmonds AJ. Gawky (GW) is the Drosophila melanogaster GW182 homologue. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2013; 768:127-45. [PMID: 23224968 DOI: 10.1007/978-1-4614-5107-5_8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Affiliation(s)
- Jing Li
- Department of Cell Biology, University of Alberta, Edmonton, Canada.
| | | | | |
Collapse
|
11
|
Woodcock MR. Nested Hierarchal Organization of Conservation for MicroRNAs and Their Putative Targets to Drosophila melanogaster. Chem Biodivers 2012; 9:945-64. [DOI: 10.1002/cbdv.201100358] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
12
|
Calvete O, González J, Betrán E, Ruiz A. Segmental duplication, microinversion, and gene loss associated with a complex inversion breakpoint region in Drosophila. Mol Biol Evol 2012; 29:1875-89. [PMID: 22328714 DOI: 10.1093/molbev/mss067] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Chromosomal inversions are usually portrayed as simple two-breakpoint rearrangements changing gene order but not gene number or structure. However, increasing evidence suggests that inversion breakpoints may often have a complex structure and entail gene duplications with potential functional consequences. Here, we used a combination of different techniques to investigate the breakpoint structure and the functional consequences of a complex rearrangement fixed in Drosophila buzzatii and comprising two tandemly arranged inversions sharing the middle breakpoint: 2m and 2n. By comparing the sequence in the breakpoint regions between D. buzzatii (inverted chromosome) and D. mojavensis (noninverted chromosome), we corroborate the breakpoint reuse at the molecular level and infer that inversion 2m was associated with a duplication of a ~13 kb segment and likely generated by staggered breaks plus repair by nonhomologous end joining. The duplicated segment contained the gene CG4673, involved in nuclear transport, and its two nested genes CG5071 and CG5079. Interestingly, we found that other than the inversion and the associated duplication, both breakpoints suffered additional rearrangements, that is, the proximal breakpoint experienced a microinversion event associated at both ends with a 121-bp long duplication that contains a promoter. As a consequence of all these different rearrangements, CG5079 has been lost from the genome, CG5071 is now a single copy nonnested gene, and CG4673 has a transcript ~9 kb shorter and seems to have acquired a more complex gene regulation. Our results illustrate the complex effects of chromosomal rearrangements and highlight the need of complementing genomic approaches with detailed sequence-level and functional analyses of breakpoint regions if we are to fully understand genome structure, function, and evolutionary dynamics.
Collapse
Affiliation(s)
- Oriol Calvete
- Departament de Genètica i de Microbiologia, Facultat de Biociències, Universitat Autònoma de Barcelona, Bellaterra, Barcelona, Spain
| | | | | | | |
Collapse
|
13
|
Dojer N, Biecek P, Tiuryn J. Bi-billboard: symmetrization and careful choice of informant species results in higher accuracy of regulatory element prediction. J Comput Biol 2011; 18:809-19. [PMID: 21563976 DOI: 10.1089/cmb.2010.0299] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The identification of cis-regulatory modules (CRM) is one of the most important problems towards the understanding of transcriptional regulation in higher eukaryotes. Computational methods for CRM detection are gaining importance due to the availability of genomic data on one side, and costs and difficulties of experimental methods on the other side. One of proposed approaches, called Billboard, predicts CRMs based on the location of transcription factor binding sites in an analyzed sequence and a related one in so-called informant species. In the present article, we show how to combine information obtained in two symmetric runs (on the sequence of interest and on the related one) of the Billboard tool. In a series of experiments on data from various organisms, we show that the predictive power of our symmetric approach is significantly higher than the power of the one-way approach of Billboard. Moreover, we show that the evolutionary distance between organisms considerably influences the quality of prediction and we provide guidelines on the choice of an informant species.
Collapse
Affiliation(s)
- Norbert Dojer
- Faculty of Mathematics, Informatics and Mechanics, University of Warsaw, Warsaw, Poland.
| | | | | |
Collapse
|
14
|
Song X, Goicoechea JL, Ammiraju JSS, Luo M, He R, Lin J, Lee SJ, Sisneros N, Watts T, Kudrna DA, Golser W, Ashley E, Collura K, Braidotti M, Yu Y, Matzkin LM, McAllister BF, Markow TA, Wing RA. The 19 genomes of Drosophila: a BAC library resource for genus-wide and genome-scale comparative evolutionary research. Genetics 2011; 187:1023-30. [PMID: 21321134 PMCID: PMC3070512 DOI: 10.1534/genetics.111.126540] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2011] [Accepted: 02/05/2011] [Indexed: 11/18/2022] Open
Abstract
The genus Drosophila has been the subject of intense comparative phylogenomics characterization to provide insights into genome evolution under diverse biological and ecological contexts and to functionally annotate the Drosophila melanogaster genome, a model system for animal and insect genetics. Recent sequencing of 11 additional Drosophila species from various divergence points of the genus is a first step in this direction. However, to fully reap the benefits of this resource, the Drosophila community is faced with two critical needs: i.e., the expansion of genomic resources from a much broader range of phylogenetic diversity and the development of additional resources to aid in finishing the existing draft genomes. To address these needs, we report the first synthesis of a comprehensive set of bacterial artificial chromosome (BAC) resources for 19 Drosophila species from all three subgenera. Ten libraries were derived from the exact source used to generate 10 of the 12 draft genomes, while the rest were generated from a strategically selected set of species on the basis of salient ecological and life history features and their phylogenetic positions. The majority of the new species have at least one sequenced reference genome for immediate comparative benefit. This 19-BAC library set was rigorously characterized and shown to have large insert sizes (125-168 kb), low nonrecombinant clone content (0.3-5.3%), and deep coverage (9.1-42.9×). Further, we demonstrated the utility of this BAC resource for generating physical maps of targeted loci, refining draft sequence assemblies and identifying potential genomic rearrangements across the phylogeny.
Collapse
Affiliation(s)
- Xiang Song
- Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, Division of Biological Sciences, University of California at San Diego, La Jolla, California 92093 and Department of Biology, University of Iowa, Iowa City, Iowa 52242
| | - Jose Luis Goicoechea
- Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, Division of Biological Sciences, University of California at San Diego, La Jolla, California 92093 and Department of Biology, University of Iowa, Iowa City, Iowa 52242
| | - Jetty S. S. Ammiraju
- Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, Division of Biological Sciences, University of California at San Diego, La Jolla, California 92093 and Department of Biology, University of Iowa, Iowa City, Iowa 52242
| | - Meizhong Luo
- Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, Division of Biological Sciences, University of California at San Diego, La Jolla, California 92093 and Department of Biology, University of Iowa, Iowa City, Iowa 52242
| | - Ruifeng He
- Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, Division of Biological Sciences, University of California at San Diego, La Jolla, California 92093 and Department of Biology, University of Iowa, Iowa City, Iowa 52242
| | - Jinke Lin
- Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, Division of Biological Sciences, University of California at San Diego, La Jolla, California 92093 and Department of Biology, University of Iowa, Iowa City, Iowa 52242
| | - So-Jeong Lee
- Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, Division of Biological Sciences, University of California at San Diego, La Jolla, California 92093 and Department of Biology, University of Iowa, Iowa City, Iowa 52242
| | - Nicholas Sisneros
- Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, Division of Biological Sciences, University of California at San Diego, La Jolla, California 92093 and Department of Biology, University of Iowa, Iowa City, Iowa 52242
| | - Tom Watts
- Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, Division of Biological Sciences, University of California at San Diego, La Jolla, California 92093 and Department of Biology, University of Iowa, Iowa City, Iowa 52242
| | - David A. Kudrna
- Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, Division of Biological Sciences, University of California at San Diego, La Jolla, California 92093 and Department of Biology, University of Iowa, Iowa City, Iowa 52242
| | - Wolfgang Golser
- Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, Division of Biological Sciences, University of California at San Diego, La Jolla, California 92093 and Department of Biology, University of Iowa, Iowa City, Iowa 52242
| | - Elizabeth Ashley
- Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, Division of Biological Sciences, University of California at San Diego, La Jolla, California 92093 and Department of Biology, University of Iowa, Iowa City, Iowa 52242
| | - Kristi Collura
- Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, Division of Biological Sciences, University of California at San Diego, La Jolla, California 92093 and Department of Biology, University of Iowa, Iowa City, Iowa 52242
| | - Michele Braidotti
- Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, Division of Biological Sciences, University of California at San Diego, La Jolla, California 92093 and Department of Biology, University of Iowa, Iowa City, Iowa 52242
| | - Yeisoo Yu
- Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, Division of Biological Sciences, University of California at San Diego, La Jolla, California 92093 and Department of Biology, University of Iowa, Iowa City, Iowa 52242
| | - Luciano M. Matzkin
- Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, Division of Biological Sciences, University of California at San Diego, La Jolla, California 92093 and Department of Biology, University of Iowa, Iowa City, Iowa 52242
| | - Bryant F. McAllister
- Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, Division of Biological Sciences, University of California at San Diego, La Jolla, California 92093 and Department of Biology, University of Iowa, Iowa City, Iowa 52242
| | - Therese Ann Markow
- Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, Division of Biological Sciences, University of California at San Diego, La Jolla, California 92093 and Department of Biology, University of Iowa, Iowa City, Iowa 52242
| | - Rod A. Wing
- Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, Division of Biological Sciences, University of California at San Diego, La Jolla, California 92093 and Department of Biology, University of Iowa, Iowa City, Iowa 52242
| |
Collapse
|
15
|
Etges WJ, De Oliveira CC, Noor MAF, Ritchie MG. Genetics of incipient speciation in Drosophila mojavensis. III. Life-history divergence in allopatry and reproductive isolation. Evolution 2011; 64:3549-69. [PMID: 20681983 DOI: 10.1111/j.1558-5646.2010.01096.x] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
We carried out a three-tiered genetic analysis of egg-to-adult development time and viability in ancestral and derived populations of cactophilic Drosophila mojavensis to test the hypothesis that evolution of these life-history characters has shaped premating reproductive isolation in this species. First, a common garden experiment with 11 populations from Baja California and mainland Mexico and Arizona reared on two host species revealed significant host plant X region and population interactions for viability and development time, evidence for host plant adaptation. Second, replicated line crosses with flies reared on both hosts revealed autosomal, X chromosome, cytoplasmic, and autosome X cactus influences on development time. Viability differences were influenced by host plants, autosomal dominance, and X chromosomal effects. Many of the F(1) , F(2) , and backcross generations showed evidence of heterosis for viability. Third, a QTL analysis of male courtship song and epicuticular hydrocarbon variation based on 1688 Baja × mainland F(2) males also revealed eight QTL influencing development time differences. Mainland alleles at six of these loci were associated with longer development times, consistent with population-level differences. Eight G × E interactions were also detected caused by longer development times of mainland alleles expressed on a mainland host with smaller differences among Baja genotypes reared on the Baja host plant. Four QTL influenced both development time and epicuticular hydrocarbon differences associated with courtship success, and there was a significant QTL-based correlation between development time and cuticular hydrocarbon variation. Thus, the regional shifts in life histories that evolved once D. mojavensis invaded mainland Mexico from Baja California by shifting host plants were genetically correlated with variation in cuticular hydrocarbon-based mate preferences.
Collapse
Affiliation(s)
- William J Etges
- Program in Ecology and Evolutionary Biology, Department of Biological Sciences, University of Arkansas, Fayetteville, Arizona 72701, USA.
| | | | | | | |
Collapse
|
16
|
Gabriško M, Janeček Š. Characterization of Maltase Clusters in the Genus Drosophila. J Mol Evol 2010; 72:104-18. [DOI: 10.1007/s00239-010-9406-3] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2010] [Accepted: 10/27/2010] [Indexed: 11/28/2022]
|
17
|
Lehmann J, Eisenhardt C, Stadler PF, Krauss V. Some novel intron positions in conserved Drosophila genes are caused by intron sliding or tandem duplication. BMC Evol Biol 2010; 10:156. [PMID: 20500887 PMCID: PMC2891723 DOI: 10.1186/1471-2148-10-156] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2009] [Accepted: 05/26/2010] [Indexed: 01/24/2023] Open
Abstract
BACKGROUND Positions of spliceosomal introns are often conserved between remotely related genes. Introns that reside in non-conserved positions are either novel or remnants of frequent losses of introns in some evolutionary lineages. A recent gain of such introns is difficult to prove. However, introns verified as novel are needed to evaluate contemporary processes of intron gain. RESULTS We identified 25 unambiguous cases of novel intron positions in 31 Drosophila genes that exhibit near intron pairs (NIPs). Here, a NIP consists of an ancient and a novel intron position that are separated by less than 32 nt. Within a single gene, such closely-spaced introns are very unlikely to have coexisted. In most cases, therefore, the ancient intron position must have disappeared in favour of the novel one. A survey for NIPs among 12 Drosophila genomes identifies intron sliding (migration) as one of the more frequent causes of novel intron positions. Other novel introns seem to have been gained by regional tandem duplications of coding sequences containing a proto-splice site. CONCLUSIONS Recent intron gains sometimes appear to have arisen by duplication of exonic sequences and subsequent intronization of one of the copies. Intron migration and exon duplication together may account for a significant amount of novel intron positions in conserved coding sequences.
Collapse
Affiliation(s)
- Jörg Lehmann
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, University of Leipzig, 04107 Leipzig, Germany
| | | | | | | |
Collapse
|
18
|
Kandul NP, Noor MAF. Large introns in relation to alternative splicing and gene evolution: a case study of Drosophila bruno-3. BMC Genet 2009; 10:67. [PMID: 19840385 PMCID: PMC2767349 DOI: 10.1186/1471-2156-10-67] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2009] [Accepted: 10/19/2009] [Indexed: 01/12/2023] Open
Abstract
Background Alternative splicing (AS) of maturing mRNA can generate structurally and functionally distinct transcripts from the same gene. Recent bioinformatic analyses of available genome databases inferred a positive correlation between intron length and AS. To study the interplay between intron length and AS empirically and in more detail, we analyzed the diversity of alternatively spliced transcripts (ASTs) in the Drosophila RNA-binding Bruno-3 (Bru-3) gene. This gene was known to encode thirteen exons separated by introns of diverse sizes, ranging from 71 to 41,973 nucleotides in D. melanogaster. Although Bru-3's structure is expected to be conducive to AS, only two ASTs of this gene were previously described. Results Cloning of RT-PCR products of the entire ORF from four species representing three diverged Drosophila lineages provided an evolutionary perspective, high sensitivity, and long-range contiguity of splice choices currently unattainable by high-throughput methods. Consequently, we identified three new exons, a new exon fragment and thirty-three previously unknown ASTs of Bru-3. All exon-skipping events in the gene were mapped to the exons surrounded by introns of at least 800 nucleotides, whereas exons split by introns of less than 250 nucleotides were always spliced contiguously in mRNA. Cases of exon loss and creation during Bru-3 evolution in Drosophila were also localized within large introns. Notably, we identified a true de novo exon gain: exon 8 was created along the lineage of the obscura group from intronic sequence between cryptic splice sites conserved among all Drosophila species surveyed. Exon 8 was included in mature mRNA by the species representing all the major branches of the obscura group. To our knowledge, the origin of exon 8 is the first documented case of exonization of intronic sequence outside vertebrates. Conclusion We found that large introns can promote AS via exon-skipping and exon turnover during evolution likely due to frequent errors in their removal from maturing mRNA. Large introns could be a reservoir of genetic diversity, because they have a greater number of mutable sites than short introns. Taken together, gene structure can constrain and/or promote gene evolution.
Collapse
Affiliation(s)
- Nikolai P Kandul
- Biology Department, Duke University, PO Box 90338, FFSC 4244, Durham, NC 27708, USA.
| | | |
Collapse
|
19
|
Walters KB, Grant P, Johnson DLE. Evolution of the GST omega gene family in 12 Drosophila species. ACTA ACUST UNITED AC 2009; 100:742-53. [PMID: 19608790 DOI: 10.1093/jhered/esp043] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
Gene families provide a unique system to study the evolutionary relationships between related genes both within and between organisms. We can ascertain whether members of a gene family in different species are orthologs or paralogs. We may also search for evidence that may explain why duplicate genes are present. The availability of genome sequences for 12 Drosophila species allows us to address these questions with respect to the evolution of one gene family, the glutathione S transferase (GST) omega class. This gene family is of particular interest because of its relationship with human disease and its presence in a wide range of species.
Collapse
Affiliation(s)
- Kathryn B Walters
- Department of Biological Sciences, The George Washington University, Washington, DC 20052, USA
| | | | | |
Collapse
|
20
|
Johnson LA, Zhao Y, Golden K, Barolo S. Reverse-engineering a transcriptional enhancer: a case study in Drosophila. Tissue Eng Part A 2009; 14:1549-59. [PMID: 18687053 DOI: 10.1089/ten.tea.2008.0074] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
Enhancers, or cis-regulatory elements, are the principal determinants of spatiotemporal patterning of gene expression. For reasons of clinical and research utility, it is desirable to build customized enhancers that drive novel gene expression patterns, but currently, we largely rely on "found" genomic elements. Synthetic enhancers, assembled from transcription factor binding sites taken from natural signal-regulated enhancers, generally fail to behave like their wild-type counterparts when placed in transgenic animals, suggesting that important aspects of enhancer function are still unexplored. As a step toward the creation of a truly synthetic regulatory element, we have undertaken an extensive structure-function study of an enhancer of the Drosophila decapentaplegic (dpp) gene that drives expression in the developing visceral mesoderm (VM). Although considerable past efforts have been made to dissect the dppVM enhancer, transgenic experiments presented here indicate that its activity cannot be explained by the known regulators alone. dppVM contains multiple, previously uncharacterized, regulatory sites, some of which exhibit functional redundancy. The results presented here suggest that even the best-studied enhancers must be further dissected before they can be fully understood, and before faithful synthetic elements based on them can be created. Implications for developmental genetics, mathematical modeling, and therapeutic applications are discussed.
Collapse
Affiliation(s)
- Lisa A Johnson
- Department of Cell and Developmental Biology, University of Michigan Medical School, Ann Arbor, Michigan, USA
| | | | | | | |
Collapse
|
21
|
Murakami K, Toyoda A, Hattori M, Kuroki Y, Fujiyama A, Kojima T, Matsuda M, Sakaki Y, Yamamoto MT. BAC library construction and BAC end sequencing of five Drosophila species: the comparative map with the D. melanogaster genome. Genes Genet Syst 2008; 83:245-56. [PMID: 18670136 DOI: 10.1266/ggs.83.245] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
We constructed and characterized arrayed bacterial artificial chromosome (BAC) libraries of five Drosophila species (D. melanogaster, D. simulans, D. sechellia, D. auraria, and D. ananassae), which are genetically well characterized in the studies of meiosis, evolution, population genetics, and developmental biology. The BAC libraries comprise 8,000 to 12,500 clones for each species, estimated to cover the most of the genomes. We sequenced both ends of most of these BAC clones with a success rate of 91%. Of these, 53,701 clones consisting of non-repetitive BAC end sequences (BESs) were mapped with reference of the public D. melanogaster genome sequences. The BES mapping estimated that the BAC libraries of D. auraria and D. ananassae covered 47% and 57% of the D. melanogaster genome, respectively, and those of D. melanogaster, D. sechellia, and D. simulans covered 94-97%. The low coverage by BESs of D. auraria and D. ananassae may be due to the high sequence divergence with D. melanogaster. From the comparative BES mapping, 111 possible breakpoints of chromosomal rearrangements were identified in these four species. The breakpoints of the major chromosome rearrangement between D. simulans and D. melanogaster on the third chromosome were determined within 20 kb in 84E and 30 kb in 93E/F. Corresponding breakpoints were also identified in D. sechellia. The BAC clones described here will be an important addition to the Drosophila genomic resources.
Collapse
|
22
|
Wegener C, Gorbashov A. Molecular evolution of neuropeptides in the genus Drosophila. Genome Biol 2008; 9:R131. [PMID: 18717992 PMCID: PMC2575521 DOI: 10.1186/gb-2008-9-8-r131] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2008] [Revised: 07/24/2008] [Accepted: 08/21/2008] [Indexed: 11/10/2022] Open
Abstract
The first genomic and chemical characterization of fruit fly neuropeptides outside Drosophila melanogaster provides insights into the evolution of the neuropeptidome in this genus. Background Neuropeptides comprise the most diverse group of neuronal signaling molecules. They often occur as multiple sequence-related copies within single precursors (the prepropeptides). These multiple sequence-related copies have not arisen by gene duplication, and it is debated whether they are mutually redundant or serve specific functions. The fully sequenced genomes of 12 Drosophila species provide a unique opportunity to study the molecular evolution of neuropeptides. Results We data-mined the 12 Drosophila genomes for homologs of neuropeptide genes identified in Drosophila melanogaster. We then predicted peptide precursors and the neuropeptidome, and biochemically identified about half of the predicted peptides by direct mass spectrometric profiling of neuroendocrine tissue in four species covering main phylogenetic lines of Drosophila. We found that all species have an identical neuropeptidome and peptide hormone complement. Calculation of amino acid distances showed that ortholog peptide copies are highly sequence-conserved between species, whereas the observed sequence variability between peptide copies within single precursors must have occurred prior to the divergence of the Drosophila species. Conclusion We provide a first genomic and chemical characterization of fruit fly neuropeptides outside D. melanogaster. Our results suggest that neuropeptides including multiple peptide copies are under stabilizing selection, which suggests that multiple peptide copies are functionally important and not dispensable. The last common ancestor of Drosophila obviously had a set of neuropeptides and peptide hormones identical to that of modern fruit flies. This is remarkable, since drosophilid flies have adapted to very different environments.
Collapse
Affiliation(s)
- Christian Wegener
- Emmy Noether Neuropeptide Group, Animal Physiology, Department of Biology, Philipps-University, Karl-von-Frisch-Strasse, D-35032 Marburg, Germany.
| | | |
Collapse
|
23
|
Sequence signatures of a recent chromosomal rearrangement in Drosophila mojavensis. Genetica 2008; 136:5-11. [PMID: 18661244 DOI: 10.1007/s10709-008-9296-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2008] [Accepted: 07/12/2008] [Indexed: 12/22/2022]
Abstract
The X-chromosome inversion, Xe, distinguishes Drosophila mojavensis and D. arizonae. Earlier work mapped the breakpoints of this inversion to large intervals and provided hypotheses for the locations of the breakpoints within 3000-bp intergenic regions on the D. mojavensis genome sequence assembly. Here, we sequenced these regions directly in the putatively ancestral D. arizonae X-chromosome. We find that the two inversion breakpoints are near an inverted gene duplication and a common repetitive element, respectively, and these features were likely present in the non-inverted ancestral chromosome on the D. mojavensis lineage. Contrary to an earlier hypothesis, the inverted gene duplication appears to predate the inversion. We find no sequence similarity between the breakpoint regions in the D. mojavensis ancestor, excluding an ectopic-exchange model of chromosome rearrangements. We also found no evidence that staggered single-strand breaks caused the inversion. We suggest these features may have contributed to the chromosomal breakages resulting in this inversion.
Collapse
|
24
|
Gregory TR, Johnston JS. Genome size diversity in the family Drosophilidae. Heredity (Edinb) 2008; 101:228-38. [DOI: 10.1038/hdy.2008.49] [Citation(s) in RCA: 103] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
|
25
|
GARDINER ANASTASIA, BARKER DANIEL, BUTLIN ROGERK, JORDAN WILLIAMC, RITCHIE MICHAELG. Drosophilachemoreceptor gene evolution: selection, specialization and genome size. Mol Ecol 2008; 17:1648-57. [DOI: 10.1111/j.1365-294x.2008.03713.x] [Citation(s) in RCA: 99] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
|
26
|
Ahnert SE, Fink TMA, Zinovyev A. How much non-coding DNA do eukaryotes require? J Theor Biol 2008; 252:587-92. [PMID: 18384817 DOI: 10.1016/j.jtbi.2008.02.005] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2007] [Revised: 02/05/2008] [Accepted: 02/05/2008] [Indexed: 11/27/2022]
Abstract
Despite tremendous advances in the field of genomics, the amount and function of the large non-coding part of the genome in higher organisms remains poorly understood. Here we report an observation, made for 37 fully sequenced eukaryotic genomes, which indicates that eukaryotes require a certain minimum amount of non-coding DNA (ncDNA). This minimum increases quadratically with the amount of DNA located in exons. Based on a simple model of the growth of regulatory networks, we derive a theoretical prediction of the required quantity of ncDNA and find it to be in excellent agreement with the data. The amount of additional ncDNA (in basepairs) which eukaryotes require obeys N(DEF)=1/2 (N(C)/N(P)) (N(C)-N(P)), where N(C) is the amount of exonic DNA, and N(P) is a constant of about 10 Mb. This value N(DEF) corresponds to a few percent of the genome in Homo sapiens and other mammals, and up to half the genome in simpler eukaryotes. Thus, our findings confirm that eukaryotic life depends on a substantial fraction of ncDNA and also make a prediction of the size of this fraction, which matches the data closely.
Collapse
|
27
|
Divergence between the Drosophila pseudoobscura and D. persimilis genome sequences in relation to chromosomal inversions. Genetics 2008; 177:1417-28. [PMID: 18039875 DOI: 10.1534/genetics.107.070672] [Citation(s) in RCA: 95] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
As whole-genome sequence assemblies accumulate, a challenge is to determine how these can be used to address fundamental evolutionary questions, such as inferring the process of speciation. Here, we use the sequence assemblies of Drosophila pseudoobscura and D. persimilis to test hypotheses regarding divergence with gene flow. We observe low differentiation between the two genome sequences in pericentromeric and peritelomeric regions. We interpret this result as primarily a remnant of the correlation between levels of variation and local recombination rate observed within populations. However, we also observe lower differentiation far from the fixed chromosomal inversions distinguishing these species and greater differentiation within and near these inversions. This finding is consistent with models suggesting that chromosomal inversions facilitate species divergence despite interspecies gene flow. We also document heterogeneity among the inverted regions in their degree of differentiation, suggesting temporal differences in the origin of each inverted region consistent with the inversions arising during a process of divergence with gene flow. While this study provides insights into the speciation process using two single-genome sequences, it was informed by lower throughput but more rigorous examinations of polymorphism and divergence. This reliance highlights the need for complementary genomic and population genetic approaches for tackling fundamental evolutionary questions such as speciation.
Collapse
|
28
|
Abstract
Over the course of the past century, flies in the family Drosophilidae have been important models for understanding genetic, developmental, cellular, ecological, and evolutionary processes. Full genome sequences from a total of 12 species promise to extend this work by facilitating comparative studies of gene expression, of molecules such as proteins, of developmental mechanisms, and of ecological adaptation. Here we review basic biological and ecological information of the species whose genomes have recently been completely sequenced in the context of current research.
Collapse
|
29
|
Choi JH, Kim S, Tang H, Andrews J, Gilbert DG, Colbourne JK. A machine-learning approach to combined evidence validation of genome assemblies. ACTA ACUST UNITED AC 2008; 24:744-50. [PMID: 18204064 DOI: 10.1093/bioinformatics/btm608] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
MOTIVATION While it is common to refer to 'the genome sequence' as if it were a single, complete and contiguous DNA string, it is in fact an assembly of millions of small, partially overlapping DNA fragments. Sophisticated computer algorithms (assemblers and scaffolders) merge these DNA fragments into contigs, and place these contigs into sequence scaffolds using the paired-end sequences derived from large-insert DNA libraries. Each step in this automated process is susceptible to producing errors; hence, the resulting draft assembly represents (in practice) only a likely assembly that requires further validation. Knowing which parts of the draft assembly are likely free of errors is critical if researchers are to draw reliable conclusions from the assembled sequence data. RESULTS We develop a machine-learning method to detect assembly errors in sequence assemblies. Several in silico measures for assembly validation have been proposed by various researchers. Using three benchmarking Drosophila draft genomes, we evaluate these techniques along with some new measures that we propose, including the good-minus-bad coverage (GMB), the good-to-bad-ratio (RGB), the average Z-score (AZ) and the average absolute Z-score (ASZ). Our results show that the GMB measure performs better than the others in both its sensitivity and its specificity for assembly error detection. Nevertheless, no single method performs sufficiently well to reliably detect genomic regions requiring attention for further experimental verification. To utilize the advantages of all these measures, we develop a novel machine learning approach that combines these individual measures to achieve a higher prediction accuracy (i.e. greater than 90%). Our combined evidence approach avoids the difficult and often ad hoc selection of many parameters the individual measures require, and significantly improves the overall precisions on the benchmarking data sets.
Collapse
Affiliation(s)
- Jeong-Hyeon Choi
- The Center for Genomics and Bioinformatics, School of Informatics and Department of Biology, Indiana University, IN 47405, USA
| | | | | | | | | | | |
Collapse
|
30
|
Gallach M, Arnau V, Marín I. Global patterns of sequence evolution in Drosophila. BMC Genomics 2007; 8:408. [PMID: 17996078 PMCID: PMC2180185 DOI: 10.1186/1471-2164-8-408] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2007] [Accepted: 11/09/2007] [Indexed: 01/30/2023] Open
Abstract
Background Sequencing of the genomes of several Drosophila allows for the first precise analyses of how global sequence patterns change among multiple, closely related animal species. A basic question is whether there are characteristic features that differentiate chromosomes within a species or between different species. Results We explored the euchromatin of the chromosomes of seven Drosophila species to establish their global patterns of DNA sequence diversity. Between species, differences in the types and amounts of simple sequence repeats were found. Within each species, the autosomes have almost identical oligonucleotide profiles. However, X chromosomes and autosomes have, in all species, a qualitatively different composition. The X chromosomes are less complex than the autosomes, containing both a higher amount of simple DNA sequences and, in several cases, chromosome-specific repetitive sequences. Moreover, we show that the right arm of the X chromosome of Drosophila pseudoobscura, which evolved from an autosome 10 – 18 millions of years ago, has a composition which is identical to that of the original, left arm of the X chromosome. Conclusion The consistent differences among species, differences among X chromosomes and autosomes and the convergent evolution of X and neo-X chromosomes demonstrate that strong forces are acting on drosophilid genomes to generate peculiar chromosomal landscapes. We discuss the relationships of the patterns observed with differential recombination and mutation rates and with the process of dosage compensation.
Collapse
Affiliation(s)
- Miguel Gallach
- Instituto de Biomedicina de Valencia, Consejo Superior de Investigaciones Científicas (IBV-CSIC), Valencia, Spain.
| | | | | |
Collapse
|
31
|
Villasante A, Abad JP, Planelló R, Méndez-Lago M, Celniker SE, de Pablos B. Drosophila telomeric retrotransposons derived from an ancestral element that was recruited to replace telomerase. Genome Res 2007; 17:1909-18. [PMID: 17989257 DOI: 10.1101/gr.6365107] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Drosophila telomeres do not have arrays of simple telomerase-generated G-rich repeats. Instead, Drosophila maintains its telomeres by occasional transposition of specific non-long terminal repeat (non-LTR) retrotransposons to chromosome ends. The genus Drosophila provides a superb model system for comparative telomere analysis. Here we present an evolutionary study of Drosophila telomeric elements to ascertain the significance of telomeric retrotransposons (TRs) in the maintenance of Drosophila telomeres. PCR and in silico surveys in the sibling species of Drosophila melanogaster and in more distantly related species show that multiple TRs maintain telomeres in Drosophila. In addition to TRs with two open reading frames (ORFs) capable of autonomous transposition, there are deleted telomeric retrotransposons that have lost their ORF2, which we refer to as half telomeric-retrotransposons (HTRs). The phylogenetic relationship among these telomeric elements is congruent with the phylogeny of the species, suggesting that they have been vertically inherited from a common ancestor. Our results suggest that an existing non-LTR retrotransposon was recruited to perform the cellular function of telomere maintenance.
Collapse
Affiliation(s)
- Alfredo Villasante
- Centro de Biología Molecular Severo Ochoa, Cantoblanco, 28049 Madrid, Spain.
| | | | | | | | | | | |
Collapse
|
32
|
Sawamura K, Zhi H, Setoguchi K, Yamada H, Miyo T, Matsuda M, Oguma Y. Genetic analysis of female mating recognition between Drosophila ananassae and Drosophila pallidosa: application of interspecific mosaic genome lines. Genetica 2007; 133:179-85. [PMID: 17768595 DOI: 10.1007/s10709-007-9198-6] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2007] [Accepted: 08/18/2007] [Indexed: 10/22/2022]
Abstract
Drosophila ananassae and Drosophila pallidosa are closely related species that can produce viable and fertile hybrids of both sexes, although strong sexual isolation exists between the two species. Females are thought to discriminate conspecific from heterospecific males based on their courtship songs. The genetic basis of female discrimination behavior was analyzed using isogenic females from interspecific mosaic genome lines that carry homozygous recombinant chromosomes. Multiple regression analysis indicated a highly significant effect of the left arm of chromosome 2 (2L) on the willingness of females to mate with D. ananassae males. Not only 2L but also the left arm of chromosome X (XL) and the right arm of chromosome 3 (3R) had significant effects on the females' willingness to mate with D. pallidosa males. All regions with strong effects on mate choice have chromosome arrangements characterized by species-specific inversions. Heterospecific combinations of 2L and 3R have previously been suggested to cause postzygotic reproductive isolation. Thus, genes involved in premating as well as postmating isolation are located in or near chromosomal inversions. This conclusion is consistent with the recently proposed hypothesis that "speciation genes" accumulate at a higher rate in non-recombining genome regions when species divergence occurs in the presence of gene flow.
Collapse
Affiliation(s)
- Kyoichi Sawamura
- Graduate School of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki, 305-8572, Japan.
| | | | | | | | | | | | | |
Collapse
|
33
|
Langille MGI, Clark DV. Parent genes of retrotransposition-generated gene duplicates in Drosophila melanogaster have distinct expression profiles. Genomics 2007; 90:334-43. [PMID: 17628393 DOI: 10.1016/j.ygeno.2007.06.001] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2006] [Revised: 05/26/2007] [Accepted: 06/05/2007] [Indexed: 01/12/2023]
Abstract
Genes arising by retrotransposition are always different from their parent genes from the outset. In addition, the cDNA must insert into a region that allows expression or it will become a processed pseudogene. We sought to determine whether this class of gene duplication differs from other gene duplications based on functional criteria. Using amino acid sequences from Drosophila melanogaster, we identified retroduplicated gene pairs at various levels of sequence identity. Analysis of gene ontology annotations showed some enrichment of retroduplications in the cellular physiological processes class. Retroduplications show a higher level of nucleotide substitution than other gene duplications, suggesting a higher rate of divergence. Remarkably, analysis of microarray data for gene expression during embryogenesis showed that parent genes are more highly expressed relative to their retroduplicated copies, tandem duplications, and all genes. Furthermore, an expressed sequence tag library representation shows a broader distribution for parent genes than for all other genes and, as found previously by others, retroduplicated gene transcripts are found most abundantly in testes. Therefore, in examining retroduplicated gene pairs, we have found that parent genes of retroduplications are also a distinctive class in terms of transcript expression levels and distribution.
Collapse
Affiliation(s)
- Morgan G I Langille
- Department of Biology, University of New Brunswick, Fredericton, Canada NB E3B 6E1
| | | |
Collapse
|
34
|
Caldwell JC, Fineberg SK, Eberl DF. reduced ocelli encodes the leucine rich repeat protein Pray For Elves in Drosophila melanogaster. Fly (Austin) 2007; 1:146-52. [PMID: 18820435 DOI: 10.4161/fly.4562] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
The ocelli are three simple photoreceptors on the vertex of the fruit fly head. We sought to identify the gene encoded by the classical ocellar mutant, reduced ocelli (rdo). Deficiency and inversion breakpoint mapping and P-element induced male recombination analyses were performed and Pray For Elves (PFE; CG15151; Fbgn0032661) emerged as a promising candidate for the rdo phenotype. The PFE locus maps to polytene region 36E on chromosome 2L between elfless (Fbgn0032660) and Arrestin 1 (Fbgn0000120). FlyBase annotation predicts that PFE encodes a serine/threonine kinase, yet protein prediction programs revealed no kinase domain. These analyses suggest that PFE simply encodes a leucine rich repeat molecule of unknown function, but presumably functions in nervous system protein-protein interaction. Two classical spontaneous alleles of rdo, rdo(1) and rdo(2), were characterized and the underlying mutations result from a small deletion spanning exon 1/intron 1 and a B104/roo insertion into the 3'UTR of PFE, respectively. Transposase-mediated excisions of several P-elements inserted into the PFE locus revert the rdo phenotype and a full-length PFE cDNA is sufficient to rescue rdo. A Gal4 enhancer trap reveals a broad adult neural expression pattern for PFE. Our identification and initial characterization of the rdo locus will contribute to the understanding of neurogenesis and neural development in the simple photoreceptors of the Drosophila visual system.
Collapse
Affiliation(s)
- Jason C Caldwell
- Department of Biological Sciences, University of Iowa, Iowa City, Iowa 52242-1324, USA
| | | | | |
Collapse
|
35
|
Machado CA, Haselkorn TS, Noor MAF. Evaluation of the genomic extent of effects of fixed inversion differences on intraspecific variation and interspecific gene flow in Drosophila pseudoobscura and D. persimilis. Genetics 2006; 175:1289-306. [PMID: 17179068 PMCID: PMC1840060 DOI: 10.1534/genetics.106.064758] [Citation(s) in RCA: 88] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
There is increasing evidence that chromosomal inversions may facilitate the formation or persistence of new species by allowing genetic factors conferring species-specific adaptations or reproductive isolation to be inherited together and by reducing or eliminating introgression. However, the genomic domain of influence of the inverted regions on introgression has not been carefully studied. Here, we present a detailed study on the consequences that distance from inversion breakpoints has had on the inferred level of gene flow and divergence between Drosophila pseudoobscura and D. persimilis. We identified the locations of the inversion breakpoints distinguishing D. pseudoobscura and D. persimilis in chromosomes 2, XR, and XL. Population genetic data were collected at specific distances from the inversion breakpoints of the second chromosome and at two loci inside the XR and XL inverted regions. For loci outside the inverted regions, we found that distance from the nearest inversion breakpoint had a significant effect on several measures of divergence and gene flow between D. pseudoobscura and D. persimilis. The data fitted a logarithmic relationship, showing that the suppression of crossovers in inversion heterozygotes also extends to loci located outside the inversion but close to it (within 1-2 Mb). Further, we detected a significant reduction in nucleotide variation inside the inverted second chromosome region of D. persimilis and near one breakpoint, consistent with a scenario in which this inversion arose and was fixed in this species by natural selection.
Collapse
Affiliation(s)
- Carlos A Machado
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, USA.
| | | | | |
Collapse
|
36
|
Allen JE, Salzberg SL. A phylogenetic generalized hidden Markov model for predicting alternatively spliced exons. Algorithms Mol Biol 2006; 1:14. [PMID: 16934144 PMCID: PMC1570466 DOI: 10.1186/1748-7188-1-14] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2006] [Accepted: 08/25/2006] [Indexed: 11/30/2022] Open
Abstract
BACKGROUND An important challenge in eukaryotic gene prediction is accurate identification of alternatively spliced exons. Functional transcripts can go undetected in gene expression studies when alternative splicing only occurs under specific biological conditions. Non-expression based computational methods support identification of rarely expressed transcripts. RESULTS A non-expression based statistical method is presented to annotate alternatively spliced exons using a single genome sequence and evidence from cross-species sequence conservation. The computational method is implemented in the program ExAlt and an analysis of prediction accuracy is given for Drosophila melanogaster. CONCLUSION ExAlt identifies the structure of most alternatively spliced exons in the test set and cross-species sequence conservation is shown to improve the precision of predictions. The software package is available to run on Drosophila genomes to search for new cases of alternative splicing.
Collapse
Affiliation(s)
- Jonathan E Allen
- Center for Bioinformatics and Computational Biology, University of Maryland Institute for Advanced Computer Studies, University of Maryland, College Park, MD 20742, USA
- Department of Computer Science, Johns Hopkins University, 3400 N. Charles Street, Baltimore, MD 21218, USA
| | - Steven L Salzberg
- Center for Bioinformatics and Computational Biology, University of Maryland Institute for Advanced Computer Studies, University of Maryland, College Park, MD 20742, USA
- Department of Computer Science, University of Maryland, College Park, MD 20742, USA
| |
Collapse
|