1
|
Auvinet J, Graça P, Ghigliotti L, Pisano E, Dettaï A, Ozouf-Costaz C, Higuet D. Insertion Hot Spots of DIRS1 Retrotransposon and Chromosomal Diversifications among the Antarctic Teleosts Nototheniidae. Int J Mol Sci 2019; 20:ijms20030701. [PMID: 30736325 PMCID: PMC6387122 DOI: 10.3390/ijms20030701] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2018] [Revised: 01/30/2019] [Accepted: 02/03/2019] [Indexed: 12/17/2022] Open
Abstract
By their faculty to transpose, transposable elements are known to play a key role in eukaryote genomes, impacting both their structuration and remodeling. Their integration in targeted sites may lead to recombination mechanisms involved in chromosomal rearrangements. The Antarctic fish family Nototheniidae went through several waves of species radiations. It is a suitable model to study transposable element (TE)-mediated mechanisms associated to genome and chromosomal diversifications. After the characterization of Gypsy (GyNoto), Copia (CoNoto), and DIRS1 (YNoto) retrotransposons in the genomes of Nototheniidae (diversity, distribution, conservation), we focused on their chromosome location with an emphasis on the three identified nototheniid radiations (the Trematomus, the plunderfishes, and the icefishes). The strong intrafamily TE conservation and wide distribution across species of the whole family suggest an ancestral acquisition with potential secondary losses in some lineages. GyNoto and CoNoto (including Hydra and GalEa clades) mostly produced interspersed signals along chromosomal arms. On the contrary, insertion hot spots accumulating in localized regions (mainly next to centromeric and pericentromeric regions) highlighted the potential role of YNoto in chromosomal diversifications as facilitator of the fusions which occurred in many nototheniid lineages, but not of the fissions.
Collapse
Affiliation(s)
- Juliette Auvinet
- Laboratoire Evolution Paris Seine, Sorbonne Université, CNRS, Univ Antilles, Institut de Biologie Paris Seine (IBPS), F-75005 Paris, France.
| | - Paula Graça
- Laboratoire Evolution Paris Seine, Sorbonne Université, CNRS, Univ Antilles, Institut de Biologie Paris Seine (IBPS), F-75005 Paris, France.
| | - Laura Ghigliotti
- Istituto per lo Studio degli Impatti Antropici e la Sostenibilità in Ambiente Marino (IAS), National Research Council (CNR), 16149 Genoa, Italy.
| | - Eva Pisano
- Istituto per lo Studio degli Impatti Antropici e la Sostenibilità in Ambiente Marino (IAS), National Research Council (CNR), 16149 Genoa, Italy.
| | - Agnès Dettaï
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Museum National d'Histoire Naturelle, CNRS, Sorbonne Université, EPHE, 57, rue Cuvier, 75005 Paris, France.
| | - Catherine Ozouf-Costaz
- Laboratoire Evolution Paris Seine, Sorbonne Université, CNRS, Univ Antilles, Institut de Biologie Paris Seine (IBPS), F-75005 Paris, France.
| | - Dominique Higuet
- Laboratoire Evolution Paris Seine, Sorbonne Université, CNRS, Univ Antilles, Institut de Biologie Paris Seine (IBPS), F-75005 Paris, France.
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Museum National d'Histoire Naturelle, CNRS, Sorbonne Université, EPHE, 57, rue Cuvier, 75005 Paris, France.
| |
Collapse
|
2
|
Pizarro D, Divakar PK, Grewe F, Leavitt SD, Huang JP, Dal Grande F, Schmitt I, Wedin M, Crespo A, Lumbsch HT. Phylogenomic analysis of 2556 single-copy protein-coding genes resolves most evolutionary relationships for the major clades in the most diverse group of lichen-forming fungi. FUNGAL DIVERS 2018. [DOI: 10.1007/s13225-018-0407-7] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
|
3
|
Greshake B, Zehr S, Dal Grande F, Meiser A, Schmitt I, Ebersberger I. Potential and pitfalls of eukaryotic metagenome skimming: a test case for lichens. Mol Ecol Resour 2015; 16:511-23. [PMID: 26345272 DOI: 10.1111/1755-0998.12463] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2015] [Revised: 07/28/2015] [Accepted: 08/22/2015] [Indexed: 11/30/2022]
Abstract
Whole-genome shotgun sequencing of multispecies communities using only a single library layout is commonly used to assess taxonomic and functional diversity of microbial assemblages. Here, we investigate to what extent such metagenome skimming approaches are applicable for in-depth genomic characterizations of eukaryotic communities, for example lichens. We address how to best assemble a particular eukaryotic metagenome skimming data, what pitfalls can occur, and what genome quality can be expected from these data. To facilitate a project-specific benchmarking, we introduce the concept of twin sets, simulated data resembling the outcome of a particular metagenome sequencing study. We show that the quality of genome reconstructions depends essentially on assembler choice. Individual tools, including the metagenome assemblers Omega and MetaVelvet, are surprisingly sensitive to low and uneven coverages. In combination with the routine of assembly parameter choice to optimize the assembly N50 size, these tools can preclude an entire genome from the assembly. In contrast, MIRA, an all-purpose overlap assembler, and SPAdes, a multisized de Bruijn graph assembler, facilitate a comprehensive view on the individual genomes across a wide range of coverage ratios. Testing assemblers on a real-world metagenome skimming data from the lichen Lasallia pustulata demonstrates the applicability of twin sets for guiding method selection. Furthermore, it reveals that the assembly outcome for the photobiont Trebouxia sp. falls behind the a priori expectation given the simulations. Although the underlying reasons remain still unclear, this highlights that further studies on this organism require special attention during sequence data generation and downstream analysis.
Collapse
Affiliation(s)
- Bastian Greshake
- Institute of Cell Biology and Neuroscience, Goethe University Frankfurt, Max-von-Laue Str. 13, D-60438, Frankfurt, Germany
| | - Simonida Zehr
- Institute of Cell Biology and Neuroscience, Goethe University Frankfurt, Max-von-Laue Str. 13, D-60438, Frankfurt, Germany
| | - Francesco Dal Grande
- Senckenberg Biodiversity and Climate Research Centre (BiK-F), Senckenberg Anlage 25, D-60325, Frankfurt, Germany
| | - Anjuli Meiser
- Institute of Ecology, Evolution and Diversity, Goethe University Frankfurt, Max-von-Laue Str. 13, D-60438, Frankfurt, Germany
| | - Imke Schmitt
- Senckenberg Biodiversity and Climate Research Centre (BiK-F), Senckenberg Anlage 25, D-60325, Frankfurt, Germany.,Institute of Ecology, Evolution and Diversity, Goethe University Frankfurt, Max-von-Laue Str. 13, D-60438, Frankfurt, Germany
| | - Ingo Ebersberger
- Institute of Cell Biology and Neuroscience, Goethe University Frankfurt, Max-von-Laue Str. 13, D-60438, Frankfurt, Germany
| |
Collapse
|
4
|
Bhuiyan SS, Kinoshita S, Wongwarangkana C, Asaduzzaman M, Asakawa S, Watabe S. Evolution of the myosin heavy chain gene MYH14 and its intronic microRNA miR-499: muscle-specific miR-499 expression persists in the absence of the ancestral host gene. BMC Evol Biol 2013; 13:142. [PMID: 24059862 PMCID: PMC3716903 DOI: 10.1186/1471-2148-13-142] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2012] [Accepted: 06/13/2013] [Indexed: 12/17/2022] Open
Abstract
Background A novel sarcomeric myosin heavy chain gene, MYH14, was identified following the completion of the human genome project. MYH14 contains an intronic microRNA, miR-499, which is expressed in a slow/cardiac muscle specific manner along with its host gene; it plays a key role in muscle fiber-type specification in mammals. Interestingly, teleost fish genomes contain multiple MYH14 and miR-499 paralogs. However, the evolutionary history of MYH14 and miR-499 has not been studied in detail. In the present study, we identified MYH14/miR-499 loci on various teleost fish genomes and examined their evolutionary history by sequence and expression analyses. Results Synteny and phylogenetic analyses depict the evolutionary history of MYH14/miR-499 loci where teleost specific duplication and several subsequent rounds of species-specific gene loss events took place. Interestingly, miR-499 was not located in the MYH14 introns of certain teleost fish. An MYH14 paralog, lacking miR-499, exhibited an accelerated rate of evolution compared with those containing miR-499, suggesting a putative functional relationship between MYH14 and miR-499. In medaka, Oryzias latipes, miR-499 is present where MYH14 is completely absent in the genome. Furthermore, by using in situ hybridization and small RNA sequencing, miR-499 was expressed in the notochord at the medaka embryonic stage and slow/cardiac muscle at the larval and adult stages. Comparing the flanking sequences of MYH14/miR-499 loci between torafugu Takifugu rubripes, zebrafish Danio rerio, and medaka revealed some highly conserved regions, suggesting that cis-regulatory elements have been functionally conserved in medaka miR-499 despite the loss of its host gene. Conclusions This study reveals the evolutionary history of the MYH14/miRNA-499 locus in teleost fish, indicating divergent distribution and expression of MYH14 and miR-499 genes in different teleost fish lineages. We also found that medaka miR-499 was even expressed in the absence of its host gene. To our knowledge, this is the first report that shows the conversion of intronic into non-intronic miRNA during the evolution of a teleost fish lineage.
Collapse
Affiliation(s)
- Sharmin Siddique Bhuiyan
- Department of Aquatic Bioscience, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Bunkyo, Tokyo 113-8657, Japan.
| | | | | | | | | | | |
Collapse
|
5
|
Fosmid library end sequencing reveals a rarely known genome structure of marine shrimp Penaeus monodon. BMC Genomics 2011; 12:242. [PMID: 21575266 PMCID: PMC3124438 DOI: 10.1186/1471-2164-12-242] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2011] [Accepted: 05/17/2011] [Indexed: 11/28/2022] Open
Abstract
Background The black tiger shrimp (Penaeus monodon) is one of the most important aquaculture species in the world, representing the crustacean lineage which possesses the greatest species diversity among marine invertebrates. Yet, we barely know anything about their genomic structure. To understand the organization and evolution of the P. monodon genome, a fosmid library consisting of 288,000 colonies and was constructed, equivalent to 5.3-fold coverage of the 2.17 Gb genome. Approximately 11.1 Mb of fosmid end sequences (FESs) from 20,926 non-redundant reads representing 0.45% of the P. monodon genome were obtained for repetitive and protein-coding sequence analyses. Results We found that microsatellite sequences were highly abundant in the P. monodon genome, comprising 8.3% of the total length. The density and the average length of microsatellites were evidently higher in comparison to those of other taxa. AT-rich microsatellite motifs, especially poly (AT) and poly (AAT), were the most abundant. High abundance of microsatellite sequences were also found in the transcribed regions. Furthermore, via self-BlastN analysis we identified 103 novel repetitive element families which were categorized into four groups, i.e., 33 WSSV-like repeats, 14 retrotransposons, 5 gene-like repeats, and 51 unannotated repeats. Overall, various types of repeats comprise 51.18% of the P. monodon genome in length. Approximately 7.4% of the FESs contained protein-coding sequences, and the Inhibitor of Apoptosis Protein (IAP) gene and the Innexin 3 gene homologues appear to be present in high abundance in the P. monodon genome. Conclusions The redundancy of various repeat types in the P. monodon genome illustrates its highly repetitive nature. In particular, long and dense microsatellite sequences as well as abundant WSSV-like sequences highlight the uniqueness of genome organization of penaeid shrimp from those of other taxa. These results provide substantial improvement to our current knowledge not only for shrimp but also for marine crustaceans of large genome size.
Collapse
|
6
|
Ellison CK, Shaw KL. Mining non-model genomic libraries for microsatellites: BAC versus EST libraries and the generation of allelic richness. BMC Genomics 2010; 11:428. [PMID: 20624300 PMCID: PMC2996956 DOI: 10.1186/1471-2164-11-428] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2010] [Accepted: 07/12/2010] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND Simple sequence repeats (SSRs) are tandemly repeated sequence motifs common in genomic nucleotide sequence that often harbor significant variation in repeat number. Frequently used as molecular markers, SSRs are increasingly identified via in silico approaches. Two common classes of genomic resources that can be mined are bacterial artificial chromosome (BAC) libraries and expressed sequence tag (EST) libraries. RESULTS 288 SSR loci were screened in the rapidly radiating Hawaiian swordtail cricket genus Laupala. SSRs were more densely distributed and contained longer repeat structures in BAC library-derived sequence than in EST library-derived sequence, although neither repeat density nor length was exceptionally elevated despite the relatively large genome size of Laupala. A non-random distribution favoring AT-rich SSRs was observed. Allelic diversity of SSRs was positively correlated with repeat length and was generally higher in AT-rich repeat motifs. CONCLUSION The first large-scale survey of Orthopteran SSR allelic diversity is presented. Selection contributes more strongly to the size and density distributions of SSR loci derived from EST library sequence than from BAC library sequence, although all SSRs likely are subject to similar physical and structural constraints, such as slippage of DNA replication machinery, that may generate increased allelic diversity in AT-rich sequence motifs. Although in silico approaches work well for SSR locus identification in both EST and BAC libraries, BAC library sequence and AT-rich repeat motifs are generally superior SSR development resources for most applications.
Collapse
Affiliation(s)
| | - Kerry L Shaw
- Department of Neurobiology and Behavior, Cornell University, Ithaca, NY 14850, USA
| |
Collapse
|
7
|
Cavallini A, Natali L, Zuccolo A, Giordani T, Jurman I, Ferrillo V, Vitacolonna N, Sarri V, Cattonaro F, Ceccarelli M, Cionini PG, Morgante M. Analysis of transposons and repeat composition of the sunflower (Helianthus annuus L.) genome. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2010; 120:491-508. [PMID: 19826774 DOI: 10.1007/s00122-009-1170-7] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2009] [Accepted: 09/27/2009] [Indexed: 05/25/2023]
Abstract
A sample-sequencing strategy combined with slot-blot hybridization and FISH was used to study the composition of the repetitive component of the sunflower genome. One thousand six hundred thirty-eight sequences for a total of 954,517 bp were analyzed. The fraction of sequences that can be classified as repetitive using computational and hybridization approaches amounts to 62% in total. Almost two thirds remain as yet uncharacterized in nature. Of those characterized, most belong to the gypsy superfamily of LTR-retrotransposons. Unlike in other species, where single families can account for large fractions of the genome, it appears that no transposon family has been amplified to very high levels in sunflower. All other known classes of transposable elements were also found. One family of unknown nature (contig 61) was the most repeated in the sunflower genome. The evolution of the repetitive component in the Helianthus genus and in other Asteraceae was studied by comparative analysis of the hybridization of total genomic DNAs from these species to the sunflower small-insert library and compared to gene-based phylogeny. Very little similarity is observed between Helianthus species and two related Asteraceae species outside of the genus. Most repetitive elements are similar in annual and perennial Helianthus species indicating that sequence amplification largely predates such divergence. Gypsy-like elements are more represented in the annuals than in the perennials, while copia-like elements are similarly represented, attesting a different amplification history of the two superfamilies of LTR-retrotransposons in the Helianthus genus.
Collapse
Affiliation(s)
- Andrea Cavallini
- Genetics Section, Department of Crop Plant Biology, University of Pisa, Pisa, Italy
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Noleto RB, de Souza Fonseca Guimarães F, Paludo KS, Vicari MR, Artoni RF, Cestari MM. Genome size evaluation in Tetraodontiform fishes from the Neotropical region. MARINE BIOTECHNOLOGY (NEW YORK, N.Y.) 2009; 11:680-685. [PMID: 19590923 DOI: 10.1007/s10126-009-9215-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2009] [Accepted: 06/17/2009] [Indexed: 05/28/2023]
Abstract
Smooth pufferfish of the family Tetraodontidae had become pure genomic models because of the remarkable compaction of their genome. This trait seems to be the result of DNA loss following its divergence from the sister family Diodontidae, which possess larger genomes. In this study, flow cytometry was used for estimate the genome size of four pufferfish species from the Neotropical region. Cytogenetic data and confocal microscopy were also used attempting to confirm relationships between DNA content and cytological parameters. The haploid genome size was 0.71 + or - 0.03 pg for Sphoeroides greeleyi, 0.34 + or - 0.01 pg for Sphoeroides spengleri, 0.82 + or - 0.03 pg for Sphoeroides testudineus (all Tetraodontidae), and 1.00 + or - 0.03 pg for Chilomycterus spinosus (Diodontidae). These differences are not related with ploidy level, because 46 chromosomes are considered basal for both families. The value for S. spengleri represents the smallest vertebrate genome reported to date. Since erythrocyte cell and nuclear sizes are strongly correlated with genome size, the variation in this last is considered under both adaptive and evolutionary perspectives.
Collapse
Affiliation(s)
- Rafael Bueno Noleto
- Departamento de Genética, Universidade Federal do Paraná, Curitiba, PR, Brazil.
| | | | | | | | | | | |
Collapse
|
9
|
DeBarry JD, Liu R, Bennetzen JL. Discovery and assembly of repeat family pseudomolecules from sparse genomic sequence data using the Assisted Automated Assembler of Repeat Families (AAARF) algorithm. BMC Bioinformatics 2008. [PMID: 18474116 DOI: 10.1186/1471‐2105‐9‐235] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Higher eukaryotic genomes are typically large, complex and filled with both genes and multiple classes of repetitive DNA. The repetitive DNAs, primarily transposable elements, are a rapidly evolving genome component that can provide the raw material for novel selected functions and also indicate the mechanisms and history of genome evolution in any ancestral lineage. Despite their abundance, universality and significance, studies of genomic repeat content have been largely limited to analyses of the repeats in fully sequenced genomes. RESULTS In order to facilitate a broader range of repeat analyses, the Assisted Automated Assembler of Repeat Families algorithm has been developed. This program, written in PERL and with numerous adjustable parameters, identifies sequence overlaps in small shotgun sequence datasets and walks them out to create long pseudomolecules representing the most abundant repeats in any genome. Testing of this program in maize indicated that it found and assembled all of the major repeats in one or more pseudomolecules, including coverage of the major Long Terminal Repeat retrotransposon families. Both Sanger sequence and 454 datasets were appropriate. CONCLUSION These results now indicate that hundreds of higher eukaryotic genomes can be efficiently characterized for the nature, abundance and evolution of their major repetitive DNA components.
Collapse
Affiliation(s)
- Jeremy D DeBarry
- Department of Genetics, University of Georgia, Athens, GA 30602-7223, USA.
| | | | | |
Collapse
|
10
|
DeBarry JD, Liu R, Bennetzen JL. Discovery and assembly of repeat family pseudomolecules from sparse genomic sequence data using the Assisted Automated Assembler of Repeat Families (AAARF) algorithm. BMC Bioinformatics 2008; 9:235. [PMID: 18474116 PMCID: PMC2412881 DOI: 10.1186/1471-2105-9-235] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2007] [Accepted: 05/13/2008] [Indexed: 11/25/2022] Open
Abstract
Background Higher eukaryotic genomes are typically large, complex and filled with both genes and multiple classes of repetitive DNA. The repetitive DNAs, primarily transposable elements, are a rapidly evolving genome component that can provide the raw material for novel selected functions and also indicate the mechanisms and history of genome evolution in any ancestral lineage. Despite their abundance, universality and significance, studies of genomic repeat content have been largely limited to analyses of the repeats in fully sequenced genomes. Results In order to facilitate a broader range of repeat analyses, the Assisted Automated Assembler of Repeat Families algorithm has been developed. This program, written in PERL and with numerous adjustable parameters, identifies sequence overlaps in small shotgun sequence datasets and walks them out to create long pseudomolecules representing the most abundant repeats in any genome. Testing of this program in maize indicated that it found and assembled all of the major repeats in one or more pseudomolecules, including coverage of the major Long Terminal Repeat retrotransposon families. Both Sanger sequence and 454 datasets were appropriate. Conclusion These results now indicate that hundreds of higher eukaryotic genomes can be efficiently characterized for the nature, abundance and evolution of their major repetitive DNA components.
Collapse
Affiliation(s)
- Jeremy D DeBarry
- Department of Genetics, University of Georgia, Athens, GA 30602-7223, USA.
| | | | | |
Collapse
|
11
|
Ikeda D, Ono Y, Snell P, Edwards YJK, Elgar G, Watabe S. Divergent evolution of the myosin heavy chain gene family in fish and tetrapods: evidence from comparative genomic analysis. Physiol Genomics 2007; 32:1-15. [PMID: 17940200 DOI: 10.1152/physiolgenomics.00278.2006] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Myosin heavy chain genes (MYHs) are the most important functional domains of myosins, which are highly conserved throughout evolution. The human genome contains 15 MYHs, whereas the corresponding number in teleost appears to be much higher. Although teleosts comprise more than one-half of all vertebrate species, our knowledge of MYHs in teleosts is rather limited. A comprehensive analysis of the torafugu (Takifugu rubripes) genome database enabled us to detect at least 28 MYHs, almost twice as many as in humans. RT-PCR revealed that at least 16 torafugu MYH representatives (5 fast skeletal, 3 cardiac, 2 slow skeletal, 1 superfast, 2 smooth, and 3 nonmuscle types) are actually transcribed. Among these, MYH(M743-2) and MYH(M5) of fast and slow skeletal types, respectively, are expressed during development of torafugu embryos. Syntenic analysis reveals that torafugu fast skeletal MYHs are distributed across five genomic regions, three of which form clusters. Interestingly, while human fast skeletal MYHs form one cluster, its syntenic region in torafugu is duplicated, although each locus contains just a single MYH in torafugu. The results of the syntenic analysis were further confirmed by corresponding analysis of MYHs based on databases from Tetraodon, zebrafish, and medaka genomes. Phylogenetic analysis suggests that fast skeletal MYHs evolved independently in teleosts and tetrapods after fast skeletal MYHs had diverged from four ancestral MYHs.
Collapse
Affiliation(s)
- Daisuke Ikeda
- Laboratory of Aquatic Molecular Biology and Biotechnology, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Bunkyo, Tokyo, Japan
| | | | | | | | | | | |
Collapse
|
12
|
Zhang Z, Deng Y, Tan J, Hu S, Yu J, Xue Q. A genome-wide microsatellite polymorphism database for the indica and japonica rice. DNA Res 2007; 14:37-45. [PMID: 17452422 PMCID: PMC2779893 DOI: 10.1093/dnares/dsm005] [Citation(s) in RCA: 71] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Microsatellite (MS) polymorphism is an important source of genetic diversity, providing support for map-based cloning and molecular breeding. We have developed a new database that contains 52 845 polymorphic MS loci between indica and japonica, composed of ample Class II MS markers, and integrated 18 828 MS loci from IRGSP and genetic markers from RGP. Based on genetic marker positions on the rice genome (http://rise.genomics.org.cn/rice2/index.jsp ), we determined the approximate genetic distances of these MS loci and validated 100 randomly selected markers experimentally with 90% success rate. In addition, we recorded polymorphic MS positions in indica cv. 9311 that is the most important paternal parent of the two-line hybrid rice in China. Our database will undoubtedly facilitate the application of MS markers in genetic researches and marker-assisted breeding. The data set is freely available from www.wigs.zju.edu.cn/achievment/polySSR.
Collapse
Affiliation(s)
- Zhonghua Zhang
- James D. Watson Institute of Genome Science, Zhejiang University, Hangzhou 310008, China
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Yajun Deng
- Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 101300, China
| | - Jun Tan
- Institute of Bioinformation, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
| | - Songnian Hu
- James D. Watson Institute of Genome Science, Zhejiang University, Hangzhou 310008, China
- Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 101300, China
| | - Jun Yu
- James D. Watson Institute of Genome Science, Zhejiang University, Hangzhou 310008, China
- Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 101300, China
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing 101300, China
| | - Qingzhong Xue
- James D. Watson Institute of Genome Science, Zhejiang University, Hangzhou 310008, China
- Department of Agronomy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310029, China
- To whom correspondence should be addressed. Tel. +86-571-86971611. Fax. +86-571-86971117, E-mail:
| |
Collapse
|
13
|
Abstract
It is unlikely that taxonomically diverse phylogenetic studies will be completed rapidly in the near future for nonmodel organisms on a whole-genome basis. However, one approach to advancing the field of "phylogenomics" is to estimate the structure of poorly known genomes by mining libraries of clones from suites of taxa, rather than from single species. The present analysis adopts this approach by taking advantage of megabase-scale end-sequence scanning of reptilian genomic clones to characterize diversity of CR1-like LINEs, the dominant family of transposable elements (TEs) in the sister group of mammals. As such, it helps close an important gap in the literature on the molecular systematics and evolution of retroelements in nonavian reptiles. Results from aligning more than 14 Mb of sequence from the American alligator (Alligator mississippiensis), painted turtle (Chrysemys picta), Bahamian green anole (Anolis smaragdinus), Tuatara (Sphenodon punctatus), Emu (Dromaius novaehollandiae), and Zebra Finch (Taeniopygia guttata) against a comprehensive library approximately 3000 TE-encoding peptides reflect an increasing abundance of LINE and non-long-terminal-repeat (non-LTR) retrotransposon repeat types with the age of common ancestry among exemplar reptilian clades. The hypothesis that repeat diversity is correlated with basal metabolic rate was tested using comparative methods and a significant nonlinear relationship was indicated. This analysis suggests that the age of divergence between an exemplary clade and its sister group as well as metabolic correlates should be considered in addition to genome size in explaining patterns of retroelement diversity. The first phylogenetic analysis of the largely unexplored chicken repeat 1 (CR1) 3' reverse transcriptase (RT) conserved domains 8 and 9 in nonavian reptiles reveals a pattern of multiple lineages with variable branch lengths, suggesting presence of both old and young elements and the existence of several distinct well-supported clades not apparent from previous characterization of CR1 subfamily structure in birds and the turtle. This mode of CR1 evolution contrasts with historical patterns of LINE 1 diversification in mammals and hints toward the existence of a rich but still largely unexplored diversity of nonavian retroelements of importance to advancing both comparative vertebrate genomics and amniote systematics.
Collapse
Affiliation(s)
- Andrew M Shedlock
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, 26 Oxford Street, Cambridge, Massachusetts 02138, USA.
| |
Collapse
|
14
|
Cruz F, Pérez M, Presa P. Distribution and abundance of microsatellites in the genome of bivalves. Gene 2005; 346:241-7. [PMID: 15716000 DOI: 10.1016/j.gene.2004.11.013] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2004] [Revised: 09/24/2004] [Accepted: 11/17/2004] [Indexed: 11/15/2022]
Abstract
Understanding how microsatellites are distributed in eukaryotic genomes is important to clarify the differential abundance of these repeats under an evolutionary scenario. We have concatenated data from 3165 DNA sequences of 326 Bivalvia species to search for taxonomic patterns of microsatellite distribution in genomic regions of markedly different functionality. Some microsatellite motifs in bivalves showed one of the lowest genomic densities observed among eukaryotes. Contrary to the expectation of a random distribution of microsatellites, they were overrepresented in introns (245 loci/Mb) compared to their frequency in exons (85 loci/Mb). Closely related species showed remarkable differences in microsatellite density suggesting species-specific properties as for mutation/repair efficiency on replication slippage. There was no evidence of a positive correlation between the density of microsatellites in intergenic DNA and the DNA-content. This research is relevant to better understand the forces shaping the distribution of microsatellites in the genome of bivalves.
Collapse
Affiliation(s)
- Fernando Cruz
- University of Vigo, Faculty of Biology, Department of Biochemistry, Genetics and Immunology, 36310 Vigo, Spain
| | | | | |
Collapse
|
15
|
Venkatesh B, Yap WH. Comparative genomics using fugu: a tool for the identification of conserved vertebrate cis-regulatory elements. Bioessays 2005; 27:100-7. [PMID: 15612032 DOI: 10.1002/bies.20134] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
With the imminent completion of the whole genome sequence of humans, increasing attention is being focused on the annotation of cis-regulatory elements in the human genome. Comparative genomics approaches based on evolutionary conservation have proved useful in the detection of conserved cis-regulatory elements. The pufferfish, Fugu rubripes, is an attractive vertebrate model for comparative genomics, by virtue of its compact genome and maximal phylogenetic distance from mammals. Fugu has lost a large proportion of nonessential DNA, and retained single orthologs for many duplicate genes that arose in the fish lineage. Non-coding sequences conserved between fugu and mammals have been shown to be functional cis-regulatory elements. Thus, fugu is a model fish genome of choice for discovering evolutionarily conserved regulatory elements in the human genome. Such evolutionarily conserved elements are likely to be shared by all vertebrates, and related to regulatory interactions fundamental to all vertebrates. The functions of these conserved vertebrate elements can be rapidly assayed in mammalian cell lines or in transgenic systems such as zebrafish/medaka and Xenopus, followed by validation of crucial elements in transgenic rodents.
Collapse
Affiliation(s)
- Byrappa Venkatesh
- Institute of Molecular and Cell Biology, 61 Biopolis Drive, Singapore 138673.
| | | |
Collapse
|
16
|
Abstract
Recent analyses of complete genome sequences have revealed that many genomes have been duplicated in their evolutionary past. Such events have been associated with important biological transitions, major leaps in evolution and adaptive radiations of species. Here, we consider recently developed computational methods to detect such ancient large-scale gene duplication events. Several new approaches have been used to show that large-scale gene duplications are more common than previously thought.
Collapse
Affiliation(s)
- Yves Van de Peer
- Department of Plant Systems Biology, Flanders Interuniversity, Institute for Biotechnology, Ghent, Belgium.
| |
Collapse
|
17
|
Trachtulec Z, Vlcek C, Mihola O, Forejt J. Comparative analysis of the PDCD2–TBP–PSMB1 region in vertebrates. Gene 2004; 335:151-7. [PMID: 15194198 DOI: 10.1016/j.gene.2004.03.021] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2003] [Revised: 02/25/2004] [Accepted: 03/18/2004] [Indexed: 10/26/2022]
Abstract
Three orthologous genes encoding programmed cell death 2 (PDCD2), TATA-binding protein (TBP), and proteasomal subunit C5 (PSMB1) proteins have been shown previously to be nonrandomly distributed in both mammalian and invertebrate genomes. Here we analyze a conserved synteny of the PDCD2, TBP, and PSMB1 orthologs in four nonmammalian vertebrates. Homologous genes of the chicken, zebrafish, fugu, and Tetraodon nigroviridis were identified. A chicken cosmid harboring the orthologs of these three genes was completely sequenced. The fish genes were analyzed in silico. In all seven vertebrates thus far investigated, the PDCD2 and TBP genes are located tail-to-tail. In six tested species but the zebrafish, the PSMB1 gene mapped head-to-head or in the close vicinity to the TBP, but even in the zebrafish, all three genes were syntenic. In contrast, a three times reused synteny breakpoint in the 5'-region from PDCD2 was detected. A comparative analysis revealed the distribution of putative matrix-attached regions (MARs), which may affect the synteny conservation.
Collapse
Affiliation(s)
- Zdenek Trachtulec
- Center for Integrated Genomics, Institute of Molecular Genetics, Academy of Sciences of the Czech Republic, Videnska 1083, 142 20 Prague 4, Czech Republic.
| | | | | | | |
Collapse
|
18
|
Amores A, Suzuki T, Yan YL, Pomeroy J, Singer A, Amemiya C, Postlethwait JH. Developmental roles of pufferfish Hox clusters and genome evolution in ray-fin fish. Genome Res 2004; 14:1-10. [PMID: 14707165 PMCID: PMC314266 DOI: 10.1101/gr.1717804] [Citation(s) in RCA: 168] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The pufferfish skeleton lacks ribs and pelvic fins, and has fused bones in the cranium and jaw. It has been hypothesized that this secondarily simplified pufferfish morphology is due to reduced complexity of the pufferfish Hox complexes. To test this hypothesis, we determined the genomic structure of Hox clusters in the Southern pufferfish Spheroides nephelus and interrogated genomic databases for the Japanese pufferfish Takifugu rubripes (fugu). Both species have at least seven Hox clusters, including two copies of Hoxb and Hoxd clusters, a single Hoxc cluster, and at least two Hoxa clusters, with a portion of a third Hoxa cluster in fugu. Results support genome duplication before divergence of zebrafish and pufferfish lineages, followed by loss of a Hoxc cluster in the pufferfish lineage and loss of a Hoxd cluster in the zebrafish lineage. Comparative analysis shows that duplicate genes continued to be lost for hundreds of millions of years, contrary to predictions for the permanent preservation of gene duplicates. Gene expression analysis in fugu embryos by in situ hybridization revealed evolutionary change in gene expression as predicted by the duplication-degeneration-complementation model. These experiments rule out the hypothesis that the simplified pufferfish body plan is due to reduction in Hox cluster complexity, and support the notion that genome duplication contributed to the radiation of teleosts into half of all vertebrate species by increasing developmental diversification of duplicate genes in daughter lineages.
Collapse
Affiliation(s)
- Angel Amores
- Institute of Neuroscience, University of Oregon, Eugene, Oregon 97402, USA
| | | | | | | | | | | | | |
Collapse
|
19
|
Vandepoele K, De Vos W, Taylor JS, Meyer A, Van de Peer Y. Major events in the genome evolution of vertebrates: paranome age and size differ considerably between ray-finned fishes and land vertebrates. Proc Natl Acad Sci U S A 2004; 101:1638-43. [PMID: 14757817 PMCID: PMC341801 DOI: 10.1073/pnas.0307968100] [Citation(s) in RCA: 405] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
It has been suggested that fish have more genes than humans. Whether most of these additional genes originated through a complete (fish-specific) genome duplication or through many lineage-specific tandem gene or smaller block duplications and family expansions continues to be debated. We analyzed the complete genome of the pufferfish Takifugu rubripes (Fugu) and compared it with the paranome of humans. We show that most paralogous genes of Fugu are the result of three complete genome duplications. Both relative and absolute dating of the complete predicted set of protein-coding genes suggest that initial genome duplications, estimated to have occurred at least 600 million years ago, shaped the genome of all vertebrates. In addition, analysis of >150 block duplications in the Fugu genome clearly supports a fish-specific genome duplication (approximately equal to 320 million years ago) that coincided with the vast radiation of most modern ray-finned fishes. Unlike the human genome, Fugu contains very few recently duplicated genes; hence, many human genes are much younger than fish genes. This lack of recent gene duplication, or, alternatively, the accelerated rate of gene loss, is possibly one reason for the drastic reduction of the genome size of Fugu observed during the past 100 million years or so, subsequent to the additional genome duplication that ray-finned fishes but not land vertebrates experienced.
Collapse
Affiliation(s)
- Klaas Vandepoele
- Department of Plant Systems Biology, Flanders Interuniversity Institute for Biotechnology, Ghent University, Technologiepark 927, B-9052 Ghent, Belgium
| | | | | | | | | |
Collapse
|
20
|
Montpetit A, Wilson MD, Chevrette M, Koop BF, Sinnett D. Analysis of the conservation of synteny between Fugu and human chromosome 12. BMC Genomics 2003; 4:30. [PMID: 12877756 PMCID: PMC179898 DOI: 10.1186/1471-2164-4-30] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2003] [Accepted: 07/23/2003] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The pufferfish Fugu rubripes (Fugu) with its compact genome is increasingly recognized as an important vertebrate model for comparative genomic studies. In particular, large regions of conserved synteny between human and Fugu genomes indicate its utility to identify disease-causing genes. The human chromosome 12p12 is frequently deleted in various hematological malignancies and solid tumors, but the actual tumor suppressor gene remains unidentified. RESULTS We investigated approximately 200 kb of the genomic region surrounding the ETV6 locus in Fugu (fETV6) in order to find conserved functional features, such as genes or regulatory regions, that could give insight into the nature of the genes targeted by deletions in human cancer cells. Seven genes were identified near the fETV6 locus. We found that the synteny with human chromosome 12 was conserved, but extensive genomic rearrangements occurred between the Fugu and human ETV6 loci. CONCLUSION This comparative analysis led to the identification of previously uncharacterized genes in the human genome and some potentially important regulatory sequences as well. This is a good indication that the analysis of the compact Fugu genome will be valuable to identify functional features that have been conserved throughout the evolution of vertebrates.
Collapse
Affiliation(s)
- Alexandre Montpetit
- Division of Hematology-Oncology, Charles-Bruneau Cancer Center, Research Center, Sainte-Justine Hospital, 3175 Côte Ste-Catherine, Montreal, QC, H3T 1C5, Canada
- Department of Biochemistry, University of Montreal, Montreal, QC, Canada
| | - Michael D Wilson
- Centre for Biomedical Research, University of Victoria, Victoria, BC, V8W 2Y2, Canada
| | - Mario Chevrette
- The Research Institute of the McGill University Health Centre and Department of Surgery, McGill University, Montreal, QC, H3G 1A4, Canada
| | - Ben F Koop
- Centre for Biomedical Research, University of Victoria, Victoria, BC, V8W 2Y2, Canada
| | - Daniel Sinnett
- Division of Hematology-Oncology, Charles-Bruneau Cancer Center, Research Center, Sainte-Justine Hospital, 3175 Côte Ste-Catherine, Montreal, QC, H3T 1C5, Canada
- Department of Biochemistry, University of Montreal, Montreal, QC, Canada
- Department of Pediatrics, University of Montreal, Montreal, QC, Canada
| |
Collapse
|
21
|
Volff JN, Schartl M. Evolution of signal transduction by gene and genome duplication in fish. JOURNAL OF STRUCTURAL AND FUNCTIONAL GENOMICS 2003; 3:139-50. [PMID: 12836693] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 03/03/2023]
Abstract
Fishes possess more genes encoding receptor tyrosine kinases from the epidermal growth factor receptor (EGFR) family than other organisms. Three of the four genes present in higher vertebrates have been duplicated early during the evolution of the ray-finned fish lineage possibly as a consequence of an event of whole genome duplication. In the fish Xiphophorus, a much more recent local event of gene duplication of the egfr co-orthologue egfr-b generated a eighth gene, the Xmrk oncogene. This duplicate acquired within a short time a constitutive activity and a pigment cell-specific overexpression responsible for the induction of melanoma in certain interspecific hybrids. Despite its frequent loss during evolution of the genus Xiphophorus, the maintenance of Xmrk in numerous species and its evolution under purifying selection suggest a so far unknown function under certain natural conditions. One of the known functions of Xmrk in tumor cells is the suppression of differentiation of melanocytes induced by the microphthalmia-associated transcription factor MITF. While only one gene with alternative 5' exons and promoters is present in higher vertebrates, two mitf genes were identified in fish. Subfunctionalization of mitf paralogues by differential degeneration of alternative exons and regulatory sequences led particularly to the formation of a mitf gene specifically expressed in the melanocyte lineage. These observations validate fish as an outstanding model to study the mechanisms and biological consequences of gene and genome duplication but underline the complexity of the fish model and the caution necessary in transferring knowledge from fish to higher vertebrates and vice versa.
Collapse
Affiliation(s)
- Jean-Nicolas Volff
- Physiologische Chemie I, Biofuture Research Group 'Evolutionary Fish Genomics', Biozentrum, University of Würzburg, Am Hubland, D-97074 Würzburg, Germany.
| | | |
Collapse
|
22
|
Sambrook JG, Campbell RD, Elgar G. Characterisation of a gene cluster in Fugu rubripes containing the complement component C4 gene. Gene 2003; 312:73-83. [PMID: 12909342 DOI: 10.1016/s0378-1119(03)00602-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
In this study, we describe the characterisation of the complement component C4 gene in Fugu rubripes. The Fugu C4 gene, orthologous to the tetrapod C4 gene, encompasses a genomic span of 9702 base pairs and contains 41 exons, encoding the typical C4 three-chain polypeptide. The gene encodes a protein containing 1703 amino acids. The Fugu C4 protein demonstrates the presence of 25 conserved cysteine residues, as well as conservation of the functionally important thioester site. Complete sequencing of one cosmid and sequence scans from a cluster of 18 overlapping BAC clones, centering around the C4 gene, have identified the short-range linkage with five orthologous human genes mapping to the Major Histocompatibility Complex (MHC) including: tenascin X (TNX); cytochrome P450, subfamily XXIA, polypeptide 2 (CYP21A2); allograft inflammatory factor 1 (AIF1) and casein kinase 2, beta polypeptide (CSNK2B), all found in the MHC class III region; and retinoid X receptor, beta (RXRB),which resides in the MHC extended class II region. To date, this syntenic association of the Fugu C4 and other MHC class III region genes has not been observed in other teleost fish. Data from the recent whole-genome shotgun assemblies reveal the Fugu MHC-related cluster of genes to be flanked predominantly by genes mapping to human chromosomes 7 and 19. All of the six identified Fugu MHC-related genes have been characterised at the genomic level.
Collapse
Affiliation(s)
- Jennifer G Sambrook
- Comparative Genomics, MRC UK HGMP Resource Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SB, UK.
| | | | | |
Collapse
|
23
|
Edwards YJK, Carver TJ, Vavouri T, Frith M, Bishop MJ, Elgar G. Theatre: A software tool for detailed comparative analysis and visualization of genomic sequence. Nucleic Acids Res 2003; 31:3510-7. [PMID: 12824356 PMCID: PMC168908 DOI: 10.1093/nar/gkg501] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2002] [Revised: 01/17/2003] [Accepted: 01/27/2003] [Indexed: 01/12/2023] Open
Abstract
Theatre is a web-based computing system designed for the comparative analysis of genomic sequences, especially with respect to motifs likely to be involved in the regulation of gene expression. Theatre is an interface to commonly used sequence analysis tools and biological sequence databases to determine or predict the positions of coding regions, repetitive sequences and transcription factor binding sites in families of DNA sequences. The information is displayed in a manner that can be easily understood and can reveal patterns that might not otherwise have been noticed. In addition to web-based output, Theatre can produce publication quality colour hardcopies showing predicted features in aligned genomic sequences. A case study using the p53 promoter region of four mammalian species and two fish species is described. Unlike the mammalian sequences the promoter regions in fish have not been previously predicted or characterized and we report the differences in the p53 promoter region of four mammals and that predicted for two fish species. Theatre can be accessed at http://www.hgmp.mrc.ac.uk/Registered/Webapp/theatre/.
Collapse
Affiliation(s)
- Yvonne J K Edwards
- Comparative Genomics Group, Research Division, MRC UK Human Genome Mapping Project Resource Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SB, UK.
| | | | | | | | | | | |
Collapse
|
24
|
Miles CG, Rankin L, Smith SI, Niksic M, Elgar G, Hastie ND. Faithful expression of a tagged Fugu WT1 protein from a genomic transgene in zebrafish: efficient splicing of pufferfish genes in zebrafish but not mice. Nucleic Acids Res 2003; 31:2795-802. [PMID: 12771206 PMCID: PMC156718 DOI: 10.1093/nar/gkg383] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2003] [Revised: 04/03/2003] [Accepted: 04/03/2003] [Indexed: 11/12/2022] Open
Abstract
The teleost fish are widely used as model organisms in vertebrate biology. The compact genome of the pufferfish, Fugu rubripes, has proven a valuable tool in comparative genome analyses, aiding the annotation of mammalian genomes and the identification of conserved regulatory elements, whilst the zebrafish is particularly suited to genetic and developmental studies. We demonstrate that a pufferfish WT1 transgene can be expressed and spliced appropriately in transgenic zebrafish, contrasting with the situation in transgenic mice. By creating both transgenic mice and transgenic zebrafish with the same construct, we show that Fugu RNA is processed correctly in zebrafish but not in mice. Furthermore, we show for the first time that a Fugu genomic construct can produce protein in transgenic zebrafish: a full-length Fugu WT1 transgene with a C-terminal beta-galactosidase fusion is spliced and translated correctly in zebrafish, mimicking the expression of the endogenous WT1 gene. These data demonstrate that the zebrafish:Fugu system is a powerful and convenient tool for dissecting both vertebrate gene regulation and gene function in vivo.
Collapse
Affiliation(s)
- Colin G Miles
- Medical Research Council Human Genetics Unit, Western General Hospital, Creve Road, Edinburgh EH4 2XU, UK
| | | | | | | | | | | |
Collapse
|
25
|
Neafsey DE, Palumbi SR. Genome size evolution in pufferfish: a comparative analysis of diodontid and tetraodontid pufferfish genomes. Genome Res 2003; 13:821-30. [PMID: 12727902 PMCID: PMC430906 DOI: 10.1101/gr.841703] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Smooth pufferfish of the family Tetraodontidae have the smallest vertebrate genomes yet measured. They have a haploid genome size of approximately 400 million bp (Mb), which is almost eight times smaller than the human genome. Given that spiny pufferfish from the sister family Diodontidae and a fish from the outgroup Molidae have genomes twice as large as smooth puffers, it appears that the genome size of smooth puffers has contracted in the last 50-70 million years since their divergence from the spiny puffers. Here we use renaturation kinetics to compare the repetitive nature of the smooth and spiny puffer genomes. We also estimate the rates of small (<400 bp) insertions and deletions in smooth and spiny puffers using defunct non-LTR retrotransposons. We find a significantly greater abundance of a transposon-like repetitive DNA class in spiny puffers relative to smooth puffers, in addition to nearly identical indel rates. We comment on the role that large insertions may play in the evolution of genome size in these two groups.
Collapse
Affiliation(s)
- Daniel E Neafsey
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138, USA.
| | | |
Collapse
|
26
|
Taylor JS, Braasch I, Frickey T, Meyer A, Van de Peer Y. Genome duplication, a trait shared by 22000 species of ray-finned fish. Genome Res 2003; 13:382-90. [PMID: 12618368 PMCID: PMC430266 DOI: 10.1101/gr.640303] [Citation(s) in RCA: 674] [Impact Index Per Article: 30.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
Through phylogeny reconstruction we identified 49 genes with a single copy in man, mouse, and chicken, one or two copies in the tetraploid frog Xenopus laevis, and two copies in zebrafish (Danio rerio). For 22 of these genes, both zebrafish duplicates had orthologs in the pufferfish (Takifugu rubripes). For another 20 of these genes, we found only one pufferfish ortholog but in each case it was more closely related to one of the zebrafish duplicates than to the other. Forty-three pairs of duplicated genes map to 24 of the 25 zebrafish linkage groups but they are not randomly distributed; we identified 10 duplicated regions of the zebrafish genome that each contain between two and five sets of paralogous genes. These phylogeny and synteny data suggest that the common ancestor of zebrafish and pufferfish, a fish that gave rise to approximately 22000 species, experienced a large-scale gene or complete genome duplication event and that the pufferfish has lost many duplicates that the zebrafish has retained.
Collapse
Affiliation(s)
- John S Taylor
- Department of Biology, University of Konstanz, D-78457, Konstanz, Germany
| | | | | | | | | |
Collapse
|
27
|
Oshiumi H, Tsujita T, Shida K, Matsumoto M, Ikeo K, Seya T. Prediction of the prototype of the human Toll-like receptor gene family from the pufferfish, Fugu rubripes, genome. Immunogenetics 2003; 54:791-800. [PMID: 12618912 DOI: 10.1007/s00251-002-0519-8] [Citation(s) in RCA: 243] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2002] [Revised: 10/23/2002] [Indexed: 11/29/2022]
Abstract
The insect Toll family of proteins and their mammalian counterparts seemingly shared one common ancestor and evolved independently. Here we demonstrated that the prototype of the mammalian-type (M-type) Toll family is shared by the fish and humans. According to the draft of the pufferfish Fugu genome project, the signature Toll-IL-1 receptor homology domain (TIR domain) has been conserved during evolution. FuguTLR2, 3, 5, 7, 8 and 9 members correspond structurally to respective mammalian TLRs. One Fugu TLR showed equally high amino acid identity to human TLR1, 6 and 10, and we named it FuguTLR1. Fugu rubripes has genes for TLR21 and 22, which are unique to fish. One possible interpretation of these findings is that TLR1, 2, 3, 4, 5, 7, 8, 9, 21 and 22 existed in the ancestral genome common to fish and mammals, and that TLR4 was lost in the fish lineage, while TLR21 and 22 were lost in the mammalian lineage. Strikingly, a solitary ascidian, Halocynthia roretzi, has only a few Toll-like proteins, which, like Caenorhabditis elegans Toll, represent primitive ones before the expansion of the Toll family. Therefore, the expansion of TLR genes should have occurred earlier than fish, but not C. intestinalis, separated evolutionarily from mammals. These results infer that the appearance of the M-type innate system was completed before or concomitant with the appearance of acquired immunity. We interpret the present data to mean that the differences of TLRs identified in this study between fishes and humans may be rather peripheral, partially due to selection pressure exerted by pathogens in distinct environments.
Collapse
Affiliation(s)
- Hiroyuki Oshiumi
- Department of Immunology, Osaka Medical Center for Cancer and Cardiovascular Diseases, Higashinari-ku, Japan
| | | | | | | | | | | |
Collapse
|
28
|
Gajewski M, Voolstra C. Comparative analysis of somitogenesis related genes of the hairy/Enhancer of split class in Fugu and zebrafish. BMC Genomics 2002; 3:21. [PMID: 12160468 PMCID: PMC126015 DOI: 10.1186/1471-2164-3-21] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2002] [Accepted: 08/05/2002] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND Members of a class of bHLH transcription factors, namely the hairy (h), Enhancer of split (E(spl)) and hairy-related with YRPW motif (hey) (h/E(spl)/hey) genes are involved in vertebrate somitogenesis and some of them show cycling expression. By sequence comparison, identified orthologues of cycling somitogenesis genes from higher vertebrates do not show an appropriate expression pattern in zebrafish. The zebrafish genomic sequence is not available yet but the genome of Fugu rubripes was recently published. To allow comparative analysis, the currently known Her proteins from zebrafish were used to screen the genomic sequence database of Fugu rubripes. RESULTS 20 h/E(spl)/hey-related genes were identified in Fugu, which is twice the number of corresponding zebrafish genes known so far. A novel class of c-Hairy proteins was identified in the genomes of Fugu and Tetraodon. A screen of the human genome database with the Fugu proteins yielded 10 h/E(spl)/hey-related genes. By analysing the upstream sequences of the c-hairy class genes in zebrafish, Fugu and Tetraodon highly similar sequence stretches were identified that harbour Suppressor of hairless paired binding sites (SPS). This motif was also discovered in the upstream sequences of the her1 gene in the examined fish species. Here, the Su(h) sites are separated by longer intervening sequences. CONCLUSIONS Our study indicates that not all her homologues in zebrafish have been isolated. Comparison to the human genome suggests a selective duplication of h/E(spl) genes in pufferfish or loss of members of these genes during evolution to the human lineage.
Collapse
Affiliation(s)
- Martin Gajewski
- Universitaet zu Koeln, Institut fuer Genetik, Weyertal 121, 50931 Koeln, Germany
| | - Chris Voolstra
- Universitaet zu Koeln, Institut fuer Genetik, Weyertal 121, 50931 Koeln, Germany
| |
Collapse
|
29
|
Claiborne JB, Edwards SL, Morrison-Shetlar AI. Acid-base regulation in fishes: cellular and molecular mechanisms. THE JOURNAL OF EXPERIMENTAL ZOOLOGY 2002; 293:302-19. [PMID: 12115903 DOI: 10.1002/jez.10125] [Citation(s) in RCA: 201] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The mechanisms underlying acid-base transfers across the branchial epithelium of fishes have been studied for more than 70 years. These animals are able to compensate for changes to internal pH following a wide range of acid-base challenges, and the gill epithelium is the primary site of acid-base transfers to the water. This paper reviews recent molecular, immunohistochemical, and functional studies that have begun to define the protein transporters involved in the acid-base relevant ion transfers. Both Na(+)/H(+) exchange (NHE) and vacuolar-type H(+)-ATPase transport H(+) from the fish to the environment. While NHEs have been thought to carry out this function mainly in seawater-adapted animals, these proteins have now been localized to mitochondrial-rich cells in the gill epithelium of both fresh and saltwater-adapted fishes. NHEs have been found in the gill epithelium of elasmobranchs, teleosts, and an agnathan. In several species, apical isoforms (NHE2 and NHE3) appear to be up-regulated following acidosis. In freshwater teleosts, H(+)-ATPase drives H(+) excretion and is indirectly coupled to Na(+) uptake (via Na(+) channels). It has been localized to respiratory pavement cells and chloride cells of the gill epithelium. In the marine elasmobranch, both branchial NHE and H(+)-ATPase have been identified, suggesting that a combination of these mechanisms may be utilized by marine elasmobranchs for acid-base regulation. An apically located Cl(-)/HCO(3)(-) anion exchanger in chloride cells may be responsible for base excretion in fresh and seawater-adapted fishes. While only a few species have been examined to date, new molecular approaches applied to a wider range of fishes will continue to improve our understanding of the roles of the various gill membrane transport processes in acid-base balance.
Collapse
Affiliation(s)
- James B Claiborne
- Department of Biology, Georgia Southern University, Statesboro, Georgia 30460, USA.
| | | | | |
Collapse
|
30
|
Abrahams BS, Mak GM, Berry ML, Palmquist DL, Saionz JR, Tay A, Tan YH, Brenner S, Simpson EM, Venkatesh B. Novel vertebrate genes and putative regulatory elements identified at kidney disease and NR2E1/fierce loci. Genomics 2002; 80:45-53. [PMID: 12079282 DOI: 10.1006/geno.2002.6795] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Fierce (frc) mice are deleted for nuclear receptor 2e1 (Nr2e1), and exhibit cerebral hypoplasia, blindness, and extreme aggression. To characterize the Nr2e1 locus, which may also contain the mouse kidney disease (kd) allele, we compared sequence from human, mouse, and the puffer fish Fugu rubripes. We identified a novel gene, c222389, containing conserved elements in noncoding regions. We also discovered a novel vertebrate gene conserved across its length in prokaryotes and invertebrates. Based on a dramatic upregulation in lactating breast, we named this gene lactation elevated-1 (LACE1). Two separate 100-bp elements within the first NR2E1 intron were virtually identical between the three species, despite an estimated 450 million years of divergent evolution. These elements represent strong candidates for functional NR2E1 regulatory elements in vertebrates. A high degree of conservation across NR2E1 combined with a lack of interspersed repeats suggests that an array of regulatory elements embedded within the gene is required for proper gene expression.
Collapse
Affiliation(s)
- Brett S Abrahams
- Centre for Molecular Medicine & Therapeutics, British Columbia Research Institute for Children's and Women's Health, and Department of Medical Genetics, University of British Columbia, 980 West 28(th) Avenue, Vancouver, British Columbia, V5Z 4H4, Canada
| | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Eggers JH, Stock M, Fliegauf M, Vonderstrass B, Otto F. Genomic characterization of the RUNX2 gene of Fugu rubripes. Gene 2002; 291:159-67. [PMID: 12095689 DOI: 10.1016/s0378-1119(02)00592-9] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
A 105 kb Fugu rubripes genomic region containing the RUNX2 ortholog (frunx2) was sequenced and analysed. Spanning 32 kb, frunx2 is seven times smaller than its human orthologue (223 kb). By comparison of Fugu and human genomic environment a stretch of conserved synteny, comprising the neighbouring genes on both sides, was identified. Except one exon that is alternatively spliced in human RUNX2, all other seven exons could be identified in frunx2. The predicted protein sequence of frunx2 shows a high degree of sequence conservation compared with RUNX2 (83% identity). Like all human paralogues, frunx2 possesses two promoter regions separated by a large intron. Both promoter regions are conserved between the two species and contain several RUNX binding sites pointing to a self-regulatory function. Three further conserved non-coding regions were identified possibly functioning as enhancer elements for tissue-specific expression of RUNX2.
Collapse
Affiliation(s)
- Jan Henrik Eggers
- Department of Hematology and Oncology, University of Freiburg Medical Center, Hugstetterstrasse 55, D-79106 Freiburg, Germany
| | | | | | | | | |
Collapse
|
32
|
Abstract
Host defenses at the mucosal surface of the airways evolved to present many layers of protection against inhaled microbes. Normally, the intrapulmonary airways are sterile. Airway secretions contain numerous factors with antimicrobial activity that contribute to innate defenses. Many protein and peptide components exert bacteriostatic or bacteriocidal effects against a wide variety of organisms and may act in synergistic or additive combinations. The beta-defensins are a relatively recently described family of peptide antimicrobials that are widely expressed at mucosal surfaces, including airway and submucosal gland epithelia. These small cationic peptides are products of individual genes that exhibit broad-spectrum activity against bacteria, fungi, and some enveloped viruses. Their expression in airway epithelia may be constitutive or inducible by bacterial products or pro-inflammatory cytokines. beta-defensins also act as chemokines for adaptive immune cells, including immature dendritic cells and T cells via the CCR6 receptor, and provide a link between innate and adaptive immunity. Alterations in the function of the beta-defensins may contribute to disease states. Here we review much of the biology of the beta-defensins, including gene discovery, genomic organization, molecular structure, regulation of expression, and function.
Collapse
Affiliation(s)
- Brian C Schutte
- Department of Pediatrics, Genetics Ph.D. Program, University of Iowa College of Medicine, Iowa City, Iowa, USA
| | | |
Collapse
|
33
|
Smith SF, Snell P, Gruetzner F, Bench AJ, Haaf T, Metcalfe JA, Green AR, Elgar G. Analyses of the extent of shared synteny and conserved gene orders between the genome of Fugu rubripes and human 20q. Genome Res 2002; 12:776-84. [PMID: 11997344 PMCID: PMC186584 DOI: 10.1101/gr.221802] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Cosmid and BAC contig maps have been constructed across two Fugu genomic regions containing the orthologs of human genes mapping to human chromosome 20q. Contig gene contents have been assessed by sample sequencing and comparative database analyses. Contigs are centered around two Fugu topoisomerase1 (top1) genes that were initially identified by sequence similarity to human TOP1 (20q12). Two other genes (SNAI1 and KRML) mapping to human chromosome 20 are also duplicated in Fugu. The two contigs have been mapped to separate Fugu chromosomes. Our data indicate that these linkage groups result from the duplication of an ancestral chromosome segment containing at least 40 genes that now map to the long arm of human chromosome 20. Although there is considerable conservation of synteny, gene orders are not well conserved between Fugu and human, with only very short sections of two to three adjacent genes being maintained in both organisms. Comparative analyses have allowed this duplication event to be dated before the separation of Fugu and zebrafish. Our data (which are best explained by regional duplication, followed by substantial gene loss) support the hypothesis that there have been a large number of gene and regional duplications (and corresponding gene loss) in the fish lineage, possibly resulting from a single whole genome duplication event.
Collapse
Affiliation(s)
- Sarah F Smith
- Fugu Genomics, United Kingdom Human Genome Mapping Project Resource Centre, Wellcome Genome Campus, Hinxton Hall, Hinxton, Cambridgeshire, CB10 1SB, United Kingdom.
| | | | | | | | | | | | | | | |
Collapse
|
34
|
Altschmied J, Delfgaauw J, Wilde B, Duschl J, Bouneau L, Volff JN, Schartl M. Subfunctionalization of duplicate mitf genes associated with differential degeneration of alternative exons in fish. Genetics 2002; 161:259-67. [PMID: 12019239 PMCID: PMC1462118 DOI: 10.1093/genetics/161.1.259] [Citation(s) in RCA: 88] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The microphthalmia-associated transcription factor (MITF) exists in at least four isoforms. These are generated in higher vertebrates using alternative 5' exons and promoters from a single gene. Two separate genes (mitf-m and mitf-b), however, are present in different teleost fish species including the poeciliid Xiphophorus, the pufferfishes Fugu rubripes and Tetraodon nigroviridis, and the zebrafish Danio rerio. Fish proteins MITF-m and MITF-b correspond at both the structural and the expression levels to one particular bird/mammalian MITF isoform. In the teleost lineage subfunctionalization of mitf genes after duplication at least 100 million years ago is associated with the degeneration of alternative exons and, probably, regulatory elements and promoters. For example, a remnant of the first exon specific for MITF-m is detected within the pufferfish gene encoding MITF-b. Retracing the evolutionary history of mitf genes in vertebrates uncovered the differential recruitment of new introns specific for either the teleost or the bird/mammalian lineage.
Collapse
Affiliation(s)
- Joachim Altschmied
- Department of Physiological Chemistry I, Biocenter (Theodor-Boveri Institute), University of Würzburg, D-97074 Würzburg, Germany
| | | | | | | | | | | | | |
Collapse
|
35
|
Bouchireb N, Grützner F, Haaf T, Stephens RJ, Elgar G, Green AJ, Clark MS. Comparative mapping of the human 9q34 region in Fugu rubripes. Cytogenet Genome Res 2002; 94:173-9. [PMID: 11856876 DOI: 10.1159/000048811] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Twenty-seven genes have been cloned and mapped in Fugu which have orthologues within the human chromosome 9q34 region. The genes are arranged into five cosmid and BAC contigs which physically map to two different Fugu chromosomes. Considering the gene content of these contigs, it is more probable that a translocation event took place early in the Fugu lineage to split the ancestral 9q34 region onto two chromosomes rather than the alternative hypothesis of a large-scale duplication of the region into two chromosomes with subsequent rapid and dramatic gene loss. There are considerable differences in gene order between the two species, which would appear to be the result of a series of complex chromosome inversions; thus suggesting that there have been no positional constraints on this particular gene set.
Collapse
Affiliation(s)
- N Bouchireb
- MRC-HGMP Resource Centre, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | | | | | | | | | | | | |
Collapse
|
36
|
Elphick MR. Evolution of cannabinoid receptors in vertebrates: identification of a CB(2) gene in the puffer fish Fugu rubripes. THE BIOLOGICAL BULLETIN 2002; 202:104-107. [PMID: 11971807 DOI: 10.2307/1543648] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Affiliation(s)
- Maurice R Elphick
- School of Biological Sciences, Queen Mary, University of London, London E1 4NS, UK.
| |
Collapse
|
37
|
Dalle Nogare DE, Clark MS, Elgar G, Frame IG, Poulter RTM. Xena, a full-length basal retroelement from tetraodontid fish. Mol Biol Evol 2002; 19:247-55. [PMID: 11861884 DOI: 10.1093/oxfordjournals.molbev.a004078] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Mobile genetic elements are ubiquitous throughout the eukaryote superkingdom. We have sequenced a highly unusual full-length retroelement from the Fugu fish, Takifugu rubripes. This element, which we have named Xena, is similar in structure and sequence to the Penelope retroelement from Drosophila virilis and consists of a single long open reading frame containing a reverse transcriptase domain flanked by identical direct long terminal repeat (LTR) sequences. These LTRs show an organization similar to the terminal repeats already described in the Penelope retrotransposon of Drosophila but are structurally and functionally distinct from the LTRs carried by LTR-retrotransposons. In view of their distinctness, we refer to these repeats as PLTRs (Penelope-LTRs). Whereas the element contains a reverse transcriptase, no other domains or motifs commonly associated with retroelements are present. In the full-length Fugu element, the 5' direct PLTR is preceded by an inverted PLTR fragment. Additional elements, many showing various degrees of deletion, are described from the Fugu genome and from that of the freshwater pufferfish Tetraodon nigroviridis. Many of these additional elements are also preceded by inverted PLTR sequences. Xena-like elements are also described from the genomes of several other organisms. The Penelope-Xena lineage is apparently a basal group within the retrotransposons and therefore represents an evolutionarily important class of retroelement.
Collapse
|
38
|
Morgante M, Hanafey M, Powell W. Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes. Nat Genet 2002; 30:194-200. [PMID: 11799393 DOI: 10.1038/ng822] [Citation(s) in RCA: 591] [Impact Index Per Article: 25.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Microsatellites are a ubiquitous class of simple repetitive DNA sequence. An excess of such repetitive tracts has been described in all eukaryotes analyzed and is thought to result from the mutational effects of replication slippage. Large-scale genomic and EST sequencing provides the opportunity to evaluate the abundance and relative distribution of microsatellites between transcribed and nontranscribed regions and the relationship of these features to haploid genome size. Although this has been studied in microbial and animal genomes, information in plants is limited. We assessed microsatellite frequency in plant species with a 50-fold range in genome size that is mostly attributable to the recent amplification of repetitive DNA. Among species, the overall frequency of microsatellites was inversely related to genome size and to the proportion of repetitive DNA but remained constant in the transcribed portion of the genome. This indicates that most microsatellites reside in regions pre-dating the recent genome expansion in many plants. The microsatellite frequency was higher in transcribed regions, especially in the untranslated portions, than in genomic DNA. Contrary to previous reports suggesting a preferential mechanism for the origin of microsatellites from repetitive DNA in both animals and plants, our findings show a significant association with the low-copy fraction of plant genomes.
Collapse
Affiliation(s)
- Michele Morgante
- E.I. duPont de Nemours and Co., DuPont Agriculture & Nutrition-Molecular Genetics, Newark, Delaware, USA
| | | | | |
Collapse
|
39
|
Taylor MS, Semple CAM. Sushi gets serious: the draft genome sequence of the pufferfish Fugu rubripes. Genome Biol 2002; 3:reviews1025. [PMID: 12225591 PMCID: PMC139409 DOI: 10.1186/gb-2002-3-9-reviews1025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
The publication of the Fugu rubripes draft genome sequence will take this fish from culinary delicacy to potent tool in deciphering the mysteries of human genome function.
Collapse
Affiliation(s)
- Martin S Taylor
- MRC Human Genetics Unit, Western General Hospital, Crewe Road, Edinburgh EH4 2XU, UK
| | - Colin AM Semple
- MRC Human Genetics Unit, Western General Hospital, Crewe Road, Edinburgh EH4 2XU, UK
| |
Collapse
|
40
|
Abstract
Several recent studies of genome evolution indicate that the rate of DNA loss exceeds that of DNA gain, leading to an underlying mutational pressure towards collapsing the length of noncoding DNA. That such a collapse is not observed suggests opposing mechanisms favoring longer noncoding regions. The presence of transposable elements alone also does not explain observed features of noncoding DNA. At present, a multidisciplinary approach--using population genetics techniques, large-scale genomic analyses, and in silico evolution--is beginning to provide new and valuable insights into the forces that shape the length of noncoding DNA and, ultimately, genome size. Recombination, in a broad sense, might be the missing key parameter for understanding the observed variation in length of noncoding DNA in eukaryotes.
Collapse
Affiliation(s)
- J M Comeron
- Department of Ecology and Evolution, University of Chicago, 1101 East 57th Street, Chicago, Illinois 60637, USA.
| |
Collapse
|
41
|
Taylor JS, Van de Peer Y, Braasch I, Meyer A. Comparative genomics provides evidence for an ancient genome duplication event in fish. Philos Trans R Soc Lond B Biol Sci 2001; 356:1661-79. [PMID: 11604130 PMCID: PMC1088543 DOI: 10.1098/rstb.2001.0975] [Citation(s) in RCA: 365] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
There are approximately 25 000 species in the division Teleostei and most are believed to have arisen during a relatively short period of time ca. 200 Myr ago. The discovery of 'extra' Hox gene clusters in zebrafish (Danio rerio), medaka (Oryzias latipes), and pufferfish (Fugu rubripes), has led to the hypothesis that genome duplication provided the genetic raw material necessary for the teleost radiation. We identified 27 groups of orthologous genes which included one gene from man, mouse and chicken, one or two genes from tetraploid Xenopus and two genes from zebrafish. A genome duplication in the ancestor of teleost fishes is the most parsimonious explanation for the observations that for 15 of these genes, the two zebrafish orthologues are sister sequences in phylogenies that otherwise match the expected organismal tree, the zebrafish gene pairs appear to have been formed at approximately the same time, and are unlinked. Phylogenies of nine genes differ a little from the tree predicted by the fish-specific genome duplication hypothesis: one tree shows a sister sequence relationship for the zebrafish genes but differs slightly from the expected organismal tree and in eight trees, one zebrafish gene is the sister sequence to a clade which includes the second zebrafish gene and orthologues from Xenopus, chicken, mouse and man. For these nine gene trees, deviations from the predictions of the fish-specific genome duplication hypothesis are poorly supported. The two zebrafish orthologues for each of the three remaining genes are tightly linked and are, therefore, unlikely to have been formed during a genome duplication event. We estimated that the unlinked duplicated zebrafish genes are between 300 and 450 Myr. Thus, genome duplication could have provided the genetic raw material for teleost radiation. Alternatively, the loss of different duplicates in different populations (i.e. 'divergent resolution') may have promoted speciation in ancient teleost populations.
Collapse
Affiliation(s)
- J S Taylor
- Department of Biology, University of Konstanz, 78457, Konstanz, Germany
| | | | | | | |
Collapse
|
42
|
Meyers BC, Tingey SV, Morgante M. Abundance, distribution, and transcriptional activity of repetitive elements in the maize genome. Genome Res 2001; 11:1660-76. [PMID: 11591643 PMCID: PMC311155 DOI: 10.1101/gr.188201] [Citation(s) in RCA: 329] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Long terminal repeat (LTR) retrotransposons have been shown to make up much of the maize genome. Although these elements are known to be prevalent in plant genomes of a middle-to-large size, little information is available on the relative proportions composed by specific families of elements in a single genome. We sequenced a library of randomly sheared genomic DNA from maize to characterize this genome. BLAST analysis of these sequences demonstrated that the maize genome is composed of diverse sequences that represent numerous families of retrotransposons. The largest families contain the previously described elements Huck, Ji, and Opie. Approximately 5% of the sequences are predicted to encode proteins. The genomic abundance of 16 families of elements was estimated by hybridization to an array of 10,752 maize bacterial artificial chromosome (BAC) clones. Comparisons of the number of elements present on individual BACs indicated that retrotransposons are in general randomly distributed across the maize genome. A second library was constructed that was selected to contain sequences hypomethylated in the maize genome. Sequence analysis of this library indicated that retroelements abundant in the genome are poorly represented in hypomethylated regions. Fifty-six retroelement sequences corresponding to the integrase and reverse transcriptase domains were isolated from approximately 407,000 maize expressed sequence tags (ESTs). Phylogenetic analysis of these and the genomic retroelement sequences indicated that elements most abundant in the genome are less abundant at the transcript level than are more rare retrotransposons. Additional phylogenies also demonstrated that rice and maize retrotransposon families are frequently more closely related to each other than to families within the same species. An analysis of the GC content of the maize genomic library and that of maize ESTs did not support recently published data that the gene space in maize is found within a narrow GC range, but does indicate that genic sequences have a higher GC content than intergenic sequences (52% vs. 47% GC).
Collapse
Affiliation(s)
- B C Meyers
- E.I. duPont de Nemours and Company, DuPont Crop Genetics-Genomics, Newark, Delaware 19714-6104, USA
| | | | | |
Collapse
|
43
|
Amemiya CT, Amores A, Ota T, Mueller G, Garrity D, Postlethwait JH, Litman GW. Generation of a P1 artificial chromosome library of the Southern pufferfish. Gene 2001; 272:283-9. [PMID: 11470535 DOI: 10.1016/s0378-1119(01)00525-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
We describe the generation of a P1 artificial chromosome genomic library from the Southern pufferfish, Spheroides nephelus. The arrayed library consists of approximately 30,000 clones and has an average insert size of 125-150 kb. The coverage is estimated to encompass seven to eight genome equivalents. The library has been used for isolating numerous genomic clones and for establishing contigs of several multigene families. Analysis of several of the clones from this library suggests a preponderance of CA repeat tracts relative to their abundance in humans. The library and high-density filters have been made available to the scientific public through genomics distribution companies.
Collapse
Affiliation(s)
- C T Amemiya
- Center for Human Genetics, Boston University School of Medicine, 715 Albany Street W408, Boston, MA, USA.
| | | | | | | | | | | | | |
Collapse
|
44
|
Abstract
The human SART1 gene was initially identified in a screen for proteins recognised by IgE, which may be implicated in atopic disease. We have examined the genomic structure and cDNA sequence of the SART1 gene in the compact genomes of the pufferfish Fugu rubripes and Tetraodon nigroviridis. The entire coding regions of both the Fugu and Tetraodon SART1 genes are contained within single exons. The Fugu gene contains only one intron located in the 5' untranslated region. Southern blot hybridisation of Fugu genomic DNA confirmed the SART1 gene to be single copy. Partial genomic structures were also determined for the human, mouse, Drosophila and C. elegans SART1 homologues. The human and mouse genes both contain many introns in the coding region, the human gene possessing at least 20 exons. The Drosophila and C. elegans homologues contain 6 and 12 exons, respectively. This is only the second time such a difference in the organization of homologous Fugu and human genes has been reported. The Fugu and Tetraodon SART1 genes encode putative proteins of 772 and 774 aa, respectively, each having 65% amino acid identity to human SART1. Leucine zipper and basic motifs are conserved in the predicted Fugu and Tetraodon proteins.
Collapse
Affiliation(s)
- D J Bolland
- Institute of Genetics, Queen's Medical Centre, University of Nottingham, Nottingham, NG7 2UH, UK
| | | |
Collapse
|
45
|
Barton LM, Gottgens B, Gering M, Gilbert JG, Grafham D, Rogers J, Bentley D, Patient R, Green AR. Regulation of the stem cell leukemia (SCL) gene: a tale of two fishes. Proc Natl Acad Sci U S A 2001; 98:6747-52. [PMID: 11381108 PMCID: PMC34424 DOI: 10.1073/pnas.101532998] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2000] [Indexed: 11/18/2022] Open
Abstract
The stem cell leukemia (SCL) gene encodes a tissue-specific basic helix-loop-helix (bHLH) protein with a pivotal role in hemopoiesis and vasculogenesis. Several enhancers have been identified within the murine SCL locus that direct reporter gene expression to subdomains of the normal SCL expression pattern, and long-range sequence comparisons of the human and murine SCL loci have identified additional candidate enhancers. To facilitate the characterization of regulatory elements, we have sequenced and analyzed 33 kb of the SCL genomic locus from the pufferfish Fugu rubripes, a species with a highly compact genome. Although the pattern of SCL expression is highly conserved from mammals to teleost fish, the genes flanking pufferfish SCL were unrelated to those known to flank both avian and mammalian SCL genes. These data suggest that SCL regulatory elements are confined to the region between the upstream and downstream flanking genes, a region of 65 kb in human and 8.5 kb in pufferfish. Consistent with this hypothesis, the entire 33-kb pufferfish SCL locus directed appropriate expression to hemopoietic and neural tissue in transgenic zebrafish embryos, as did a 10.4-kb fragment containing the SCL gene and extending to the 5' and 3' flanking genes. These results demonstrate the power of combining the compact genome of the pufferfish with the advantages that zebrafish provide for studies of gene regulation during development. Furthermore, the pufferfish SCL locus provides a powerful tool for the manipulation of hemopoiesis and vasculogenesis in vivo.
Collapse
Affiliation(s)
- L M Barton
- Department of Hematology, Cambridge Institute for Medical Research, University of Cambridge, Addenbrookes Hospital, Cambridge CB2 2XY, United Kingdom
| | | | | | | | | | | | | | | | | |
Collapse
|
46
|
Smith SF, Metcalfe JA, Elgar G. Characterisation of two topoisomerase 1 genes in the pufferfish (Fugu rubripes). Gene 2001; 265:195-204. [PMID: 11255022 DOI: 10.1016/s0378-1119(01)00366-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Eukaryotic DNA topoisomerase I manipulates the higher order structures of DNA. Only one functional topoisomerase 1 (top1) gene has previously been identified in any individual eukaryotic species. Here we report the identification and characterisation of two top1 genes in the pufferfish, Fugu rubripes. This shows that the copy number of top1, like that of other topoisomerases, may vary between eukaryotes. Both Fugu genes have 21 exons; a gene structure similar to that of human TOP1. Despite this conservation of structure, and some non-coding elements, both genes are less than a tenth of the size of the human gene. Sequence and phylogenetic analyses have shown that this duplication is ancient and also affects other species in the fish lineage.
Collapse
Affiliation(s)
- S F Smith
- Fugu Genomics Group, UK Human Genome Mapping Project Resource Centre, Wellcome Trust Genome Campus, Hinxton Hall, Hinxton, CB10 1SB, Cambridge, UK
| | | | | |
Collapse
|
47
|
Brunner B, Grützner F, Yaspo ML, Ropers HH, Haaf T, Kalscheuer VM. Molecular cloning and characterization of the Fugu rubripes MEST/COPG2 imprinting cluster and chromosomal localization in Fugu and Tetraodon nigroviridis. Chromosome Res 2001; 8:465-76. [PMID: 11032317 DOI: 10.1023/a:1009263504671] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
We isolated Fugu genomic clones using the human MEST (Mesoderm-Specific Transcript) cDNA as probe. Sequence analysis revealed the presence of MEST and three additional genes which show homology to plant DNBP (DNA-Binding Protein), vertebrate COPG2 (Coat Protein Gamma 2), as well as to human and mouse UCN (Urocortin). Structures of Fugu and human MEST, COPG2 and UCN genes are very similar. Since MEST and COPG2 are neighboring genes on human chromosome 7q32, we can conclude that we identified their orthologs and that linkage of these genes is evolutionarily conserved in vertebrates. Unlike human MEST which underlies isoform-specific imprinting and is methylated in a parent-of-origin-specific fashion, the CpG island of the Fugu ortholog is completely methylated. The translation start of Fugu MEST is identical to the non-imprinted human isoform which is in good agreement with the assumption that genomic imprinting is restricted to mammals. Comparative mapping of these genes by fluorescence in-situ hybridization to metaphase chromosomes of Fugu rubripes and Tetraodon nigroviridis showed clear signals on one of the smallest acrocentric chromosomal pairs, which in Fugu, can be easily classified by its unique triangular shape.
Collapse
Affiliation(s)
- B Brunner
- Human Genetics, University Hospital Nijmegen, The Netherlands
| | | | | | | | | | | |
Collapse
|
48
|
Agüero F, Verdún RE, Frasch AC, Sánchez DO. A random sequencing approach for the analysis of the Trypanosoma cruzi genome: general structure, large gene and repetitive DNA families, and gene discovery. Genome Res 2000; 10:1996-2005. [PMID: 11116094 PMCID: PMC313047 DOI: 10.1101/gr.gr-1463r] [Citation(s) in RCA: 46] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2000] [Accepted: 09/20/2000] [Indexed: 11/24/2022]
Abstract
A random sequence survey of the genome of Trypanosoma cruzi, the agent of Chagas disease, was performed and 11,459 genomic sequences were obtained, resulting in approximately 4.3 Mb of readable sequences or approximately 10% of the parasite haploid genome. The estimated total GC content was 50.9%, with a high representation of A and T di- and trinucleotide repeats. Out of the estimated 5000 parasite genes, 947 putative new genes were identified. Another 1723 sequences corresponded to genes detected previously in T. cruzi through expression sequence tag analysis. 7735 sequences had no matches in the database, but the presence of open reading frames that passed Fickett's test suggests that some might contain coding DNA. The survey was highly redundant, with approximately 35% of the sequences included in a few large sequence families. Some of them code for protein families present in dozens of copies, including proteins essential for parasite survival and retrotransposons. Other sequence families include repetitive DNA present in thousands of copies per haploid genome. Some families in the latter group are new, parasite-specific, repetitive DNAs. These results suggest that T. cruzi could constitute an interesting model to analyze gene and genome evolution due to its plasticity in terms of sequence amplification and divergence. Additional information can be found at http://www.iib.unsam.edu.ar/tcruzi.gss. html.
Collapse
Affiliation(s)
- F Agüero
- Instituto de Investigaciones Biotecnológicas, Instituto Tecnológico de Chascomús, Universidad Nacional de General San Martín, Consejo Nacional de Investigaciones Científicas y Técnicas, San Martín, Provincia de Buenos Aires, Argentina
| | | | | | | |
Collapse
|
49
|
Woods IG, Kelly PD, Chu F, Ngo-Hazelett P, Yan YL, Huang H, Postlethwait JH, Talbot WS. A comparative map of the zebrafish genome. Genome Res 2000; 10:1903-14. [PMID: 11116086 PMCID: PMC313070 DOI: 10.1101/gr.10.12.1903] [Citation(s) in RCA: 305] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Zebrafish mutations define the functions of hundreds of essential genes in the vertebrate genome. To accelerate the molecular analysis of zebrafish mutations and to facilitate comparisons among the genomes of zebrafish and other vertebrates, we used a homozygous diploid meiotic mapping panel to localize polymorphisms in 691 previously unmapped genes and expressed sequence tags (ESTs). Together with earlier efforts, this work raises the total number of markers scored in the mapping panel to 2119, including 1503 genes and ESTs and 616 previously characterized simple-sequence length polymorphisms. Sequence analysis of zebrafish genes mapped in this study and in prior work identified putative human orthologs for 804 zebrafish genes and ESTs. Map comparisons revealed 139 new conserved syntenies, in which two or more genes are on the same chromosome in zebrafish and human. Although some conserved syntenies are quite large, there were changes in gene order within conserved groups, apparently reflecting the relatively frequent occurrence of inversions and other intrachromosomal rearrangements since the divergence of teleost and tetrapod ancestors. Comparative mapping also shows that there is not a one-to-one correspondence between zebrafish and human chromosomes. Mapping of duplicate gene pairs identified segments of 20 linkage groups that may have arisen during a genome duplication that occurred early in the evolution of teleosts after the divergence of teleost and mammalian ancestors. This comparative map will accelerate the molecular analysis of zebrafish mutations and enhance the understanding of the evolution of the vertebrate genome.
Collapse
Affiliation(s)
- I G Woods
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, California 94305, USA
| | | | | | | | | | | | | | | |
Collapse
|
50
|
Postlethwait JH, Woods IG, Ngo-Hazelett P, Yan YL, Kelly PD, Chu F, Huang H, Hill-Force A, Talbot WS. Zebrafish comparative genomics and the origins of vertebrate chromosomes. Genome Res 2000; 10:1890-902. [PMID: 11116085 DOI: 10.1101/gr.164800] [Citation(s) in RCA: 468] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
To help understand mechanisms of vertebrate genome evolution, we have compared zebrafish and tetrapod gene maps. It has been suggested that translocations are fixed more frequently than inversions in mammals. Gene maps showed that blocks of conserved syntenies between zebrafish and humans were large, but gene orders were frequently inverted and transposed. This shows that intrachromosomal rearrangements have been fixed more frequently than translocations. Duplicated chromosome segments suggest that a genome duplication occurred in ray-fin phylogeny, and comparative studies suggest that this event happened deep in the ancestry of teleost fish. Consideration of duplicate chromosome segments shows that at least 20% of duplicated gene pairs may be retained from this event. Despite genome duplication, zebrafish and humans have about the same number of chromosomes, and zebrafish chromosomes are mosaically orthologous to several human chromosomes. Is this because of an excess of chromosome fissions in the human lineage or an excess of chromosome fusions in the zebrafish lineage? Comparative analysis suggests that an excess of chromosome fissions in the tetrapod lineage may account for chromosome numbers and provides histories for several human chromosomes.
Collapse
MESH Headings
- Animals
- Chromosome Mapping
- Chromosomes/genetics
- Chromosomes, Human, Pair 10/genetics
- Chromosomes, Human, Pair 11/genetics
- Chromosomes, Human, Pair 15/genetics
- Chromosomes, Human, Pair 17/genetics
- Chromosomes, Human, Pair 19/genetics
- Chromosomes, Human, Pair 9/genetics
- Evolution, Molecular
- Gene Duplication
- Genetic Linkage/genetics
- Genetic Markers
- Genome
- Humans
- Mice
- Models, Genetic
- Zebrafish/genetics
Collapse
Affiliation(s)
- J H Postlethwait
- Institute of Neuroscience, University of Oregon, Eugene, Oregon 97403, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|