151
|
Yap WH, Yeoh E, Tay A, Brenner S, Venkatesh B. STAT4 is a target of the hematopoietic zinc-finger transcription factor Ikaros in T cells. FEBS Lett 2005; 579:4470-8. [PMID: 16081070 DOI: 10.1016/j.febslet.2005.07.018] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2005] [Revised: 07/10/2005] [Accepted: 07/13/2005] [Indexed: 11/16/2022]
Abstract
STAT4 is a transcription factor activated in response to IL-12, and is involved in Th1 cell development. The molecular mechanisms controlling the transcription of the STAT4 gene are however, unclear. Sequence comparison of the 5' flanking regions of human, mouse and pufferfish (Fugu rubripes) Stat4 genes revealed a high frequency of Ikaros (Ik) binding elements in all three species. We then investigated the role of Ik binding elements in the human STAT4 promoter using Jurkat T cells. Transactivation, electrophoretic mobility shift assay and RNA interference-mediated gene knockdown experiments revealed that Ik is involved in the regulation of STAT4 in human T cells.
Collapse
|
152
|
Walter K, Abnizova I, Elgar G, Gilks WR. Striking nucleotide frequency pattern at the borders of highly conserved vertebrate non-coding sequences. Trends Genet 2005; 21:436-40. [PMID: 15979195 DOI: 10.1016/j.tig.2005.06.003] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2005] [Revised: 04/11/2005] [Accepted: 06/03/2005] [Indexed: 11/18/2022]
Abstract
In a recent study, 1373 highly conserved non-coding elements (CNEs) were detected by aligning the human and Takifugu rubripes (Fugu) genomes. The remarkable degree of sequence conservation in CNEs compared with their surroundings suggested comparing the base composition within CNEs with their 5' and 3' flanking regions. The analysis reveals a novel, sharp and distinct signal of nucleotide frequency bias precisely at the border between CNEs and flanking regions.
Collapse
|
153
|
Dehal P, Boore JL. Two rounds of whole genome duplication in the ancestral vertebrate. PLoS Biol 2005; 3:e314. [PMID: 16128622 PMCID: PMC1197285 DOI: 10.1371/journal.pbio.0030314] [Citation(s) in RCA: 1029] [Impact Index Per Article: 54.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2004] [Accepted: 07/08/2005] [Indexed: 11/18/2022] Open
Abstract
The hypothesis that the relatively large and complex vertebrate genome was created by two ancient, whole genome duplications has been hotly debated, but remains unresolved. We reconstructed the evolutionary relationships of all gene families from the complete gene sets of a tunicate, fish, mouse, and human, and then determined when each gene duplicated relative to the evolutionary tree of the organisms. We confirmed the results of earlier studies that there remains little signal of these events in numbers of duplicated genes, gene tree topology, or the number of genes per multigene family. However, when we plotted the genomic map positions of only the subset of paralogous genes that were duplicated prior to the fish-tetrapod split, their global physical organization provides unmistakable evidence of two distinct genome duplication events early in vertebrate evolution indicated by clear patterns of four-way paralogous regions covering a large part of the human genome. Our results highlight the potential for these large-scale genomic events to have driven the evolutionary success of the vertebrate lineage.
Collapse
|
154
|
Wan L, Almers W, Chen W. Two ribeye genes in teleosts: the role of Ribeye in ribbon formation and bipolar cell development. J Neurosci 2005; 25:941-9. [PMID: 15673675 PMCID: PMC6725632 DOI: 10.1523/jneurosci.4657-04.2005] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Ribeye is the only known protein specific to synaptic ribbon, but its function is unclear. We show that the teleost fish, Fugu and zebrafish, have two ribeye genes, ribeye a and ribeye b. Whole-mount in situ hybridization revealed that ribeye a is expressed in tissues containing synaptic ribbons, including the pineal gland, inner ear, and retina. Ribeye b is absent in the pineal gland. In the retina, ribeye a is expressed in both photoreceptors and bipolar cells, whereas ribeye b is detected only in photoreceptors. To study the function of Ribeye a in retina, we depleted it by morpholino antisense oligos. Fish deficient in Ribeye a lack an optokinetic response and have shorter synaptic ribbons in photoreceptors and fewer synaptic ribbons in bipolar cells. Their bipolar cells still target Syntaxin-3 proteins to the inner plexiform layer and have abundant vsx1 mRNA. However, they lack large synaptic terminals and show increased apoptosis. Rod bipolar cells are fewer in number and/or deficient in PKCalpha. Recovery of Ribeye a levels rescues the optokinetic response, increases the number of PKCalpha-positive bipolar cells, and stops apoptosis. We conclude that Ribeye a is important for late steps in bipolar cell development.
Collapse
|
155
|
Bates MD, Wells JM, Venkatesh B. Comparative genomics of the Hlx homeobox gene and protein: conservation of structure and expression from fish to mammals. Gene 2005; 352:45-56. [PMID: 15935575 DOI: 10.1016/j.gene.2005.03.021] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2004] [Revised: 02/08/2005] [Accepted: 03/01/2005] [Indexed: 11/24/2022]
Abstract
Hlx is a homeobox transcription factor gene that is expressed in intestinal and hepatic mesenchyme of the developing mouse embryo and is essential for normal intestinal and hepatic development. Because of the morphological and molecular similarities in the development of the digestive system across species, we hypothesized that the Hlx gene and protein sequences and expression patterns would be conserved among vertebrates. Comparison of the Hlx gene orthologues of human, chimpanzee, mouse, rat, pufferfish (Fugu) and zebrafish demonstrates that these six genes share an identical organization with four exons and three introns. Comparison of the inferred Hlx protein sequences from these and three additional species (chick, Spanish ribbed newt and rainbow trout) reveals significant sequence identity, with identical homeodomains. The expression of Hlx in the mesenchyme of developing chick embryos is highly similar to that of mouse. Fugu Hlx is expressed in a tissue-specific manner that is similar though not identical to that of mouse, suggesting a conservation of Hlx function between mammals and birds. The mammalian and fish Hlx genes share a putative 5' upstream enhancer as well as an inverted repeat containing CCAAT boxes on opposite strands that we have previously shown to be important for mouse Hlx gene expression. These results suggest that the function of Hlx and the mechanisms regulating its expression are highly conserved in mammals, birds, amphibians and fish.
Collapse
|
156
|
Ohtani M, Miyadai T, Hiroishi S. Molecular cloning of the BCL-6 gene, a transcriptional repressor for B-cell differentiation, in torafugu (Takifugu rubripes). Mol Immunol 2005; 43:1047-53. [PMID: 16076494 DOI: 10.1016/j.molimm.2005.06.036] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2005] [Indexed: 11/24/2022]
Abstract
B-cell lymphoma-6 (BCL-6) is a transcriptional repressor that prevents the terminal differentiation of mature B-cells to plasma cells, and is essential for germinal center formation in the primary lymphoid organs of mammals. In this study, we identified the BCL-6 gene in torafugu (Takifugu rubripes) using the torafugu genome database, and analyzed the expression of BCL-6 mRNA in various tissues of torafugu, using RT-PCR. The BCL-6 gene consisted of eight exons and seven introns spanning a genome of ca. 3.3 kb. BCL-6 mRNA contained a 2112 bp open reading frame encoding 703 amino acids, with a predicted protein size of 78.8 kDa. The predicted torafugu BCL-6 primary structure contains two conserved specific motifs, the BTB/POZ domain at the N-terminus and the sixC2H2-type zinc finger motifs at the C-terminal region. The homology of torafugu BCL-6 to those of zebrafish (Danio rerio), Xenopus laevis, mouse (Mus musculus) and human (Homo sapiens) is 76, 59, 60 and 60%, respectively. RT-PCR analysis revealed that BCL-6 mRNA is highly expressed in pronephros, thymus, intestine, ovary, brain, nasal cavity and muscle. These results imply that torafugu BCL-6 is involved in regulation of B-cell differentiation in torafugu.
Collapse
|
157
|
Itoi S, Misaki R, Hirayama M, Nakaniwa M, Liang CS, Kondo H, Watabe S. Identification of three isoforms for mitochondrial adenine nucleotide translocator in the pufferfish Takifugu rubripes. Mitochondrion 2005; 5:162-72. [PMID: 16050982 DOI: 10.1016/j.mito.2005.01.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2004] [Revised: 12/20/2004] [Accepted: 01/26/2005] [Indexed: 11/21/2022]
Abstract
Three adenine nucleotide translocator (ANT) genes were identified through in silico data mining of the Fugu genome database along with isolation of their corresponding cDNAs in vivo from the pufferfish (Takifugu rubripes). As a result of phylogenetic analysis, the ANT gene on scaffold_254 corresponded to mammalian ANT1, whereas both of those on scaffold_6 and scaffold_598 to mammalian ANT3. The ANT gene encoded by scaffold_6 was expressed ubiquitously in various tissues, whereas the ANT genes encoded by scaffold_254 and scaffold_598 were predominantly expressed in skeletal muscle and heart, respectively.
Collapse
MESH Headings
- Amino Acid Sequence
- Animals
- Base Sequence
- Cloning, Molecular
- Computational Biology
- Conserved Sequence
- DNA, Complementary
- Genome
- Isoenzymes/chemistry
- Isoenzymes/genetics
- Isoenzymes/isolation & purification
- Mitochondria, Heart/enzymology
- Mitochondria, Muscle/enzymology
- Mitochondrial ADP, ATP Translocases/chemistry
- Mitochondrial ADP, ATP Translocases/genetics
- Mitochondrial ADP, ATP Translocases/isolation & purification
- Molecular Sequence Data
- Nucleic Acid Amplification Techniques
- Phylogeny
- Polymerase Chain Reaction
- Reverse Transcriptase Polymerase Chain Reaction
- Sequence Analysis, DNA
- Sequence Homology, Amino Acid
- Takifugu/genetics
- Tissue Distribution
- Transcription, Genetic
Collapse
|
158
|
Roach JC, Glusman G, Rowen L, Kaur A, Purcell MK, Smith KD, Hood LE, Aderem A. The evolution of vertebrate Toll-like receptors. Proc Natl Acad Sci U S A 2005; 102:9577-82. [PMID: 15976025 PMCID: PMC1172252 DOI: 10.1073/pnas.0502272102] [Citation(s) in RCA: 832] [Impact Index Per Article: 43.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The complete sequences of Takifugu Toll-like receptor (TLR) loci and gene predictions from many draft genomes enable comprehensive molecular phylogenetic analysis. Strong selective pressure for recognition of and response to pathogen-associated molecular patterns has maintained a largely unchanging TLR recognition in all vertebrates. There are six major families of vertebrate TLRs. This repertoire is distinct from that of invertebrates. TLRs within a family recognize a general class of pathogen-associated molecular patterns. Most vertebrates have exactly one gene ortholog for each TLR family. The family including TLR1 has more species-specific adaptations than other families. A major family including TLR11 is represented in humans only by a pseudogene. Coincidental evolution plays a minor role in TLR evolution. The sequencing phase of this study produced finished genomic sequences for the 12 Takifugu rubripes TLRs. In addition, we have produced >70 gene models, including sequences from the opossum, chicken, frog, dog, sea urchin, and sea squirt.
Collapse
|
159
|
Kai W, Kikuchi K, Fujita M, Suetake H, Fujiwara A, Yoshiura Y, Ototake M, Venkatesh B, Miyaki K, Suzuki Y. A genetic linkage map for the tiger pufferfish, Takifugu rubripes. Genetics 2005; 171:227-38. [PMID: 15972462 PMCID: PMC1456513 DOI: 10.1534/genetics.105.042051] [Citation(s) in RCA: 85] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The compact genome of the tiger pufferfish, Takifugu rubripes (fugu), has been sequenced to the "draft" level and annotated to identify all the genes. However, the assembly of the draft genome sequence is highly fragmented due to the lack of a genetic or a physical map. To determine the long-range linkage relationship of the sequences, we have constructed the first genetic linkage map for fugu. The maps for the male and female spanning 697.1 and 1213.5 cM, respectively, were arranged into 22 linkage groups by markers heterozygous in both parents. The resulting map consists of 200 microsatellite loci physically linked to genome sequences spanning approximately 39 Mb in total. Comparisons of the genome maps of fugu, other teleosts, and mammals suggest that syntenic relationship is more conserved in the teleost lineage than in the mammalian lineage. Map comparisons also show a pufferfish lineage-specific rearrangement of the genome resulting in colocalization of two Hox gene clusters in one linkage group. This map provides a foundation for development of a complete physical map, a basis for comparison of long-range linkage of genes with other vertebrates, and a resource for mapping loci responsible for phenotypic differences among Takifugu species.
Collapse
|
160
|
Poulin F, Nobrega MA, Plajzer-Frick I, Holt A, Afzal V, Rubin EM, Pennacchio LA. In vivo characterization of a vertebrate ultraconserved enhancer. Genomics 2005; 85:774-81. [PMID: 15885503 DOI: 10.1016/j.ygeno.2005.03.003] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2005] [Revised: 03/04/2005] [Accepted: 03/07/2005] [Indexed: 02/03/2023]
Abstract
Genomic sequence comparisons among human, mouse, and pufferfish (Takifugu rubripes (Fugu)) have revealed a set of extremely conserved noncoding sequences. While this high degree of sequence conservation suggests severe evolutionary constraint and predicts a lack of tolerance to change to retain in vivo functionality, such elements have been minimally explored experimentally. In this study, we describe the in-depth characterization of an ancient conserved enhancer, Dc2, located near the dachshund gene, which displays a human-Fugu identity of 84% over 424 basepairs (bp). In addition to this large overall conservation, we find that Dc2 is characterized by the presence of a large block of sequence (144 bp) that is completely identical among human, mouse, chicken, zebrafish, and Fugu. Through the testing of reporter vector constructs in transgenic mice, we observed that the 424-bp Dc2-conserved element is necessary and sufficient for brain tissue enhancer activity. In vivo analyses also revealed that the 144-bp 100% conserved sequence is necessary, but not sufficient, to replicate Dc2 enhancer function. However, the introduction of two separate 16-bp insertions into the highly conserved enhancer core did not cause any detectable modification of its in vivo activity. Our observations indicate that the 144-bp 100% conserved element is tolerant of change at least at the resolution of this transgenic mouse assay and suggest that purifying selection on the Dc2 sequence might not be as strong as we predicted or that some unknown property also constrains this highly conserved enhancer sequence.
Collapse
|
161
|
Baroukh N, Ahituv N, Chang J, Shoukry M, Afzal V, Rubin EM, Pennacchio LA. Comparative genomic analysis reveals a distant liver enhancer upstream of the COUP-TFII gene. Mamm Genome 2005; 16:91-5. [PMID: 15859353 DOI: 10.1007/s00335-004-2442-9] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
COUP-TFII is a central nuclear hormone receptor that tightly regulates the expression of numerous target lipid metabolism genes in vertebrates. However, it remains unclear how COUP-TFII itself is transcriptionally controlled since studies with its promoter and upstream region fail to recapitulate the gene's liver expression. In an attempt to identify liver enhancers in the vicinity of COUP-TFII, we employed a comparative genomic approach. Initial comparisons between humans and mice of the 3470-kb gene-poor region surrounding COUP-TFII revealed 2023 conserved noncoding elements. To prioritize a subset of these elements for functional studies, we performed further genomic comparisons with the orthologous pufferfish (Fugu rubripes) locus and uncovered two anciently conserved noncoding sequences (CNS) upstream of COUP-TFII (CNS-62kb and CNS-66kb). Testing these two elements using reporter constructs in liver cells (HepG2) revealed that CNS-66kb, but not CNS-62kb, yielded robust in vitro enhancer activity. In addition, an in vivo reporter assay using naked DNA transfer with CNS-66kb linked to luciferase displayed strong reproducible liver expression in adult mice, further supporting its role as a liver enhancer. Together, these studies further support the utility of comparative genomics to uncover gene regulatory sequences based on evolutionary conservation and provide the substrates to better understand the regulation and expression of COUP-TFII.
Collapse
|
162
|
Kurokawa T, Uji S, Suzuki T. Identification of cDNA coding for a homologue to mammalian leptin from pufferfish, Takifugu rubripes. Peptides 2005; 26:745-50. [PMID: 15808904 DOI: 10.1016/j.peptides.2004.12.017] [Citation(s) in RCA: 164] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/29/2004] [Revised: 12/21/2004] [Accepted: 12/21/2004] [Indexed: 11/30/2022]
Abstract
We identified cDNA coding for a homologue to mammalian leptin in puffer, Takifugu rubripes, using genomic synteny around the human leptin gene. In addition to significant sequence homologies, the puffer leptin (pLEP) displays characteristic structural features in common with mammalian leptin. The pLEP mRNA was expressed mostly in the liver that contained abundant lipids. In addition, homologues to pLEP were found in the databanks for three fish species (salmon, medaka, and Tetraodon) and two amphibians (salamander and Xenopus). The phylogenetic analysis shows rapid rates of molecular divergence among leptins from different vertebrate classes, but not between mammals and avians.
Collapse
|
163
|
Imanishi Y, Yang L, Sokal I, Filipek S, Palczewski K, Baehr W. Diversity of guanylate cyclase-activating proteins (GCAPs) in teleost fish: characterization of three novel GCAPs (GCAP4, GCAP5, GCAP7) from zebrafish (Danio rerio) and prediction of eight GCAPs (GCAP1-8) in pufferfish (Fugu rubripes). J Mol Evol 2005; 59:204-217. [PMID: 15486694 PMCID: PMC1351297 DOI: 10.1007/s00239-004-2614-y] [Citation(s) in RCA: 86] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2003] [Accepted: 02/16/2004] [Indexed: 11/25/2022]
Abstract
The guanylate cyclase-activating proteins (GCAPs) are Ca(2+)-binding proteins of the calmodulin (CaM) gene superfamily that function in the regulation of photoreceptor guanylate cyclases (GCs). In the mammalian retina, two GCAPs (GCAP 1-2) and two transmembrane GCs have been identified as part of a complex regulatory system responsive to fluctuating levels of free Ca(2+). A third GCAP, GCAP3, is expressed in human and zebrafish (Danio rerio) retinas, and a guanylate cyclase-inhibitory protein (GCIP) has been shown to be present in frog cones. To explore the diversity of GCAPs in more detail, we searched the pufferfish (Fugu rubripes) and zebrafish (Danio rerio) genomes for GCAP-related gene sequences (fuGCAPs and zGCAPs, respectively) and found that at least five additional GCAPs (GCAP4-8) are predicted to be present in these species. We identified genomic contigs encoding fuGCAPl-8, fuGCIP, zGCAPl-5, zGCAP7 and zGCIP. We describe cloning, expression and localization of three novel GCAPs present in the zebrafish retina (zGCAP4, zGCAP5, and zGCAP7). The results show that recombinant zGCAP4 stimulated bovine rod outer segment GC in a Ca(2+)-dependent manner. RT-PCR with zGCAP specific primers showed specific expression of zGCAPs and zGCIP in the retina, while zGCAPl mRNA is also present in the brain. In situ hybridization with anti-sense zGCAP4, zGCAP5 and zGCAP7 RNA showed exclusive expression in zebrafish cone photoreceptors. The presence of at least eight GCAP genes suggests an unexpected diversity within this subfamily of Ca(2+)-binding proteins in the teleost retina, and suggests additional functions for GCAPs apart from stimulation of GC. Based on genome searches and EST analyses, the mouse and human genomes do not harbor GCAP4-8 or GCIP genes.
Collapse
|
164
|
Hikima JI, Lennard ML, Wilson MR, Miller NW, Clem LW, Warr GW. Evolution of vertebrate E protein transcription factors: comparative analysis of the E protein gene family in Takifugu rubripes and humans. Physiol Genomics 2005; 21:144-51. [PMID: 15713784 DOI: 10.1152/physiolgenomics.00312.2004] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
E proteins are essential for B lymphocyte development and function, including immunoglobulin (Ig) gene rearrangement and expression. Previous studies of B cells in the channel catfish ( Ictalurus punctatus) identified E protein homologs that are capable of binding the μE5 motif and driving a strong transcriptional response. There are three E protein genes in mammals, HEB (TCF12), E2A (TCF3), and E2-2 (TCF4). The major expressed E proteins found in catfish B cells are homologs of HEB and of E2A. Here we sought to define the complete family of E protein genes in a teleost fish, Takifugu rubripes, taking advantage of the completed genome sequence. The catfish CFEB (HEB homolog) sequence identified homologous E-protein-encoding sequences in five scaffolds in the Takifugu genome database. Detailed comparative analysis with the human genome revealed the presence of five E protein homologs in Takifugu. Single genes orthologous to HEB and to E2-2 were identified. In contrast, two members of the E2A gene family were identified in Takifugu; one of these shows the alternative processing of transcripts that identifies it as the ortholog of the E12/E47-encoding mammalian E2A gene, whereas the second Takifugu E2A gene has no predicted alternative splice products. A novel fifth E protein gene (EX) was identified in Takifugu. Phylogenetic analysis revealed four E protein branches among vertebrates: EX, E2A, HEB, and E2-2.
Collapse
|
165
|
Subramanian S, Kumar S. Gene expression intensity shapes evolutionary rates of the proteins encoded by the vertebrate genome. Genetics 2005; 168:373-81. [PMID: 15454550 PMCID: PMC1448110 DOI: 10.1534/genetics.104.028944] [Citation(s) in RCA: 196] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Natural selection leaves its footprints on protein-coding sequences by modulating their silent and replacement evolutionary rates. In highly expressed genes in invertebrates, these footprints are seen in the higher codon usage bias and lower synonymous divergence. In mammals, the highly expressed genes have a shorter gene length in the genome and the breadth of expression is known to constrain the rate of protein evolution. Here we have examined how the rates of evolution of proteins encoded by the vertebrate genomes are modulated by the amount (intensity) of gene expression. To understand how natural selection operates on proteins that appear to have arisen in earlier and later phases of animal evolution, we have contrasted patterns of mouse proteins that have homologs in invertebrate and protist genomes (Precambrian genes) with those that do not have such detectable homologs (vertebrate-specific genes). We find that the intensity of gene expression relates inversely to the rate of protein sequence evolution on a genomic scale. The most highly expressed genes actually show the lowest total number of substitutions per polypeptide, consistent with cumulative effects of purifying selection on individual amino acid replacements. Precambrian genes exhibit a more pronounced difference in protein evolutionary rates (up to three times) between the genes with high and low expression levels as compared to the vertebrate-specific genes, which appears to be due to the narrower breadth of expression of the vertebrate-specific genes. These results provide insights into the differential relationship and effect of the increasing complexity of animal body form on evolutionary rates of proteins.
Collapse
|
166
|
Neafsey DE, Hartl DL. Convergent loss of an anciently duplicated, functionally divergent RH2 opsin gene in the fugu and Tetraodon pufferfish lineages. Gene 2005; 350:161-71. [PMID: 15820147 DOI: 10.1016/j.gene.2005.02.011] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2004] [Revised: 01/21/2005] [Accepted: 02/22/2005] [Indexed: 11/16/2022]
Abstract
We describe the complete opsin gene families from the sequenced fugu and Tetraodon pufferfish genomes. We report the convergent loss of function of an anciently duplicated, functionally divergent RH2 or "green-sensitive" opsin gene in both pufferfish lineages, designated RH2-2. In fugu, RH2-2 apparently ceased to function very recently following a transposon-induced deletion that truncated the N-terminal 115 amino acids from the translated protein. Although a lack of frameshift or nonsense mutations in the fugu RH2-2 pseudogene suggests that the gene was lost very recently in this lineage, we were unable to detect any evidence of a selective sweep associated with the fixation of the truncated allele from population data. Interspecific comparison of the remaining fugu RH2-2 coding sequence paradoxically indicates that the gene was under strong purifying selection until the truncation occurred.
Collapse
|
167
|
Dijkstra JM, Somamoto T, Moore L, Hordvik I, Ototake M, Fischer U. Identification and characterization of a second CD4-like gene in teleost fish. Mol Immunol 2005; 43:410-9. [PMID: 16337483 DOI: 10.1016/j.molimm.2005.03.005] [Citation(s) in RCA: 86] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2005] [Accepted: 03/04/2005] [Indexed: 11/17/2022]
Abstract
In fish, T cell subdivision is not well studied, although CD8 and CD4 homologues have been reported. This study describes a second teleost CD4-like gene, CD4-like 2 (CD4L-2). Two rainbow trout copies of this gene were found, -2a and -2b, encoding molecules sharing 81% aa identity. The 2a/2b duplication may be related to tetraploid ancestry of salmonid fishes. In the Fugu genome CD4L-2 lies head to tail with an earlier reported, very different CD4-like gene [Suetake, H., Araki, K., Suzuki, Y., 2004. Cloning, expression, and characterization of fugu CD4, the first ectothermic animal CD4. Immunogenetics 56, 368-374], which was designated CD4L-1 in the present article. The flanking genes of the Fugu CD4L-1 and CD4L-2 are reminiscent of the genes surrounding CD4 and LAG-3 in mammals. However, neither synteny nor phylogenetic analysis could decide between CD4 and LAG-3 identity for the fish CD4L genes. CD4L-1 and CD4L-2 share a tyrosine protein kinase p56(lck) binding motif in the cytoplasmic tail with CD4 but not with LAG-3. Trout CD4L-2 expression is highest in the thymus, similar to mammalian and chicken CD4, whereas Fugu CD4L-1 expression was highest in the spleen. However, CD4L-2 encodes only two IG-like domains, whereas CD4L-1, CD4 and LAG-3 encode four. The CD4-like genes 1 and 2 in fish apparently went through an evolution different from that of LAG-3 and CD4 in higher vertebrates.
Collapse
|
168
|
Savan R, Kono T, Igawa D, Sakai M. A novel tumor necrosis factor (TNF) gene present in tandem with theTNF-? gene on the same chromosome in teleosts. Immunogenetics 2005; 57:140-50. [PMID: 15759114 DOI: 10.1007/s00251-005-0768-4] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2004] [Revised: 12/07/2004] [Indexed: 10/25/2022]
Abstract
Tumor necrosis factors (TNFs) are pleiotropic cytokines implicated in inflammation, apoptosis, cell proliferation, and a general stimulation of immune system. Although the TNF/lymphotoxin (LT) locus is present in the MHC region of the chromosome in mammals, no such locus has been described from teleosts. Furthermore, in teleosts only the TNF-alpha-like gene has been cloned and analyzed. In this study, for the first time in teleosts, a similar TNF locus has also been found in zebrafish and fugu. This locus harbors a new TNF ligand in tandem with TNF-alpha gene. The fugu and zebrafish TNF locus harbors TNF-alpha and TNF-N genes that are 7.5 kb and 19 kb apart from each other, respectively. Several genes associated with the TNF/LT locus in mammals are found conserved in fish as well. Fish TNF-alpha genes cloned possess a transmembrane domain, two conserved cysteines, and a conserved TNF family signature with a four-exon and three-intron genomic structure. The newly identified TNF-N genes from teleosts are present upstream of TNF-alpha genes in the same transcriptional orientation. However, the coding regions of these genes span into five and four exons in fugu and zebrafish, respectively. The transcription sites in the promoter regions of teleost TNF ligands are fairly conserved in comparison to mammals. Expression analysis indicates a constitutive expression of both the genes in fugu, while differential expression of TNF ligands is seen in zebrafish tissues.
Collapse
|
169
|
Araki K, Suetake H, Kikuchi K, Suzuki Y. Characterization and expression analysis of CD3? and CD3?/? in fugu, Takifugu rubripes. Immunogenetics 2005; 57:158-63. [PMID: 15756549 DOI: 10.1007/s00251-005-0772-8] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2004] [Revised: 12/12/2004] [Indexed: 10/25/2022]
Abstract
CD3 is an essential component of the CD3-TCR complex. In this report, we describe the cloning, characterization, and expression analysis of the CD3varepsilon and CD3gamma/delta chain genes from fugu, Takifugu rubripes. Two distinct CD3varepsilon homologue cDNAs, designated as CD3varepsilon-1 and CD3varepsilon-2, and a CD3gamma/delta homologue cDNA were isolated from the fugu thymus. The deduced amino acid sequences of these cDNAs exhibit conserved essential CD3 chain motifs and overall structures. RT-PCR analysis demonstrated that the CD3varepsilon and CD3gamma/delta genes were expressed in lymphoid organs (e.g. thymus, head kidney, trunk kidney and spleen), mucosal tissues (gill, skin, and intestine), and peripheral blood leucocytes (PBL). The CD3 and TCRalpha genes were expressed only in the surface IgM- population, which were separated from PBL using an anti-fugu IgM monoclonal antibody. In addition, in situ hybridization confirmed that CD3-expressing cells were distributed randomly in the head kidney, trunk kidney, and spleen, but in the thymus were restricted to the lymphoid outer zone and epithelioid inner zone only. Collectively, these results suggest that CD3 molecules are useful markers for the identification of T cells in teleost fish. The present study thus provides a critical step in identifying T cells in this model organism.
Collapse
|
170
|
Abstract
It is well known that repositioning of a gene often exerts a strong impact on its own expression and whole development. Here we report the results of genome-wide analyses suggesting that repositioning may also radically change the evolutionary fate of gene duplicates. As an indicator of these changes, we used the GC content of gene pairs which originated by duplication. This indicator turned out to be duplicate-asymmetric, which means that genes in a pair differ significantly in GC content despite their apparent origin from a common ancestor. Such an asymmetry necessarily implies that after duplication two originally identical genes mutated in opposite directions-toward GC-rich and GC-poor content, respectively. In mammalian genomes, this trend is definitely associated with presumably methylated hypermutable CpG sites, and in a typical GC-asymmetric gene pair, its two member genes are embedded in GC-contrasting isochores. However, we unexpectedly found similar significant GC asymmetry in fish, fly, worm, and yeast. This means that neither methylation alone nor methylation in combination with isochores can be counted as a primary cause of the GC asymmetry; rather they represent specific realizations of some universal principle of genome evolution. Remarkably, genes from pairs with the greatest GC asymmetry tend to be on different chromosomes, suggesting that the mutational difference between gene duplicates is associated with translocation of a new gene to a different place in the genome, whereas GC symmetric pairs demonstrate the opposite tendency. A recently emerged extra gene copy is usually on the same chromosome as is its parent but quickly, by 0.05 substitution per synonymous site, either has perished or occupies a different chromosome. During this earliest posttranslocation period, the ratio of nonsynonymous/synonymous base substitutions is unusually high, suggesting a rapid adaptive evolution of novel functions. In a general context of evolution by gene duplication, our interpretation of this position-dependent GC asymmetry between duplicated genes is that evolution of redundant genes toward a new function has often been associated with their very early, postduplication repositioning in the genome, with a concomitant abrupt change in epigenetic control of tissue/stage-specific expression and an increase in the mutation rate. Of eight eukaryotic genomes studied, the most distinguished in this respect is the human genome.
Collapse
|
171
|
Sakamoto D, Kudo H, Inohaya K, Yokoi H, Narita T, Naruse K, Mitani H, Araki K, Shima A, Ishikawa Y, Imai Y, Kudo A. A mutation in the gene for delta-aminolevulinic acid dehydratase (ALAD) causes hypochromic anemia in the medaka, Oryzias latipes. Mech Dev 2005; 121:747-52. [PMID: 15210182 DOI: 10.1016/j.mod.2004.03.030] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2003] [Revised: 03/31/2004] [Accepted: 03/31/2004] [Indexed: 11/17/2022]
Abstract
A genetic screen for mutations affecting embryogenesis in the medaka, Oryzias latipes, identified a mutant, whiteout (who), that exhibited hypochromic anemia. The who mutant initially had the normal number of blood cells, but it then gradually decreased during the embryonic and larval stages. The blood cells in the who mutants show an elongated morphology and little hemoglobin activity. Genetic mapping localized who to the vicinity of a LG12 marker, olgc1. By utilizing the highly conserved synteny between medaka and pufferfish, we identified a gene for delta-aminolevulinic acid dehydratase (ALAD), which is the second enzyme in the heme synthetic pathway, as a candidate for who. We found a missense mutation in the alad gene that was tightly linked to the who phenotype, strongly suggesting that the hypochromic anemia phenotype in the who mutant is caused by a loss of the alad function. Thus, who mutants represent a model for the human disease ALAD-deficiency porphyria.
Collapse
|
172
|
Venkatesh B, Yap WH. Comparative genomics using fugu: a tool for the identification of conserved vertebrate cis-regulatory elements. Bioessays 2005; 27:100-7. [PMID: 15612032 DOI: 10.1002/bies.20134] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
With the imminent completion of the whole genome sequence of humans, increasing attention is being focused on the annotation of cis-regulatory elements in the human genome. Comparative genomics approaches based on evolutionary conservation have proved useful in the detection of conserved cis-regulatory elements. The pufferfish, Fugu rubripes, is an attractive vertebrate model for comparative genomics, by virtue of its compact genome and maximal phylogenetic distance from mammals. Fugu has lost a large proportion of nonessential DNA, and retained single orthologs for many duplicate genes that arose in the fish lineage. Non-coding sequences conserved between fugu and mammals have been shown to be functional cis-regulatory elements. Thus, fugu is a model fish genome of choice for discovering evolutionarily conserved regulatory elements in the human genome. Such evolutionarily conserved elements are likely to be shared by all vertebrates, and related to regulatory interactions fundamental to all vertebrates. The functions of these conserved vertebrate elements can be rapidly assayed in mammalian cell lines or in transgenic systems such as zebrafish/medaka and Xenopus, followed by validation of crucial elements in transgenic rodents.
Collapse
|
173
|
Guo D, Holmlund C, Henriksson R, Hedman H. The LRIG gene family has three vertebrate paralogs widely expressed in human and mouse tissues and a homolog in Ascidiacea. Genomics 2005; 84:157-65. [PMID: 15203213 DOI: 10.1016/j.ygeno.2004.01.013] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2003] [Accepted: 01/27/2004] [Indexed: 11/22/2022]
Abstract
Human LRIG1 (formerly LIG1), human LRIG2, and mouse Lrig1 (also known as Lig-1) encode integral membrane proteins. The human genes are located at chromosomes 3p14 and 1p13, which are regions frequently deleted in human cancers. We have searched for additional members of the LRIG family and by molecular cloning identified human LRIG3 and its mouse ortholog Lrig3. Human LRIG3 is located at chromosome 12q13. In silico analysis of public databases revealed a mouse Lrig2 mRNA, three LRIG homologs in the puffer fish Fugu rubripes, and one LRIG homolog in the ascidian tunicate Ciona intestinalis. The human and mouse LRIG polypeptides have the same predicted domain organization: a signal peptide, 15 tandem leucine-rich repeats with cysteine-rich N- and C-flanking domains, three immunoglobulin-like domains, a transmembrane domain, and a cytoplasmic tail. The extracellular part--especially the IgC2.2 domain, the transmembrane domain, and the membrane-proximal part of the cytoplasmic tail--are the most conserved regions. Northern blot analysis and real-time RT-PCR revealed that the three LRIG paralogs are widely expressed in human and mouse tissues. In conclusion, the LRIG gene family was found to have three widely expressed mammalian paralogs, corresponding orthologs in fish, and a homolog in Ascidiacea.
Collapse
|
174
|
Zdobnov EM, Campillos M, Harrington ED, Torrents D, Bork P. Protein coding potential of retroviruses and other transposable elements in vertebrate genomes. Nucleic Acids Res 2005; 33:946-54. [PMID: 15716312 PMCID: PMC549403 DOI: 10.1093/nar/gki236] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
We suggest an annotation strategy for genes encoded by retroviruses and transposable elements (RETRA genes) based on a set of marker protein domains. Usually RETRA genes are masked in vertebrate genomes prior to the application of automated gene prediction pipelines under the assumption that they provide no selective advantage to the host. Yet, we show that about 1000 genes in four vertebrate gene sets analyzed contain at least one RETRA gene marker domain. Using the conservation of genomic neighborhood (synteny), we were able to discriminate between RETRA genes with putative functionality in the vertebrates and those that probably function only in the context of mobile elements. We identified 35 such genes in human, along with their corresponding mouse and rat orthologs; which included almost all known human genes with similarity to mobile elements. The results also imply that the vast majority of the remaining RETRA genes in current gene sets are unlikely to encode vertebrate functions. To automatically annotate RETRA genes in other vertebrate genomes, we provide as a tool a set of marker protein domains and a manually refined list of domesticated or ancestral RETRA genes for rescuing genes with vertebrate functions.
Collapse
|
175
|
Fredriksson R, Schiöth HB. The repertoire of G-protein-coupled receptors in fully sequenced genomes. Mol Pharmacol 2005; 67:1414-25. [PMID: 15687224 DOI: 10.1124/mol.104.009001] [Citation(s) in RCA: 460] [Impact Index Per Article: 24.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
The superfamily of G-protein-coupled receptors (GPCRs) is one of the largest and most studied families of proteins. We created Hidden Markov Models derived from sorted groups of GPCRs from our previous detailed phylogenetic classification of human GPCRs and added several other models derived from receptors not found in mammals. We used these models to search entire Genscan data sets from 13 species whose genomes are nearly completely sequenced. We found more than 5000 unique GPCRs that were divided into 15 main groups, and the largest one, the Rhodopsin family, was subdivided into 13 subclasses. The results show that the main families in the human genome, Glutamate, Rhodopsin, Adhesion, Frizzled, and Secretin, arose before the split of nematodes from the chordate lineage. Moreover, several of the subgroups of the Rhodopsin family arose before the split of the linage leading to vertebrates. We also searched expressed sequence tag (EST) databases and identified more than 20,000 sequences that match GPCRs. Although the GPCRs represent typically 1 to 2% of the Genscan predictions, the ESTs that match GPCRs are typically only 0.01 to 0.001%, indicating that GPCRs in most of the groups are expressed at low levels. We also provide searchable data sets that may be used for annotation and further detailed analysis of the GPCR family. This study provides an extensive overview of the expansion of the gene repertoire for families and subgroups of GPCRs.
Collapse
|