201
|
Timblin B, Rehli M, Skidgel RA. Structural characterization of the human carboxypeptidase D gene and its promoter. Int Immunopharmacol 2002; 2:1907-17. [PMID: 12489804 DOI: 10.1016/s1567-5769(02)00149-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Human carboxypeptidase D (CPD) is a 180-kDa type I membrane protein with three tandem active site domains. CPD is a B-type (or kininase I-type) carboxypeptidase that cleaves C-terminal basic residues from proteins and peptides, such as Arg9 from bradykinin. The human carboxypeptidase D (CPD) gene was found to encompass approximately 88.3 kb of genomic sequence, containing 21 exons ranging in size from 65 to 1813 bp, and 21 introns ranging in size from 112 bp to 35.6 kb. Although CPD and CPM belong to the same metallocarboxypeptidase subfamily, their intron/exon structures differ significantly. Multiple transcription start sites were found in the CPD gene within a GC-rich sequence lacking the typical TATA box, but containing three GC boxes. Luciferase reporter assays with various size constructs containing the promoter region upstream of the start sites showed that it was active in three different cell lines, especially in the human hepatoma cell line HepG2 and the human monocytic cell line THP-1, which have high constitutive expression of CPD.
Collapse
Affiliation(s)
- Barbara Timblin
- Department of Pharmacology, College of Medicine, University of Illinois, Chicago, IL 60612, USA
| | | | | |
Collapse
|
202
|
Abstract
New genes that originated by various molecular mechanisms are an essential component in understanding the evolution of genetic systems. We investigated the pattern of origin of the genes created by retroposition in Drosophila. We surveyed the wholeDrosophila melanogaster genome for such new retrogenes and experimentally analyzed their functionality and evolutionary process. These retrogenes, functional as revealed by the analysis of expression, substitution, and population genetics, show a surprisingly asymmetric pattern in their origin. There is a significant excess of retrogenes that originate from the X chromosome and retropose to autosomes; new genes retroposed from autosomes are scarce. Further, we found that most of these X-derived autosomal retrogenes had evolved a testis expression pattern. These observations may be explained by natural selection favoring those new retrogenes that moved to autosomes and avoided the spermatogenesis X inactivation, and suggest the important role of genome position for the origin of new genes.[The sequence data from this study have been submitted to GenBank under accession nos. AY150701–AY150797. The following individuals kindly provided reagents, samples, or unpublished information as indicated in the paper: M.-L. Wu, F. Lemeunier, and P. Gibert.]
Collapse
|
203
|
Ge W, Wu J, Zhai J, Nie Z, Lin H, Schlaepfer WW, Canete-Soler R. Binding of p190RhoGEF to a destabilizing element on the light neurofilament mRNA is competed by BC1 RNA. J Biol Chem 2002; 277:42701-5. [PMID: 12215442 DOI: 10.1074/jbc.m206635200] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
The enhancement of RNA-mediated motor neuron degeneration in transgenic mice by mutating a major mRNA instability determinant in a light neurofilament (NF-L) transgene implicates cognate RNA binding factors in the pathogenesis of motor neuron degeneration. p190RhoGEF is a neuron-enriched guanine exchange factor (GEF) that binds to the NF-L-destabilizing element, to c-Jun N-terminal kinase-interactive protein-1 (JIP-1), and to 14-3-3 and may link neurofilament expression to pathways affecting neuronal homeostasis. This study was undertaken to identify additional RNA species that bind p190RhoGEF and could affect interactions of the exchange factor with NF-L transcripts. The C-terminal domain of p190RhoGEF, containing the RNA-binding site, was expressed as a glutathione S-transferase fusion protein and was used as an affinity probe to isolate interactive RNAs in rat brain extracts. As expected, NF-L mRNA was identified as an RNA specie eluted from the affinity column. In addition, BC1 RNA was also found enriched in the bound RNA fraction. BC1 is a 152-nucleotide RNA that is highly expressed but untranslated in differentiated neurons. We show that BC1 and NF-L mRNA bind to a similar site in the C-terminal domain of p190RhoGEF, and their bindings to p190RhoGEF are readily cross-competed. Moreover, we identify a novel binding site in BC1 to account for its interaction with p190RhoGEF. The findings suggest a novel role of BC1 in differentiated neurons involving RNA-protein interactions of p190RhoGEF.
Collapse
Affiliation(s)
- Weiwen Ge
- Division of Neuropathology, University of Pennsylvania Medical School, Philadelphia, Pennsylvania 19104-6100, USA
| | | | | | | | | | | | | |
Collapse
|
204
|
Thompson RF, Langford GM. Myosin superfamily evolutionary history. THE ANATOMICAL RECORD 2002; 268:276-89. [PMID: 12382324 DOI: 10.1002/ar.10160] [Citation(s) in RCA: 106] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The superfamily of myosin proteins found in eukaryotic cells is known to contain at least 18 different classes. Members are classified based on the phylogenetic analysis of the head domains located at the amino terminus of the polypeptide. While phylogenetic relationships provide insights into the functional relatedness of myosins within and between families, the evolutionary history of the myosin superfamily is not revealed by such studies. In order to establish the evolutionary history of the superfamily, we analyzed the representation of myosin gene families in a range of organisms covering the taxonomic spectrum. The amino acid sequences of 232 myosin heavy chains, as well as 65 organisms representing the protist, plant, and animal kingdoms, were included in this study. A phylogenetic tree of organisms was constructed based on several complementary taxonomic classification schemes. The results of the analysis support an evolutionary hypothesis in which myosins II and I evolved the earliest of all the myosin groups. Myosins V and XI evolved from a common myosin II-like ancestor, but the two families diverged to either the plant (XI) or animal (V) lineage. Class VII myosin appeared fourth among the families, and classes VI and IX appeared later during the early period of metazoan radiation. Myosins III, XV, and XVIII appeared after this group, and X appeared during the formative phases of vertebrate evolution. The remaining members of the myosin superfamily (IV, VI, XII, XIII, XIV, XVI, and XVII) are limited in distribution to one or more groups of organisms. The evolutionary data permits one to predict the likelihood that myosin genes absent from a given species are either missing (not found yet because of insufficient data) or lost due to a mutation that removed the gene from an organism's lineage. In conclusion, an analysis of the evolutionary history of the myosin superfamily suggests that early-appearing myosin families function as generalists, carrying out a number of functions in a variety of cell types, while more recently evolved myosin families function as specialists and are limited to a few organisms or a few cell types within organisms.
Collapse
Affiliation(s)
- Reid F Thompson
- Department of Biological Sciences, Dartmouth College, Hanover, New Hampshire 03755, USA
| | | |
Collapse
|
205
|
Locascio A, Vega S, de Frutos CA, Manzanares M, Nieto MA. Biological potential of a functional human SNAIL retrogene. J Biol Chem 2002; 277:38803-9. [PMID: 12151403 DOI: 10.1074/jbc.m205358200] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Snail genes encode zinc finger transcription factors required for the development of vertebrate and invertebrate embryos. They trigger epithelial to mesenchymal transitions (EMTs), thereby allowing epithelial cells to emigrate from their place of origin and form tissues such as the mesoderm and the neural crest. Snail genes are also involved in the EMTs responsible for the acquisition of invasiveness during tumor progression. This aspect of their activity is associated with their ability to directly repress E-cadherin transcription. Here we describe the existence of an active human Snail retrogene, inserted within an intron of a novel evolutionarily conserved gene and expressed in different human tissues and cell lines. Functional analyses in cell culture show that this retrogene maintains the potential to induce EMTs, conferring migratory and invasive properties to epithelial cells. In light of this data, we have renamed it SNAIL-like, a new player that must be considered in both physiological and pathological studies of SNAIL function in humans.
Collapse
Affiliation(s)
- Annamaria Locascio
- Instituto Cajal, Consejo Superior de Investigaciones Cientificas, Av. Doctor Arce 37, 28002 Madrid, Spain
| | | | | | | | | |
Collapse
|
206
|
Abstract
The eukaryotic genome has undergone a series of epidemics of amplification of mobile elements that have resulted in most eukaryotic genomes containing much more of this 'junk' DNA than actual coding DNA. The majority of these elements utilize an RNA intermediate and are termed retroelements. Most of these retroelements appear to amplify in evolutionary waves that insert in the genome and then gradually diverge. In humans, almost half of the genome is recognizably derived from retroelements, with the two elements that are currently actively amplifying, L1 and Alu, making up about 25% of the genome and contributing extensively to disease. The mechanisms of this amplification process are beginning to be understood, although there are still more questions than answers. Insertion of new retroelements may directly damage the genome, and the presence of multiple copies of these elements throughout the genome has longer-term influences on recombination events in the genome and more subtle influences on gene expression.
Collapse
Affiliation(s)
- Prescott L Deininger
- Tulane Cancer Center, Department of Environmental Health Sciences, Tulane University Health Sciences Center, New Orleans, Louisiana 70112, USA.
| | | |
Collapse
|
207
|
Hickox DM, Gibbs G, Morrison JR, Sebire K, Edgar K, Keah HH, Alter K, Loveland KL, Hearn MTW, de Kretser DM, O'Bryan MK. Identification of a novel testis-specific member of the phosphatidylethanolamine binding protein family, pebp-2. Biol Reprod 2002; 67:917-27. [PMID: 12193403 DOI: 10.1095/biolreprod.101.001446] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022] Open
Abstract
The phosphatidylethanolamine binding proteins (pebps) are an evolutionarily conserved family of proteins recently implicated in mitogen-activated protein (MAP) kinase pathway regulation, where they are called raf kinase inhibitory proteins. Here, we describe the cloning, cellular localization, and partial characterization of a new member, pebp-2, with potential roles in male fertility. Expression data show that pebp-2 is a testis-specific 21-kDa protein found within late meiotic and haploid germ cells in a stage-specific pattern that is temporally distinct from that of pebp-1. Sequence analyses suggest that pebp-2 forms a distinct subset of the pebp family within mammals. Database analyses revealed the existence of a third subset. Analysis suggests that the specificity/regulation of the distinct pebps subsets is likely to be determined by the amino terminal 40 amino acids or the 3' untranslated region, where the majority of sequence differences occur. Protein homology modeling suggests that pebp-2 protein is, however, topologically similar to other pebps and composed of Greek key fold motifs, a dominant beta-sheet formed from five anti-parallel beta strands forming a shallow groove associated with a putative phosphatidylethanolamine binding site. The pebp-2 gene is intronless and data suggest that it is a retrogene derived from pebp-1. Further, pebp-2 colocalizes with members of the MAP kinase pathway in late spermatocytes and spermatids and on the midpiece of epididymal sperm. These data raise the possibility that pebp-2 is a novel participant in the MAP kinase signaling pathway, with a role in spermatogenesis or posttesticular sperm maturation.
Collapse
Affiliation(s)
- Deborah M Hickox
- Monash Institute of Reproduction and Development, Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Clayton, Victoria 3168, Australia
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
208
|
Abstract
The heterogeneous, short RNAs produced from the high, copy, short mobile elements (SINEs) interact with proteins to form RNA-protein (RNP) complexes. In particular, the BC1 RNA, which is transcribed to high levels specifically in brain and testis from one locus of the ID SINE family, exists as a discrete RNP complex. We expressed a series of altered BC1, and other SINE-related RNAs, in several cell lines and tested for the mobility of the resulting RNP complexes in a native PAGE assay to determine which portions of these SINE RNAs contribute to protein binding. When different SINE RNAs were substituted for the BC1 ID sequence, the resulting RNPs exhibited the same mobility as BC1. This indicates that the protein(s) binding to the ID portion of BC1 is not sequence specific and may be more dependent upon the secondary structure of the RNA. It also suggests that all SINE RNAs may bind a similar set of cellular proteins. Deletion of the A-rich region of BC1 RNA has a marked effect on the mobility of the RNP. Rodent cell lines exhibit a slightly different mobility for this shifted complex when compared to human cell lines, reflecting evolutionary differences in one or more of the protein components. On the basis of mobility change observed in RNP complexes when the A-rich region is removed, we decided to examine poly(A) binding protein (PABP) as a candidate member of the RNP. An antibody against the C terminus of PABP is able to immunoprecipitate BC1 RNA, confirming PABP's presence in the BC1 RNP. Given the ubiquitous role of poly(A) regions in the retrotransposition process, these data suggest that PABP may contribute to the SINE retrotransposition process.
Collapse
Affiliation(s)
- Neva West
- Tulane Cancer Center, SL-66, Department of Environmental Health Sciences, Tulane University Health Sciences Center, 1430 Tulane Avenue, New Orleans, LA 70112, USA
| | | | | | | | | |
Collapse
|
209
|
Hilgard P, Huang T, Wolkoff AW, Stockert RJ. Translated Alu sequence determines nuclear localization of a novel catalytic subunit of casein kinase 2. Am J Physiol Cell Physiol 2002; 283:C472-83. [PMID: 12107057 DOI: 10.1152/ajpcell.00070.2002] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Casein kinase 2 (CK2) is a tetrameric enzyme constitutively expressed in all eukaryotic tissues. The two known isoforms of the catalytic subunit, CK2alpha and CK2alpha', have been reported to have distinct tissue-dependent subcellular distributions. We recently described a third isoform of the catalytic subunit, designated CK2alpha", which is highly expressed in liver. Immunoblot analysis of HuH-7 human hepatoma cell fractions as well as immunofluorescent microscopy revealed that CK2alpha" was exclusively localized to the nucleus and preferentially associated with the nuclear matrix. CK2alpha and CK2alpha' were found in nuclear, membrane, and cytosolic compartments. Deletion of the carboxy-terminal 32 amino acids from the CK2alpha" sequence resulted in release of the truncated green fluorescent protein fusion protein from the nuclear matrix and redistribution to both the nucleus and the cytoplasm. Demonstration that the carboxy terminus is necessary but not sufficient for nuclear retention indicates that the underlying mechanism of CK2alpha" nuclear localization is dependent on the secondary structure of the holoenzyme directed by the carboxy-terminal sequence.
Collapse
Affiliation(s)
- Philip Hilgard
- Marion Bessin Liver Research Center, Albert Einstein College of Medicine, Bronx, New York 10461, USA
| | | | | | | |
Collapse
|
210
|
Betrán E, Wang W, Jin L, Long M. Evolution of the phosphoglycerate mutase processed gene in human and chimpanzee revealing the origin of a new primate gene. Mol Biol Evol 2002; 19:654-63. [PMID: 11961099 DOI: 10.1093/oxfordjournals.molbev.a004124] [Citation(s) in RCA: 61] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Processed genes are created by retroposition from messenger RNA of expressed genes. The estimated amount of processed copies of genes in the human genome is 10,000-14,000. Some of these might be pseudogenes with the expected pattern for nonfunctional sequences, but some others might be an important source of new genes. We have studied the evolution of a Phosphoglycerate mutase processed gene (PGAM3) described in humans and believed to be a pseudogene. We sequenced PGAM3 in chimpanzee and macaque and obtained polymorphism data for human coding region. We found evidence that PGAM3 likely produces a functional protein, as an example of addressing functionality for human processed pseudogenes. First, the open reading frame was intact despite many deletions that occurred in the 3' untranslated region. Second, it appears that the gene is expressed. Finally, interspecies and intraspecies variation for PGAM3 was not consistent with the neutral model proposed for pseudogenes, suggesting that a new functional primate gene has originated. Amino acid divergence was significantly higher than synonymous divergence in PGAM3 lineage, supporting positive selection acting in this gene. This role of selection was further supported by the excess of rare alleles in a population genetic analysis. PGAM3 is located in a region of very low recombination; therefore, it is conceivable that the rapid fixation events in this newly arising gene may have contributed to a selective sweep of variation in the region.
Collapse
Affiliation(s)
- Esther Betrán
- Department of Ecology and Evolution, The University of Chicago, 1101 East 57th Street, Chicago, IL 60637, USA
| | | | | | | |
Collapse
|
211
|
Wang W, Brunet FG, Nevo E, Long M. Origin of sphinx, a young chimeric RNA gene in Drosophila melanogaster. Proc Natl Acad Sci U S A 2002; 99:4448-53. [PMID: 11904380 PMCID: PMC123668 DOI: 10.1073/pnas.072066399] [Citation(s) in RCA: 115] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Non-protein-coding RNA genes play an important role in various biological processes. How new RNA genes originated and whether this process is controlled by similar evolutionary mechanisms for the origin of protein-coding genes remains unclear. A young chimeric RNA gene that we term sphinx (spx) provides the first insight into the early stage of evolution of RNA genes. spx originated as an insertion of a retroposed sequence of the ATP synthase chain F gene at the cytological region 60DB since the divergence of Drosophila melanogaster from its sibling species 2-3 million years ago. This retrosequence, which is located at 102F on the fourth chromosome, recruited a nearby exon and intron, thereby evolving a chimeric gene structure. This molecular process suggests that the mechanism of exon shuffling, which can generate protein-coding genes, also plays a role in the origin of RNA genes. The subsequent evolutionary process of spx has been associated with a high nucleotide substitution rate, possibly driven by a continuous positive Darwinian selection for a novel function, as is shown in its sex- and development-specific alternative splicing. To test whether spx has adapted to different environments, we investigated its population genetic structure in the unique "Evolution Canyon" in Israel, revealing a similar haplotype structure in spx, and thus similar evolutionary forces operating on spx between environments.
Collapse
Affiliation(s)
- Wen Wang
- Department of Ecology and Evolution, University of Chicago, 1101 East 57th Street, Chicago, IL 60637, USA
| | | | | | | |
Collapse
|
212
|
García-Maroto F, Garrido-Cárdenas JA, Rodríguez-Ruiz J, Vilches-Ferrón M, Adam AC, Polaina J, Alonso DL. Cloning and molecular characterization of the delta6-desaturase from two echium plant species: production of GLA by heterologous expression in yeast and tobacco. Lipids 2002; 37:417-26. [PMID: 12030323 DOI: 10.1007/s1145-002-0910-6] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
The synthesis of GLA (delta6,9,12-1-8:3) is carried out in a number of plant taxa by introducing a double bond at the delta6 position of its precursor, linoleic acid (delta9,12-18:2), through a reaction catalyzed by a delta6-desaturase enzyme. We have cloned genes encoding the delta6-desaturase (D6DES) from two different Macaronesian Echium species, E. pitardii and E. gentianoides (Boraginaceae), which are characterized by the accumulation of high amounts of GLA in their seeds. The Echium D6DES genes encode proteins of 438 amino acids bearing the prototypical cytochrome b(5) domain at the N-terminus. Cladistic analysis of desaturases from higher plants groups the Echium D6DES proteins together with other delta6-desaturases in a different cluster from that of the highly related delta8-desaturases. Expression analysis carried out in E. pitardii shows a positive correlation between the D6DES transcript level and GLA accumulation in different tissues of the plant. Although a ubiquitous expression in all organs is observed, the transcript is particularly abundant in developing fruits, whereas a much lower level is present in mature leaves. Functional characterization of the D6DES gene from E. gentianoides has been achieved by heterologous expression in tobacco plants and in the yeast Saccharomyces cerevisiae. In both cases, overexpression of the gene led to the synthesis of GLA. Biotechnological application of these results can be envisaged as an initial step toward the generation of transgenic oleaginous plants producing GLA.
Collapse
Affiliation(s)
- Federico García-Maroto
- Departamento de Bioquímica, Facultad de Ciencias Experimentales, Universidad de Almería, Spain
| | | | | | | | | | | | | |
Collapse
|
213
|
Abstract
This study examines the intragenomic spread of the human endogenous retrovirus family HERV-W from insertions present within the draft sequence of the human genome. Identification of shared diagnostic differences and phylogenetic analyses revealed the existence of three main subfamilies. The average divergence between sequences for each of the subfamilies suggests that most of the HERV-W elements were inserted within the genome during a short period of evolutionary time. Each one of the subfamilies consists of two types of insertions, the expected proviral sequences and other sequences resembling the structure of processed retrogenes. These HERV-W retrosequences extend from the R region of the 5' long-terminal repeat (LTR) to the R region of the 3' LTR (as viral genomic RNAs), end in poly(A) 3' tails, and are flanked by direct repeats longer than the proviral integrations. Furthermore, several of the HERV-W retrosequences are 5'-truncated at different sites. I suggest the involvement of the L1 machinery in these integrations and discuss the characteristic features of the evolutionary history of HERV-W, with emphasis on the putative impact of HERV-W retrosequence integrations on the mammalian genome.
Collapse
Affiliation(s)
- Javier Costas
- Departamento de Bioloxía Fundamental, Universidade de Santiago de Compostela, E-15782 Santiago de Compostela, Spain.
| |
Collapse
|
214
|
Thiselton DL, McDowall J, Brandau O, Ramser J, d'Esposito F, Bhattacharya SS, Ross MT, Hardcastle AJ, Meindl A. An integrated, functionally annotated gene map of the DXS8026-ELK1 interval on human Xp11.3-Xp11.23: potential hotspot for neurogenetic disorders. Genomics 2002; 79:560-72. [PMID: 11944989 DOI: 10.1006/geno.2002.6733] [Citation(s) in RCA: 72] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Human chromosome Xp11.3-Xp11.23 encompasses the map location for a growing number of diseases with a genetic basis or genetic component. These include several eye disorders, syndromic and nonsyndromic forms of X-linked mental retardation (XLMR), X-linked neuromuscular diseases and susceptibility loci for schizophrenia, type 1 diabetes, and Graves' disease. We have constructed an approximately 2.7-Mb high-resolution physical map extending from DXS8026 to ELK1, corresponding to a genetic distance of approximately 5.5 cM. A combination of chromosome walking and sequence-tagged site (STS)-content mapping resulted in an integrated framework and transcript map, precisely positioning 10 polymorphic microsatellites (one of which is novel), 16 ESTs, and 12 known genes (RP2, PCTK1, UHX1, UBE1, RBM10, ZNF157, SYN1, ARAF1, TIMP1, PFC, ELK1, UXT). The composite map is currently anchored with 89 STSs to give an average resolution of approximately 1 STS every 30 kb. By a combination of EST database searches and in silico detection of UniGene clusters within genomic sequence generated from this template map, we have mapped several novel genes within this interval: a Na+/H+ exchanger (SLC9A7), at least two zincfinger transcription factors (KIAA0215 and Hs.68318), carbohydrate sulfotransferase-7 (CHST7), regucalcin (RGN), inactivation-escape-1 (INE1), the human ortholog of mouse neuronal protein 15.6, and four putative novel genes. Further genomic analysis enabled annotation of the sequence interval with 20 predicted pseudogenes and 21 UniGene clusters of unknown function. The combined PAC/BAC transcript map and YAC scaffold presented here clarifies previously conflicting data for markers and genes within the Xp11.3-Xp11.23 interval and provides a powerful integrated resource for functional characterization of this clonally unstable, yet gene-rich and clinically significant region of proximal Xp.
Collapse
Affiliation(s)
- Dawn L Thiselton
- Department of Molecular Genetics, Institute of Ophthalmology, University College, London, EC1V 9EL, UK.
| | | | | | | | | | | | | | | | | |
Collapse
|
215
|
Pavlícek A, Paces J, Elleder D, Hejnar J. Processed pseudogenes of human endogenous retroviruses generated by LINEs: their integration, stability, and distribution. Genome Res 2002; 12:391-9. [PMID: 11875026 PMCID: PMC155283 DOI: 10.1101/gr.216902] [Citation(s) in RCA: 80] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
We report here the presence of numerous processed pseudogenes derived from the W family of endogenous retroviruses in the human genome. These pseudogenes are structurally colinear with the retroviral mRNA followed by a poly(A) tail. Our analysis of insertion sites of HERV-W processed pseudogenes shows a strong preference for the insertion motif of long interspersed nuclear element (LINE) retrotransposons. The genomic distribution, stability during evolution, and frequent truncations at the 5' end resemble those of the pseudogenes generated by LINEs. We therefore suggest that HERV-W processed pseudogenes arose by multiple and independent LINE-mediated retrotransposition of retroviral mRNA. These data document that the majority of HERV-W copies are actually nontranscribed promoterless pseudogenes. The current search for HERV-Ws associated with several human diseases should concentrate on a small subset of transcriptionally competent elements.
Collapse
Affiliation(s)
- Adam Pavlícek
- Institute of Molecular Genetics, Academy of Sciences of the Czech Republic, Prague 6, CZ-16637, Czech Republic
| | | | | | | |
Collapse
|
216
|
Li J, Rehli M, Timblin B, Tan F, Krause SW, Skidgel RA. Structure of the human carboxypeptidase M gene. Identification of a proximal GC-rich promoter and a unique distal promoter that consists of repetitive elements. Gene 2002; 284:189-202. [PMID: 11891060 DOI: 10.1016/s0378-1119(01)00898-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
The human carboxypeptidase M (CPM) gene was found to encompass about 112.6 kb of genomic sequence, containing 11 exons of which eight (exons 2-9) are common to all transcripts and contain the entire coding region. We have cloned several alternative variants of CPM transcripts that result from differential promoter usage and alternative splicing. Although CPM belongs to the same metallocarboxypeptidase subfamily as CPE, their intron/exon structures differ significantly. Multiple transcription start sites were found in the CPM gene that cluster in two regions separated by about 30 kb and are flanked by two unique functional promoters. One ('proximal') is immediately upstream of the coding region and contains GC-rich sequences and a typical TATA box whereas the other ('distal') consists almost entirely of repetitive elements. Luciferase reporter assays with constructs of the promoter regions showed they were both quite active in several cell lines. However, the proximal promoter was much stronger than the distal one in two of the human cell lines tested (HepG2 and HEK293) whereas both promoters were highly and equally active in the human monocytic cell line THP-1, which has high constitutive expression of CPM.
Collapse
Affiliation(s)
- Jingqiu Li
- Department of Pharmacology, University of Illinois College of Medicine, 835 S. Wolcott, Chicago, IL 60612, USA
| | | | | | | | | | | |
Collapse
|
217
|
Ogiwara I, Miya M, Ohshima K, Okada N. V-SINEs: a new superfamily of vertebrate SINEs that are widespread in vertebrate genomes and retain a strongly conserved segment within each repetitive unit. Genome Res 2002; 12:316-24. [PMID: 11827951 PMCID: PMC155270 DOI: 10.1101/gr.212302] [Citation(s) in RCA: 80] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
We have identified a new superfamily of vertebrate short interspersed repetitive elements (SINEs), designated V-SINEs, that are widespread in fishes and frogs. Each V-SINE includes a central conserved domain preceded by a 5'-end tRNA-related region and followed by a potentially recombinogenic (TG)(n) tract, with a 3' tail derived from the 3' untranslated region (UTR) of the corresponding partner long interspersed repetitive element (LINE) that encodes a functional reverse transcriptase. The central domain is strongly conserved and is even found in SINEs in the lamprey genome, suggesting that V-SINEs might be approximately 550 Myr old or older in view of the timing of divergence of the lamprey lineage from the bony fish lineage. The central conserved domain might have been subject to some form of positive selection. Although the contemporary 3' tails of V-SINEs differ from one another, it is possible that the original 3' tail might have been replaced, via recombination, by the 3' tails of more active partner LINEs, thereby retaining retropositional activity and the ability to survive for long periods on the evolutionary time scale. It seems plausible that V-SINEs may have some function(s) that have been maintained by the coevolution of SINEs and LINEs during the evolution of vertebrates.
Collapse
Affiliation(s)
- Ikuo Ogiwara
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 4259 Nagatsuta-cho, Midori-ku, Yokohama 226-8501, Japan
| | | | | | | |
Collapse
|
218
|
Abstract
The human endogenous retroviruses database (HERVd) is maintained at the Institute of Molecular Genetics, Academy of Sciences of the Czech Republic, and is accessible via the World Wide Web at http://herv.img.cas.cz. The HERVd provides complex information on and analysis of retroviral elements found in the human genome. It can be used for searches of individual HERV families, identification of HERV parts, graphical output of HERV structures, comparison of HERVs and identification of retrovirus integration sites.
Collapse
Affiliation(s)
- Jan Paces
- Institute of Molecular Genetics, Academy of Sciences of the Czech Republic, Flemingovo 2, CZ-16637 Prague, Czech Republic
| | | | | |
Collapse
|
219
|
Ganko EW, Fielman KT, McDonald JF. Evolutionary history of Cer elements and their impact on the C. elegans genome. Genome Res 2001; 11:2066-74. [PMID: 11731497 PMCID: PMC311226 DOI: 10.1101/gr.196201] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2001] [Accepted: 10/10/2001] [Indexed: 11/25/2022]
Abstract
We report the results of sequence analysis and chromosomal distribution of all distinguishable long terminal repeat (LTR) retrotransposons (Cer elements) in the Caenorhabditis elegans genome. Included in this analysis are all readily recognizable full-length and fragmented elements, as well as solo LTRs. Our results indicate that there are 19 families of Cer elements, some of which display significant subfamily structure. Cer elements can be clustered based on their tRNA primer binding sites (PBSs). These clusters are in concordance with our reverse transcriptase- and LTR-based phylogenies. Although we find that most Cer elements are located in the gene depauperate chromosome ends, some elements are located in or near putative genes and may contribute to gene structure and function. The results of RT-PCR analyses are consistent with this prediction.
Collapse
Affiliation(s)
- E W Ganko
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
| | | | | |
Collapse
|
220
|
Abstract
Much progress in understanding the evolution of new genes has been accomplished in the past few years. Molecular mechanisms such as illegitimate recombination and LINE element mediated 3' transduction underlying exon shuffling, a major process for generating new genes, are better understood. The identification of young genes in invertebrates and vertebrates has revealed a significant role of adaptive evolution acting on initially rudimentary gene structures created as if by evolutionary tinkers. New genes in humans and our primate relatives add a new component to the understanding of genetic divergence between humans and non-humans.
Collapse
Affiliation(s)
- M Long
- Department of Ecology and Evolution, The University of Chicago, 1101 East 57th Street, Chicago Illinois 60637, USA.
| |
Collapse
|
221
|
Miranda-Vizuete A, Ljung J, Damdimopoulos AE, Gustafsson JA, Oko R, Pelto-Huikko M, Spyrou G. Characterization of Sptrx, a novel member of the thioredoxin family specifically expressed in human spermatozoa. J Biol Chem 2001; 276:31567-74. [PMID: 11399755 DOI: 10.1074/jbc.m101760200] [Citation(s) in RCA: 113] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Thioredoxins (Trx) are small ubiquitous proteins that participate in different cellular processes via redox-mediated reactions. We report here the identification and characterization of a novel member of the thioredoxin family in humans, named Sptrx (sperm-specific trx), the first with a tissue-specific distribution, located exclusively in spermatozoa. Sptrx open reading frame encodes for a protein of 486 amino acids composed of two clear domains: an N-terminal domain consisting of 23 highly conserved repetitions of a 15-residue motif and a C-terminal domain typical of thioredoxins. Northern analysis and in situ hybridization shows that Sptrx mRNA is only expressed in human testis, specifically in round and elongating spermatids. Immunostaining of human testis sections identified Sptrx protein in spermatids, while immunofluorescence and immunogold electron microscopy analysis demonstrated Sptrx localization in the cytoplasmic droplet of ejaculated sperm. Sptrx appears to have a multimeric structure in native conditions and is able to reduce insulin disulfide bonds in the presence of NADPH and thioredoxin reductase. During mammalian spermiogenesis in testis seminiferous tubules and later maturation in epididymis, extensive reorganization of disulfide bonds is required to stabilize cytoskeletal sperm structures. However, the molecular mechanisms that control these processes are not known. The identification of Sptrx with an expression pattern restricted to the postmeiotic phase of spermatogenesis, when the sperm tail is organized, suggests that Sptrx might be an important factor in regulating critical steps of human spermiogenesis.
Collapse
Affiliation(s)
- A Miranda-Vizuete
- Department of Biosciences at Novum, Center for Biotechnology, Karolinska Institute, S-141 57 Huddinge, Sweden.
| | | | | | | | | | | | | |
Collapse
|
222
|
Brosius J, Tiedge H. Neuronal BC1 RNA: intracellular transport and activity-dependent modulation. Results Probl Cell Differ 2001; 34:129-38. [PMID: 11288672 DOI: 10.1007/978-3-540-40025-7_8] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2023]
Affiliation(s)
- J Brosius
- Institute of Experimental Pathology/Molecular Neurobiology, Center for Molecular Biology of Inflammation, University of Münster, 48149 Münster, Germany
| | | |
Collapse
|
223
|
Gardiol A, Racca C, Triller A. RNA transport and local protein synthesis in the dendritic compartment. Results Probl Cell Differ 2001; 34:105-28. [PMID: 11288671 DOI: 10.1007/978-3-540-40025-7_7] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Affiliation(s)
- A Gardiol
- Laboratoire de Biologie Cellulaire de la Synapse N&P INSERM U497 Ecole Normale Supérieure, 46 rue d'Ulm, 75005 Paris, France
| | | | | |
Collapse
|
224
|
Ono T, Kondoh Y, Kagiyama N, Sonta S, Yoshida MC. Genomic organization and chromosomal distribution of rat ID elements. Genes Genet Syst 2001; 76:213-20. [PMID: 11732629 DOI: 10.1266/ggs.76.213] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Identifier (ID) elements are members of a family of short interspersed nuclear elements (SINEs) in rodents. We investigated the genomic organization and chromosomal distribution of the ID elements in the rat, mouse and Chinese hamster. Southern blot hybridization analysis revealed that the ID elements are widespread in the rat genome, but concentrated in the mouse and Chinese hamster genomes, and that the copy of ID elements in the rat is about 5 times and 50 times that in the mouse and Chinese hamster, respectively. FISH analysis showed that the ID elements are predominantly distributed in the R-band regions of rat chromosomes. In mouse and Chinese hamster chromosomes, no specific distribution pattern of the ID elements was found. Furthermore, we found a distinct group of derivative ID elements in the rat, which contain partially repeated ID core domains, by PCR amplification using an ID core sequence. Such derivatives were not found in either the mouse or Chinese hamster. These findings suggest that explosive amplification of the ID elements in the rat has been accompanied by the occurrence of derivative ID elements and a predominant localization to the R-band regions. Similar associations found in the Alu family, one of the human SINEs, allow us to speculate that the rat ID elements and the human Alu family have analogous functions in chromosomal organization.
Collapse
Affiliation(s)
- T Ono
- Division of Bioscience, Graduate School of Environmental Earth Sciences, Hokkaido University, Sapporo, Japan.
| | | | | | | | | |
Collapse
|
225
|
Landry JR, Medstrand P, Mager DL. Repetitive elements in the 5' untranslated region of a human zinc-finger gene modulate transcription and translation efficiency. Genomics 2001; 76:110-6. [PMID: 11549323 DOI: 10.1006/geno.2001.6604] [Citation(s) in RCA: 46] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
A substantial proportion of the human genome consists of repetitive sequences. Although most of these sequences are nonessential for the organism, retroelements, such as Alu sequences, L1s, and HERVs (human endogenous retroviruses), have recently been implicated in the regulation of various genes. Our laboratory previously identified a novel, alternatively spliced zinc-finger gene, ZNF177, which incorporates Alu L1, and HERV segments into the 5' untranslated region (UTR) of transcripts. In this study, we investigated the genomic structure and functional significance of the repetitive sequences in the 5' UTR of ZNF177 mRNAs. Using luciferase and GFP reporter constructs, we assessed the effect of the HERV, Alu, and L1 sequences on gene expression levels. Our results indicate that the presence of the retroelement sequences, particularly the Alu and L1 segments which form one 5' UTR exon, modifies the expression level of both reporter genes. We present evidence that the Alu and L1 sequences alter both the RNA and protein levels of reporter genes by increasing transcription efficiency while decreasing translation efficiency. Our findings indicate that the Alu and L1 repeats in the 5' UTR of ZNF177 exert a positive transcriptional enhancer effect, but repress translation of the zinc finger gene. In addition, our analysis of a 5' UTR database suggests that 4% of human 5' UTRs harbor Alu sequences, indicating that the expression of many genes might be influenced by Alu repeats. These results illustrate the complex regulatory effects that retroelements can have on human gene expression.
Collapse
Affiliation(s)
- J R Landry
- Terry Fox Laboratory, British Columbia Cancer Agency and Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia, Canada
| | | | | |
Collapse
|
226
|
Kurdyukov SG, Lebedev YB, Artamonova II, Gorodentseva TN, Batrak AV, Mamedov IZ, Azhikina TL, Legchilina SP, Efimenko IG, Gardiner K, Sverdlov ED. Full-sized HERV-K (HML-2) human endogenous retroviral LTR sequences on human chromosome 21: map locations and evolutionary history. Gene 2001; 273:51-61. [PMID: 11483360 DOI: 10.1016/s0378-1119(01)00570-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
One of the evolutionary mechanisms for acquisition of novel functional sequences can be domestication of exogenous retroviruses that have been integrated into the germ line. The whole genome mapping of such elements in various species could reveal differences in positions of the retroviral integration and suggest possible roles of these differences in speciation. Here, we describe the number, locations and sequence features of the human endogenous retrovirus HERV-K (HML-2) long terminal repeat (LTR) sequences on human chromosome 21. We show that their distribution along the chromosome is not only non-random but also roughly correlated with the gene density. Amplification of orthologous LTR sites from a number of primate genomes produced patterns of presence and absence for each LTR sequence and allowed determination of the phylogenetic ages and evolutionary order of appearance of individual LTRs. The identity level and phylogenetic age of the LTRs did not correlate with their map locations. Thus, despite the non-random distribution of LTRs, they have apparently been inserted randomly into the chromosome relative to each other. As evidenced in previous studies of chromosomes 19 and 22, this is a characteristic of HERV-K integration.
Collapse
Affiliation(s)
- S G Kurdyukov
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Science, 16/10 Miklukho-Maklaya, Moscow, 117871, Russia
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
227
|
Kuryshev VY, Skryabin BV, Kremerskothen J, Jurka J, Brosius J. Birth of a gene: locus of neuronal BC200 snmRNA in three prosimians and human BC200 pseudogenes as archives of change in the Anthropoidea lineage. J Mol Biol 2001; 309:1049-66. [PMID: 11399078 DOI: 10.1006/jmbi.2001.4725] [Citation(s) in RCA: 41] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The gene encoding brain-specific dendritic BC200 small non-messenger RNA is limited to the primate order and arose from a monomeric Alu element. It is present and neuronally expressed in all Anthropoidea examined. By comparing the human sequence of about 13.2 kb with each of the prosimian (lemur 14.6 kb, galago 12 kb, and tarsier 13.8 kb) orthologous loci, we could establish that the BC200 RNA gene is absent from the prosimian lineages. In Strepsirhini (lemurs and lorises), a dimeric AluJ-like element integrated very close to the BC200 insertion point, while the corresponding tarsier region is devoid of any repetitive element. Consequently, insertion of the Alu monomer that gave rise to the BC200 RNA gene must have occurred after the anthropoid lineage diverged from the prosimian lineage(s). Shared insertions of other repetitive elements favor proximity of simians and tarsiers in support of their grouping into Haplorhini and the omomyid hypothesis. On the other hand, the nucleotide sequences in the segment that is available for comparison in all four species reveal less exchanges between Strepsirhini (lemur and galago) and human than between tarsier and human. Our data imply that the early activity of dimeric Alu sequences must have been concurrent with the activity of monomeric Alu elements that persisted longer than is usually thought. As BC200 RNA gave rise to more than 200 pseudogenes, we used their consensus sequence variations as a molecular archive recording the BC200 RNA sequence changes in the anthropoid lineage leading to Homo sapiens and timed these alterations over the past 35-55 million years.
Collapse
Affiliation(s)
- V Y Kuryshev
- Institute of Experimental Pathology/Molecular Neurobiology, ZMBE, University of Münster, Von-Esmarch-Str. 56, Münster, D-48149, Germany.
| | | | | | | | | |
Collapse
|
228
|
Bogdanova N, Markoff A, Gerke V, McCluskey M, Horst J, Dworniczak B. Homologues to the first gene for autosomal dominant polycystic kidney disease are pseudogenes. Genomics 2001; 74:333-41. [PMID: 11414761 DOI: 10.1006/geno.2001.6568] [Citation(s) in RCA: 68] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
PKD1 is the first gene identified to be causative for the condition of autosomal dominant polycystic kidney disease. There are several genes homologous to PKD1 that are located proximal to the master gene on the same chromosome. Two of these genes have been recently covered in a large sequencing work on chromosome 16, and their structure has been broadly analyzed. However, the major question whether homologous genes (HG) code for functionally active polypeptides has not been resolved so far. The current study identifies and partially characterizes four more homologues of PKD1, different from the previously published sequence, two of which were found by screening of a BAC library and the other two contained in available databases. Analysis of HG transcripts shows that they are not translated in the model cell line T98G. Taken together, these findings suggest that homologues to PKD1 form a family of pseudogenes.
Collapse
Affiliation(s)
- N Bogdanova
- Institut für Humangenetik, Westfälische Wilchelms-Universität Münster, Münster, D-48149, Germany
| | | | | | | | | | | |
Collapse
|
229
|
Collins C, Volik S, Kowbel D, Ginzinger D, Ylstra B, Cloutier T, Hawkins T, Predki P, Martin C, Wernick M, Kuo WL, Alberts A, Gray JW. Comprehensive Genome Sequence Analysis of a Breast Cancer Amplicon. Genome Res 2001. [DOI: 10.1101/gr.174301] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Gene amplification occurs in most solid tumors and is associated with poor prognosis. Amplification of 20q13.2 is common to several tumor types including breast cancer. The 1 Mb of sequence spanning the 20q13.2 breast cancer amplicon is one of the most exhaustively studied segments of the human genome. These studies have included amplicon mapping by comparative genomic hybridization (CGH), fluorescent in-situ hybridization (FISH), array-CGH, quantitative microsatellite analysis (QUMA), and functional genomic studies. Together these studies revealed a complex amplicon structure suggesting the presence of at least two driver genes in some tumors. One of these, ZNF217, is capable of immortalizing human mammary epithelial cells (HMEC) when overexpressed. In addition, we now report the sequencing of this region in human and mouse, and on quantitative expression studies in tumors. Amplicon localization now is straightforward and the availability of human and mouse genomic sequence facilitates their functional analysis. However, comprehensive annotation of megabase-scale regions requires integration of vast amounts of information. We present a system for integrative analysis and demonstrate its utility on 1.2 Mb of sequence spanning the 20q13.2 breast cancer amplicon and 865 kb of syntenic murine sequence. We integrate tumor genome copy number measurements with exhaustive genome landscape mapping, showing that amplicon boundaries are associated with maxima in repetitive element density and a region of evolutionary instability. This integration of comprehensive sequence annotation, quantitative expression analysis, and tumor amplicon boundaries provide evidence for an additional driver gene prefoldin 4 (PFDN4), coregulated genes, conserved noncoding regions, and associate repetitive elements with regions of genomic instability at this locus.
Collapse
|
230
|
Collins C, Volik S, Kowbel D, Ginzinger D, Ylstra B, Cloutier T, Hawkins T, Predki P, Martin C, Wernick M, Kuo WL, Alberts A, Gray JW. Comprehensive genome sequence analysis of a breast cancer amplicon. Genome Res 2001; 11:1034-42. [PMID: 11381030 PMCID: PMC311107 DOI: 10.1101/gr.gr1743r] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Gene amplification occurs in most solid tumors and is associated with poor prognosis. Amplification of 20q13.2 is common to several tumor types including breast cancer. The 1 Mb of sequence spanning the 20q13.2 breast cancer amplicon is one of the most exhaustively studied segments of the human genome. These studies have included amplicon mapping by comparative genomic hybridization (CGH), fluorescent in-situ hybridization (FISH), array-CGH, quantitative microsatellite analysis (QUMA), and functional genomic studies. Together these studies revealed a complex amplicon structure suggesting the presence of at least two driver genes in some tumors. One of these, ZNF217, is capable of immortalizing human mammary epithelial cells (HMEC) when overexpressed. In addition, we now report the sequencing of this region in human and mouse, and on quantitative expression studies in tumors. Amplicon localization now is straightforward and the availability of human and mouse genomic sequence facilitates their functional analysis. However, comprehensive annotation of megabase-scale regions requires integration of vast amounts of information. We present a system for integrative analysis and demonstrate its utility on 1.2 Mb of sequence spanning the 20q13.2 breast cancer amplicon and 865 kb of syntenic murine sequence. We integrate tumor genome copy number measurements with exhaustive genome landscape mapping, showing that amplicon boundaries are associated with maxima in repetitive element density and a region of evolutionary instability. This integration of comprehensive sequence annotation, quantitative expression analysis, and tumor amplicon boundaries provide evidence for an additional driver gene prefoldin 4 (PFDN4), coregulated genes, conserved noncoding regions, and associate repetitive elements with regions of genomic instability at this locus.
Collapse
Affiliation(s)
- C Collins
- University of California San Francisco Cancer Center, San Francisco, California 94143-0808, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
231
|
Abstract
How genes with newly characterized functions originate remains a fundamental question. PMCHL1 and PMCHL2, two chimeric genes derived from the melanin-concentrating hormone (MCH) gene, offer an opportunity to examine such an issue in the human lineage. Detailed structural, expression, and phylogenetic analysis showed that the PMCHL1 gene was created near 25 million years ago (Ma) by a complex mechanism of exon shuffling through retrotransposition of an antisense MCH messenger RNA coupled to de novo creation of splice sites. PMCHL2 arose 5 to 10 Ma by an event of duplication involving a large chromosomal region encompassing the PMCHL1 locus. The RNA expression patterns of those chimeric genes suggest that they have been submitted to strong regulatory constraints during primate evolution.
Collapse
Affiliation(s)
- A Courseaux
- Institut de Pharmacologie Moléculaire et Cellulaire, UMR CNRS 6097, 660 route des Lucioles Sophia Antipolis 06560 Valbonne, France
| | | |
Collapse
|
232
|
Medstrand P, Landry JR, Mager DL. Long terminal repeats are used as alternative promoters for the endothelin B receptor and apolipoprotein C-I genes in humans. J Biol Chem 2001; 276:1896-903. [PMID: 11054415 DOI: 10.1074/jbc.m006557200] [Citation(s) in RCA: 151] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
To examine the potential regulatory involvement of retroelements in the human genome, we screened the transcribed sequences of GenBank and expressed sequence tag data bases with long terminal repeat (LTR) elements derived from different human endogenous retroviruses. These screenings detected human transcripts containing LTRs belonging to the human endogenous retrovirus-E family fused to the apolipoprotein CI (apoC-I) and the endothelin B receptor (EBR) genes. However, both genes are known to have non-LTR (native) promoters. Initial reverse transcription-polymerase chain reaction experiments confirmed and authenticated the presence of transcripts from both the native and LTR promoters. Using a 5'-rapid amplification of cDNA ends protocol, we showed that the alternative transcripts of apoC-I and EBR are initiated and promoted by the LTRs. The LTR-apoC-I fusion and native apoC-I transcripts are present in many of the tissues tested. As expected, we found apoC-I preferentially expressed in liver, where about 15% of the transcripts are derived from the LTR promoter. Transient transfections suggest that the expression is not dependent on the LTR itself, but the presence of the LTR increases activity of the apoC-I promoter from both humans and baboons. The native EBR-driven transcripts were also detected in many tissues, whereas the LTR-driven transcripts appear limited to placenta. In contrast to the LTR of apoC-I, the EBR LTR promotes a significant proportion of the total EBR transcripts, and transient transfection results indicate that the LTR acts as a strong promoter and enhancer in a placental cell line. This investigation reports two examples where LTR sequences contribute to increased transcription of human genes and illustrates the impact of mobile elements on gene and genome evolution.
Collapse
Affiliation(s)
- P Medstrand
- Terry Fox Laboratory, British Columbia Cancer Agency and Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia, V5Z 1L3, Canada
| | | | | |
Collapse
|
233
|
Shi X, Potvin B, Huang T, Hilgard P, Spray DC, Suadicani SO, Wolkoff AW, Stanley P, Stockert RJ. A novel casein kinase 2 alpha-subunit regulates membrane protein traffic in the human hepatoma cell line HuH-7. J Biol Chem 2001; 276:2075-82. [PMID: 11038365 DOI: 10.1074/jbc.m008583200] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
A previously isolated endocytic trafficking mutant (TRF1) isolated from HuH-7 cells is defective in the distribution of subpopulations of cell-surface receptors for asialoorosomucoid (asialoglycoprotein receptor (ASGR)), transferrin, and mannose-terminating glycoproteins. The pleiotropic phenotype of TRF1 also includes an increased sensitivity to Pseudomonas toxin and deficient assembly and function of gap junctions. HuH-7xTRF1 hybrids exhibited a normal subcellular distribution of ASGR, consistent with the TRF1 mutation being recessive. A cDNA expression library derived from HuH-7 mRNA was transfected into TRF1 cells, which were subsequently selected for resistance to Pseudomonas toxin. Sequence analysis of a recovered cDNA revealed a unique isoform of casein kinase 2 (CK2), CK2alpha". Western blot analysis of TRF1 proteins revealed a 60% reduction in total CK2alpha expression. Consistent with this finding, the hybrids HuH-7xHuH-7 and HuH-7xTRF1 expressed equivalent amounts of total CK2alpha. Immunoblots using antibodies against peptides unique to the previously described CK2 isoforms CK2alpha and CK2alpha' and the novel CK2alpha" isoform showed that, although TRF1 and parental HuH-7 cells expressed comparable amounts of CK2alpha and CK2alpha', the mutant did not express CK2alpha". Based on the genomic DNA sequence, RNA transcripts encoding CK2alpha" apparently originate from alternative splicing of a primary transcript. Protein overexpression following transfection of TRF1 cells with cDNAs encoding either CK2alpha or the newly cloned CK2alpha" restored the parental HuH-7 phenotype, including Pseudomonas toxin resistance, cell-surface ASGR binding activity, phosphorylation, and the assembly of gap junctions. This study suggests that HuH-7 cells express at least three CK2alpha isoforms and that the pleiotropic TRF1 phenotype is a consequence of a reduction in total CK2 expression.
Collapse
Affiliation(s)
- X Shi
- Marion Bessin Liver Research Center, Albert Einstein College of Medicine, Bronx, New York 10461, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
234
|
|
235
|
Feschotte C, Fourrier N, Desmons I, Mouchès C. Birth of a retroposon: the Twin SINE family from the vector mosquito Culex pipiens may have originated from a dimeric tRNA precursor. Mol Biol Evol 2001; 18:74-84. [PMID: 11141194 DOI: 10.1093/oxfordjournals.molbev.a003721] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
SINEs are short interspersed repetitive elements found in many eukaryotic genomes and are believed to propagate by retroposition. Almost all SINEs reported to date have a composite structure made of a 5' tRNA-related region followed by a tRNA-unrelated region. Here, we describe a new type of tRNA-derived SINEs from the genome of the mosquito Culex pipiens. These elements, called TWINs, are approximately 220 bp long and reiterated at approximately 500 copies per haploid genome. TWINs have a unique structure compared with other tRNA-SINEs described so far. They consist of two tRNA(Arg)-related regions separated by a 39-bp spacer. Other tRNA-unrelated sequences include a 5-bp leader preceding the left tRNA-like unit and a short trailer located downstream of the right tRNA-like region. This 3' trailer is a 10-bp sequence that is ended by a TTTT motif and followed by a polyA tract of variable length. The right tRNA-like unit also contains a 16-bp sequence which is absent in the left one and appears to be located in the ancestral anticodon stem precisely at a position expected for a nuclear tRNA intron. According to this singular structure, we hypothesize that the TWIN: SINE family originated from an unprocessed polymerase III transcript containing two tRNA sequences. We suggest that some peculiar properties acquired by this dicistronic transcript, such as a polyA tail and a 3' stem-loop secondary structure, promote its retroposition by increasing its chances of being recognized by a reverse transcriptase encoded elsewhere in the C. pipiens genome.
Collapse
Affiliation(s)
- C Feschotte
- Laboratoire Ecologie Moléculaire et Faculté Sciences et Techniques Côte-Basque, Université de Pau et des Pays de l'Adour, Pau, France
| | | | | | | |
Collapse
|
236
|
Abstract
The nature of the role played by mobile elements in host genome evolution is reassessed considering numerous recent developments in many areas of biology. It is argued that easy popular appellations such as "selfish DNA" and "junk DNA" may be either inaccurate or misleading and that a more enlightened view of the transposable element-host relationship encompasses a continuum from extreme parasitism to mutualism. Transposable elements are potent, broad spectrum, endogenous mutators that are subject to the influence of chance as well as selection at several levels of biological organization. Of particular interest are transposable element traits that early evolve neutrally at the host level but at a later stage of evolution are co-opted for new host functions.
Collapse
Affiliation(s)
- M G Kidwell
- Department of Ecology and Evolutionary Biology, The University of Arizona, Tucson 85721, USA.
| | | |
Collapse
|
237
|
Abstract
Using a bacterial artificial chromosome (BAC) clone, we have mapped the human cytochrome P450 3A (CYP3A) locus containing the genes encoding for CYP3A4, CYP3A5 and CYP3A7. The genes lie in a head-to-tail orientation in the order of 3A4, 3A7 and 3A5. In both intergenic regions (3A4-3A7 and 3A7-3A5), we have detected several additional cytochrome P450 3A exons, forming two CYP3A pseudogenes. These pseudogenes have the same orientation as the CYP3A genes. To our surprise, a 3A7 mRNA species has been detected in which the exons 2 and 13 of one of the pseudogenes (the one that is downstream of 3A7) are spliced after the 3A7 terminal exon. This results in an mRNA molecule that consists of the 13 3A7 exons and two additional exons at the 3' end. The additional two exons originating from the pseudogene are in an altered reading frame and consequently have the capability to code a completely different amino acid sequence than the canonical CYP3A exons 2 and 13. These findings may represent a generalized evolutionary process with genes having the potential to capture neighboring sequences and use them as functional exons.
Collapse
Affiliation(s)
- C Finta
- Center for Nutrition and Toxicology, Department of Biosciences at NOVUM, Karolinska Institute, SE-14157, Huddinge, Sweden
| | | |
Collapse
|
238
|
Abstract
Interspersed repetitive sequences are major components of eukaryotic genomes. Repetitive elements comprise over 50% of the mammalian genome. Because the specific function of these elements remains to be defined and because of their unusual 'behaviour' in the genome, they are often quoted as a selfish or junk DNA. Our view of the entire phenomenon of repetitive elements has to now be revised in the light of data on their biology and evolution, especially in the light of what we know about the retroposons. I would like to argue that even if we cannot define the specific function of these elements, we still can show that they are not useless pieces of the genomes. The repetitive elements interact with the surrounding sequences and nearby genes. They may serve as recombination hot spots or acquire specific cellular functions such as RNA transcription control or even become part of protein coding regions. Finally, they provide very efficient mechanism for genomic shuffling. As such, repetitive elements should be called genomic scrap yard rather than junk DNA. Tables listing examples of recruited (exapted) transposable elements are available at http://www.ncbi.nlm.gov/Makalowski/ScrapYard/
Collapse
Affiliation(s)
- W Makałowski
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| |
Collapse
|
239
|
Abstract
Telomeres, the eukaryotic chromosome termini, are deoxyribonucleoprotein structures that distinguish natural chromosome ends from broken DNA. In most organisms, telomeres are extended by a reverse transcriptase (RT) with an integrated RNA template, telomerase; in Drosophila melanogaster, however, telomere-specific retrotransposons, HeT-A and TART, transpose specifically to chromosome ends. Whether telomeres are extended by a telomerase or by retrotransposons, an RT is a key component. RT has been studied extensively, both for its important role in converting RNA genomes to DNA, which has great evolutionary impact, and as a therapeutic target in human retroviral diseases. Here we discuss a few important aspects of RT usage during retrotransposition and telomere elongation.Key words: telomeres, telomerase, retrotransposons, reverse transcriptase.
Collapse
|
240
|
Whitford KL, Oakes JA, Scholnick J, Saha MS. Tissue-specific developmental expression of OAX, a Xenopus repetitive element. Mech Dev 2000; 94:209-12. [PMID: 10842074 DOI: 10.1016/s0925-4773(00)00307-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Approximately 1% of the Xenopus laevis genome consists of highly repetitive DNA known alternatively as OAX (for Oocyte Activation in Xenopus), Satellite I, or Repetitive HindIII Monomer 2. Present as tandemly repeated units of approximately 750 base pairs, OAX encodes a family of small RNA species transcribed by RNA polymerase III. Although the subject of many of the classic studies on early embryonic gene regulation, reports on OAX expression remain contradictory and incomplete. Using whole-mount in situ hybridization and RNase protection assays, we have therefore examined in detail the expression pattern of OAX in Xenopus embryos of various stages. OAX is initially expressed during gastrula stages; by tailbud stages embryos display discrete zones of expression at the dorsal boundary of the cement gland, in the developing somites and differentiating skeletal muscle, as well as in the dorsal aspect of the neural tube. These data demonstrate that OAX is expressed in a dynamic pattern under tight spatial and temporal regulation.
Collapse
Affiliation(s)
- K L Whitford
- Department of Biology, College of William and Mary, Williamsburg, VA 23187-8795, USA
| | | | | | | |
Collapse
|