51
|
Bork P. Hundreds of ankyrin-like repeats in functionally diverse proteins: mobile modules that cross phyla horizontally? Proteins 1993; 17:363-74. [PMID: 8108379 DOI: 10.1002/prot.340170405] [Citation(s) in RCA: 411] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Based on pattern searches and systematic database screening, almost 650 different ankyrin-like (ANK) repeats from nearly all phyla have been identified; more than 150 of them are reported here for the first time. Their presence in functionally diverse proteins such as enzymes, toxins, and transcription factors strongly suggests domain shuffling, but their occurrence in prokaryotes and yeast excludes exon shuffling. The spreading mechanism remains unknown, but in at least three cases horizontal gene transfer appears to be involved. ANK repeats occur in at least four consecutive copies. The terminal repeats are more variable in sequence. One feature of the internal repeats is a predicted central hydrophobic alpha-helix, which is likely to interact with other repeats. The functions of the ankyrin-like repeats are compatible with a role in protein-protein interactions.
Collapse
Affiliation(s)
- P Bork
- Max-Delbrück-Centre of Molecular Medicine, Berlin, Germany
| |
Collapse
|
52
|
|
53
|
Retief JD, Dixon GH. Evolution of pro-protamine P2 genes in primates. EUROPEAN JOURNAL OF BIOCHEMISTRY 1993; 214:609-15. [PMID: 8513810 DOI: 10.1111/j.1432-1033.1993.tb17960.x] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
Protamines P1 and P2 form a family of small basic peptides that represent the major sperm proteins in placental mammals. In human and mouse protamine P2 is one of the most abundant sperm proteins. The protamine P2 gene codes for a P2 precursor, pro-P2 which is later processed by proteolytic cleavages in its N-terminal region to form the mature P2 protamines. We have used polymerase chain amplification to directly sequence the pro-P2 genes of the five major primate families: red howler (Alouatta seniculus) is a New World monkey (Cebidae); the two macaque species, Macaca mulatta and M. nemistrina are Old World monkeys (Cercopithecidae), the gibbon, Hylobates lar, represents one branch of the apes (Hylobatidae); the orangutan, Pongo pygmaeus, gorilla, Gorilla gorilla and two species of chimpanzee Pan paniscus and Pan troglodytes represent a second ape family (Pongidae). These pro-P2 genes are compared with that of human [Domenjoud, L., Nussbaum, G., Adham, I. M., Greeske, G. & Engel, W. (1990) Genomics 8, 127-133]. The overall size and organization of the genes are conserved within the group. The mean length of pro-P2 is 101 residues, with an increase to 102 in M. nemistrina and a decrease to 99 residues in red howler (A. seniculus). In gorilla and red howler one of two 79-bp tandem repeats that occurs 3' of the gene is deleted. Of the 101 deduced amino acids examined, an amino acid change occurs in one or more primates at 45 positions. Considering only the most recently diverged group, the human/gorilla/chimpanzee clade, this represents a very high mutation rate of 0.99 changes/100 sites in 10(6) years. This rapid mutation rate is characteristic of both members of the protamine gene family, P1 and P2. Consideration of the variable nature of the sequences at the multiple sites of proteolysis during the processing of the pro-P2 indicates either that there are several processing enzymes of differing specificities, or more likely that the folded structure of the pro-P2 limits accessibility of a non-specific protease to certain exposed sites.
Collapse
Affiliation(s)
- J D Retief
- Department of Medical Biochemistry, Faculty of Medicine, University of Calgary, Alberta, Canada
| | | |
Collapse
|
54
|
Brugidou C, Marty I, Chartier Y, Meyer Y. The Nicotiana tabacum genome encodes two cytoplasmic thioredoxin genes which are differently expressed. MOLECULAR & GENERAL GENETICS : MGG 1993; 238:285-93. [PMID: 8479434 DOI: 10.1007/bf00279557] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
A Nicotiana tabacum thioredoxin h gene (EMBL Accession No. Z11803) encoding a new thioredoxin (called h2) was isolated using thioredoxin h1 cDNA (X58527), and represents the first thioredoxin h gene isolated from a higher plant. It encodes a polypeptide of 118 amino acids with the conserved thioredoxin active site Trp-Cys-Gly-Pro-Cys. This gene comprises two introns which have lengths of 1071 and 147 bp respectively, and three exons which encode peptides of 29, 41 and 48 amino acids, respectively. This thioredoxin h shows 66% identity with the amino acid sequence of thioredoxin h1 (X58527) and only around 35% with the choroplastic thioredoxins. The two thioredoxins, h1 and h2, do not have any signal peptides and are most probably cytoplasmic. Using the 3' regions of the mRNAs, two probes specific for thioredoxins h1 and h2 have been prepared. Southern blot analysis shows that thioredoxin sequences are present in only two genomic EcoRI fragments: a 3.3 kb fragment encodes h1 and a 4.5 kb fragment encodes h2. Analysis of the ancestors of the allotetraploid N. tabacum shows that thioredoxin h2 is present in N. sylvestris and N. tomentosiformis but that thioredoxin h1 is absent from both putative ancestors. Thus, the thioredoxin h1 gene has probably been recently introduced in to N. tabacum as a gene of agronomic importance, or linked to such genes. Northern blot analysis shows that both genes are expressed in N. tabacum, mostly in organs or tissues that contain growing cells.(ABSTRACT TRUNCATED AT 250 WORDS)
Collapse
Affiliation(s)
- C Brugidou
- Laboratoire de Biologie Moléculaire et Physiologie Végétale, Unité associée au CNRS 565, Université de Perpignan, France
| | | | | | | |
Collapse
|
55
|
Affiliation(s)
- G Beckmann
- Max-Delbrück-Centre of Molecular Medicine, Berlin-Buch, Germany
| | | |
Collapse
|
56
|
Abstract
There has been a growing recognition that there are many conserved features among apparently diverse RNA splicing systems, suggesting that they may have a common origin. However, pre-tRNA splicing is an apparent exception in nearly all respects. Features of this unique class should be considered in any comprehensive discussion of the origin(s) of splicing and its implications for the evolution of gene structure.
Collapse
Affiliation(s)
- E M Phizicky
- Department of Biochemistry, University of Rochester Medical Center, NY 14642
| | | |
Collapse
|
57
|
Abstract
Evolutionary genetics embodies a broad research area that ranges from the DNA level to studies of genetic aspects in populations. In all cases the purpose is to determine the impact of genetic variation on evolutionary change. The broad range of evolutionary genetics requires the involvement of a diverse group of researchers: molecular biologists, (population) geneticists, biochemists, physiologists, ecologists, ethologists and theorists, each of which has its own insights and interests. For example, biochemists are often not concerned with the physiological function of a protein (with respect to pH, substrates, temperature, etc.), while ecologists, in turn, are often not interested in the biochemical-physiological aspects underlying the traits they study. This review deals with several evolutionary aspects of the Drosophila alcohol dehydrogenase gene-enzyme system, and includes my own personal viewpoints. I have tried to condense and integrate the current knowledge in this field as it has developed since the comprehensive review by van Delden (1982). Details on specific issues may be gained from Sofer and Martin (1987), Sullivan, Atkinson and Starmer (1990); Chambers (1988, 1991); Geer, Miller and Heinstra (1991); and Winberg and McKinley-McKee (1992).
Collapse
Affiliation(s)
- P W Heinstra
- Department of Plant Ecology & Evolutionary Biology, Utrecht University, The Netherlands
| |
Collapse
|
58
|
Dixon B, Pohajdak B. Did the ancestral globin gene of plants and animals contain only two introns? Trends Biochem Sci 1992; 17:486-8. [PMID: 1471256 DOI: 10.1016/0968-0004(92)90334-6] [Citation(s) in RCA: 30] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
All vertebrate globin genes contain two introns, while plant globin genes contain three. It is widely thought that the plant gene structure reflects the structure of the primordial globin gene and that a common ancestor of all animals lost the central intron shortly after the divergence of plants and animals more than one billion years ago. The recent discovery of a discordant central intron in some animal globin genes suggests that this model is incorrect. We propose that the typical vertebrate two-intron gene structure is the primordial eukaryotic form, and that following the divergence of plants and animals, a common ancestor of plants gained a central intron in the globin gene.
Collapse
Affiliation(s)
- B Dixon
- Department of Biology, Dalhousie University, Halifax, Nova Scotia, Canada
| | | |
Collapse
|
59
|
Trezise AE, Szpirer C, Buchwald M. Localization of the gene encoding the cystic fibrosis transmembrane conductance regulator (CFTR) in the rat to chromosome 4 and implications for the evolution of mammalian chromosomes. Genomics 1992; 14:869-74. [PMID: 1282491 DOI: 10.1016/s0888-7543(05)80107-7] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
We have isolated a partial cDNA encoding the cystic fibrosis transmembrane conductance regulator (CFTR) in the rat. This cDNA hybridizes to a 6.1-kb RNA transcript from the human T84 epithelial cell line and a similarly sized transcript from the rat parotid gland. The nucleotide sequence of this cDNA shows 80.5% identity to the human CFTR cDNA sequence, and the deduced amino acid sequence of rat CFTR shows 75.5% identity to the amino acid sequence of human CFTR. We have used this cDNA to map the location of the gene encoding CFTR to rat chromosome 4. This result places CFTR within a syntenic group on rat chromosome 4 and on human chromosome 7 that includes the genes encoding interleukin 6 (IL6), erythropoietin (EPO), P-glycoprotein 1 (PGY1), and T cell receptor beta chain (TCRB). This group is divided between chromosomes 5 and 6 in the mouse. Mapping of CFTR to rat chromosome 4 shows that this syntenic group has been divided in the mouse lineage during the past 15 million years and further localizes that breakpoint to a sequence homologous to the human chromosome 7q21.1 and 7q32 region. Similarly, a group of five genes, CFTR, TCRB, HOX1, parathyroid hormone-like hormone (PTHLH), and Kirsten rat sarcoma 2 viral (v-Ki-ras2) oncogene homolog (KRAS2), is syntenic on rat chromosome 4 and mouse chromosome 6, but is divided between human chromosomes 7 and 12. These data suggest that the ancestral mammalian chromosome appeared as the present day rat chromosome 4, with all six genes grouped together, and that chromosomal breakages have occurred in the mouse and human lineages since the mammalian divergence.
Collapse
Affiliation(s)
- A E Trezise
- Department of Genetics, Hospital for Sick Children, Toronto, Ontario, Canada
| | | | | |
Collapse
|
60
|
Abstract
Flagellar radial spokes contribute to the regulation of dynein arm activity and thus the pattern of flagellar bending. We have sequenced the genes for radial spoke protein 4 (RSP4) and RSP6, two of the five proteins that make up the radial spoke head in Chlamydomonas reinhardtii. The two genes, which are tightly linked genetically (B. Huang, G. Piperno, Z. Ramanis, and D.J.L. Luck, J. Cell Biol. 88:80-88, 1981), are separated by only 435 bp. They encode proline-rich polypeptides of 49.8 kDa (RSP4) and 48.8 kDa (RSP6), which are 48% identical to each other but do not resemble any previously sequenced proteins. The transcription start sites of these genes and an additional radial spoke protein gene, that for RSP3, were determined, and patterns of mRNA accumulation during flagellar regeneration were examined for the three radial spoke protein genes. These studies provide the molecular tools for a detailed analysis of radial spoke head function and assembly and for a determination of the mechanism by which the genes required to build a complex organelle are regulated.
Collapse
|
61
|
Bach I, Pontoglio M, Yaniv M. Structure of the gene encoding hepatocyte nuclear factor 1 (HNF1). Nucleic Acids Res 1992; 20:4199-204. [PMID: 1354855 PMCID: PMC334126 DOI: 10.1093/nar/20.16.4199] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Genomic clones have been isolated that cover the entire gene for the transcription factor HNF1 (hepatocyte nuclear factor 1). This protein governs the expression of many genes, synthesized in the liver in a tissue-specific manner. We have determined the intron/exon structure of the HNF1 gene, which is strictly conserved between rat and mouse and estimate that it spans not more than 40kb in the rat genome. Whereas most homeoprotein genes do not contain introns within the homeodomain, HNF1 displays an intron between the regions encoding the second and the third helices. We discuss possible evolutionary mechanisms leading to this homeobox intron/exon pattern.
Collapse
Affiliation(s)
- I Bach
- UA 1149 du CNRS, Département des Biotechnologies, Institut Pasteur, Paris, France
| | | | | |
Collapse
|
62
|
Curry AM, Williams BD, Rosenbaum JL. Sequence analysis reveals homology between two proteins of the flagellar radial spoke. Mol Cell Biol 1992; 12:3967-77. [PMID: 1508197 PMCID: PMC360281 DOI: 10.1128/mcb.12.9.3967-3977.1992] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Flagellar radial spokes contribute to the regulation of dynein arm activity and thus the pattern of flagellar bending. We have sequenced the genes for radial spoke protein 4 (RSP4) and RSP6, two of the five proteins that make up the radial spoke head in Chlamydomonas reinhardtii. The two genes, which are tightly linked genetically (B. Huang, G. Piperno, Z. Ramanis, and D.J.L. Luck, J. Cell Biol. 88:80-88, 1981), are separated by only 435 bp. They encode proline-rich polypeptides of 49.8 kDa (RSP4) and 48.8 kDa (RSP6), which are 48% identical to each other but do not resemble any previously sequenced proteins. The transcription start sites of these genes and an additional radial spoke protein gene, that for RSP3, were determined, and patterns of mRNA accumulation during flagellar regeneration were examined for the three radial spoke protein genes. These studies provide the molecular tools for a detailed analysis of radial spoke head function and assembly and for a determination of the mechanism by which the genes required to build a complex organelle are regulated.
Collapse
Affiliation(s)
- A M Curry
- Department of Biology, Yale University, New Haven, Connecticut 06511
| | | | | |
Collapse
|
63
|
Fedorov A, Suboch G, Bujakov M, Fedorova L. Analysis of nonuniformity in intron phase distribution. Nucleic Acids Res 1992; 20:2553-7. [PMID: 1598214 PMCID: PMC312392 DOI: 10.1093/nar/20.10.2553] [Citation(s) in RCA: 57] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
The distribution of different intron groups with respect to phases has been analyzed. It has been established that group II introns and nuclear introns have a minimum frequency of phase 2 introns. Since the phase of introns is an extremely conservative measure the observed minimum reflects evolutionary processes. A sample of all known, group I introns was too small to provide a valid characteristic of their phase distribution. The findings observed for the unequal distribution of phases cannot be explained solely on the basis of the mobile properties of introns. One of the most likely explanations for this nonuniformity in the intron phase distribution is the process of exon shuffling. It is proposed that group II introns originated at the early stages of evolution and were involved in the process of exon shuffling.
Collapse
Affiliation(s)
- A Fedorov
- Department of Molecular Basis of Human Genetics, Academy of Sciences of Russia, Moscow
| | | | | | | |
Collapse
|
64
|
Neidle E, Hartnett C, Ornston LN, Bairoch A, Rekik M, Harayama S. cis-diol dehydrogenases encoded by the TOL pWW0 plasmid xylL gene and the Acinetobacter calcoaceticus chromosomal benD gene are members of the short-chain alcohol dehydrogenase superfamily. EUROPEAN JOURNAL OF BIOCHEMISTRY 1992; 204:113-20. [PMID: 1740120 DOI: 10.1111/j.1432-1033.1992.tb16612.x] [Citation(s) in RCA: 69] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
In the aerobic degradation of benzoate by bacteria, benzoate is first dihydroxylated by a ring-hydroxylating dioxygenase to form a cis-diol (1,2-dihydroxycyclohexa-3,4-diene carboxylate) which is subsequently transformed to a catechol by an NAD(+)-dependent cis-diol dehydrogenase. The structural gene for this dehydrogenase, encoded on TOL plasmid pWW0 of Pseudomonas putida (xylL) and that encoded on the chromosome of Acinetobacter calcoaceticus (benD), were sequenced. They encode polypeptides of about 28 kDa in size. These proteins are similar to each other, exhibiting 58% sequence identity. They are also similar to other proteins of at least 20 different functions, which are members of the short-chain alcohol dehydrogenase family. The alignment of these proteins suggest two amino acids, lysine and tyrosine, as catalytically important residues.
Collapse
Affiliation(s)
- E Neidle
- Department of Biology, Yale University, New Haven
| | | | | | | | | | | |
Collapse
|
65
|
Purugganan M, Wessler S. The splicing of transposable elements and its role in intron evolution. Genetica 1992; 86:295-303. [PMID: 1334914 DOI: 10.1007/bf00133728] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Recent studies have demonstrated that transposable elements in maize and Drosophila are spliced from pre-mRNA. These transposable element introns represent the first examples of recent addition of introns into nuclear genes. The eight reported examples of transposable element splicing include members of the maize Ac/Ds and Spm/dSpm and the Drosophila P and 412 element families. The details of the splicing of these transposable elements and their relevance to models of intron origin are discussed.
Collapse
Affiliation(s)
- M Purugganan
- Botany Department, University of Georgia, Athens 30602
| | | |
Collapse
|
66
|
Kaiser E, Eberhard D, Knippers R. Exons encoding the highly conserved part of human glutaminyl-tRNA synthetase. J Mol Evol 1992; 34:45-53. [PMID: 1556743 DOI: 10.1007/bf00163851] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Aminoacyl-tRNA synthetases are important components of the genetic apparatus. In spite of common catalytic properties, synthetases with different amino acid specificities are widely diverse in their primary structures, subunit sizes, and subunit composition. However, synthetases with given amino acid specificities are well conserved throughout evolution. We have been studying the human glutaminyl-tRNA synthetase possessing a sequence of about 400 amino acid residues (the core region) that is very similar to sequences in the corresponding enzymes from bacteria and yeast. The conserved sequence appears to be essential for the basic function of the enzyme, the charging of tRNA with glutamine. As a first step to a better understanding of the evolution of this enzyme, we determined the coding region for the conserved part of the human glutaminyl-tRNA synthetase. The coding region is composed of eight exons. It appears that individual exons encode defined secondary structural elements as parts of functionally important domains of the enzyme. Evolution of the gene by assembly of individual exons seems to be a viable hypothesis; alternative pathways are discussed.
Collapse
Affiliation(s)
- E Kaiser
- Fakultät für Biologie, Universität Konstanz, Germany
| | | | | |
Collapse
|
67
|
Abstract
Accumulating evidence that introns are highly restricted in their phylogenetic distribution strongly supports the view that introns were inserted late in eukaryotic evolution into preformed genes and, hence, that exon-shuffling played no role in the assembly of primordial genes. Potential mechanisms of intron insertion and the possible evolution of nuclear introns and their splicing machinery from self-splicing group II introns are also discussed.
Collapse
Affiliation(s)
- J D Palmer
- Department of Biology, Indiana University, Bloomington 47405
| | | |
Collapse
|
68
|
Jackson IJ, Chambers DM, Budd PS, Johnson R. The tyrosinase-related protein-1 gene has a structure and promoter sequence very different from tyrosinase. Nucleic Acids Res 1991; 19:3799-804. [PMID: 1907365 PMCID: PMC328466 DOI: 10.1093/nar/19.14.3799] [Citation(s) in RCA: 68] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
We have determined the exon structure of the mouse tyrosinase-related protein-1 (TRP-1) gene. The gene is only 15kb in length, but contains seven introns, in contrast to the tyrosinase gene which is almost 100kb long with only four introns. Only two introns are located in homologous positions in both genes. Intron I of TRP-1 has three alternative 5' splice sites clustered within 21bp, which all splice to the same 3' site. Intron V has a very unusual 5' splice site, which has the dinucleotide GC rather than the conventional GT. We show that as little as 370bp of 5'-flanking DNA is sufficient to direct cell-specific expression of the chloramphenicol acetyl transferase gene. The flanking DNA of TRP-1, unlike tyrosinase, does not contain a TATA box or a CCAAT box. Both mouse genes, however, share an 11bp sequence, also found in human tyrosinase, which we suggest may be a melanocyte-specific promoter element.
Collapse
Affiliation(s)
- I J Jackson
- MRC Human Genetics Unit, Western General Hospital, Edinburgh, UK
| | | | | | | |
Collapse
|
69
|
|
70
|
Abstract
In a recent paper, Walter Gilbert's group has estimated the number of original exons from which all extant proteins might have been constructed. The approach used is subjected to a critical analysis here. It is shown that there are flawed assumptions about both the mechanism and generality of exon-shuffling and in the sequence comparison procedures employed, the latter failing to distinguish chance similarity from similarity due to common ancestry. These methodological errors lead to the omission of many known cases of exon-shuffling and the inclusion of others which may not be genuine. In consequence, the analysis from the Gilbert group cannot give a reliable estimate of those modules that actually participated in exon-shuffling and provides no information on the number of protein archetypes that did not participate in these processes.
Collapse
Affiliation(s)
- L Patthy
- Institute of Enzymology, Hungarian Academy of Sciences, Budapest
| |
Collapse
|