201
|
Tripoli G, D'Elia D, Barsanti P, Caggese C. Comparison of the oxidative phosphorylation (OXPHOS) nuclear genes in the genomes of Drosophila melanogaster, Drosophila pseudoobscura and Anopheles gambiae. Genome Biol 2005; 6:R11. [PMID: 15693940 PMCID: PMC551531 DOI: 10.1186/gb-2005-6-2-r11] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2004] [Revised: 12/08/2004] [Accepted: 01/07/2005] [Indexed: 01/16/2023] Open
Abstract
An analysis of nuclear-encoded oxidative phosphorylation genes in Drosophila and Anopheles reveals that pairs of duplicated genes have strikingly different expression patterns. Background In eukaryotic cells, oxidative phosphorylation (OXPHOS) uses the products of both nuclear and mitochondrial genes to generate cellular ATP. Interspecies comparative analysis of these genes, which appear to be under strong functional constraints, may shed light on the evolutionary mechanisms that act on a set of genes correlated by function and subcellular localization of their products. Results We have identified and annotated the Drosophila melanogaster, D. pseudoobscura and Anopheles gambiae orthologs of 78 nuclear genes encoding mitochondrial proteins involved in oxidative phosphorylation by a comparative analysis of their genomic sequences and organization. We have also identified 47 genes in these three dipteran species each of which shares significant sequence homology with one of the above-mentioned OXPHOS orthologs, and which are likely to have originated by duplication during evolution. Gene structure and intron length are essentially conserved in the three species, although gain or loss of introns is common in A. gambiae. In most tissues of D. melanogaster and A. gambiae the expression level of the duplicate gene is much lower than that of the original gene, and in D. melanogaster at least, its expression is almost always strongly testis-biased, in contrast to the soma-biased expression of the parent gene. Conclusions Quickly achieving an expression pattern different from the parent genes may be required for new OXPHOS gene duplicates to be maintained in the genome. This may be a general evolutionary mechanism for originating phenotypic changes that could lead to species differentiation.
Collapse
Affiliation(s)
- Gaetano Tripoli
- University of Bari, DAPEG Section of Genetics, via Amendola 165/A, 70126 Bari, Italy
| | - Domenica D'Elia
- CNR, Institute of Biomedical Technology, Section of Bari, via Amendola 122/D, 70126 Bari, Italy
| | - Paolo Barsanti
- University of Bari, DAPEG Section of Genetics, via Amendola 165/A, 70126 Bari, Italy
| | - Corrado Caggese
- University of Bari, DAPEG Section of Genetics, via Amendola 165/A, 70126 Bari, Italy
| |
Collapse
|
202
|
Vendramini D. Noncoding DNA and the teem theory of inheritance, emotions and innate behaviour. Med Hypotheses 2005; 64:512-9. [PMID: 15617858 DOI: 10.1016/j.mehy.2004.08.022] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2004] [Accepted: 08/25/2004] [Indexed: 10/26/2022]
Abstract
The evolutionary function of noncoding 'junk' DNA remains one of the most challenging mysteries of genetics. Here a new model of DNA is proposed to explain this function. The hypothesis asserts the DNA molecule contains not one, but two separate modes of inheritance. In addition to exons that code for proteins and physical traits, it is argued noncoding repetitive elements code for the inheritance of emotions and innate behaviour in metazoans. That is to say, noncoding DNA functions as the medium of a second, hitherto unknown evolutionary process that genetically archives adaptive information, configured as emotions and acquired during the life of an organism, into an inheritable form. This second evolutionary process, here called 'Teemosis', is a selectionist process, but paradoxically, because it does not affect physical traits, it has no maladaptive Lamarckian consequences. The medical implications of the hypothesis are discussed.
Collapse
|
203
|
Crawford AC, Kricker JA, Anderson AJ, Richardson NR, Mather PB. Structure and function of a cellulase gene in redclaw crayfish, Cherax quadricarinatus. Gene 2004; 340:267-74. [PMID: 15475168 DOI: 10.1016/j.gene.2004.06.060] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2004] [Revised: 06/14/2004] [Accepted: 06/19/2004] [Indexed: 11/16/2022]
Abstract
The most abundant organic compound produced by plants is cellulose; however, it has long been accepted that most animals do not produce endogenous enzymes required for its degradation, but rely instead on symbiotic relationships with microbes that produce the necessary enzymes. Here, we present the genomic organisation of an endogenous glycosyl hydrolase family (GHF) 9 gene in redclaw crayfish (Cherax quadricarinatus), consolidated from a cDNA sequence determined by Byrne et al. [Gene 239 (1999) 317-324.]. Comparison with several other invertebrate GHF9 genes reveals the conservation of both intron position/phase and splice sequence, which adds support to an argument for an ancestral animal cellulase gene. Furthermore, two introns in plant GHF9 genes are also identical in position, implying a more ancient origin for this class of animal cellulase. Protein purification from redclaw gastric fluid via fast performance liquid chromatography (FPLC) indicated the presence of two endoglucanase enzymes. The molecular weights of these components were determined by matrix-assisted laser desorption/ionisation-time-of-flight (MALDI-TOF) to be 47,887 Da (Cel1) and 50,295 Da (Cel2). Cel1 is possibly the functional product of the described cellulase gene, with N-terminal amino acid residues identical to the translated amino acid sequence from the corresponding gene region. Cel2 was identical to Cel1 for 7 of 11 N-terminal residues and likely to be the product of a paralogous endoglucanase gene. These results suggest that redclaw crayfish possess at least one and possibly two functional, endoglucanase enzymes, although further work is required to confirm their origin and attributes.
Collapse
Affiliation(s)
- Allison C Crawford
- School of Natural Resource Sciences, Queensland University of Technology, GPO Box 2434, Brisbane Q. 4001, Australia
| | | | | | | | | |
Collapse
|
204
|
Prachumwat A, DeVincentis L, Palopoli MF. Intron size correlates positively with recombination rate in Caenorhabditis elegans. Genetics 2004; 166:1585-90. [PMID: 15082572 PMCID: PMC1470791 DOI: 10.1534/genetics.166.3.1585] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
A negative correlation between intron size and recombination rate has been reported for the Drosophila melanogaster and human genomes. Population-genetic models suggest that this pattern could be caused by an interaction between recombination rate and the efficacy of natural selection. To test this idea, we examined variation in intron size and recombination rate across the genome of the nematode Caenorhabditis elegans. Interestingly, we found that intron size correlated positively with recombination rate in this species.
Collapse
|
205
|
Bryson-Richardson RJ, Logan DW, Currie PD, Jackson IJ. Large-scale analysis of gene structure in rhodopsin-like GPCRs: evidence for widespread loss of an ancient intron. Gene 2004; 338:15-23. [PMID: 15302402 DOI: 10.1016/j.gene.2004.05.001] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2003] [Revised: 04/05/2004] [Accepted: 05/06/2004] [Indexed: 10/26/2022]
Abstract
The G-protein-coupled receptors (GPCRs) are a large superfamily of seven transmembrane domain-spanning proteins that mediate signal transduction by activation of G-proteins. Mammalian GPCR genes are reputed to be largely intron-deficient, but there have been no reports of using genome-wide analyses of gene structure to investigate this. Using complete genome sequences, we analysed the intron content of over 850 members of the rhodopsin-like GPCR subfamily (family A G-protein-coupled receptor or GPCR-A) in four species. We find that mouse and human GPCR-As have a large and significant reduction in intron number compared to the rest of their genome. In contrast, invertebrate GPCR-As have an intron repertoire similar to, or slightly greater than, the rest of their genome, suggesting that the reduced intron content in mammals is due to widespread intron loss. Furthermore, we provide a specific example of intron loss through analysis of an intron that is conserved in position and phase within a phylogenetically diverse range of GPCR-As within six vertebrate and invertebrate species. Together, these two lines of evidence provide compelling evidence for the widespread loss of introns during the evolution of the mammalian GPCR-A family.
Collapse
|
206
|
Fritz DT, Liu D, Xu J, Jiang S, Rogers MB. Conservation of Bmp2 post-transcriptional regulatory mechanisms. J Biol Chem 2004; 279:48950-8. [PMID: 15358784 DOI: 10.1074/jbc.m409620200] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Bone morphogenetic protein (BMP) orthologs from diverse species like flies and humans are functionally interchangeable and play key roles in fundamental processes such as dorso-ventral axis formation in metazoans. Because both transcriptional and post-transcriptional mechanisms play central roles in modulating developmental protein levels, we have analyzed the 3'-untranslated region (3'UTR) of the Bmp 2 gene. This 3'UTR is unusually long and is alternatively polyadenylated. Mouse, human, and dog mRNAs are 83-87% identical within this region. A 265-nucleotide sequence, conserved between mammals, birds, frogs, and fish, is present in Bmp2 but not Bmp4. The ability of AmphiBMP2/4, a chordate ortholog to Bmp2 and Bmp4, to align with this sequence suggests that its function may have been lost in Bmp4. Activation of reporter genes by the conserved region acts by a post-transcriptional mechanism. Mouse, human, chick, and zebrafish Bmp2 synthetic RNAs decay rapidly in extracts from cells not expressing Bmp2. In contrast, these RNAs are relatively stable in extracts from Bmp2-expressing cells. Thus, Bmp2 RNA half-lives in vitro correlate with natural Bmp2 mRNA levels. The fact that non-murine RNAs interact appropriately with the mouse decay machinery suggests that the function of these cis-regulatory regions has been conserved for 450 million years since the fish and tetrapod lineages diverged. Overall, our results suggest that the Bmp2 3'UTR contains essential regulatory elements that act post-transcriptionally.
Collapse
Affiliation(s)
- David T Fritz
- Department of Biochemistry and Molecular Biology, University of Medicine and Dentistry of New Jersey (UMDNJ)-NJ Medical School, Newark, New Jersey 07101, USA
| | | | | | | | | |
Collapse
|
207
|
Krauss V, Pecyna M, Kurz K, Sass H. Phylogenetic Mapping of Intron Positions: A Case Study of Translation Initiation Factor eIF2γ. Mol Biol Evol 2004; 22:74-84. [PMID: 15356279 DOI: 10.1093/molbev/msh255] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Eukaryotic translation initiation factor 2 (eIF2) is a G protein that delivers the methionyl initiator tRNA to the small ribosomal subunit and releases it upon GTP hydrolysis after the recognition of the initiation codon. eIF2 is composed of three subunits, alpha, beta, and gamma. Subunit gamma shows the strongest conservation, and it confers both tRNA and GTP/GDP binding. Using intron positioning and protein sequence alignment, here we show that eIF2gamma is a suitable phylogenetic marker for eukaryotes. We determined or completed the sequences of 13 arthropod eIF2gamma genes. Analyzing the phylogenetic distribution of 52 different intron positions in 55 distantly related eIF2gamma genes, we identified ancient ones and shared derived introns in our data set. Obviously, intron positioning in eIF2gamma is evolutionarily conserved. However, there were episodes of complete and partial intron losses followed by intron gains. We identified 17 clusters of intron positions based on their distribution. The evolution of these clusters appears to be connected with preferred exon length and can be used to estimate the relative timing of intron gain because nearby precursor introns had to be erased from the gene before the new introns could be inserted. Moreover, we identified a putative case of intron sliding that constitutes a synapomorphic character state supporting monophyly of Coleoptera, Lepidoptera, and Diptera excluding Hymenoptera. We also performed tree reconstructions using the eIF2gamma protein sequences and intron positioning as phylogenetic information. Our results support the monophyly of Viridoplantae, Ascomycota, Homobasidiomyceta, and Apicomplexa.
Collapse
Affiliation(s)
- Veiko Krauss
- Department of Genetics, University of Leipzig, Leipzig, Germany.
| | | | | | | |
Collapse
|
208
|
Karan D, Lesbats M, David JR, Capy P. Evolution of the AMP-forming acetyl-CoA synthetase gene in the Drosophilidae family. J Mol Evol 2004; 57 Suppl 1:S297-303. [PMID: 15008428 DOI: 10.1007/s00239-003-0040-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Analysis of the AMP-forming ACS gene was performed in 12 species of the Drosophilidae family. Systematically four introns, aligned at the same positions, were detected, but none of them showed a position similar to those known for species outside the Drosophilidae family. The average length of introns varied from 63 to 75 bp but in two species Drosophila takahashii and D. kikkawai the length of the second intron was 343 and 210 bp, respectively. In coding regions, about 80% of the third codon positions were substituted while first and second positions showed, respectively, 14% and 6% substitutions. Interestingly, the divergence observed at the protein level between species was very low. The phylogenetic tree based on the DNA sequences of the exons was mainly in agreement with taxonomic classification and previous molecular phylogenies except for D. ananassae, which appeared more closely related to D. subobscura and D. funebris than to the species of the melanogaster group.
Collapse
Affiliation(s)
- Dev Karan
- Laboratoire Populations, Génétique et Evolution, Centre National de la Recherche Scientifique, 91198 Gif-sur-Yvette Cedex, France
| | | | | | | |
Collapse
|
209
|
Grover CE, Kim H, Wing RA, Paterson AH, Wendel JF. Incongruent patterns of local and global genome size evolution in cotton. Genome Res 2004; 14:1474-82. [PMID: 15256507 PMCID: PMC509256 DOI: 10.1101/gr.2673204] [Citation(s) in RCA: 77] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2004] [Accepted: 05/26/2004] [Indexed: 11/24/2022]
Abstract
Genome sizes in plants vary over several orders of magnitude, reflecting a combination of differentially acting local and global forces such as biases in indel accumulation and transposable element proliferation or removal. To gain insight into the relative role of these and other forces, approximately 105 kb of contiguous sequence surrounding the cellulose synthase gene CesA1 was compared for the two coresident genomes (AT and DT) of the allopolyploid cotton species, Gossypium hirsutum. These two genomes differ approximately twofold in size, having diverged from a common ancestor approximately 5-10 million years ago (Mya) and been reunited in the same nucleus at the time of polyploid formation, approximately 1-2 Mya. Gene content, order, and spacing are largely conserved between the two genomes, although a few transposable elements and a single cpDNA fragment distinguish the two homoeologs. Sequence conservation is high in both intergenic and genic regions, with 14 conserved genes detected in both genomes yielding a density of 1 gene every 7.5 kb. In contrast to the twofold overall difference in DNA content, no disparity in size was observed for this 105-kb region, and 555 indels were detected that distinguish the two homoeologous BACs, approximately equally distributed between AT and DT in number and aggregate size. The data demonstrate that genome size evolution at this phylogenetic scale is not primarily caused by mechanisms that operate uniformly across different genomic regions and components; instead, the twofold overall difference in DNA content must reflect locally operating forces between gene islands or in largely gene-free regions.
Collapse
Affiliation(s)
- Corrinne E Grover
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, Iowa 50011, USA
| | | | | | | | | |
Collapse
|
210
|
Cvrčková F, Novotný M, Pícková D, Žárský V. Formin homology 2 domains occur in multiple contexts in angiosperms. BMC Genomics 2004; 5:44. [PMID: 15256004 PMCID: PMC509240 DOI: 10.1186/1471-2164-5-44] [Citation(s) in RCA: 80] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2004] [Accepted: 07/15/2004] [Indexed: 11/10/2022] Open
Abstract
Background Involvement of conservative molecular modules and cellular mechanisms in the widely diversified processes of eukaryotic cell morphogenesis leads to the intriguing question: how do similar proteins contribute to dissimilar morphogenetic outputs. Formins (FH2 proteins) play a central part in the control of actin organization and dynamics, providing a good example of evolutionarily versatile use of a conserved protein domain in the context of a variety of lineage-specific structural and signalling interactions. Results In order to identify possible plant-specific sequence features within the FH2 protein family, we performed a detailed analysis of angiosperm formin-related sequences available in public databases, with particular focus on the complete Arabidopsis genome and the nearly finished rice genome sequence. This has led to revision of the current annotation of half of the 22 Arabidopsis formin-related genes. Comparative analysis of the two plant genomes revealed a good conservation of the previously described two subfamilies of plant formins (Class I and Class II), as well as several subfamilies within them that appear to predate the separation of monocot and dicot plants. Moreover, a number of plant Class II formins share an additional conserved domain, related to the protein phosphatase/tensin/auxilin fold. However, considerable inter-species variability sets limits to generalization of any functional conclusions reached on a single species such as Arabidopsis. Conclusions The plant-specific domain context of the conserved FH2 domain, as well as plant-specific features of the domain itself, may reflect distinct functional requirements in plant cells. The variability of formin structures found in plants far exceeds that known from both fungi and metazoans, suggesting a possible contribution of FH2 proteins in the evolution of the plant type of multicellularity.
Collapse
Affiliation(s)
- Fatima Cvrčková
- Department of Plant Physiology, Faculty of Sciences, Charles University, Viničná 5, CZ 128 44 Praha 2, Czech Republic
| | - Marian Novotný
- Department of Cell and Molecular Biology, Uppsala University, Biomedical Centre, Husargatan 3, Box 570, S 751 23 Uppsala, Sweden
| | - Denisa Pícková
- Department of Plant Physiology, Faculty of Sciences, Charles University, Viničná 5, CZ 128 44 Praha 2, Czech Republic
- Institute of Experimental Botany, Faculty of Sciences of the Czech Republic, Rozvojová 135, CZ 165 02 Praha 6, Czech Republic
| | - Viktor Žárský
- Department of Plant Physiology, Faculty of Sciences, Charles University, Viničná 5, CZ 128 44 Praha 2, Czech Republic
- Institute of Experimental Botany, Faculty of Sciences of the Czech Republic, Rozvojová 135, CZ 165 02 Praha 6, Czech Republic
| |
Collapse
|
211
|
Kovaleva ES, Masler EP, Skantar AM, Chitwood DJ. Novel matrix metalloproteinase from the cyst nematodes Heterodera glycines and Globodera rostochiensis. Mol Biochem Parasitol 2004; 136:109-12. [PMID: 15138072 DOI: 10.1016/j.molbiopara.2004.03.001] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2003] [Revised: 03/05/2004] [Accepted: 03/08/2004] [Indexed: 10/26/2022]
Affiliation(s)
- Elena S Kovaleva
- Nematology Laboratory, U.S. Department of Agriculture, Agricultural Research Service, Beltsville, MD 20705, USA
| | | | | | | |
Collapse
|
212
|
Vinogradov AE. Compactness of human housekeeping genes: selection for economy or genomic design? Trends Genet 2004; 20:248-53. [PMID: 15109779 DOI: 10.1016/j.tig.2004.03.006] [Citation(s) in RCA: 123] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Affiliation(s)
- Alexander E Vinogradov
- Institute of Cytology, Russian Academy of Sciences, Tikhoretsky Ave 4, St Petersburg 194064, Russia.
| |
Collapse
|
213
|
Singh B, Gupta RS. Genomic organization and linkage via a bidirectional promoter of the AP-3 (adaptor protein-3) mu3A and AK (adenosine kinase) genes: deletion mutants of AK in Chinese hamster cells extend into the AP-3 mu3A gene. Biochem J 2004; 378:519-28. [PMID: 14575525 PMCID: PMC1223951 DOI: 10.1042/bj20031219] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2003] [Revised: 10/07/2003] [Accepted: 10/24/2003] [Indexed: 11/17/2022]
Abstract
The cDNA and genomic DNA for the mu3A subunit of the AP-3 (adaptor protein-3) complex were cloned from Chinese hamster cells. The AP-3 mu3A genes in Chinese hamster, human and mouse each comprise nine exons and eight introns, with all introns located in identical positions in the species studied. The AP-3 mu3A genes in these species are linked in a head-to-head fashion with the gene for the purine salvage pathway enzyme AK (adenosine kinase). These genes share the first exon, and a 512 bp fragment covering the intervening untranslated sequence has the characteristic of a CpG island promoter, and it effectively carried out transcription in both directions. Deletion studies indicate that this region contains both positive and negative regulatory elements affecting transcription of these genes. In comparison with the AP-3 mu3A gene (27 kb), the AK gene in human is very large (558 kb), with average exon and intron lengths of approx. 100 bp and 55.7 kb respectively. The ratio of non-coding to coding sequence in the human AK gene is >550, which is the highest reported for any gene. We also present evidence that a number of AK- mutants of Chinese hamster ovary cells contain large deletions that affect both of these genes. In addition to lacking part of the AK gene, two of these mutants also lacked all of the exons and introns corresponding to the AP-3 mu3A gene. These mutants should prove useful in elucidating the role of AP-3 mu3A in vesicle-mediated protein sorting--a process that is altered in Hermansky-Pudlak syndrome. Detailed phylogenetic analysis of the micro family of proteins presented here also provides insight into how different AP complexes are related and may have evolved.
Collapse
Affiliation(s)
- Bhag Singh
- Department of Biochemistry, McMaster University, Hamilton, Ontario, Canada L8N 3Z5
| | | |
Collapse
|
214
|
Drabenstot SD, Kupfer DM, White JD, Dyer DW, Roe BA, Buchanan KL, Murphy JW. FELINES: a utility for extracting and examining EST-defined introns and exons. Nucleic Acids Res 2004; 31:e141. [PMID: 14602934 PMCID: PMC275578 DOI: 10.1093/nar/gng141] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
FELINES (Finding and Examining Lots of Intron 'N' Exon Sequences) is a utility written to automate construction and analysis of high quality intron and exon sequence databases produced from EST (expressed sequence tag) to genomic sequence alignments. We demonstrated the various programs of the FELINES utility by creating intron and exon sequence databases for the fungal organism Schizosaccharomyces pombe from alignments of EST to genomic sequences. In addition, we analyzed our constructed S.pombe sequence databases and the well-established Saccharomyces cerevisiae intron database from Manuel Ares' Laboratory for conserved sequence motifs. FELINES was shown to be useful for characterizing branchsites, polypyrimidine tracts and 5' and 3' splice sites in the intron databases and exonic splicing enhancers (ESEs) in S.pombe exons. FELINES is available at http://www.genome.ou.edu/informatics.html.
Collapse
Affiliation(s)
- Scott D Drabenstot
- Department of Microbiology and Immunology, University of Oklahoma Health Sciences Center, PO Box 26901, BMSB 1053, Oklahoma City, OK 73190, USA
| | | | | | | | | | | | | |
Collapse
|
215
|
Järving R, Järving I, Kurg R, Brash AR, Samel N. On the Evolutionary Origin of Cyclooxygenase (COX) Isozymes. J Biol Chem 2004; 279:13624-33. [PMID: 14732711 DOI: 10.1074/jbc.m313258200] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
In vertebrates, COX-1 and COX-2, two cyclooxygenase isozymes with different physiological functions and gene regulation, catalyze identical reactions in prostaglandin synthesis. It is still not understood why there are multiple forms of COX enzyme in the same cell type and when the evolutionary duplication of the COX gene occurred. Here we report the structure of two genes encoding for COX isozymes in the coral Gersemia fruticosa, the first non-vertebrate organism from which a cyclooxygenase was characterized. Both genes are about 20 kb in size and consist of nine exons. Intron/exon boundaries are well conserved between coral and mammalian COX genes. mRNAs of the previously reported G. fruticosa COX-A (GenBank trade mark accession number AY004222) and the novel COX-B share 94% sequence identity in the coding regions and less than 30% in the 5'- and 3'-untranslated region. Transcripts of both COX genes are detectable in coral cells, although the transcriptional level of COX-A is 2 orders of magnitude higher than COX-B. Expression of both coral genes in mammalian cells gave functional proteins with similar catalytic properties. By data base analyses we also detected and constructed different pairs of COX genes from the primitive chordates, Ciona savignyi and Ciona intestinalis. These two gene pairs encode proteins with 50% intra-species and only 70% cross-species sequence identity. Our results suggest that invertebrate COX gene pairs do not correspond to vertebrate COX-1 and COX-2 and are consistent with duplication of the COX gene having occurred independently in corals, ascidians, and vertebrates. It is evident that due to the importance and complexity of its regulatory role, COX has multiple isoforms in all organisms known to express it, and the genes encoding for the isozymes may to be regulated differently.
Collapse
Affiliation(s)
- Reet Järving
- Department of Chemistry, Tallinn Technical University, Akadeemia tee 15, Tallinn 12618
| | | | | | | | | |
Collapse
|
216
|
Abstract
Comparative analyses of ESTs and cDNAs with genomic DNA predict a high frequency of alternative splicing in human genes. However, there is an ongoing debate as to how many of these predicted splice variants are functional and how many are the result of aberrant splicing (or 'noise'). To address this question, we compared alternatively spliced cassette exons that are conserved between human and mouse with EST-predicted cassette exons that are not conserved in the mouse genome. Presumably, conserved exon-skipping events represent functional alternative splicing. We show that conserved (functional) cassette exons possess unique characteristics in size, repeat content and in their influence on the protein. By contrast, most non-conserved cassette exons do not share these characteristics. We conclude that a significant portion of cassette exons evident in EST databases is not functional, and might result from aberrant rather than regulated splicing.
Collapse
Affiliation(s)
- Rotem Sorek
- Department of Human Genetics, Sackler Faculty of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel
| | | | | |
Collapse
|
217
|
Khalaj V, Smith L, Brookman J, Tuckwell D. Identification of a novel class of annexin genes. FEBS Lett 2004; 562:79-86. [PMID: 15044005 DOI: 10.1016/s0014-5793(04)00186-3] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2003] [Revised: 02/16/2004] [Accepted: 02/16/2004] [Indexed: 01/15/2023]
Abstract
The annexins are a family of calcium- and phospholipid-binding proteins that have been widely studied in animals. Investigation of annexins in the fungus Aspergillus fumigatus identified a novel annexin-like gene (ANXC4) as well as two conventional annexins (ANXC3.1 and ANXC3.2). The genes were initially identified by bioinformatics, and sequences were then determined experimentally. Reverse transcription polymerase chain reaction indicated that all three genes were expressed. ANXC4 lacked calcium-binding consensus sequences and had a 553 residue N-terminal tail. However, bioinformatics indicated that ANXC4 is an annexin and homologues were identified in other filamentous fungi. ANXC4 therefore represents a new grouping within the annexin family.
Collapse
Affiliation(s)
- Vahid Khalaj
- F2G Ltd., P.O. Box 1, Lankro Way, Eccles, Manchester M30 0BH, UK
| | | | | | | |
Collapse
|
218
|
Valenzuela A, Talavera D, Orozco M, de la Cruz X. Alternative splicing mechanisms for the modulation of protein function: conservation between human and other species. J Mol Biol 2004; 335:495-502. [PMID: 14672658 DOI: 10.1016/j.jmb.2003.10.061] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Alternative splicing (AS) is an important process in eukaryotic organisms by which a given gene may express a set of different protein isoforms depending on the tissue, or the developmental stage of the individual. In the present work, we have compared AS among species, focusing on the conservation of AS mechanisms for the modulation of protein function. For this purpose, we first analysed the frequency with which different species, human, mouse, rat and fruitfly, utilise them. Second, we focused more directly on the conservation among species of the mechanisms themselves. To this end, we compared biologically equivalent AS events between human and mouse, or rat. Our results indicate only minor differences in the frequency of use of these mechanisms, as well as a high degree of conservation among the species studied.
Collapse
Affiliation(s)
- Abel Valenzuela
- Unitat de Modelització Molecular i Bioinformática, Institut de Recerca Biomèdica de Barcelona, Parc Científic de Barcelona, C/Josep Samitier, 1-5, 08028 Barcelona, Spain
| | | | | | | |
Collapse
|
219
|
Zhang Z, Harrison PM, Liu Y, Gerstein M. Millions of years of evolution preserved: a comprehensive catalog of the processed pseudogenes in the human genome. Genome Res 2004; 13:2541-58. [PMID: 14656962 PMCID: PMC403796 DOI: 10.1101/gr.1429003] [Citation(s) in RCA: 319] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Processed pseudogenes were created by reverse-transcription of mRNAs; they provide snapshots of ancient genes existing millions of years ago in the genome. To find them in the present-day human, we developed a pipeline using features such as intron-absence, frame-disruption, polyadenylation, and truncation. This has enabled us to identify in recent genome drafts approximately 8000 processed pseudogenes (distributed from http://pseudogene.org). Overall, processed pseudogenes are very similar to their closest corresponding human gene, being 94% complete in coding regions, with sequence similarity of 75% for amino acids and 86% for nucleotides. Their chromosomal distribution appears random and dispersed, with the numbers on chromosomes proportional to length, suggesting sustained "bombardment" over evolution. However, it does vary with GC-content: Processed pseudogenes occur mostly in intermediate GC-content regions. This is similar to Alus but contrasts with functional genes and L1-repeats. Pseudogenes, moreover, have age profiles similar to Alus. The number of pseudogenes associated with a given gene follows a power-law relationship, with a few genes giving rise to many pseudogenes and most giving rise to few. The prevalence of processed pseudogenes agrees well with germ-line gene expression. Highly expressed ribosomal proteins account for approximately 20% of the total. Other notables include cyclophilin-A, keratin, GAPDH, and cytochrome c.
Collapse
Affiliation(s)
- Zhaolei Zhang
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520-8114, USA
| | | | | | | |
Collapse
|
220
|
Long M, Betrán E, Thornton K, Wang W. The origin of new genes: glimpses from the young and old. Nat Rev Genet 2003; 4:865-75. [PMID: 14634634 DOI: 10.1038/nrg1204] [Citation(s) in RCA: 650] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Genome data have revealed great variation in the numbers of genes in different organisms, which indicates that there is a fundamental process of genome evolution: the origin of new genes. However, there has been little opportunity to explore how genes with new functions originate and evolve. The study of ancient genes has highlighted the antiquity and general importance of some mechanisms of gene origination, and recent observations of young genes at early stages in their evolution have unveiled unexpected molecular and evolutionary processes.
Collapse
Affiliation(s)
- Manyuan Long
- Department of Ecology and Evolution, The University of Chicago, 1101 East 57th Street, Chicago, Illinois 60637, USA.
| | | | | | | |
Collapse
|
221
|
Abstract
AbstractIntron sizes show an asymmetrical distribution in a number of organisms, with a large number of “short” introns clustered around a minimal intron length and a much broader distribution of longer introns. In Drosophila melanogaster, the short intron class is centered around 61 bp. The narrow length distribution suggests that natural selection may play a role in maintaining intron size. A comparison of 15 orthologous introns among species of the D. melanogaster subgroup indicates that, in general, short introns are not under greater DNA sequence or length constraints than long introns. There is a bias toward deletions in all introns (deletion/insertion ratio is 1.66), and the vast majority of indels are of short length (<10 bp). Indels occurring on the internal branches of the phylogenetic tree are significantly longer than those occurring on the terminal branches. These results are consistent with a compensatory model of intron length evolution in which slightly deleterious short deletions are frequently fixed within species by genetic drift, and relatively rare larger insertions that restore intron length are fixed by positive selection. A comparison of paralogous introns shared among duplicated genes suggests that length constraints differ between introns within the same gene. The janusA, janusB, and ocnus genes share two short introns derived from a common ancestor. The first of these introns shows significantly fewer indels than the second intron, although the two introns show a comparable number of substitutions. This indicates that intron-specific selective constraints have been maintained following gene duplication, which preceded the divergence of the D. melanogaster species subgroup.
Collapse
Affiliation(s)
- John Parsch
- Department of Biology II, Section of Evolutionary Biology, University of Munich (LMU), Munich 80333, Germany.
| |
Collapse
|
222
|
Mattick JS. Challenging the dogma: the hidden layer of non-protein-coding RNAs in complex organisms. Bioessays 2003; 25:930-9. [PMID: 14505360 DOI: 10.1002/bies.10332] [Citation(s) in RCA: 313] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
The central dogma of biology holds that genetic information normally flows from DNA to RNA to protein. As a consequence it has been generally assumed that genes generally code for proteins, and that proteins fulfil not only most structural and catalytic but also most regulatory functions, in all cells, from microbes to mammals. However, the latter may not be the case in complex organisms. A number of startling observations about the extent of non-protein-coding RNA (ncRNA) transcription in the higher eukaryotes and the range of genetic and epigenetic phenomena that are RNA-directed suggests that the traditional view of the structure of genetic regulatory systems in animals and plants may be incorrect. ncRNA dominates the genomic output of the higher organisms and has been shown to control chromosome architecture, mRNA turnover and the developmental timing of protein expression, and may also regulate transcription and alternative splicing. This paper re-examines the available evidence and suggests a new framework for considering and understanding the genomic programming of biological complexity, autopoietic development and phenotypic variation.
Collapse
Affiliation(s)
- John S Mattick
- ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, St. Lucia QLD 4072, Australia.
| |
Collapse
|
223
|
Abstract
Complete genomic sequences from diverse phylogenetic lineages reveal notable increases in genome complexity from prokaryotes to multicellular eukaryotes. The changes include gradual increases in gene number, resulting from the retention of duplicate genes, and more abrupt increases in the abundance of spliceosomal introns and mobile genetic elements. We argue that many of these modifications emerged passively in response to the long-term population-size reductions that accompanied increases in organism size. According to this model, much of the restructuring of eukaryotic genomes was initiated by nonadaptive processes, and this in turn provided novel substrates for the secondary evolution of phenotypic complexity by natural selection. The enormous long-term effective population sizes of prokaryotes may impose a substantial barrier to the evolution of complex genomes and morphologies.
Collapse
Affiliation(s)
- Michael Lynch
- Department of Biology, Indiana University, Bloomington, IN 47405, USA.
| | | |
Collapse
|
224
|
Clyne PJ, Brotman JS, Sweeney ST, Davis G. Green Fluorescent Protein Tagging Drosophila Proteins at Their Native Genomic Loci With Small P Elements. Genetics 2003; 165:1433-41. [PMID: 14668392 PMCID: PMC1462835 DOI: 10.1093/genetics/165.3.1433] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Abstract
We describe a technique to tag Drosophila proteins with GFP at their native genomic loci. This technique uses a new, small P transposable element (the Wee-P) that is composed primarily of the green fluorescent protein (GFP) sequence flanked by consensus splice acceptor and splice donor sequences. We demonstrate that insertion of the Wee-P can generate GFP fusions with native proteins. We further demonstrate that GFP-tagged proteins have correct subcellular localization and can be expressed at near-normal levels. We have used the Wee-P to tag genes with a wide variety of functions, including transmembrane proteins. A genetic analysis of 12 representative fusion lines demonstrates that loss-of-function phenotypes are not caused by the Wee-P insertion. This technology allows the generation of GFP-tagged reagents on a genome-wide scale with diverse potential applications.
Collapse
Affiliation(s)
- Peter J Clyne
- Department of Biochemistry and Biophysics, University of California, San Francisco, California 94143-0448, USA
| | | | | | | |
Collapse
|
225
|
Pozzoli U, Elgar G, Cagliani R, Riva L, Comi GP, Bresolin N, Bardoni A, Sironi M. Comparative analysis of vertebrate dystrophin loci indicate intron gigantism as a common feature. Genome Res 2003; 13:764-72. [PMID: 12727896 PMCID: PMC430921 DOI: 10.1101/gr.776503] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The human DMD gene is the largest known to date, spanning > 2000 kb on the X chromosome. The gene size is mainly accounted for by huge intronic regions. We sequenced 190 kb of Fugu rubripes (pufferfish) genomic DNA corresponding to the complete dystrophin gene (FrDMD) and provide the first report of gene structure and sequence comparison among dystrophin genomic sequences from different vertebrate organisms. Almost all intron positions and phases are conserved between FrDMD and its mammalian counterparts, and the predicted protein product of the Fugu gene displays 55% identity and 71% similarity to human dystrophin. In analogy to the human gene, FrDMD presents several-fold longer than average intronic regions. Analysis of intron sequences of the human and murine genes revealed that they are extremely conserved in size and that a similar fraction of total intron length is represented by repetitive elements; moreover, our data indicate that intron expansion through repeat accumulation in the two orthologs is the result of independent insertional events. The hypothesis that intron length might be functionally relevant to the DMD gene regulation is proposed and substantiated by the finding that dystrophin intron gigantism is common to the three vertebrate genes.
Collapse
Affiliation(s)
- Uberto Pozzoli
- IRCCS E. Medea, Associazione La Nostra Famiglia, 23842 Bosisio Parini (LC), Italy.
| | | | | | | | | | | | | | | |
Collapse
|
226
|
Nascimento AM, Goldman GH, Park S, Marras SAE, Delmas G, Oza U, Lolans K, Dudley MN, Mann PA, Perlin DS. Multiple resistance mechanisms among Aspergillus fumigatus mutants with high-level resistance to itraconazole. Antimicrob Agents Chemother 2003; 47:1719-26. [PMID: 12709346 PMCID: PMC153329 DOI: 10.1128/aac.47.5.1719-1726.2003] [Citation(s) in RCA: 191] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
A collection of Aspergillus fumigatus mutants highly resistant to itraconazole (RIT) at 100 micro g ml(-1) were selected in vitro (following UV irradiation as a preliminary step) to investigate mechanisms of drug resistance in this clinically important pathogen. Eight of the RIT mutants were found to have a mutation at Gly54 (G54E, -K, or -R) in the azole target gene CYP51A. Primers designed for highly conserved regions of multidrug resistance (MDR) pumps were used in reverse transcriptase PCR amplification reactions to identify novel genes encoding potential MDR efflux pumps in A. fumigatus. Two genes, AfuMDR3 and AfuMDR4, showed prominent changes in expression levels in many RIT mutants and were characterized in more detail. Analysis of the deduced amino acid sequence encoded by AfuMDR3 revealed high similarity to major facilitator superfamily transporters, while AfuMDR4 was a typical member of the ATP-binding cassette superfamily. Real-time quantitative PCR with molecular beacon probes was used to assess expression levels of AfuMDR3 and AfuMDR4. Most RIT mutants showed either constitutive high-level expression of both genes or induction of expression upon exposure to itraconazole. Our results suggest that overexpression of one or both of these newly identified drug efflux pump genes of A. fumigatus and/or selection of drug target site mutations are linked to high-level itraconazole resistance and are mechanistic considerations for the emergence of clinical resistance to itraconazole.
Collapse
Affiliation(s)
- Adriana M Nascimento
- Departamento de Genética, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, São Paulo, Brazil
| | | | | | | | | | | | | | | | | | | |
Collapse
|
227
|
Desseyn JL, Laine A. Characterization of mouse muc6 and evidence of conservation of the gel-forming mucin gene cluster between human and mouse. Genomics 2003; 81:433-6. [PMID: 12676567 DOI: 10.1016/s0888-7543(03)00036-3] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Using degenerate primers designed from conserved cysteine-rich domains of gel-forming mucins, we cloned two new mouse mucin cDNAs. Blast searching showed that they belong to the same new gene assigned to chromosome 7 band F5. This gene is clustered with the three secreted large gel-forming mucins Muc2, Muc5ac, and Muc5b in a region that exhibits synteny with human chromosome 11p15. Computer analysis and sequence alignments with mucin genes predict that the new gene is composed of 33 exons and spans 30 kb from the initiation ATG codon to the Stop codon. Sequence similarities, domain organization of the deduced peptide, and expression analysis allow us to conclude that this newly cloned mouse gene is Muc6, i.e., the mouse ortholog of human MUC6. Like those of their human homologs, the genomic order and arrangement of the four mucins within the cluster of mucin genes are conserved.
Collapse
|
228
|
Bon E, Casaregola S, Blandin G, Llorente B, Neuvéglise C, Munsterkotter M, Guldener U, Mewes HW, Van Helden J, Dujon B, Gaillardin C. Molecular evolution of eukaryotic genomes: hemiascomycetous yeast spliceosomal introns. Nucleic Acids Res 2003; 31:1121-35. [PMID: 12582231 PMCID: PMC150231 DOI: 10.1093/nar/gkg213] [Citation(s) in RCA: 104] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2002] [Accepted: 12/19/2002] [Indexed: 11/12/2022] Open
Abstract
As part of the exploratory sequencing program Génolevures, visual scrutinisation and bioinformatic tools were used to detect spliceosomal introns in seven hemiascomycetous yeast species. A total of 153 putative novel introns were identified. Introns are rare in yeast nuclear genes (<5% have an intron), mainly located at the 5' end of ORFs, and not highly conserved in sequence. They all share a clear non-random vocabulary: conserved splice sites and conserved nucleotide contexts around splice sites. Homologues of metazoan snRNAs and putative homologues of SR splicing factors were identified, confirming that the spliceosomal machinery is highly conserved in eukaryotes. Several introns' features were tested as possible markers for phylogenetic analysis. We found that intron sizes vary widely within each genome, and according to the phylogenetic position of the yeast species. The evolutionary origin of spliceosomal introns was examined by analysing the degree of conservation of intron positions in homologous yeast genes. Most introns appeared to exist in the last common ancestor of present day yeast species, and then to have been differentially lost during speciation. However, in some cases, it is difficult to exclude a possible sliding event affecting a pre-existing intron or a gain of a novel intron. Taken together, our results indicate that the origin of spliceosomal introns is complex within a given genome, and that present day introns may have resulted from a dynamic flux between intron conservation, intron loss and intron gain during the evolution of hemiascomycetous yeasts.
Collapse
Affiliation(s)
- Elisabeth Bon
- Laboratoire de Génétique Moléculaire et Cellulaire CNRS-INRA, Institut National Agronomique Paris-Grignon, F-78850 Thiverval-Grignon, France
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
229
|
Wendel JF, Cronn RC, Alvarez I, Liu B, Small RL, Senchina DS. Intron size and genome size in plants. Mol Biol Evol 2002; 19:2346-52. [PMID: 12446829 DOI: 10.1093/oxfordjournals.molbev.a004062] [Citation(s) in RCA: 59] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
|
230
|
Hartung F, Blattner FR, Puchta H. Intron gain and loss in the evolution of the conserved eukaryotic recombination machinery. Nucleic Acids Res 2002; 30:5175-81. [PMID: 12466542 PMCID: PMC137963 DOI: 10.1093/nar/gkf649] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Intron conservation, intron gain or loss and putative intron sliding events were determined for a set of three genes (SPO11, MRE11 and DMC1) involved in basic aspects of recombination in eukaryotes. These are ancient genes and present in nearly all of the major kingdoms. MRE11 is of bacterial origin and can be found in all kingdoms. DMC1 is a specialized homolog of the bacterial RecA protein, whereas the SPO11 gene is of archaebacterial origin. Only unique homologs of SPO11 are found in animals and fungi whereas three distantly related SPO11 copies are present in plant genomes. A comparison of the respective intron positions and phases of all genes was performed, demonstrating that a quarter of the intron positions were perfectly conserved over more than 1 000 000 000 years. Regarding the remaining three quarters of the introns we found insertions to be about three times more frequent than deletions. Aligning the introns of the three different SPO11 homologs of Arabidopsis thaliana we propose a conclusive model of their evolution. We postulate that at least one duplication event occurred shortly after the divergence of plants from animals and fungi and that a respective homolog has been retained in a protist group, the apicomplexa.
Collapse
Affiliation(s)
- Frank Hartung
- Institute of Plant Genetics and Crop Plant Research (IPK), Corrensstrasse 3, D-06466 Gatersleben, Germany
| | | | | |
Collapse
|
231
|
Majewski J, Ott J. Distribution and characterization of regulatory elements in the human genome. Genome Res 2002; 12:1827-36. [PMID: 12466286 PMCID: PMC187578 DOI: 10.1101/gr.606402] [Citation(s) in RCA: 230] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2002] [Accepted: 10/10/2002] [Indexed: 11/24/2022]
Abstract
The regulation of transcription and subsequent gene splicing are crucial to correct gene expression. Although a number of regulatory sequences involved in both processes are known, it is not clear how general their functions are in the genomic context, nor how the regulatory regions are distributed throughout the genome. Here we study the distribution of known mutagenic elements within human introns and exons to deduce the properties of regions essential for splicing and transcription. We show that intronic splicing regulators are generally found close to the splice sites, but may be found as far as 200 nucleotides away from the splice junctions. Similarly, sequences important for splicing may be located as far as 125 nucleotides away from the junctions, within exons. We characterize several types of simple repetitive sequences and low-complexity regions that are overrepresented close to both intron ends and are likely to play important roles in the splicing process. We show that the first introns within most genes play a particularly important regulatory role that is most likely, however, to be involved in transcription control. We also study the distribution of two known regulatory motifs, the GGG trinucleotide and the CpG dinucleotide, and deduce their respective importance to splicing and transcription regulation.
Collapse
|
232
|
Ptak SE, Petrov DA. How intron splicing affects the deletion and insertion profile in Drosophila melanogaster. Genetics 2002; 162:1233-44. [PMID: 12454069 PMCID: PMC1462315 DOI: 10.1093/genetics/162.3.1233] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Studies of "dead-on-arrival" transposable elements in Drosophila melanogaster found that deletions outnumber insertions approximately 8:1 with a median size for deletions of approximately 10 bp. These results are consistent with the deletion and insertion profiles found in most other Drosophila pseudogenes. In contrast, a recent study of D. melanogaster introns found a deletion/insertion ratio of 1.35:1, with 84% of deletions being shorter than 10 bp. This discrepancy could be explained if deletions, especially long deletions, are more frequently strongly deleterious than insertions and are eliminated disproportionately from intron sequences. To test this possibility, we use analysis and simulations to examine how deletions and insertions of different lengths affect different components of splicing and determine the distribution of deletions and insertions that preserve the original exons. We find that, consistent with our predictions, longer deletions affect splicing at a much higher rate compared to insertions and short deletions. We also explore other potential constraints in introns and show that most of these also disproportionately affect large deletions. Altogether we demonstrate that constraints in introns may explain much of the difference in the pattern of deletions and insertions observed in Drosophila introns and pseudogenes.
Collapse
Affiliation(s)
- Susan E Ptak
- Department of Biological Sciences, Stanford University, California 94305, USA.
| | | |
Collapse
|
233
|
Kaessmann H, Zöllner S, Nekrutenko A, Li WH. Signatures of domain shuffling in the human genome. Genome Res 2002; 12:1642-50. [PMID: 12421750 PMCID: PMC187552 DOI: 10.1101/gr.520702] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
To elucidate the role of exon shuffling in shaping the complexity of the human genome/proteome, we have systematically analyzed intron phase distributions in the coding sequence of human protein domains. We found that introns at the boundaries of domains show high excess of symmetrical phase combinations (i.e., 0-0, 1-1, and 2-2), whereas nonboundary introns show no excess symmetry. This suggests that exon shuffling has primarily involved rearrangement of structural and functional domains as a whole. Furthermore, we found that domains flanked by phase 1 introns have dramatically expanded in the human genome due to domain shuffling and that 1-1 symmetrical domains and domain families are nonrandomly distributed with respect to their age. The predominance and extracellular location of 1-1 symmetrical domains among domains specific to metazoans suggests that they are associated with the rise of multicellularity. On the other hand, 0-0 symmetrical domains tend to be over-represented among ancient protein domains that are shared between the eukaryotic and prokaryotic kingdoms, which is compatible with the suggestion of primordial domain shuffling in the progenote. To see whether the human data reflect general genomic patterns of metazoans, similar analyses were done for the nematode Caenorhabditis elegans. Although the C. elegans data generally concur with the human patterns, we identified fewer intron-bounded domains in this organism, consistent with the lower complexity of C. elegans genes. [The following individuals kindly provided reagents, samples, or unpublished information as indicated in the paper: Z. Gu and R. Stevens.]
Collapse
Affiliation(s)
- Henrik Kaessmann
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois 60637, USA.
| | | | | | | |
Collapse
|
234
|
Rose AB. Requirements for intron-mediated enhancement of gene expression in Arabidopsis. RNA (NEW YORK, N.Y.) 2002; 8:1444-53. [PMID: 12458797 PMCID: PMC1370350 DOI: 10.1017/s1355838202020551] [Citation(s) in RCA: 59] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
To explore possible mechanisms of intron-mediated enhancement of gene expression, the features of PAT1 intron 1 required to elevate mRNA accumulation were systematically tested in transgenic Arabidopsis. This intron is remarkably resilient, retaining some ability to increase mRNA accumulation when splicing was prevented by mutation of 5' and 3' splice sites, branchpoint sequences, or when intron U-richness was reduced. Enhancement was abolished by simultaneously eliminating branchpoints and the 5' splice site, structures involved in the first two steps of spliceosome assembly. Although this suggests that the splicing machinery is required, intron splicing is clearly not enough to enhance mRNA accumulation. Five other introns were all efficiently spliced but varied widely in their ability to increase mRNA levels. Furthermore, PAT1 intron 1 was spliced but lost the ability to elevate mRNA accumulation when moved to the 3' UTR. These findings demonstrate that splicing per se is neither necessary nor sufficient for an intron to enhance mRNA accumulation, and suggest a mechanism that requires intron recognition by the splicing machinery but also involves nonconserved intron sequences.
Collapse
Affiliation(s)
- Alan B Rose
- Molecular and Cellular Biology, University of California, Davis, California, 95616, USA.
| |
Collapse
|
235
|
Balraj P, Concannon P, Jamal R, Beghini A, Hoe TS, Khoo AS, Volpi L. An unusual mutation in RECQ4 gene leading to Rothmund-Thomson syndrome. Mutat Res 2002; 508:99-105. [PMID: 12379465 DOI: 10.1016/s0027-5107(02)00189-6] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Rothmund-Thomson syndrome (OMIM #268400) is a severe autosomal recessive genodermatosis: characterised by growth retardation, hyperpigmentation and frequently accompanied by congenital bone defects, brittle hair and hypogonadism. Mutations in helicase RECQ4 gene are responsible for a subset of cases of RTS. Only six mutations have been reported, thus, far and each affecting the coding sequence or the splice junctions. We report the first homozygous mutation in RECQ4 helicase: 2746-2756-delTGGGCTGAGGC in IVS8 responsible for the severe phenotype associated with RTS in a Malaysian pedigree. We report also a 5321 G-->A transition in exon 17 and the updated list of the RECQ4 gene mutations.
Collapse
Affiliation(s)
- Pauline Balraj
- Division of Molecular Pathology, Cancer Research Centre, Institute for Medical Research, 50588 Kuala Lumpur, Malaysia
| | | | | | | | | | | | | |
Collapse
|
236
|
Rogozin IB, Makarova KS, Natale DA, Spiridonov AN, Tatusov RL, Wolf YI, Yin J, Koonin EV. Congruent evolution of different classes of non-coding DNA in prokaryotic genomes. Nucleic Acids Res 2002; 30:4264-71. [PMID: 12364605 PMCID: PMC140549 DOI: 10.1093/nar/gkf549] [Citation(s) in RCA: 67] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Prokaryotic genomes are considered to be 'wall-to-wall' genomes, which consist largely of genes for proteins and structural RNAs, with only a small fraction of the genomic DNA allotted to intergenic regions, which are thought to typically contain regulatory signals. The majority of bacterial and archaeal genomes contain 6-14% non-coding DNA. Significant positive correlations were detected between the fraction of non-coding DNA and inter- and intra-operonic distances, suggesting that different classes of non-coding DNA evolve congruently. In contrast, no correlation was found between any of these characteristics of non-coding sequences and the number of genes or genome size. Thus, the non-coding regions and the gene sets in prokaryotes seem to evolve in different regimes. The evolution of non-coding regions appears to be determined primarily by the selective pressure to minimize the amount of non-functional DNA, while maintaining essential regulatory signals, because of which the content of non-coding DNA in different genomes is relatively uniform and intra- and inter-operonic non-coding regions evolve congruently. In contrast, the gene set is optimized for the particular environmental niche of the given microbe, which results in the lack of correlation between the gene number and the characteristics of non-coding regions.
Collapse
Affiliation(s)
- Igor B Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | | | | | | | | | | | |
Collapse
|
237
|
Castillo-Davis CI, Mekhedov SL, Hartl DL, Koonin EV, Kondrashov FA. Selection for short introns in highly expressed genes. Nat Genet 2002; 31:415-8. [PMID: 12134150 DOI: 10.1038/ng940] [Citation(s) in RCA: 378] [Impact Index Per Article: 17.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Transcription is a slow and expensive process: in eukaryotes, approximately 20 nucleotides can be transcribed per second at the expense of at least two ATP molecules per nucleotide. Thus, at least for highly expressed genes, transcription of long introns, which are particularly common in mammals, is costly. Using data on the expression of genes that encode proteins in Caenorhabditis elegans and Homo sapiens, we show that introns in highly expressed genes are substantially shorter than those in genes that are expressed at low levels. This difference is greater in humans, such that introns are, on average, 14 times shorter in highly expressed genes than in genes with low expression, whereas in C. elegans the difference in intron length is only twofold. In contrast, the density of introns in a gene does not strongly depend on the level of gene expression. Thus, natural selection appears to favor short introns in highly expressed genes to minimize the cost of transcription and other molecular processes, such as splicing.
Collapse
Affiliation(s)
- Cristian I Castillo-Davis
- Department of Organismic and Evolutionary Biology, Harvard University, 16 Divinity Avenue, Cambridge, Massachusetts 02138, USA
| | | | | | | | | |
Collapse
|
238
|
Abstract
The protein coding sequences of most eukaryotic messenger RNA precursors (pre-mRNAs) are interrupted by non-coding sequences called introns. Pre-mRNA splicing is the process by which introns are removed and the protein coding elements assembled into mature mRNAs. Alternative pre-mRNA splicing selectively joins different protein coding elements to form mRNAs that encode proteins with distinct functions, and is therefore an important source of protein diversity. The elaboration of this mechanism may have had a significant role in the expansion of metazoan proteomes during evolution.
Collapse
Affiliation(s)
- Tom Maniatis
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, Massachusetts 02138, USA
| | | |
Collapse
|
239
|
Abstract
Alu repetitive elements are found in approximately 1.4 million copies in the human genome, comprising more than one-tenth of it. Numerous studies describe exonizations of Alu elements, that is, splicing-mediated insertions of parts of Alu sequences into mature mRNAs. To study the connection between the exonization of Alu elements and alternative splicing, we used a database of ESTs and cDNAs aligned to the human genome. We compiled two exon sets, one of 1176 alternatively spliced internal exons, and another of 4151 constitutively spliced internal exons. Sixty one alternatively spliced internal exons (5.2%) had a significant BLAST hit to an Alu sequence, but none of the constitutively spliced internal exons had such a hit. The vast majority (84%) of the Alu-containing exons that appeared within the coding region of mRNAs caused a frame-shift or a premature termination codon. Alu-containing exons were included in transcripts at lower frequencies than alternatively spliced exons that do not contain an Alu sequence. These results indicate that internal exons that contain an Alu sequence are predominantly, if not exclusively, alternatively spliced. Presumably, evolutionary events that cause a constitutive insertion of an Alu sequence into an mRNA are deleterious and selected against.
Collapse
Affiliation(s)
- Rotem Sorek
- Department of Zoology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv 69978, Israel.
| | | | | |
Collapse
|
240
|
Abstract
To gauge the processes that might direct the length of introns, I studied the balance of indels (insertions or deletions, determined using Alu and LINE1 retroposon repeats) and the density of these repeats in the introns of the human genome. The indel balance is biased in favour of deletions and correlated with the divergence of repeats. At fixed repeat divergence, the indel bias correlated with the intron size: the shorter the intron, the more deletions were favoured over insertions. This correlation with the intron size was stronger than with the gene-wide or isochore-wide parameters. The density of repeats (the number of repeats in a unit of intron length) correlated positively with the intron size. Thus, quite different mechanisms, the indel bias and the integration and/or persistence of retroposons, act in the same direction in regards to intron size, which suggests selection for the size of individual introns.
Collapse
Affiliation(s)
- Alexander E Vinogradov
- Institute of Cytology, Russian Academy of Sciences, Tikhoretsky Ave. 4, St Petersburg 194064, Russia.
| |
Collapse
|
241
|
Abstract
Debate over the mechanisms responsible for the phylogenetic and genomic distribution of introns has proceeded largely without consideration of the population-genetic forces influencing the establishment and retention of novel genetic elements. However, a simple model incorporating random genetic drift and weak mutation pressure against intron-containing alleles yields predictions consistent with a diversity of observations: (i) the rarity of introns in unicellular organisms with large population sizes, and their expansion after the origin of multicellular organisms with reduced population sizes; (ii) the relationship between intron abundance and the stringency of splice-site requirements; (iii) the tendency for introns to be more numerous and longer in regions of low recombination; and (iv) the bias toward phase-0 introns. This study provides a second example of a mechanism whereby genomic complexity originates passively as a "pathological" response to small population size, and raises difficulties for the idea that ancient introns played a major role in the origin of genes by exon shuffling.
Collapse
Affiliation(s)
- Michael Lynch
- Department of Biology, Indiana University, Bloomington, IN 47405, USA.
| |
Collapse
|
242
|
Schoonderwoert VTG, Martens GJM. Structural gene organization and evolutionary aspects of the V-ATPase accessory subunit Ac45. BIOCHIMICA ET BIOPHYSICA ACTA 2002; 1574:245-54. [PMID: 11997089 DOI: 10.1016/s0167-4781(01)00368-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
The vacuolar H+-ATPase (V-ATPase) is a multisubunit enzyme that couples ATP hydrolysis to proton pumping across membranes. The intracellular targeting and activity of the V-ATPase may be regulated via proteins that interact with the pump such as the accessory subunit Ac45. Here we report the isolation and characterization of the gene encoding Ac45. This single-copy gene is located in a gene-dense region of chromosome Xq and consists of 10 exons spanning approximately 8 kb in the mouse and human genomes. The gene structure is poorly conserved in that its invertebrate orthologs of Caenorhabditis elegans and Drosophila melanogaster encompass only six and four exons extending over 4.1 and 2.1 kb, respectively. Furthermore, the overall degree of amino acid sequence identity between the mammalian and invertebrate Ac45 proteins is very low (<18%), except for a surprisingly highly conserved putative targeting motif in the carboxy-terminal region. Primer extension analysis revealed that the mouse Ac45 gene contains two major transcription initiation sites. The start sites are not preceded by a clear CAAT-box and are located in a CpG island. The most downstream start site contains a TATA-box and transcriptional regulatory elements such as PEA-3, F2F, Maz and Sp1. The limited number of regulatory DNA elements common in the genes encoding Ac45 and V-ATPase subunits suggests a differential regulation of these genes. Together with the finding that Ac45 appears to occur only in multicellular organisms, these results indicate that this accessory subunit directs the V-ATPase to specialized and complex vacuolar systems.
Collapse
Affiliation(s)
- Vincent Th G Schoonderwoert
- Department of Animal Physiology, University of Nijmegen, Geert Grooteplein Zuid 28, RT193, 6525 GA, Nijmegen, The Netherlands
| | | |
Collapse
|
243
|
Takahashi I, Takahashi T, Komatsu M, Sato T, Takada G. An exonic mutation of the GH-1 gene causing familial isolated growth hormone deficiency type II. Clin Genet 2002; 61:222-5. [PMID: 12000366 DOI: 10.1034/j.1399-0004.2002.610310.x] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
A heterozygous base change was identified in exon 3 of the growth hormone (GH)-1 gene in a Japanese family with autosomal dominant GH deficiency. All of the patients from this family had a heterozygous G to T transversion at the first 5'-site nucleotide of exon 3. Analysis of the GH-1 cDNA, synthesized from lymphoblasts of the patients, revealed an abnormal shorter transcript as well as a normal-sized transcript. Direct sequencing of this abnormal transcript showed that the transcript completely lacked exon 3. In familial isolated GH deficiency (IGHD) type II, several heterozygous mutations have been reported at the donor splice site in intron 3 of the GH-1 gene or inside intron 3, which causes aberrant GH messenger RNA splicing, resulting in the deletion of exon 3. This deletion causes a lack of amino acid residues 32-71 in the mature GH protein. This mutant GH is well-known to exert a dominant negative effect on the secretion of mature normal GH protein. Thus, in the subject family, a heterozygous G-to-T transversion at the first nucleotide of the exon 3 deletes exon 3 in mature GH mRNA and causes GH deficiency. The present authors suggest that the first nucleotide of exon 3 is critical for the splicing of GH-1 mRNA.
Collapse
Affiliation(s)
- I Takahashi
- Department of Pediatrics, Akita University School of Medicine, Akita, Japan
| | | | | | | | | |
Collapse
|
244
|
Webb CT, Shabalina SA, Ogurtsov AY, Kondrashov AS. Analysis of similarity within 142 pairs of orthologous intergenic regions of Caenorhabditis elegans and Caenorhabditis briggsae. Nucleic Acids Res 2002; 30:1233-9. [PMID: 11861916 PMCID: PMC101251 DOI: 10.1093/nar/30.5.1233] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2001] [Revised: 01/11/2002] [Accepted: 01/11/2002] [Indexed: 11/12/2022] Open
Abstract
Patterns of similarity between genomes of related species reflect the distribution of selective constraint within DNA. We analyzed alignments of 142 orthologous intergenic regions of Caenorhabditis elegans and Caenorhabditis briggsae and found a mosaic pattern with regions of high similarity (phylogenetic footprints) interspersed with non-alignable sequences. Footprints cover approximately 20% of intergenic regions, often occur in clumps and are rare within 5' UTRs but common within 3' UTRs. The footprints have a higher ratio of transitions to transversions than expected at random and a higher GC content than the rest of the intergenic region. The number of footprints and the GC content of footprints within an intergenic region are higher when genes are oriented so that their 5' ends form the boundaries of the intergenic region. Overall, the patterns and characteristics identified here, along with other comparative and experimental studies, suggest that many footprints have a regulatory function, although other types of function are also possible. These conclusions may be quite general across eukaryotes, and the characteristics of conserved regulatory elements determined from genomic comparisons can be useful in prediction of regulation sites within individual DNA sequences.
Collapse
Affiliation(s)
- Colleen T Webb
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| | | | | | | |
Collapse
|
245
|
Glanzer JG, Liao L, Baker T, McMullen MH, Langan AS, Crandall LZ, Vorce RL. Organization and regulation of the human rasGAP gene. Gene 2002; 285:149-56. [PMID: 12039041 DOI: 10.1016/s0378-1119(02)00415-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
ras GTPase activating protein (rasGAP) is highly conserved among mammalian species and is required for normal cardiovascular system development. Expression of this protein exhibits both quantitative and qualitative variability among tissues. Using a combination of DNA sequencing and database analyses, we have determined that the human rasGAP gene spans 122 kb and is composed of 25 exons; the size of each intron and the intron/exon junctions also have been elucidated. With one exception, all intron/exon boundaries conform to the GT/AG rule; the splice donor site of intron 3 is GC/AG. Results of RNA ligase mediated rapid amplification of cDNA ends followed by sequence determination indicate that the transcription start point (TSP) is approximately 588 bp upstream from the translational start site and is uninterrupted by introns; this extremely long 5' untranslated region is continuous with the first coding exon. Analysis of 1 kb of sequence upstream of the TSP did not identify any of the typical promoter elements (TATA or CAAT boxes). Sequential deletions of this 1 kb region followed by secreted alkaline phosphatase reporter gene analysis revealed that transcription is supported by this region of the rasGAP gene. Because the highest efficiency is demonstrated by a 213 bp sequence just upstream from the TSP (-786 to -584), this region is identified as containing the rasGAP minimal promoter. Sequence analysis of this 213 bp sequence shows few candidate sites for transcription factor binding. A 406 bp fragment surrounding the TSP exhibits characteristics of a CpG island (68% C+G; observed/expected ratio of CpG=0.95). RapidScan analysis revealed that high levels of rasGAP transcript are present in placenta and testis, but transcript is not detectable in kidney and intestinal tract. These data suggest that rasGAP transcription is regulated by an atypical mechanism capable of producing quantitative variability among tissue types.
Collapse
Affiliation(s)
- Jason G Glanzer
- Department of Pharmacology, University of Nebraska Medical Center, Omaha, NE, USA
| | | | | | | | | | | | | |
Collapse
|
246
|
Barker FK, Barrowclough GF, Groth JG. A phylogenetic hypothesis for passerine birds: taxonomic and biogeographic implications of an analysis of nuclear DNA sequence data. Proc Biol Sci 2002; 269:295-308. [PMID: 11839199 PMCID: PMC1690884 DOI: 10.1098/rspb.2001.1883] [Citation(s) in RCA: 289] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Passerine birds comprise over half of avian diversity, but have proved difficult to classify. Despite a long history of work on this group, no comprehensive hypothesis of passerine family-level relationships was available until recent analyses of DNA-DNA hybridization data. Unfortunately, given the value of such a hypothesis in comparative studies of passerine ecology and behaviour, the DNA-hybridization results have not been well tested using independent data and analytical approaches. Therefore, we analysed nucleotide sequence variation at the nuclear RAG-1 and c-mos genes from 69 passerine taxa, including representatives of most currently recognized families. In contradiction to previous DNA-hybridization studies, our analyses suggest paraphyly of suboscine passerines because the suboscine New Zealand wren Acanthisitta was found to be sister to all other passerines. Additionally, we reconstructed the parvorder Corvida as a basal paraphyletic grade within the oscine passerines. Finally, we found strong evidence that several family-level taxa are misplaced in the hybridization results, including the Alaudidae, Irenidae, and Melanocharitidae. The hypothesis of relationships we present here suggests that the oscine passerines arose on the Australian continental plate while it was isolated by oceanic barriers and that a major northern radiation of oscines (i.e. the parvorder Passerida) originated subsequent to dispersal from the south.
Collapse
Affiliation(s)
- F Keith Barker
- Department of Ornithology, American Museum of Natural History, Central Park West at 79th Street, New York, NY 10024, USA.
| | | | | |
Collapse
|
247
|
|
248
|
Gregory TR. A bird's-eye view of the C-value enigma: genome size, cell size, and metabolic rate in the class aves. Evolution 2002; 56:121-30. [PMID: 11913657 DOI: 10.1111/j.0014-3820.2002.tb00854.x] [Citation(s) in RCA: 180] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
For half a century, variation in genome size (C-value) has been an unresolved puzzle in evolutionary biology. While the initial "C-value paradox" was solved with the discovery of noncoding DNA, a much more complex "C-value enigma" remains. The present study focuses on one aspect of this puzzle, namely the small genome sizes of birds. Significant negative correlations are reported between resting metabolic rate and both C-value and erythrocyte size. Cell size is positively correlated with both nucleus size and C-value in birds, as in other vertebrates. These findings shed light on the constraints acting on genome size in birds and illustrate the importance of interactions among various levels of the biological hierarchy, ranging from the subchromosomal to the ecological. Following from a discussion of the mechanistic bases of the correlations reported and the processes by which birds achieved and/or maintain small genomes, a pluralistic approach to the C-value enigma is recommended.
Collapse
Affiliation(s)
- T Ryan Gregory
- Department of Zoology, University of Guelph, Ontario, Canada.
| |
Collapse
|
249
|
Sakharkar M, Passetti F, de Souza JE, Long M, de Souza SJ. ExInt: an Exon Intron Database. Nucleic Acids Res 2002; 30:191-4. [PMID: 11752290 PMCID: PMC99089 DOI: 10.1093/nar/30.1.191] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2001] [Accepted: 09/26/2001] [Indexed: 11/14/2022] Open
Abstract
The Exon/Intron Database (ExInt) stores information of all GenBank eukaryotic entries containing an annotated intron sequence. Data are available through a retrieval system, as flat-files and as a MySQL dump file. In this report we discuss several implementations added to ExInt, which is accessible at http://intron.bic.nus.edu.sg/exint/newexint/exint.html.
Collapse
Affiliation(s)
- M Sakharkar
- Ludwig Institute for Cancer Research, Sao Paulo Branch, Rua Prof. Antonio Prudente 109, 01509-010, Sao Paulo, Brazil
| | | | | | | | | |
Collapse
|
250
|
Boudet N, Aubourg S, Toffano-Nioche C, Kreis M, Lecharny A. Evolution of intron/exon structure of DEAD helicase family genes in Arabidopsis, Caenorhabditis, and Drosophila. Genome Res 2001; 11:2101-14. [PMID: 11731501 PMCID: PMC311229 DOI: 10.1101/gr.200801] [Citation(s) in RCA: 110] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The DEAD box RNA helicase (RH) proteins are homologs involved in diverse cellular functions in all of the organisms from prokaryotes to eukaryotes. Nevertheless, there is a lack of conservation in the splicing pattern in the 53 Arabidopsis thaliana (AtRHs), the 32 Caenorhabditis elegans (CeRHs) and the 29 Drosophila melanogaster (DmRHs) genes. Of the 153 different observed intron positions, 4 are conserved between AtRHs, CeRHs, and DmRHs, and one position is also found in RHs from yeast and human. Of the 27 different AtRH structures with introns, 20 have at least one predicted ancient intron in the regions coding for the catalytic domain. In all of the organisms examined, we found at least one gene with most of its intron predicted to be ancient. In A. thaliana, the large diversity in RH structures suggests that duplications of the ancestral RH were followed by a high number of intron deletions and additions. The very high bias toward phase 0 introns is in favor of intron addition, preferentially in phase 0. Results from this comparative study of the same gene family in a plant and in two animals are discussed in terms of the general mechanisms of gene family evolution.
Collapse
Affiliation(s)
- N Boudet
- Institut de Biotechnologie des Plantes, Unité Mixte de Recherche-Centre National Recherche Scientifique 8618, Université de Paris-Sud, Bât. 630, F-91405 Orsay Cedex, France
| | | | | | | | | |
Collapse
|