1
|
Dignam JD, Lebovitz RM, Roeder RG. Accurate transcription initiation by RNA polymerase II in a soluble extract from isolated mammalian nuclei. Nucleic Acids Res 1983; 11:1475-89. [PMID: 6828386 PMCID: PMC325809 DOI: 10.1093/nar/11.5.1475] [Citation(s) in RCA: 10162] [Impact Index Per Article: 242.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
We have developed a procedure for preparing extracts from nuclei of human tissue culture cells that directs accurate transcription initiation in vitro from class II promoters. Conditions of extraction and assay have been optimized for maximum activity using the major late promoter of adenovirus 2. The extract also directs accurate transcription initiation from other adenovirus promoters and cellular promoters. The extract also directs accurate transcription initiation from class III promoters (tRNA and Ad 2 VA).
Collapse
|
research-article |
42 |
10162 |
2
|
Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 1997; 25:955-64. [PMID: 9023104 PMCID: PMC146525 DOI: 10.1093/nar/25.5.955] [Citation(s) in RCA: 7798] [Impact Index Per Article: 278.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
We describe a program, tRNAscan-SE, which identifies 99-100% of transfer RNA genes in DNA sequence while giving less than one false positive per 15 gigabases. Two previously described tRNA detection programs are used as fast, first-pass prefilters to identify candidate tRNAs, which are then analyzed by a highly selective tRNA covariance model. This work represents a practical application of RNA covariance models, which are general, probabilistic secondary structure profiles based on stochastic context-free grammars. tRNAscan-SE searches at approximately 30 000 bp/s. Additional extensions to tRNAscan-SE detect unusual tRNA homologues such as selenocysteine tRNAs, tRNA-derived repetitive elements and tRNA pseudogenes.
Collapse
|
research-article |
28 |
7798 |
3
|
Anderson S, Bankier AT, Barrell BG, de Bruijn MH, Coulson AR, Drouin J, Eperon IC, Nierlich DP, Roe BA, Sanger F, Schreier PH, Smith AJ, Staden R, Young IG. Sequence and organization of the human mitochondrial genome. Nature 1981; 290:457-65. [PMID: 7219534 DOI: 10.1038/290457a0] [Citation(s) in RCA: 6433] [Impact Index Per Article: 146.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
The complete sequence of the 16,569-base pair human mitochondrial genome is presented. The genes for the 12S and 16S rRNAs, 22 tRNAs, cytochrome c oxidase subunits I, II and III, ATPase subunit 6, cytochrome b and eight other predicted protein coding genes have been located. The sequence shows extreme economy in that the genes have none or only a few noncoding bases between them, and in many cases the termination codons are not coded in the DNA but are created post-transcriptionally by polyadenylation of the mRNAs.
Collapse
|
|
44 |
6433 |
4
|
Blattner FR, Plunkett G, Bloch CA, Perna NT, Burland V, Riley M, Collado-Vides J, Glasner JD, Rode CK, Mayhew GF, Gregor J, Davis NW, Kirkpatrick HA, Goeden MA, Rose DJ, Mau B, Shao Y. The complete genome sequence of Escherichia coli K-12. Science 1997; 277:1453-62. [PMID: 9278503 DOI: 10.1126/science.277.5331.1453] [Citation(s) in RCA: 5375] [Impact Index Per Article: 192.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
The 4,639,221-base pair sequence of Escherichia coli K-12 is presented. Of 4288 protein-coding genes annotated, 38 percent have no attributed function. Comparison with five other sequenced microbes reveals ubiquitous as well as narrowly distributed gene families; many families of similar genes within E. coli are also evident. The largest family of paralogous proteins contains 80 ABC transporters. The genome as a whole is strikingly organized with respect to the local direction of replication; guanines, oligonucleotides possibly related to replication and recombination, and most genes are so oriented. The genome also contains insertion sequence (IS) elements, phage remnants, and many other patches of unusual composition indicating genome plasticity through horizontal transfer.
Collapse
MESH Headings
- Bacterial Proteins/chemistry
- Bacterial Proteins/genetics
- Bacterial Proteins/metabolism
- Bacteriophage lambda/genetics
- Base Composition
- Binding Sites
- Chromosome Mapping
- DNA Replication
- DNA Transposable Elements
- DNA, Bacterial/genetics
- Escherichia coli/genetics
- Genes, Bacterial
- Genome, Bacterial
- Molecular Sequence Data
- Mutation
- Operon
- RNA, Bacterial/genetics
- RNA, Transfer/genetics
- Recombination, Genetic
- Regulatory Sequences, Nucleic Acid
- Repetitive Sequences, Nucleic Acid
- Sequence Analysis, DNA
- Sequence Homology, Amino Acid
Collapse
|
|
28 |
5375 |
5
|
Jaillon O, Aury JM, Noel B, Policriti A, Clepet C, Casagrande A, Choisne N, Aubourg S, Vitulo N, Jubin C, Vezzi A, Legeai F, Hugueney P, Dasilva C, Horner D, Mica E, Jublot D, Poulain J, Bruyère C, Billault A, Segurens B, Gouyvenoux M, Ugarte E, Cattonaro F, Anthouard V, Vico V, Del Fabbro C, Alaux M, Di Gaspero G, Dumas V, Felice N, Paillard S, Juman I, Moroldo M, Scalabrin S, Canaguier A, Le Clainche I, Malacrida G, Durand E, Pesole G, Laucou V, Chatelet P, Merdinoglu D, Delledonne M, Pezzotti M, Lecharny A, Scarpelli C, Artiguenave F, Pè ME, Valle G, Morgante M, Caboche M, Adam-Blondon AF, Weissenbach J, Quétier F, Wincker P. The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 2007; 449:463-7. [PMID: 17721507 DOI: 10.1038/nature06148] [Citation(s) in RCA: 2264] [Impact Index Per Article: 125.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2007] [Accepted: 08/07/2007] [Indexed: 01/12/2023]
Abstract
The analysis of the first plant genomes provided unexpected evidence for genome duplication events in species that had previously been considered as true diploids on the basis of their genetics. These polyploidization events may have had important consequences in plant evolution, in particular for species radiation and adaptation and for the modulation of functional capacities. Here we report a high-quality draft of the genome sequence of grapevine (Vitis vinifera) obtained from a highly homozygous genotype. The draft sequence of the grapevine genome is the fourth one produced so far for flowering plants, the second for a woody species and the first for a fruit crop (cultivated for both fruit and beverage). Grapevine was selected because of its important place in the cultural heritage of humanity beginning during the Neolithic period. Several large expansions of gene families with roles in aromatic features are observed. The grapevine genome has not undergone recent genome duplication, thus enabling the discovery of ancestral traits and features of the genetic organization of flowering plants. This analysis reveals the contribution of three ancestral genomes to the grapevine haploid content. This ancestral arrangement is common to many dicotyledonous plants but is absent from the genome of rice, which is a monocotyledon. Furthermore, we explain the chronology of previously described whole-genome duplication events in the evolution of flowering plants.
Collapse
|
Journal Article |
18 |
2264 |
6
|
Laslett D, Canback B. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 2004; 32:11-6. [PMID: 14704338 PMCID: PMC373265 DOI: 10.1093/nar/gkh152] [Citation(s) in RCA: 2025] [Impact Index Per Article: 96.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
A computer program, ARAGORN, identifies tRNA and tmRNA genes. The program employs heuristic algorithms to predict tRNA secondary structure, based on homology with recognized tRNA consensus sequences and ability to form a base-paired cloverleaf. tmRNA genes are identified using a modified version of the BRUCE program. ARAGORN achieves a detection sensitivity of 99% from a set of 1290 eubacterial, eukaryotic and archaeal tRNA genes and detects all complete tmRNA sequences in the tmRNA database, improving on the performance of the BRUCE program. Recently discovered tmRNA genes in the chloroplasts of two species from the 'green' algae lineage are detected. The output of the program reports the proposed tRNA secondary structure and, for tmRNA genes, the secondary structure of the tRNA domain, the tmRNA gene sequence, the tag peptide and a list of organisms with matching tmRNA peptide tags.
Collapse
|
Journal Article |
21 |
2025 |
7
|
Tillich M, Lehwark P, Pellizzer T, Ulbricht-Jones ES, Fischer A, Bock R, Greiner S. GeSeq - versatile and accurate annotation of organelle genomes. Nucleic Acids Res 2017; 45:W6-W11. [PMID: 28486635 PMCID: PMC5570176 DOI: 10.1093/nar/gkx391] [Citation(s) in RCA: 1987] [Impact Index Per Article: 248.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2017] [Revised: 04/13/2017] [Accepted: 04/27/2017] [Indexed: 11/21/2022] Open
Abstract
We have developed the web application GeSeq (https://chlorobox.mpimp-golm.mpg.de/geseq.html) for the rapid and accurate annotation of organellar genome sequences, in particular chloroplast genomes. In contrast to existing tools, GeSeq combines batch processing with a fully customizable reference sequence selection of organellar genome records from NCBI and/or references uploaded by the user. For the annotation of chloroplast genomes, the application additionally provides an integrated database of manually curated reference sequences. GeSeq identifies genes or other feature-encoding regions by BLAT-based homology searches and additionally, by profile HMM searches for protein and rRNA coding genes and two de novo predictors for tRNA genes. These unique features enable the user to conveniently compare the annotations of different state-of-the-art methods, thus supporting high-quality annotations. The main output of GeSeq is a GenBank file that usually requires only little curation and is instantly visualized by OGDRAW. GeSeq also offers a variety of optional additional outputs that facilitate downstream analyzes, for example comparative genomic or phylogenetic studies.
Collapse
|
research-article |
8 |
1987 |
8
|
Abstract
A 3'-end proximal segment of most of the putative mRNAs encoded in the heavy strand of HeLa cell mtDNA has been partially sequences and aligned with the DNA sequence. In all cases, the 3'-end nucleotide of the individual mRNA coding sequences has been found to be immediately contiguous to a tRNA gene or another mRNA coding sequence. These and previous results indicate that the heavy (H) strand sequences coding for the rRNA, poly(A)-containing RNA and tRNA species form a continuum extending over almost the entire length of this strand. We propose that the H strand is transcribed into a single polycistronic RNA molecule, which is processed later into mature species by precise endonucleolytic cleavages which occur, in most cases, immediately before and after a tRNA sequence.
Collapse
|
|
44 |
1907 |
9
|
Schattner P, Brooks AN, Lowe TM. The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Res 2005; 33:W686-9. [PMID: 15980563 PMCID: PMC1160127 DOI: 10.1093/nar/gki366] [Citation(s) in RCA: 1857] [Impact Index Per Article: 92.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Transfer RNAs (tRNAs) and small nucleolar RNAs (snoRNAs) are two of the largest classes of non-protein-coding RNAs. Conventional gene finders that detect protein-coding genes do not find tRNA and snoRNA genes because they lack the codon structure and statistical signatures of protein-coding genes. Previously, we developed tRNAscan-SE, snoscan and snoGPS for the detection of tRNAs, methylation-guide snoRNAs and pseudouridylation-guide snoRNAs, respectively. tRNAscan-SE is routinely applied to completed genomes, resulting in the identification of thousands of tRNA genes. Snoscan has successfully detected methylation-guide snoRNAs in a variety of eukaryotes and archaea, and snoGPS has identified novel pseudouridylation-guide snoRNAs in yeast and mammals. Although these programs have been quite successful at RNA gene detection, their use has been limited by the need to install and configure the software packages on UNIX workstations. Here, we describe online implementations of these RNA detection tools that make these programs accessible to a wider range of research biologists. The tRNAscan-SE, snoscan and snoGPS servers are available at http://lowelab.ucsc.edu/tRNAscan-SE/, http://lowelab.ucsc.edu/snoscan/ and http://lowelab.ucsc.edu/snoGPS/, respectively.
Collapse
|
research-article |
20 |
1857 |
10
|
Bibb MJ, Van Etten RA, Wright CT, Walberg MW, Clayton DA. Sequence and gene organization of mouse mitochondrial DNA. Cell 1981; 26:167-80. [PMID: 7332926 DOI: 10.1016/0092-8674(81)90300-7] [Citation(s) in RCA: 1252] [Impact Index Per Article: 28.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
The complete sequence of the 16,295 bp mouse L cell mitochondrial DNA genome has been determined. Genes for the 12S and 16S ribosomal RNAs; 22 tRNAs; cytochrome c oxidase subunits I, II and III; ATPase subunit 6; cytochrome b; and eight unidentified proteins have been located. The genome displays exceptional economy of organization, with tRNA genes interspersed between rRNA and protein-coding genes with zero or few noncoding nucleotides between coding sequences. Only two significant portions of the genome, the 879 nucleotide displacement-loop region containing the origin of heavy-strand replication and the 32 nucleotide origin of light-strand replication, do not encode a functional RNA species. All of the remaining nucleotide sequence serves as a defined coding function, with the exception of 32 nucleotides, of which 18 occur at the 5' ends of open reading frames. Mouse mitochondrial DNA is unique in that the translational start codon is AUN, with any of the four nucleotides in the third position, whereas the only translational stop codon is the orthodox UAA. The mouse mitochondrial DNA genome is highly homologous in overall sequence and in gene organization to human mitochondrial DNA, with the descending order of conserved regions being tRNA genes; origin of light-strand replication; rRNA genes; known protein-coding genes; unidentified protein-coding genes; displacement-loop region.
Collapse
|
|
44 |
1252 |
11
|
Dujon B, Sherman D, Fischer G, Durrens P, Casaregola S, Lafontaine I, De Montigny J, Marck C, Neuvéglise C, Talla E, Goffard N, Frangeul L, Aigle M, Anthouard V, Babour A, Barbe V, Barnay S, Blanchin S, Beckerich JM, Beyne E, Bleykasten C, Boisramé A, Boyer J, Cattolico L, Confanioleri F, De Daruvar A, Despons L, Fabre E, Fairhead C, Ferry-Dumazet H, Groppi A, Hantraye F, Hennequin C, Jauniaux N, Joyet P, Kachouri R, Kerrest A, Koszul R, Lemaire M, Lesur I, Ma L, Muller H, Nicaud JM, Nikolski M, Oztas S, Ozier-Kalogeropoulos O, Pellenz S, Potier S, Richard GF, Straub ML, Suleau A, Swennen D, Tekaia F, Wésolowski-Louvel M, Westhof E, Wirth B, Zeniou-Meyer M, Zivanovic I, Bolotin-Fukuhara M, Thierry A, Bouchier C, Caudron B, Scarpelli C, Gaillardin C, Weissenbach J, Wincker P, Souciet JL. Genome evolution in yeasts. Nature 2004; 430:35-44. [PMID: 15229592 DOI: 10.1038/nature02579] [Citation(s) in RCA: 1221] [Impact Index Per Article: 58.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2004] [Accepted: 04/19/2004] [Indexed: 11/09/2022]
Abstract
Identifying the mechanisms of eukaryotic genome evolution by comparative genomics is often complicated by the multiplicity of events that have taken place throughout the history of individual lineages, leaving only distorted and superimposed traces in the genome of each living organism. The hemiascomycete yeasts, with their compact genomes, similar lifestyle and distinct sexual and physiological properties, provide a unique opportunity to explore such mechanisms. We present here the complete, assembled genome sequences of four yeast species, selected to represent a broad evolutionary range within a single eukaryotic phylum, that after analysis proved to be molecularly as diverse as the entire phylum of chordates. A total of approximately 24,200 novel genes were identified, the translation products of which were classified together with Saccharomyces cerevisiae proteins into about 4,700 families, forming the basis for interspecific comparisons. Analysis of chromosome maps and genome redundancies reveal that the different yeast lineages have evolved through a marked interplay between several distinct molecular mechanisms, including tandem gene repeat formation, segmental duplication, a massive genome duplication and extensive gene loss.
Collapse
|
Research Support, Non-U.S. Gov't |
21 |
1221 |
12
|
Wilson R, Ainscough R, Anderson K, Baynes C, Berks M, Bonfield J, Burton J, Connell M, Copsey T, Cooper J. 2.2 Mb of contiguous nucleotide sequence from chromosome III of C. elegans. Nature 1994; 368:32-8. [PMID: 7906398 DOI: 10.1038/368032a0] [Citation(s) in RCA: 1175] [Impact Index Per Article: 37.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
As part of our effort to sequence the 100-megabase (Mb) genome of the nematode Caenorhabditis elegans, we have completed the nucleotide sequence of a contiguous 2,181,032 base pairs in the central gene cluster of chromosome III. Analysis of the finished sequence has indicated an average density of about one gene per five kilobases; comparison with the public sequence databases reveals similarities to previously known genes for about one gene in three. In addition, the genomic sequence contains several intriguing features, including putative gene duplications and a variety of other repeats with potential evolutionary implications.
Collapse
|
|
31 |
1175 |
13
|
Abstract
A unique transfer RNA (tRNA)/aminoacyl-tRNA synthetase pair has been generated that expands the number of genetically encoded amino acids in Escherichia coli. When introduced into E. coli, this pair leads to the in vivo incorporation of the synthetic amino acid O-methyl-l-tyrosine into protein in response to an amber nonsense codon. The fidelity of translation is greater than 99%, as determined by analysis of dihydrofolate reductase containing the unnatural amino acid. This approach should provide a general method for increasing the genetic repertoire of living cells to include a variety of amino acids with novel structural, chemical, and physical properties not found in the common 20 amino acids.
Collapse
MESH Headings
- Anticodon
- Codon/genetics
- Codon/metabolism
- Codon, Terminator
- Escherichia coli/genetics
- Escherichia coli/growth & development
- Escherichia coli/metabolism
- Genetic Code
- Mass Spectrometry
- Methanococcus/enzymology
- Methanococcus/genetics
- Methyltyrosines/metabolism
- Mutation
- Protein Biosynthesis
- RNA, Bacterial/genetics
- RNA, Bacterial/metabolism
- RNA, Transfer/genetics
- RNA, Transfer/metabolism
- RNA, Transfer, Tyr/genetics
- RNA, Transfer, Tyr/metabolism
- Suppression, Genetic
- Transfer RNA Aminoacylation
- Transformation, Bacterial
- Tyrosine-tRNA Ligase/chemistry
- Tyrosine-tRNA Ligase/genetics
- Tyrosine-tRNA Ligase/metabolism
Collapse
|
|
24 |
1154 |
14
|
Wolstenholme DR. Animal mitochondrial DNA: structure and evolution. INTERNATIONAL REVIEW OF CYTOLOGY 1992; 141:173-216. [PMID: 1452431 DOI: 10.1016/s0074-7696(08)62066-5] [Citation(s) in RCA: 1120] [Impact Index Per Article: 33.9] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
|
Review |
33 |
1120 |
15
|
Cannone JJ, Subramanian S, Schnare MN, Collett JR, D'Souza LM, Du Y, Feng B, Lin N, Madabusi LV, Müller KM, Pande N, Shang Z, Yu N, Gutell RR. The comparative RNA web (CRW) site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs. BMC Bioinformatics 2002; 3:2. [PMID: 11869452 PMCID: PMC65690 DOI: 10.1186/1471-2105-3-2] [Citation(s) in RCA: 1114] [Impact Index Per Article: 48.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2001] [Accepted: 01/17/2002] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Comparative analysis of RNA sequences is the basis for the detailed and accurate predictions of RNA structure and the determination of phylogenetic relationships for organisms that span the entire phylogenetic tree. Underlying these accomplishments are very large, well-organized, and processed collections of RNA sequences. This data, starting with the sequences organized into a database management system and aligned to reveal their higher-order structure, and patterns of conservation and variation for organisms that span the phylogenetic tree, has been collected and analyzed. This type of information can be fundamental for and have an influence on the study of phylogenetic relationships, RNA structure, and the melding of these two fields. RESULTS We have prepared a large web site that disseminates our comparative sequence and structure models and data. The four major types of comparative information and systems available for the three ribosomal RNAs (5S, 16S, and 23S rRNA), transfer RNA (tRNA), and two of the catalytic intron RNAs (group I and group II) are: (1) Current Comparative Structure Models; (2) Nucleotide Frequency and Conservation Information; (3) Sequence and Structure Data; and (4) Data Access Systems. CONCLUSIONS This online RNA sequence and structure information, the result of extensive analysis, interpretation, data collection, and computer program and web development, is accessible at our Comparative RNA Web (CRW) Site http://www.rna.icmb.utexas.edu. In the future, more data and information will be added to these existing categories, new categories will be developed, and additional RNAs will be studied and presented at the CRW Site.
Collapse
MESH Headings
- Base Sequence/genetics
- Databases, Nucleic Acid
- Internet
- Molecular Sequence Data
- Nucleic Acid Conformation
- RNA/chemistry
- RNA/genetics
- RNA, Archaeal/chemistry
- RNA, Archaeal/genetics
- RNA, Bacterial/chemistry
- RNA, Bacterial/genetics
- RNA, Ribosomal, 16S/chemistry
- RNA, Ribosomal, 16S/genetics
- RNA, Ribosomal, 23S/chemistry
- RNA, Ribosomal, 23S/genetics
- RNA, Ribosomal, 5S/chemistry
- RNA, Ribosomal, 5S/genetics
- RNA, Transfer/chemistry
- RNA, Transfer/genetics
Collapse
|
research-article |
23 |
1114 |
16
|
Eichinger L, Pachebat J, Glöckner G, Rajandream MA, Sucgang R, Berriman M, Song J, Olsen R, Szafranski K, Xu Q, Tunggal B, Kummerfeld S, Madera M, Konfortov BA, Rivero F, Bankier AT, Lehmann R, Hamlin N, Davies R, Gaudet P, Fey P, Pilcher K, Chen G, Saunders D, Sodergren E, Davis P, Kerhornou A, Nie X, Hall N, Anjard C, Hemphill L, Bason N, Farbrother P, Desany B, Just E, Morio T, Rost R, Churcher C, Cooper J, Haydock S, van Driessche N, Cronin A, Goodhead I, Muzny D, Mourier T, Pain A, Lu M, Harper D, Lindsay R, Hauser H, James K, Quiles M, Babu MM, Saito T, Buchrieser C, Wardroper A, Felder M, Thangavelu M, Johnson D, Knights A, Loulseged H, Mungall K, Oliver K, Price C, Quail M, Urushihara H, Hernandez J, Rabbinowitsch E, Steffen D, Sanders M, Ma J, Kohara Y, Sharp S, Simmonds M, Spiegler S, Tivey A, Sugano S, White B, Walker D, Woodward J, Winckler T, Tanaka Y, Shaulsky G, Schleicher M, Weinstock G, Rosenthal A, Cox E, Chisholm RL, Gibbs R, Loomis WF, Platzer M, Kay RR, Williams J, Dear PH, Noegel AA, Barrell B, Kuspa A. The genome of the social amoeba Dictyostelium discoideum. Nature 2005; 435:43-57. [PMID: 15875012 PMCID: PMC1352341 DOI: 10.1038/nature03481] [Citation(s) in RCA: 977] [Impact Index Per Article: 48.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2004] [Accepted: 02/17/2005] [Indexed: 02/07/2023]
Abstract
The social amoebae are exceptional in their ability to alternate between unicellular and multicellular forms. Here we describe the genome of the best-studied member of this group, Dictyostelium discoideum. The gene-dense chromosomes of this organism encode approximately 12,500 predicted proteins, a high proportion of which have long, repetitive amino acid tracts. There are many genes for polyketide synthases and ABC transporters, suggesting an extensive secondary metabolism for producing and exporting small molecules. The genome is rich in complex repeats, one class of which is clustered and may serve as centromeres. Partial copies of the extrachromosomal ribosomal DNA (rDNA) element are found at the ends of each chromosome, suggesting a novel telomere structure and the use of a common mechanism to maintain both the rDNA and chromosomal termini. A proteome-based phylogeny shows that the amoebozoa diverged from the animal-fungal lineage after the plant-animal split, but Dictyostelium seems to have retained more of the diversity of the ancestral genome than have plants, animals or fungi.
Collapse
|
research-article |
20 |
977 |
17
|
Xie K, Minkenberg B, Yang Y. Boosting CRISPR/Cas9 multiplex editing capability with the endogenous tRNA-processing system. Proc Natl Acad Sci U S A 2015; 112:3570-5. [PMID: 25733849 PMCID: PMC4371917 DOI: 10.1073/pnas.1420294112] [Citation(s) in RCA: 949] [Impact Index Per Article: 94.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
The clustered regularly interspaced short palindromic repeat (CRISPR)/CRISPR-associated protein 9 nuclease (Cas9) system is being harnessed as a powerful tool for genome engineering in basic research, molecular therapy, and crop improvement. This system uses a small guide RNA (gRNA) to direct Cas9 endonuclease to a specific DNA site; thus, its targeting capability is largely constrained by the gRNA-expressing device. In this study, we developed a general strategy to produce numerous gRNAs from a single polycistronic gene. The endogenous tRNA-processing system, which precisely cleaves both ends of the tRNA precursor, was engineered as a simple and robust platform to boost the targeting and multiplex editing capability of the CRISPR/Cas9 system. We demonstrated that synthetic genes with tandemly arrayed tRNA-gRNA architecture were efficiently and precisely processed into gRNAs with desired 5' targeting sequences in vivo, which directed Cas9 to edit multiple chromosomal targets. Using this strategy, multiplex genome editing and chromosomal-fragment deletion were readily achieved in stable transgenic rice plants with a high efficiency (up to 100%). Because tRNA and its processing system are virtually conserved in all living organisms, this method could be broadly used to boost the targeting capability and editing efficiency of CRISPR/Cas9 toolkits.
Collapse
|
research-article |
10 |
949 |
18
|
Clary DO, Wolstenholme DR. The mitochondrial DNA molecular of Drosophila yakuba: nucleotide sequence, gene organization, and genetic code. J Mol Evol 1985; 22:252-71. [PMID: 3001325 DOI: 10.1007/bf02099755] [Citation(s) in RCA: 943] [Impact Index Per Article: 23.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
The sequence of the 16,019 nucleotide-pair mitochondrial DNA (mtDNA) molecule of Drosophila yakuba is presented. This molecule contains the genes for two rRNAs, 22 tRNAs, six identified proteins [cytochrome b, cytochrome c oxidase subunits I, II, and III (COI-III), and ATPase subunits 6 and 8] and seven presumptive proteins (URF1-6 and URF4L). Replication originates within a region of 1077 nucleotides that is 92.8% A + T and lacks any open reading frame larger than 123 nucleotides. An equivalent to the sequence found in all mammalian mtCDNAs that is associated with initiation of second-strand DNA synthesis is not present in D. yakuba mtDNA. Introns are absent from D. yakuba mitochondrial genes and there are few (0-31) intergenic nucleotides. The genes found in D. yakuba and mammalian mtDNAs are the same, but there are differences in their arrangement and in the relative proportions of the complementary strands of the molecule that serve as templates for transcription. Although the D. yakuba small and large mitochondrial rRNA genes are exceptionally low in G and C and are shorter than any other metazoan rRNA genes reported, they can be folded into secondary structures remarkably similar to the secondary structures proposed for mammalian mitochondrial rRNAs. D. yakuba mitochondrial tRNA genes, like their mammalian counterparts, are more variable in sequence than nonorganelle tRNAs. In mitochondrial protein genes ATG, ATT, ATA, and in one case (COI) ATAA appear to be used as translation initiation codons. The only termination codon found in these genes is TAA. In the D. yakuba mitochondrial genetic code, AGA, ATA, and TGA specify serine, isoleucine, and tryptophan, respectively. Fifty-nine types of sense condon are used in the D. yakuba mitochondrial protein genes, but 93.8% of all codons end in A or T. Codon-anticodon interactions may include both G-A and C-A pairing in the wobble position. Evidence is summarized that supports the hypothesis that A and T nucleotides are favored at all locations in the D. yakuba mtDNA molecule where these nucleotides are compatible with function.
Collapse
|
|
40 |
943 |
19
|
Abstract
The nucleic acid sequence bank now contains over 600 protein coding genes of which 107 are from prokaryotic organisms. Codon frequencies in each new prokaryotic gene are given. Analysis of genetic code usage in the 83 sequenced genes of the Escherichia coli genome (chromosome, transposons and plasmids) is presented, taking into account new data on gene expressivity and regulation as well as iso-tRNA specificity and cellular concentration. The codon composition of each gene is summarized using two indexes: one is based on the differential usage of iso-tRNA species during gene translation, the other on choice between Cytosine and Uracil for third base. A strong relationship between codon composition and mRNA expressivity is confirmed, even for genes transcribed in the same operon. The influence of codon use of peptide elongation rate and protein yield is discussed. Finally, the evolutionary aspect of codon selection in mRNA sequences is studied.
Collapse
|
research-article |
43 |
883 |
20
|
Abstract
Studies of diseases caused by mitochondrial DNA mutations suggest that a variety of degenerative processes may be associated with defects in oxidative phosphorylation (OXPHOS). Application of this hypothesis has provided new insights into such diverse clinical problems as ischemic heart disease, late-onset diabetes, Parkinson's disease, Alzheimer's disease, and aging.
Collapse
|
|
33 |
875 |
21
|
Abstract
Virulence factors of pathogenic bacteria (adhesins, toxins, invasins, protein secretion systems, iron uptake systems, and others) may be encoded by particular regions of the prokaryotic genome termed pathogenicity islands. Pathogenicity islands were first described in human pathogens of the species Escherichia coli, but have recently been found in the genomes of various pathogens of humans, animals, and plants. Pathogenicity islands comprise large genomic regions [10-200 kilobases (kb) in size] that are present on the genomes of pathogenic strains but absent from the genomes of nonpathogenic members of the same or related species. The finding that the G+C content of pathogenicity islands often differs from that of the rest of the genome, the presence of direct repeats at their ends, the association of pathogenicity islands with transfer RNA genes, the presence of integrase determinants and other mobility loci, and their genetic instability argue for the generation of pathogenicity islands by horizontal gene transfer, a process that is well known to contribute to microbial evolution. In this article we review these and other aspects of pathogenicity islands and discuss the concept that they represent a subclass of genomic islands. Genomic islands are present in the majority of genomes of pathogenic as well as nonpathogenic bacteria and may encode accessory functions which have been previously spread among bacterial populations.
Collapse
|
Review |
24 |
823 |
22
|
Brown WM, Prager EM, Wang A, Wilson AC. Mitochondrial DNA sequences of primates: tempo and mode of evolution. J Mol Evol 1982; 18:225-39. [PMID: 6284948 DOI: 10.1007/bf01734101] [Citation(s) in RCA: 822] [Impact Index Per Article: 19.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
We cloned and sequenced a segment of mitochondrial DNA from human, chimpanzee, gorilla, orangutan, and gibbon. This segment is 896 bp in length, contains the genes for three transfer RNAs and parts of two proteins, and is homologous in all 5 primates. The 5 sequences differ from one another by base substitutions at 283 positions and by a deletion of one base pair. The sequence differences range from 9 to 19% among species, in agreement with estimates from cleavage map comparisons, thus confirming that the rate of mtDNA evolution in primates is 5 to 10 times higher than in nuclear DNA. The most striking new finding to emerge from these comparisons is that transitions greatly outnumber transversions. Ninety-two percent of the differences among the most closely related species (human, chimpanzee, and gorilla) are transitions. For pairs of species with longer divergence times, the observed percentage of transitions falls until, in the case of comparisons between primates and non-primates, it reaches a value of 45. The time dependence is probably due to obliteration of the record of transitions by multiple substitutions at the same nucleotide site. This finding illustrates the importance of choosing closely related species for analysis of evolutionary process. The remarkable bias toward transitions in mtDNA evolution necessitates the revision of equations that correct for multiple substitutions at the same site. With revised equations, we calculated the incidence of silent and replacement substitutions in the two protein-coding genes. The silent substitution rate is 4 to 6 times higher than the replacement rate, indicating strong functional constraints at replacement sites. Moreover, the silent rate for these two genes is about 10% per million years, a value 10 times higher than the silent rate for the nuclear genes studied so far. In addition, the mean substitution rate in the three mitochondrial tRNA genes is at least 100 times higher than in nuclear tRNA genes. Finally, genealogical analysis of the sequence differences supports the view that the human lineage branched off only slightly before the gorilla and chimpanzee lineages diverged and strengthens the hypothesis that humans are more related to gorillas and chimpanzees than is the orangutan.
Collapse
|
Comparative Study |
43 |
822 |
23
|
Horton R, Wilming L, Rand V, Lovering RC, Bruford EA, Khodiyar VK, Lush MJ, Povey S, Talbot CC, Wright MW, Wain HM, Trowsdale J, Ziegler A, Beck S. Gene map of the extended human MHC. Nat Rev Genet 2004; 5:889-99. [PMID: 15573121 DOI: 10.1038/nrg1489] [Citation(s) in RCA: 821] [Impact Index Per Article: 39.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
The major histocompatibility complex (MHC) is the most important region in the vertebrate genome with respect to infection and autoimmunity, and is crucial in adaptive and innate immunity. Decades of biomedical research have revealed many MHC genes that are duplicated, polymorphic and associated with more diseases than any other region of the human genome. The recent completion of several large-scale studies offers the opportunity to assimilate the latest data into an integrated gene map of the extended human MHC. Here, we present this map and review its content in relation to paralogy, polymorphism, immune function and disease.
Collapse
|
Review |
21 |
821 |
24
|
Newport J, Kirschner M. A major developmental transition in early Xenopus embryos: II. Control of the onset of transcription. Cell 1982; 30:687-96. [PMID: 7139712 DOI: 10.1016/0092-8674(82)90273-2] [Citation(s) in RCA: 780] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
We have shown in the accompanying paper that a developmental transition occurs at the midblastula stage (cleavage 12) in Xenopus embryos, and that this midblastula transition (MBT) is apparently initiated when the ratio of nucleus to cytoplasm reaches a critical value. One manifestation of this transition is the onset of transcription. We show here that a plasmid containing a cloned gene coding for a yeast leucine tRNA comes under developmental control when injected into cleaving eggs. In pre-MBT eggs this plasmid is transiently transcribed and then becomes inactive; however, it becomes transcriptionally active again at the MBT. This pre-MBT suppression of transcription can be reversed by addition of competing DNA. The amount of DNA needed to induce premature transcription is equal to the amount of nuclear DNA present after 12 cleavages (24 ng), suggesting that the MBT is triggered by the DNA through titration of suppressor components present in the egg.
Collapse
|
|
43 |
780 |
25
|
Sprinzl M, Horn C, Brown M, Ioudovitch A, Steinberg S. Compilation of tRNA sequences and sequences of tRNA genes. Nucleic Acids Res 1998; 26:148-53. [PMID: 9399820 PMCID: PMC147216 DOI: 10.1093/nar/26.1.148] [Citation(s) in RCA: 732] [Impact Index Per Article: 27.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
Sequences of 3279 sequences of tRNA genes and tRNAs published up to December 1996 are included in the compilation. Alignment of the sequences, which is most compatible with the tRNA phylogeny and known three-dimensional structures of tRNA, is used. Sequences and references are available under http://www.uni-bayreuth. de/departments/biochemie/trna/
Collapse
|
research-article |
27 |
732 |