26
|
Ougland R, Lando D, Jonson I, Dahl JA, Moen MN, Nordstrand LM, Rognes T, Lee JT, Klungland A, Kouzarides T, Larsen E. ALKBH1 is a histone H2A dioxygenase involved in neural differentiation. Stem Cells 2013; 30:2672-82. [PMID: 22961808 PMCID: PMC3546389 DOI: 10.1002/stem.1228] [Citation(s) in RCA: 90] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2012] [Accepted: 08/19/2012] [Indexed: 12/18/2022]
Abstract
AlkB homolog 1 (ALKBH1) is one of nine members of the family of mammalian AlkB homologs. Most Alkbh1(-/-) mice die during embryonic development, and survivors are characterized by defects in tissues originating from the ectodermal lineage. In this study, we show that deletion of Alkbh1 prolonged the expression of pluripotency markers in embryonic stem cells and delayed the induction of genes involved in early differentiation. In vitro differentiation to neural progenitor cells (NPCs) displayed an increased rate of apoptosis in the Alkbh1(-/-) NPCs when compared with wild-type cells. Whole-genome expression analysis and chromatin immunoprecipitation revealed that ALKBH1 regulates both directly and indirectly, a subset of genes required for neural development. Furthermore, our in vitro enzyme activity assays demonstrate that ALKBH1 is a histone dioxygenase that acts specifically on histone H2A. Mass spectrometric analysis demonstrated that histone H2A from Alkbh1(-/-) mice are improperly methylated. Our results suggest that ALKBH1 is involved in neural development by modifying the methylation status of histone H2A.
Collapse
|
27
|
Backe PH, Simm R, Laerdahl JK, Dalhus B, Fagerlund A, Okstad OA, Rognes T, Alseth I, Kolstø AB, Bjørås M. A new family of proteins related to the HEAT-like repeat DNA glycosylases with affinity for branched DNA structures. J Struct Biol 2013; 183:66-75. [PMID: 23623903 DOI: 10.1016/j.jsb.2013.04.007] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2012] [Revised: 04/09/2013] [Accepted: 04/16/2013] [Indexed: 12/27/2022]
Abstract
The recently discovered HEAT-like repeat (HLR) DNA glycosylase superfamily is widely distributed in all domains of life. The present bioinformatics and phylogenetic analysis shows that HLR DNA glycosylase superfamily members in the genus Bacillus form three subfamilies: AlkC, AlkD and AlkF/AlkG. The crystal structure of AlkF shows structural similarity with the DNA glycosylases AlkC and AlkD, however neither AlkF nor AlkG display any DNA glycosylase activity. Instead, both proteins have affinity to branched DNA structures such as three-way and Holliday junctions. A unique β-hairpin in the AlkF/AlkG subfamily is most likely inserted into the DNA major groove, and could be a structural determinant regulating DNA substrate affinity. We conclude that AlkF and AlkG represent a new family of HLR proteins with affinity for branched DNA structures.
Collapse
|
28
|
Søberg K, Jahnsen T, Rognes T, Skålhegg BS, Laerdahl JK. Evolutionary paths of the cAMP-dependent protein kinase (PKA) catalytic subunits. PLoS One 2013; 8:e60935. [PMID: 23593352 PMCID: PMC3625193 DOI: 10.1371/journal.pone.0060935] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2012] [Accepted: 03/05/2013] [Indexed: 11/19/2022] Open
Abstract
3',5'-cyclic adenosine monophosphate (cAMP) dependent protein kinase or protein kinase A (PKA) has served as a prototype for the large family of protein kinases that are crucially important for signal transduction in eukaryotic cells. The PKA catalytic subunits Cα and Cβ, encoded by the two genes PRKACA and PRKACB, respectively, are among the best understood and characterized human kinases. Here we have studied the evolution of this gene family in chordates, arthropods, mollusks and other animals employing probabilistic methods and show that Cα and Cβ arose by duplication of an ancestral PKA catalytic subunit in a common ancestor of vertebrates. The two genes have subsequently been duplicated in teleost fishes. The evolution of the PRKACG retroposon in simians was also investigated. Although the degree of sequence conservation in the PKA Cα/Cβ kinase family is exceptionally high, a small set of signature residues defining Cα and Cβ subfamilies were identified. These conserved residues might be important for functions that are unique to the Cα or Cβ clades. This study also provides a good example of a seemingly simple phylogenetic problem which, due to a very high degree of sequence conservation and corresponding weak phylogenetic signals, combined with problematic nonphylogenetic signals, is nontrivial for state-of-the-art probabilistic phylogenetic methods.
Collapse
|
29
|
Weel-Sneve R, Kristiansen KI, Odsbu I, Dalhus B, Booth J, Rognes T, Skarstad K, Bjørås M. Single transmembrane peptide DinQ modulates membrane-dependent activities. PLoS Genet 2013; 9:e1003260. [PMID: 23408903 PMCID: PMC3567139 DOI: 10.1371/journal.pgen.1003260] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2011] [Accepted: 12/05/2012] [Indexed: 11/18/2022] Open
Abstract
The functions of several SOS regulated genes in Escherichia coli are still unknown, including dinQ. In this work we characterize dinQ and two small RNAs, agrA and agrB, with antisense complementarity to dinQ. Northern analysis revealed five dinQ transcripts, but only one transcript (+44) is actively translated. The +44 dinQ transcript translates into a toxic single transmembrane peptide localized in the inner membrane. AgrB regulates dinQ RNA by RNA interference to counteract DinQ toxicity. Thus the dinQ-agr locus shows the classical features of a type I TA system and has many similarities to the tisB-istR locus. DinQ overexpression depolarizes the cell membrane and decreases the intracellular ATP concentration, demonstrating that DinQ can modulate membrane-dependent processes. Augmented DinQ strongly inhibits marker transfer by Hfr conjugation, indicating a role in recombination. Furthermore, DinQ affects transformation of nucleoid morphology in response to UV damage. We hypothesize that DinQ is a transmembrane peptide that modulates membrane-dependent activities such as nucleoid compaction and recombination. Exposure of the bacterium Escherichia coli to DNA damaging agents induces the SOS response, which up-regulates gene functions involved in numerous cellular processes such as DNA repair, cell division, and replication. Most of the SOS regulated genes in E. coli have been characterized, but still there are several genes of unknown function. One of these uncharacterized genes is dinQ. In this work we characterize dinQ and two novel small RNAs, agrA and agrB, that regulate expression of dinQ. The DinQ peptide is localized in the inner membrane as a single transmembrane peptide of 27 amino acids. Small proteins of less than 50 amino acids are important in cellular processes such as regulation, signalling, and antibacterial action. Here we demonstrate that DinQ modulates recombination and transformation of nucleoid morphology in response to UV damage. Our results provide new insights into small hydrophobic peptides that could regulate important DNA metabolic processes dependent on the inner membrane of the cell.
Collapse
|
30
|
Nordstrand LM, Furu K, Paulsen J, Rognes T, Klungland A. Alkbh1 and Tzfp repress a non-repeat piRNA cluster in pachytene spermatocytes. Nucleic Acids Res 2012; 40:10950-63. [PMID: 22965116 PMCID: PMC3505970 DOI: 10.1093/nar/gks839] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Piwi proteins and Piwi-interacting small RNAs (piRNAs) have known functions in transposon silencing in the male germline of fetal and newborn mice. Both are also present in adult testes; however, their function here remains a mystery. Here, we confirm that most piRNAs in meiotic spermatocytes originate from clusters in non-repeat intergenic regions of DNA. The regulation of these piRNA clusters, including the processing of the precursor transcripts into individual piRNAs, is accomplished through mostly unknown processes. We present a possible regulatory mechanism for one such cluster, named cluster 1082B, located on chromosome 7 in the mouse genome. The 1082B precursor transcript and its 788 unique piRNAs are repressed by the Alkbh1 dioxygenase and the testis-specific transcription repressor Tzfp. We observe a remarkable >1000-fold upregulation of individual piRNAs in pachytene spermatocytes isolated from Alkbh1- and Tzfp-deficient murine testes. Repression of cluster 1082B is further supported by the identification of a 10-bp Tzfp recognition sequence contained within the precursor transcript. Downregulation of LINE1 and IAP transcripts in the Alkbh1- and Tzfp-deficient mice leads us to propose a potential role for the 1082B-encoded piRNAs in transposon control.
Collapse
|
31
|
Rognes T. Faster Smith-Waterman database searches with inter-sequence SIMD parallelisation. BMC Bioinformatics 2011; 12:221. [PMID: 21631914 PMCID: PMC3120707 DOI: 10.1186/1471-2105-12-221] [Citation(s) in RCA: 136] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2011] [Accepted: 06/01/2011] [Indexed: 11/30/2022] Open
Abstract
Background The Smith-Waterman algorithm for local sequence alignment is more sensitive than heuristic methods for database searching, but also more time-consuming. The fastest approach to parallelisation with SIMD technology has previously been described by Farrar in 2007. The aim of this study was to explore whether further speed could be gained by other approaches to parallelisation. Results A faster approach and implementation is described and benchmarked. In the new tool SWIPE, residues from sixteen different database sequences are compared in parallel to one query residue. Using a 375 residue query sequence a speed of 106 billion cell updates per second (GCUPS) was achieved on a dual Intel Xeon X5650 six-core processor system, which is over six times more rapid than software based on Farrar's 'striped' approach. SWIPE was about 2.5 times faster when the programs used only a single thread. For shorter queries, the increase in speed was larger. SWIPE was about twice as fast as BLAST when using the BLOSUM50 score matrix, while BLAST was about twice as fast as SWIPE for the BLOSUM62 matrix. The software is designed for 64 bit Linux on processors with SSSE3. Source code is available from http://dna.uio.no/swipe/ under the GNU Affero General Public License. Conclusions Efficient parallelisation using SIMD on standard hardware makes it possible to run Smith-Waterman database searches more than six times faster than before. The approach described here could significantly widen the potential application of Smith-Waterman searches. Other applications that require optimal local alignment scores could also benefit from improved performance.
Collapse
|
32
|
Yang M, Aamodt RM, Dalhus B, Balasingham S, Helle I, Andersen P, Tønjum T, Alseth I, Rognes T, Bjørås M. The ada operon of Mycobacterium tuberculosis encodes two DNA methyltransferases for inducible repair of DNA alkylation damage. DNA Repair (Amst) 2011; 10:595-602. [PMID: 21570366 DOI: 10.1016/j.dnarep.2011.03.007] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2010] [Revised: 03/01/2011] [Accepted: 03/15/2011] [Indexed: 11/26/2022]
Abstract
The ada operon of Mycobacterium tuberculosis, which encodes a composite protein of AdaA and AlkA and a separate AdaB/Ogt protein, was characterized. M. tuberculosis treated with N-methyl-N'-nitro-N-nitrosoguanidine induced transcription of the adaA-alkA and adaB genes, suggesting that M. tuberculosis mount an inducible response to methylating agents. Survival assays of the methyltransferase defective Escherichia coli mutant KT233 (ada ogt), showed that expression of the adaB gene rescued the alkylation sensitivity. Further, adaB but not adaA-alkA complemented the hypermutator phenotype of KT233. Purified AdaA-AlkA and AdaB possessed methyltransferase activity. These data suggested that AdaB counteract the cytotoxic and mutagenic effect of O(6)-methylguanine, while AdaA-AlkA most likely transfers methyl groups from innocuous methylphosphotriesters. AdaA-AlkA did not possess alkylbase DNA glycosylase activity nor rescue the alkylation sensitivity of the E. coli mutant BK2118 (tag alkA). We propose that AdaA-AlkA is a positive regulator of the adaptive response in M. tuberculosis. It thus appears that the ada operon of M. tuberculosis suppresses the mutagenic effect of alkylation but not the cytotoxic effect of lesions such as 3-methylpurines. Collectively, these data indicate that M. tuberculosis hypermutator strains with defective adaptive response genes might sustain robustness to cytotoxic alkylation DNA damage and confer a selective advantage contributing to host adaptation.
Collapse
|
33
|
Thomassen GOS, Weel-Sneve R, Rowe AD, Booth JA, Lindvall JM, Lagesen K, Kristiansen KI, Bjørås M, Rognes T. Tiling array analysis of UV treated Escherichia coli predicts novel differentially expressed small peptides. PLoS One 2010; 5:e15356. [PMID: 21203457 PMCID: PMC3009722 DOI: 10.1371/journal.pone.0015356] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2010] [Accepted: 11/09/2010] [Indexed: 11/19/2022] Open
Abstract
Background Despite comprehensive investigation, the Escherichia coli SOS response system is not yet fully understood. We have applied custom designed whole genome tiling arrays to measure UV invoked transcriptional changes in E. coli. This study provides a more complete insight into the transcriptome and the UV irradiation response of this microorganism. Results We detected a number of novel differentially expressed transcripts in addition to the expected SOS response genes (such as sulA, recN, uvrA, lexA, umuC and umuD) in the UV treated cells. Several of the differentially expressed transcripts might play important roles in regulation of the cellular response to UV damage. We have predicted 23 novel small peptides from our set of detected non-gene transcripts. Further, three of the predicted peptides were cloned into protein expression vectors to test the biological activity. All three constructs expressed the predicted peptides, in which two of them were highly toxic to the cell. Additionally, a remarkably high overlap with previously in-silico predicted non-coding RNAs (ncRNAs) was detected. Generally we detected a far higher transcriptional activity than the annotation suggests, and these findings correspond with previous transcription mappings from E. coli and other organisms. Conclusions Here we demonstrate that the E. coli transcriptome consists of far more transcripts than the present annotation suggests, of which many transcripts seem important to the bacterial stress response. Sequence alignment of promoter regions suggest novel regulatory consensus sequences for some of the upregulated genes. Finally, several of the novel transcripts identified in this study encode putative small peptides, which are biologically active.
Collapse
|
34
|
Nordstrand LM, Svärd J, Larsen E, Nilsen A, Ougland R, Furu K, Lien GF, Rognes T, Namekawa SH, Lee JT, Klungland A. Mice lacking Alkbh1 display sex-ratio distortion and unilateral eye defects. PLoS One 2010; 5:e13827. [PMID: 21072209 PMCID: PMC2972218 DOI: 10.1371/journal.pone.0013827] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2010] [Accepted: 10/14/2010] [Indexed: 11/26/2022] Open
Abstract
Background Eschericia coli AlkB is a 2-oxoglutarate- and iron-dependent dioxygenase that reverses alkylated DNA damage by oxidative demethylation. Mouse AlkB homolog 1 (Alkbh1) is one of eight members of the newly discovered family of mammalian dioxygenases. Methods and Findings In the present study we show non-Mendelian inheritance of the Alkbh1 targeted allele in mice. Both Alkbh1−/− and heterozygous Alkbh1+/− offspring are born at a greatly reduced frequency. Additionally, the sex-ratio is considerably skewed against female offspring, with one female born for every three to four males. Most mechanisms that cause segregation distortion, act in the male gametes and affect male fertility. The skewing of the sexes appears to be of paternal origin, and might be set in the pachythene stage of meiosis during spermatogenesis, in which Alkbh1 is upregulated more than 10-fold. In testes, apoptotic spermatids were revealed in 5–10% of the tubules in Alkbh1−/− adults. The deficiency of Alkbh1 also causes misexpression of Bmp2, 4 and 7 at E11.5 during embryonic development. This is consistent with the incompletely penetrant phenotypes observed, particularly recurrent unilateral eye defects and craniofacial malformations. Conclusions Genetic and phenotypic assessment suggests that Alkbh1 mediates gene regulation in spermatogenesis, and that Alkbh1 is essential for normal sex-ratio distribution and embryonic development in mice.
Collapse
|
35
|
Thomassen GOS, Røsok Ø, Rognes T. Computational prediction of microRNAs encoded in viral and other genomes. J Biomed Biotechnol 2010; 2006:95270. [PMID: 17057374 PMCID: PMC1559940 DOI: 10.1155/jbb/2006/95270] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
We present an overview of selected computational methods for microRNA prediction. It is especially aimed at viral miRNA detection. As the number of microRNAs increases and the range of genomes encoding miRNAs expands, it seems that these small regulators have a more important role than has been previously thought. Most microRNAs have been detected by cloning and Northern blotting, but experimental methods are biased towards abundant microRNAs as well as being time-consuming. Computational detection methods must therefore be refined to serve as a faster, better, and more affordable method for microRNA detection. We also present data from a small study investigating the problems of computational miRNA prediction. Our findings suggest that the prediction of microRNA precursor candidates is fairly easy, while excluding false positives as well as exact prediction of the mature microRNA is hard. Finally, we discuss possible improvements to computational microRNA detection.
Collapse
|
36
|
Fensgård Ø, Kassahun H, Bombik I, Rognes T, Lindvall JM, Nilsen H. A two-tiered compensatory response to loss of DNA repair modulates aging and stress response pathways. Aging (Albany NY) 2010; 2:133-59. [PMID: 20382984 PMCID: PMC2871243 DOI: 10.18632/aging.100127] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2010] [Accepted: 03/27/2010] [Indexed: 05/29/2023]
Abstract
Activation of oxidative stress-responses and downregulation of insulin-like signaling (ILS) is seen in Nucleotide Excision Repair (NER) deficient segmental progeroid mice. Evidence suggests that this is a survival response to persistent transcription-blocking DNA damage, although the relevant lesions have not been identified. Here we show that loss of NTH-1, the only Base Excision Repair (BER) enzyme known to initiate repair of oxidative DNA damage inC. elegans, restores normal lifespan of the short-lived NER deficient xpa-1 mutant. Loss of NTH-1 leads to oxidative stress and global expression profile changes that involve upregulation of genes responding to endogenous stress and downregulation of ILS. A similar, but more extensive, transcriptomic shift is observed in the xpa-1 mutant whereas loss of both NTH-1 and XPA-1 elicits a different profile with downregulation of Aurora-B and Polo-like kinase 1 signaling networks as well as DNA repair and DNA damage response genes. The restoration of normal lifespan and absence oxidative stress responses in nth-1;xpa-1 indicate that BER contributes to generate transcription blocking lesions from oxidative DNA damage. Hence, our data strongly suggests that the DNA lesions relevant for aging are repair intermediates resulting from aberrant or attempted processing by BER of lesions normally repaired by NER.
Collapse
|
37
|
Nakken S, Rognes T, Hovig E. The disruptive positions in human G-quadruplex motifs are less polymorphic and more conserved than their neutral counterparts. Nucleic Acids Res 2009; 37:5749-56. [PMID: 19617376 PMCID: PMC2761265 DOI: 10.1093/nar/gkp590] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Specific guanine-rich sequence motifs in the human genome have considerable potential to form four-stranded structures known as G-quadruplexes or G4 DNA. The enrichment of these motifs in key chromosomal regions has suggested a functional role for the G-quadruplex structure in genomic regulation. In this work, we have examined the spectrum of nucleotide substitutions in G4 motifs, and related this spectrum to G4 prevalence. Data collected from the large repository of human SNPs indicates that the core feature of G-quadruplex motifs, 5′-GGG-3′, exhibits specific mutational patterns that preserve the potential for G4 formation. In particular, we find a genome-wide pattern in which sites that disrupt the guanine triplets are more conserved and less polymorphic than their neutral counterparts. This also holds when considering non-CpG sites only. However, the low level of polymorphisms in guanine tracts is not only confined to G4 motifs. A complete mapping of DNA three-mers at guanine polymorphisms indicated that short guanine tracts are the most under-represented sequence context at polymorphic sites. Furthermore, we provide evidence for a strand bias upstream of human genes. Here, a significantly lower rate of G4-disruptive SNPs on the non-template strand supports a higher relative influence of G4 formation on this strand during transcription.
Collapse
|
38
|
Thomassen GOS, Rowe AD, Lagesen K, Lindvall JM, Rognes T. Custom design and analysis of high-density oligonucleotide bacterial tiling microarrays. PLoS One 2009; 4:e5943. [PMID: 19536279 PMCID: PMC2691959 DOI: 10.1371/journal.pone.0005943] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2009] [Accepted: 05/18/2009] [Indexed: 11/21/2022] Open
Abstract
Background High-density tiling microarrays are a powerful tool for the characterization of complete genomes. The two major computational challenges associated with custom-made arrays are design and analysis. Firstly, several genome dependent variables, such as the genome's complexity and sequence composition, need to be considered in the design to ensure a high quality microarray. Secondly, since tiling projects today very often exceed the limits of conventional array-experiments, researchers cannot use established computer tools designed for commercial arrays, and instead have to redesign previous methods or create novel tools. Principal Findings Here we describe the multiple aspects involved in the design of tiling arrays for transcriptome analysis and detail the normalisation and analysis procedures for such microarrays. We introduce a novel design method to make two 280,000 feature microarrays covering the entire genome of the bacterial species Escherichia coli and Neisseria meningitidis, respectively, as well as the use of multiple copies of control probe-sets on tiling microarrays. Furthermore, a novel normalisation and background estimation procedure for tiling arrays is presented along with a method for array analysis focused on detection of short transcripts. The design, normalisation and analysis methods have been applied in various experiments and several of the detected novel short transcripts have been biologically confirmed by Northern blot tests. Conclusions Tiling-arrays are becoming increasingly applicable in genomic research, but researchers still lack both the tools for custom design of arrays, as well as the systems and procedures for analysis of the vast amount of data resulting from such experiments. We believe that the methods described herein will be a useful contribution and resource for researchers designing and analysing custom tiling arrays for both bacteria and higher organisms.
Collapse
|
39
|
Ambur OH, Davidsen T, Frye SA, Balasingham SV, Lagesen K, Rognes T, Tønjum T. Genome dynamics in major bacterial pathogens. FEMS Microbiol Rev 2009; 33:453-70. [PMID: 19396949 PMCID: PMC2734928 DOI: 10.1111/j.1574-6976.2009.00173.x] [Citation(s) in RCA: 84] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
Pathogenic bacteria continuously encounter multiple forms of stress in their hostile environments, which leads to DNA damage. With the new insight into biology offered by genome sequences, the elucidation of the gene content encoding proteins provides clues toward understanding the microbial lifestyle related to habitat and niche. Campylobacter jejuni, Haemophilus influenzae, Helicobacter pylori, Mycobacterium tuberculosis, the pathogenic Neisseria, Streptococcus pneumoniae, Streptococcus pyogenes and Staphylococcus aureus are major human pathogens causing detrimental morbidity and mortality at a global scale. An algorithm for the clustering of orthologs was established in order to identify whether orthologs of selected genes were present or absent in the genomes of the pathogenic bacteria under study. Based on the known genes for the various functions and their orthologs in selected pathogenic bacteria, an overview of the presence of the different types of genes was created. In this context, we focus on selected processes enabling genome dynamics in these particular pathogens, namely DNA repair, recombination and horizontal gene transfer. An understanding of the precise molecular functions of the enzymes participating in DNA metabolism and their importance in the maintenance of bacterial genome integrity has also, in recent years, indicated a future role for these enzymes as targets for therapeutic intervention.
Collapse
|
40
|
Robertson AB, Klungland A, Rognes T, Leiros I. DNA repair in mammalian cells: Base excision repair: the long and short of it. Cell Mol Life Sci 2009; 66:981-93. [PMID: 19153658 PMCID: PMC11131461 DOI: 10.1007/s00018-009-8736-z] [Citation(s) in RCA: 406] [Impact Index Per Article: 27.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
Base excision repair (BER) is the primary DNA repair pathway that corrects base lesions that arise due to oxidative, alkylation, deamination, and depurinatiation/depyrimidination damage. BER facilitates the repair of damaged DNA via two general pathways - short-patch and long-patch. The shortpatch BER pathway leads to a repair tract of a single nucleotide. Alternatively, the long-patch BER pathway produces a repair tract of at least two nucleotides. The BER pathway is initiated by one of many DNA glycosylases, which recognize and catalyze the removal of damaged bases. The completion of the BER pathway is accomplished by the coordinated action of at least three additional enzymes. These downstream enzymes carry out strand incision, gap-filling and ligation. The high degree of BER conservation between E. coli and mammals has lead to advances in our understanding of mammalian BER. This review will provide a general overview of the mammalian BER pathway. (Part of a Multi-author Review).
Collapse
|
41
|
Nakken S, Rødland EA, Rognes T, Hovig E. Large-scale inference of the point mutational spectrum in human segmental duplications. BMC Genomics 2009; 10:43. [PMID: 19161616 PMCID: PMC2640414 DOI: 10.1186/1471-2164-10-43] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2008] [Accepted: 01/22/2009] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND Recent segmental duplications are relatively large (> or = 1 kb) genomic regions of high sequence identity (> or = 90%). They cover approximately 4-5% of the human genome and play important roles in gene evolution and genomic disease. The DNA sequence differences between copies of a segmental duplication represent the result of various mutational events over time, since any two duplication copies originated from the same ancestral DNA sequence. Based on this fact, we have developed a computational scheme for inference of point mutational events in human segmental duplications, which we collectively term duplication-inferred mutations (DIMs). We have characterized these nucleotide substitutions by comparing them with high-quality SNPs from dbSNP, both in terms of sequence context and frequency of substitution types. RESULTS Overall, DIMs show a lower ratio of transitions relative to transversions than SNPs, although this ratio approaches that of SNPs when considering DIMs within most recent duplications. Our findings indicate that DIMs and SNPs in general are caused by similar mutational mechanisms, with some deviances at the CpG dinucleotide. Furthermore, we discover a large number of reference SNPs that coincide with computationally inferred DIMs. The latter reflects how sequence variation in duplicated sequences can be misinterpreted as ordinary allelic variation. CONCLUSION In summary, we show how DNA sequence analysis of segmental duplications can provide a genome-wide mutational spectrum that mirrors recent genome evolution. The inferred set of nucleotide substitutions represents a valuable complement to SNPs for the analysis of genetic variation and point mutagenesis.
Collapse
|
42
|
Skotheim RI, Thomassen GOS, Eken M, Lind GE, Micci F, Ribeiro FR, Cerveira N, Teixeira MR, Heim S, Rognes T, Lothe RA. A universal assay for detection of oncogenic fusion transcripts by oligo microarray analysis. Mol Cancer 2009; 8:5. [PMID: 19152679 PMCID: PMC2633275 DOI: 10.1186/1476-4598-8-5] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2008] [Accepted: 01/19/2009] [Indexed: 11/21/2022] Open
Abstract
BACKGROUND The ability to detect neoplasia-specific fusion genes is important not only in cancer research, but also increasingly in clinical settings to ensure that correct diagnosis is made and the optimal treatment is chosen. However, the available methodologies to detect such fusions all have their distinct short-comings. RESULTS We describe a novel oligonucleotide microarray strategy whereby one can screen for all known oncogenic fusion transcripts in a single experiment. To accomplish this, we combine measurements of chimeric transcript junctions with exon-wise measurements of individual fusion partners. To demonstrate the usefulness of the approach, we designed a DNA microarray containing 68,861 oligonucleotide probes that includes oligos covering all combinations of chimeric exon-exon junctions from 275 pairs of fusion genes, as well as sets of oligos internal to all the exons of the fusion partners. Using this array, proof of principle was demonstrated by detection of known fusion genes (such as TCF3:PBX1, ETV6:RUNX1, and TMPRSS2:ERG) from all six positive controls consisting of leukemia cell lines and prostate cancer biopsies. CONCLUSION This new method bears promise of an important complement to currently used diagnostic and research tools for the detection of fusion genes in neoplastic diseases.
Collapse
|
43
|
Røyrvik EC, Ahlquist T, Rognes T, Lothe RA. Slip slidin' away: a duodecennial review of targeted genes in mismatch repair deficient colorectal cancer. Crit Rev Oncog 2008; 13:229-57. [PMID: 18298386 DOI: 10.1615/critrevoncog.v13.i3.20] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Roughly 15% of colorectal tumors are characterized by microsatellite instability (MSI), a deficiency caused by defective DNA mismatch repair, which leads to profuse insertions and deletions in microsatellites. Downstream target genes of this defective repair are those prone to exhibit these insertion/deletion mutations in their coding regions and potentially having functional consequences in, and providing a growth advantage for, the cancer cell. This review presents the last 12 years of research on these MSI target genes, systematizing the mutation details of the more than 160 genes identified to date, and includes their mutation frequencies in colorectal and other MSI (e.g., gastric and endometrial) tumors. Functional aspects of certain targets and the target gene concept itself are also discussed, as is the comparative wealth of potential target genes assessed by scanning the coding sequences of the human genome for mononucleotide repeats--yet to be investigated.
Collapse
|
44
|
Cameron J, Holla OL, Laerdahl JK, Kulseth MA, Ranheim T, Rognes T, Berge KE, Leren TP. Characterization of novel mutations in the catalytic domain of the PCSK9 gene. J Intern Med 2008; 263:420-31. [PMID: 18266662 DOI: 10.1111/j.1365-2796.2007.01915.x] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
OBJECTIVES To expand our understanding of the structure and function of proprotein convertase subtilisin/kexin type 9 (PCSK9) by studying how naturally occurring mutations in PCSK9 disrupt the function of PCSK9. DESIGN Mutations in PCSK9 were identified by sequencing of DNA from subjects with hypo- or hypercholesterolemia. The effect of the identified mutations on the autocatalytic cleavage and secretion of PCSK9, as well as the effect on PCSK9-mediated degradation of the low density lipoprotein receptors, were determined in HepG2 or HEK293 cells transiently transfected with mutant PCSK9-containing plasmids. The findings were collated to the clinical characteristics of the subjects possessing these mutations, and the phenotypic effects were analysed in terms of available structural data for PCSK9. RESULTS Five novel mutations in PCSK9 were identified. Mutation R215H was a gain-of-function mutation which causes hypercholesterolemia. Mutation G236S and N354I were loss-of-function mutations due to failure to exit the endoplasmic reticulum or failure to undergo autocatalytic cleavage, respectively. Mutations A245T and R272Q were most likely normal genetic variants. By comparing the number of patients with gain-of-function mutations in PCSK9 with the number of familial hypercholesterolemia heterozygotes among subjects with hypercholesterolemia, the prevalence of subjects with gain-of-function mutations in PCSK9 in Norway can be estimated to one in 15,000. CONCLUSION This study has provided novel information about the structural requirements for the normal function of PCSK9. However, more studies are needed to determine the mechanisms by which gain-of-function mutations in PCSK9 cause hypercholesterolemia.
Collapse
|
45
|
Lagesen K, Hallin P, Rødland EA, Staerfeldt HH, Rognes T, Ussery DW. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res 2007; 35:3100-8. [PMID: 17452365 PMCID: PMC1888812 DOI: 10.1093/nar/gkm160] [Citation(s) in RCA: 4628] [Impact Index Per Article: 272.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The publication of a complete genome sequence is usually accompanied by annotations of its genes. In contrast to protein coding genes, genes for ribosomal RNA (rRNA) are often poorly or inconsistently annotated. This makes comparative studies based on rRNA genes difficult. We have therefore created computational predictors for the major rRNA species from all kingdoms of life and compiled them into a program called RNAmmer. The program uses hidden Markov models trained on data from the 5S ribosomal RNA database and the European ribosomal RNA database project. A pre-screening step makes the method fast with little loss of sensitivity, enabling the analysis of a complete bacterial genome in less than a minute. Results from running RNAmmer on a large set of genomes indicate that the location of rRNAs can be predicted with a very high level of accuracy. Novel, unannotated rRNAs are also predicted in many genomes. The software as well as the genome analysis results are available at the CBS web server.
Collapse
|
46
|
Dalhus B, Helle IH, Backe PH, Alseth I, Rognes T, Bjørås M, Laerdahl JK. Structural insight into repair of alkylated DNA by a new superfamily of DNA glycosylases comprising HEAT-like repeats. Nucleic Acids Res 2007; 35:2451-9. [PMID: 17395642 PMCID: PMC1874660 DOI: 10.1093/nar/gkm039] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
3-methyladenine DNA glycosylases initiate repair of cytotoxic and promutagenic alkylated bases in DNA. We demonstrate by comparative modelling that Bacillus cereus AlkD belongs to a new, fifth, structural superfamily of DNA glycosylases with an alpha–alpha superhelix fold comprising six HEAT-like repeats. The structure reveals a wide, positively charged groove, including a putative base recognition pocket. This groove appears to be suitable for the accommodation of double-stranded DNA with a flipped-out alkylated base. Site-specific mutagenesis within the recognition pocket identified several residues essential for enzyme activity. The results suggest that the aromatic side chain of a tryptophan residue recognizes electron-deficient alkylated bases through stacking interactions, while an interacting aspartate–arginine pair is essential for removal of the damaged base. A structural model of AlkD bound to DNA with a flipped-out purine moiety gives insight into the catalytic machinery for this new class of DNA glycosylases.
Collapse
|
47
|
Nakken S, Alseth I, Rognes T. Computational prediction of the effects of non-synonymous single nucleotide polymorphisms in human DNA repair genes. Neuroscience 2006; 145:1273-9. [PMID: 17055652 DOI: 10.1016/j.neuroscience.2006.09.004] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2006] [Revised: 09/08/2006] [Accepted: 09/12/2006] [Indexed: 10/24/2022]
Abstract
Non-synonymous single nucleotide polymorphisms (nsSNPs) represent common genetic variation that alters encoded amino acids in proteins. All nsSNPs may potentially affect the structure or function of expressed proteins and could therefore have an impact on complex diseases. In an effort to evaluate the phenotypic effect of all known nsSNPs in human DNA repair genes, we have characterized each polymorphism in terms of different functional properties. The properties are computed based on amino acid characteristics (e.g. residue volume change); position-specific phylogenetic information from multiple sequence alignments and from prediction programs such as SIFT (Sorting Intolerant From Tolerant) and PolyPhen (Polymorphism Phenotyping). We provide a comprehensive, updated list of all validated nsSNPs from dbSNP (public database of human single nucleotide polymorphisms at National Center for Biotechnology Information, USA) located in human DNA repair genes. The list includes repair enzymes, genes associated with response to DNA damage as well as genes implicated with genetic instability or sensitivity to DNA damaging agents. Out of a total of 152 genes involved in DNA repair, 95 had validated nsSNPs in them. The fraction of nsSNPs that had high probability of being functionally significant was predicted to be 29.6% and 30.9%, by SIFT and PolyPhen respectively. The resulting list of annotated nsSNPs is available online (http://dna.uio.no/repairSNP), and is an ongoing project that will continue assessing the function of coding SNPs in human DNA repair genes.
Collapse
|
48
|
Alseth I, Rognes T, Lindbäck T, Solberg I, Robertsen K, Kristiansen KI, Mainieri D, Lillehagen L, Kolstø AB, Bjørås M. A new protein superfamily includes two novel 3-methyladenine DNA glycosylases from Bacillus cereus, AlkC and AlkD. Mol Microbiol 2006; 59:1602-9. [PMID: 16468998 PMCID: PMC1413580 DOI: 10.1111/j.1365-2958.2006.05044.x] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
Soil bacteria are heavily exposed to environmental methylating agents such as methylchloride and may have special requirements for repair of alkylation damage on DNA. We have used functional complementation of an Escherichia coli tag alkA mutant to screen for 3-methyladenine DNA glycosylase genes in genomic libraries of the soil bacterium Bacillus cereus. Three genes were recovered: alkC, alkD and alkE. The amino acid sequence of AlkE is homologous to the E. coli AlkA sequence. AlkC and AlkD represent novel proteins without sequence similarity to any protein of known function. However, iterative and indirect sequence similarity searches revealed that AlkC and AlkD are distant homologues of each other within a new protein superfamily that is ubiquitous in the prokaryotic kingdom. Homologues of AlkC and AlkD were also identified in the amoebas Entamoeba histolytica and Dictyostelium discoideum, but no other eukaryotic counterparts of the superfamily were found. The alkC and alkD genes were expressed in E. coli and the proteins were purified to homogeneity. Both proteins were found to be specific for removal of N-alkylated bases, and showed no activity on oxidized or deaminated base lesions in DNA. B. cereus AlkC and AlkD thus define novel families of alkylbase DNA glycosylases within a new protein superfamily.
Collapse
|
49
|
Sæbø PE, Andersen SM, Myrseth J, Laerdahl JK, Rognes T. PARALIGN: rapid and sensitive sequence similarity searches powered by parallel computing technology. Nucleic Acids Res 2005; 33:W535-9. [PMID: 15980529 PMCID: PMC1160184 DOI: 10.1093/nar/gki423] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
PARALIGN is a rapid and sensitive similarity search tool for the identification of distantly related sequences in both nucleotide and amino acid sequence databases. Two algorithms are implemented, accelerated Smith-Waterman and ParAlign. The ParAlign algorithm is similar to Smith-Waterman in sensitivity, while as quick as BLAST for protein searches. A form of parallel computing technology known as multimedia technology that is available in modern processors, but rarely used by other bioinformatics software, has been exploited to achieve the high speed. The software is also designed to run efficiently on computer clusters using the message-passing interface standard. A public search service powered by a large computer cluster has been set-up and is freely available at www.paralign.org, where the major public databases can be searched. The software can also be downloaded free of charge for academic use.
Collapse
|
50
|
Saetrom P, Sneve R, Kristiansen KI, Snøve O, Grünfeld T, Rognes T, Seeberg E. Predicting non-coding RNA genes in Escherichia coli with boosted genetic programming. Nucleic Acids Res 2005; 33:3263-70. [PMID: 15942029 PMCID: PMC1143698 DOI: 10.1093/nar/gki644] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Several methods exist for predicting non-coding RNA (ncRNA) genes in Escherichia coli (E.coli). In addition to about sixty known ncRNA genes excluding tRNAs and rRNAs, various methods have predicted more than thousand ncRNA genes, but only 95 of these candidates were confirmed by more than one study. Here, we introduce a new method that uses automatic discovery of sequence patterns to predict ncRNA genes. The method predicts 135 novel candidates. In addition, the method predicts 152 genes that overlap with predictions in the literature. We test sixteen predictions experimentally, and show that twelve of these are actual ncRNA transcripts. Six of the twelve verified candidates were novel predictions. The relatively high confirmation rate indicates that many of the untested novel predictions are also ncRNAs, and we therefore speculate that E.coli contains more ncRNA genes than previously estimated.
Collapse
|