1
|
Santos MM, Johnson MC, Fiedler L, Zegerman P. Global early replication disrupts gene expression and chromatin conformation in a single cell cycle. Genome Biol 2022; 23:217. [PMID: 36253803 PMCID: PMC9575230 DOI: 10.1186/s13059-022-02788-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Accepted: 10/10/2022] [Indexed: 12/03/2022] Open
Abstract
Background The early embryonic divisions of many organisms, including fish, flies, and frogs, are characterized by a very rapid S-phase caused by high rates of replication initiation. In somatic cells, S-phase is much longer due to both a reduction in the total number of initiation events and the imposition of a temporal order of origin activation. The physiological importance of changes in the rate and timing of replication initiation in S-phase remains unclear. Results Here we assess the importance of the temporal control of replication initiation using a conditional system in budding yeast to drive the early replication of the majority of origins in a single cell cycle. We show that global early replication disrupts the expression of over a quarter of all genes. By deleting individual origins, we show that delaying replication is sufficient to restore normal gene expression, directly implicating origin firing control in this regulation. Global early replication disrupts nucleosome positioning and transcription factor binding during S-phase, suggesting that the rate of S-phase is important to regulate the chromatin landscape. Conclusions Together, these data provide new insight into the role of the temporal control of origin firing during S-phase for coordinating replication, gene expression, and chromatin establishment as occurs in the early embryo. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-022-02788-7.
Collapse
Affiliation(s)
- Miguel M Santos
- Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge, CB2 1GA, UK.,Wellcome Trust/Cancer Research UK Gurdon Institute, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QN, UK
| | - Mark C Johnson
- Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge, CB2 1GA, UK.,Wellcome Trust/Cancer Research UK Gurdon Institute, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QN, UK
| | - Lukáš Fiedler
- Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge, CB2 1GA, UK.,Wellcome Trust/Cancer Research UK Gurdon Institute, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QN, UK
| | - Philip Zegerman
- Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge, CB2 1GA, UK. .,Wellcome Trust/Cancer Research UK Gurdon Institute, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QN, UK.
| |
Collapse
|
2
|
Abstract
Budding yeast harbors a simple point centromere, which is originally believed to be sequence dependent without much epigenetic regulation and is transcription incompatible, as inserting a strong promoter upstream inactivates the centromere completely. Here, we demonstrate that an optimal level centromeric noncoding RNA is required for budding yeast centromere activity. Centromeric transcription is induced in S phase, coinciding with the assembly of new centromeric proteins. Too much or too little centromeric noncoding RNA leads to centromere malfunction. Overexpression of centromeric noncoding RNA reduces the protein levels and chromatin localization of inner centromere and kinetochore proteins, such as CENP-A, CENP-C, and the chromosome passenger complex. This work shows that point centromere is epigenetically regulated by noncoding RNA. In budding yeast, which possesses simple point centromeres, we discovered that all of its centromeres express long noncoding RNAs (cenRNAs), especially in S phase. Induction of cenRNAs coincides with CENP-ACse4 loading time and is dependent on DNA replication. Centromeric transcription is repressed by centromere-binding factor Cbf1 and histone H2A variant H2A.ZHtz1. Deletion of CBF1 and H2A.ZHTZ1 results in an up-regulation of cenRNAs; an increased loss of a minichromosome; elevated aneuploidy; a down-regulation of the protein levels of centromeric proteins CENP-ACse4, CENP-A chaperone HJURPScm3, CENP-CMif2, SurvivinBir1, and INCENPSli15; and a reduced chromatin localization of CENP-ACse4, CENP-CMif2, and Aurora BIpl1. When the RNA interference system was introduced to knock down all cenRNAs from the endogenous chromosomes, but not the cenRNA from the circular minichromosome, an increase in minichromosome loss was still observed, suggesting that cenRNA functions in trans to regulate centromere activity. CenRNA knockdown partially alleviates minichromosome loss in cbf1Δ, htz1Δ, and cbf1Δ htz1Δ in a dose-dependent manner, demonstrating that cenRNA level is tightly regulated to epigenetically control point centromere function.
Collapse
|
3
|
Pietrzyk Ł. Food properties and dietary habits in colorectal cancer prevention and development. INTERNATIONAL JOURNAL OF FOOD PROPERTIES 2017. [DOI: 10.1080/10942912.2016.1236813] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Łukasz Pietrzyk
- Department of Didactics and Medical Simulation, Chair of Human Anatomy, Medical University of Lublin, Lublin, Poland
- Department of General, Oncological and Minimally Invasive Surgery, 1st Military Clinical Hospital in Lublin, Lublin, Poland
| |
Collapse
|
4
|
Müller CA, Nieduszynski CA. DNA replication timing influences gene expression level. J Cell Biol 2017; 216:1907-1914. [PMID: 28539386 PMCID: PMC5496624 DOI: 10.1083/jcb.201701061] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2017] [Revised: 04/06/2017] [Accepted: 04/19/2017] [Indexed: 12/31/2022] Open
Abstract
Eukaryotic genomes are replicated in a reproducible temporal order whose physiological significance is poorly understood. Müller and Nieduszynski compare the temporal order of genome replication in phylogenetically diverse yeast species and identify genes for which conserved replication timing contributes to maximal expression. Eukaryotic genomes are replicated in a reproducible temporal order; however, the physiological significance is poorly understood. We compared replication timing in divergent yeast species and identified genomic features with conserved replication times. Histone genes were among the earliest replicating loci in all species. We specifically delayed the replication of HTA1-HTB1 and discovered that this halved the expression of these histone genes. Finally, we showed that histone and cell cycle genes in general are exempt from Rtt109-dependent dosage compensation, suggesting the existence of pathways excluding specific loci from dosage compensation mechanisms. Thus, we have uncovered one of the first physiological requirements for regulated replication time and demonstrated a direct link between replication timing and gene expression.
Collapse
Affiliation(s)
- Carolin A Müller
- Sir William Dunn School of Pathology, University of Oxford, Oxford, England, UK
| | | |
Collapse
|
5
|
Luo Y, Wang F, Szolovits P. Tensor factorization toward precision medicine. Brief Bioinform 2017; 18:511-514. [PMID: 26994614 PMCID: PMC6078180 DOI: 10.1093/bib/bbw026] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Revised: 01/08/2016] [Indexed: 11/13/2022] Open
Abstract
Precision medicine initiatives come amid the rapid growth in quantity and variety of biomedical data, which exceeds the capacity of matrix-oriented data representations and many current analysis algorithms. Tensor factorizations extend the matrix view to multiple modalities and support dimensionality reduction methods that identify latent groups of data for meaningful summarization of both features and instances. In this opinion article, we analyze the modest literature on applying tensor factorization to various biomedical fields including genotyping and phenotyping. Based on the cited work including work of our own, we suggest that tensor applications could serve as an effective tool to enable frequent updating of medical knowledge based on the continually growing scientific and clinical evidence. We encourage extensive experimental studies to tackle challenges including design choice of factorizations, integrating temporality and algorithm scalability.
Collapse
|
6
|
Aiello KA, Alter O. Platform-Independent Genome-Wide Pattern of DNA Copy-Number Alterations Predicting Astrocytoma Survival and Response to Treatment Revealed by the GSVD Formulated as a Comparative Spectral Decomposition. PLoS One 2016; 11:e0164546. [PMID: 27798635 PMCID: PMC5087864 DOI: 10.1371/journal.pone.0164546] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2016] [Accepted: 09/27/2016] [Indexed: 01/07/2023] Open
Abstract
We use the generalized singular value decomposition (GSVD), formulated as a comparative spectral decomposition, to model patient-matched grades III and II, i.e., lower-grade astrocytoma (LGA) brain tumor and normal DNA copy-number profiles. A genome-wide tumor-exclusive pattern of DNA copy-number alterations (CNAs) is revealed, encompassed in that previously uncovered in glioblastoma (GBM), i.e., grade IV astrocytoma, where GBM-specific CNAs encode for enhanced opportunities for transformation and proliferation via growth and developmental signaling pathways in GBM relative to LGA. The GSVD separates the LGA pattern from other sources of biological and experimental variation, common to both, or exclusive to one of the tumor and normal datasets. We find, first, and computationally validate, that the LGA pattern is correlated with a patient's survival and response to treatment. Second, the GBM pattern identifies among the LGA patients a subtype, statistically indistinguishable from that among the GBM patients, where the CNA genotype is correlated with an approximately one-year survival phenotype. Third, cross-platform classification of the Affymetrix-measured LGA and GBM profiles by using the Agilent-derived GBM pattern shows that the GBM pattern is a platform-independent predictor of astrocytoma outcome. Statistically, the pattern is a better predictor (corresponding to greater median survival time difference, proportional hazard ratio, and concordance index) than the patient's age and the tumor's grade, which are the best indicators of astrocytoma currently in clinical use, and laboratory tests. The pattern is also statistically independent of these indicators, and, combined with either one, is an even better predictor of astrocytoma outcome. Recurring DNA CNAs have been observed in astrocytoma tumors' genomes for decades, however, copy-number subtypes that are predictive of patients' outcomes were not identified before. This is despite the growing number of datasets recording different aspects of the disease, and due to an existing fundamental need for mathematical frameworks that can simultaneously find similarities and dissimilarities across the datasets. This illustrates the ability of comparative spectral decompositions to find what other methods miss.
Collapse
Affiliation(s)
- Katherine A. Aiello
- Scientific Computing and Imaging Institute, University of Utah, Salt Lake City, Utah, United States of America
- Department of Bioengineering, University of Utah, Salt Lake City, Utah, United States of America
| | - Orly Alter
- Scientific Computing and Imaging Institute, University of Utah, Salt Lake City, Utah, United States of America
- Department of Bioengineering, University of Utah, Salt Lake City, Utah, United States of America
- Huntsman Cancer Institute, University of Utah, Salt Lake City, Utah, United States of America
- Department of Human Genetics, University of Utah, Salt Lake City, Utah, United States of America
- * E-mail:
| |
Collapse
|
7
|
Chitforoushzadeh Z, Ye Z, Sheng Z, LaRue S, Fry RC, Lauffenburger DA, Janes KA. TNF-insulin crosstalk at the transcription factor GATA6 is revealed by a model that links signaling and transcriptomic data tensors. Sci Signal 2016; 9:ra59. [PMID: 27273097 DOI: 10.1126/scisignal.aad3373] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Signal transduction networks coordinate transcriptional programs activated by diverse extracellular stimuli, such as growth factors and cytokines. Cells receive multiple stimuli simultaneously, and mapping how activation of the integrated signaling network affects gene expression is a challenge. We stimulated colon adenocarcinoma cells with various combinations of the cytokine tumor necrosis factor (TNF) and the growth factors insulin and epidermal growth factor (EGF) to investigate signal integration and transcriptional crosstalk. We quantitatively linked the proteomic and transcriptomic data sets by implementing a structured computational approach called tensor partial least squares regression. This statistical model accurately predicted transcriptional signatures from signaling arising from single and combined stimuli and also predicted time-dependent contributions of signaling events. Specifically, the model predicted that an early-phase, AKT-associated signal downstream of insulin repressed a set of transcripts induced by TNF. Through bioinformatics and cell-based experiments, we identified the AKT-repressed signal as glycogen synthase kinase 3 (GSK3)-catalyzed phosphorylation of Ser(37) on the long form of the transcription factor GATA6. Phosphorylation of GATA6 on Ser(37) promoted its degradation, thereby preventing GATA6 from repressing transcripts that are induced by TNF and attenuated by insulin. Our analysis showed that predictive tensor modeling of proteomic and transcriptomic data sets can uncover pathway crosstalk that produces specific patterns of gene expression in cells receiving multiple stimuli.
Collapse
Affiliation(s)
- Zeinab Chitforoushzadeh
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, USA. Department of Pharmacology, University of Virginia, Charlottesville, VA 22908, USA
| | - Zi Ye
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, USA
| | - Ziran Sheng
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, USA
| | - Silvia LaRue
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, USA
| | - Rebecca C Fry
- Department of Environmental Sciences and Engineering, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Douglas A Lauffenburger
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Kevin A Janes
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, USA.
| |
Collapse
|
8
|
Descorps-Declère S, Saguez C, Cournac A, Marbouty M, Rolland T, Ma L, Bouchier C, Moszer I, Dujon B, Koszul R, Richard GF. Genome-wide replication landscape of Candida glabrata. BMC Biol 2015; 13:69. [PMID: 26329162 PMCID: PMC4556013 DOI: 10.1186/s12915-015-0177-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2015] [Accepted: 08/05/2015] [Indexed: 11/25/2022] Open
Abstract
Background The opportunistic pathogen Candida glabrata is a member of the Saccharomycetaceae yeasts. Like its close relative Saccharomyces cerevisiae, it underwent a whole-genome duplication followed by an extensive loss of genes. Its genome contains a large number of very long tandem repeats, called megasatellites. In order to determine the whole replication program of the C. glabrata genome and its general chromosomal organization, we used deep-sequencing and chromosome conformation capture experiments. Results We identified 253 replication fork origins, genome wide. Centromeres, HML and HMR loci, and most histone genes are replicated early, whereas natural chromosomal breakpoints are located in late-replicating regions. In addition, 275 autonomously replicating sequences (ARS) were identified during ARS-capture experiments, and their relative fitness was determined during growth competition. Analysis of ARSs allowed us to identify a 17-bp consensus, similar to the S. cerevisiae ARS consensus sequence but slightly more constrained. Megasatellites are not in close proximity to replication origins or termini. Using chromosome conformation capture, we also show that early origins tend to cluster whereas non-subtelomeric megasatellites do not cluster in the yeast nucleus. Conclusions Despite a shorter cell cycle, the C. glabrata replication program shares unexpected striking similarities to S. cerevisiae, in spite of their large evolutionary distance and the presence of highly repetitive large tandem repeats in C. glabrata. No correlation could be found between the replication program and megasatellites, suggesting that their formation and propagation might not be directly caused by replication fork initiation or termination. Electronic supplementary material The online version of this article (doi:10.1186/s12915-015-0177-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Stéphane Descorps-Declère
- Institut Pasteur, Center of Bioinformatics, Biostatistics and Integrative Biology (C3BI), F-75015, Paris, France.
| | - Cyril Saguez
- Institut Pasteur, Unité de Génétique Moléculaire des Levures, Département Génomes & Génétique, F-75015, Paris, France. .,CNRS, UMR3525, F-75015, Paris, France. .,Sorbonne Universités, UPMC Univ Paris 06, 4 Place Jussieu, 75252, Paris, Cedex 05, France.
| | - Axel Cournac
- CNRS, UMR3525, F-75015, Paris, France. .,Institut Pasteur, Groupe Régulation Spatiale des Génomes, Département Génomes & Génétique, F-75015, Paris, France.
| | - Martial Marbouty
- CNRS, UMR3525, F-75015, Paris, France. .,Institut Pasteur, Groupe Régulation Spatiale des Génomes, Département Génomes & Génétique, F-75015, Paris, France.
| | - Thomas Rolland
- Present address: Institut Pasteur, Unité de Génétique Humaine et Fonctions Cognitives, Département des Neurosciences, F-75015, Paris, France.
| | - Laurence Ma
- Institut Pasteur, Plate-forme Génomique, Département Génomes & Génétique, F-75015, Paris, France.
| | - Christiane Bouchier
- Institut Pasteur, Plate-forme Génomique, Département Génomes & Génétique, F-75015, Paris, France.
| | - Ivan Moszer
- Present address: Plate-forme Bio-informatique/Biostatistique, Institut de Neurosciences Translationnelles IHU-A-ICM, Hôpital Pitié-Salpêtrière, 47-83 bd de l'Hôpital, 75561, Paris, Cedex 13, France.
| | - Bernard Dujon
- Institut Pasteur, Unité de Génétique Moléculaire des Levures, Département Génomes & Génétique, F-75015, Paris, France. .,CNRS, UMR3525, F-75015, Paris, France. .,Sorbonne Universités, UPMC Univ Paris 06, 4 Place Jussieu, 75252, Paris, Cedex 05, France.
| | - Romain Koszul
- CNRS, UMR3525, F-75015, Paris, France. .,Institut Pasteur, Groupe Régulation Spatiale des Génomes, Département Génomes & Génétique, F-75015, Paris, France.
| | - Guy-Franck Richard
- Institut Pasteur, Unité de Génétique Moléculaire des Levures, Département Génomes & Génétique, F-75015, Paris, France. .,CNRS, UMR3525, F-75015, Paris, France. .,Sorbonne Universités, UPMC Univ Paris 06, 4 Place Jussieu, 75252, Paris, Cedex 05, France.
| |
Collapse
|
9
|
Luo Y, Xin Y, Hochberg E, Joshi R, Uzuner O, Szolovits P. Subgraph augmented non-negative tensor factorization (SANTF) for modeling clinical narrative text. J Am Med Inform Assoc 2015; 22:1009-19. [PMID: 25862765 PMCID: PMC4986663 DOI: 10.1093/jamia/ocv016] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2014] [Revised: 01/18/2015] [Accepted: 02/16/2015] [Indexed: 02/04/2023] Open
Abstract
OBJECTIVE Extracting medical knowledge from electronic medical records requires automated approaches to combat scalability limitations and selection biases. However, existing machine learning approaches are often regarded by clinicians as black boxes. Moreover, training data for these automated approaches at often sparsely annotated at best. The authors target unsupervised learning for modeling clinical narrative text, aiming at improving both accuracy and interpretability. METHODS The authors introduce a novel framework named subgraph augmented non-negative tensor factorization (SANTF). In addition to relying on atomic features (e.g., words in clinical narrative text), SANTF automatically mines higher-order features (e.g., relations of lymphoid cells expressing antigens) from clinical narrative text by converting sentences into a graph representation and identifying important subgraphs. The authors compose a tensor using patients, higher-order features, and atomic features as its respective modes. We then apply non-negative tensor factorization to cluster patients, and simultaneously identify latent groups of higher-order features that link to patient clusters, as in clinical guidelines where a panel of immunophenotypic features and laboratory results are used to specify diagnostic criteria. RESULTS AND CONCLUSION SANTF demonstrated over 10% improvement in averaged F-measure on patient clustering compared to widely used non-negative matrix factorization (NMF) and k-means clustering methods. Multiple baselines were established by modeling patient data using patient-by-features matrices with different feature configurations and then performing NMF or k-means to cluster patients. Feature analysis identified latent groups of higher-order features that lead to medical insights. We also found that the latent groups of atomic features help to better correlate the latent groups of higher-order features.
Collapse
Affiliation(s)
- Yuan Luo
- Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology
| | - Yu Xin
- Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology
| | - Ephraim Hochberg
- Center for Lymphoma, Massachusetts General Hospital and Department of Medicine, Harvard Medical School
| | - Rohit Joshi
- Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology
| | - Ozlem Uzuner
- Department of Information Studies, State University of New York at Albany
| | - Peter Szolovits
- Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology
| |
Collapse
|
10
|
Sankaranarayanan P, Schomay TE, Aiello KA, Alter O. Tensor GSVD of patient- and platform-matched tumor and normal DNA copy-number profiles uncovers chromosome arm-wide patterns of tumor-exclusive platform-consistent alterations encoding for cell transformation and predicting ovarian cancer survival. PLoS One 2015; 10:e0121396. [PMID: 25875127 PMCID: PMC4398562 DOI: 10.1371/journal.pone.0121396] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2014] [Accepted: 01/31/2015] [Indexed: 11/28/2022] Open
Abstract
The number of large-scale high-dimensional datasets recording different aspects of a single disease is growing, accompanied by a need for frameworks that can create one coherent model from multiple tensors of matched columns, e.g., patients and platforms, but independent rows, e.g., probes. We define and prove the mathematical properties of a novel tensor generalized singular value decomposition (GSVD), which can simultaneously find the similarities and dissimilarities, i.e., patterns of varying relative significance, between any two such tensors. We demonstrate the tensor GSVD in comparative modeling of patient- and platform-matched but probe-independent ovarian serous cystadenocarcinoma (OV) tumor, mostly high-grade, and normal DNA copy-number profiles, across each chromosome arm, and combination of two arms, separately. The modeling uncovers previously unrecognized patterns of tumor-exclusive platform-consistent co-occurring copy-number alterations (CNAs). We find, first, and validate that each of the patterns across only 7p and Xq, and the combination of 6p+12p, is correlated with a patient’s prognosis, is independent of the tumor’s stage, the best predictor of OV survival to date, and together with stage makes a better predictor than stage alone. Second, these patterns include most known OV-associated CNAs that map to these chromosome arms, as well as several previously unreported, yet frequent focal CNAs. Third, differential mRNA, microRNA, and protein expression consistently map to the DNA CNAs. A coherent picture emerges for each pattern, suggesting roles for the CNAs in OV pathogenesis and personalized therapy. In 6p+12p, deletion of the p21-encoding CDKN1A and p38-encoding MAPK14 and amplification of RAD51AP1 and KRAS encode for human cell transformation, and are correlated with a cell’s immortality, and a patient’s shorter survival time. In 7p, RPA3 deletion and POLD2 amplification are correlated with DNA stability, and a longer survival. In Xq, PABPC5 deletion and BCAP31 amplification are correlated with a cellular immune response, and a longer survival.
Collapse
MESH Headings
- Carcinoma, Ovarian Epithelial
- Cell Transformation, Neoplastic/genetics
- Chromosome Mapping
- Chromosomes/genetics
- Cystadenocarcinoma, Serous/diagnosis
- Cystadenocarcinoma, Serous/genetics
- Cystadenocarcinoma, Serous/pathology
- DNA Copy Number Variations/genetics
- Female
- Gene Expression Regulation, Neoplastic
- Humans
- MicroRNAs/biosynthesis
- Models, Theoretical
- Mutation
- Neoplasm Proteins/biosynthesis
- Neoplasms, Glandular and Epithelial/diagnosis
- Neoplasms, Glandular and Epithelial/genetics
- Neoplasms, Glandular and Epithelial/pathology
- Ovarian Neoplasms/diagnosis
- Ovarian Neoplasms/genetics
- Ovarian Neoplasms/pathology
- Prognosis
- RNA, Messenger/biosynthesis
- RNA, Messenger/genetics
- Survival Analysis
Collapse
Affiliation(s)
- Preethi Sankaranarayanan
- Scientific Computing and Imaging (SCI) Institute, University of Utah, Salt Lake City, Utah, United States of America
- Department of Bioengineering, University of Utah, Salt Lake City, Utah, United States of America
| | - Theodore E. Schomay
- Scientific Computing and Imaging (SCI) Institute, University of Utah, Salt Lake City, Utah, United States of America
- Department of Bioengineering, University of Utah, Salt Lake City, Utah, United States of America
| | - Katherine A. Aiello
- Scientific Computing and Imaging (SCI) Institute, University of Utah, Salt Lake City, Utah, United States of America
- Department of Bioengineering, University of Utah, Salt Lake City, Utah, United States of America
| | - Orly Alter
- Scientific Computing and Imaging (SCI) Institute, University of Utah, Salt Lake City, Utah, United States of America
- Department of Bioengineering, University of Utah, Salt Lake City, Utah, United States of America
- Department of Human Genetics, University of Utah, Salt Lake City, Utah, United States of America
- * E-mail:
| |
Collapse
|
11
|
He S, Yin J, Li H, Wang X. Graphical model selection and estimation for high dimensional tensor data. J MULTIVARIATE ANAL 2014. [DOI: 10.1016/j.jmva.2014.03.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
12
|
SVD identifies transcript length distribution functions from DNA microarray data and reveals evolutionary forces globally affecting GBM metabolism. PLoS One 2013; 8:e78913. [PMID: 24282503 PMCID: PMC3839928 DOI: 10.1371/journal.pone.0078913] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2013] [Accepted: 09/25/2013] [Indexed: 01/10/2023] Open
Abstract
To search for evolutionary forces that might act upon transcript length, we use the singular value decomposition (SVD) to identify the length distribution functions of sets and subsets of human and yeast transcripts from profiles of mRNA abundance levels across gel electrophoresis migration distances that were previously measured by DNA microarrays. We show that the SVD identifies the transcript length distribution functions as “asymmetric generalized coherent states” from the DNA microarray data and with no a-priori assumptions. Comparing subsets of human and yeast transcripts of the same gene ontology annotations, we find that in both disparate eukaryotes, transcripts involved in protein synthesis or mitochondrial metabolism are significantly shorter than typical, and in particular, significantly shorter than those involved in glucose metabolism. Comparing the subsets of human transcripts that are overexpressed in glioblastoma multiforme (GBM) or normal brain tissue samples from The Cancer Genome Atlas, we find that GBM maintains normal brain overexpression of significantly short transcripts, enriched in transcripts that are involved in protein synthesis or mitochondrial metabolism, but suppresses normal overexpression of significantly longer transcripts, enriched in transcripts that are involved in glucose metabolism and brain activity. These global relations among transcript length, cellular metabolism and tumor development suggest a previously unrecognized physical mode for tumor and normal cells to differentially regulate metabolism in a transcript length-dependent manner. The identified distribution functions support a previous hypothesis from mathematical modeling of evolutionary forces that act upon transcript length in the manner of the restoring force of the harmonic oscillator.
Collapse
|
13
|
Mohr H, Mohr CA, Schneider MR, Scrivano L, Adler B, Kraner-Schreiber S, Schnieke A, Dahlhoff M, Wolf E, Koszinowski UH, Ruzsics Z. Cytomegalovirus replicon-based regulation of gene expression in vitro and in vivo. PLoS Pathog 2012; 8:e1002728. [PMID: 22685399 PMCID: PMC3369935 DOI: 10.1371/journal.ppat.1002728] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2011] [Accepted: 04/18/2012] [Indexed: 12/14/2022] Open
Abstract
There is increasing evidence for a connection between DNA replication and the expression of adjacent genes. Therefore, this study addressed the question of whether a herpesvirus origin of replication can be used to activate or increase the expression of adjacent genes. Cell lines carrying an episomal vector, in which reporter genes are linked to the murine cytomegalovirus (MCMV) origin of lytic replication (oriLyt), were constructed. Reporter gene expression was silenced by a histone-deacetylase-dependent mechanism, but was resolved upon lytic infection with MCMV. Replication of the episome was observed subsequent to infection, leading to the induction of gene expression by more than 1000-fold. oriLyt-based regulation thus provided a unique opportunity for virus-induced conditional gene expression without the need for an additional induction mechanism. This principle was exploited to show effective late trans-complementation of the toxic viral protein M50 and the glycoprotein gO of MCMV. Moreover, the application of this principle for intracellular immunization against herpesvirus infection was demonstrated. The results of the present study show that viral infection specifically activated the expression of a dominant-negative transgene, which inhibited viral growth. This conditional system was operative in explant cultures of transgenic mice, but not in vivo. Several applications are discussed. All herpesviruses show a precisely regulated gene expression profile, including true-late genes, which are turned on only after the onset of DNA replication. We used this intrinsic viral mechanism to generate a versatile conditional gene expression system that exploits the activity of the murine cytomegalovirus (MCMV) viral origin of lytic replication (oriLyt). Upon virus infection, replication of the viral genome also led to the replication and activation of the oriLyt-coupled episomal transgene. The oriLyt-based replicons were silenced in all stable cell lines and transgenic mice; however, virus infection liberated the plasmids from histone-deacetylase-induced inactivation. As maximum gene expression relied on relief from silencing via replication of the episomal constructs, very strong induction of the reporter gene was achieved. We showed that this system can be used for trans-complementation of late, toxic viral genes, to block virus production by activating dominant-negative (DN) transgenes, and to provide a new tool to study the principles of viral replication.
Collapse
Affiliation(s)
- Hermine Mohr
- Max von Pettenkofer-Institute, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Christian A. Mohr
- Max von Pettenkofer-Institute, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Marlon R. Schneider
- Institute of Molecular Animal Breeding and Biotechnology, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Laura Scrivano
- Institute of Molecular Animal Breeding and Biotechnology, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Barbara Adler
- Max von Pettenkofer-Institute, Ludwig-Maximilians-Universität München, Munich, Germany
| | | | - Angelika Schnieke
- Chair of Livestock Biotechnology, Technische Universität München, Freising, Germany
| | - Maik Dahlhoff
- Chair of Livestock Biotechnology, Technische Universität München, Freising, Germany
| | - Eckhard Wolf
- Institute of Molecular Animal Breeding and Biotechnology, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Ulrich H. Koszinowski
- Max von Pettenkofer-Institute, Ludwig-Maximilians-Universität München, Munich, Germany
- * E-mail:
| | - Zsolt Ruzsics
- Max von Pettenkofer-Institute, Ludwig-Maximilians-Universität München, Munich, Germany
| |
Collapse
|
14
|
Di Rienzi SC, Lindstrom KC, Mann T, Noble WS, Raghuraman MK, Brewer BJ. Maintaining replication origins in the face of genomic change. Genome Res 2012; 22:1940-52. [PMID: 22665441 PMCID: PMC3460189 DOI: 10.1101/gr.138248.112] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Origins of replication present a paradox to evolutionary biologists. As a collection, they are absolutely essential genomic features, but individually are highly redundant and nonessential. It is therefore difficult to predict to what extent and in what regard origins are conserved over evolutionary time. Here, through a comparative genomic analysis of replication origins and chromosomal replication patterns in the budding yeasts Saccharomyces cerevisiae and Lachancea waltii, we assess to what extent replication origins survived genomic change produced from 150 million years of evolution. We find that L. waltii origins exhibit a core consensus sequence and nucleosome occupancy pattern highly similar to those of S. cerevisiae origins. We further observe that the overall progression of chromosomal replication is similar between L. waltii and S. cerevisiae. Nevertheless, few origins show evidence of being conserved in location between the two species. Among the conserved origins are those surrounding centromeres and adjacent to histone genes, suggesting that proximity to an origin may be important for their regulation. We conclude that, over evolutionary time, origins maintain sequence, structure, and regulation, but are continually being created and destroyed, with the result that their locations are generally not conserved.
Collapse
Affiliation(s)
- Sara C Di Rienzi
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | | | | | | | | | | |
Collapse
|
15
|
Abstract
Computers are organized into hardware and software. Using a theoretical approach to identify patterns in gene expression in a variety of species, organs, and cell types, we found that biological systems similarly are comprised of a relatively unchanging hardware-like gene pattern. Orthogonal patterns of software-like transcripts vary greatly, even among tumors of the same type from different individuals. Two distinguishable classes could be identified within the hardware-like component: those transcripts that are highly expressed and stable and an adaptable subset with lower expression that respond to external stimuli. Importantly, we demonstrate that this structure is conserved across organisms. Deletions of transcripts from the highly stable core are predicted to result in cell mortality. The approach provides a conceptual thermodynamic-like framework for the analysis of gene-expression levels and networks and their variations in diseased cells.
Collapse
|
16
|
Lee CH, Alpert BO, Sankaranarayanan P, Alter O. GSVD comparison of patient-matched normal and tumor aCGH profiles reveals global copy-number alterations predicting glioblastoma multiforme survival. PLoS One 2012; 7:e30098. [PMID: 22291905 PMCID: PMC3264559 DOI: 10.1371/journal.pone.0030098] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2011] [Accepted: 12/09/2011] [Indexed: 11/18/2022] Open
Abstract
Despite recent large-scale profiling efforts, the best prognostic predictor of glioblastoma multiforme (GBM) remains the patient's age at diagnosis. We describe a global pattern of tumor-exclusive co-occurring copy-number alterations (CNAs) that is correlated, possibly coordinated with GBM patients' survival and response to chemotherapy. The pattern is revealed by GSVD comparison of patient-matched but probe-independent GBM and normal aCGH datasets from The Cancer Genome Atlas (TCGA). We find that, first, the GSVD, formulated as a framework for comparatively modeling two composite datasets, removes from the pattern copy-number variations (CNVs) that occur in the normal human genome (e.g., female-specific X chromosome amplification) and experimental variations (e.g., in tissue batch, genomic center, hybridization date and scanner), without a-priori knowledge of these variations. Second, the pattern includes most known GBM-associated changes in chromosome numbers and focal CNAs, as well as several previously unreported CNAs in >3% of the patients. These include the biochemically putative drug target, cell cycle-regulated serine/threonine kinase-encoding TLK2, the cyclin E1-encoding CCNE1, and the Rb-binding histone demethylase-encoding KDM5A. Third, the pattern provides a better prognostic predictor than the chromosome numbers or any one focal CNA that it identifies, suggesting that the GBM survival phenotype is an outcome of its global genotype. The pattern is independent of age, and combined with age, makes a better predictor than age alone. GSVD comparison of matched profiles of a larger set of TCGA patients, inclusive of the initial set, confirms the global pattern. GSVD classification of the GBM profiles of an independent set of patients validates the prognostic contribution of the pattern.
Collapse
Affiliation(s)
- Cheng H. Lee
- Scientific Computing and Imaging (SCI) Institute, University of Utah, Salt Lake City, Utah, United States of America
| | - Benjamin O. Alpert
- Scientific Computing and Imaging (SCI) Institute, University of Utah, Salt Lake City, Utah, United States of America
- Department of Bioengineering, University of Utah, Salt Lake City, Utah, United States of America
| | - Preethi Sankaranarayanan
- Scientific Computing and Imaging (SCI) Institute, University of Utah, Salt Lake City, Utah, United States of America
- Department of Bioengineering, University of Utah, Salt Lake City, Utah, United States of America
| | - Orly Alter
- Scientific Computing and Imaging (SCI) Institute, University of Utah, Salt Lake City, Utah, United States of America
- Department of Bioengineering, University of Utah, Salt Lake City, Utah, United States of America
- Department of Human Genetics, University of Utah, Salt Lake City, Utah, United States of America
| |
Collapse
|
17
|
Ponnapalli SP, Saunders MA, Van Loan CF, Alter O. A higher-order generalized singular value decomposition for comparison of global mRNA expression from multiple organisms. PLoS One 2011; 6:e28072. [PMID: 22216090 PMCID: PMC3245232 DOI: 10.1371/journal.pone.0028072] [Citation(s) in RCA: 74] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2011] [Accepted: 10/31/2011] [Indexed: 11/18/2022] Open
Abstract
The number of high-dimensional datasets recording multiple aspects of a single phenomenon is increasing in many areas of science, accompanied by a need for mathematical frameworks that can compare multiple large-scale matrices with different row dimensions. The only such framework to date, the generalized singular value decomposition (GSVD), is limited to two matrices. We mathematically define a higher-order GSVD (HO GSVD) for N≥2 matrices , each with full column rank. Each matrix is exactly factored as Di = UiΣiVT, where V, identical in all factorizations, is obtained from the eigensystem SV = VΛ of the arithmetic mean S of all pairwise quotients of the matrices , i≠j. We prove that this decomposition extends to higher orders almost all of the mathematical properties of the GSVD. The matrix S is nondefective with V and Λ real. Its eigenvalues satisfy λk≥1. Equality holds if and only if the corresponding eigenvector vk is a right basis vector of equal significance in all matrices Di and Dj, that is σi,k/σj,k = 1 for all i and j, and the corresponding left basis vector ui,k is orthogonal to all other vectors in Ui for all i. The eigenvalues λk = 1, therefore, define the “common HO GSVD subspace.” We illustrate the HO GSVD with a comparison of genome-scale cell-cycle mRNA expression from S. pombe, S. cerevisiae and human. Unlike existing algorithms, a mapping among the genes of these disparate organisms is not required. We find that the approximately common HO GSVD subspace represents the cell-cycle mRNA expression oscillations, which are similar among the datasets. Simultaneous reconstruction in the common subspace, therefore, removes the experimental artifacts, which are dissimilar, from the datasets. In the simultaneous sequence-independent classification of the genes of the three organisms in this common subspace, genes of highly conserved sequences but significantly different cell-cycle peak times are correctly classified.
Collapse
Affiliation(s)
- Sri Priya Ponnapalli
- Department of Electrical and Computer Engineering, University of Texas at Austin, Texas, United States of America
| | - Michael A. Saunders
- Department of Management Science and Engineering, Stanford University, Stanford, California, United States of America
| | - Charles F. Van Loan
- Department of Computer Science, Cornell University, Ithaca, New York, United States of America
| | - Orly Alter
- Scientific Computing and Imaging (SCI) Institute and Departments of Bioengineering and Human Genetics, University of Utah, Salt Lake City, Utah, United States of America
- * E-mail:
| |
Collapse
|
18
|
Siow CC, Nieduszynska SR, Müller CA, Nieduszynski CA. OriDB, the DNA replication origin database updated and extended. Nucleic Acids Res 2011; 40:D682-6. [PMID: 22121216 PMCID: PMC3245157 DOI: 10.1093/nar/gkr1091] [Citation(s) in RCA: 107] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
OriDB (http://www.oridb.org/) is a database containing collated genome-wide mapping studies of confirmed and predicted replication origin sites. The original database collated and curated Saccharomyces cerevisiae origin mapping studies. Here, we report that the OriDB database and web site have been revamped to improve user accessibility to curated data sets, to greatly increase the number of curated origin mapping studies, and to include the collation of replication origin sites in the fission yeast Schizosaccharomyces pombe. The revised database structure underlies these improvements and will facilitate further expansion in the future. The updated OriDB for S. cerevisiae is available at http://cerevisiae.oridb.org/ and for S. pombe at http://pombe.oridb.org/.
Collapse
Affiliation(s)
- Cheuk C Siow
- Centre for Genetics and Genomics, The University of Nottingham, Queen's Medical Centre, Nottingham NG7 2UH, UK
| | | | | | | |
Collapse
|
19
|
Khobta A, Epe B. Interactions between DNA damage, repair, and transcription. Mutat Res 2011; 736:5-14. [PMID: 21907218 DOI: 10.1016/j.mrfmmm.2011.07.014] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2010] [Revised: 06/22/2011] [Accepted: 07/25/2011] [Indexed: 01/16/2023]
Abstract
This review addresses a variety of mechanisms by which DNA repair interacts with transcription and vice versa. Blocking of transcriptional elongation is the best studied of these mechanisms. Transcription recovery after damage therefore has often been used as a surrogate marker of DNA repair in cells. However, it has become evident that relationships between DNA damage, repair, and transcription are more complex due to various indirect effects of DNA damage on gene transcription. These include inhibition of transcription by DNA repair intermediates as well as regulation of transcription and of the epigenetic status of the genes by DNA repair-related mechanisms. In addition, since transcription is emerging as an important endogenous source of DNA damage in cells, we briefly summarise recent advances in understanding the nature of co-transcriptionally induced DNA damage and the DNA repair pathways involved.
Collapse
Affiliation(s)
- Andriy Khobta
- Institute of Pharmacy and Biochemistry, University of Mainz, Mainz, Germany
| | | |
Collapse
|
20
|
Brázda V, Laister RC, Jagelská EB, Arrowsmith C. Cruciform structures are a common DNA feature important for regulating biological processes. BMC Mol Biol 2011; 12:33. [PMID: 21816114 PMCID: PMC3176155 DOI: 10.1186/1471-2199-12-33] [Citation(s) in RCA: 177] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2011] [Accepted: 08/05/2011] [Indexed: 04/10/2023] Open
Abstract
DNA cruciforms play an important role in the regulation of natural processes involving DNA. These structures are formed by inverted repeats, and their stability is enhanced by DNA supercoiling. Cruciform structures are fundamentally important for a wide range of biological processes, including replication, regulation of gene expression, nucleosome structure and recombination. They also have been implicated in the evolution and development of diseases including cancer, Werner's syndrome and others. Cruciform structures are targets for many architectural and regulatory proteins, such as histones H1 and H5, topoisomerase IIβ, HMG proteins, HU, p53, the proto-oncogene protein DEK and others. A number of DNA-binding proteins, such as the HMGB-box family members, Rad54, BRCA1 protein, as well as PARP-1 polymerase, possess weak sequence specific DNA binding yet bind preferentially to cruciform structures. Some of these proteins are, in fact, capable of inducing the formation of cruciform structures upon DNA binding. In this article, we review the protein families that are involved in interacting with and regulating cruciform structures, including (a) the junction-resolving enzymes, (b) DNA repair proteins and transcription factors, (c) proteins involved in replication and (d) chromatin-associated proteins. The prevalence of cruciform structures and their roles in protein interactions, epigenetic regulation and the maintenance of cell homeostasis are also discussed.
Collapse
Affiliation(s)
- Václav Brázda
- Institute of Biophysics, Academy of Sciences of the Czech Republic, v,v,i,, Královopolská 135, Brno, 612 65, Czech Republic.
| | | | | | | |
Collapse
|
21
|
Ding Q, MacAlpine DM. Defining the replication program through the chromatin landscape. Crit Rev Biochem Mol Biol 2011; 46:165-79. [PMID: 21417598 DOI: 10.3109/10409238.2011.560139] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
DNA replication is an essential cell cycle event required for the accurate and timely duplication of the chromosomes. It is essential that the genome is replicated accurately and completely within the confines of S-phase. Failure to completely copy the genome has the potential to result in catastrophic genomic instability. Replication initiates in a coordinated manner from multiple locations, termed origins of replication, distributed across each of the chromosomes. The selection of these origins of replication is a dynamic process responding to both developmental and tissue-specific signals. In this review, we explore the role of the local chromatin environment in regulating the DNA replication program at the level of origin selection and activation. Finally, there is increasing molecular evidence that the DNA replication program itself affects the chromatin landscape, suggesting that DNA replication is critical for both genetic and epigenetic inheritance.
Collapse
Affiliation(s)
- Queying Ding
- Department of Pharmacology and Cancer Biology, Duke University Medical Center, Durham, NC 27710, USA
| | | |
Collapse
|
22
|
Tensor decomposition reveals concurrent evolutionary convergences and divergences and correlations with structural motifs in ribosomal RNA. PLoS One 2011; 6:e18768. [PMID: 21625625 PMCID: PMC3094155 DOI: 10.1371/journal.pone.0018768] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2011] [Accepted: 03/17/2011] [Indexed: 11/19/2022] Open
Abstract
Evolutionary relationships among organisms are commonly described by using a hierarchy derived from comparisons of ribosomal RNA (rRNA) sequences. We propose that even on the level of a single rRNA molecule, an organism's evolution is composed of multiple pathways due to concurrent forces that act independently upon different rRNA degrees of freedom. Relationships among organisms are then compositions of coexisting pathway-dependent similarities and dissimilarities, which cannot be described by a single hierarchy. We computationally test this hypothesis in comparative analyses of 16S and 23S rRNA sequence alignments by using a tensor decomposition, i.e., a framework for modeling composite data. Each alignment is encoded in a cuboid, i.e., a third-order tensor, where nucleotides, positions and organisms, each represent a degree of freedom. A tensor mode-1 higher-order singular value decomposition (HOSVD) is formulated such that it separates each cuboid into combinations of patterns of nucleotide frequency variation across organisms and positions, i.e., "eigenpositions" and corresponding nucleotide-specific segments of "eigenorganisms," respectively, independent of a-priori knowledge of the taxonomic groups or rRNA structures. We find, in support of our hypothesis that, first, the significant eigenpositions reveal multiple similarities and dissimilarities among the taxonomic groups. Second, the corresponding eigenorganisms identify insertions or deletions of nucleotides exclusively conserved within the corresponding groups, that map out entire substructures and are enriched in adenosines, unpaired in the rRNA secondary structure, that participate in tertiary structure interactions. This demonstrates that structural motifs involved in rRNA folding and function are evolutionary degrees of freedom. Third, two previously unknown coexisting subgenic relationships between Microsporidia and Archaea are revealed in both the 16S and 23S rRNA alignments, a convergence and a divergence, conferred by insertions and deletions of these motifs, which cannot be described by a single hierarchy. This shows that mode-1 HOSVD modeling of rRNA alignments might be used to computationally predict evolutionary mechanisms.
Collapse
|
23
|
Kravchenko-Balasha N, Remacle F, Gross A, Rotter V, Levitzki A, Levine RD. Convergence of logic of cellular regulation in different premalignant cells by an information theoretic approach. BMC SYSTEMS BIOLOGY 2011; 5:42. [PMID: 21410932 PMCID: PMC3072338 DOI: 10.1186/1752-0509-5-42] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/21/2010] [Accepted: 03/16/2011] [Indexed: 11/10/2022]
Abstract
BACKGROUND Surprisal analysis is a thermodynamic-like molecular level approach that identifies biological constraints that prevents the entropy from reaching its maximum. To examine the significance of altered gene expression levels in tumorigenesis we apply surprisal analysis to the WI-38 model through its precancerous states. The constraints identified by the analysis are transcription patterns underlying the process of transformation. Each pattern highlights the role of a group of genes that act coherently to define a transformed phenotype. RESULTS We identify a major transcription pattern that represents a contraction of signaling networks accompanied by induction of cellular proliferation and protein metabolism, which is essential for full transformation. In addition, a more minor, "tumor signature" transcription pattern completes the transformation process. The variation with time of the importance of each transcription pattern is determined. Midway through the transformation, at the stage when cells switch from slow to fast growth rate, the major transcription pattern undergoes a total inversion of its weight while the more minor pattern does not contribute before that stage. CONCLUSIONS A similar network reorganization occurs in two very different cellular transformation models: WI-38 and the cervical cancer HF1 models. Our results suggest that despite differences in a list of transcripts expressed in different cancer models the rationale of the network reorganization remains essentially the same.
Collapse
Affiliation(s)
- Nataly Kravchenko-Balasha
- Unit of Cellular Signaling, Department of Biological Chemistry, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem 91904, Israel
| | | | | | | | | | | |
Collapse
|
24
|
Behnke MS, Wootton JC, Lehmann MM, Radke JB, Lucas O, Nawas J, Sibley LD, White MW. Coordinated progression through two subtranscriptomes underlies the tachyzoite cycle of Toxoplasma gondii. PLoS One 2010; 5:e12354. [PMID: 20865045 PMCID: PMC2928733 DOI: 10.1371/journal.pone.0012354] [Citation(s) in RCA: 197] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2010] [Accepted: 06/12/2010] [Indexed: 01/29/2023] Open
Abstract
BACKGROUND Apicomplexan parasites replicate by varied and unusual processes where the typically eukaryotic expansion of cellular components and chromosome cycle are coordinated with the biosynthesis of parasite-specific structures essential for transmission. METHODOLOGY/PRINCIPAL FINDINGS Here we describe the global cell cycle transcriptome of the tachyzoite stage of Toxoplasma gondii. In dividing tachyzoites, more than a third of the mRNAs exhibit significant cyclical profiles whose timing correlates with biosynthetic events that unfold during daughter parasite formation. These 2,833 mRNAs have a bimodal organization with peak expression occurring in one of two transcriptional waves that are bounded by the transition into S phase and cell cycle exit following cytokinesis. The G1-subtranscriptome is enriched for genes required for basal biosynthetic and metabolic functions, similar to most eukaryotes, while the S/M-subtranscriptome is characterized by the uniquely apicomplexan requirements of parasite maturation, development of specialized organelles, and egress of infectious daughter cells. Two dozen AP2 transcription factors form a series through the tachyzoite cycle with successive sharp peaks of protein expression in the same timeframes as their mRNA patterns, indicating that the mechanisms responsible for the timing of protein delivery might be mediated by AP2 domains with different promoter recognition specificities. CONCLUSION/SIGNIFICANCE Underlying each of the major events in apicomplexan cell cycles, and many more subordinate actions, are dynamic changes in parasite gene expression. The mechanisms responsible for cyclical gene expression timing are likely crucial to the efficiency of parasite replication and may provide new avenues for interfering with parasite growth.
Collapse
Affiliation(s)
- Michael S. Behnke
- Department of Veterinary Molecular Biology, Montana State University, Bozeman, Montana, United States of America
- Department of Molecular Microbiology, Washington University School of Medicine, St. Louis, Missouri, United States of America
| | - John C. Wootton
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Margaret M. Lehmann
- Department of Veterinary Molecular Biology, Montana State University, Bozeman, Montana, United States of America
| | - Josh B. Radke
- Department of Veterinary Molecular Biology, Montana State University, Bozeman, Montana, United States of America
- Departments of Molecular Medicine and Global Health, University of South Florida, Tampa, Florida, United States of America
| | - Olivier Lucas
- Departments of Molecular Medicine and Global Health, University of South Florida, Tampa, Florida, United States of America
| | - Julie Nawas
- Department of Molecular Microbiology, Washington University School of Medicine, St. Louis, Missouri, United States of America
| | - L. David Sibley
- Department of Molecular Microbiology, Washington University School of Medicine, St. Louis, Missouri, United States of America
| | - Michael W. White
- Department of Veterinary Molecular Biology, Montana State University, Bozeman, Montana, United States of America
- Departments of Molecular Medicine and Global Health, University of South Florida, Tampa, Florida, United States of America
| |
Collapse
|