Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Omberg L, Meyerson JR, Kobayashi K, Drury LS, Diffley JFX, Alter O. Global effects of DNA replication and DNA replication origin activity on eukaryotic gene expression. Mol Syst Biol 2009;5:312. [PMID: 19888207 PMCID: PMC2779084 DOI: 10.1038/msb.2009.70] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2009] [Accepted: 08/19/2009] [Indexed: 11/09/2022] Open

For:	Omberg L, Meyerson JR, Kobayashi K, Drury LS, Diffley JFX, Alter O. Global effects of DNA replication and DNA replication origin activity on eukaryotic gene expression. Mol Syst Biol 2009;5:312. [PMID: 19888207 PMCID: PMC2779084 DOI: 10.1038/msb.2009.70] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2009] [Accepted: 08/19/2009] [Indexed: 11/09/2022] Open

Number

Cited by Other Article(s)

Santos MM, Johnson MC, Fiedler L, Zegerman P. Global early replication disrupts gene expression and chromatin conformation in a single cell cycle. Genome Biol 2022;23:217. [PMID: 36253803 PMCID: PMC9575230 DOI: 10.1186/s13059-022-02788-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Accepted: 10/10/2022] [Indexed: 12/03/2022] Open

Point centromere activity requires an optimal level of centromeric noncoding RNA. Proc Natl Acad Sci U S A 2019;116:6270-6279. [PMID: 30850541 PMCID: PMC6442628 DOI: 10.1073/pnas.1821384116] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Abstract

Budding yeast harbors a simple point centromere, which is originally believed to be sequence dependent without much epigenetic regulation and is transcription incompatible, as inserting a strong promoter upstream inactivates the centromere completely. Here, we demonstrate that an optimal level centromeric noncoding RNA is required for budding yeast centromere activity. Centromeric transcription is induced in S phase, coinciding with the assembly of new centromeric proteins. Too much or too little centromeric noncoding RNA leads to centromere malfunction. Overexpression of centromeric noncoding RNA reduces the protein levels and chromatin localization of inner centromere and kinetochore proteins, such as CENP-A, CENP-C, and the chromosome passenger complex. This work shows that point centromere is epigenetically regulated by noncoding RNA.

In budding yeast, which possesses simple point centromeres, we discovered that all of its centromeres express long noncoding RNAs (cenRNAs), especially in S phase. Induction of cenRNAs coincides with CENP-A^Cse4 loading time and is dependent on DNA replication. Centromeric transcription is repressed by centromere-binding factor Cbf1 and histone H2A variant H2A.Z^Htz1. Deletion of CBF1 and H2A.Z^HTZ1 results in an up-regulation of cenRNAs; an increased loss of a minichromosome; elevated aneuploidy; a down-regulation of the protein levels of centromeric proteins CENP-A^Cse4, CENP-A chaperone HJURP^Scm3, CENP-C^Mif2, Survivin^Bir1, and INCENP^Sli15; and a reduced chromatin localization of CENP-A^Cse4, CENP-C^Mif2, and Aurora B^Ipl1. When the RNA interference system was introduced to knock down all cenRNAs from the endogenous chromosomes, but not the cenRNA from the circular minichromosome, an increase in minichromosome loss was still observed, suggesting that cenRNA functions in trans to regulate centromere activity. CenRNA knockdown partially alleviates minichromosome loss in cbf1Δ, htz1Δ, and cbf1Δ htz1Δ in a dose-dependent manner, demonstrating that cenRNA level is tightly regulated to epigenetically control point centromere function.

Collapse

Pietrzyk Ł. Food properties and dietary habits in colorectal cancer prevention and development. INTERNATIONAL JOURNAL OF FOOD PROPERTIES 2017. [DOI: 10.1080/10942912.2016.1236813] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Müller CA, Nieduszynski CA. DNA replication timing influences gene expression level. J Cell Biol 2017;216:1907-1914. [PMID: 28539386 PMCID: PMC5496624 DOI: 10.1083/jcb.201701061] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2017] [Revised: 04/06/2017] [Accepted: 04/19/2017] [Indexed: 12/31/2022] Open

Luo Y, Wang F, Szolovits P. Tensor factorization toward precision medicine. Brief Bioinform 2017;18:511-514. [PMID: 26994614 PMCID: PMC6078180 DOI: 10.1093/bib/bbw026] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Revised: 01/08/2016] [Indexed: 11/13/2022] Open

Aiello KA, Alter O. Platform-Independent Genome-Wide Pattern of DNA Copy-Number Alterations Predicting Astrocytoma Survival and Response to Treatment Revealed by the GSVD Formulated as a Comparative Spectral Decomposition. PLoS One 2016;11:e0164546. [PMID: 27798635 PMCID: PMC5087864 DOI: 10.1371/journal.pone.0164546] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2016] [Accepted: 09/27/2016] [Indexed: 01/07/2023] Open

Abstract

We use the generalized singular value decomposition (GSVD), formulated as a comparative spectral decomposition, to model patient-matched grades III and II, i.e., lower-grade astrocytoma (LGA) brain tumor and normal DNA copy-number profiles. A genome-wide tumor-exclusive pattern of DNA copy-number alterations (CNAs) is revealed, encompassed in that previously uncovered in glioblastoma (GBM), i.e., grade IV astrocytoma, where GBM-specific CNAs encode for enhanced opportunities for transformation and proliferation via growth and developmental signaling pathways in GBM relative to LGA. The GSVD separates the LGA pattern from other sources of biological and experimental variation, common to both, or exclusive to one of the tumor and normal datasets. We find, first, and computationally validate, that the LGA pattern is correlated with a patient's survival and response to treatment. Second, the GBM pattern identifies among the LGA patients a subtype, statistically indistinguishable from that among the GBM patients, where the CNA genotype is correlated with an approximately one-year survival phenotype. Third, cross-platform classification of the Affymetrix-measured LGA and GBM profiles by using the Agilent-derived GBM pattern shows that the GBM pattern is a platform-independent predictor of astrocytoma outcome. Statistically, the pattern is a better predictor (corresponding to greater median survival time difference, proportional hazard ratio, and concordance index) than the patient's age and the tumor's grade, which are the best indicators of astrocytoma currently in clinical use, and laboratory tests. The pattern is also statistically independent of these indicators, and, combined with either one, is an even better predictor of astrocytoma outcome. Recurring DNA CNAs have been observed in astrocytoma tumors' genomes for decades, however, copy-number subtypes that are predictive of patients' outcomes were not identified before. This is despite the growing number of datasets recording different aspects of the disease, and due to an existing fundamental need for mathematical frameworks that can simultaneously find similarities and dissimilarities across the datasets. This illustrates the ability of comparative spectral decompositions to find what other methods miss.

Collapse

Chitforoushzadeh Z, Ye Z, Sheng Z, LaRue S, Fry RC, Lauffenburger DA, Janes KA. TNF-insulin crosstalk at the transcription factor GATA6 is revealed by a model that links signaling and transcriptomic data tensors. Sci Signal 2016;9:ra59. [PMID: 27273097 DOI: 10.1126/scisignal.aad3373] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Descorps-Declère S, Saguez C, Cournac A, Marbouty M, Rolland T, Ma L, Bouchier C, Moszer I, Dujon B, Koszul R, Richard GF. Genome-wide replication landscape of Candida glabrata. BMC Biol 2015;13:69. [PMID: 26329162 PMCID: PMC4556013 DOI: 10.1186/s12915-015-0177-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2015] [Accepted: 08/05/2015] [Indexed: 11/25/2022] Open

Abstract

Background

The opportunistic pathogen Candida glabrata is a member of the Saccharomycetaceae yeasts. Like its close relative Saccharomyces cerevisiae, it underwent a whole-genome duplication followed by an extensive loss of genes. Its genome contains a large number of very long tandem repeats, called megasatellites. In order to determine the whole replication program of the C. glabrata genome and its general chromosomal organization, we used deep-sequencing and chromosome conformation capture experiments.

Results

We identified 253 replication fork origins, genome wide. Centromeres, HML and HMR loci, and most histone genes are replicated early, whereas natural chromosomal breakpoints are located in late-replicating regions. In addition, 275 autonomously replicating sequences (ARS) were identified during ARS-capture experiments, and their relative fitness was determined during growth competition. Analysis of ARSs allowed us to identify a 17-bp consensus, similar to the S. cerevisiae ARS consensus sequence but slightly more constrained. Megasatellites are not in close proximity to replication origins or termini. Using chromosome conformation capture, we also show that early origins tend to cluster whereas non-subtelomeric megasatellites do not cluster in the yeast nucleus.

Conclusions

Despite a shorter cell cycle, the C. glabrata replication program shares unexpected striking similarities to S. cerevisiae, in spite of their large evolutionary distance and the presence of highly repetitive large tandem repeats in C. glabrata. No correlation could be found between the replication program and megasatellites, suggesting that their formation and propagation might not be directly caused by replication fork initiation or termination.

Electronic supplementary material

The online version of this article (doi:10.1186/s12915-015-0177-6) contains supplementary material, which is available to authorized users.

Collapse

Affiliation(s)

Stéphane Descorps-Declère Institut Pasteur, Center of Bioinformatics, Biostatistics and Integrative Biology (C3BI), F-75015, Paris, France.
Cyril Saguez Institut Pasteur, Unité de Génétique Moléculaire des Levures, Département Génomes & Génétique, F-75015, Paris, France. .,CNRS, UMR3525, F-75015, Paris, France. .,Sorbonne Universités, UPMC Univ Paris 06, 4 Place Jussieu, 75252, Paris, Cedex 05, France.
Axel Cournac CNRS, UMR3525, F-75015, Paris, France. .,Institut Pasteur, Groupe Régulation Spatiale des Génomes, Département Génomes & Génétique, F-75015, Paris, France.
Martial Marbouty CNRS, UMR3525, F-75015, Paris, France. .,Institut Pasteur, Groupe Régulation Spatiale des Génomes, Département Génomes & Génétique, F-75015, Paris, France.
Thomas Rolland Present address: Institut Pasteur, Unité de Génétique Humaine et Fonctions Cognitives, Département des Neurosciences, F-75015, Paris, France.
Laurence Ma Institut Pasteur, Plate-forme Génomique, Département Génomes & Génétique, F-75015, Paris, France.
Christiane Bouchier Institut Pasteur, Plate-forme Génomique, Département Génomes & Génétique, F-75015, Paris, France.
Ivan Moszer Present address: Plate-forme Bio-informatique/Biostatistique, Institut de Neurosciences Translationnelles IHU-A-ICM, Hôpital Pitié-Salpêtrière, 47-83 bd de l'Hôpital, 75561, Paris, Cedex 13, France.
Bernard Dujon Institut Pasteur, Unité de Génétique Moléculaire des Levures, Département Génomes & Génétique, F-75015, Paris, France. .,CNRS, UMR3525, F-75015, Paris, France. .,Sorbonne Universités, UPMC Univ Paris 06, 4 Place Jussieu, 75252, Paris, Cedex 05, France.
Romain Koszul CNRS, UMR3525, F-75015, Paris, France. .,Institut Pasteur, Groupe Régulation Spatiale des Génomes, Département Génomes & Génétique, F-75015, Paris, France.
Guy-Franck Richard Institut Pasteur, Unité de Génétique Moléculaire des Levures, Département Génomes & Génétique, F-75015, Paris, France. .,CNRS, UMR3525, F-75015, Paris, France. .,Sorbonne Universités, UPMC Univ Paris 06, 4 Place Jussieu, 75252, Paris, Cedex 05, France.

Collapse

Luo Y, Xin Y, Hochberg E, Joshi R, Uzuner O, Szolovits P. Subgraph augmented non-negative tensor factorization (SANTF) for modeling clinical narrative text. J Am Med Inform Assoc 2015;22:1009-19. [PMID: 25862765 PMCID: PMC4986663 DOI: 10.1093/jamia/ocv016] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2014] [Revised: 01/18/2015] [Accepted: 02/16/2015] [Indexed: 02/04/2023] Open

Abstract

OBJECTIVE

Extracting medical knowledge from electronic medical records requires automated approaches to combat scalability limitations and selection biases. However, existing machine learning approaches are often regarded by clinicians as black boxes. Moreover, training data for these automated approaches at often sparsely annotated at best. The authors target unsupervised learning for modeling clinical narrative text, aiming at improving both accuracy and interpretability.

METHODS

The authors introduce a novel framework named subgraph augmented non-negative tensor factorization (SANTF). In addition to relying on atomic features (e.g., words in clinical narrative text), SANTF automatically mines higher-order features (e.g., relations of lymphoid cells expressing antigens) from clinical narrative text by converting sentences into a graph representation and identifying important subgraphs. The authors compose a tensor using patients, higher-order features, and atomic features as its respective modes. We then apply non-negative tensor factorization to cluster patients, and simultaneously identify latent groups of higher-order features that link to patient clusters, as in clinical guidelines where a panel of immunophenotypic features and laboratory results are used to specify diagnostic criteria.

RESULTS AND CONCLUSION

SANTF demonstrated over 10% improvement in averaged F-measure on patient clustering compared to widely used non-negative matrix factorization (NMF) and k-means clustering methods. Multiple baselines were established by modeling patient data using patient-by-features matrices with different feature configurations and then performing NMF or k-means to cluster patients. Feature analysis identified latent groups of higher-order features that lead to medical insights. We also found that the latent groups of atomic features help to better correlate the latent groups of higher-order features.

Collapse

Sankaranarayanan P, Schomay TE, Aiello KA, Alter O. Tensor GSVD of patient- and platform-matched tumor and normal DNA copy-number profiles uncovers chromosome arm-wide patterns of tumor-exclusive platform-consistent alterations encoding for cell transformation and predicting ovarian cancer survival. PLoS One 2015;10:e0121396. [PMID: 25875127 PMCID: PMC4398562 DOI: 10.1371/journal.pone.0121396] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2014] [Accepted: 01/31/2015] [Indexed: 11/28/2022] Open

Abstract

The number of large-scale high-dimensional datasets recording different aspects of a single disease is growing, accompanied by a need for frameworks that can create one coherent model from multiple tensors of matched columns, e.g., patients and platforms, but independent rows, e.g., probes. We define and prove the mathematical properties of a novel tensor generalized singular value decomposition (GSVD), which can simultaneously find the similarities and dissimilarities, i.e., patterns of varying relative significance, between any two such tensors. We demonstrate the tensor GSVD in comparative modeling of patient- and platform-matched but probe-independent ovarian serous cystadenocarcinoma (OV) tumor, mostly high-grade, and normal DNA copy-number profiles, across each chromosome arm, and combination of two arms, separately. The modeling uncovers previously unrecognized patterns of tumor-exclusive platform-consistent co-occurring copy-number alterations (CNAs). We find, first, and validate that each of the patterns across only 7p and Xq, and the combination of 6p+12p, is correlated with a patient’s prognosis, is independent of the tumor’s stage, the best predictor of OV survival to date, and together with stage makes a better predictor than stage alone. Second, these patterns include most known OV-associated CNAs that map to these chromosome arms, as well as several previously unreported, yet frequent focal CNAs. Third, differential mRNA, microRNA, and protein expression consistently map to the DNA CNAs. A coherent picture emerges for each pattern, suggesting roles for the CNAs in OV pathogenesis and personalized therapy. In 6p+12p, deletion of the p21-encoding CDKN1A and p38-encoding MAPK14 and amplification of RAD51AP1 and KRAS encode for human cell transformation, and are correlated with a cell’s immortality, and a patient’s shorter survival time. In 7p, RPA3 deletion and POLD2 amplification are correlated with DNA stability, and a longer survival. In Xq, PABPC5 deletion and BCAP31 amplification are correlated with a cellular immune response, and a longer survival.

Collapse

He S, Yin J, Li H, Wang X. Graphical model selection and estimation for high dimensional tensor data. J MULTIVARIATE ANAL 2014. [DOI: 10.1016/j.jmva.2014.03.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

SVD identifies transcript length distribution functions from DNA microarray data and reveals evolutionary forces globally affecting GBM metabolism. PLoS One 2013;8:e78913. [PMID: 24282503 PMCID: PMC3839928 DOI: 10.1371/journal.pone.0078913] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2013] [Accepted: 09/25/2013] [Indexed: 01/10/2023] Open

Abstract

To search for evolutionary forces that might act upon transcript length, we use the singular value decomposition (SVD) to identify the length distribution functions of sets and subsets of human and yeast transcripts from profiles of mRNA abundance levels across gel electrophoresis migration distances that were previously measured by DNA microarrays. We show that the SVD identifies the transcript length distribution functions as “asymmetric generalized coherent states” from the DNA microarray data and with no a-priori assumptions. Comparing subsets of human and yeast transcripts of the same gene ontology annotations, we find that in both disparate eukaryotes, transcripts involved in protein synthesis or mitochondrial metabolism are significantly shorter than typical, and in particular, significantly shorter than those involved in glucose metabolism. Comparing the subsets of human transcripts that are overexpressed in glioblastoma multiforme (GBM) or normal brain tissue samples from The Cancer Genome Atlas, we find that GBM maintains normal brain overexpression of significantly short transcripts, enriched in transcripts that are involved in protein synthesis or mitochondrial metabolism, but suppresses normal overexpression of significantly longer transcripts, enriched in transcripts that are involved in glucose metabolism and brain activity. These global relations among transcript length, cellular metabolism and tumor development suggest a previously unrecognized physical mode for tumor and normal cells to differentially regulate metabolism in a transcript length-dependent manner. The identified distribution functions support a previous hypothesis from mathematical modeling of evolutionary forces that act upon transcript length in the manner of the restoring force of the harmonic oscillator.

Collapse

Mohr H, Mohr CA, Schneider MR, Scrivano L, Adler B, Kraner-Schreiber S, Schnieke A, Dahlhoff M, Wolf E, Koszinowski UH, Ruzsics Z. Cytomegalovirus replicon-based regulation of gene expression in vitro and in vivo. PLoS Pathog 2012;8:e1002728. [PMID: 22685399 PMCID: PMC3369935 DOI: 10.1371/journal.ppat.1002728] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2011] [Accepted: 04/18/2012] [Indexed: 12/14/2022] Open

Abstract

There is increasing evidence for a connection between DNA replication and the expression of adjacent genes. Therefore, this study addressed the question of whether a herpesvirus origin of replication can be used to activate or increase the expression of adjacent genes. Cell lines carrying an episomal vector, in which reporter genes are linked to the murine cytomegalovirus (MCMV) origin of lytic replication (oriLyt), were constructed. Reporter gene expression was silenced by a histone-deacetylase-dependent mechanism, but was resolved upon lytic infection with MCMV. Replication of the episome was observed subsequent to infection, leading to the induction of gene expression by more than 1000-fold. oriLyt-based regulation thus provided a unique opportunity for virus-induced conditional gene expression without the need for an additional induction mechanism. This principle was exploited to show effective late trans-complementation of the toxic viral protein M50 and the glycoprotein gO of MCMV. Moreover, the application of this principle for intracellular immunization against herpesvirus infection was demonstrated. The results of the present study show that viral infection specifically activated the expression of a dominant-negative transgene, which inhibited viral growth. This conditional system was operative in explant cultures of transgenic mice, but not in vivo. Several applications are discussed.

All herpesviruses show a precisely regulated gene expression profile, including true-late genes, which are turned on only after the onset of DNA replication. We used this intrinsic viral mechanism to generate a versatile conditional gene expression system that exploits the activity of the murine cytomegalovirus (MCMV) viral origin of lytic replication (oriLyt). Upon virus infection, replication of the viral genome also led to the replication and activation of the oriLyt-coupled episomal transgene. The oriLyt-based replicons were silenced in all stable cell lines and transgenic mice; however, virus infection liberated the plasmids from histone-deacetylase-induced inactivation. As maximum gene expression relied on relief from silencing via replication of the episomal constructs, very strong induction of the reporter gene was achieved. We showed that this system can be used for trans-complementation of late, toxic viral genes, to block virus production by activating dominant-negative (DN) transgenes, and to provide a new tool to study the principles of viral replication.

Collapse

Di Rienzi SC, Lindstrom KC, Mann T, Noble WS, Raghuraman MK, Brewer BJ. Maintaining replication origins in the face of genomic change. Genome Res 2012;22:1940-52. [PMID: 22665441 PMCID: PMC3460189 DOI: 10.1101/gr.138248.112] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

On a fundamental structure of gene networks in living cells. Proc Natl Acad Sci U S A 2012;109:4702-7. [PMID: 22392990 DOI: 10.1073/pnas.1200790109] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Lee CH, Alpert BO, Sankaranarayanan P, Alter O. GSVD comparison of patient-matched normal and tumor aCGH profiles reveals global copy-number alterations predicting glioblastoma multiforme survival. PLoS One 2012;7:e30098. [PMID: 22291905 PMCID: PMC3264559 DOI: 10.1371/journal.pone.0030098] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2011] [Accepted: 12/09/2011] [Indexed: 11/18/2022] Open

Abstract

Despite recent large-scale profiling efforts, the best prognostic predictor of glioblastoma multiforme (GBM) remains the patient's age at diagnosis. We describe a global pattern of tumor-exclusive co-occurring copy-number alterations (CNAs) that is correlated, possibly coordinated with GBM patients' survival and response to chemotherapy. The pattern is revealed by GSVD comparison of patient-matched but probe-independent GBM and normal aCGH datasets from The Cancer Genome Atlas (TCGA). We find that, first, the GSVD, formulated as a framework for comparatively modeling two composite datasets, removes from the pattern copy-number variations (CNVs) that occur in the normal human genome (e.g., female-specific X chromosome amplification) and experimental variations (e.g., in tissue batch, genomic center, hybridization date and scanner), without a-priori knowledge of these variations. Second, the pattern includes most known GBM-associated changes in chromosome numbers and focal CNAs, as well as several previously unreported CNAs in >3% of the patients. These include the biochemically putative drug target, cell cycle-regulated serine/threonine kinase-encoding TLK2, the cyclin E1-encoding CCNE1, and the Rb-binding histone demethylase-encoding KDM5A. Third, the pattern provides a better prognostic predictor than the chromosome numbers or any one focal CNA that it identifies, suggesting that the GBM survival phenotype is an outcome of its global genotype. The pattern is independent of age, and combined with age, makes a better predictor than age alone. GSVD comparison of matched profiles of a larger set of TCGA patients, inclusive of the initial set, confirms the global pattern. GSVD classification of the GBM profiles of an independent set of patients validates the prognostic contribution of the pattern.

Collapse

Ponnapalli SP, Saunders MA, Van Loan CF, Alter O. A higher-order generalized singular value decomposition for comparison of global mRNA expression from multiple organisms. PLoS One 2011;6:e28072. [PMID: 22216090 PMCID: PMC3245232 DOI: 10.1371/journal.pone.0028072] [Citation(s) in RCA: 74] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2011] [Accepted: 10/31/2011] [Indexed: 11/18/2022] Open

Abstract

The number of high-dimensional datasets recording multiple aspects of a single phenomenon is increasing in many areas of science, accompanied by a need for mathematical frameworks that can compare multiple large-scale matrices with different row dimensions. The only such framework to date, the generalized singular value decomposition (GSVD), is limited to two matrices. We mathematically define a higher-order GSVD (HO GSVD) for N≥2 matrices , each with full column rank. Each matrix is exactly factored as D_i = U_iΣ_iV^T, where V, identical in all factorizations, is obtained from the eigensystem SV = VΛ of the arithmetic mean S of all pairwise quotients of the matrices , i≠j. We prove that this decomposition extends to higher orders almost all of the mathematical properties of the GSVD. The matrix S is nondefective with V and Λ real. Its eigenvalues satisfy λ_k≥1. Equality holds if and only if the corresponding eigenvector v_k is a right basis vector of equal significance in all matrices D_i and D_j, that is σ_i,k/σ_j,k = 1 for all i and j, and the corresponding left basis vector u_i,k is orthogonal to all other vectors in U_i for all i. The eigenvalues λ_k = 1, therefore, define the “common HO GSVD subspace.” We illustrate the HO GSVD with a comparison of genome-scale cell-cycle mRNA expression from S. pombe, S. cerevisiae and human. Unlike existing algorithms, a mapping among the genes of these disparate organisms is not required. We find that the approximately common HO GSVD subspace represents the cell-cycle mRNA expression oscillations, which are similar among the datasets. Simultaneous reconstruction in the common subspace, therefore, removes the experimental artifacts, which are dissimilar, from the datasets. In the simultaneous sequence-independent classification of the genes of the three organisms in this common subspace, genes of highly conserved sequences but significantly different cell-cycle peak times are correctly classified.

Collapse

Siow CC, Nieduszynska SR, Müller CA, Nieduszynski CA. OriDB, the DNA replication origin database updated and extended. Nucleic Acids Res 2011;40:D682-6. [PMID: 22121216 PMCID: PMC3245157 DOI: 10.1093/nar/gkr1091] [Citation(s) in RCA: 107] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open

Khobta A, Epe B. Interactions between DNA damage, repair, and transcription. Mutat Res 2011;736:5-14. [PMID: 21907218 DOI: 10.1016/j.mrfmmm.2011.07.014] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2010] [Revised: 06/22/2011] [Accepted: 07/25/2011] [Indexed: 01/16/2023]

Brázda V, Laister RC, Jagelská EB, Arrowsmith C. Cruciform structures are a common DNA feature important for regulating biological processes. BMC Mol Biol 2011;12:33. [PMID: 21816114 PMCID: PMC3176155 DOI: 10.1186/1471-2199-12-33] [Citation(s) in RCA: 177] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2011] [Accepted: 08/05/2011] [Indexed: 04/10/2023] Open

Ding Q, MacAlpine DM. Defining the replication program through the chromatin landscape. Crit Rev Biochem Mol Biol 2011;46:165-79. [PMID: 21417598 DOI: 10.3109/10409238.2011.560139] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Tensor decomposition reveals concurrent evolutionary convergences and divergences and correlations with structural motifs in ribosomal RNA. PLoS One 2011;6:e18768. [PMID: 21625625 PMCID: PMC3094155 DOI: 10.1371/journal.pone.0018768] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2011] [Accepted: 03/17/2011] [Indexed: 11/19/2022] Open

Abstract

Evolutionary relationships among organisms are commonly described by using a hierarchy derived from comparisons of ribosomal RNA (rRNA) sequences. We propose that even on the level of a single rRNA molecule, an organism's evolution is composed of multiple pathways due to concurrent forces that act independently upon different rRNA degrees of freedom. Relationships among organisms are then compositions of coexisting pathway-dependent similarities and dissimilarities, which cannot be described by a single hierarchy. We computationally test this hypothesis in comparative analyses of 16S and 23S rRNA sequence alignments by using a tensor decomposition, i.e., a framework for modeling composite data. Each alignment is encoded in a cuboid, i.e., a third-order tensor, where nucleotides, positions and organisms, each represent a degree of freedom. A tensor mode-1 higher-order singular value decomposition (HOSVD) is formulated such that it separates each cuboid into combinations of patterns of nucleotide frequency variation across organisms and positions, i.e., "eigenpositions" and corresponding nucleotide-specific segments of "eigenorganisms," respectively, independent of a-priori knowledge of the taxonomic groups or rRNA structures. We find, in support of our hypothesis that, first, the significant eigenpositions reveal multiple similarities and dissimilarities among the taxonomic groups. Second, the corresponding eigenorganisms identify insertions or deletions of nucleotides exclusively conserved within the corresponding groups, that map out entire substructures and are enriched in adenosines, unpaired in the rRNA secondary structure, that participate in tertiary structure interactions. This demonstrates that structural motifs involved in rRNA folding and function are evolutionary degrees of freedom. Third, two previously unknown coexisting subgenic relationships between Microsporidia and Archaea are revealed in both the 16S and 23S rRNA alignments, a convergence and a divergence, conferred by insertions and deletions of these motifs, which cannot be described by a single hierarchy. This shows that mode-1 HOSVD modeling of rRNA alignments might be used to computationally predict evolutionary mechanisms.

Collapse

Kravchenko-Balasha N, Remacle F, Gross A, Rotter V, Levitzki A, Levine RD. Convergence of logic of cellular regulation in different premalignant cells by an information theoretic approach. BMC SYSTEMS BIOLOGY 2011;5:42. [PMID: 21410932 PMCID: PMC3072338 DOI: 10.1186/1752-0509-5-42] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/21/2010] [Accepted: 03/16/2011] [Indexed: 11/10/2022]

Behnke MS, Wootton JC, Lehmann MM, Radke JB, Lucas O, Nawas J, Sibley LD, White MW. Coordinated progression through two subtranscriptomes underlies the tachyzoite cycle of Toxoplasma gondii. PLoS One 2010;5:e12354. [PMID: 20865045 PMCID: PMC2928733 DOI: 10.1371/journal.pone.0012354] [Citation(s) in RCA: 197] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2010] [Accepted: 06/12/2010] [Indexed: 01/29/2023] Open