1
|
Abujudeh S, Zeki SS, van Lanschot MCJ, Pusung M, Weaver JMJ, Li X, Noorani A, Metz AJ, Bornschein J, Bower L, Miremadi A, Fitzgerald RC, Morrissey ER, Lynch AG. Low-cost and clinically applicable copy number profiling using repeat DNA. BMC Genomics 2022; 23:599. [PMID: 35978291 PMCID: PMC9386984 DOI: 10.1186/s12864-022-08681-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Accepted: 06/10/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Somatic copy number alterations (SCNAs) are an important class of genomic alteration in cancer. They are frequently observed in cancer samples, with studies showing that, on average, SCNAs affect 34% of a cancer cell's genome. Furthermore, SCNAs have been shown to be major drivers of tumour development and have been associated with response to therapy and prognosis. Large-scale cancer genome studies suggest that tumours are driven by somatic copy number alterations (SCNAs) or single-nucleotide variants (SNVs). Despite the frequency of SCNAs and their clinical relevance, the use of genomics assays in the clinic is biased towards targeted gene panels, which identify SNVs but provide limited scope to detect SCNAs throughout the genome. There is a need for a comparably low-cost and simple method for high-resolution SCNA profiling. RESULTS We present conliga, a fully probabilistic method that infers SCNA profiles from a low-cost, simple, and clinically-relevant assay (FAST-SeqS). When applied to 11 high-purity oesophageal adenocarcinoma samples, we obtain good agreement (Spearman's rank correlation coefficient, rs=0.94) between conliga's inferred SCNA profiles using FAST-SeqS data (approximately £14 per sample) and those inferred by ASCAT using high-coverage WGS (gold-standard). We find that conliga outperforms CNVkit (rs=0.89), also applied to FAST-SeqS data, and is comparable to QDNAseq (rs=0.96) applied to low-coverage WGS, which is approximately four-fold more expensive, more laborious and less clinically-relevant. By performing an in silico dilution series experiment, we find that conliga is particularly suited to detecting SCNAs in low tumour purity samples. At two million reads per sample, conliga is able to detect SCNAs in all nine samples at 3% tumour purity and as low as 0.5% purity in one sample. Crucially, we show that conliga's hidden state information can be used to decide when a sample is abnormal or normal, whereas CNVkit and QDNAseq cannot provide this critical information. CONCLUSIONS We show that conliga provides high-resolution SCNA profiles using a convenient, low-cost assay. We believe conliga makes FAST-SeqS a more clinically valuable assay as well as a useful research tool, enabling inexpensive and fast copy number profiling of pre-malignant and cancer samples.
Collapse
Affiliation(s)
- Sam Abujudeh
- Cancer Research UK Cambridge Institute, University of Cambridge, Li Ka Shing Centre, Robinson Way, Cambridge, CB2 0RE, UK.
| | - Sebastian S Zeki
- Medical Research Council (MRC) Cancer Unit, University of Cambridge, Cambridge, UK. .,Department of Gastroenterology, Guy's and St Thomas' NHS Trust, London, SE1 7EH, UK.
| | | | - Mark Pusung
- Medical Research Council (MRC) Cancer Unit, University of Cambridge, Cambridge, UK
| | - Jamie M J Weaver
- Medical Research Council (MRC) Cancer Unit, University of Cambridge, Cambridge, UK.,Department of Medical Oncology, The Christie NHS Foundation Trust, Manchester, M20 4TX, UK
| | - Xiaodun Li
- Medical Research Council (MRC) Cancer Unit, University of Cambridge, Cambridge, UK
| | - Ayesha Noorani
- Medical Research Council (MRC) Cancer Unit, University of Cambridge, Cambridge, UK
| | - Andrew J Metz
- Medical Research Council (MRC) Cancer Unit, University of Cambridge, Cambridge, UK
| | - Jan Bornschein
- Medical Research Council (MRC) Cancer Unit, University of Cambridge, Cambridge, UK
| | - Lawrence Bower
- Cancer Research UK Cambridge Institute, University of Cambridge, Li Ka Shing Centre, Robinson Way, Cambridge, CB2 0RE, UK
| | - Ahmad Miremadi
- Medical Research Council (MRC) Cancer Unit, University of Cambridge, Cambridge, UK
| | - Rebecca C Fitzgerald
- Medical Research Council (MRC) Cancer Unit, University of Cambridge, Cambridge, UK.
| | - Edward R Morrissey
- Cancer Research UK Cambridge Institute, University of Cambridge, Li Ka Shing Centre, Robinson Way, Cambridge, CB2 0RE, UK. .,Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK.
| | - Andy G Lynch
- Cancer Research UK Cambridge Institute, University of Cambridge, Li Ka Shing Centre, Robinson Way, Cambridge, CB2 0RE, UK. .,School of Mathematics and Statistics/School of Medicine, University of St Andrews, St Andrews, UK.
| |
Collapse
|
2
|
Affiliation(s)
- Edward R Morrissey
- Cancer Research UK - Cambridge Institute, University of Cambridge, Cambridge, UK
| | - Louis Vermeulen
- Laboratory for Experimental Oncology and Radiobiology, Center for Experimental Molecular Medicine, Academic Medical Center, Amsterdam, The Netherlands
| |
Collapse
|
3
|
Thomas L, Hodgson DA, Wentzel A, Nieselt K, Ellingsen TE, Moore J, Morrissey ER, Legaie R, Wohlleben W, Rodríguez-García A, Martín JF, Burroughs NJ, Wellington EMH, Smith MCM. Metabolic switches and adaptations deduced from the proteomes of Streptomyces coelicolor wild type and phoP mutant grown in batch culture. Mol Cell Proteomics 2011; 11:M111.013797. [PMID: 22147733 DOI: 10.1074/mcp.m111.013797] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Bacteria in the genus Streptomyces are soil-dwelling oligotrophs and important producers of secondary metabolites. Previously, we showed that global messenger RNA expression was subject to a series of metabolic and regulatory switches during the lifetime of a fermentor batch culture of Streptomyces coelicolor M145. Here we analyze the proteome from eight time points from the same fermentor culture and, because phosphate availability is an important regulator of secondary metabolite production, compare this to the proteome of a similar time course from an S. coelicolor mutant, INB201 (ΔphoP), defective in the control of phosphate utilization. The proteomes provide a detailed view of enzymes involved in central carbon and nitrogen metabolism. Trends in protein expression over the time courses were deduced from a protein abundance index, which also revealed the importance of stress pathway proteins in both cultures. As expected, the ΔphoP mutant was deficient in expression of PhoP-dependent genes, and several putatively compensatory metabolic and regulatory pathways for phosphate scavenging were detected. Notably there is a succession of switches that coordinately induce the production of enzymes for five different secondary metabolite biosynthesis pathways over the course of the batch cultures.
Collapse
Affiliation(s)
- Louise Thomas
- Institute of Medical Sciences, University of Aberdeen, Foresterhill, Aberdeen AB25 2ZD, United Kingdom
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
4
|
Morrissey ER, Juárez MA, Denby KJ, Burroughs NJ. On reverse engineering of gene interaction networks using time course data with repeated measurements. ACTA ACUST UNITED AC 2010; 26:2305-12. [PMID: 20639410 DOI: 10.1093/bioinformatics/btq421] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION Gene expression measurements are the most common data source for reverse engineering gene interaction networks. When dealing with destructive sampling in time course experiments, it is common to average any available measurements for each time point and to treat this as the actual time series data for fitting the network, neglecting the variability contained in the repeated measurements. Proceeding in such a way can affect the retrieved network topology. RESULTS We propose a fully Bayesian method for reverse engineering a gene interaction network, based on time course data with repeated measurements. The observations are treated as surrogate measurements of the underlying gene expression. As these measurements often contain outliers, we use a non-Gaussian specification for dealing with measurement error. The network interactions are assumed linear and an autoregressive model is specified, augmented with indicator variables that allow inference on the topology of the network. We analyse two in silico and one in vivo experiments, the latter dealing with the circadian clock in Arabidopsis thaliana. A systematic attenuation of the estimated regulation strengths and a concomitant overestimation of their precision is demonstrated when measurement error is disregarded. Thus, a clear improvement in the inferred topology for the synthetic datasets is demonstrated when this is included. Also, the influence of outliers in the retrieved network is demonstrated when using the in vivo data. AVAILABILITY Matlab code and data used in the article are available from http://go.warwick.ac.uk/majuarez/home/materials.
Collapse
Affiliation(s)
- E R Morrissey
- Warwick Systems Biology Centre and Warwick HRI, University of Warwick, Coventry, UK
| | | | | | | |
Collapse
|
5
|
Abstract
Pomelo II (http://pomelo2.bioinfo.cnio.es) is an open-source, web-based, freely available tool for the analysis of gene (and protein) expression and tissue array data. Pomelo II implements: permutation-based tests for class comparisons (t-test, ANOVA) and regression; survival analysis using Cox model; contingency table analysis with Fisher's exact test; linear models (of which t-test and ANOVA are especial cases) that allow additional covariates for complex experimental designs and use empirical Bayes moderated statistics. Permutation-based and Cox model analysis use parallel computing, which permits taking advantage of multicore CPUs and computing clusters. Access to, and further analysis of, additional biological information and annotations (PubMed references, Gene Ontology terms, KEGG and Reactome pathways) are available either for individual genes (from clickable links in tables and figures) or sets of genes. The source code is available, allowing for extending and reusing the software. A comprehensive test suite is also available, and covers both the user interface and the numerical results. The possibility of including additional covariates, parallelization of computation, open-source availability of the code and comprehensive testing suite make Pomelo II a unique tool.
Collapse
Affiliation(s)
- Edward R Morrissey
- Structural and Computational Biology Programme, Spanish National Cancer Center (CNIO), Melchor FernA!ndez Almagro 3, Madrid, 28029, Spain
| | | |
Collapse
|
6
|
Díaz-Uriarte R, Alibés A, Morrissey ER, Cañada A, Rueda OM, Neves ML. Asterias: integrated analysis of expression and aCGH data using an open-source, web-based, parallelized software suite. Nucleic Acids Res 2007; 35:W75-80. [PMID: 17488846 PMCID: PMC1933128 DOI: 10.1093/nar/gkm229] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Asterias (http://www.asterias.info) is an open-source, web-based, suite for the analysis of gene expression and aCGH data. Asterias implements validated statistical methods, and most of the applications use parallel computing, which permits taking advantage of multicore CPUs and computing clusters. Access to, and further analysis of, additional biological information and annotations (PubMed references, Gene Ontology terms, KEGG and Reactome pathways) are available either for individual genes (from clickable links in tables and figures) or sets of genes. These applications cover from array normalization to imputation and preprocessing, differential gene expression analysis, class and survival prediction and aCGH analysis. The source code is available, allowing for extention and reuse of the software. The links and analysis of additional functional information, parallelization of computation and open-source availability of the code make Asterias a unique suite that can exploit features specific to web-based environments.
Collapse
Affiliation(s)
- Ramón Díaz-Uriarte
- Statistical Computing Team, Structural and Computational Biology Programme, Spanish National Cancer Center (CNIO), Melchor Fernández Almagro 3, Madrid, 28029, Spain.
| | | | | | | | | | | |
Collapse
|
7
|
Alibés A, Morrissey ER, Cañada A, Rueda OM, Casado D, Yankilevich P, Díaz-Uriarte R. Asterias: a parallelized web-based suite for the analysis of expression and aCGH data. Cancer Inform 2007; 3:1-9. [PMID: 19455230 PMCID: PMC2675829] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The analysis of expression and CGH arrays plays a central role in the study of complex diseases, especially cancer, including finding markers for early diagnosis and prognosis, choosing an optimal therapy, or increasing our understanding of cancer development and metastasis. Asterias (http://www.asterias.info) is an integrated collection of freely-accessible web tools for the analysis of gene expression and aCGH data. Most of the tools use parallel computing (via MPI) and run on a server with 60 CPUs for computation; compared to a desktop or server-based but not parallelized application, parallelization provides speed ups of factors up to 50. Most of our applications allow the user to obtain additional information for user-selected genes (chromosomal location, PubMed ids, Gene Ontology terms, etc.) by using clickable links in tables and/or figures. Our tools include: normalization of expression and aCGH data (DNMAD); converting between different types of gene/clone and protein identifiers (IDconverter/IDClight); filtering and imputation (preP); finding differentially expressed genes related to patient class and survival data (Pomelo II); searching for models of class prediction (Tnasas); using random forests to search for minimal models for class prediction or for large subsets of genes with predictive capacity (GeneSrF); searching for molecular signatures and predictive genes with survival data (SignS); detecting regions of genomic DNA gain or loss (ADaCGH). The capability to send results between different applications, access to additional functional information, and parallelized computation make our suite unique and exploit features only available to web-based applications.
Collapse
|
8
|
Alibés A, Morrissey ER, Cañada A, Rueda OM, Casado D, Yankilevich P, Díaz-Uriarte R. Asterias: A Parallelized Web-based Suite for the Analysis of Expression and aCGH Data. Cancer Inform 2007. [DOI: 10.1177/117693510700300007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
The analysis of expression and CGH arrays plays a central role in the study of complex diseases, especially cancer, including finding markers for early diagnosis and prognosis, choosing an optimal therapy, or increasing our understanding of cancer development and metastasis. Asterias ( http://www.asterias.info ) is an integrated collection of freely-accessible web tools for the analysis of gene expression and aCGH data. Most of the tools use parallel computing (via MPI) and run on a server with 60 CPUs for computation; compared to a desktop or server-based but not parallelized application, parallelization provides speed ups of factors up to 50. Most of our applications allow the user to obtain additional information for user-selected genes (chromosomal location, PubMed ids, Gene Ontology terms, etc.) by using clickable links in tables and/or figures. Our tools include: normalization of expression and aCGH data (DNMAD); converting between different types of gene/clone and protein identifiers (IDconverter/IDClight); filtering and imputation (preP); finding differentially expressed genes related to patient class and survival data (Pomelo II); searching for models of class prediction (Tnasas); using random forests to search for minimal models for class prediction or for large subsets of genes with predictive capacity (GeneSrF); searching for molecular signatures and predictive genes with survival data (SignS); detecting regions of genomic DNA gain or loss (ADaCGH). The capability to send results between different applications, access to additional functional information, and parallelized computation make our suite unique and exploit features only available to web-based applications.
Collapse
Affiliation(s)
- Andreu Alibés
- Statistical Computing Team, Structural and Computational Biology Programme, Spanish National Cancer Center (CNIO), Melchor Fernández Almagro 3, Madrid, 28029, Spain
| | - Edward R. Morrissey
- Statistical Computing Team, Structural and Computational Biology Programme, Spanish National Cancer Center (CNIO), Melchor Fernández Almagro 3, Madrid, 28029, Spain
| | - Andrés Cañada
- Statistical Computing Team, Structural and Computational Biology Programme, Spanish National Cancer Center (CNIO), Melchor Fernández Almagro 3, Madrid, 28029, Spain
| | - Oscar M. Rueda
- Statistical Computing Team, Structural and Computational Biology Programme, Spanish National Cancer Center (CNIO), Melchor Fernández Almagro 3, Madrid, 28029, Spain
| | - David Casado
- Statistical Computing Team, Structural and Computational Biology Programme, Spanish National Cancer Center (CNIO), Melchor Fernández Almagro 3, Madrid, 28029, Spain
| | - Patricio Yankilevich
- Statistical Computing Team, Structural and Computational Biology Programme, Spanish National Cancer Center (CNIO), Melchor Fernández Almagro 3, Madrid, 28029, Spain
| | - Ramón Díaz-Uriarte
- Statistical Computing Team, Structural and Computational Biology Programme, Spanish National Cancer Center (CNIO), Melchor Fernández Almagro 3, Madrid, 28029, Spain
| |
Collapse
|
9
|
Abstract
The increasing prevalence of multiple drug use among adolescents presents researchers with complex conceptual and measurement issues. This work examines definitions of and indicators of multiple drug use involvement, as well as the interrelationships between indices of drug involvement incorporating different dimensions. Data regarding the relationship between patterns of alcohol use and illicit drug use in a population of 1473 teenagers interviewed in a northwestern metropolitan community indicate that : (1) regardless of the dimensions incorporated into four different indices of drug involvement examined, intercorrelations between indices were strong; (2) although the correlations between indices of drug involvement and overall levels of alcohol intake were uniformly weak, it was shown that levels of drug involvement vary directly with the maximum volume of alcohol used. Although the results of the current investigation must be viewed cautiously because of the concentration of heavy users of both alcohol and drugs in the population upon which the research is based, among adolescents, the spacing of drinking episodes and the patterning of alcohol intake may be more important determinants of polydrug involvement thant the quantity of alcohol consumed.
Collapse
|
10
|
Schuckit MA, Morrissey ER. Psychiatric problems in women admitted to an alcoholic detoxification center. Am J Psychiatry 1979; 136:611-7. [PMID: 426154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
The authors studied 293 consecutive female admissions to a public detoxification facility. One-half had a primary diagnosis of alcoholism; the rest had primary diagnosis of affective disorder (14%), antisocial personality (14%), drug abuse (6%) or did not meet the criteria for alcoholism (13%). Each subgroup had unique social backgrounds and characteristic histories of involvement with alcohol, drugs, and antisocial problems. The primary alcoholic woman in the detoxification center resembles the primary alcoholic man in the same setting. The authors underscore the importance of socioeconomic strata and primary versus secondary diagnosis in understanding the prognosis for alcoholics.
Collapse
|
11
|
Schuckit MA, Morrissey ER. Drug abuse among alcoholic women. Am J Psychiatry 1979; 136:607-11. [PMID: 426153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Of 293 women in an alcoholic treatment center, two-thirds have received prescriptions for drugs of potential abuse, usually hypnotic and antianxiety drugs. One-third of the women admitted abusing substances; 80% of these subjects got prescriptions for potential drugs of abuse while actively abusing. These figures are alarming in light of the lack of evidence that antianxiety drugs, hypnotics, and stimulants are effective in treating alcoholics and drug abusers. Drug-misusing women in this sample reported more suicide attempts and early antisocial problems and had received more psychiatric care than nonabusers. The authors urge that hypnotics, antianxiety drugs, and stimulants almost never be prescribed to outpatient alcoholics and that analgesics be prescribed only with great care.
Collapse
|
12
|
Abstract
In the roles of student and homemaker, youthful and female problem drinkers can easily hide alcoholism. Compared with adult male alcoholics, their intake of alcohol is less but they are more likely to use other drugs concomitantly. An understanding of their unique characteristics will help the physician recognize patients in these groups with alcohol-related problems.
Collapse
|
13
|
Abstract
No strong temporal associations were found between the occurrence of stressful life events and the onset of alcohol problems.
Collapse
|
14
|
Schuckit MA, Morrissey ER. Propoxyphene and Phencyclidine (PCP) use in adolescents. J Clin Psychiatry 1978; 39:7-13. [PMID: 580263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Trained interviewers utilized a structured research instrument to analyze drug use patterns, personal and familial psychiatric problems, and social backgrounds of a consecutive series of 355 adolescent referred by courts to alcohol counselling and education centers in King County. Within this sample, 4% of the subjects used propoxyphene alone, 16% reported use of phencyclidine (PCP) alone, and 6% reported abuse of both substances. The data consistently reveals that the use of one of these more unusual substances is likely to occur in people with more antisocial, drug, and alcohol problems. Use of both drugs was associated with the most pervasive antisocial history, with over one-third of these subjects fulfilling strict criteria for the antisocial personality. The treatment and prognostic implications of these findings are discussed.
Collapse
|
15
|
Abstract
The relationship between drug and alcohol use and self-esteem was investigated using students living in a co-ed dormitory. A questionnaire was distributed containing four sections including demographic information, the Coopersmith Self-esteem Inventory (1967), an A Priori Identity Crisis scale, and a frequency of drug/alcohol use scale. 66% of the questionnaires were returned, yielding a sample of 171 males and 219 females. Heavy use of alcohol was related to low self-esteem. No relationship, however, was found between drug use and self-esteem. The relationship between drug and alcohol use and psychopathology is discussed as being a continuum.
Collapse
|