1
|
Zhang S, Cao B, Su T, Wu Y, Feng Z, Xiong J, Zhang TY. Crystallographic phase identifier of a convolutional self-attention neural network (CPICANN) on powder diffraction patterns. IUCRJ 2024; 11:634-642. [PMID: 38958016 PMCID: PMC11220882 DOI: 10.1107/s2052252524005323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Accepted: 06/05/2024] [Indexed: 07/04/2024]
Abstract
Spectroscopic data, particularly diffraction data, are essential for materials characterization due to their comprehensive crystallographic information. The current crystallographic phase identification, however, is very time consuming. To address this challenge, we have developed a real-time crystallographic phase identifier based on a convolutional self-attention neural network (CPICANN). Trained on 692 190 simulated powder X-ray diffraction (XRD) patterns from 23 073 distinct inorganic crystallographic information files, CPICANN demonstrates superior phase-identification power. Single-phase identification on simulated XRD patterns yields 98.5 and 87.5% accuracies with and without elemental information, respectively, outperforming JADE software (68.2 and 38.7%, respectively). Bi-phase identification on simulated XRD patterns achieves 84.2 and 51.5% accuracies, respectively. In experimental settings, CPICANN achieves an 80% identification accuracy, surpassing JADE software (61%). Integration of CPICANN into XRD refinement software will significantly advance the cutting-edge technology in XRD materials characterization.
Collapse
Affiliation(s)
- Shouyang Zhang
- Materials Genome InstituteShanghai UniversityShanghai200444People’s Republic of China
| | - Bin Cao
- Guangzhou Municipal Key Laboratory of Materials Informatics, Sustainable Energy and Environment Thrust, Advanced Materials Thrust, Hong Kong University of Science and Technology (Guangzhou), Guangzhou511400, Guangdong, People’s Republic of China
| | - Tianhao Su
- Materials Genome InstituteShanghai UniversityShanghai200444People’s Republic of China
| | - Yue Wu
- Materials Genome InstituteShanghai UniversityShanghai200444People’s Republic of China
| | - Zhenjie Feng
- Materials Genome InstituteShanghai UniversityShanghai200444People’s Republic of China
| | - Jie Xiong
- Materials Genome InstituteShanghai UniversityShanghai200444People’s Republic of China
| | - Tong-Yi Zhang
- Materials Genome InstituteShanghai UniversityShanghai200444People’s Republic of China
- Guangzhou Municipal Key Laboratory of Materials Informatics, Sustainable Energy and Environment Thrust, Advanced Materials Thrust, Hong Kong University of Science and Technology (Guangzhou), Guangzhou511400, Guangdong, People’s Republic of China
| |
Collapse
|
2
|
De Iaco S, Cappello C, Congedi A, Palma M. Multivariate Modeling for Spatio-Temporal Radon Flux Predictions. ENTROPY (BASEL, SWITZERLAND) 2023; 25:1104. [PMID: 37510051 PMCID: PMC10378277 DOI: 10.3390/e25071104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Revised: 06/16/2023] [Accepted: 06/27/2023] [Indexed: 07/30/2023]
Abstract
Nowadays, various fields in environmental sciences require the availability of appropriate techniques to exploit the information given by multivariate spatial or spatio-temporal observations. In particular, radon flux data which are of high interest to monitor greenhouse gas emissions and to assess human exposure to indoor radon are determined by the deposit of uranium and radio (precursor elements). Furthermore, they are also affected by various atmospheric variables, such as humidity, temperature, precipitation and evapotranspiration. To this aim, a significant role can be recognized to the tools of multivariate geostatistics which supports the modeling and prediction of variables under study. In this paper, the spatio-temporal distribution of radon flux densities over the Veneto Region (Italy) and its estimation at unsampled points in space and time are discussed. In particular, the spatio-temporal linear coregionalization model is identified on the basis of the joint diagonalization of the empirical covariance matrices evaluated at different spatio-temporal lags and is used to produce predicted radon flux maps for different months. Probability maps, that the radon flux density in the upcoming months is greater than three historical statistics, are then built. This might be of interest especially in summer months when the risk of radon exhalation is higher. Moreover, a comparison with respect to alternative models in the univariate and multivariate context is provided.
Collapse
Affiliation(s)
- Sandra De Iaco
- National Future Center of Biodiversity, 90133 Palermo, Italy
- DES-Sect. of Mathematics and Statistics, University of Salento, 73100 Lecce, Italy
- National Center of High Performance Computing, Big Data and Quantum Computing, 40121 Bologna, Italy
| | - Claudia Cappello
- DES-Sect. of Mathematics and Statistics, University of Salento, 73100 Lecce, Italy
| | - Antonella Congedi
- DES-Sect. of Mathematics and Statistics, University of Salento, 73100 Lecce, Italy
| | - Monica Palma
- DES-Sect. of Mathematics and Statistics, University of Salento, 73100 Lecce, Italy
- National Center of High Performance Computing, Big Data and Quantum Computing, 40121 Bologna, Italy
| |
Collapse
|
3
|
Jiang B. Blind signal separation based on signal minimum projection. SECOND INTERNATIONAL SYMPOSIUM ON COMPUTER APPLICATIONS AND INFORMATION SYSTEMS (ISCAIS 2023) 2023. [DOI: 10.1117/12.2683576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
|
4
|
Ginebaugh SP, Hagner M, Ray A, Erzurum SC, Comhair SAA, Denlinger LC, Jarjour NN, Castro M, Woodruff PG, Christenson SA, Bleecker ER, Meyers DA, Hastie AT, Moore WC, Mauger DT, Israel E, Levy BD, Wenzel SE, Camiolo MJ. Bronchial epithelial cell transcriptional responses to inhaled corticosteroids dictate severe asthmatic outcomes. J Allergy Clin Immunol 2023; 151:1513-1524. [PMID: 36796454 PMCID: PMC10257752 DOI: 10.1016/j.jaci.2023.01.028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 01/08/2023] [Accepted: 01/12/2023] [Indexed: 02/16/2023]
Abstract
BACKGROUND Inhaled corticosteroids (CSs) are the backbone of asthma treatment, improving quality of life, exacerbation rates, and mortality. Although effective for most, a subset of patients with asthma experience CS-resistant disease despite receiving high-dose medication. OBJECTIVE We sought to investigate the transcriptomic response of bronchial epithelial cells (BECs) to inhaled CSs. METHODS Independent component analysis was performed on datasets, detailing the transcriptional response of BECs to CS treatment. The expression of these CS-response components was examined in 2 patient cohorts and investigated in relation to clinical parameters. Supervised learning was used to predict BEC CS responses using peripheral blood gene expression. RESULTS We identified a signature of CS response that was closely correlated with CS use in patients with asthma. Participants could be separated on the basis of CS-response genes into groups with high and low signature expression. Patients with low expression of CS-response genes, particularly those with a severe asthma diagnosis, showed worse lung function and quality of life. These individuals demonstrated enrichment for T-lymphocyte infiltration in endobronchial brushings. Supervised machine learning identified a 7-gene signature from peripheral blood that reliably identified patients with poor CS-response expression in BECs. CONCLUSIONS Loss of CS transcriptional responses within bronchial epithelium was related to impaired lung function and poor quality of life, particularly in patients with severe asthma. These individuals were identified using minimally invasive blood sampling, suggesting these findings may enable earlier triage to alternative treatments.
Collapse
Affiliation(s)
- Scott P Ginebaugh
- Integrative Systems Biology, University of Pittsburgh, Pittsburgh, Pa
| | | | - Anuradha Ray
- Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, Pa; Department of Immunology, University of Pittsburgh School of Medicine, Pittsburgh, Pa
| | | | | | - Loren C Denlinger
- Department of Medicine, University of Wisconsin School of Medicine and Public Health, Madison, Wis
| | - Nizar N Jarjour
- Department of Medicine, University of Wisconsin School of Medicine and Public Health, Madison, Wis
| | - Mario Castro
- University of Kansas School of Medicine, Kansas City, Mo
| | - Prescott G Woodruff
- University of California, San Francisco School of Medicine, San Francisco, Calif
| | | | - Eugene R Bleecker
- Division for Genetics, Genomics and Personalized Medicine, University of Arizona College of Medicine, Tucson, Ariz
| | - Deborah A Meyers
- Division for Genetics, Genomics and Personalized Medicine, University of Arizona College of Medicine, Tucson, Ariz
| | | | - Wendy C Moore
- Wake Forest University School of Medicine, Winston-Salem, NC
| | | | - Elliot Israel
- Department of Medicine, Divisions of Pulmonary & Critical Care Medicine & Allergy & Immunology, Brigham & Women's Hospital, Harvard Medical School, Boston, Mass
| | - Bruce D Levy
- Department of Medicine, Divisions of Pulmonary & Critical Care Medicine & Allergy & Immunology, Brigham & Women's Hospital, Harvard Medical School, Boston, Mass
| | - Sally E Wenzel
- Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, Pa; Department of Immunology, University of Pittsburgh School of Medicine, Pittsburgh, Pa; Department of Environmental Medicine and Occupational Health, Graduate School of Public Health, University of Pittsburgh School of Medicine, Pittsburgh, Pa
| | | |
Collapse
|
5
|
Muehlmann C, De Iaco S, Nordhausen K. Blind recovery of sources for multivariate space-time random fields. STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT : RESEARCH JOURNAL 2022; 37:1593-1613. [PMID: 37041981 PMCID: PMC10081984 DOI: 10.1007/s00477-022-02348-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 11/14/2022] [Indexed: 06/19/2023]
Abstract
With advances in modern worlds technology, huge datasets that show dependencies in space as well as in time occur frequently in practice. As an example, several monitoring stations at different geographical locations track hourly concentration measurements of a number of air pollutants for several years. Such a dataset contains thousands of multivariate observations, thus, proper statistical analysis needs to account for dependencies in space and time between and among the different monitored variables. To simplify the consequent multivariate spatio-temporal statistical analysis it might be of interest to detect linear transformations of the original observations that result in straightforward interpretative, spatio-temporally uncorrelated processes that are also highly likely to have a real physical meaning. Blind source separation (BSS) represents a statistical methodology which has the aim to recover so-called latent processes, that exactly meet the former requirements. BSS was already successfully used in sole temporal and sole spatial applications with great success, but, it was not yet introduced for the spatio-temporal case. In this contribution, a reasonable and innovative generalization of BSS for multivariate space-time random fields (stBSS), under second-order stationarity, is proposed, together with two space-time extensions of the well-known algorithms for multiple unknown signals extraction (stAMUSE) and the second-order blind identification (stSOBI) which solve the formulated problem. Furthermore, symmetry and separability properties of the model are elaborated and connections to the space-time linear model of coregionalization and to the classical principal component analysis are drawn. Finally, the usefulness of the new methods is shown in a thorough simulation study and on a real environmental application.
Collapse
Affiliation(s)
- C. Muehlmann
- Institute of Statistics and Mathematical Methods in Economics, TU Wien / Technische Universität Wien / Vienna University of Technology, Vienna, Austria
| | - S. De Iaco
- Department of Economic Sciences-Sect. of Mathematics and Statistics, University of Salento, Lecce, Italy
- Centro Nazionale di Biodiversità, University of Salento, Lecce, Italy
| | - K. Nordhausen
- Department of Mathematics and Statistics, University of Jyväskylä, Jyväskylä, Finland
| |
Collapse
|
6
|
Duda-Goławska J, Imbir KK, Żygierewicz J. ERP Analysis Using a Multi-Channel Matching Pursuit Algorithm. Neuroinformatics 2022; 20:827-862. [PMID: 35286575 DOI: 10.1007/s12021-022-09575-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/11/2022] [Indexed: 12/31/2022]
Abstract
In this study, we propose a new algorithm for analysing event-related components observed in EEG signals in psychological experiments. We investigate its capabilities and limitations. The algorithm is based on multivariate matching pursuit and clustering. It is aimed to find patterns in EEG signals which are similar across different experimental conditions, but it allows for variations in amplitude and slight variability in topography. The method proved to yield expected results in numerical simulations. For the real data coming from an emotional categorisation task experiment, we obtained two indications. First, the method can be used as a specific filter that reduces the variability of components, as defined classically, within each experimental condition. Second, equivalent dipoles fitted to items of the activity clusters identified by the algorithm localise in compact brain areas related to the task performed by the subjects across experimental conditions. Thus this activity may be studied as candidates for hypothetical latent components. The proposed algorithm is a promising new tool in ERP studies, which deserves further experimental evaluations.
Collapse
Affiliation(s)
- Joanna Duda-Goławska
- Faculty of Physics, University of Warsaw, L. Pasteura 5 Street, Warsaw, 02-093, Poland.
| | - Kamil K Imbir
- Faculty of Psychology, University of Warsaw, Stawki 5/7 Street, Warsaw, 10-587, Poland
| | - Jarosław Żygierewicz
- Faculty of Physics, University of Warsaw, L. Pasteura 5 Street, Warsaw, 02-093, Poland
| |
Collapse
|
7
|
Pan Y, Matilainen M, Taskinen S, Nordhausen K. A review of second-order blind identification methods. WILEY INTERDISCIPLINARY REVIEWS. COMPUTATIONAL STATISTICS 2022; 14:e1550. [PMID: 36249858 PMCID: PMC9540980 DOI: 10.1002/wics.1550] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Revised: 01/06/2021] [Accepted: 01/07/2021] [Indexed: 11/24/2022]
Abstract
Second-order source separation (SOS) is a data analysis tool which can be used for revealing hidden structures in multivariate time series data or as a tool for dimension reduction. Such methods are nowadays increasingly important as more and more high-dimensional multivariate time series data are measured in numerous fields of applied science. Dimension reduction is crucial, as modeling such high-dimensional data with multivariate time series models is often impractical as the number of parameters describing dependencies between the component time series is usually too high. SOS methods have their roots in the signal processing literature, where they were first used to separate source signals from an observed signal mixture. The SOS model assumes that the observed time series (signals) is a linear mixture of latent time series (sources) with uncorrelated components. The methods make use of the second-order statistics-hence the name "second-order source separation." In this review, we discuss the classical SOS methods and their extensions to more complex settings. An example illustrates how SOS can be performed. This article is categorized under:Statistical Models > Time Series ModelsStatistical and Graphical Methods of Data Analysis > Dimension ReductionData: Types and Structure > Time Series, Stochastic Processes, and Functional Data.
Collapse
Affiliation(s)
- Yan Pan
- Department of Mathematics and StatisticsUniversity of JyväskyläFinland
| | - Markus Matilainen
- Turku PET CentreTurku University Hospital and University of TurkuFinland
| | - Sara Taskinen
- Department of Mathematics and StatisticsUniversity of JyväskyläFinland
| | - Klaus Nordhausen
- Department of Mathematics and StatisticsUniversity of JyväskyläFinland
- Institute of Statistics and Mathematical Methods in Economics, TUViennaAustria
| |
Collapse
|
8
|
|
9
|
Tounsi M, Zitouni M. Matrix-variate Lindley distributions and its applications. BRAZ J PROBAB STAT 2021. [DOI: 10.1214/21-bjps504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Mariem Tounsi
- Computer Engineering and Applied Mathematics Department, Sfax National Engineering School, B.P. 1173, 3038, Tunisia
| | - Mouna Zitouni
- Applied Mathematics Department, Sfax Faculty of Sciences, B.P. 802, 3038, Tunisia
| |
Collapse
|
10
|
Cappello C, De Iaco S, Palma M. Computational advances for spatio-temporal multivariate environmental models. Comput Stat 2021. [DOI: 10.1007/s00180-021-01132-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
AbstractIn multivariate Geostatistics, the linear coregionalization model (LCM) has been widely used over the last decades, in order to describe the spatial dependence which characterizes two or more variables of interest. However, in spatio-temporal multiple modeling, the identification of the main elements of a space–time linear coregionalization model (ST-LCM), as well as of the latent structures underlying the analyzed phenomenon, represents a tough task. In this paper, some computational advances which support the selection of an ST-LCM are described, gathering all the necessary steps which allow the analyst to easily and properly detect the basic space–time components for the phenomenon under study. The implemented algorithm is applied on space–time air quality data measured in Scotland in 2017.
Collapse
|
11
|
Virta J, Lietzén N, Ilmonen P, Nordhausen K. Fast tensorial JADE. Scand Stat Theory Appl 2021; 48:164-187. [PMID: 33664538 PMCID: PMC7891388 DOI: 10.1111/sjos.12445] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2018] [Revised: 10/24/2019] [Accepted: 01/05/2020] [Indexed: 11/28/2022]
Abstract
We propose a novel method for tensorial-independent component analysis. Our approach is based on TJADE and k-JADE, two recently proposed generalizations of the classical JADE algorithm. Our novel method achieves the consistency and the limiting distribution of TJADE under mild assumptions and at the same time offers notable improvement in computational speed. Detailed mathematical proofs of the statistical properties of our method are given and, as a special case, a conjecture on the properties of k-JADE is resolved. Simulations and timing comparisons demonstrate remarkable gain in speed. Moreover, the desired efficiency is obtained approximately for finite samples. The method is applied successfully to large-scale video data, for which neither TJADE nor k-JADE is feasible. Finally, an experimental procedure is proposed to select the values of a set of tuning parameters. Supplementary material including the R-code for running the examples and the proofs of the theoretical results is available online.
Collapse
Affiliation(s)
- Joni Virta
- Department of Mathematics and Systems AnalysisAalto University School of Science
- Department of Mathematics and StatisticsUniversity of Turku
| | - Niko Lietzén
- Department of Mathematics and Systems AnalysisAalto University School of Science
| | - Pauliina Ilmonen
- Department of Mathematics and Systems AnalysisAalto University School of Science
| | - Klaus Nordhausen
- Institute of Statistics & Mathematical Methods in EconomicsVienna University of Technology
| |
Collapse
|
12
|
Lee S, Shen H, Truong Y. Sampling Properties of color Independent Component Analysis. J MULTIVARIATE ANAL 2020; 181. [PMID: 33162620 DOI: 10.1016/j.jmva.2020.104692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
Independent Component Analysis (ICA) offers an effective data-driven approach for blind source extraction encountered in many signal and image processing problems. Although many ICA methods have been developed, they have received relatively little attention in the statistics literature, especially in terms of rigorous theoretical investigation for statistical inference. The current paper aims at narrowing this gap and investigates the statistical sampling properties of the colorICA (cICA) method. The cICA incorporates the correlation structure within sources through parametric time series models in the frequency domain and outperforms several existing ICA alternatives numerically. We establish the consistency and asymptotic normality of the cICA estimates, which then enables statistical inference based on the estimates. These asymptotic properties are further validated using simulation studies.
Collapse
Affiliation(s)
- Seonjoo Lee
- Department of Psychiatry and Biostatistics, Columbia University, New York, NY, USA.,Mental Health Data Science, New York State Psychiatric Institute and Research Foundation for Mental Hygiene, Inc., New York, NY, USA
| | - Haipeng Shen
- Innovation and Information Management, Faculty of Business and Economics, University of Hong Kong, Hong Kong, China
| | - Young Truong
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| |
Collapse
|
13
|
Li A, Herbst RH, Canner D, Schenkel JM, Smith OC, Kim JY, Hillman M, Bhutkar A, Cuoco MS, Rappazzo CG, Rogers P, Dang C, Jerby-Arnon L, Rozenblatt-Rosen O, Cong L, Birnbaum M, Regev A, Jacks T. IL-33 Signaling Alters Regulatory T Cell Diversity in Support of Tumor Development. Cell Rep 2020; 29:2998-3008.e8. [PMID: 31801068 PMCID: PMC6990979 DOI: 10.1016/j.celrep.2019.10.120] [Citation(s) in RCA: 50] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Revised: 09/07/2019] [Accepted: 10/29/2019] [Indexed: 02/08/2023] Open
Abstract
Regulatory T cells (Tregs) can impair anti-tumor immune responses and are associated with poor prognosis in multiple cancer types. Tregs in human tumors span diverse transcriptional states distinct from those of peripheral Tregs, but their contribution to tumor development remains unknown. Here, we use single-cell RNA sequencing (RNA-seq) to longitudinally profile dynamic shifts in the distribution of Tregs in a genetically engineered mouse model of lung adenocarcinoma. In this model, interferon-responsive Tregs are more prevalent early in tumor development, whereas a specialized effector phenotype characterized by enhanced expression of the interleukin-33 receptor ST2 is predominant in advanced disease. Treg-specific deletion of ST2 alters the evolution of effector Treg diversity, increases infiltration of CD8+ T cells into tumors, and decreases tumor burden. Our study shows that ST2 plays a critical role in Treg-mediated immunosuppression in cancer, highlighting potential paths for therapeutic intervention.
Collapse
Affiliation(s)
- Amy Li
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, 500 Main Street, Cambridge, MA 02139, USA; Department of Biology, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA; Harvard Medical School, 25 Shattuck Street, Boston, MA 02115, USA
| | - Rebecca H Herbst
- Harvard Medical School, 25 Shattuck Street, Boston, MA 02115, USA; Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA 02142, USA
| | - David Canner
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, 500 Main Street, Cambridge, MA 02139, USA; Department of Biology, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| | - Jason M Schenkel
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, 500 Main Street, Cambridge, MA 02139, USA; Department of Pathology, Brigham and Women's Hospital, Boston, MA 02115, USA
| | - Olivia C Smith
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, 500 Main Street, Cambridge, MA 02139, USA
| | - Jonathan Y Kim
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, 500 Main Street, Cambridge, MA 02139, USA
| | - Michelle Hillman
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, 500 Main Street, Cambridge, MA 02139, USA
| | - Arjun Bhutkar
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, 500 Main Street, Cambridge, MA 02139, USA
| | - Michael S Cuoco
- Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA 02142, USA
| | - C Garrett Rappazzo
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, 500 Main Street, Cambridge, MA 02139, USA; Department of Biological Engineering, Massachusetts Institute of Technology, 21 Ames Street, Cambridge, MA 02142, USA
| | - Patricia Rogers
- Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA 02142, USA
| | - Celeste Dang
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, 500 Main Street, Cambridge, MA 02139, USA
| | - Livnat Jerby-Arnon
- Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA 02142, USA
| | | | - Le Cong
- Department of Pathology, Stanford University, Stanford, CA 94305, USA; Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Michael Birnbaum
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, 500 Main Street, Cambridge, MA 02139, USA; Department of Biological Engineering, Massachusetts Institute of Technology, 21 Ames Street, Cambridge, MA 02142, USA
| | - Aviv Regev
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, 500 Main Street, Cambridge, MA 02139, USA; Department of Biology, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA; Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA 02142, USA; Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
| | - Tyler Jacks
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, 500 Main Street, Cambridge, MA 02139, USA; Department of Biology, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA; Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
| |
Collapse
|
14
|
Ning J, Fang M, Ran W, Chen C, Li Y. Rapid Multi-Sensor Feature Fusion Based on Non-Stationary Kernel JADE for the Small-Amplitude Hunting Monitoring of High-Speed Trains. SENSORS 2020; 20:s20123457. [PMID: 32570955 PMCID: PMC7349269 DOI: 10.3390/s20123457] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/29/2020] [Revised: 06/07/2020] [Accepted: 06/16/2020] [Indexed: 11/16/2022]
Abstract
Joint Approximate Diagonalization of Eigen-matrices (JADE) cannot deal with non-stationary data. Therefore, in this paper, a method called Non-stationary Kernel JADE (NKJADE) is proposed, which can extract non-stationary features and fuse multi-sensor features precisely and rapidly. In this method, the non-stationarity of the data is considered and the data from multi-sensor are used to fuse the features efficiently. The method is compared with EEMD-SVD-LTSA and EEMD-JADE using the bearing fault data of CWRU, and the validity of the method is verified. Considering that the vibration signals of high-speed trains are typically non-stationary, it is necessary to utilize a rapid feature fusion method to identify the evolutionary trends of hunting motions quickly before the phenomenon is fully manifested. In this paper, the proposed method is applied to identify the evolutionary trend of hunting motions quickly and accurately. Results verify that the accuracy of this method is much higher than that of the EEMD-JADE and EEMD-SVD-LTSA methods. This method can also be used to fuse multi-sensor features of non-stationary data rapidly.
Collapse
Affiliation(s)
- Jing Ning
- School of Mechanical Engineering, Southwest Jiaotong University, Chengdu 610031, Sichuan, China; (M.F.); (W.R.); (C.C.); (Y.L.)
- Technology and Equipment of Rail Transit Operation and Maintenance Key Laboratory of Sichuan Province, Chengdu 610031, Sichuan, China
- Correspondence: ; Tel./Fax: +86-28-87600690
| | - Mingkuan Fang
- School of Mechanical Engineering, Southwest Jiaotong University, Chengdu 610031, Sichuan, China; (M.F.); (W.R.); (C.C.); (Y.L.)
| | - Wei Ran
- School of Mechanical Engineering, Southwest Jiaotong University, Chengdu 610031, Sichuan, China; (M.F.); (W.R.); (C.C.); (Y.L.)
| | - Chunjun Chen
- School of Mechanical Engineering, Southwest Jiaotong University, Chengdu 610031, Sichuan, China; (M.F.); (W.R.); (C.C.); (Y.L.)
| | - Yanping Li
- School of Mechanical Engineering, Southwest Jiaotong University, Chengdu 610031, Sichuan, China; (M.F.); (W.R.); (C.C.); (Y.L.)
| |
Collapse
|
15
|
Nordhausen K, Fischer G, Filzmoser P. Blind Source Separation for Compositional Time Series. MATHEMATICAL GEOSCIENCES 2020; 53:905-924. [PMID: 34721726 PMCID: PMC8550155 DOI: 10.1007/s11004-020-09869-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/30/2019] [Accepted: 05/01/2020] [Indexed: 06/13/2023]
Abstract
Many geological phenomena are regularly measured over time to follow developments and changes. For many of these phenomena, the absolute values are not of interest, but rather the relative information, which means that the data are compositional time series. Thus, the serial nature and the compositional geometry should be considered when analyzing the data. Multivariate time series are already challenging, especially if they are higher dimensional, and latent variable models are a popular way to deal with this kind of data. Blind source separation techniques are well-established latent factor models for time series, with many variants covering quite different time series models. Here, several such methods and their assumptions are reviewed, and it is shown how they can be applied to high-dimensional compositional time series. Also, a novel blind source separation method is suggested which is quite flexible regarding the assumptions of the latent time series. The methodology is illustrated using simulations and in an application to light absorbance data from water samples taken from a small stream in Lower Austria.
Collapse
Affiliation(s)
- Klaus Nordhausen
- CSTAT - Computational Statistics Institute of Statistics and Mathematical Methods in Economics, Vienna University of Technology, Wiedner Hauptstr. 7, 1040 Vienna, Austria
| | - Gregor Fischer
- CSTAT - Computational Statistics Institute of Statistics and Mathematical Methods in Economics, Vienna University of Technology, Wiedner Hauptstr. 8-10, 1040 Vienna, Austria
| | - Peter Filzmoser
- CSTAT - Computational Statistics Institute of Statistics and Mathematical Methods in Economics, Vienna University of Technology, Wiedner Hauptstr. 8-10, 1040 Vienna, Austria
| |
Collapse
|
16
|
Virta J, Li B, Nordhausen K, Oja H. Independent component analysis for multivariate functional data. J MULTIVARIATE ANAL 2020. [DOI: 10.1016/j.jmva.2019.104568] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
17
|
Miettinen J, Matilainen M, Nordhausen K, Taskinen S. Extracting Conditionally Heteroskedastic Components using Independent Component Analysis. JOURNAL OF TIME SERIES ANALYSIS 2020; 41:293-311. [PMID: 32508370 PMCID: PMC7266430 DOI: 10.1111/jtsa.12505] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/21/2018] [Revised: 08/12/2019] [Accepted: 08/13/2019] [Indexed: 06/11/2023]
Abstract
In the independent component model, the multivariate data are assumed to be a mixture of mutually independent latent components. The independent component analysis (ICA) then aims at estimating these latent components. In this article, we study an ICA method which combines the use of linear and quadratic autocorrelations to enable efficient estimation of various kinds of stationary time series. Statistical properties of the estimator are studied by finding its limiting distribution under general conditions, and the asymptotic variances are derived in the case of ARMA-GARCH model. We use the asymptotic results and a finite sample simulation study to compare different choices of a weight coefficient. As it is often of interest to identify all those components which exhibit stochastic volatility features we suggest a test statistic for this problem. We also show that a slightly modified version of the principal volatility component analysis can be seen as an ICA method. Finally, we apply the estimators in analysing a data set which consists of time series of exchange rates of seven currencies to US dollar. Supporting information including proofs of the theorems is available online.
Collapse
Affiliation(s)
- Jari Miettinen
- Department of Signal Processing and AcousticsAalto UniversityHelsinkiFinland
| | - Markus Matilainen
- Department of Mathematics and StatisticsUniversity of TurkuTurkuFinland
- Turku PET CentreTurku University Hospital and University of TurkuFinland
| | - Klaus Nordhausen
- Institute of Statistics & Mathematical Methods in EconomicsVienna University of TechnologyWienAustria
| | - Sara Taskinen
- Department of Mathematics and StatisticsUniversity of JyvaskylaJyväskyläFinland
| |
Collapse
|
18
|
Bachoc F, Genton MG, Nordhausen K, Ruiz-Gazen A, Virta J. Spatial blind source separation. Biometrika 2020. [DOI: 10.1093/biomet/asz079] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Summary
Recently a blind source separation model was suggested for spatial data, along with an estimator based on the simultaneous diagonalization of two scatter matrices. The asymptotic properties of this estimator are derived here, and a new estimator based on the joint diagonalization of more than two scatter matrices is proposed. The asymptotic properties and merits of the novel estimator are verified in simulation studies. A real-data example illustrates application of the method.
Collapse
Affiliation(s)
- François Bachoc
- Institut de Mathématiques de Toulouse, Université Paul Sabatier, 118 route de Narbonne, 31062 Toulouse, France
| | - Marc G Genton
- Statistics Program, King Abdullah University of Science and Technology, Thuwal 23955-6900, Saudi Arabia
| | - Klaus Nordhausen
- Computational Statistics, Vienna University of Technology, Wiedner Hauptstr. 7, A-1040 Vienna, Austria
| | - Anne Ruiz-Gazen
- Toulouse School of Economics, University of Toulouse Capitole, 1, Esplanade de l’Université, 31080 Toulouse Cedex 06, France
| | - Joni Virta
- Department of Mathematics and Statistics, University of Turku, 20014 Turun yliopisto, Finland
| |
Collapse
|
19
|
Ng SR, Rideout WM, Akama-Garren EH, Bhutkar A, Mercer KL, Schenkel JM, Bronson RT, Jacks T. CRISPR-mediated modeling and functional validation of candidate tumor suppressor genes in small cell lung cancer. Proc Natl Acad Sci U S A 2020; 117:513-521. [PMID: 31871154 PMCID: PMC6955235 DOI: 10.1073/pnas.1821893117] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Small cell lung cancer (SCLC) is a highly aggressive subtype of lung cancer that remains among the most lethal of solid tumor malignancies. Recent genomic sequencing studies have identified many recurrently mutated genes in human SCLC tumors. However, the functional roles of most of these genes remain to be validated. Here, we have adapted the CRISPR-Cas9 system to a well-established murine model of SCLC to rapidly model loss-of-function mutations in candidate genes identified from SCLC sequencing studies. We show that loss of the gene p107 significantly accelerates tumor progression. Notably, compared with loss of the closely related gene p130, loss of p107 results in fewer but larger tumors as well as earlier metastatic spread. In addition, we observe differences in proliferation and apoptosis as well as altered distribution of initiated tumors in the lung, resulting from loss of p107 or p130 Collectively, these data demonstrate the feasibility of using the CRISPR-Cas9 system to model loss of candidate tumor suppressor genes in SCLC, and we anticipate that this approach will facilitate efforts to investigate mechanisms driving tumor progression in this deadly disease.
Collapse
Affiliation(s)
- Sheng Rong Ng
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - William M Rideout
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Elliot H Akama-Garren
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Arjun Bhutkar
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Kim L Mercer
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139
- Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Jason M Schenkel
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139
- Department of Pathology, Brigham and Women's Hospital, Boston, MA 02115
| | - Roderick T Bronson
- Department of Pathology, Tufts University School of Veterinary Medicine, North Grafton, MA 01536
| | - Tyler Jacks
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139;
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139
- Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02139
| |
Collapse
|
20
|
|
21
|
Jin Z, Risk BB, Matteson DS. Optimization and testing in linear non‐Gaussian component analysis. Stat Anal Data Min 2019. [DOI: 10.1002/sam.11403] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Affiliation(s)
- Ze Jin
- Statistical Science Cornell University New York
| | | | | |
Collapse
|
22
|
Virta J, Nordhausen K. Estimating the number of signals using principal component analysis. Stat (Int Stat Inst) 2019. [DOI: 10.1002/sta4.231] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Joni Virta
- Department of Mathematics and Systems AnalysisAalto University Espoo Finland
| | - Klaus Nordhausen
- Institute of Statistics & Mathematical Methods in EconomicsVienna University of Technology Vienna Austria
| |
Collapse
|
23
|
Matilainen M, Croux C, Nordhausen K, Oja H. Sliced average variance estimation for multivariate time series. STATISTICS-ABINGDON 2019. [DOI: 10.1080/02331888.2019.1605515] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- M. Matilainen
- Department of Mathematics and Statistics, University of Turku, Turku, Finland
- Turku PET Centre, Turku, Finland
| | - C. Croux
- EDHEC Business School, Lille, France
| | - K. Nordhausen
- Institute of Statistics & Mathematical Methods in Economics, Vienna University of Technology, Wien, Austria
| | - H. Oja
- Department of Mathematics and Statistics, University of Turku, Turku, Finland
| |
Collapse
|
24
|
Affiliation(s)
- Joni Virta
- Department of Mathematics and Statistics, University of Turku, Turku, Finland
| | - Bing Li
- Department of Statistics, Pennsylvania State University, University Park, PA
| | - Klaus Nordhausen
- Institute of Statistics & Mathematical Methods in Economics, Vienna University of Technology, Vienna, Austria
| | - Hannu Oja
- Department of Mathematics and Statistics, University of Turku, Turku, Finland
| |
Collapse
|
25
|
Virta J, Li B, Nordhausen K, Oja H. Independent component analysis for tensor-valued data. J MULTIVARIATE ANAL 2017. [DOI: 10.1016/j.jmva.2017.09.008] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
26
|
Gocheva V, Naba A, Bhutkar A, Guardia T, Miller KM, Li CMC, Dayton TL, Sanchez-Rivera FJ, Kim-Kiselak C, Jailkhani N, Winslow MM, Del Rosario A, Hynes RO, Jacks T. Quantitative proteomics identify Tenascin-C as a promoter of lung cancer progression and contributor to a signature prognostic of patient survival. Proc Natl Acad Sci U S A 2017; 114:E5625-E5634. [PMID: 28652369 PMCID: PMC5514763 DOI: 10.1073/pnas.1707054114] [Citation(s) in RCA: 84] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
The extracellular microenvironment is an integral component of normal and diseased tissues that is poorly understood owing to its complexity. To investigate the contribution of the microenvironment to lung fibrosis and adenocarcinoma progression, two pathologies characterized by excessive stromal expansion, we used mouse models to characterize the extracellular matrix (ECM) composition of normal lung, fibrotic lung, lung tumors, and metastases. Using quantitative proteomics, we identified and assayed the abundance of 113 ECM proteins, which revealed robust ECM protein signatures unique to fibrosis, primary tumors, or metastases. These analyses indicated significantly increased abundance of several S100 proteins, including Fibronectin and Tenascin-C (Tnc), in primary lung tumors and associated lymph node metastases compared with normal tissue. We further showed that Tnc expression is repressed by the transcription factor Nkx2-1, a well-established suppressor of metastatic progression. We found that increasing the levels of Tnc, via CRISPR-mediated transcriptional activation of the endogenous gene, enhanced the metastatic dissemination of lung adenocarcinoma cells. Interrogation of human cancer gene expression data revealed that high TNC expression correlates with worse prognosis for lung adenocarcinoma, and that a three-gene expression signature comprising TNC, S100A10, and S100A11 is a robust predictor of patient survival independent of age, sex, smoking history, and mutational load. Our findings suggest that the poorly understood ECM composition of the fibrotic and tumor microenvironment is an underexplored source of diagnostic markers and potential therapeutic targets for cancer patients.
Collapse
Affiliation(s)
- Vasilena Gocheva
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Alexandra Naba
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139;
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Arjun Bhutkar
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Talia Guardia
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Kathryn M Miller
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Carman Man-Chung Li
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Talya L Dayton
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Francisco J Sanchez-Rivera
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Caroline Kim-Kiselak
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Noor Jailkhani
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Monte M Winslow
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Amanda Del Rosario
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Richard O Hynes
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139
- Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Tyler Jacks
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139;
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139
- Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02139
| |
Collapse
|
27
|
Sasi NK, Bhutkar A, Lanning NJ, MacKeigan JP, Weinreich M. DDK Promotes Tumor Chemoresistance and Survival via Multiple Pathways. Neoplasia 2017; 19:439-450. [PMID: 28448802 PMCID: PMC5406526 DOI: 10.1016/j.neo.2017.03.001] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2016] [Revised: 03/10/2017] [Accepted: 03/13/2017] [Indexed: 12/12/2022] Open
Abstract
DBF4-dependent kinase (DDK) is a two-subunit kinase required for initiating DNA replication at individual origins and is composed of CDC7 kinase and its regulatory subunit DBF4. Both subunits are highly expressed in many diverse tumor cell lines and primary tumors, and this is correlated with poor prognosis. Inhibiting DDK causes apoptosis of tumor cells, but not normal cells, through a largely unknown mechanism. Firstly, to understand why DDK is often overexpressed in tumors, we identified gene expression signatures that correlate with DDK high- and DDK low-expressing lung adenocarcinomas. We found that increased DDK expression is highly correlated with inactivation of RB1-E2F and p53 tumor suppressor pathways. Both CDC7 and DBF4 promoters bind E2F, suggesting that increased E2F activity in RB1 mutant cancers promotes increased DDK expression. Surprisingly, increased DDK expression levels are also correlated with both increased chemoresistance and genome-wide mutation frequencies. Our data further suggest that high DDK levels directly promote elevated mutation frequencies. Secondly, we performed an RNAi screen to investigate how DDK inhibition causes apoptosis of tumor cells. We identified 23 kinases and phosphatases required for apoptosis when DDK is inhibited. These hits include checkpoint genes, G2/M cell cycle regulators, and known tumor suppressors leading to the hypothesis that inhibiting mitotic progression can protect against DDKi-induced apoptosis. Characterization of one novel hit, the LATS2 tumor suppressor, suggests that it promotes apoptosis independently of the upstream MST1/2 kinases in the Hippo signaling pathway.
Collapse
Affiliation(s)
- Nanda Kumar Sasi
- Laboratory of Genome Integrity and Tumorigenesis, Van Andel Research Institute (VARI), Grand Rapids, MI 49503; Laboratory of Systems Biology, VARI; Graduate Program in Genetics, Michigan State University, East Lansing, MI 48824
| | - Arjun Bhutkar
- David H. Koch Institute for Integrative Cancer Research, Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | | | | | - Michael Weinreich
- Laboratory of Genome Integrity and Tumorigenesis, Van Andel Research Institute (VARI), Grand Rapids, MI 49503.
| |
Collapse
|