1
|
Identification of Breast Cancer Subtype-Specific Biomarkers by Integrating Copy Number Alterations and Gene Expression Profiles. ACTA ACUST UNITED AC 2021; 57:medicina57030261. [PMID: 33809336 PMCID: PMC7998437 DOI: 10.3390/medicina57030261] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2021] [Revised: 03/01/2021] [Accepted: 03/09/2021] [Indexed: 12/20/2022]
Abstract
Background and Objectives: Breast cancer is a heterogeneous disease categorized into four subtypes. Previous studies have shown that copy number alterations of several genes are implicated with the development and progression of many cancers. This study evaluates the effects of DNA copy number alterations on gene expression levels in different breast cancer subtypes. Materials and Methods: We performed a computational analysis integrating copy number alterations and gene expression profiles in 1024 breast cancer samples grouped into four molecular subtypes: luminal A, luminal B, HER2, and basal. Results: Our analyses identified several genes correlated in all subtypes such as KIAA1967 and MCPH1. In addition, several subtype-specific genes that showed a significant correlation between copy number and gene expression profiles were detected: SMARCB1, AZIN1, MTDH in luminal A, PPP2R5E, APEX1, GCN5 in luminal B, TNFAIP1, PCYT2, DIABLO in HER2, and FAM175B, SENP5, SCAF1 in basal subtype. Conclusions: This study showed that computational analyses integrating copy number and gene expression can contribute to unveil the molecular mechanisms of cancer and identify new subtype-specific biomarkers.
Collapse
|
2
|
In-Silico Integration Approach to Identify a Key miRNA Regulating a Gene Network in Aggressive Prostate Cancer. Int J Mol Sci 2018; 19:ijms19030910. [PMID: 29562723 PMCID: PMC5877771 DOI: 10.3390/ijms19030910] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2018] [Revised: 03/15/2018] [Accepted: 03/16/2018] [Indexed: 12/12/2022] Open
Abstract
Like other cancer diseases, prostate cancer (PC) is caused by the accumulation of genetic alterations in the cells that drives malignant growth. These alterations are revealed by gene profiling and copy number alteration (CNA) analysis. Moreover, recent evidence suggests that also microRNAs have an important role in PC development. Despite efforts to profile PC, the alterations (gene, CNA, and miRNA) and biological processes that correlate with disease development and progression remain partially elusive. Many gene signatures proposed as diagnostic or prognostic tools in cancer poorly overlap. The identification of co-expressed genes, that are functionally related, can identify a core network of genes associated with PC with a better reproducibility. By combining different approaches, including the integration of mRNA expression profiles, CNAs, and miRNA expression levels, we identified a gene signature of four genes overlapping with other published gene signatures and able to distinguish, in silico, high Gleason-scored PC from normal human tissue, which was further enriched to 19 genes by gene co-expression analysis. From the analysis of miRNAs possibly regulating this network, we found that hsa-miR-153 was highly connected to the genes in the network. Our results identify a four-gene signature with diagnostic and prognostic value in PC and suggest an interesting gene network that could play a key regulatory role in PC development and progression. Furthermore, hsa-miR-153, controlling this network, could be a potential biomarker for theranostics in high Gleason-scored PC.
Collapse
|
3
|
Vella D, Zoppis I, Mauri G, Mauri P, Di Silvestre D. From protein-protein interactions to protein co-expression networks: a new perspective to evaluate large-scale proteomic data. EURASIP JOURNAL ON BIOINFORMATICS & SYSTEMS BIOLOGY 2017; 2017:6. [PMID: 28477207 PMCID: PMC5359264 DOI: 10.1186/s13637-017-0059-z] [Citation(s) in RCA: 58] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/18/2016] [Accepted: 03/09/2017] [Indexed: 12/19/2022]
Abstract
The reductionist approach of dissecting biological systems into their constituents has been successful in the first stage of the molecular biology to elucidate the chemical basis of several biological processes. This knowledge helped biologists to understand the complexity of the biological systems evidencing that most biological functions do not arise from individual molecules; thus, realizing that the emergent properties of the biological systems cannot be explained or be predicted by investigating individual molecules without taking into consideration their relations. Thanks to the improvement of the current -omics technologies and the increasing understanding of the molecular relationships, even more studies are evaluating the biological systems through approaches based on graph theory. Genomic and proteomic data are often combined with protein-protein interaction (PPI) networks whose structure is routinely analyzed by algorithms and tools to characterize hubs/bottlenecks and topological, functional, and disease modules. On the other hand, co-expression networks represent a complementary procedure that give the opportunity to evaluate at system level including organisms that lack information on PPIs. Based on these premises, we introduce the reader to the PPI and to the co-expression networks, including aspects of reconstruction and analysis. In particular, the new idea to evaluate large-scale proteomic data by means of co-expression networks will be discussed presenting some examples of application. Their use to infer biological knowledge will be shown, and a special attention will be devoted to the topological and module analysis.
Collapse
Affiliation(s)
- Danila Vella
- Institute for Biomedical Technologies - National Research Council (ITB-CNR), 93 Fratelli Cervi, Segrate, Milan, Italy.,Department of Computer Science, Systems and Communication DiSCo, University of Milano-Bicocca, 336 Viale Sarca, Milan, Italy
| | - Italo Zoppis
- Department of Computer Science, Systems and Communication DiSCo, University of Milano-Bicocca, 336 Viale Sarca, Milan, Italy
| | - Giancarlo Mauri
- Department of Computer Science, Systems and Communication DiSCo, University of Milano-Bicocca, 336 Viale Sarca, Milan, Italy
| | - Pierluigi Mauri
- Institute for Biomedical Technologies - National Research Council (ITB-CNR), 93 Fratelli Cervi, Segrate, Milan, Italy
| | - Dario Di Silvestre
- Institute for Biomedical Technologies - National Research Council (ITB-CNR), 93 Fratelli Cervi, Segrate, Milan, Italy.
| |
Collapse
|
4
|
Cava C, Bertoli G, Castiglioni I. Pathway-based expression profile for breast cancer diagnoses. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2016; 2014:1151-4. [PMID: 25570167 DOI: 10.1109/embc.2014.6943799] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Microarray experiments have made possible to identify breast cancer marker gene signatures. However, gene expression-based signatures present limitations because they do not consider metabolic role of the genes and are affected by genetic heterogeneity across patient cohorts. Considering the activity of entire pathways rather than the expression levels of individual genes can be a way to exceed these limits. We evaluated and compared five methods of pathway-level aggregation of gene expression data. Our results confirmed the important role of pathway expression profile in breast cancer diagnostic classification (accuracy >90%). However, although assessed on a limited number of samples and datasets, this study shows that using dissimilarity representation among patients does not improve the classification of pathway-based expression profiles.
Collapse
|
5
|
Galli M, Zoppis I, Smith A, Magni F, Mauri G. Machine learning approaches in MALDI-MSI: clinical applications. Expert Rev Proteomics 2016; 13:685-96. [PMID: 27322705 DOI: 10.1080/14789450.2016.1200470] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
INTRODUCTION Despite the unquestionable advantages of Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry Imaging in visualizing the spatial distribution and the relative abundance of biomolecules directly on-tissue, the yielded data is complex and high dimensional. Therefore, analysis and interpretation of this huge amount of information is mathematically, statistically and computationally challenging. AREAS COVERED This article reviews some of the challenges in data elaboration with particular emphasis on machine learning techniques employed in clinical applications, and can be useful in general as an entry point for those who want to study the computational aspects. Several characteristics of data processing are described, enlightening advantages and disadvantages. Different approaches for data elaboration focused on clinical applications are also provided. Practical tutorial based upon Orange Canvas and Weka software is included, helping familiarization with the data processing. Expert commentary: Recently, MALDI-MSI has gained considerable attention and has been employed for research and diagnostic purposes, with successful results. Data dimensionality constitutes an important issue and statistical methods for information-preserving data reduction represent one of the most challenging aspects. The most common data reduction methods are characterized by collecting independent observations into a single table. However, the incorporation of relational information can improve the discriminatory capability of the data.
Collapse
Affiliation(s)
- Manuel Galli
- a Department of Medicine and Surgery , University of Milano Bicocca , Monza Brianza , Italy
| | - Italo Zoppis
- b Department of Informatics, Systems and Communication , University of Milano Bicocca , Milano , Italy
| | - Andrew Smith
- a Department of Medicine and Surgery , University of Milano Bicocca , Monza Brianza , Italy
| | - Fulvio Magni
- a Department of Medicine and Surgery , University of Milano Bicocca , Monza Brianza , Italy
| | - Giancarlo Mauri
- b Department of Informatics, Systems and Communication , University of Milano Bicocca , Milano , Italy
| |
Collapse
|
6
|
A Support Vector Machine Classification of Thyroid Bioptic Specimens Using MALDI-MSI Data. Adv Bioinformatics 2016; 2016:3791214. [PMID: 27293431 PMCID: PMC4886047 DOI: 10.1155/2016/3791214] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2015] [Accepted: 04/24/2016] [Indexed: 02/03/2023] Open
Abstract
Biomarkers able to characterise and predict multifactorial diseases are still one of the most important targets for all the “omics” investigations. In this context, Matrix-Assisted Laser Desorption/Ionisation-Mass Spectrometry Imaging (MALDI-MSI) has gained considerable attention in recent years, but it also led to a huge amount of complex data to be elaborated and interpreted. For this reason, computational and machine learning procedures for biomarker discovery are important tools to consider, both to reduce data dimension and to provide predictive markers for specific diseases. For instance, the availability of protein and genetic markers to support thyroid lesion diagnoses would impact deeply on society due to the high presence of undetermined reports (THY3) that are generally treated as malignant patients. In this paper we show how an accurate classification of thyroid bioptic specimens can be obtained through the application of a state-of-the-art machine learning approach (i.e., Support Vector Machines) on MALDI-MSI data, together with a particular wrapper feature selection algorithm (i.e., recursive feature elimination). The model is able to provide an accurate discriminatory capability using only 20 out of 144 features, resulting in an increase of the model performances, reliability, and computational efficiency. Finally, tissue areas rather than average proteomic profiles are classified, highlighting potential discriminating areas of clinical interest.
Collapse
|
7
|
Cava C, Bertoli G, Castiglioni I. Integrating genetics and epigenetics in breast cancer: biological insights, experimental, computational methods and therapeutic potential. BMC SYSTEMS BIOLOGY 2015; 9:62. [PMID: 26391647 PMCID: PMC4578257 DOI: 10.1186/s12918-015-0211-x] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/14/2015] [Accepted: 09/15/2015] [Indexed: 12/11/2022]
Abstract
BACKGROUND Development of human cancer can proceed through the accumulation of different genetic changes affecting the structure and function of the genome. Combined analyses of molecular data at multiple levels, such as DNA copy-number alteration, mRNA and miRNA expression, can clarify biological functions and pathways deregulated in cancer. The integrative methods that are used to investigate these data involve different fields, including biology, bioinformatics, and statistics. RESULTS These methodologies are presented in this review, and their implementation in breast cancer is discussed with a focus on integration strategies. We report current applications, recent studies and interesting results leading to the identification of candidate biomarkers for diagnosis, prognosis, and therapy in breast cancer by using both individual and combined analyses. CONCLUSION This review presents a state of art of the role of different technologies in breast cancer based on the integration of genetics and epigenetics, and shares some issues related to the new opportunities and challenges offered by the application of such integrative approaches.
Collapse
Affiliation(s)
- Claudia Cava
- Institute of Molecular Bioimaging and Physiology (IBFM), National Research Council (CNR), Milan, Italy.
| | - Gloria Bertoli
- Institute of Molecular Bioimaging and Physiology (IBFM), National Research Council (CNR), Milan, Italy.
| | - Isabella Castiglioni
- Institute of Molecular Bioimaging and Physiology (IBFM), National Research Council (CNR), Milan, Italy.
| |
Collapse
|
8
|
Integrative Analysis with Monte Carlo Cross-Validation Reveals miRNAs Regulating Pathways Cross-Talk in Aggressive Breast Cancer. BIOMED RESEARCH INTERNATIONAL 2015; 2015:831314. [PMID: 26240829 PMCID: PMC4512830 DOI: 10.1155/2015/831314] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/20/2015] [Revised: 05/31/2015] [Accepted: 06/08/2015] [Indexed: 12/11/2022]
Abstract
In this work an integrated approach was used to identify functional miRNAs regulating gene pathway cross-talk in breast cancer (BC). We first integrated gene expression profiles and biological pathway information to explore the underlying associations between genes differently expressed among normal and BC samples and pathways enriched from these genes. For each pair of pathways, a score was derived from the distribution of gene expression levels by quantifying their pathway cross-talk. Random forest classification allowed the identification of pairs of pathways with high cross-talk. We assessed miRNAs regulating the identified gene pathways by a mutual information analysis. A Fisher test was applied to demonstrate their significance in the regulated pathways. Our results suggest interesting networks of pathways that could be key regulatory of target genes in BC, including stem cell pluripotency, coagulation, and hypoxia pathways and miRNAs that control these networks could be potential biomarkers for diagnostic, prognostic, and therapeutic development in BC. This work shows that standard methods of predicting normal and tumor classes such as differentially expressed miRNAs or transcription factors could lose intrinsic features; instead our approach revealed the responsible molecules of the disease.
Collapse
|
9
|
Castelnuovo G, Zoppis I, Santoro E, Ceccarini M, Pietrabissa G, Manzoni GM, Corti S, Borrello M, Giusti EM, Cattivelli R, Melesi A, Mauri G, Molinari E, Sicurello F. Managing chronic pathologies with a stepped mHealth-based approach in clinical psychology and medicine. Front Psychol 2015; 6:407. [PMID: 25926801 PMCID: PMC4396192 DOI: 10.3389/fpsyg.2015.00407] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2015] [Accepted: 03/23/2015] [Indexed: 12/12/2022] Open
Abstract
Chronic diseases and conditions typically require long-term monitoring and treatment protocols both in traditional settings and in out-patient frameworks. The economic burden of chronic conditions is a key challenge and new and mobile technologies could offer good solutions. mHealth could be considered an evolution of eHealth and could be defined as the practice of medicine and public health supported by mobile communication devices. mHealth approach could overcome limitations linked with the traditional, restricted, and highly expensive in-patient treatment of many chronic pathologies. Possible applications include stepped mHealth approach, where patients can be monitored and treated in their everyday contexts. Unfortunately, many barriers for the spread of mHealth are still present. Due the significant impact of psychosocial factors on disease evolution, psychotherapies have to be included into the chronic disease protocols. Existing psychological theories of health behavior change have to be adapted to the new technological contexts and requirements. In conclusion, clinical psychology and medicine have to face the "chronic care management" challenge in both traditional and mHealth settings.
Collapse
Affiliation(s)
- Gianluca Castelnuovo
- Istituto Auxologico Italiano IRCCS, Psychology Research Laboratory, Ospedale San GiuseppeVerbania, Italy
- Department of Psychology, Catholic University of MilanMilan, Italy
| | - Italo Zoppis
- Department of Informatics, Systems and Communication, Università degli Studi di Milano-BicoccaMilano, Italy
| | - Eugenio Santoro
- Laboratory of Medical Informatics, Department of Epidemiology, Istituto di Ricovero e Cura a Carattere Scientifico – Istituto di Ricerche Farmacologiche Mario NegriMilano, Italy
| | - Martina Ceccarini
- Istituto Auxologico Italiano IRCCS, Psychology Research Laboratory, Ospedale San GiuseppeVerbania, Italy
- Department of Psychology, University of BergamoBergamo, Italy
| | - Giada Pietrabissa
- Istituto Auxologico Italiano IRCCS, Psychology Research Laboratory, Ospedale San GiuseppeVerbania, Italy
- Department of Psychology, Catholic University of MilanMilan, Italy
| | - Gian Mauro Manzoni
- Istituto Auxologico Italiano IRCCS, Psychology Research Laboratory, Ospedale San GiuseppeVerbania, Italy
- Department of Psychology, Catholic University of MilanMilan, Italy
| | - Stefania Corti
- Istituto Auxologico Italiano IRCCS, Psychology Research Laboratory, Ospedale San GiuseppeVerbania, Italy
- Department of Psychology, University of BergamoBergamo, Italy
| | - Maria Borrello
- Department of Psychology, University of BergamoBergamo, Italy
| | | | - Roberto Cattivelli
- Istituto Auxologico Italiano IRCCS, Psychology Research Laboratory, Ospedale San GiuseppeVerbania, Italy
| | - Anna Melesi
- Department of Electronics, Information and Bioengineering, Politecnico di MilanoMilano, Italy
| | - Giancarlo Mauri
- Department of Informatics, Systems and Communication, Università degli Studi di Milano-BicoccaMilano, Italy
| | - Enrico Molinari
- Istituto Auxologico Italiano IRCCS, Psychology Research Laboratory, Ospedale San GiuseppeVerbania, Italy
- Department of Psychology, Catholic University of MilanMilan, Italy
| | - Francesco Sicurello
- Department of Informatics, Systems and Communication, Università degli Studi di Milano-BicoccaMilano, Italy
| |
Collapse
|
10
|
Cava C, Bertoli G, Ripamonti M, Mauri G, Zoppis I, Rosa PAD, Gilardi MC, Castiglioni I. Integration of mRNA expression profile, copy number alterations, and microRNA expression levels in breast cancer to improve grade definition. PLoS One 2014; 9:e97681. [PMID: 24866763 PMCID: PMC4035288 DOI: 10.1371/journal.pone.0097681] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2013] [Accepted: 04/23/2014] [Indexed: 12/20/2022] Open
Abstract
Defining the aggressiveness and growth rate of a malignant cell population is a key step in the clinical approach to treating tumor disease. The correct grading of breast cancer (BC) is a fundamental part in determining the appropriate treatment. Biological variables can make it difficult to elucidate the mechanisms underlying BC development. To identify potential markers that can be used for BC classification, we analyzed mRNAs expression profiles, gene copy numbers, microRNAs expression and their association with tumor grade in BC microarray-derived datasets. From mRNA expression results, we found that grade 2 BC is most likely a mixture of grade 1 and grade 3 that have been misclassified, being described by the gene signature of either grade 1 or grade 3. We assessed the potential of the new approach of integrating mRNA expression profile, copy number alterations, and microRNA expression levels to select a limited number of genomic BC biomarkers. The combination of mRNA profile analysis and copy number data with microRNA expression levels led to the identification of two gene signatures of 42 and 4 altered genes (FOXM1, KPNA4, H2AFV and DDX19A) respectively, the latter obtained through a meta-analytical procedure. The 42-based gene signature identifies 4 classes of up- or down-regulated microRNAs (17 microRNAs) and of their 17 target mRNA, and the 4-based genes signature identified 4 microRNAs (Hsa-miR-320d, Hsa-miR-139-5p, Hsa-miR-567 and Hsa-let-7c). These results are discussed from a biological point of view with respect to pathological features of BC. Our identified mRNAs and microRNAs were validated as prognostic factors of BC disease progression, and could potentially facilitate the implementation of assays for laboratory validation, due to their reduced number.
Collapse
Affiliation(s)
- Claudia Cava
- Institute of Molecular Bioimaging and Physiology (IBFM), National Research Council (CNR), Milan, Italy
| | - Gloria Bertoli
- Institute of Molecular Bioimaging and Physiology (IBFM), National Research Council (CNR), Milan, Italy
| | - Marilena Ripamonti
- Institute of Molecular Bioimaging and Physiology (IBFM), National Research Council (CNR), Milan, Italy
| | - Giancarlo Mauri
- Department of Informatics, Systems and Communications, University of Milan–Bicocca, Milan, Italy
| | - Italo Zoppis
- Department of Informatics, Systems and Communications, University of Milan–Bicocca, Milan, Italy
| | | | - Maria Carla Gilardi
- Institute of Molecular Bioimaging and Physiology (IBFM), National Research Council (CNR), Milan, Italy
| | - Isabella Castiglioni
- Institute of Molecular Bioimaging and Physiology (IBFM), National Research Council (CNR), Milan, Italy
| |
Collapse
|