1
|
Sutiene K, Schwendner P, Sipos C, Lorenzo L, Mirchev M, Lameski P, Kabasinskas A, Tidjani C, Ozturkkal B, Cerneviciene J. Enhancing portfolio management using artificial intelligence: literature review. Front Artif Intell 2024; 7:1371502. [PMID: 38650961 PMCID: PMC11033520 DOI: 10.3389/frai.2024.1371502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Accepted: 03/12/2024] [Indexed: 04/25/2024] Open
Abstract
Building an investment portfolio is a problem that numerous researchers have addressed for many years. The key goal has always been to balance risk and reward by optimally allocating assets such as stocks, bonds, and cash. In general, the portfolio management process is based on three steps: planning, execution, and feedback, each of which has its objectives and methods to be employed. Starting from Markowitz's mean-variance portfolio theory, different frameworks have been widely accepted, which considerably renewed how asset allocation is being solved. Recent advances in artificial intelligence provide methodological and technological capabilities to solve highly complex problems, and investment portfolio is no exception. For this reason, the paper reviews the current state-of-the-art approaches by answering the core question of how artificial intelligence is transforming portfolio management steps. Moreover, as the use of artificial intelligence in finance is challenged by transparency, fairness and explainability requirements, the case study of post-hoc explanations for asset allocation is demonstrated. Finally, we discuss recent regulatory developments in the European investment business and highlight specific aspects of this business where explainable artificial intelligence could advance transparency of the investment process.
Collapse
Affiliation(s)
- Kristina Sutiene
- Department of Mathematical Modeling, Kaunas University of Technology, Kaunas, Lithuania
| | - Peter Schwendner
- School of Management and Law, Institute of Wealth and Asset Management, Zurich University of Applied Sciences, Winterthur, Switzerland
| | - Ciprian Sipos
- Department of Economics and Modelling, West University of Timisoara, Timisoara, Romania
| | - Luis Lorenzo
- Faculty of Statistic Studies, Complutense University of Madrid, Madrid, Spain
| | - Miroslav Mirchev
- Faculty of Computer Science and Engineering, Ss. Cyril and Methodius University in Skopje, Skopje, North Macedonia
- Complexity Science Hub Vienna, Vienna, Austria
| | - Petre Lameski
- Faculty of Computer Science and Engineering, Ss. Cyril and Methodius University in Skopje, Skopje, North Macedonia
| | - Audrius Kabasinskas
- Department of Mathematical Modeling, Kaunas University of Technology, Kaunas, Lithuania
| | - Chemseddine Tidjani
- Division of Firms and Industrial Economics, Research Center in Applied Economics for Development, Algiers, Algeria
| | - Belma Ozturkkal
- Department of International Trade and Finance, Kadir Has University, Istanbul, Türkiye
| | - Jurgita Cerneviciene
- Department of Mathematical Modeling, Kaunas University of Technology, Kaunas, Lithuania
| |
Collapse
|
2
|
Sobański JA, Klasa K, Dembińska E, Mielimąka M, Citkowska-Kisielewska A, Jęda P, Rutkowski K. Central psychological symptoms from a network analysis of patients with anxiety, somatoform or personality disorders before psychotherapy. J Affect Disord 2023; 339:1-21. [PMID: 37399849 DOI: 10.1016/j.jad.2023.06.040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/06/2022] [Revised: 03/05/2023] [Accepted: 06/20/2023] [Indexed: 07/05/2023]
Abstract
BACKGROUND Cross-sectional network analysis examines the relationships between symptoms to explain how they constitute disorders. Up to now, research focuses mostly on depression, posttraumatic stress disorder, and rarely assesses larger networks of various symptoms measured with instruments independent of classifications. Studies on large groups of psychotherapy patients are also rare. METHODS Analyzing triangulated maximally filtered graph (TMFG) networks of 62 psychological symptoms reported by 4616 consecutive nonpsychotic adults in 1980-2015. RESULTS Case-dropping and nonparametric bootstrap proved the accuracy, stability and reliability of networks in patients' sex-, age-, and time of visit divided subgroups. Feeling that others are prejudiced against the patient was the most central symptom, followed by catastrophic fears, feeling inferior and underestimated. Sadness, panic, and sex-related complaints were less central than we expected. All analysed symptoms were connected, and we found only small sex-related differences between subsamples' networks. No differences were observed for time of visit and age of patients. LIMITATION Analyses were cross-sectional and retrospective, not allowing examination of directionality or causality. Further, data are at the between-person level; thus, it is unknown whether the network remains constant for any person over time. One self-report checklist and building binary network method may bias results. Our results indicate how symptoms co-occured before psychotherapy, not longitudinally. Our sample included public university hospital patients, all White-Europeans, predominantly females and university students. CONCLUSIONS Hostile projection, catastrophic fears, feeling inferior and underestimated were the most important psychological phenomena reported before psychotherapy. Exploring these symptoms would possibly lead to enhancement of treatments.
Collapse
Affiliation(s)
- Jerzy A Sobański
- Jagiellonian University Medical College, Faculty of Medicine, Department of Psychotherapy, Poland.
| | - Katarzyna Klasa
- Jagiellonian University Medical College, Faculty of Medicine, Department of Psychotherapy, Poland
| | - Edyta Dembińska
- Jagiellonian University Medical College, Faculty of Medicine, Department of Psychotherapy, Poland
| | - Michał Mielimąka
- Jagiellonian University Medical College, Faculty of Medicine, Department of Psychotherapy, Poland
| | | | - Patrycja Jęda
- Jagiellonian University Medical College, Faculty of Medicine, Department of Psychotherapy, Poland
| | - Krzysztof Rutkowski
- Jagiellonian University Medical College, Faculty of Medicine, Department of Psychotherapy, Poland
| |
Collapse
|
3
|
Jiang C, He Y, Betzel RF, Wang YS, Xing XX, Zuo XN. Optimizing network neuroscience computation of individual differences in human spontaneous brain activity for test-retest reliability. Netw Neurosci 2023; 7:1080-1108. [PMID: 37781147 PMCID: PMC10473278 DOI: 10.1162/netn_a_00315] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Accepted: 03/22/2023] [Indexed: 10/03/2023] Open
Abstract
A rapidly emerging application of network neuroscience in neuroimaging studies has provided useful tools to understand individual differences in intrinsic brain function by mapping spontaneous brain activity, namely intrinsic functional network neuroscience (ifNN). However, the variability of methodologies applied across the ifNN studies-with respect to node definition, edge construction, and graph measurements-makes it difficult to directly compare findings and also challenging for end users to select the optimal strategies for mapping individual differences in brain networks. Here, we aim to provide a benchmark for best ifNN practices by systematically comparing the measurement reliability of individual differences under different ifNN analytical strategies using the test-retest design of the Human Connectome Project. The results uncovered four essential principles to guide ifNN studies: (1) use a whole brain parcellation to define network nodes, including subcortical and cerebellar regions; (2) construct functional networks using spontaneous brain activity in multiple slow bands; and (3) optimize topological economy of networks at individual level; and (4) characterize information flow with specific metrics of integration and segregation. We built an interactive online resource of reliability assessments for future ifNN (https://ibraindata.com/research/ifNN).
Collapse
Affiliation(s)
- Chao Jiang
- School of Psychology, Capital Normal University, Beijing, China
| | - Ye He
- School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing, China
| | - Richard F. Betzel
- Department of Psychological and Brain Sciences, Indiana University, Bloomington, Indiana, USA
| | - Yin-Shan Wang
- State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing, China
- Developmental Population Neuroscience Research Center, International Data Group/McGovern Institute for Brain Research, Beijing Normal University, Beijing, China
| | - Xiu-Xia Xing
- Department of Applied Mathematics, College of Mathematics, Faculty of Science, Beijing University of Technology, Beijing, China
| | - Xi-Nian Zuo
- State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing, China
- Developmental Population Neuroscience Research Center, International Data Group/McGovern Institute for Brain Research, Beijing Normal University, Beijing, China
- National Basic Science Data Center, Beijing, China
- Institute of Psychology, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
4
|
Chen Q, Christensen AP, Kenett YN, Ren Z, Condon DM, Bilder RM, Qiu J, Beaty RE. Mapping the Creative Personality: A Psychometric Network Analysis of Highly Creative Artists and Scientists. CREATIVITY RESEARCH JOURNAL 2023. [DOI: 10.1080/10400419.2023.2184558] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/17/2023]
Affiliation(s)
- Qunlin Chen
- Southwest University
- Pennsylvania State University
| | | | | | | | | | | | | | | |
Collapse
|
5
|
Cohesion and segregation in the value migration network: Evidence from network partitioning based on sector classification and clustering. SOCIAL NETWORK ANALYSIS AND MINING 2023. [DOI: 10.1007/s13278-023-01027-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
AbstractCluster structure detection of the network is a basic problem of complex network analysis. This study investigates the structure of the value migration network using data from 499 stocks listed in the S&P500 as of the end of 2021. An examination is carried out whether the process of value migration creates a cluster structure in the network of companies according to economic activity. Specifically, the cohesion and segregation of the extracted modules in the network division according to (i) sector classification, (ii) community division, and (iii) network clustering decomposition are assessed. The results of this study show that the sector classification of the value migration network has a non-cohesive structure, which means that the flow of value in the financial market occurs between companies from various industries. Moreover, the divisions of the value migration network based on community detection and clustering algorithm are characterized by intra-cluster similarity between the vertices and have a strong community structure. The structure of the network division into modules corresponding to the classification of economic sectors differs significantly from the partition based on the algorithms applied.
Collapse
|
6
|
He Y, Yang Y, Su X, Zhao B, Xiong S, Hu L. Incorporating higher order network structures to improve miRNA-disease association prediction based on functional modularity. Brief Bioinform 2023; 24:6958503. [PMID: 36562706 DOI: 10.1093/bib/bbac562] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Revised: 10/29/2022] [Accepted: 11/19/2022] [Indexed: 12/24/2022] Open
Abstract
As microRNAs (miRNAs) are involved in many essential biological processes, their abnormal expressions can serve as biomarkers and prognostic indicators to prevent the development of complex diseases, thus providing accurate early detection and prognostic evaluation. Although a number of computational methods have been proposed to predict miRNA-disease associations (MDAs) for further experimental verification, their performance is limited primarily by the inadequacy of exploiting lower order patterns characterizing known MDAs to identify missing ones from MDA networks. Hence, in this work, we present a novel prediction model, namely HiSCMDA, by incorporating higher order network structures for improved performance of MDA prediction. To this end, HiSCMDA first integrates miRNA similarity network, disease similarity network and MDA network to preserve the advantages of all these networks. After that, it identifies overlapping functional modules from the integrated network by predefining several higher order connectivity patterns of interest. Last, a path-based scoring function is designed to infer potential MDAs based on network paths across related functional modules. HiSCMDA yields the best performance across all datasets and evaluation metrics in the cross-validation and independent validation experiments. Furthermore, in the case studies, 49 and 50 out of the top 50 miRNAs, respectively, predicted for colon neoplasms and lung neoplasms have been validated by well-established databases. Experimental results show that rich higher order organizational structures exposed in the MDA network gain new insight into the MDA prediction based on higher order connectivity patterns.
Collapse
Affiliation(s)
- Yizhou He
- School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan, 430070, China
| | - Yue Yang
- School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan, 430070, China
| | - Xiaorui Su
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi, 830011, China
| | - Bowei Zhao
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi, 830011, China
| | - Shengwu Xiong
- School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan, 430070, China
| | - Lun Hu
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi, 830011, China
| |
Collapse
|
7
|
Christensen AP, Cardillo ER, Chatterjee A. What kind of impacts can artwork have on viewers? Establishing a taxonomy for aesthetic impacts. Br J Psychol 2022; 114:335-351. [PMID: 36519205 DOI: 10.1111/bjop.12623] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 12/05/2022] [Indexed: 12/23/2022]
Abstract
What kinds of impacts can visual art have on a viewer? To identify potential art impacts, we recruited five aesthetics experts from different academic disciplines: art history, neuroscience, philosophy, psychology and theology. Together, the group curated a set of terms that corresponded to descriptive features (124 terms) and cognitive-affective impacts (69 terms) of artworks. Using these terms as prompts, participants (n = 899) were given one minute to generate words for each term related to how an artwork looked (descriptive features) or made them think or feel (cognitive-affective impacts). Using network psychometric approaches, we identified terms that were semantically similar based on participants' responses and applied hierarchical exploratory graph analysis to map the relationships between the terms. Our analyses identified 17 descriptive dimensions, which could be further reduced to 5, and 11 impact dimensions, which could be further reduced to 4. The resulting taxonomy demonstrated overlap between the descriptive and impact networks as well as consistency with empirical evidence. This taxonomy could serve as the foundation to empirically evaluate art's impacts on viewers.
Collapse
Affiliation(s)
- Alexander P Christensen
- Penn Center for Neuroaesthetics, University of Pennsylvania, Philadelphia, Pennsylvania, USA.,Psychology and Human Development, Peabody College, Vanderbilt University, Nashville, Tennessee, USA
| | - Eileen R Cardillo
- Penn Center for Neuroaesthetics, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Anjan Chatterjee
- Penn Center for Neuroaesthetics, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| |
Collapse
|
8
|
|
9
|
Choi S, Gwak D, Song JW, Chang W. Stock market network based on bi-dimensional histogram and autoencoder. INTELL DATA ANAL 2022. [DOI: 10.3233/ida-215819] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
In this study, we propose a deep learning related framework to analyze S&P500 stocks using bi-dimensional histogram and autoencoder. The bi-dimensional histogram consisting of daily returns of stock price and stock trading volume is plotted for each stock. Autoencoder is applied to the bi-dimensional histogram to reduce data dimension and extract meaningful features of a stock. The histogram distance matrix for stocks are made of the extracted features of stocks, and stock market network is built by applying Planar Maximally Filtered Graph(PMFG) algorithm to the histogram distance matrix. The constructed stock market network represents the latent space of bi-dimensional histogram, and network analysis is performed to investigate the structural properties of the stock market. we discover that the structural properties of stock market network are related to the dispersion of bi-dimensional histogram. Also, we confirm that the autoencoder is effective in extracting the latent feature of the bi-dimensional histogram. Portfolios using the features of bi-dimensional histogram network are constructed and their investment performance is evaluated in comparison with other benchmark portfolios. We observe that the portfolio consisting of stocks corresponding to the peripheral nodes of bi-dimensional histogram network shows better investment performance than other benchmark stock portfolios.
Collapse
Affiliation(s)
- Sungyoon Choi
- Department of Industrial Engineering, Seoul National University, Seoul, Korea
| | - Dongkyu Gwak
- Department of Industrial Engineering, Seoul National University, Seoul, Korea
| | - Jae Wook Song
- Department of Industrial Engineering, Hanyang University, Seoul, Korea
| | - Woojin Chang
- Department of Industrial Engineering, Seoul National University, Seoul, Korea
- Institute for Industrial Systems Innovation, Seoul National University, Seoul, Korea
- SNU Institute for Research in Finance and Economics, Seoul National University, Seoul, Korea
| |
Collapse
|
10
|
Wang M, Song WM, Ming C, Wang Q, Zhou X, Xu P, Krek A, Yoon Y, Ho L, Orr ME, Yuan GC, Zhang B. Guidelines for bioinformatics of single-cell sequencing data analysis in Alzheimer's disease: review, recommendation, implementation and application. Mol Neurodegener 2022; 17:17. [PMID: 35236372 PMCID: PMC8889402 DOI: 10.1186/s13024-022-00517-z] [Citation(s) in RCA: 36] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2021] [Accepted: 01/18/2022] [Indexed: 12/13/2022] Open
Abstract
Alzheimer's disease (AD) is the most common form of dementia, characterized by progressive cognitive impairment and neurodegeneration. Extensive clinical and genomic studies have revealed biomarkers, risk factors, pathways, and targets of AD in the past decade. However, the exact molecular basis of AD development and progression remains elusive. The emerging single-cell sequencing technology can potentially provide cell-level insights into the disease. Here we systematically review the state-of-the-art bioinformatics approaches to analyze single-cell sequencing data and their applications to AD in 14 major directions, including 1) quality control and normalization, 2) dimension reduction and feature extraction, 3) cell clustering analysis, 4) cell type inference and annotation, 5) differential expression, 6) trajectory inference, 7) copy number variation analysis, 8) integration of single-cell multi-omics, 9) epigenomic analysis, 10) gene network inference, 11) prioritization of cell subpopulations, 12) integrative analysis of human and mouse sc-RNA-seq data, 13) spatial transcriptomics, and 14) comparison of single cell AD mouse model studies and single cell human AD studies. We also address challenges in using human postmortem and mouse tissues and outline future developments in single cell sequencing data analysis. Importantly, we have implemented our recommended workflow for each major analytic direction and applied them to a large single nucleus RNA-sequencing (snRNA-seq) dataset in AD. Key analytic results are reported while the scripts and the data are shared with the research community through GitHub. In summary, this comprehensive review provides insights into various approaches to analyze single cell sequencing data and offers specific guidelines for study design and a variety of analytic directions. The review and the accompanied software tools will serve as a valuable resource for studying cellular and molecular mechanisms of AD, other diseases, or biological systems at the single cell level.
Collapse
Affiliation(s)
- Minghui Wang
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
| | - Won-min Song
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
| | - Chen Ming
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
| | - Qian Wang
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
| | - Xianxiao Zhou
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
| | - Peng Xu
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
| | - Azra Krek
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029 USA
| | - Yonejung Yoon
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
| | - Lap Ho
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
| | - Miranda E. Orr
- Department of Internal Medicine, Section of Gerontology and Geriatric Medicine, Wake Forest School of Medicine, Winston-Salem, North Carolina USA
- Sticht Center for Healthy Aging and Alzheimer’s Prevention, Wake Forest School of Medicine, Winston-Salem, North Carolina USA
| | - Guo-Cheng Yuan
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029 USA
| | - Bin Zhang
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
| |
Collapse
|
11
|
A survey of the application of graph-based approaches in stock market analysis and prediction. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS 2022. [DOI: 10.1007/s41060-021-00306-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
AbstractGraph-based approaches are revolutionizing the analysis of different real-life systems, and the stock market is no exception. Individual stocks and stock market indices are connected, and interesting patterns appear when the stock market is considered as a graph. Researchers are analyzing the stock market using graph-based approaches in recent years, and there is a need to survey those works from multiple perspectives. We discuss the existing graph-based works from five perspectives: (i) stock market graph formulation, (ii) stock market graph filtering, (iii) stock market graph clustering, (iv) stock movement prediction, and (v) portfolio optimization. This study contains a concise description of major techniques and algorithms relevant to graph-based approaches for the stock market.
Collapse
|
12
|
Estimating the Stability of Psychological Dimensions via Bootstrap Exploratory Graph Analysis: A Monte Carlo Simulation and Tutorial. PSYCH 2021. [DOI: 10.3390/psych3030032] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
Exploratory Graph Analysis (EGA) has emerged as a popular approach for estimating the dimensionality of multivariate data using psychometric networks. Sampling variability, however, has made reproducibility and generalizability a key issue in network psychometrics. To address this issue, we have developed a novel bootstrap approach called Bootstrap Exploratory Graph Analysis (bootEGA). bootEGA generates a sampling distribution of EGA results where several statistics can be computed. Descriptive statistics (median, standard error, and dimension frequency) provide researchers with a general sense of the stability of their empirical EGA dimensions. Structural consistency estimates how often dimensions are replicated exactly across the bootstrap replicates. Item stability statistics provide information about whether dimensions are unstable due to misallocation (e.g., item placed in the wrong dimension), multidimensionality (e.g., item belonging to more than one dimension), and item redundancy (e.g., similar semantic content). Using a Monte Carlo simulation, we determine guidelines for acceptable item stability. After, we provide an empirical example that demonstrates how bootEGA can be used to identify structural consistency issues (including a fully reproducible R tutorial). In sum, we demonstrate that bootEGA is a robust approach for identifying the stability and robustness of dimensionality in multivariate data.
Collapse
|
13
|
Hu L, Zhang J, Pan X, Yan H, You ZH. HiSCF: leveraging higher-order structures for clustering analysis in biological networks. Bioinformatics 2021; 37:542-550. [PMID: 32931549 DOI: 10.1093/bioinformatics/btaa775] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2020] [Revised: 05/12/2020] [Accepted: 09/03/2020] [Indexed: 02/06/2023] Open
Abstract
MOTIVATION Clustering analysis in a biological network is to group biological entities into functional modules, thus providing valuable insight into the understanding of complex biological systems. Existing clustering techniques make use of lower-order connectivity patterns at the level of individual biological entities and their connections, but few of them can take into account of higher-order connectivity patterns at the level of small network motifs. RESULTS Here, we present a novel clustering framework, namely HiSCF, to identify functional modules based on the higher-order structure information available in a biological network. Taking advantage of higher-order Markov stochastic process, HiSCF is able to perform the clustering analysis by exploiting a variety of network motifs. When compared with several state-of-the-art clustering models, HiSCF yields the best performance for two practical clustering applications, i.e. protein complex identification and gene co-expression module detection, in terms of accuracy. The promising performance of HiSCF demonstrates that the consideration of higher-order network motifs gains new insight into the analysis of biological networks, such as the identification of overlapping protein complexes and the inference of new signaling pathways, and also reveals the rich higher-order organizational structures presented in biological networks. AVAILABILITY AND IMPLEMENTATION HiSCF is available at https://github.com/allenv5/HiSCF. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Lun Hu
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Science, Urumqi 830011, China.,School of Computer Science and Technology, Wuhan University of Technology, Wuhan 430070, China
| | - Jun Zhang
- School of Computer Science and Technology, Wuhan University of Technology, Wuhan 430070, China
| | - Xiangyu Pan
- School of Computer Science and Technology, Wuhan University of Technology, Wuhan 430070, China
| | - Hong Yan
- Department of Electrical Engineering, City University of Hong Kong, Hong Kong 999077, China
| | - Zhu-Hong You
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Science, Urumqi 830011, China
| |
Collapse
|
14
|
“Small things matter most”: The spillover effects in the cryptocurrency market and gold as a silver bullet. THE NORTH AMERICAN JOURNAL OF ECONOMICS AND FINANCE 2020; 54. [PMCID: PMC7434496 DOI: 10.1016/j.najef.2020.101277] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Cryptos with small market capitalization are more likely to be sources of shocks than their larger counterparts. Bitcoins can be considered as a better hedge due to its relative independence. USDT strong anchoring with US$ makes it very volatile. The idiosyncrasy of Gold enables it to weather adverse crypto market’s movements. Having gold in the portfolio with cryptocurrency helps fruitful diversification.
The cryptocurrencies with small market capitalization are often overlooked despite they can potentially be the source of shocks to other cryptocurrencies in the market. To address this caveat, this paper attempts to investigate the spillover effects among 14 cryptocurrencies by employing transfer entropy. Our results suggest that among different types of cryptos, Bitcoin is still the most appropriate instrument for hedging, while Tether (USDT) which have a strong anchor with the US dollar is significantly volatile. Interestingly, we document that the small coins are more likely to be shock creators in the cryptocurrency market. Using the same approach, we further explored the link between gold prices and cryptocurrency prices. The results show that gold could be a good hedging instrument for cryptocurrencies due to its independence. In light of empirical results, it is advisable to carefully consider the coins with small capitalization. Further, investors should conduct portfolio rebalancing by including gold to hedge against the unexpected movement in the cryptocurrency market. Our paper not only contributes in terms of the application of advanced empirical methodology but also provides evidence on idiosyncratic features of the cryptocurrency market.
Collapse
|
15
|
Zhang L, Peng TL, Wang L, Meng XH, Zhu W, Zeng Y, Zhu JQ, Zhou Y, Xiao HM, Deng HW. Network-based Transcriptome-wide Expression Study for Postmenopausal Osteoporosis. J Clin Endocrinol Metab 2020; 105:5850085. [PMID: 32483604 PMCID: PMC7320836 DOI: 10.1210/clinem/dgaa319] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Accepted: 05/27/2020] [Indexed: 01/08/2023]
Abstract
PURPOSE Menopause is a crucial physiological transition during a woman's life, and it occurs with growing risks of health issues like osteoporosis. To identify postmenopausal osteoporosis-related genes, we performed transcriptome-wide expression analyses for human peripheral blood monocytes (PBMs) using Affymetrix 1.0 ST arrays in 40 Caucasian postmenopausal women with discordant bone mineral density (BMD) levels. METHODS We performed multiscale embedded gene coexpression network analysis (MEGENA) to study functionally orchestrating clusters of differentially expressed genes in the form of functional networks. Gene sets net correlations analysis (GSNCA) was applied to assess how the coexpression structure of a predefined gene set differs in high and low BMD groups. Bayesian network (BN) analysis was used to identify important regulation patterns between potential risk genes for osteoporosis. A small interfering ribonucleic acid (siRNA)-based gene silencing in vitro experiment was performed to validate the findings from BN analysis. RESULT MEGENA showed that the "T cell receptor signaling pathway" and the "osteoclast differentiation pathway" were significantly enriched in the identified compact network, which is significantly correlated with BMD variation. GSNCA revealed that the coexpression structure of the "Signaling by TGF-beta receptor complex pathway" is significantly different between the 2 BMD discordant groups; the hub genes in the postmenopausal low and high BMD group are FURIN and SMAD3 respectively. With siRNA in vitro experiments, we confirmed the regulation relationship of TGFBR2-SMAD7 and TGFBR1-SMURF2. MAIN CONCLUSION The present study suggests that biological signals involved in monocyte recruitment, monocyte/macrophage lineage development, osteoclast formation, and osteoclast differentiation might function together in PBMs that contribute to the pathogenesis of postmenopausal osteoporosis.
Collapse
Affiliation(s)
- Lan Zhang
- Center for Biomedical informatics and Genomics, Department of Medicine, Tulane University, New Orleans, Louisiana
| | - Tian-Liu Peng
- Institute of Reproduction and Stem Cell Engineering, School of Basic Medical Science, Central South University, Changsha, Hunan, China
| | - Le Wang
- Institute of Reproduction and Stem Cell Engineering, School of Basic Medical Science, Central South University, Changsha, Hunan, China
| | - Xiang-He Meng
- Laboratory of Molecular and Statistical Genetics, College of Life Sciences, Hunan Normal University, Changsha, Hunan, China
| | - Wei Zhu
- Center for Biomedical informatics and Genomics, Department of Medicine, Tulane University, New Orleans, Louisiana
| | - Yong Zeng
- Center for Biomedical informatics and Genomics, Department of Medicine, Tulane University, New Orleans, Louisiana
| | - Jia-Qiang Zhu
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan
| | - Yu Zhou
- Center for Biomedical informatics and Genomics, Department of Medicine, Tulane University, New Orleans, Louisiana
| | - Hong-Mei Xiao
- Institute of Reproduction and Stem Cell Engineering, School of Basic Medical Science, Central South University, Changsha, Hunan, China
| | - Hong-Wen Deng
- Center for Biomedical informatics and Genomics, Department of Medicine, Tulane University, New Orleans, Louisiana
- Correspondence and Reprint Requests: Hong-Wen Deng, Center for Biomedical Informatics and Genomics, Department of Medicine, School of Medicine, Tulane University, New Orleans, LA 70112, USA. E-mail:
| |
Collapse
|
16
|
Golino H, Shi D, Christensen AP, Garrido LE, Nieto MD, Sadana R, Thiyagarajan JA, Martinez-Molina A. Investigating the performance of exploratory graph analysis and traditional techniques to identify the number of latent factors: A simulation and tutorial. Psychol Methods 2020; 25:292-320. [PMID: 32191105 PMCID: PMC7244378 DOI: 10.1037/met0000255] [Citation(s) in RCA: 146] [Impact Index Per Article: 36.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Exploratory graph analysis (EGA) is a new technique that was recently proposed within the framework of network psychometrics to estimate the number of factors underlying multivariate data. Unlike other methods, EGA produces a visual guide-network plot-that not only indicates the number of dimensions to retain, but also which items cluster together and their level of association. Although previous studies have found EGA to be superior to traditional methods, they are limited in the conditions considered. These issues are addressed through an extensive simulation study that incorporates a wide range of plausible structures that may be found in practice, including continuous and dichotomous data, and unidimensional and multidimensional structures. Additionally, two new EGA techniques are presented: one that extends EGA to also deal with unidimensional structures, and the other based on the triangulated maximally filtered graph approach (EGAtmfg). Both EGA techniques are compared with 5 widely used factor analytic techniques. Overall, EGA and EGAtmfg are found to perform as well as the most accurate traditional method, parallel analysis, and to produce the best large-sample properties of all the methods evaluated. To facilitate the use and application of EGA, we present a straightforward R tutorial on how to apply and interpret EGA, using scores from a well-known psychological instrument: the Marlowe-Crowne Social Desirability Scale. (PsycInfo Database Record (c) 2020 APA, all rights reserved).
Collapse
|
17
|
Network structure of the Wisconsin Schizotypy Scales-Short Forms: Examining psychometric network filtering approaches. Behav Res Methods 2019. [PMID: 29520631 DOI: 10.3758/s13428-018-1032-9] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Schizotypy is a multidimensional construct that provides a useful framework for understanding the etiology, development, and risk for schizophrenia-spectrum disorders. Past research has applied traditional methods, such as factor analysis, to uncovering common dimensions of schizotypy. In the present study, we aimed to advance the construct of schizotypy, measured by the Wisconsin Schizotypy Scales-Short Forms (WSS-SF), beyond this general scope by applying two different psychometric network filtering approaches-the state-of-the-art approach (lasso), which has been employed in previous studies, and an alternative approach (information-filtering networks; IFNs). First, we applied both filtering approaches to two large, independent samples of WSS-SF data (ns = 5,831 and 2,171) and assessed each approach's representation of the WSS-SF's schizotypy construct. Both filtering approaches produced results similar to those from traditional methods, with the IFN approach producing results more consistent with previous theoretical interpretations of schizotypy. Then we evaluated how well both filtering approaches reproduced the global and local network characteristics of the two samples. We found that the IFN approach produced more consistent results for both global and local network characteristics. Finally, we sought to evaluate the predictability of the network centrality measures for each filtering approach, by determining the core, intermediate, and peripheral items on the WSS-SF and using them to predict interview reports of schizophrenia-spectrum symptoms. We found some similarities and differences in their effectiveness, with the IFN approach's network structure providing better overall predictive distinctions. We discuss the implications of our findings for schizotypy and for psychometric network analysis more generally.
Collapse
|
18
|
Marcaccioli R, Livan G. A Pólya urn approach to information filtering in complex networks. Nat Commun 2019; 10:745. [PMID: 30765706 PMCID: PMC6375975 DOI: 10.1038/s41467-019-08667-3] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2018] [Accepted: 01/23/2019] [Indexed: 11/18/2022] Open
Abstract
The increasing availability of data demands for techniques to filter information in large complex networks of interactions. A number of approaches have been proposed to extract network backbones by assessing the statistical significance of links against null hypotheses of random interaction. Yet, it is well known that the growth of most real-world networks is non-random, as past interactions between nodes typically increase the likelihood of further interaction. Here, we propose a filtering methodology inspired by the Pólya urn, a combinatorial model driven by a self-reinforcement mechanism, which relies on a family of null hypotheses that can be calibrated to assess which links are statistically significant with respect to a given network's own heterogeneity. We provide a full characterization of the filter, and show that it selects links based on a non-trivial interplay between their local importance and the importance of the nodes they belong to.
Collapse
Affiliation(s)
- Riccardo Marcaccioli
- Department of Computer Science, University College London, 66-72 Gower Street, London, WC1E 6EA, UK
| | - Giacomo Livan
- Department of Computer Science, University College London, 66-72 Gower Street, London, WC1E 6EA, UK.
- Systemic Risk Centre, London School of Economics and Political Sciences, Houghton Street, London, WC2A 2AE, UK.
| |
Collapse
|
19
|
Wang WH, Xie TY, Xie GL, Ren ZL, Li JM. An Integrated Approach for Identifying Molecular Subtypes in Human Colon Cancer Using Gene Expression Data. Genes (Basel) 2018; 9:E397. [PMID: 30072645 PMCID: PMC6115727 DOI: 10.3390/genes9080397] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Revised: 07/18/2018] [Accepted: 07/27/2018] [Indexed: 02/08/2023] Open
Abstract
Identifying molecular subtypes of colorectal cancer (CRC) may allow for more rational, patient-specific treatment. Various studies have identified molecular subtypes for CRC using gene expression data, but they are inconsistent and further research is necessary. From a methodological point of view, a progressive approach is needed to identify molecular subtypes in human colon cancer using gene expression data. We propose an approach to identify the molecular subtypes of colon cancer that integrates denoising by the Bayesian robust principal component analysis (BRPCA) algorithm, hierarchical clustering by the directed bubble hierarchical tree (DBHT) algorithm, and feature gene selection by an improved differential evolution based feature selection method (DEFSW) algorithm. In this approach, the normal samples being completely and exclusively clustered into one class is considered to be the standard of reasonable clustering subtypes, and the feature selection pays attention to imbalances of samples among subtypes. With this approach, we identified the molecular subtypes of colon cancer on the mRNA gene expression dataset of 153 colon cancer samples and 19 normal control samples of the Cancer Genome Atlas (TCGA) project. The colon cancer was clustered into 7 subtypes with 44 feature genes. Our approach could identify finer subtypes of colon cancer with fewer feature genes than the other two recent studies and exhibits a generic methodology that might be applied to identify the subtypes of other cancers.
Collapse
Affiliation(s)
- Wen-Hui Wang
- State Key Laboratory of Organ Failure Research, Division of Nephrology, Southern Medical University, Guangzhou 510515, China.
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China.
- Network Information Center, The Sixth Affiliated Hospital of Sun Yat-Sen University, Guangzhou 510655, China.
| | - Ting-Yan Xie
- State Key Laboratory of Organ Failure Research, Division of Nephrology, Southern Medical University, Guangzhou 510515, China.
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China.
| | - Guang-Lei Xie
- State Key Laboratory of Organ Failure Research, Division of Nephrology, Southern Medical University, Guangzhou 510515, China.
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China.
| | - Zhong-Lu Ren
- Center for Systems Medical Genetics, Department of Obstetrics & Gynecology Nanfang Hospital, Southern Medical University, Guangzhou 510515, China.
- Laboratory of Systems Neuroscience, Institute of Mental Health Southern Medical University, Southern Medical University, Guangzhou 510515, China.
| | - Jin-Ming Li
- State Key Laboratory of Organ Failure Research, Division of Nephrology, Southern Medical University, Guangzhou 510515, China.
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China.
| |
Collapse
|
20
|
Christensen AP, Cotter KN, Silvia PJ. Reopening Openness to Experience: A Network Analysis of Four Openness to Experience Inventories. J Pers Assess 2018; 101:574-588. [DOI: 10.1080/00223891.2018.1467428] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Affiliation(s)
| | | | - Paul J. Silvia
- Department of Psychology, University of North Carolina at Greensboro
| |
Collapse
|
21
|
Gu X, Angelov P, Kangin D, Principe J. Self-Organised direction aware data partitioning algorithm. Inf Sci (N Y) 2018. [DOI: 10.1016/j.ins.2017.09.025] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
22
|
Gong C, Chen H, He W, Zhang Z. Improved multi-objective clustering algorithm using particle swarm optimization. PLoS One 2017; 12:e0188815. [PMID: 29206880 PMCID: PMC5716574 DOI: 10.1371/journal.pone.0188815] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2017] [Accepted: 09/11/2017] [Indexed: 11/20/2022] Open
Abstract
Multi-objective clustering has received widespread attention recently, as it can obtain more accurate and reasonable solution. In this paper, an improved multi-objective clustering framework using particle swarm optimization (IMCPSO) is proposed. Firstly, a novel particle representation for clustering problem is designed to help PSO search clustering solutions in continuous space. Secondly, the distribution of Pareto set is analyzed. The analysis results are applied to the leader selection strategy, and make algorithm avoid trapping in local optimum. Moreover, a clustering solution-improved method is proposed, which can increase the efficiency in searching clustering solution greatly. In the experiments, 28 datasets are used and nine state-of-the-art clustering algorithms are compared, the proposed method is superior to other approaches in the evaluation index ARI.
Collapse
Affiliation(s)
- Congcong Gong
- PLA University of Science and Technology, Nanjing, PR China
| | - Haisong Chen
- PLA University of Science and Technology, Nanjing, PR China
- * E-mail:
| | - Weixiong He
- PLA University of Science and Technology, Nanjing, PR China
| | | |
Collapse
|
23
|
Zhao Y, Forst CV, Sayegh CE, Wang IM, Yang X, Zhang B. Molecular and genetic inflammation networks in major human diseases. MOLECULAR BIOSYSTEMS 2017; 12:2318-41. [PMID: 27303926 DOI: 10.1039/c6mb00240d] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
It has been well-recognized that inflammation alongside tissue repair and damage maintaining tissue homeostasis determines the initiation and progression of complex diseases. Albeit with the accomplishment of having captured the most critical inflammation-involved molecules, genetic susceptibilities, epigenetic factors, and environmental factors, our schemata on the role of inflammation in complex diseases remain largely patchy, in part due to the success of reductionism in terms of research methodology per se. Omics data alongside the advances in data integration technologies have enabled reconstruction of molecular and genetic inflammation networks which shed light on the underlying pathophysiology of complex diseases or clinical conditions. Given the proven beneficial role of anti-inflammation in coronary heart disease as well as other complex diseases and immunotherapy as a revolutionary transition in oncology, it becomes timely to review our current understanding of the molecular and genetic inflammation networks underlying major human diseases. In this review, we first briefly discuss the complexity of infectious diseases and then highlight recently uncovered molecular and genetic inflammation networks in other major human diseases including obesity, type II diabetes, coronary heart disease, late onset Alzheimer's disease, Parkinson's disease, and sporadic cancer. The commonality and specificity of these molecular networks are addressed in the context of genetics based on genome-wide association study (GWAS). The double-sword role of inflammation, such as how the aberrant type 1 and/or type 2 immunity leads to chronic and severe clinical conditions, remains open in terms of the inflammasome and the core inflammatome network features. Increasingly available large Omics and clinical data in tandem with systems biology approaches have offered an exciting yet challenging opportunity toward reconstruction of more comprehensive and dynamic molecular and genetic inflammation networks, which hold great promise in transiting network snapshots to video-style multi-scale interplays of disease mechanisms, in turn leading to effective clinical intervention.
Collapse
Affiliation(s)
- Yongzhong Zhao
- Department of Genetics and Genomic Sciences, Mount Sinai School of Medicine, 1425 Madison Avenue, NY 10029, USA. and Institute of Genomics and Multiscale Biology, Mount Sinai School of Medicine, 1425 Madison Avenue, NY 10029, USA
| | - Christian V Forst
- Department of Genetics and Genomic Sciences, Mount Sinai School of Medicine, 1425 Madison Avenue, NY 10029, USA. and Institute of Genomics and Multiscale Biology, Mount Sinai School of Medicine, 1425 Madison Avenue, NY 10029, USA
| | - Camil E Sayegh
- Vertex Pharmaceuticals (Canada) Incorporated, 275 Armand-Frappier, Laval, Quebec H7V 4A7, Canada
| | - I-Ming Wang
- Informatics and Analysis, Merck Research Laboratories, Merck & Co., Inc., 770 Sumneytown Pike, West Point, PA 19486, USA.
| | - Xia Yang
- Department of Integrative Biology and Physiology, University of California, Los Angeles, CA 90025, USA.
| | - Bin Zhang
- Department of Genetics and Genomic Sciences, Mount Sinai School of Medicine, 1425 Madison Avenue, NY 10029, USA. and Institute of Genomics and Multiscale Biology, Mount Sinai School of Medicine, 1425 Madison Avenue, NY 10029, USA
| |
Collapse
|
24
|
Hung FH, Chiu HW. Cancer subtype prediction from a pathway-level perspective by using a support vector machine based on integrated gene expression and protein network. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2017; 141:27-34. [PMID: 28241965 DOI: 10.1016/j.cmpb.2017.01.006] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/09/2016] [Revised: 01/06/2017] [Accepted: 01/16/2017] [Indexed: 06/06/2023]
Abstract
BACKGROUND AND OBJECTIVE Distinguishing cancer subtypes is critical for selecting the appropriate treatment strategy. Bioinformatics approaches have gradually taken the place of clinical observations and pathological experiments. However, these approaches are typically only used in gene expression profiling. Previous studies have primarily focused on the gene level or specific diseases, and thus pathway-level factors have not been considered. Therefore, a computational method that integrates gene expression and pathway is necessary. METHODS This study presented an approach to determine potential fragments of activated pathways around protein networks in different stages of disease. We used a scored equation that integrates genomic and proteomic information and determined the intensity of the pathway link change. A support vector machine (SVM) was used to train and test subtype-predicted models. RESULTS The performance of the proposed method was evaluated by calculating prediction accuracy. The average prediction accuracy was 67.64% for three subtypes in tumors of neuroepithelial tissues. The results demonstrate that the proposed method applies fewer features than gene expression methods used to obtain similar results CONCLUSIONS: This study suggests a method to implement a cancer subtype classifier based on an SVM from a pathway-level perspective.
Collapse
Affiliation(s)
- Fei-Hung Hung
- Graduate Institute of Biomedical Informatics, Taipei Medical University, 250 Wu-Hsing Street, Taipei 11031, Taiwan
| | - Hung-Wen Chiu
- Graduate Institute of Biomedical Informatics, Taipei Medical University, 250 Wu-Hsing Street, Taipei 11031, Taiwan.
| |
Collapse
|
25
|
Barfuss W, Massara GP, Di Matteo T, Aste T. Parsimonious modeling with information filtering networks. Phys Rev E 2016; 94:062306. [PMID: 28085404 DOI: 10.1103/physreve.94.062306] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2016] [Indexed: 06/06/2023]
Abstract
We introduce a methodology to construct parsimonious probabilistic models. This method makes use of information filtering networks to produce a robust estimate of the global sparse inverse covariance from a simple sum of local inverse covariances computed on small subparts of the network. Being based on local and low-dimensional inversions, this method is computationally very efficient and statistically robust, even for the estimation of inverse covariance of high-dimensional, noisy, and short time series. Applied to financial data our method results are computationally more efficient than state-of-the-art methodologies such as Glasso producing, in a fraction of the computation time, models that can have equivalent or better performances but with a sparser inference structure. We also discuss performances with sparse factor models where we notice that relative performances decrease with the number of factors. The local nature of this approach allows us to perform computations in parallel and provides a tool for dynamical adaptation by partial updating when the properties of some variables change without the need of recomputing the whole model. This makes this approach particularly suitable to handle big data sets with large numbers of variables. Examples of practical application for forecasting, stress testing, and risk allocation in financial systems are also provided.
Collapse
Affiliation(s)
- Wolfram Barfuss
- Department of Physics, FAU Erlangen-Nürnberg, Nägelsbachstrasse 49b, 91052 Erlangen, Germany
| | - Guido Previde Massara
- Department of Computer Science, University College London, Gower Street, London, WC1E 6BT, United Kingdom
| | - T Di Matteo
- Department of Computer Science, University College London, Gower Street, London, WC1E 6BT, United Kingdom
- Department of Mathematics, King's College London, The Strand, London, WC2R 2LS, United Kingdom
- Systemic Risk Centre, London School of Economics and Political Sciences, London, WC2A2AE, United Kingdom
| | - Tomaso Aste
- Department of Computer Science, University College London, Gower Street, London, WC1E 6BT, United Kingdom
- Systemic Risk Centre, London School of Economics and Political Sciences, London, WC2A2AE, United Kingdom
| |
Collapse
|
26
|
Hung FH, Chiu HW. Differentiating disease subtypes by using pathway patterns constructed from gene expressions and protein networks. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2016; 2015:6519-22. [PMID: 26737786 DOI: 10.1109/embc.2015.7319886] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Gene expression profiles differ in different diseases. Even if diseases are at the same stage, such diseases exhibit different gene expressions, not to mention the different subtypes at a single lesion site. Distinguishing different disease subtypes at a single lesion site is difficult. In early cases, subtypes were initially distinguished by doctors. Subsequently, further differences were found through pathological experiments. For example, a brain tumor can be classified according to its origin, its cell-type origin, or the tumor site. Because of the advancements in bioinformatics and the techniques for accumulating gene expressions, researchers can use gene expression data to classify disease subtypes. Because the operation of a biopathway is closely related to the disease mechanism, the application of gene expression profiles for clustering disease subtypes is insufficient. In this study, we collected gene expression data of healthy and four myelodysplastic syndrome subtypes and applied a method that integrated protein-protein interaction and gene expression data to identify different patterns of disease subtypes. We hope it is efficient for the classification of disease subtypes in adventure.
Collapse
|
27
|
Donnat P, Marti G, Very P. Toward a generic representation of random variables for machine learning. Pattern Recognit Lett 2016. [DOI: 10.1016/j.patrec.2015.11.004] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
28
|
Song WM, Zhang B. Multiscale Embedded Gene Co-expression Network Analysis. PLoS Comput Biol 2015; 11:e1004574. [PMID: 26618778 PMCID: PMC4664553 DOI: 10.1371/journal.pcbi.1004574] [Citation(s) in RCA: 195] [Impact Index Per Article: 21.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2014] [Accepted: 09/24/2015] [Indexed: 02/02/2023] Open
Abstract
Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.
Collapse
Affiliation(s)
- Won-Min Song
- Department of Genetics and Genomic Sciences, Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| | - Bin Zhang
- Department of Genetics and Genomic Sciences, Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| |
Collapse
|
29
|
Pathogenesis of progressive scarring trachoma in Ethiopia and Tanzania and its implications for disease control: two cohort studies. PLoS Negl Trop Dis 2015; 9:e0003763. [PMID: 25970613 PMCID: PMC4430253 DOI: 10.1371/journal.pntd.0003763] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2015] [Accepted: 04/15/2015] [Indexed: 11/19/2022] Open
Abstract
Background Trachoma causes blindness through a conjunctival scarring process initiated by ocular Chlamydia trachomatis infection; however, the rates, drivers and pathophysiological determinants are poorly understood. We investigated progressive scarring and its relationship to conjunctival infection, inflammation and transcript levels of cytokines and fibrogenic factors. Methodology/Principal Findings We recruited two cohorts, one each in Ethiopia and Tanzania, of individuals with established trachomatous conjunctival scarring. They were followed six-monthly for two years, with clinical examinations and conjunctival swab sample collection. Progressive scarring cases were identified by comparing baseline and two-year photographs, and compared to individuals without progression. Samples were tested for C. trachomatis by PCR and transcript levels of S100A7, IL1B, IL13, IL17A, CXCL5, CTGF, SPARCL1, CEACAM5, MMP7, MMP9 and CD83 were estimated by quantitative RT-PCR. Progressive scarring was found in 135/585 (23.1%) of Ethiopian participants and 173/577 (30.0%) of Tanzanian participants. There was a strong relationship between progressive scarring and increasing inflammatory episodes (Ethiopia: OR 5.93, 95%CI 3.31–10.6, p<0.0001. Tanzania: OR 5.76, 95%CI 2.60–12.7, p<0.0001). No episodes of C. trachomatis infection were detected in the Ethiopian cohort and only 5 episodes in the Tanzanian cohort. Clinical inflammation, but not scarring progression, was associated with increased expression of S100A7, IL1B, IL17A, CXCL5, CTGF, CEACAM5, MMP7, CD83 and reduced SPARCL1. Conclusions/Significance Scarring progressed in the absence of detectable C. trachomatis, which raises uncertainty about the primary drivers of late-stage trachoma. Chronic conjunctival inflammation appears to be central and is associated with enriched expression of pro-inflammatory factors and altered expression of extracellular matrix regulators. Host determinants of scarring progression appear more complex and subtle than the features of inflammation. Overall this indicates a potential role for anti-inflammatory interventions to interrupt progression and the need for trichiasis disease surveillance and surgery long after chlamydial infection has been controlled at community level. Blinding trachoma is believed to be the end result of a long-term progressive scarring process that is initiated by recurrent infection by the bacterium Chlamydia trachomatis starting in childhood. Scar tissue predominantly develops on the inner surface of the upper eyelids (conjunctiva). However, the rates, drivers and pathophysiological determinants are poorly understood. We investigated progressive scarring and its relationship to conjunctival infection, inflammation and transcript levels of cytokines and fibrogenic factors in two cohorts of adults in Tanzania and Ethiopia. These groups of people already had a degree of scarring and were regularly followed-up with over a two-year period. We found scarring progressed in about a quarter of people over this time. The progression was closely associated with episodes of conjunctival inflammation but not to the detection of C. trachomatis infection. This raises uncertainty about the primary drivers of late-stage trachoma. Chronic conjunctival inflammation appears to be central and is associated with enriched expression of pro-inflammatory factors and altered expression of extracellular matrix regulators. Host determinants of scarring progression appear more complex and subtle than the features of inflammation. Overall this indicates the likely need for trichiasis disease surveillance and surgery long after chlamydial infection has been controlled at community level.
Collapse
|
30
|
Musmeci N, Aste T, Di Matteo T. Relation between financial market structure and the real economy: comparison between clustering methods. PLoS One 2015; 10:e0116201. [PMID: 25786703 PMCID: PMC4365074 DOI: 10.1371/journal.pone.0116201] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2014] [Accepted: 12/07/2014] [Indexed: 12/04/2022] Open
Abstract
We quantify the amount of information filtered by different hierarchical clustering methods on correlations between stock returns comparing the clustering structure with the underlying industrial activity classification. We apply, for the first time to financial data, a novel hierarchical clustering approach, the Directed Bubble Hierarchical Tree and we compare it with other methods including the Linkage and k-medoids. By taking the industrial sector classification of stocks as a benchmark partition, we evaluate how the different methods retrieve this classification. The results show that the Directed Bubble Hierarchical Tree can outperform other methods, being able to retrieve more information with fewer clusters. Moreover, we show that the economic information is hidden at different levels of the hierarchical structures depending on the clustering method. The dynamical analysis on a rolling window also reveals that the different methods show different degrees of sensitivity to events affecting financial markets, like crises. These results can be of interest for all the applications of clustering methods to portfolio optimization and risk hedging.
Collapse
Affiliation(s)
- Nicoló Musmeci
- Department of Mathematics, King’s College London, The Strand, London, WC2R 2LS, UK
| | - Tomaso Aste
- Department of Computer Science, UCL, Gower Street, London, WC1E 6BT, UK
- Systemic Risk Centre, London School of Economics and Political Sciences, London, WC2A2AE, UK
- * E-mail:
| | - T. Di Matteo
- Department of Mathematics, King’s College London, The Strand, London, WC2R 2LS, UK
| |
Collapse
|
31
|
Dependency structure and scaling properties of financial time series are related. Sci Rep 2014; 4:4589. [PMID: 24699417 PMCID: PMC3980463 DOI: 10.1038/srep04589] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2013] [Accepted: 03/10/2014] [Indexed: 12/03/2022] Open
Abstract
We report evidence of a deep interplay between cross-correlations hierarchical properties and multifractality of New York Stock Exchange daily stock returns. The degree of multifractality displayed by different stocks is found to be positively correlated to their depth in the hierarchy of cross-correlations. We propose a dynamical model that reproduces this observation along with an array of other empirical properties. The structure of this model is such that the hierarchical structure of heterogeneous risks plays a crucial role in the time evolution of the correlation matrix, providing an interpretation to the mechanism behind the interplay between cross-correlation and multifractality in financial markets, where the degree of multifractality of stocks is associated to their hierarchical positioning in the cross-correlation structure. Empirical observations reported in this paper present a new perspective towards the merging of univariate multi scaling and multivariate cross-correlation properties of financial time series.
Collapse
|
32
|
Pozzi F, Di Matteo T, Aste T. Spread of risk across financial markets: better to invest in the peripheries. Sci Rep 2013; 3:1665. [PMID: 23588852 PMCID: PMC3627193 DOI: 10.1038/srep01665] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2012] [Accepted: 03/28/2013] [Indexed: 11/09/2022] Open
Abstract
Risk is not uniformly spread across financial markets and this fact can be exploited to reduce investment risk contributing to improve global financial stability. We discuss how, by extracting the dependency structure of financial equities, a network approach can be used to build a well-diversified portfolio that effectively reduces investment risk. We find that investments in stocks that occupy peripheral, poorly connected regions in financial filtered networks, namely Minimum Spanning Trees and Planar Maximally Filtered Graphs, are most successful in diversifying, improving the ratio between returns' average and standard deviation, reducing the likelihood of negative returns, while keeping profits in line with the general market average even for small baskets of stocks. On the contrary, investments in subsets of central, highly connected stocks are characterized by greater risk and worse performance. This methodology has the added advantage of visualizing portfolio choices directly over the graphic layout of the network.
Collapse
Affiliation(s)
- F Pozzi
- Department of Applied Mathematics, Research School of Physical Sciences, The Australian National University, 0200 Canberra, ACT, Australia
| | | | | |
Collapse
|
33
|
Petri G, Scolamiero M, Donato I, Vaccarino F. Topological Strata of Weighted Complex Networks. PLoS One 2013; 8:e66506. [PMID: 23805226 PMCID: PMC3689815 DOI: 10.1371/journal.pone.0066506] [Citation(s) in RCA: 114] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2013] [Accepted: 05/07/2013] [Indexed: 11/18/2022] Open
Abstract
The statistical mechanical approach to complex networks is the dominant paradigm in describing natural and societal complex systems. The study of network properties, and their implications on dynamical processes, mostly focus on locally defined quantities of nodes and edges, such as node degrees, edge weights and -more recently- correlations between neighboring nodes. However, statistical methods quickly become cumbersome when dealing with many-body properties and do not capture the precise mesoscopic structure of complex networks. Here we introduce a novel method, based on persistent homology, to detect particular non-local structures, akin to weighted holes within the link-weight network fabric, which are invisible to existing methods. Their properties divide weighted networks in two broad classes: one is characterized by small hierarchically nested holes, while the second displays larger and longer living inhomogeneities. These classes cannot be reduced to known local or quasilocal network properties, because of the intrinsic non-locality of homological properties, and thus yield a new classification built on high order coordination patterns. Our results show that topology can provide novel insights relevant for many-body interactions in social and spatial networks. Moreover, this new method creates the first bridge between network theory and algebraic topology, which will allow to import the toolset of algebraic methods to complex systems.
Collapse
Affiliation(s)
| | - Martina Scolamiero
- ISI Foundation, Torino, Italy
- Dipartimento di Ingegneria Gestionale e della Produzione, Politecnico di Torino, Torino, Italy
| | - Irene Donato
- ISI Foundation, Torino, Italy
- Dipartimento di Scienze Matematiche, Politecnico di Torino, Torino, Italy
| | - Francesco Vaccarino
- ISI Foundation, Torino, Italy
- Dipartimento di Scienze Matematiche, Politecnico di Torino, Torino, Italy
| |
Collapse
|
34
|
Wu MY, Dai DQ, Zhang XF, Zhu Y. Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm. PLoS One 2013; 8:e66256. [PMID: 23799085 PMCID: PMC3684607 DOI: 10.1371/journal.pone.0066256] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2013] [Accepted: 05/02/2013] [Indexed: 11/29/2022] Open
Abstract
In cancer biology, it is very important to understand the phenotypic changes of the patients and discover new cancer subtypes. Recently, microarray-based technologies have shed light on this problem based on gene expression profiles which may contain outliers due to either chemical or electrical reasons. These undiscovered subtypes may be heterogeneous with respect to underlying networks or pathways, and are related with only a few of interdependent biomarkers. This motivates a need for the robust gene expression-based methods capable of discovering such subtypes, elucidating the corresponding network structures and identifying cancer related biomarkers. This study proposes a penalized model-based Student’s t clustering with unconstrained covariance (PMT-UC) to discover cancer subtypes with cluster-specific networks, taking gene dependencies into account and having robustness against outliers. Meanwhile, biomarker identification and network reconstruction are achieved by imposing an adaptive penalty on the means and the inverse scale matrices. The model is fitted via the expectation maximization algorithm utilizing the graphical lasso. Here, a network-based gene selection criterion that identifies biomarkers not as individual genes but as subnetworks is applied. This allows us to implicate low discriminative biomarkers which play a central role in the subnetwork by interconnecting many differentially expressed genes, or have cluster-specific underlying network structures. Experiment results on simulated datasets and one available cancer dataset attest to the effectiveness, robustness of PMT-UC in cancer subtype discovering. Moveover, PMT-UC has the ability to select cancer related biomarkers which have been verified in biochemical or biomedical research and learn the biological significant correlation among genes.
Collapse
Affiliation(s)
- Meng-Yun Wu
- Center for Computer Vision and Department of Mathematics, Sun Yat-Sen University, Guangzhou, China
| | - Dao-Qing Dai
- Center for Computer Vision and Department of Mathematics, Sun Yat-Sen University, Guangzhou, China
- * E-mail:
| | - Xiao-Fei Zhang
- Center for Computer Vision and Department of Mathematics, Sun Yat-Sen University, Guangzhou, China
| | - Yuan Zhu
- Center for Computer Vision and Department of Mathematics, Sun Yat-Sen University, Guangzhou, China
- Department of Mathematics, Guangdong University of Business Studies, Guangzhou, China
| |
Collapse
|
35
|
Cui Y, Zheng CH, Yang J. Identifying subspace gene clusters from microarray data using low-rank representation. PLoS One 2013; 8:e59377. [PMID: 23527177 PMCID: PMC3602020 DOI: 10.1371/journal.pone.0059377] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2012] [Accepted: 02/13/2013] [Indexed: 12/23/2022] Open
Abstract
Identifying subspace gene clusters from the gene expression data is useful for discovering novel functional gene interactions. In this paper, we propose to use low-rank representation (LRR) to identify the subspace gene clusters from microarray data. LRR seeks the lowest-rank representation among all the candidates that can represent the genes as linear combinations of the bases in the dataset. The clusters can be extracted based on the block diagonal representation matrix obtained using LRR, and they can well capture the intrinsic patterns of genes with similar functions. Meanwhile, the parameter of LRR can balance the effect of noise so that the method is capable of extracting useful information from the data with high level of background noise. Compared with traditional methods, our approach can identify genes with similar functions yet without similar expression profiles. Also, it could assign one gene into different clusters. Moreover, our method is robust to the noise and can identify more biologically relevant gene clusters. When applied to three public datasets, the results show that the LRR based method is superior to existing methods for identifying subspace gene clusters.
Collapse
Affiliation(s)
- Yan Cui
- School of Computer Science and Technology, Nanjing University of Science and Technology, Nanjing, Jiangsu, China
| | - Chun-Hou Zheng
- College of Electrical Engineering and Automation, Anhui University, Hefei, Anhui, China
| | - Jian Yang
- School of Computer Science and Technology, Nanjing University of Science and Technology, Nanjing, Jiangsu, China
- * E-mail:
| |
Collapse
|
36
|
Hoppe A. What mRNA Abundances Can Tell us about Metabolism. Metabolites 2012; 2:614-31. [PMID: 24957650 PMCID: PMC3901220 DOI: 10.3390/metabo2030614] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2012] [Revised: 08/24/2012] [Accepted: 09/04/2012] [Indexed: 01/23/2023] Open
Abstract
Inferring decreased or increased metabolic functions from transcript profiles is at first sight a bold and speculative attempt because of the functional layers in between: proteins, enzymatic activities, and reaction fluxes. However, the growing interest in this field can easily be explained by two facts: the high quality of genome-scale metabolic network reconstructions and the highly developed technology to obtain genome-covering RNA profiles. Here, an overview of important algorithmic approaches is given by means of criteria by which published procedures can be classified. The frontiers of the methods are sketched and critical voices are being heard. Finally, an outlook for the prospects of the field is given.
Collapse
Affiliation(s)
- Andreas Hoppe
- Institute for Biochemistry, Charité University Medicine Berlin, Charitéplatz 1, Berlin 10117, Germany.
| |
Collapse
|
37
|
Aste T, Gramatica R, Di Matteo T. Exploring complex networks via topological embedding on surfaces. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2012; 86:036109. [PMID: 23030982 DOI: 10.1103/physreve.86.036109] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/07/2011] [Revised: 07/21/2012] [Indexed: 05/15/2023]
Abstract
We demonstrate that graphs embedded on surfaces are a powerful and practical tool to generate, to characterize, and to simulate networks with a broad range of properties. Any network can be embedded on a surface with sufficiently high genus and therefore the study of topologically embedded graphs is non-restrictive. We show that the local properties of the network are affected by the surface genus which determines the average degree, which influences the degree distribution, and which controls the clustering coefficient. The global properties of the graph are also strongly affected by the surface genus which is constraining the degree of interwovenness, changing the scaling properties of the network from large-world kind (small genus) to small- and ultrasmall-world kind (large genus). Two elementary moves allow the exploration of all networks embeddable on a given surface and naturally introduce a tool to develop a statistical mechanics description for these networks. Within such a framework, we study the properties of topologically embedded graphs which dynamically tend to lower their energy towards a ground state with a given reference degree distribution. We show that the cooling dynamics between high and low "temperatures" is strongly affected by the surface genus with the manifestation of a glass-like transition occurring when the distance from the reference distribution is low. We prove, with examples, that topologically embedded graphs can be built in a way to contain arbitrary complex networks as subgraphs. This method opens a new avenue to build geometrically embedded networks on hyperbolic manifolds.
Collapse
Affiliation(s)
- Tomaso Aste
- School of Physical Sciences, University of Kent, CT2 7NZ, United Kingdom.
| | | | | |
Collapse
|
38
|
Song WM, Di Matteo T, Aste T. Building complex networks with Platonic solids. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2012; 85:046115. [PMID: 22680546 DOI: 10.1103/physreve.85.046115] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2011] [Indexed: 06/01/2023]
Abstract
We propose a unified model to build planar graphs with diverse topological characteristics which are of relevance in real applications. Here convex regular polyhedra (Platonic solids) are used as the building blocks for the construction of a variety of complex planar networks. These networks are obtained by merging polyhedra face by face on a tree-structure leading to planar graphs. We investigate two different constructions: (1) a fully deterministic construction where a self-similar fractal structure is built by using a single kind of polyhedron which is iteratively attached to every face and (2) a stochastic construction where at each step a polyhedron is attached to a randomly chosen face. These networks are scale-free, small-world, clustered, and sparse, sharing several characteristics of real-world complex networks. We derive analytical expressions for the degree distribution, the clustering coefficient, and the mean degree of nearest neighbors showing that these networks have power-law degree distributions with tunable exponents associated with the building polyhedron, and they possess a hierarchical organization that is determined by planarity.
Collapse
Affiliation(s)
- Won-Min Song
- Department of Applied Mathematics, Research School of Physics and Engineering, Australian National University, Canberra, ACT 0200, Australia.
| | | | | |
Collapse
|