1
|
Wu Y, Hu L, Hu J. Modeling Tree-like Heterophily on Symmetric Matrix Manifolds. ENTROPY (BASEL, SWITZERLAND) 2024; 26:377. [PMID: 38785627 PMCID: PMC11120610 DOI: 10.3390/e26050377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2024] [Revised: 04/14/2024] [Accepted: 04/25/2024] [Indexed: 05/25/2024]
Abstract
Tree-like structures, characterized by hierarchical relationships and power-law distributions, are prevalent in a multitude of real-world networks, ranging from social networks to citation networks and protein-protein interaction networks. Recently, there has been significant interest in utilizing hyperbolic space to model these structures, owing to its capability to represent them with diminished distortions compared to flat Euclidean space. However, real-world networks often display a blend of flat, tree-like, and circular substructures, resulting in heterophily. To address this diversity of substructures, this study aims to investigate the reconstruction of graph neural networks on the symmetric manifold, which offers a comprehensive geometric space for more effective modeling of tree-like heterophily. To achieve this objective, we propose a graph convolutional neural network operating on the symmetric positive-definite matrix manifold, leveraging Riemannian metrics to facilitate the scheme of information propagation. Extensive experiments conducted on semi-supervised node classification tasks validate the superiority of the proposed approach, demonstrating that it outperforms comparative models based on Euclidean and hyperbolic geometries.
Collapse
Affiliation(s)
| | | | - Juncheng Hu
- College of Computer Science and Technology, Jilin University, Changchun 130012, China; (Y.W.); (L.H.)
| |
Collapse
|
2
|
Wei R, Liu Y, Song J, Xie Y, Zhou K. Exploring Hierarchical Information in Hyperbolic Space for Self-Supervised Image Hashing. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2024; 33:1768-1781. [PMID: 38442063 DOI: 10.1109/tip.2024.3371358] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/07/2024]
Abstract
In real-world datasets, visually related images often form clusters, and these clusters can be further grouped into larger categories with more general semantics. These inherent hierarchical structures can help capture the underlying distribution of data, making it easier to learn robust hash codes that lead to better retrieval performance. However, existing methods fail to make use of this hierarchical information, which in turn prevents the accurate preservation of relationships between data points in the learned hash codes, resulting in suboptimal performance. In this paper, our focus is on applying visual hierarchical information to self-supervised hash learning and addressing three key challenges, including the construction, embedding, and exploitation of visual hierarchies. We propose a new self-supervised hashing method named Hierarchical Hyperbolic Contrastive Hashing (HHCH), making breakthroughs in three aspects. First, we propose to embed continuous hash codes into hyperbolic space for accurate semantic expression since embedding hierarchies in the hyperbolic space generates less distortion than in the hyper-sphere or Euclidean space. Second, we update the K-Means algorithm to make it run in the hyperbolic space. The proposed hierarchical hyperbolic K-Means algorithm can achieve the adaptive construction of hierarchical semantic structures. Last but not least, to exploit the hierarchical semantic structures in hyperbolic space, we propose the hierarchical contrastive learning algorithm, including hierarchical instance-wise and hierarchical prototype-wise contrastive learning. Extensive experiments on four benchmark datasets demonstrate that the proposed method outperforms state-of-the-art self-supervised hashing methods. Our codes are released at https://github.com/HUST-IDSM-AI/HHCH.git.
Collapse
|
3
|
Ramazi S, Tabatabaei SAH, Khalili E, Nia AG, Motarjem K. Analysis and review of techniques and tools based on machine learning and deep learning for prediction of lysine malonylation sites in protein sequences. Database (Oxford) 2024; 2024:baad094. [PMID: 38245002 PMCID: PMC10799748 DOI: 10.1093/database/baad094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Revised: 11/30/2023] [Accepted: 12/20/2023] [Indexed: 01/22/2024]
Abstract
The post-translational modifications occur as crucial molecular regulatory mechanisms utilized to regulate diverse cellular processes. Malonylation of proteins, a reversible post-translational modification of lysine/k residues, is linked to a variety of biological functions, such as cellular regulation and pathogenesis. This modification plays a crucial role in metabolic pathways, mitochondrial functions, fatty acid oxidation and other life processes. However, accurately identifying malonylation sites is crucial to understand the molecular mechanism of malonylation, and the experimental identification can be a challenging and costly task. Recently, approaches based on machine learning (ML) have been suggested to address this issue. It has been demonstrated that these procedures improve accuracy while lowering costs and time constraints. However, these approaches also have specific shortcomings, including inappropriate feature extraction out of protein sequences, high-dimensional features and inefficient underlying classifiers. As a result, there is an urgent need for effective predictors and calculation methods. In this study, we provide a comprehensive analysis and review of existing prediction models, tools and benchmark datasets for predicting malonylation sites in protein sequences followed by a comparison study. The review consists of the specifications of benchmark datasets, explanation of features and encoding methods, descriptions of the predictions approaches and their embedding ML or deep learning models and the description and comparison of the existing tools in this domain. To evaluate and compare the prediction capability of the tools, a new bunch of data has been extracted based on the most updated database and the tools have been assessed based on the extracted data. Finally, a hybrid architecture consisting of several classifiers including classical ML models and a deep learning model has been proposed to ensemble the prediction results. This approach demonstrates the better performance in comparison with all prediction tools included in this study (the source codes of the models presented in this manuscript are available in https://github.com/Malonylation). Database URL: https://github.com/A-Golshan/Malonylation.
Collapse
Affiliation(s)
| | - Seyed Amir Hossein Tabatabaei
- Department of Computer Science, Faculty of Mathematical Sciences, University of Guilan, Namjoo St. Postal, Rasht 41938-33697, Iran
- Department of Biophysics, Faculty of Biological Sciences, Tarbiat Modares University, Jalal AleAhmad, Tehran 14117-13116, Iran
| | - Elham Khalili
- Department of Plant Sciences, Faculty of Science, Tarbiat Modares University, Jalal AleAhmad, Tehran 14117-13116, Iran
| | - Amirhossein Golshan Nia
- Department of Mathematics and Computer Science, Amirkabir University of Technology, No. 350, Hafez Ave, Tehran 15916-34311, Iran
| | - Kiomars Motarjem
- Department of Statistics, Faculty of Mathematical Sciences, Tarbiat Modares University, Jalal AleAhmad, Tehran 14117-13116, Iran
| |
Collapse
|
4
|
Petoukhov SV. The principle "like begets like" in algebra-matrix genetics and code biology. Biosystems 2023; 233:105019. [PMID: 37690530 DOI: 10.1016/j.biosystems.2023.105019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2023] [Revised: 09/01/2023] [Accepted: 09/01/2023] [Indexed: 09/12/2023]
Abstract
The article is devoted to analysis of emergent properties of the system of binary oppositions in the genetic code ensemble. The epochal model of the double helix of DNA by Watson and Crick showed that the multiple reproduction of genetic information on DNA strands uses the ancient principle "like begets like" based on the simple complementarity in pairs of nucleobases. Each of these pairs is built on the binary opposition "purine-pyrimidine". But the system of DNA n-plet alphabets and genetic coding is much richer in types of binary oppositions, which also have some coding meanings related to this principle. The article contains the results of the application of the author's "method of hierarchy binary stochastics" (HBS-method) to the analysis of the quasi-stochastic organization of binary sequences of hydrogen bonds in genomic single-stranded DNAs. This analysis revealed hidden probability rules related to dichotomous fractal-like probability trees. The relationship between inherited bodily dichotomies in living organisms and the discovered probability dichotomies in information sequences of genomic DNAs is discussed. The encoding properties of molecular binary oppositions in the DNA nucleotide system allows the algorithmic construction of (2n∗2n)-matrices of probabilities of n-plets in these binary sequences, which are matrix representations of 2n-dimensional hyperbolic numbers. Connections of these multidimensional numbers with some inherited physiological phenomena and deep neural networks are noted. A unified algebra-numeric certification of the DNAs of genomes and genes - based on these multidimensional numerical systems - is proposed.
Collapse
Affiliation(s)
- Sergey V Petoukhov
- Mechanical Engineering Research Institute of Russian Academy of Sciences, M. Kharitonievskiy pereulok, 4, 101990, Moscow, Russia.
| |
Collapse
|
5
|
Wang Z, Lu H, Yan H, Kan H, Jin L. Vison transformer adapter-based hyperbolic embeddings for multi-lesion segmentation in diabetic retinopathy. Sci Rep 2023; 13:11178. [PMID: 37429966 DOI: 10.1038/s41598-023-38320-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Accepted: 07/06/2023] [Indexed: 07/12/2023] Open
Abstract
Diabetic Retinopathy (DR) is a major cause of blindness worldwide. Early detection and treatment are crucial to prevent vision loss, making accurate and timely diagnosis critical. Deep learning technology has shown promise in the automated diagnosis of DR, and in particular, multi-lesion segmentation tasks. In this paper, we propose a novel Transformer-based model for DR segmentation that incorporates hyperbolic embeddings and a spatial prior module. The proposed model is primarily built on a traditional Vision Transformer encoder and further enhanced by incorporating a spatial prior module for image convolution and feature continuity, followed by feature interaction processing using the spatial feature injector and extractor. Hyperbolic embeddings are used to classify feature matrices from the model at the pixel level. We evaluated the proposed model's performance on the publicly available datasets and compared it with other widely used DR segmentation models. The results show that our model outperforms these widely used DR segmentation models. The incorporation of hyperbolic embeddings and a spatial prior module into the Vision Transformer-based model significantly improves the accuracy of DR segmentation. The hyperbolic embeddings enable us to better capture the underlying geometric structure of the feature matrices, which is important for accurate segmentation. The spatial prior module improves the continuity of the features and helps to better distinguish between lesions and normal tissues. Overall, our proposed model has potential for clinical use in automated DR diagnosis, improving accuracy and speed of diagnosis. Our study shows that the integration of hyperbolic embeddings and a spatial prior module with a Vision Transformer-based model improves the performance of DR segmentation models. Future research can explore the application of our model to other medical imaging tasks, as well as further optimization and validation in real-world clinical settings.
Collapse
Affiliation(s)
- Zijian Wang
- School of Medicine and Information Engineering, Anhui University of Chinese Medicine, Hefei, 230012, China
- Hefei University of Technology, Hefei, 230009, China
| | - Haimei Lu
- School of Basic Medical Sciences, Anhui Medical University, Hefei, 230032, China
| | - Haixin Yan
- Hefei University of Technology, Hefei, 230009, China
| | - Hongxing Kan
- School of Medicine and Information Engineering, Anhui University of Chinese Medicine, Hefei, 230012, China
| | - Li Jin
- School of Medicine and Information Engineering, Anhui University of Chinese Medicine, Hefei, 230012, China.
| |
Collapse
|
6
|
Xu Y, Zang Z, Xia J, Tan C, Geng Y, Li SZ. Structure-preserving visualization for single-cell RNA-Seq profiles using deep manifold transformation with batch-correction. Commun Biol 2023; 6:369. [PMID: 37016133 PMCID: PMC10073100 DOI: 10.1038/s42003-023-04662-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Accepted: 03/06/2023] [Indexed: 04/06/2023] Open
Abstract
Dimensionality reduction and visualization play an important role in biological data analysis, such as data interpretation of single-cell RNA sequences (scRNA-seq). It is desired to have a visualization method that can not only be applicable to various application scenarios, including cell clustering and trajectory inference, but also satisfy a variety of technical requirements, especially the ability to preserve inherent structure of data and handle with batch effects. However, no existing methods can accommodate these requirements in a unified framework. In this paper, we propose a general visualization method, deep visualization (DV), that possesses the ability to preserve inherent structure of data and handle batch effects and is applicable to a variety of datasets from different application domains and dataset scales. The method embeds a given dataset into a 2- or 3-dimensional visualization space, with either a Euclidean or hyperbolic metric depending on a specified task type with type static (at a time point) or dynamic (at a sequence of time points) scRNA-seq data, respectively. Specifically, DV learns a structure graph to describe the relationships between data samples, transforms the data into visualization space while preserving the geometric structure of the data and correcting batch effects in an end-to-end manner. The experimental results on nine datasets in complex tissue from human patients or animal development demonstrate the competitiveness of DV in discovering complex cellular relations, uncovering temporal trajectories, and addressing complex batch factors. We also provide a preliminary attempt to pre-train a DV model for visualization of new incoming data.
Collapse
Affiliation(s)
- Yongjie Xu
- Zhejiang University, Hangzhou, 310058, China
- AI Division, School of Engineering, Westlake University, Hangzhou, 310024, China
| | - Zelin Zang
- Zhejiang University, Hangzhou, 310058, China
- AI Division, School of Engineering, Westlake University, Hangzhou, 310024, China
| | - Jun Xia
- Zhejiang University, Hangzhou, 310058, China
- AI Division, School of Engineering, Westlake University, Hangzhou, 310024, China
| | - Cheng Tan
- Zhejiang University, Hangzhou, 310058, China
- AI Division, School of Engineering, Westlake University, Hangzhou, 310024, China
| | - Yulan Geng
- AI Division, School of Engineering, Westlake University, Hangzhou, 310024, China
| | - Stan Z Li
- AI Division, School of Engineering, Westlake University, Hangzhou, 310024, China.
| |
Collapse
|
7
|
Gao Z, Wu Y, Harandi M, Jia Y. Curvature-Adaptive Meta-Learning for Fast Adaptation to Manifold Data. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2023; 45:1545-1562. [PMID: 35380955 DOI: 10.1109/tpami.2022.3164894] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Meta-learning methods are shown to be effective in quickly adapting a model to novel tasks. Most existing meta-learning methods represent data and carry out fast adaptation in euclidean space. In fact, data of real-world applications usually resides in complex and various Riemannian manifolds. In this paper, we propose a curvature-adaptive meta-learning method that achieves fast adaptation to manifold data by producing suitable curvature. Specifically, we represent data in the product manifold of multiple constant curvature spaces and build a product manifold neural network as the base-learner. In this way, our method is capable of encoding complex manifold data into discriminative and generic representations. Then, we introduce curvature generation and curvature updating schemes, through which suitable product manifolds for various forms of data manifolds are constructed via few optimization steps. The curvature generation scheme identifies task-specific curvature initialization, leading to a shorter optimization trajectory. The curvature updating scheme automatically produces appropriate learning rate and search direction for curvature, making a faster and more adaptive optimization paradigm compared to hand-designed optimization schemes. We evaluate our method on a broad set of problems including few-shot classification, few-shot regression, and reinforcement learning tasks. Experimental results show that our method achieves substantial improvements as compared to meta-learning methods ignoring the geometry of the underlying space.
Collapse
|
8
|
Jia Y, Lin M, Wang Y, Li J, Chen K, Siebert J, Zhang GZ, Liao Q. Extrapolation over temporal knowledge graph via hyperbolic embedding. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY 2023. [DOI: 10.1049/cit2.12186] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Affiliation(s)
- Yan Jia
- Pencheng Laboratory Harbin Institute of Technology (Shenzhen) Shenzhen China
| | - Menqi Lin
- Harbin Institute of Technology (Shenzhen) Shenzhen China
| | - Ye Wang
- Harbin Institute of Technology (Shenzhen) Shenzhen China
- National University of Defense Technology Changsha China
| | - Jianming Li
- Harbin Institute of Technology (Shenzhen) Shenzhen China
| | - Kai Chen
- National University of Defense Technology Changsha China
| | - Joanna Siebert
- Harbin Institute of Technology (Shenzhen) Shenzhen China
| | | | - Qing Liao
- Pencheng Laboratory Harbin Institute of Technology (Shenzhen) Shenzhen China
| |
Collapse
|
9
|
Geometry interaction network alignment. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.06.077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
10
|
Luo Q, Yu D, Maradapu Vera Venkata Sai A, Cai Z, Cheng X. A survey of structural representation learning for social networks. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.04.128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|