1
|
Lei Y, Meng Y, Guo X, Ning K, Bian Y, Li L, Hu Z, Anashkina AA, Jiang Q, Dong Y, Zhu X. Overview of structural variation calling: Simulation, identification, and visualization. Comput Biol Med 2022; 145:105534. [DOI: 10.1016/j.compbiomed.2022.105534] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2022] [Revised: 04/09/2022] [Accepted: 04/14/2022] [Indexed: 12/11/2022]
|
2
|
Cornman RS, Cryan PM. Positively selected genes in the hoary bat ( Lasiurus cinereus) lineage: prominence of thymus expression, immune and metabolic function, and regions of ancient synteny. PeerJ 2022; 10:e13130. [PMID: 35317076 PMCID: PMC8934532 DOI: 10.7717/peerj.13130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Accepted: 02/25/2022] [Indexed: 01/12/2023] Open
Abstract
Background Bats of the genus Lasiurus occur throughout the Americas and have diversified into at least 20 species among three subgenera. The hoary bat (Lasiurus cinereus) is highly migratory and ranges farther across North America than any other wild mammal. Despite the ecological importance of this species as a major insect predator, and the particular susceptibility of lasiurine bats to wind turbine strikes, our understanding of hoary bat ecology, physiology, and behavior remains poor. Methods To better understand adaptive evolution in this lineage, we used whole-genome sequencing to identify protein-coding sequence and explore signatures of positive selection. Gene models were predicted with Maker and compared to seven well-annotated and phylogenetically representative species. Evolutionary rate analysis was performed with PAML. Results Of 9,447 single-copy orthologous groups that met evaluation criteria, 150 genes had a significant excess of nonsynonymous substitutions along the L. cinereus branch (P < 0.001 after manual review of alignments). Selected genes as a group had biased expression, most strongly in thymus tissue. We identified 23 selected genes with reported immune functions as well as a divergent paralog of Steep1 within suborder Yangochiroptera. Seventeen genes had roles in lipid and glucose metabolic pathways, partially overlapping with 15 mitochondrion-associated genes; these adaptations may reflect the metabolic challenges of hibernation, long-distance migration, and seasonal variation in prey abundance. The genomic distribution of positively selected genes differed significantly from background expectation by discrete Kolmogorov-Smirnov test (P < 0.001). Remarkably, the top three physical clusters all coincided with islands of conserved synteny predating Mammalia, the largest of which shares synteny with the human cat-eye critical region (CECR) on 22q11. This observation coupled with the expansion of a novel Tbx1-like gene family may indicate evolutionary innovation during pharyngeal arch development: both the CECR and Tbx1 cause dosage-dependent congenital abnormalities in thymus, heart, and head, and craniodysmorphy is associated with human orthologs of other positively selected genes as well.
Collapse
|
3
|
Xue F, Gao L, Chen T, Chen H, Zhang H, Wang T, Han Z, Gao S, Wang L, Hu Y, Tang J, Huang L, Liu G, Zhang Y. Parkinson's Disease rs117896735 Variant Regulates INPP5F Expression in Brain Tissues and Increases Risk of Alzheimer's Disease. J Alzheimers Dis 2022; 89:67-77. [PMID: 35848021 DOI: 10.3233/jad-220086] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
BACKGROUND Both INPP5D and INPP5F are members of INPP5 family. INPP5F rs117896735 variant was associated with Parkinson's disease (PD) risk, and INPP5D was an Alzheimer's disease (AD) risk gene. However, it remains unclear about the roles of INPP5F rs117896735 variant in AD. OBJECTIVE We aim to investigate the roles of rs117896735 in AD. METHODS First, we conducted a candidate variant study to evaluate the association of rs117896735 variant with AD risk using the large-scale AD GWAS dataset. Second, we conducted a gene expression analysis of INPP5F to investigate the expression difference of INPP5F in different human tissues using two large-scale gene expression datasets. Third, we conducted an expression quantitative trait loci analysis to evaluate whether rs117896735 variant regulate the expression of INPP5F. Fourth, we explore the potentially differential expression of INPP5F in AD and control using multiple AD-control gene expression datasets in human brain tissues and whole blood. RESULTS We found that 1) rs117896735 A allele was associated with the increased risk of AD with OR = 1.15, 95% CI 1.005-1.315, p = 0.042; 2) rs117896735 A allele could increase INPP5F expression in multiple human tissues; 3) INPP5F showed different expression in different human tissues, especially in brain tissues; 4) INPP5F showed significant expression dysregulation in AD compared with controls in human brain tissues. CONCLUSION Conclusion: We demonstrate that PD rs117896735 variant could regulate INPP5F expression in brain tissues and increase the risk of AD. These finding may provide important information about the role of rs117896735 in AD.
Collapse
Affiliation(s)
- Feng Xue
- Department of Neurosurgery, Tianjin Hospital of ITCWM Nan Kai Hospital, Tianjin, China
| | - Luyan Gao
- Department of Neurology, Tianjin Fourth Central Hospital, The Fourth Central Hospital Affiliated to Nankai University, The Fourth Central Clinical College of Tianjin Medical University, Tianjin, China
| | - TingTing Chen
- Department of Oncology, Tianjin Hospital of ITCWM Nan Kai Hospital, Tianjin, China
| | - Hongyuan Chen
- Department of Neurology, The Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Haihua Zhang
- Beijing Institute of Brain Disorders, Laboratory of Brain Disorders, Ministry of Science and Technology, Collaborative Innovation Center for Brain Disorders, Capital Medical University, Beijing, China
| | - Tao Wang
- Chinese Institute for Brain Research, Beijing, China
| | - Zhifa Han
- School of Medicine, School of Pharmaceutical Sciences, THU-PKU Center for Life Sciences, Tsinghua University, Beijing, China
| | - Shan Gao
- Beijing Institute of Brain Disorders, Laboratory of Brain Disorders, Ministry of Science and Technology, Collaborative Innovation Center for Brain Disorders, Capital Medical University, Beijing, China
| | - Longcai Wang
- Department of Anesthesiology, The Affiliated Hospital of Weifang Medical University, Weifang, China
| | - Yang Hu
- School of Life Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Jiangwei Tang
- Department of Neurology, Tianjin Fourth Central Hospital, The Fourth Central Hospital Affiliated to Nankai University, The Fourth Central Clinical College of Tianjin Medical University, Tianjin, China
| | - Lei Huang
- Department of Neurology, Tianjin Fourth Central Hospital, The Fourth Central Hospital Affiliated to Nankai University, The Fourth Central Clinical College of Tianjin Medical University, Tianjin, China
| | - Guiyou Liu
- Beijing Institute of Brain Disorders, Laboratory of Brain Disorders, Ministry of Science and Technology, Collaborative Innovation Center for Brain Disorders, Capital Medical University, Beijing, China
- Chinese Institute for Brain Research, Beijing, China
- Beijing Key Laboratory of Hypoxia Translational Medicine, National Engineering Laboratory of Internet Medical Diagnosis and Treatment Technology, Xuanwu Hospital, Capital Medical University, Beijing, China
| | - Yan Zhang
- Department of Pathology, The Affiliated Hospital of Weifang Medical University, Weifang, China
| |
Collapse
|
4
|
Guo X, Zhou W, Yu Y, Cai Y, Zhang Y, Du A, Lu Q, Ding Y, Li C. Multiple Laplacian Regularized RBF Neural Network for Assessing Dry Weight of Patients With End-Stage Renal Disease. Front Physiol 2021; 12:790086. [PMID: 34966294 PMCID: PMC8711098 DOI: 10.3389/fphys.2021.790086] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 11/17/2021] [Indexed: 11/28/2022] Open
Abstract
Dry weight (DW) is an important dialysis index for patients with end-stage renal disease. It can guide clinical hemodialysis. Brain natriuretic peptide, chest computed tomography image, ultrasound, and bioelectrical impedance analysis are key indicators (multisource information) for assessing DW. By these approaches, a trial-and-error method (traditional measurement method) is employed to assess DW. The assessment of clinician is time-consuming. In this study, we developed a method based on artificial intelligence technology to estimate patient DW. Based on the conventional radial basis function neural (RBFN) network, we propose a multiple Laplacian-regularized RBFN (MLapRBFN) model to predict DW of patient. Compared with other model and body composition monitor, our method achieves the lowest value (1.3226) of root mean square error. In Bland-Altman analysis of MLapRBFN, the number of out agreement interval is least (17 samples). MLapRBFN integrates multiple Laplace regularization terms, and employs an efficient iterative algorithm to solve the model. The ratio of out agreement interval is 3.57%, which is lower than 5%. Therefore, our method can be tentatively applied for clinical evaluation of DW in hemodialysis patients.
Collapse
Affiliation(s)
- Xiaoyi Guo
- Hemodialysis Center, The Affiliated Wuxi People's Hospital of Nanjing Medical University, Wuxi, China.,Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| | - Wei Zhou
- Hemodialysis Center, The Affiliated Wuxi People's Hospital of Nanjing Medical University, Wuxi, China
| | - Yan Yu
- Hemodialysis Center, The Affiliated Wuxi People's Hospital of Nanjing Medical University, Wuxi, China
| | - Yinghua Cai
- Department of Nursing, The Affiliated Wuxi People's Hospital of Nanjing Medical University, Wuxi, China
| | - Yuan Zhang
- Hemodialysis Center, The Affiliated Wuxi People's Hospital of Nanjing Medical University, Wuxi, China
| | - Aiyan Du
- Hemodialysis Center, The Affiliated Wuxi People's Hospital of Nanjing Medical University, Wuxi, China
| | - Qun Lu
- Department of Nursing, The Affiliated Wuxi People's Hospital of Nanjing Medical University, Wuxi, China
| | - Yijie Ding
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China
| | - Chao Li
- General Hospital of Heilongjiang Province Land Reclamation Bureau, Harbin, China
| |
Collapse
|
5
|
Guo Y, Cheng H, Yuan Z, Liang Z, Wang Y, Du D. Testing Gene-Gene Interactions Based on a Neighborhood Perspective in Genome-wide Association Studies. Front Genet 2021; 12:801261. [PMID: 34956337 PMCID: PMC8693929 DOI: 10.3389/fgene.2021.801261] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 11/15/2021] [Indexed: 12/21/2022] Open
Abstract
Unexplained genetic variation that causes complex diseases is often induced by gene-gene interactions (GGIs). Gene-based methods are one of the current statistical methodologies for discovering GGIs in case-control genome-wide association studies that are not only powerful statistically, but also interpretable biologically. However, most approaches include assumptions about the form of GGIs, which results in poor statistical performance. As a result, we propose gene-based testing based on the maximal neighborhood coefficient (MNC) called gene-based gene-gene interaction through a maximal neighborhood coefficient (GBMNC). MNC is a metric for capturing a wide range of relationships between two random vectors with arbitrary, but not necessarily equal, dimensions. We established a statistic that leverages the difference in MNC in case and in control samples as an indication of the existence of GGIs, based on the assumption that the joint distribution of two genes in cases and controls should not be substantially different if there is no interaction between them. We then used a permutation-based statistical test to evaluate this statistic and calculate a statistical p-value to represent the significance of the interaction. Experimental results using both simulation and real data showed that our approach outperformed earlier methods for detecting GGIs.
Collapse
Affiliation(s)
- Yingjie Guo
- School of Electronic and Communication Engineering, Shenzhen Polytechnic, Shenzhen, China.,Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| | - Honghong Cheng
- School of Information, Shanxi University of Finance and Economics, Taiyuan, China
| | - Zhian Yuan
- Research Institute of Big Data Science and Industry, Shanxi University, Taiyuan, China
| | - Zhen Liang
- School of Life Science, Shanxi University, Taiyuan, China
| | - Yang Wang
- School of Electronic and Communication Engineering, Shenzhen Polytechnic, Shenzhen, China
| | - Debing Du
- Beidahuang Industry Group General Hospital, Harbin, China
| |
Collapse
|
6
|
Guo Y, Wu C, Yuan Z, Wang Y, Liang Z, Wang Y, Zhang Y, Xu L. Gene-Based Testing of Interactions Using XGBoost in Genome-Wide Association Studies. Front Cell Dev Biol 2021; 9:801113. [PMID: 34977040 PMCID: PMC8716787 DOI: 10.3389/fcell.2021.801113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2021] [Accepted: 11/23/2021] [Indexed: 11/30/2022] Open
Abstract
Among the myriad of statistical methods that identify gene–gene interactions in the realm of qualitative genome-wide association studies, gene-based interactions are not only powerful statistically, but also they are interpretable biologically. However, they have limited statistical detection by making assumptions on the association between traits and single nucleotide polymorphisms. Thus, a gene-based method (GGInt-XGBoost) originated from XGBoost is proposed in this article. Assuming that log odds ratio of disease traits satisfies the additive relationship if the pair of genes had no interactions, the difference in error between the XGBoost model with and without additive constraint could indicate gene–gene interaction; we then used a permutation-based statistical test to assess this difference and to provide a statistical p-value to represent the significance of the interaction. Experimental results on both simulation and real data showed that our approach had superior performance than previous experiments to detect gene–gene interactions.
Collapse
Affiliation(s)
- Yingjie Guo
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
- School of Electronic and Communication Engineering, Shenzhen Polytechnic, Shenzhen, China
| | - Chenxi Wu
- Department of Mathematics, University of Wisconsin-Madison, Madison, WI, United States
| | - Zhian Yuan
- Research Institute of Big Data Science and Industry, Shanxi University, Taiyuan, China
| | - Yansu Wang
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
- School of Electronic and Communication Engineering, Shenzhen Polytechnic, Shenzhen, China
| | - Zhen Liang
- School of Life Science, Shanxi University, Taiyuan, China
| | - Yang Wang
- School of Electronic and Communication Engineering, Shenzhen Polytechnic, Shenzhen, China
| | - Yi Zhang
- Beidahuang Industry Group General Hospital, Harbin, China
- *Correspondence: Yi Zhang, ; Lei Xu,
| | - Lei Xu
- School of Electronic and Communication Engineering, Shenzhen Polytechnic, Shenzhen, China
- *Correspondence: Yi Zhang, ; Lei Xu,
| |
Collapse
|
7
|
Lin X. Genomic Variation Prediction: A Summary From Different Views. Front Cell Dev Biol 2021; 9:795883. [PMID: 34901036 PMCID: PMC8656232 DOI: 10.3389/fcell.2021.795883] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Accepted: 11/11/2021] [Indexed: 12/02/2022] Open
Abstract
Structural variations in the genome are closely related to human health and the occurrence and development of various diseases. To understand the mechanisms of diseases, find pathogenic targets, and carry out personalized precision medicine, it is critical to detect such variations. The rapid development of high-throughput sequencing technologies has accelerated the accumulation of large amounts of genomic mutation data, including synonymous mutations. Identifying pathogenic synonymous mutations that play important roles in the occurrence and development of diseases from all the available mutation data is of great importance. In this paper, machine learning theories and methods are reviewed, efficient and accurate pathogenic synonymous mutation prediction methods are developed, and a standardized three-level variant analysis framework is constructed. In addition, multiple variation tolerance prediction models are studied and integrated, and new ideas for structural variation detection based on deep information mining are explored.
Collapse
Affiliation(s)
- Xiuchun Lin
- College of Information and Electrical Engineering, China Agricultural University, Beijing, China
| |
Collapse
|
8
|
Ao C, Zou Q, Yu L. RFhy-m2G: Identification of RNA N2-methylguanosine modification sites based on random forest and hybrid features. Methods 2021; 203:32-39. [PMID: 34033879 DOI: 10.1016/j.ymeth.2021.05.016] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2021] [Revised: 05/04/2021] [Accepted: 05/20/2021] [Indexed: 12/31/2022] Open
Abstract
N2-methylguanosine is a post-transcriptional modification of RNA that is found in eukaryotes and archaea. The biological function of m2G modification discovered so far is to control and stabilize the three-dimensional structure of tRNA and the dynamic barrier of reverse transcription. To discover additional biological functions of m2G, it is necessary to develop time-saving and labor-saving calculation tools to identify m2G. In this paper, based on hybrid features and a random forest, a novel predictor, RFhy-m2G, was developed to identify the m2G modification sites for three species. The hybrid feature used by the predictor is used to fuse the three features of ENAC, PseDNC, and NPPS. These three features include primary sequence derivation properties, physicochemical properties, and position-specific properties. Since there are redundant features in hybrid features, MRMD2.0 is used for optimal feature selection. Through feature analysis, it is found that the optimal hybrid features obtained still contain three kinds of properties, and the hybrid features can more accurately identify m2G modification sites and improve prediction performance. Based on five-fold cross-validation and independent testing to evaluate the prediction model, the accuracies obtained were 0.9982 and 0.9417, respectively. The robustness of the predictor is demonstrated by comparisons with other predictors.
Collapse
Affiliation(s)
- Chunyan Ao
- School of Computer Science and Technology, Xidian University, Xi'an, China; Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| | - Liang Yu
- School of Computer Science and Technology, Xidian University, Xi'an, China.
| |
Collapse
|
9
|
Zou Y, Wu H, Guo X, Peng L, Ding Y, Tang J, Guo F. MK-FSVM-SVDD: A Multiple Kernel-based Fuzzy SVM Model for Predicting DNA-binding Proteins via Support Vector Data Description. Curr Bioinform 2021. [DOI: 10.2174/1574893615999200607173829] [Citation(s) in RCA: 67] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Background:
Detecting DNA-binding proteins (DBPs) based on biological and chemical
methods is time-consuming and expensive.
Objective:
In recent years, the rise of computational biology methods based on Machine Learning
(ML) has greatly improved the detection efficiency of DBPs.
Method:
In this study, the Multiple Kernel-based Fuzzy SVM Model with Support Vector Data
Description (MK-FSVM-SVDD) is proposed to predict DBPs. Firstly, sex features are extracted
from the protein sequence. Secondly, multiple kernels are constructed via these sequence features.
Then, multiple kernels are integrated by Centered Kernel Alignment-based Multiple Kernel
Learning (CKA-MKL). Next, fuzzy membership scores of training samples are calculated with
Support Vector Data Description (SVDD). FSVM is trained and employed to detect new DBPs.
Results:
Our model is evaluated on several benchmark datasets. Compared with other methods, MKFSVM-
SVDD achieves best Matthew's Correlation Coefficient (MCC) on PDB186 (0.7250) and
PDB2272 (0.5476).
Conclusion:
We can conclude that MK-FSVM-SVDD is more suitable than common SVM, as the
classifier for DNA-binding proteins identification.
Collapse
Affiliation(s)
- Yi Zou
- School of Internet of Things Engineering, Jiangnan University, Wuxi, 214122, China
| | - Hongjie Wu
- School of Electronic and Information Engineering, Suzhou University of Science and Technology, No. 1 Kerui Road, 215009, Suzhou, China
| | - Xiaoyi Guo
- Hemodialysis Center, The Affiliated Wuxi People's Hospital of Nanjing Medical University, 214000, Wuxi, China
| | - Li Peng
- School of Internet of Things Engineering, Jiangnan University, Wuxi, 214122, China
| | - Yijie Ding
- School of Electronic and Information Engineering, Suzhou University of Science and Technology, No. 1 Kerui Road, 215009, Suzhou, China
| | - Jijun Tang
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, 300350, Tianjin, China
| | - Fei Guo
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, 300350, Tianjin, China
| |
Collapse
|
10
|
Guo X, Zhou W, Shi B, Wang X, Du A, Ding Y, Tang J, Guo F. An Efficient Multiple Kernel Support Vector Regression Model for Assessing Dry Weight of Hemodialysis Patients. Curr Bioinform 2021. [DOI: 10.2174/1574893615999200614172536] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Background:
Dry Weight (DW) is the lowest weight after dialysis, and patients with
lower weight usually have symptoms of hypotension and shock. Several clinical-based approaches
have been presented to assess the dry weight of hemodialysis patients. However, these traditional
methods all depend on special instruments and professional technicians.
Objective:
In order to avoid this limitation, we need to find a machine-independent way to assess dry
weight, therefore we collected some clinical influencing characteristic data and constructed a
Machine Learning-based (ML) model to predict the dry weight of hemodialysis patients.
Methods::
In this paper, 476 hemodialysis patients' demographic data, anthropometric measurements,
and Bioimpedance spectroscopy (BIS) were collected. Among them, these patients' age, sex, Body
Mass Index (BMI), Blood Pressure (BP) and Heart Rate (HR) and Years of Dialysis (YD) were
closely related to their dry weight. All these relevant data were used to enter the regression equation.
Multiple Kernel Support Vector Regression-based on Maximizes the Average Similarity (MKSVRMAS)
model was proposed to predict the dry weight of hemodialysis patients.
Result:
The experimental results show that dry weight is positively correlated with BMI and HR.
And age, sex, systolic blood pressure, diastolic blood pressure and hemodialysis time are negatively
correlated with dry weight. Moreover, the Root Mean Square Error (RMSE) of our model was
1.3817.
Conclusion:
Our proposed model could serve as a viable alternative for dry weight estimation of
hemodialysis patients, thus providing a new way for clinical practice. Our proposed model could serve as a viable alternative of dry weight estimation for hemodialysis patients,
thus providing a new way for the clinic.
Collapse
Affiliation(s)
- Xiaoyi Guo
- Hemodialysis Center, The Affiliated Wuxi People's Hospital of Nanjing Medical University, 214000, Wuxi, China
| | - Wei Zhou
- Hemodialysis Center, The Affiliated Wuxi People's Hospital of Nanjing Medical University, 214000, Wuxi, China
| | - Bin Shi
- Hemodialysis Center, Northern Jiangsu People's Hospital, 225001, Yangzhou, China
| | - Xiaohua Wang
- Department of Urology, the First Affiliated Hospital of Soochow University, 215006, Suzhou, China
| | - Aiyan Du
- Hemodialysis Center, The Affiliated Wuxi People's Hospital of Nanjing Medical University, 214000, Wuxi, China
| | - Yijie Ding
- School of Electronic and Information Engineering, Suzhou University of Science and Technology, 215009, Suzhou, China
| | - Jijun Tang
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, 300350, Tianjin, China
| | - Fei Guo
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, 300350, Tianjin, China
| |
Collapse
|
11
|
Lv Y, Huang S, Zhang T, Gao B. Application of Multilayer Network Models in Bioinformatics. Front Genet 2021; 12:664860. [PMID: 33868392 PMCID: PMC8044439 DOI: 10.3389/fgene.2021.664860] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2021] [Accepted: 02/26/2021] [Indexed: 11/24/2022] Open
Abstract
Multilayer networks provide an efficient tool for studying complex systems, and with current, dramatic development of bioinformatics tools and accumulation of data, researchers have applied network concepts to all aspects of research problems in the field of biology. Addressing the combination of multilayer networks and bioinformatics, through summarizing the applications of multilayer network models in bioinformatics, this review classifies applications and presents a summary of the latest results. Among them, we classify the applications of multilayer networks according to the object of study. Furthermore, because of the systemic nature of biology, we classify the subjects into several hierarchical categories, such as cells, tissues, organs, and groups, according to the hierarchical nature of biological composition. On the basis of the complexity of biological systems, we selected brain research for a detailed explanation. We describe the application of multilayer networks and chronological networks in brain research to demonstrate the primary ideas associated with the application of multilayer networks in biological studies. Finally, we mention a quality assessment method focusing on multilayer and single-layer networks as an evaluation method emphasizing network studies.
Collapse
Affiliation(s)
- Yuanyuan Lv
- Hainan Key Laboratory for Computational Science and Application, Hainan Normal University, Haikou, China
- Yangtze Delta Region Institute, University of Electronic Science and Technology of China, Quzhou, China
| | - Shan Huang
- Department of Neurology, The Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Tianjiao Zhang
- College of Information and Computer Engineering, Northeast Forestry University, Harbin, China
| | - Bo Gao
- Department of Radiology, The Second Affiliated Hospital, Harbin Medical University, Harbin, China
| |
Collapse
|
12
|
Wang X, Yang Y, Liu J, Wang G. The stacking strategy-based hybrid framework for identifying non-coding RNAs. Brief Bioinform 2021; 22:6165004. [PMID: 33693454 DOI: 10.1093/bib/bbab023] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Revised: 01/16/2021] [Indexed: 12/12/2022] Open
Abstract
With the development of next-generation sequencing technology, a large number of transcripts need to be analyzed, and it has been a challenge to distinguish non-coding ribonucleic acid (RNAs) (ncRNAs) from coding RNAs. And for non-model organisms, due to the lack of transcriptional data, many existing methods cannot identify them. Therefore, in addition to using deoxyribonucleic acid-based and RNA-based features, we also proposed a hybrid framework based on the stacking strategy to identify ncRNAs, and we innovatively added eight features based on predicted peptides. The proposed framework was based on stacking two-layer classifier which combined random forest (RF), LightGBM, XGBoost and logistic regression (LR) models. We used this framework to build two types of models. For cross-species ncRNAs identification model, we tested it on six different species: human, mouse, zebrafish, fruit fly, worm and Arabidopsis. Compared with other tools, our model was the best in datasets of Arabidopsis, worm and zebrafish with the accuracy of 98.36%, 99.65% and 94.12%. For performance metrics analysis, the datasets of the six species were considered as a whole set, and the sensitivity, accuracy, precision and F1 values of our model were the best. For the plant-specific ncRNAs identification model, the average values of the six metrics of the two experiments were all greater than 95%, which demonstrated it can be used to identify ncRNAs in plants. The above indicates that the hybrid framework we designed is universal between animals and plants and has significant advantages in the identification of cross-species ncRNAs.
Collapse
Affiliation(s)
- Xin Wang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Yang Yang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Jian Liu
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Guohua Wang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
| |
Collapse
|
13
|
Zhang S, Zhang C, Du J, Zhang R, Yang S, Li B, Wang P, Deng W. Prediction of Lymph-Node Metastasis in Cancers Using Differentially Expressed mRNA and Non-coding RNA Signatures. Front Cell Dev Biol 2021; 9:605977. [PMID: 33644044 PMCID: PMC7905047 DOI: 10.3389/fcell.2021.605977] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Accepted: 01/07/2021] [Indexed: 12/12/2022] Open
Abstract
Accurate prediction of lymph-node metastasis in cancers is pivotal for the next targeted clinical interventions that allow favorable prognosis for patients. Different molecular profiles (mRNA and non-coding RNAs) have been widely used to establish classifiers for cancer prediction (e.g., tumor origin, cancerous or non-cancerous state, cancer subtype). However, few studies focus on lymphatic metastasis evaluation using these profiles, and the performance of classifiers based on different profiles has also not been compared. Here, differentially expressed mRNAs, miRNAs, and lncRNAs between lymph-node metastatic and non-metastatic groups were identified as molecular signatures to construct classifiers for lymphatic metastasis prediction in different cancers. With this similar feature selection strategy, support vector machine (SVM) classifiers based on different profiles were systematically compared in their prediction performance. For representative cancers (a total of nine types), these classifiers achieved comparative overall accuracies of 81.00% (67.96-92.19%), 81.97% (70.83-95.24%), and 80.78% (69.61-90.00%) on independent mRNA, miRNA, and lncRNA datasets, with a small set of biomarkers (6, 12, and 4 on average). Therefore, our proposed feature selection strategies are economical and efficient to identify biomarkers that aid in developing competitive classifiers for predicting lymph-node metastasis in cancers. A user-friendly webserver was also deployed to help researchers in metastasis risk determination by submitting their expression profiles of different origins.
Collapse
Affiliation(s)
- Shihua Zhang
- College of Life Science and Health, Wuhan University of Science and Technology, Wuhan, China
| | - Cheng Zhang
- College of Life Science and Health, Wuhan University of Science and Technology, Wuhan, China
| | - Jinke Du
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, China
| | - Rui Zhang
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, China
| | - Shixiong Yang
- Central Laboratory, Xiaogan Hospital Affiliated to Wuhan University of Science and Technology, Xiaogan, China
| | - Bo Li
- School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan, China
| | - Pingping Wang
- School of Life Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Wensheng Deng
- College of Life Science and Health, Wuhan University of Science and Technology, Wuhan, China
| |
Collapse
|
14
|
Assessing Dry Weight of Hemodialysis Patients via Sparse Laplacian Regularized RVFL Neural Network with L 2,1-Norm. BIOMED RESEARCH INTERNATIONAL 2021; 2021:6627650. [PMID: 33628794 PMCID: PMC7880720 DOI: 10.1155/2021/6627650] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/01/2021] [Revised: 01/21/2021] [Accepted: 01/25/2021] [Indexed: 11/28/2022]
Abstract
Dry weight is the normal weight of hemodialysis patients after hemodialysis. If the amount of water in diabetes is too much (during hemodialysis), the patient will experience hypotension and shock symptoms. Therefore, the correct assessment of the patient's dry weight is clinically important. These methods all rely on professional instruments and technicians, which are time-consuming and labor-intensive. To avoid this limitation, we hope to use machine learning methods on patients. This study collected demographic and anthropometric data of 476 hemodialysis patients, including age, gender, blood pressure (BP), body mass index (BMI), years of dialysis (YD), and heart rate (HR). We propose a Sparse Laplacian regularized Random Vector Functional Link (SLapRVFL) neural network model on the basis of predecessors. When we evaluate the prediction performance of the model, we fully compare SLapRVFL with the Body Composition Monitor (BCM) instrument and other models. The Root Mean Square Error (RMSE) of SLapRVFL is 1.3136, which is better than other methods. The SLapRVFL neural network model could be a viable alternative of dry weight assessment.
Collapse
|
15
|
Bai Z, Chen M, Lin Q, Ye Y, Fan H, Wen K, Zeng J, Huang D, Mo W, Lei Y, Liao Z. Identification of Methicillin-Resistant Staphylococcus Aureus From Methicillin-Sensitive Staphylococcus Aureus and Molecular Characterization in Quanzhou, China. Front Cell Dev Biol 2021; 9:629681. [PMID: 33553185 PMCID: PMC7858276 DOI: 10.3389/fcell.2021.629681] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2020] [Accepted: 01/04/2021] [Indexed: 12/17/2022] Open
Abstract
To distinguish Methicillin-Resistant Staphylococcus aureus (MRSA) from Methicillin-Sensitive Staphylococcus aureus (MSSA) in the protein sequences level, test the susceptibility to antibiotic of all Staphylococcus aureus isolates from Quanzhou hospitals, define the virulence factor and molecular characteristics of the MRSA isolates. MRSA and MSSA Pfam protein sequences were used to extract feature vectors of 188D, n-gram and 400D. Weka software was applied to classify the two Staphylococcus aureus and performance effect was evaluated. Antibiotic susceptibility testing of the 81 Staphylococcus aureus was performed by the Mérieux Microbial Analysis Instrument. The 65 MRSA isolates were characterized by Panton-Valentine leukocidin (PVL), X polymorphic region of Protein A (spa), multilocus sequence typing test (MLST), staphylococcus chromosomal cassette mec (SCCmec) typing. After comparing the results of Weka six classifiers, the highest correctly classified rates were 91.94, 70.16, and 62.90% from 188D, n-gram and 400D, respectively. Antimicrobial susceptibility test of the 81 Staphylococcus aureus: Penicillin-resistant rate was 100%. No resistance to teicoplanin, linezolid, and vancomycin. The resistance rate of the MRSA isolates to clindamycin, erythromycin and tetracycline was higher than that of the MSSAs. Among the 65 MRSA isolates, the positive rate of PVL gene was 47.7% (31/65). Seventeen sequence types (STs) were identified among the 65 isolates, and ST59 was the most prevalent. SCCmec type III and IV were observed at 24.6 and 72.3%, respectively. Two isolates did not be typed. Twenty-one spa types were identified, spa t437 (34/65, 52.3%) was the most predominant type. MRSA major clone type of molecular typing was CC59-ST59-spa t437-IV (28/65, 43.1%). Overall, 188D feature vectors can be applied to successfully distinguish MRSA from MSSA. In Quanzhou, the detection rate of PVL virulence factor was high, suggesting a high pathogenic risk of MRSA infection. The cross-infection of CA-MRSA and HA-MRSA was presented, the molecular characteristics were increasingly blurred, HA-MRSA with typical CA-MRSA molecular characteristics has become an important cause of healthcare-related infections. CC59-ST59-spa t437-IV was the main clone type in Quanzhou, which was rare in other parts of mainland China.
Collapse
Affiliation(s)
- Zhimin Bai
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, China.,Department of Clinical Laboratory, Jinjiang Municipal Hospital, Jinjiang, China
| | - Min Chen
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, China.,Microbiological Laboratory Sanming Center for Disease Control and Prevention, Sanming, China
| | - Qiaofa Lin
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, China
| | - Ying Ye
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, China
| | - Hongmei Fan
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, China
| | - Kaizhen Wen
- Department of Clinical Laboratory, Jinjiang Municipal Hospital, Jinjiang, China
| | - Jianxing Zeng
- Department of Clinical Laboratory, Jinjiang Municipal Hospital, Jinjiang, China
| | - Donghong Huang
- Department of Clinical Laboratory, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, China
| | - Wenfei Mo
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, China
| | - Ying Lei
- Department of Clinical Laboratory, Quanzhou Women's and Children's Hospital, Quanzhou, China
| | - Zhijun Liao
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, China
| |
Collapse
|
16
|
Hu Y, Sun JY, Zhang Y, Zhang H, Gao S, Wang T, Han Z, Wang L, Sun BL, Liu G. rs1990622 variant associates with Alzheimer's disease and regulates TMEM106B expression in human brain tissues. BMC Med 2021; 19:11. [PMID: 33461566 PMCID: PMC7814705 DOI: 10.1186/s12916-020-01883-5] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/13/2020] [Accepted: 12/08/2020] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND It has been well established that the TMEM106B gene rs1990622 variant was a frontotemporal dementia (FTD) risk factor. Until recently, growing evidence highlights the role of TMEM106B in Alzheimer's disease (AD). However, it remains largely unclear about the role of rs1990622 variant in AD. METHODS Here, we conducted comprehensive analyses including genetic association study, gene expression analysis, eQTLs analysis, and colocalization analysis. In stage 1, we conducted a genetic association analysis of rs1990622 using large-scale genome-wide association study (GWAS) datasets from International Genomics of Alzheimer's Project (21,982 AD and 41,944 cognitively normal controls) and UK Biobank (314,278 participants). In stage 2, we performed a gene expression analysis of TMEM106B in 49 different human tissues using the gene expression data in GTEx. In stage 3, we performed an expression quantitative trait loci (eQTLs) analysis using multiple datasets from UKBEC, GTEx, and Mayo RNAseq Study. In stage 4, we performed a colocalization analysis to provide evidence of the AD GWAS and eQTLs pair influencing both AD and the TMEM106B expression at a particular region. RESULTS We found (1) rs1990622 variant T allele contributed to AD risk. A sex-specific analysis in UK Biobank further indicated that rs1990622 T allele only contributed to increased AD risk in females, but not in males; (2) TMEM106B showed different expression in different human brain tissues especially high expression in cerebellum; (3) rs1990622 variant could regulate the expression of TMEM106B in human brain tissues, which vary considerably in different disease statuses, the mean ages at death, the percents of females, and the different descents of the selected donors; (4) colocalization analysis provided suggestive evidence that the same variant contributed to AD risk and TMEM106B expression in cerebellum. CONCLUSION Our comprehensive analyses highlighted the role of FTD rs1990622 variant in AD risk. This cross-disease approach may delineate disease-specific and common features, which will be important for both diagnostic and therapeutic development purposes. Meanwhile, these findings highlight the importance to better understand TMEM106B function and dysfunction in the context of normal aging and neurodegenerative diseases.
Collapse
Affiliation(s)
- Yang Hu
- School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150080, China
| | - Jing-Yi Sun
- Shandong Provincial Hospital, Shandong First Medical University & Shandong Academy of Medical Sciences, Jinan, 250021, China
| | - Yan Zhang
- Department of Pathology, The Affiliated Hospital of Weifang Medical University, Weifang, 261053, China
| | - Haihua Zhang
- Beijing Institute for Brain Disorders, Laboratory of Brain Disorders, Ministry of Science and Technology, Collaborative Innovation Center for Brain Disorders, Capital Medical University, Beijing, 100069, China
| | - Shan Gao
- Beijing Institute for Brain Disorders, Laboratory of Brain Disorders, Ministry of Science and Technology, Collaborative Innovation Center for Brain Disorders, Capital Medical University, Beijing, 100069, China
| | - Tao Wang
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China.,Chinese Institute for Brain Research, Beijing, China
| | - Zhifa Han
- School of Medicine, School of Pharmaceutical Sciences, THU-PKU Center for Life Sciences, Tsinghua University, Beijing, China.,State Key Laboratory of Medical Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, Beijing, China.,Department of Pathophysiology, Peking Union Medical College, Beijing, China
| | - Longcai Wang
- Department of Anesthesiology, The Affiliated Hospital of Weifang Medical University, Weifang, 261053, China
| | - Bao-Liang Sun
- Key Laboratory of Cerebral Microcirculation in Universities of Shandong; Department of Neurology, Second Affiliated Hospital; Shandong First Medical University & Shandong Academy of Medical Sciences, Taian, 271000, Shandong, China
| | - Guiyou Liu
- Beijing Institute for Brain Disorders, Laboratory of Brain Disorders, Ministry of Science and Technology, Collaborative Innovation Center for Brain Disorders, Capital Medical University, Beijing, 100069, China. .,Chinese Institute for Brain Research, Beijing, China. .,Key Laboratory of Cerebral Microcirculation in Universities of Shandong; Department of Neurology, Second Affiliated Hospital; Shandong First Medical University & Shandong Academy of Medical Sciences, Taian, 271000, Shandong, China. .,National Engineering Laboratory of Internet Medical Diagnosis and Treatment Technology, Xuanwu Hospital, Capital Medical University, Beijing, 100053, China. .,Department of Neurology, Xuanwu Hospital, Capital Medical University, Beijing, 100053, China.
| |
Collapse
|
17
|
Tsai DJ, Fang WH, Wu LW, Tai MC, Kao CC, Huang SM, Chen WT, Hsiao PJ, Chiu CC, Su W, Wu CC, Su SL. The Polymorphism at PLCB4 Promoter (rs6086746) Changes the Binding Affinity of RUNX2 and Affects Osteoporosis Susceptibility: An Analysis of Bioinformatics-Based Case-Control Study and Functional Validation. Front Endocrinol (Lausanne) 2021; 12:730686. [PMID: 34899595 PMCID: PMC8657146 DOI: 10.3389/fendo.2021.730686] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Accepted: 11/09/2021] [Indexed: 12/13/2022] Open
Abstract
PURPOSE Genome-wide association studies have identified numerous genetic variants that are associated with osteoporosis risk; however, most of them are present in the non-coding regions of the genome and the functional mechanisms are unknown. In this study, we aimed to investigate the potential variation in runt domain transcription factor 2 (RUNX2), which is an osteoblast-specific transcription factor that normally stimulates bone formation and osteoblast differentiation, regarding variants within RUNX2 binding sites and risk of osteoporosis in postmenopausal osteoporosis (PMOP). METHODS We performed bioinformatics-based prediction by combining whole genome sequencing and chromatin immunoprecipitation sequencing to screen functional SNPs in the RUNX2 binding site using data from the database of Taiwan Biobank; Case-control studies with 651 postmenopausal women comprising 107 osteoporosis patients, 290 osteopenia patients, and 254 controls at Tri-Service General Hospital between 2015 and 2019 were included. The subjects were examined for bone mass density and classified into normal and those with osteopenia or osteoporosis by T-scoring with dual-energy X-ray absorptiometry. Furthermore, mRNA expression and luciferase reporter assay were used to provide additional evidence regarding the associations identified in the association analyses. Chi-square tests and logistic regression were mainly used for statistical assessment. RESULTS Through candidate gene approaches, 3 SNPs in the RUNX2 binding site were selected. A novel SNP rs6086746 in the PLCB4 promoter was identified to be associated with osteoporosis in Chinese populations. Patients with AA allele had higher risk of osteoporosis than those with GG+AG (adjusted OR = 6.89; 95% confidence intervals: 2.23-21.31, p = 0.001). Moreover, the AA genotype exhibited lower bone mass density (p < 0.05). Regarding mRNA expression, there were large differences in the correlation between PLCB4 and different RUNX2 alleles (Cohen's q = 0.91). Functionally, the rs6086746 A allele reduces the RUNX2 binding affinity, thus enhancing the suppression of PLCB4 expression (p < 0.05). CONCLUSIONS Our results provide further evidence to support the important role of the SNP rs6086746 in the etiology of osteopenia/osteoporosis, thereby enhancing the current understanding of the susceptibility to osteoporosis. We further studied the mechanism underlying osteoporosis regulation by PLCB4.
Collapse
Affiliation(s)
- Dung-Jang Tsai
- Graduate Institute of Life Sciences, National Defense Medical Center, Taipei, Taiwan
- School of Public Health, National Defense Medical Center, Taipei, Taiwan
| | - Wen-Hui Fang
- Department of Family and Community Medicine, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan
| | - Li-Wei Wu
- Department of Family and Community Medicine, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan
| | - Ming-Cheng Tai
- Department of Ophthalmology, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan
| | - Chung-Cheng Kao
- Superintendent’s Office, Tri-Service General Hospital Songshan Branch, National Defense Medical Center, Taipei, Taiwan
| | - Shih-Ming Huang
- Department of Biochemistry, National Defense Medical Center, Taipei, Taiwan
| | - Wei-Teing Chen
- Division of Thoracic Medicine, Department of Medicine, Cheng Hsin General Hospital, Taipei, Taiwan
- Department of Medicine, Tri-Service General Hospital, National Defense Medical Center, Taipei, ROC, Taiwan
| | - Po-Jen Hsiao
- Department of Internal Medicine, Taoyuan Armed Forces General Hospital, Taoyuan, Taiwan
- Division of Nephrology, Department of Internal Medicine, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan
| | - Chih-Chien Chiu
- Division of Infectious Diseases, Department of Internal Medicine, Taoyuan Armed Forces General Hospital, National Defense Medical Center, Taoyuan, Taiwan
| | - Wen Su
- Graduate Institute of Aerospace and Undersea Medicine, National Defense Medical Center, Taipei, Taiwan
| | - Chia-Chun Wu
- Department of Orthopedics, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan
| | - Sui-Lung Su
- Graduate Institute of Life Sciences, National Defense Medical Center, Taipei, Taiwan
- School of Public Health, National Defense Medical Center, Taipei, Taiwan
- *Correspondence: Sui-Lung Su,
| |
Collapse
|
18
|
Hu Y, Zhang H, Liu B, Gao S, Wang T, Han Z, Ji X, Liu G. rs34331204 regulates TSPAN13 expression and contributes to Alzheimer's disease with sex differences. Brain 2020; 143:e95. [PMID: 33175954 PMCID: PMC7719023 DOI: 10.1093/brain/awaa302] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Affiliation(s)
- Yang Hu
- School of Life Science and Technology, Harbin Institute of Technology, Harbin 150080, China
| | - Haihua Zhang
- Beijing Institute for Brain Disorders, Capital Medical University, Beijing, 100069, China
| | - Bian Liu
- Department of Neurology, Xuanwu Hospital, Capital Medical University, Beijing 100053, China
| | - Shan Gao
- Beijing Institute for Brain Disorders, Capital Medical University, Beijing, 100069, China
| | - Tao Wang
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
- Chinese Institute for Brain Research, Beijing, China
| | - Zhifa Han
- School of Medicine, School of Pharmaceutical Sciences, THU-PKU Center for Life Sciences, Tsinghua University, Beijing, China
- State Key Laboratory of Medical Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, Beijing, China
- Department of Pathophysiology, Peking Union Medical College, Beijing, China
| | - Xunming Ji
- Beijing Institute for Brain Disorders, Capital Medical University, Beijing, 100069, China
- Department of Neurology, Xuanwu Hospital, Capital Medical University, Beijing 100053, China
- Department of Neurosurgery, Xuanwu Hospital, Capital Medical University, Beijing 100053, China
- National Engineering Laboratory of Internet Medical Diagnosis and Treatment Technology, Xuanwu Hospital, Capital Medical University, Beijing 100053, China
| | - Guiyou Liu
- Beijing Institute for Brain Disorders, Capital Medical University, Beijing, 100069, China
- Department of Neurology, Xuanwu Hospital, Capital Medical University, Beijing 100053, China
- National Engineering Laboratory of Internet Medical Diagnosis and Treatment Technology, Xuanwu Hospital, Capital Medical University, Beijing 100053, China
| |
Collapse
|
19
|
Liu G, Zhao W, Zhang H, Wang T, Han Z, Ji X. rs1769793 variant reduces EGLN1 expression in skeletal muscle and hippocampus and contributes to high aerobic capacity in hypoxia. Proc Natl Acad Sci U S A 2020; 117:29283-29285. [PMID: 33109725 PMCID: PMC7703601 DOI: 10.1073/pnas.2010073117] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Affiliation(s)
- Guiyou Liu
- Beijing Institute for Brain Disorders, Capital Medical University, Beijing 100069, China;
- Department of Neurology, Xuanwu Hospital, Capital Medical University, Beijing 100053, China
- National Engineering Laboratory of Internet Medical Diagnosis and Treatment Technology, Xuanwu Hospital, Capital Medical University, Beijing 100053, China
| | - Wenbo Zhao
- Department of Neurology, Xuanwu Hospital, Capital Medical University, Beijing 100053, China
| | - Haihua Zhang
- Beijing Institute for Brain Disorders, Capital Medical University, Beijing 100069, China
- National Engineering Laboratory of Internet Medical Diagnosis and Treatment Technology, Xuanwu Hospital, Capital Medical University, Beijing 100053, China
| | - Tao Wang
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
- Department of Bioinformatics, Chinese Institute for Brain Research, Beijing 102206, China
| | - Zhifa Han
- School of Medicine, Tsinghua University-Peking University Center for Life Sciences, Tsinghua University, Beijing 100084, China
- School of Pharmaceutical Sciences, Tsinghua University-Peking University Center for Life Sciences, Tsinghua University, Beijing 100084, China
- State Key Laboratory of Medical Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, Beijing 100005, China
- Department of Pathophysiology, Peking Union Medical College, Beijing 100021, China
| | - Xunming Ji
- Beijing Institute for Brain Disorders, Capital Medical University, Beijing 100069, China;
- Department of Neurology, Xuanwu Hospital, Capital Medical University, Beijing 100053, China
- National Engineering Laboratory of Internet Medical Diagnosis and Treatment Technology, Xuanwu Hospital, Capital Medical University, Beijing 100053, China
- Department of Neurosurgery, Xuanwu Hospital, Capital Medical University, Beijing 100053, China
- Collaborative Innovation Center for Brain Disorders, Capital Medical University, Beijing 100069, China
| |
Collapse
|
20
|
Genetic variant rs7820258 regulates the expression of indoleamine 2,3-dioxygenase 1 in brain regions. Proc Natl Acad Sci U S A 2020; 117:24035-24036. [PMID: 32994208 PMCID: PMC7533890 DOI: 10.1073/pnas.2007022117] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
|
21
|
Zhuang H, Zhang Y, Yang S, Cheng L, Liu SL. A Mendelian Randomization Study on Infant Length and Type 2 Diabetes Mellitus Risk. Curr Gene Ther 2020; 19:224-231. [PMID: 31553296 DOI: 10.2174/1566523219666190925115535] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Revised: 06/15/2019] [Accepted: 06/16/2019] [Indexed: 12/12/2022]
Abstract
OBJECTIVE Infant length (IL) is a positively associated phenotype of type 2 diabetes mellitus (T2DM), but the causal relationship of which is still unclear. Here, we applied a Mendelian randomization (MR) study to explore the causal relationship between IL and T2DM, which has the potential to provide guidance for assessing T2DM activity and T2DM- prevention in young at-risk populations. MATERIALS AND METHODS To classify the study, a two-sample MR, using genetic instrumental variables (IVs) to explore the causal effect was applied to test the influence of IL on the risk of T2DM. In this study, MR was carried out on GWAS data using 8 independent IL SNPs as IVs. The pooled odds ratio (OR) of these SNPs was calculated by the inverse-variance weighted method for the assessment of the risk the shorter IL brings to T2DM. Sensitivity validation was conducted to identify the effect of individual SNPs. MR-Egger regression was used to detect pleiotropic bias of IVs. RESULTS The pooled odds ratio from the IVW method was 1.03 (95% CI 0.89-1.18, P = 0.0785), low intercept was -0.477, P = 0.252, and small fluctuation of ORs ranged from -0.062 ((0.966 - 1.03) / 1.03) to 0.05 ((1.081 - 1.03) / 1.03) in leave-one-out validation. CONCLUSION We validated that the shorter IL causes no additional risk to T2DM. The sensitivity analysis and the MR-Egger regression analysis also provided adequate evidence that the above result was not due to any heterogeneity or pleiotropic effect of IVs.
Collapse
Affiliation(s)
- He Zhuang
- Systemomics Center, College of Pharmacy, and Genomics Research Center (State-Province Key Laboratories of Biomedicine- Pharmaceutics of China), Harbin Medical University, Harbin, China.,HMU-UCFM Centre for Infection and Genomics, Harbin Medical University, Harbin, China
| | - Ying Zhang
- Department of Pharmacy, Heilongjiang Province Land Reclamation Headquarters General Hospital, 150001, Harbin, China
| | - Shuo Yang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Liang Cheng
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Shu-Lin Liu
- Systemomics Center, College of Pharmacy, and Genomics Research Center (State-Province Key Laboratories of Biomedicine- Pharmaceutics of China), Harbin Medical University, Harbin, China.,HMU-UCFM Centre for Infection and Genomics, Harbin Medical University, Harbin, China.,Department of Microbiology, Immunology and Infectious Diseases, University of Calgary, Calgary, Canada.,Department of Infectious Diseases, The First Affiliated Hospital, Harbin Medical University, Harbin, China.,Translational Medicine Research and Cooperation Center of Northern China, Heilongjiang Academy of Medical Sciences, Harbin, China
| |
Collapse
|
22
|
Dao FY, Lv H, Yang YH, Zulfiqar H, Gao H, Lin H. Computational identification of N6-methyladenosine sites in multiple tissues of mammals. Comput Struct Biotechnol J 2020; 18:1084-1091. [PMID: 32435427 PMCID: PMC7229270 DOI: 10.1016/j.csbj.2020.04.015] [Citation(s) in RCA: 58] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2020] [Revised: 04/20/2020] [Accepted: 04/21/2020] [Indexed: 12/12/2022] Open
Abstract
N6-methyladenosine (m6A) is the methylation of the adenosine at the nitrogen-6 position, which is the most abundant RNA methylation modification and involves a series of important biological processes. Accurate identification of m6A sites in genome-wide is invaluable for better understanding their biological functions. In this work, an ensemble predictor named iRNA-m6A was established to identify m6A sites in multiple tissues of human, mouse and rat based on the data from high-throughput sequencing techniques. In the proposed predictor, RNA sequences were encoded by physical-chemical property matrix, mono-nucleotide binary encoding and nucleotide chemical property. Subsequently, these features were optimized by using minimum Redundancy Maximum Relevance (mRMR) feature selection method. Based on the optimal feature subset, the best m6A classification models were trained by Support Vector Machine (SVM) with 5-fold cross-validation test. Prediction results on independent dataset showed that our proposed method could produce the excellent generalization ability. We also established a user-friendly webserver called iRNA-m6A which can be freely accessible at http://lin-group.cn/server/iRNA-m6A. This tool will provide more convenience to users for studying m6A modification in different tissues.
Collapse
Affiliation(s)
| | | | - Yu-He Yang
- Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Hasan Zulfiqar
- Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Hui Gao
- Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Hao Lin
- Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| |
Collapse
|
23
|
Dong H, Zhou W, Wang P, Zuo E, Ying X, Chai S, Fei T, Jin L, Chen C, Ma G, Liu H. Comprehensive Analysis of the Genetic and Epigenetic Mechanisms of Osteoporosis and Bone Mineral Density. Front Cell Dev Biol 2020; 8:194. [PMID: 32269995 PMCID: PMC7109267 DOI: 10.3389/fcell.2020.00194] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Accepted: 03/09/2020] [Indexed: 01/11/2023] Open
Abstract
Osteoporosis is a skeletal disorder characterized by a systemic impairment of bone mineral density (BMD). Genome-wide association studies (GWAS) have identified hundreds of susceptibility loci for osteoporosis and BMD. However, the vast majority of susceptibility loci are located in non-coding regions of the genome and provide limited information about the genetic mechanisms of osteoporosis. Herein we performed a comprehensive functional analysis to investigate the genetic and epigenetic mechanisms of osteoporosis and BMD. BMD and osteoporosis are found to share many common susceptibility loci, and the corresponding susceptibility genes are significantly enriched in bone-related biological pathways. The regulatory element enrichment analysis indicated that BMD and osteoporosis susceptibility loci are significantly enriched in 5′UTR and DNase I hypersensitive sites (DHSs) of peripheral blood immune cells. By integrating GWAS and expression Quantitative Trait Locus (eQTL) data, we found that 15 protein-coding genes are regulated by the osteoporosis and BMD susceptibility loci. Our analysis provides new clues for a better understanding of the pathogenic mechanisms and offers potential therapeutic targets for osteoporosis.
Collapse
Affiliation(s)
- Hui Dong
- Department of Oral Prosthodontics, School of Stomatology, Dalian Medical University, Dalian, China.,Department of Stomatology, The First Affiliated Hospital of Dalian Medical University, Dalian, China
| | - Wenyang Zhou
- School of Life Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Pingping Wang
- School of Life Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Enjun Zuo
- Department of Oral Prosthodontics, School of Stomatology, Dalian Medical University, Dalian, China
| | - Xiaoxia Ying
- Department of Oral Prosthodontics, School of Stomatology, Dalian Medical University, Dalian, China
| | - Songling Chai
- Department of Oral Prosthodontics, School of Stomatology, Dalian Medical University, Dalian, China
| | - Tao Fei
- Department of Oral Prosthodontics, School of Stomatology, Dalian Medical University, Dalian, China
| | - Laidi Jin
- Department of Oral Prosthodontics, School of Stomatology, Dalian Medical University, Dalian, China
| | - Chen Chen
- Department of Oral Prosthodontics, School of Stomatology, Dalian Medical University, Dalian, China
| | - Guowu Ma
- Department of Oral Prosthodontics, School of Stomatology, Dalian Medical University, Dalian, China
| | - Huiying Liu
- Department of Oral Prosthodontics, School of Stomatology, Dalian Medical University, Dalian, China
| |
Collapse
|
24
|
Wang C, Zhao N, Yuan L, Liu X. Computational Detection of Breast Cancer Invasiveness with DNA Methylation Biomarkers. Cells 2020; 9:E326. [PMID: 32019269 PMCID: PMC7072524 DOI: 10.3390/cells9020326] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2020] [Revised: 01/28/2020] [Accepted: 01/28/2020] [Indexed: 12/14/2022] Open
Abstract
Breast cancer is the most common female malignancy. It has high mortality, primarily due to metastasis and recurrence. Patients with invasive and noninvasive breast cancer require different treatments, so there is an urgent need for predictive tools to guide clinical decision making and avoid overtreatment of noninvasive breast cancer and undertreatment of invasive cases. Here, we divided the sample set based on the genome-wide methylation distance to make full use of metastatic cancer data. Specifically, we implemented two differential methylation analysis methods to identify specific CpG sites. After effective dimensionality reduction, we constructed a methylation-based classifier using the Random Forest algorithm to categorize the primary breast cancer. We took advantage of breast cancer (BRCA) HM450 DNA methylation data and accompanying clinical data from The Cancer Genome Atlas (TCGA) database to validate the performance of the classifier. Overall, this study demonstrates DNA methylation as a potential biomarker to predict breast tumor invasiveness and as a possible parameter that could be included in the studies aiming to predict breast cancer aggressiveness. However, more comparative studies are needed to assess its usability in the clinic. Towards this, we developed a website based on these algorithms to facilitate its use in studies and predictions of breast cancer invasiveness.
Collapse
Affiliation(s)
- Chunyu Wang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150080, China
| | - Ning Zhao
- School of Life Science and Technology, Harbin Institute of Technology, Harbin 150080, China;
| | - Linlin Yuan
- College of Intelligence and Computing, Tianjin University, Tianjin 300350, China;
| | - Xiaoyan Liu
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150080, China
| |
Collapse
|
25
|
Taxonomy dimension reduction for colorectal cancer prediction. Comput Biol Chem 2019; 83:107160. [DOI: 10.1016/j.compbiolchem.2019.107160] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2019] [Revised: 11/02/2019] [Accepted: 11/04/2019] [Indexed: 02/01/2023]
|
26
|
Zhao T, Wang D, Hu Y, Zhang N, Zang T, Wang Y. Identifying Alzheimer’s Disease-related miRNA Based on Semi-clustering. Curr Gene Ther 2019; 19:216-223. [DOI: 10.2174/1566523219666190924113737] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Revised: 06/05/2019] [Accepted: 06/12/2019] [Indexed: 01/14/2023]
Abstract
Background:
More and more scholars are trying to use it as a specific biomarker for Alzheimer’s
Disease (AD) and mild cognitive impairment (MCI). Multiple studies have indicated that
miRNAs are associated with poor axonal growth and loss of synaptic structures, both of which are early
events in AD. The overall loss of miRNA may be associated with aging, increasing the incidence of
AD, and may also be involved in the disease through some specific molecular mechanisms.
Objective:
Identifying Alzheimer’s disease-related miRNA can help us find new drug targets, early
diagnosis.
Materials and Methods:
We used genes as a bridge to connect AD and miRNAs. Firstly, proteinprotein
interaction network is used to find more AD-related genes by known AD-related genes. Then,
each miRNA’s correlation with these genes is obtained by miRNA-gene interaction. Finally, each
miRNA could get a feature vector representing its correlation with AD. Unlike other studies, we do not
generate negative samples randomly with using classification method to identify AD-related miRNAs.
Here we use a semi-clustering method ‘one-class SVM’. AD-related miRNAs are considered as outliers
and our aim is to identify the miRNAs that are similar to known AD-related miRNAs (outliers).
Results and Conclusion:
We identified 257 novel AD-related miRNAs and compare our method with
SVM which is applied by generating negative samples. The AUC of our method is much higher than
SVM and we did case studies to prove that our results are reliable.
Collapse
Affiliation(s)
- Tianyi Zhao
- Department of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Donghua Wang
- Department of General Surgery, General Hospital of Heilongjiang Province Land Reclamation Bureau, Harbin, China
| | - Yang Hu
- School of life Science and Tenchnology, Harbin Institute of Technology, Harbin, China
| | - Ningyi Zhang
- Department of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Tianyi Zang
- Department of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Yadong Wang
- Department of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
| |
Collapse
|
27
|
Reply to Liu et al.: Tissue specificity of SIM1 gene expression and erectile dysfunction. Proc Natl Acad Sci U S A 2019; 116:3349-3350. [PMID: 30755537 DOI: 10.1073/pnas.1900162116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
|