Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wang D, Li JR, Zhang YH, Chen L, Huang T, Cai YD. Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms. Genes (Basel) 2018. [PMID: 29534550 PMCID: PMC5867876 DOI: 10.3390/genes9030155] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open

For:	Wang D, Li JR, Zhang YH, Chen L, Huang T, Cai YD. Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms. Genes (Basel) 2018. [PMID: 29534550 PMCID: PMC5867876 DOI: 10.3390/genes9030155] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open

Number

Cited by Other Article(s)

Zhang ZY, Sun ZJ, Gao D, Hao YD, Lin H, Liu F. Excavation of gene markers associated with pancreatic ductal adenocarcinoma based on interrelationships of gene expression. IET Syst Biol 2024. [PMID: 38530028 DOI: 10.1049/syb2.12090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 02/06/2024] [Accepted: 03/10/2024] [Indexed: 03/27/2024] Open

Wu Y, Xiao Q, Wang S, Xu H, Fang Y. Establishment and Analysis of an Artificial Neural Network Model for Early Detection of Polycystic Ovary Syndrome Using Machine Learning Techniques. J Inflamm Res 2023;16:5667-5676. [PMID: 38050562 PMCID: PMC10693771 DOI: 10.2147/jir.s438838] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Accepted: 11/10/2023] [Indexed: 12/06/2023] Open

Mohamed TIA, Ezugwu AE, Fonou-Dombeu JV, Ikotun AM, Mohammed M. A bio-inspired convolution neural network architecture for automatic breast cancer detection and classification using RNA-Seq gene expression data. Sci Rep 2023;13:14644. [PMID: 37670037 PMCID: PMC10480180 DOI: 10.1038/s41598-023-41731-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2023] [Accepted: 08/30/2023] [Indexed: 09/07/2023] Open

Abstract

Breast cancer is considered one of the significant health challenges and ranks among the most prevalent and dangerous cancer types affecting women globally. Early breast cancer detection and diagnosis are crucial for effective treatment and personalized therapy. Early detection and diagnosis can help patients and physicians discover new treatment options, provide a more suitable quality of life, and ensure increased survival rates. Breast cancer detection using gene expression involves many complexities, such as the issue of dimensionality and the complicatedness of the gene expression data. This paper proposes a bio-inspired CNN model for breast cancer detection using gene expression data downloaded from the cancer genome atlas (TCGA). The data contains 1208 clinical samples of 19,948 genes with 113 normal and 1095 cancerous samples. In the proposed model, Array-Array Intensity Correlation (AAIC) is used at the pre-processing stage for outlier removal, followed by a normalization process to avoid biases in the expression measures. Filtration is used for gene reduction using a threshold value of 0.25. Thereafter the pre-processed gene expression dataset was converted into images which were later converted to grayscale to meet the requirements of the model. The model also uses a hybrid model of CNN architecture with a metaheuristic algorithm, namely the Ebola Optimization Search Algorithm (EOSA), to enhance the detection of breast cancer. The traditional CNN and five hybrid algorithms were compared with the classification result of the proposed model. The competing hybrid algorithms include the Whale Optimization Algorithm (WOA-CNN), the Genetic Algorithm (GA-CNN), the Satin Bowerbird Optimization (SBO-CNN), the Life Choice-Based Optimization (LCBO-CNN), and the Multi-Verse Optimizer (MVO-CNN). The results show that the proposed model determined the classes with high-performance measurements with an accuracy of 98.3%, a precision of 99%, a recall of 99%, an f1-score of 99%, a kappa of 90.3%, a specificity of 92.8%, and a sensitivity of 98.9% for the cancerous class. The results suggest that the proposed method has the potential to be a reliable and precise approach to breast cancer detection, which is crucial for early diagnosis and personalized therapy.

Collapse

Zhou J, Jiang Z, Fu L, Qu F, Dai M, Xie N, Zhang S, Wang F. Contribution of labor related gene subtype classification on heterogeneity of polycystic ovary syndrome. PLoS One 2023;18:e0282292. [PMID: 36857354 PMCID: PMC9977056 DOI: 10.1371/journal.pone.0282292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Accepted: 02/11/2023] [Indexed: 03/02/2023] Open

Abstract

OBJECTIVE

As one of the most common endocrine disorders in women of reproductive age, polycystic ovary syndrome (PCOS) is highly heterogeneous with varied clinical features and diverse gestational complications among individuals. The patients with PCOS have 2-fold higher risk of preterm labor which is associated with substantial infant morbidity and mortality and great socioeconomic cost. The study was designated to identify molecular subtypes and the related hub genes to facilitate the susceptibility assessment of preterm labor in women with PCOS.

METHODS

Four mRNA datasets (GSE84958, GSE5090, GSE43264 and GSE98421) were obtained from Gene Expression Omnibus database. Twenty-eight candidate genes related to preterm labor or labor were yielded from the researches and our unpublished data. Then, we utilized unsupervised clustering to identify molecular subtypes in PCOS based on the expression of above candidate genes. Key modules were generated with weighted gene co-expression network analysis R package, and their hub genes were generated with CytoHubba. The probable biological function and mechanism were explored through Gene Ontology analysis and Kyoto Encyclopedia of Genes and Genomes pathway analysis. In addition, STRING and Cytoscape software were used to identify the protein-protein interaction (PPI) network, and the molecular complex detection (MCODE) was used to identify the hub genes. Then the overlapping hub genes were predicted.

RESULTS

Two molecular subtypes were found in women with PCOS based on the expression similarity of preterm labor or labor-related genes, in which two modules were highlighted. The key modules and PPI network have five overlapping five hub genes, two of which, GTF2F2 and MYO6 gene, were further confirmed by the comparison between clustering subgroups according to the expression of hub genes.

CONCLUSIONS

Distinct PCOS molecular subtypes were identified with preterm labor or labor-related genes, which might uncover the potential mechanism underlying heterogeneity of clinical pregnancy complications in women with PCOS.

Collapse

Wang S, Liu W, Ye Z, Xia X, Guo M. Development of a joint diagnostic model of thyroid papillary carcinoma with artificial neural network and random forest. Front Genet 2022;13:957718. [PMCID: PMC9585230 DOI: 10.3389/fgene.2022.957718] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Accepted: 09/21/2022] [Indexed: 11/13/2022] Open

Bahadory S, Sadraei J, Zibaei M, Pirestani M, Dalimi A. In vitro anti-gastrointestinal cancer activity of Toxocara canis-derived peptide: Analyzing the expression level of factors related to cell proliferation and tumor growth. Front Pharmacol 2022;13:878724. [PMID: 36204226 PMCID: PMC9530354 DOI: 10.3389/fphar.2022.878724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Accepted: 08/01/2022] [Indexed: 11/23/2022] Open

Shao D, Dai Y, Li N, Cao X, Zhao W, Cheng L, Rong Z, Huang L, Wang Y, Zhao J. Artificial intelligence in clinical research of cancers. Brief Bioinform 2021;23:6470966. [PMID: 34929741 PMCID: PMC8769909 DOI: 10.1093/bib/bbab523] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Revised: 11/06/2021] [Accepted: 11/13/2021] [Indexed: 12/16/2022] Open

Habehh H, Gohel S. Machine Learning in Healthcare. Curr Genomics 2021;22:291-300. [PMID: 35273459 PMCID: PMC8822225 DOI: 10.2174/1389202922666210705124359] [Citation(s) in RCA: 48] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 05/12/2021] [Accepted: 06/04/2021] [Indexed: 11/22/2022] Open

Liu Q, Cheng B, Jin Y, Hu P. Bayesian tensor factorization-drive breast cancer subtyping by integrating multi-omics data. J Biomed Inform 2021;125:103958. [PMID: 34839017 DOI: 10.1016/j.jbi.2021.103958] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2021] [Revised: 10/13/2021] [Accepted: 11/19/2021] [Indexed: 12/12/2022]

The Blood Gene Expression Signature for Kawasaki Disease in Children Identified with Advanced Feature Selection Methods. BIOMED RESEARCH INTERNATIONAL 2021;2020:6062436. [PMID: 32685506 PMCID: PMC7327570 DOI: 10.1155/2020/6062436] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/26/2020] [Accepted: 06/12/2020] [Indexed: 01/22/2023]

Li D, Lin H, Li L. Multiple Feature Selection Strategies Identified Novel Cardiac Gene Expression Signature for Heart Failure. Front Physiol 2020;11:604241. [PMID: 33304275 PMCID: PMC7693561 DOI: 10.3389/fphys.2020.604241] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2020] [Accepted: 10/15/2020] [Indexed: 02/02/2023] Open

Xia Q, Shu Z, Ye T, Zhang M. Identification and Analysis of the Blood lncRNA Signature for Liver Cirrhosis and Hepatocellular Carcinoma. Front Genet 2020;11:595699. [PMID: 33365048 PMCID: PMC7750531 DOI: 10.3389/fgene.2020.595699] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Accepted: 10/13/2020] [Indexed: 12/12/2022] Open

Katz L, Woolman M, Tata A, Zarrine-Afsar A. Potential impact of tissue molecular heterogeneity on ambient mass spectrometry profiles: a note of caution in choosing the right disease model. Anal Bioanal Chem 2020;413:2655-2664. [PMID: 33247337 DOI: 10.1007/s00216-020-03054-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Revised: 11/02/2020] [Accepted: 11/10/2020] [Indexed: 02/07/2023]

Wu Z, Shou L, Wang J, Huang T, Xu X. The Methylation Pattern for Knee and Hip Osteoarthritis. Front Cell Dev Biol 2020;8:602024. [PMID: 33240895 PMCID: PMC7677303 DOI: 10.3389/fcell.2020.602024] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Accepted: 10/22/2020] [Indexed: 01/08/2023] Open

Zhu JH, Yan QL, Wang JW, Chen Y, Ye QH, Wang ZJ, Huang T. The Key Genes for Perineural Invasion in Pancreatic Ductal Adenocarcinoma Identified With Monte-Carlo Feature Selection Method. Front Genet 2020;11:554502. [PMID: 33193628 PMCID: PMC7593847 DOI: 10.3389/fgene.2020.554502] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2020] [Accepted: 08/17/2020] [Indexed: 12/20/2022] Open

Li S, Jiang L, Tang J, Gao N, Guo F. Kernel Fusion Method for Detecting Cancer Subtypes via Selecting Relevant Expression Data. Front Genet 2020;11:979. [PMID: 33133130 PMCID: PMC7511763 DOI: 10.3389/fgene.2020.00979] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Accepted: 08/03/2020] [Indexed: 12/19/2022] Open

Establishment and Analysis of a Combined Diagnostic Model of Polycystic Ovary Syndrome with Random Forest and Artificial Neural Network. BIOMED RESEARCH INTERNATIONAL 2020;2020:2613091. [PMID: 32884937 PMCID: PMC7455828 DOI: 10.1155/2020/2613091] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 07/27/2020] [Accepted: 08/03/2020] [Indexed: 12/14/2022]

Ren X, Wang S, Huang T. Decipher the connections between proteins and phenotypes. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2020;1868:140503. [PMID: 32707349 DOI: 10.1016/j.bbapap.2020.140503] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2020] [Revised: 06/30/2020] [Accepted: 07/16/2020] [Indexed: 10/23/2022]

Pan X, Zeng T, Zhang YH, Chen L, Feng K, Huang T, Cai YD. Investigation and Prediction of Human Interactome Based on Quantitative Features. Front Bioeng Biotechnol 2020;8:730. [PMID: 32766217 PMCID: PMC7379396 DOI: 10.3389/fbioe.2020.00730] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2020] [Accepted: 06/09/2020] [Indexed: 01/27/2023] Open

Fajarda O, Duarte-Pereira S, Silva RM, Oliveira JL. Merging microarray studies to identify a common gene expression signature to several structural heart diseases. BioData Min 2020;13:8. [PMID: 32670412 PMCID: PMC7346458 DOI: 10.1186/s13040-020-00217-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Accepted: 06/05/2020] [Indexed: 12/22/2022] Open

Abstract

BACKGROUND

Heart disease is the leading cause of death worldwide. Knowing a gene expression signature in heart disease can lead to the development of more efficient diagnosis and treatments that may prevent premature deaths. A large amount of microarray data is available in public repositories and can be used to identify differentially expressed genes. However, most of the microarray datasets are composed of a reduced number of samples and to obtain more reliable results, several datasets have to be merged, which is a challenging task. The identification of differentially expressed genes is commonly done using statistical methods. Nonetheless, these methods are based on the definition of an arbitrary threshold to select the differentially expressed genes and there is no consensus on the values that should be used.

RESULTS

Nine publicly available microarray datasets from studies of different heart diseases were merged to form a dataset composed of 689 samples and 8354 features. Subsequently, the adjusted p-value and fold change were determined and by combining a set of adjusted p-values cutoffs with a list of different fold change thresholds, 12 sets of differentially expressed genes were obtained. To select the set of differentially expressed genes that has the best accuracy in classifying samples from patients with heart diseases and samples from patients with no heart condition, the random forest algorithm was used. A set of 62 differentially expressed genes having a classification accuracy of approximately 95% was identified.

CONCLUSIONS

We identified a gene expression signature common to different cardiac diseases and supported our findings by showing their involvement in the pathophysiology of the heart. The approach used in this study is suitable for the identification of gene expression signatures, and can be extended to different diseases.

Collapse

Li M, Chen F, Zhang Y, Xiong Y, Li Q, Huang H. Identification of Post-myocardial Infarction Blood Expression Signatures Using Multiple Feature Selection Strategies. Front Physiol 2020;11:483. [PMID: 32581823 PMCID: PMC7287215 DOI: 10.3389/fphys.2020.00483] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2020] [Accepted: 04/20/2020] [Indexed: 12/24/2022] Open

Retained or altered expression of major histocompatibility complex class I in patient-derived xenograft models in breast cancer. Immunol Res 2020;67:469-477. [PMID: 31900802 DOI: 10.1007/s12026-019-09109-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]

Analysis of gene expression profiles of lung cancer subtypes with machine learning algorithms. Biochim Biophys Acta Mol Basis Dis 2020;1866:165822. [PMID: 32360590 DOI: 10.1016/j.bbadis.2020.165822] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2020] [Revised: 04/13/2020] [Accepted: 04/22/2020] [Indexed: 12/14/2022]

Xue W, Ton H, Zhang J, Xie T, Chen X, Zhou B, Guo Y, Fang J, Wang S, Zhang W. Patient‑derived orthotopic xenograft glioma models fail to replicate the magnetic resonance imaging features of the original patient tumor. Oncol Rep 2020;43:1619-1629. [PMID: 32323818 PMCID: PMC7107810 DOI: 10.3892/or.2020.7538] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2019] [Accepted: 02/12/2020] [Indexed: 12/14/2022] Open

Abstract

Patient-derived orthotopic glioma xenograft models are important platforms used for pre-clinical research of glioma. In the present study, the diagnostic ability of magnetic resonance imaging (MRI) was examined with regard to the identification of biomarkers obtained from patient-derived glioma xenografts and human tumors. Conventional MRI, diffusion weighted imaging and dynamic contrast-enhanced (DCE)-MRI were used to analyze seven pairs of high grade gliomas with their corresponding xenografts obtained from non-obese diabetic-severe-combined immunodeficiency nude mice. Tumor samples were collected for transcriptome sequencing and histopathological staining, and differentially expressed genes were screened between the original tumors and the corresponding xenografts. Gene Ontology (GO) analysis was performed to predict the functions of these genes. In 6 cases of xenografts with diffuse growth, the degree of enhancement was significantly lower compared with the original tumors. Histopathological staining indicated that the microvascular area and microvascular diameter of the xenografts were significantly lower compared with the original tumors (P=0.009 and P=0.007, respectively). In one case, there was evidence of nodular tumor growth in the mouse. Both MRI and histopathological staining showed a clear demarcation between the transplanted tumors and the normal brain tissues. The relative apparent diffusion coefficient values of the 7 cases examined were significantly higher compared with the corresponding original tumors (P=0.001) and transfer coefficient values derived from DCE-MRI of the tumor area was significantly lower compared with the original tumors (P=0.016). GO analysis indicated that the expression levels of extracellular matrix-associated genes, angiogenesis-associated genes and immune function-associated genes in the original tumors were higher compared with the corresponding xenografts. In conclusion, the data demonstrated that the MRI features of patient-derived xenograft glioma models in mice were different compared with those of the original patient tumors. Differential gene expression may underlie the differences noted in the MRI features between original tumors and corresponding xenografts. The results of the present study highlight the precautions that should be taken when extrapolating data from patient-derived xenograft studies, and their applicability to humans.

Collapse

Chen L, Pan X, Guo W, Gan Z, Zhang YH, Niu Z, Huang T, Cai YD. Investigating the gene expression profiles of cells in seven embryonic stages with machine learning algorithms. Genomics 2020;112:2524-2534. [PMID: 32045671 DOI: 10.1016/j.ygeno.2020.02.004] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2019] [Revised: 12/26/2019] [Accepted: 02/07/2020] [Indexed: 12/15/2022]

Chen L, Pan X, Zeng T, Zhang YH, Zhang Y, Huang T, Cai YD. Immunosignature Screening for Multiple Cancer Subtypes Based on Expression Rule. Front Bioeng Biotechnol 2019;7:370. [PMID: 31850330 PMCID: PMC6901955 DOI: 10.3389/fbioe.2019.00370] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2019] [Accepted: 11/13/2019] [Indexed: 12/13/2022] Open

Li J, Wang D, Wang Y. IBI: Identification of Biomarker Genes in Individual Tumor Samples. Front Genet 2019;10:1236. [PMID: 31850079 PMCID: PMC6902017 DOI: 10.3389/fgene.2019.01236] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Accepted: 11/07/2019] [Indexed: 12/12/2022] Open

Identifying Methylation Pattern and Genes Associated with Breast Cancer Subtypes. Int J Mol Sci 2019;20:ijms20174269. [PMID: 31480430 PMCID: PMC6747348 DOI: 10.3390/ijms20174269] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2019] [Revised: 08/19/2019] [Accepted: 08/29/2019] [Indexed: 12/18/2022] Open

Gene selection for microarray data classification via adaptive hypergraph embedded dictionary learning. Gene 2019;706:188-200. [DOI: 10.1016/j.gene.2019.04.060] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2018] [Revised: 04/03/2019] [Accepted: 04/22/2019] [Indexed: 01/19/2023]

Li J, Lu L, Zhang YH, Xu Y, Liu M, Feng K, Chen L, Kong X, Huang T, Cai YD. Identification of leukemia stem cell expression signatures through Monte Carlo feature selection strategy and support vector machine. Cancer Gene Ther 2019;27:56-69. [PMID: 31138902 DOI: 10.1038/s41417-019-0105-y] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2019] [Revised: 04/28/2019] [Accepted: 05/04/2019] [Indexed: 01/09/2023]

Analysis of Expression Pattern of snoRNAs in Different Cancer Types with Machine Learning Algorithms. Int J Mol Sci 2019;20:ijms20092185. [PMID: 31052553 PMCID: PMC6539089 DOI: 10.3390/ijms20092185] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2019] [Revised: 04/29/2019] [Accepted: 04/30/2019] [Indexed: 01/17/2023] Open

Chen L, Pan X, Zhang YH, Kong X, Huang T, Cai YD. Tissue differences revealed by gene expression profiles of various cell lines. J Cell Biochem 2019;120:7068-7081. [PMID: 30368905 DOI: 10.1002/jcb.27977] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2018] [Accepted: 10/04/2018] [Indexed: 01/24/2023]

Chen L, Pan X, Zhang YH, Huang T, Cai YD. Analysis of Gene Expression Differences between Different Pancreatic Cells. ACS OMEGA 2019;4:6421-6435. [DOI: 10.1021/acsomega.8b02171] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/30/2023]

Chen X, Jin Y, Feng Y. Evaluation of Plasma Extracellular Vesicle MicroRNA Signatures for Lung Adenocarcinoma and Granuloma With Monte-Carlo Feature Selection Method. Front Genet 2019;10:367. [PMID: 31105742 PMCID: PMC6498093 DOI: 10.3389/fgene.2019.00367] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2018] [Accepted: 04/05/2019] [Indexed: 12/24/2022] Open

The next generation personalized models to screen hidden layers of breast cancer tumorigenicity. Breast Cancer Res Treat 2019;175:277-286. [PMID: 30810866 DOI: 10.1007/s10549-019-05159-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Accepted: 02/05/2019] [Indexed: 10/27/2022]

Abstract

BACKGROUND

Breast cancer (BC) is a challenging disease and major cause of death amongst women worldwide who die due to tumor relapse or sidelong diseases. BC main complexity comes from the heterogeneous nature of breast tumors that demands customized treatments in the form of personalized medicine.

REVIEW OF THE LITERATURE AND DISCUSSION

Spatiotemporally dynamic and heterogeneous nature of BC tumors is shaped by their clonal evolution and sub-clonal selections and shapes resistance to collective or group therapies that drives cancer recurrence and tumor metastasis. Personalized intervention promises to administer medications that selectively target each individual patient tumor and even further each colonized secondary tumor. Such personalized regimens will require creation of in vitro and in vivo models genuinely recapitulating characteristics of each tumor type as initiating platforms for two main purposes: to closely monitor the tumorigenic processes that shape tumor heterogeneity and evolution as the main driving forces behind tumor chemo-resistance and relapse, and subsequently to establish patient-specific preventive and therapeutic measures. While application of tumor modeling for personalized drug screening and design requires a separate review, here we discuss the personalized utilities of xenograft modeling in investigating BC tumor formation and progression toward metastasis. We will further elaborate on the impact of innovative technologies on personalized modeling of BC tumorigenicity at improved resolution.

CONCLUSION

Heterogeneous nature of each BC tumor requires personalized intervention implying that modeling breast tumors is inevitable for better disease understanding, detection and cure. Patient-derived xenografts are just the initiating piece of the puzzle for ideal management of breast cancer. Emerging technologies promise to model BC more personalized than before.

Collapse

Mirza B, Wang W, Wang J, Choi H, Chung NC, Ping P. Machine Learning and Integrative Analysis of Biomedical Big Data. Genes (Basel) 2019;10:E87. [PMID: 30696086 PMCID: PMC6410075 DOI: 10.3390/genes10020087] [Citation(s) in RCA: 157] [Impact Index Per Article: 31.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2018] [Revised: 01/08/2019] [Accepted: 01/21/2019] [Indexed: 12/11/2022] Open

Affiliation(s)

Bilal Mirza NIH BD2K Center of Excellence for Biomedical Computing, University of California Los Angeles, Los Angeles, CA 90095, USA. Department of Physiology, University of California Los Angeles, Los Angeles, CA 90095, USA.
Wei Wang NIH BD2K Center of Excellence for Biomedical Computing, University of California Los Angeles, Los Angeles, CA 90095, USA. Department of Computer Science, University of California Los Angeles, Los Angeles, CA 90095, USA. Scalable Analytics Institute (ScAi), University of California Los Angeles, Los Angeles, CA 90095, USA. Department of Bioinformatics, University of California Los Angeles, Los Angeles, CA 90095, USA.
Jie Wang NIH BD2K Center of Excellence for Biomedical Computing, University of California Los Angeles, Los Angeles, CA 90095, USA. Department of Physiology, University of California Los Angeles, Los Angeles, CA 90095, USA.
Howard Choi NIH BD2K Center of Excellence for Biomedical Computing, University of California Los Angeles, Los Angeles, CA 90095, USA. Department of Physiology, University of California Los Angeles, Los Angeles, CA 90095, USA. Department of Bioinformatics, University of California Los Angeles, Los Angeles, CA 90095, USA.
Neo Christopher Chung NIH BD2K Center of Excellence for Biomedical Computing, University of California Los Angeles, Los Angeles, CA 90095, USA. Institute of Informatics, Faculty of Mathematics, Informatics and Mechanics, University of Warsaw, Banacha 2, 02-097 Warsaw, Poland.
Peipei Ping NIH BD2K Center of Excellence for Biomedical Computing, University of California Los Angeles, Los Angeles, CA 90095, USA. Department of Physiology, University of California Los Angeles, Los Angeles, CA 90095, USA. Scalable Analytics Institute (ScAi), University of California Los Angeles, Los Angeles, CA 90095, USA. Department of Bioinformatics, University of California Los Angeles, Los Angeles, CA 90095, USA. Department of Medicine (Cardiology), University of California Los Angeles, Los Angeles, CA 90095, USA.

Collapse

Rahman MF, Rahman MR, Islam T, Zaman T, Shuvo MAH, Hossain MT, Islam MR, Karim MR, Moni MA. A bioinformatics approach to decode core genes and molecular pathways shared by breast cancer and endometrial cancer. INFORMATICS IN MEDICINE UNLOCKED 2019. [DOI: 10.1016/j.imu.2019.100274] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Chen L, Pan X, Zhang YH, Liu M, Huang T, Cai YD. Classification of Widely and Rarely Expressed Genes with Recurrent Neural Network. Comput Struct Biotechnol J 2018;17:49-60. [PMID: 30595815 PMCID: PMC6307323 DOI: 10.1016/j.csbj.2018.12.002] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2018] [Revised: 12/07/2018] [Accepted: 12/09/2018] [Indexed: 02/06/2023] Open

Abstract

A tissue-specific gene expression shapes the formation of tissues, while gene expression changes reflect the immune response of the human body to environmental stimulations or pressure, particularly in disease conditions, such as cancers. A few genes are commonly expressed across tissues or various cancers, while others are not. To investigate the functional differences between widely and rarely expressed genes, we defined the genes that were expressed in 32 normal tissues/cancers (i.e., called widely expressed genes; FPKM >1 in all samples) and those that were not detected (i.e., called rarely expressed genes; FPKM <1 in all samples) based on the large gene expression data set provided by Uhlen et al. Each gene was encoded using the gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment scores. Minimum redundancy maximum relevance (mRMR) was used to measure and rank these features on the mRMR feature list. Thereafter, we applied the incremental feature selection method with a supervised classifier recurrent neural network (RNN) to select the discriminate features for classifying widely expressed genes from rarely expressed genes and construct an optimum RNN classifier. The Youden's indexes generated by the optimum RNN classifier and evaluated using a 10-fold cross validation were 0.739 for normal tissues and 0.639 for cancers. Furthermore, the underlying mechanisms of the key discriminate GO and KEGG features were analyzed. Results can facilitate the identification of the expression landscape of genes and elucidation of how gene expression shapes tissues and the microenvironment of cancers.

•

Some genes are widely expressed across tissues or various cancers.

•

A number of genes are rarely expressed across tissues or various cancers.

•

The functional differences between widely and rarely expressed genes were studied.

•

Several GO terms and KEGG pathways were extracted and analyzed.

Collapse

Chen L, Zhang S, Pan X, Hu X, Zhang YH, Yuan F, Huang T, Cai YD. HIV infection alters the human epigenetic landscape. Gene Ther 2018;26:29-39. [PMID: 30443044 DOI: 10.1038/s41434-018-0051-6] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2018] [Revised: 10/30/2018] [Accepted: 10/31/2018] [Indexed: 02/07/2023]

The early detection of asthma based on blood gene expression. Mol Biol Rep 2018;46:217-223. [PMID: 30421126 DOI: 10.1007/s11033-018-4463-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2018] [Accepted: 11/01/2018] [Indexed: 01/10/2023]

Chen L, Zhang YH, Pan X, Liu M, Wang S, Huang T, Cai YD. Tissue Expression Difference between mRNAs and lncRNAs. Int J Mol Sci 2018;19:ijms19113416. [PMID: 30384456 PMCID: PMC6274976 DOI: 10.3390/ijms19113416] [Citation(s) in RCA: 49] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2018] [Revised: 10/26/2018] [Accepted: 10/28/2018] [Indexed: 12/15/2022] Open

Abstract

Messenger RNA (mRNA) and long noncoding RNA (lncRNA) are two main subgroups of RNAs participating in transcription regulation. With the development of next generation sequencing, increasing lncRNAs are identified. Many hidden functions of lncRNAs are also revealed. However, the differences in lncRNAs and mRNAs are still unclear. For example, we need to determine whether lncRNAs have stronger tissue specificity than mRNAs and which tissues have more lncRNAs expressed. To investigate such tissue expression difference between mRNAs and lncRNAs, we encoded 9339 lncRNAs and 14,294 mRNAs with 71 expression features, including 69 maximum expression features for 69 types of cells, one feature for the maximum expression in all cells, and one expression specificity feature that was measured as Chao-Shen-corrected Shannon's entropy. With advanced feature selection methods, such as maximum relevance minimum redundancy, incremental feature selection methods, and random forest algorithm, 13 features presented the dissimilarity of lncRNAs and mRNAs. The 11 cell subtype features indicated which cell types of the lncRNAs and mRNAs had the largest expression difference. Such cell subtypes may be the potential cell models for lncRNA identification and function investigation. The expression specificity feature suggested that the cell types to express mRNAs and lncRNAs were different. The maximum expression feature suggested that the maximum expression levels of mRNAs and lncRNAs were different. In addition, the rule learning algorithm, repeated incremental pruning to produce error reduction algorithm, was also employed to produce effective classification rules for classifying lncRNAs and mRNAs, which gave competitive results compared with random forest and could give a clearer picture of different expression patterns between lncRNAs and mRNAs. Results not only revealed the heterogeneous expression pattern of lncRNA and mRNA, but also gave rise to the development of a new tool to identify the potential biological functions of such RNA subgroups.

Collapse

Lu S, Zhao K, Wang X, Liu H, Ainiwaer X, Xu Y, Ye M. Use of Laplacian Heat Diffusion Algorithm to Infer Novel Genes With Functions Related to Uveitis. Front Genet 2018;9:425. [PMID: 30349554 PMCID: PMC6186792 DOI: 10.3389/fgene.2018.00425] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2018] [Accepted: 09/10/2018] [Indexed: 12/17/2022] Open

A Computational Method for Classifying Different Human Tissues with Quantitatively Tissue-Specific Expressed Genes. Genes (Basel) 2018;9:genes9090449. [PMID: 30205473 PMCID: PMC6162521 DOI: 10.3390/genes9090449] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2018] [Revised: 09/01/2018] [Accepted: 09/04/2018] [Indexed: 02/06/2023] Open

Li J, Lan CN, Kong Y, Feng SS, Huang T. Identification and Analysis of Blood Gene Expression Signature for Osteoarthritis With Advanced Feature Selection Methods. Front Genet 2018;9:246. [PMID: 30214455 PMCID: PMC6125376 DOI: 10.3389/fgene.2018.00246] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2018] [Accepted: 06/22/2018] [Indexed: 12/15/2022] Open