1
|
Joo MS, Pyo KH, Chung JM, Cho BC. Artificial intelligence-based non-small cell lung cancer transcriptome RNA-sequence analysis technology selection guide. Front Bioeng Biotechnol 2023; 11:1081950. [PMID: 36873350 PMCID: PMC9975749 DOI: 10.3389/fbioe.2023.1081950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Accepted: 01/24/2023] [Indexed: 02/17/2023] Open
Abstract
The incidence and mortality rates of lung cancer are high worldwide, where non-small cell lung cancer (NSCLC) accounts for more than 85% of lung cancer cases. Recent non-small cell lung cancer research has been focused on analyzing patient prognosis after surgery and identifying mechanisms in connection with clinical cohort and ribonucleic acid (RNA) sequencing data, including single-cell ribonucleic acid (scRNA) sequencing data. This paper investigates statistical techniques and artificial intelligence (AI) based non-small cell lung cancer transcriptome data analysis methods divided into target and analysis technology groups. The methodologies of transcriptome data were schematically categorized so researchers can easily match analysis methods according to their goals. The most widely known and frequently utilized transcriptome analysis goal is to find essential biomarkers and classify carcinomas and cluster NSCLC subtypes. Transcriptome analysis methods are divided into three major categories: Statistical analysis, machine learning, and deep learning. Specific models and ensemble techniques typically used in NSCLC analysis are summarized in this paper, with the intent to lay a foundation for advanced research by converging and linking the various analysis methods available.
Collapse
Affiliation(s)
- Min Soo Joo
- School of Electrical and Electronic Engineering, College of Engineering, Yonsei University, Seoul, Republic of Korea
| | - Kyoung-Ho Pyo
- Department of Oncology, Severance Hospital, College of Medicine, Yonsei University, Seoul, Republic of Korea.,Severance Biomedical Science Institute, Yonsei University College of Medicine, Seoul, Republic of Korea.,Yonsei New Il Han Institute for Integrative Lung Cancer Research, Yonsei University College of Medicine, Seoul, Republic of Korea.,Division of Medical Oncology, Department of Internal Medicine and Yonsei Cancer Center, Severance Hospital, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Jong-Moon Chung
- School of Electrical and Electronic Engineering, College of Engineering, Yonsei University, Seoul, Republic of Korea.,Department of Emergency Medicine, College of Medicine, Yonsei University, Seoul, Republic of Korea
| | - Byoung Chul Cho
- Division of Medical Oncology, Department of Internal Medicine and Yonsei Cancer Center, Severance Hospital, Yonsei University College of Medicine, Seoul, Republic of Korea
| |
Collapse
|
2
|
Zhu H, Xu X, Zheng E, Ni J, Jiang X, Yang M, Zhao G. LncRNA RP11‑805J14.5 functions as a ceRNA to regulate CCND2 by sponging miR‑34b‑3p and miR‑139‑5p in lung adenocarcinoma. Oncol Rep 2022; 48:161. [PMID: 35866595 PMCID: PMC9350987 DOI: 10.3892/or.2022.8376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Accepted: 07/08/2021] [Indexed: 11/05/2022] Open
Abstract
Lung adenocarcinoma (LUAD) is the most common lung cancer with high incidence. The prognosis of LUAD is poor due to its aggressive behavior. Long non‑coding RNAs (lncRNAs) have been reported as a key modulator on LUAD progression. Therefore, the present study aimed to clarify the molecular mechanism of lncRNAs in LUAD development. The expression of lncRNA RP11‑805J14.5 (RP11‑805J14.5) in LUAD tissues and cells was quantified based on the data in The Cancer Genome Atlas (TCGA). Cell viability was determined using Cell Counting Kit‑8 method. Apoptotic cells were sorted and determined by flow cytometry. Cell migration and invasion abilities were detected by the Transwell assay. Luciferase reporter experiment and RNA pull‑down assay were utilized to determine the interactions between RP11‑805J14.5, microRNA (miR)‑34b‑3p, miR‑139‑5p, and cyclin D2 (CCND2). A xenograft tumor was established to determine tumor growth in vivo. RP11‑805J14.5 was highly expressed in LUAD and associated with poor survival of LUAD patients. Knockdown of RP11‑805J14.5 suppressed LUAD cell growth, invasion, migration and tumor growth, indicating that RP11‑805J14.5 is an important regulator of LUAD. Our study demonstrated that the regulation of RP11‑805J14.5 on LUAD was mediated by CCND2 whose expression was regulated by sponging miR‑34b‑3p and miR‑139‑5p. The expression of RP11‑805J14.5 was increased in LUAD, and the knockdown of RP11‑805J14.5 expression suppressed LUAD cell growth, invasion and migration by downregulating CCND2 by sponging miR‑34b‑3p and miR‑139‑5p, indicating that RP11‑805J14.5 could be a prospective target for LUAD therapy.
Collapse
Affiliation(s)
- Huangkai Zhu
- Medical School of Ningbo University, Ningbo, Zhejiang 315211, P.R. China
| | - Xiang Xu
- Department of Thoracic Surgery, Hwa Mei Hospital, University of Chinese Academy of Sciences, Ningbo, Zhejiang 315010, P.R. China
| | - Enkuo Zheng
- Department of Thoracic Surgery, Hwa Mei Hospital, University of Chinese Academy of Sciences, Ningbo, Zhejiang 315010, P.R. China
| | - Junjun Ni
- Department of Thoracic Surgery, Hwa Mei Hospital, University of Chinese Academy of Sciences, Ningbo, Zhejiang 315010, P.R. China
| | - Xu Jiang
- Department of Thoracic Surgery, Hwa Mei Hospital, University of Chinese Academy of Sciences, Ningbo, Zhejiang 315010, P.R. China
| | - Minglei Yang
- Medical School of Ningbo University, Ningbo, Zhejiang 315211, P.R. China
| | - Guofang Zhao
- Medical School of Ningbo University, Ningbo, Zhejiang 315211, P.R. China
| |
Collapse
|
3
|
Genetic variability assessment of 127 Triticum turgidum L. accessions for mycorrhizal susceptibility-related traits detection. Sci Rep 2021; 11:13426. [PMID: 34183734 PMCID: PMC8239029 DOI: 10.1038/s41598-021-92837-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 06/02/2021] [Indexed: 02/06/2023] Open
Abstract
Positive effects of arbuscular mycorrhizal fungi (AMF)-wheat plant symbiosis have been well discussed by research, while the actual role of the single wheat genotype in establishing this type of association is still poorly investigated. In this work, the genetic diversity of Triticum turgidum wheats was exploited to detect roots susceptibility to AMF and to identify genetic markers in linkage with chromosome regions involved in this symbiosis. A tetraploid wheat collection of 127 accessions was genotyped using 35K single-nucleotide polymorphism (SNP) array and inoculated with the AMF species Funneliformis mosseae (F. mosseae) and Rhizoglomus irregulare (R. irregulare), and a genome-wide association study (GWAS) was conducted. Six clusters of genetically related accessions were identified, showing a different mycorrhizal colonization among them. GWAS revealed four significant quantitative trait nucleotides (QTNs) involved in mycorrhizal symbiosis, located on chromosomes 1A, 2A, 2B and 6A. The results of this work enrich future breeding activities aimed at developing new grains on the basis of genetic diversity on low or high susceptibility to mycorrhization, and, possibly, maximizing the symbiotic effects.
Collapse
|
4
|
Pasetto A, Lu YC. Single-Cell TCR and Transcriptome Analysis: An Indispensable Tool for Studying T-Cell Biology and Cancer Immunotherapy. Front Immunol 2021; 12:689091. [PMID: 34163487 PMCID: PMC8215674 DOI: 10.3389/fimmu.2021.689091] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Accepted: 05/10/2021] [Indexed: 12/18/2022] Open
Abstract
T cells have been known to be the driving force for immune response and cancer immunotherapy. Recent advances on single-cell sequencing techniques have empowered scientists to discover new biology at the single-cell level. Here, we review the single-cell techniques used for T-cell studies, including T-cell receptor (TCR) and transcriptome analysis. In addition, we summarize the approaches used for the identification of T-cell neoantigens, an important aspect for T-cell mediated cancer immunotherapy. More importantly, we discuss the applications of single-cell techniques for T-cell studies, including T-cell development and differentiation, as well as the role of T cells in autoimmunity, infectious disease and cancer immunotherapy. Taken together, this powerful tool not only can validate previous observation by conventional approaches, but also can pave the way for new discovery, such as previous unidentified T-cell subpopulations that potentially responsible for clinical outcomes in patients with autoimmunity or cancer.
Collapse
Affiliation(s)
- Anna Pasetto
- Department of Laboratory Medicine, Division of Clinical Microbiology, ANA FUTURA, Karolinska Institutet, Stockholm, Sweden
| | - Yong-Chen Lu
- Department of Pathology, University of Arkansas for Medical Sciences, Little Rock, AR, United States.,Winthrop P. Rockefeller Cancer Institute, University of Arkansas for Medical Sciences, Little Rock, AR, United States
| |
Collapse
|
5
|
Li D, Lin H, Li L. Multiple Feature Selection Strategies Identified Novel Cardiac Gene Expression Signature for Heart Failure. Front Physiol 2020; 11:604241. [PMID: 33304275 PMCID: PMC7693561 DOI: 10.3389/fphys.2020.604241] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2020] [Accepted: 10/15/2020] [Indexed: 02/02/2023] Open
Abstract
Heart failure (HF) is a serious condition in which the support of blood pumped by the heart is insufficient to meet the demands of body at a normal cardiac filling pressure. Approximately 26 million patients worldwide are suffering from heart failure and about 17–45% of patients with heart failure die within 1-year, and the majority die within 5-years admitted to a hospital. The molecular mechanisms underlying the progression of heart failure have been poorly studied. We compared the gene expression profiles between patients with heart failure (n = 177) and without heart failure (n = 136) using multiple feature selection strategies and identified 38 HF signature genes. The support vector machine (SVM) classifier based on these 38 genes evaluated with leave-one-out cross validation (LOOCV) achieved great performance with sensitivity of 0.983 and specificity of 0.963. The network analysis suggested that the hub gene SMOC2 may play important roles in HF. Other genes, such as FCN3, HMGN2, and SERPINA3, also showed great promises. Our results can facilitate the early detection of heart failure and can reveal its molecular mechanisms.
Collapse
Affiliation(s)
- Dan Li
- Department of Cardiovascular Medicine, First Hospital Affiliated to Harbin Medical University, Harbin, China
| | - Hong Lin
- Internal Medicine-Cardiovascular Department, Harbin Chest Hospital, Harbin, China
| | - Luyifei Li
- Department of Cardiovascular Medicine, First Hospital Affiliated to Harbin Medical University, Harbin, China
| |
Collapse
|
6
|
Das S, Rai SN. Statistical Approach for Biologically Relevant Gene Selection from High-Throughput Gene Expression Data. ENTROPY (BASEL, SWITZERLAND) 2020; 22:E1205. [PMID: 33286973 PMCID: PMC7712650 DOI: 10.3390/e22111205] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Revised: 10/19/2020] [Accepted: 10/21/2020] [Indexed: 12/16/2022]
Abstract
Selection of biologically relevant genes from high-dimensional expression data is a key research problem in gene expression genomics. Most of the available gene selection methods are either based on relevancy or redundancy measure, which are usually adjudged through post selection classification accuracy. Through these methods the ranking of genes was conducted on a single high-dimensional expression data, which led to the selection of spuriously associated and redundant genes. Hence, we developed a statistical approach through combining a support vector machine with Maximum Relevance and Minimum Redundancy under a sound statistical setup for the selection of biologically relevant genes. Here, the genes were selected through statistical significance values and computed using a nonparametric test statistic under a bootstrap-based subject sampling model. Further, a systematic and rigorous evaluation of the proposed approach with nine existing competitive methods was carried on six different real crop gene expression datasets. This performance analysis was carried out under three comparison settings, i.e., subject classification, biological relevant criteria based on quantitative trait loci and gene ontology. Our analytical results showed that the proposed approach selects genes which are more biologically relevant as compared to the existing methods. Moreover, the proposed approach was also found to be better with respect to the competitive existing methods. The proposed statistical approach provides a framework for combining filter and wrapper methods of gene selection.
Collapse
Affiliation(s)
- Samarendra Das
- Division of Statistical Genetics, Indian Council of Agricultural Research (ICAR)-Indian Agricultural Statistics Research Institute, PUSA, New Delhi 110012, India;
- Netaji Subhas-Indian Council of Agricultural Research (ICAR) International Fellow, Indian Council of Agricultural Research, Krishi Bhawan, New Delhi 110001, India
- Biostatistics and Bioinformatics Facility, JG Brown Cancer Center, University of Louisville, Louisville, KY 40292, USA
- School of Interdisciplinary and Graduate Studies, University of Louisville, Louisville, KY 40292, USA
| | - Shesh N. Rai
- Biostatistics and Bioinformatics Facility, JG Brown Cancer Center, University of Louisville, Louisville, KY 40292, USA
- School of Interdisciplinary and Graduate Studies, University of Louisville, Louisville, KY 40292, USA
- Alcohol Research Center, University of Louisville, Louisville, KY 40292, USA
- Department of Hepatobiology and Toxicology, University of Louisville, Louisville, KY 40292, USA
- Department of Bioinformatics and Biostatistics, University of Louisville, Louisville, KY 40292, USA
- Wendell Cherry Chair in Clinical Trial Research, University of Louisville, Louisville, KY 40292, USA
| |
Collapse
|