1
|
Rosati D, Palmieri M, Brunelli G, Morrione A, Iannelli F, Frullanti E, Giordano A. Differential gene expression analysis pipelines and bioinformatic tools for the identification of specific biomarkers: A review. Comput Struct Biotechnol J 2024; 23:1154-1168. [PMID: 38510977 PMCID: PMC10951429 DOI: 10.1016/j.csbj.2024.02.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 02/20/2024] [Accepted: 02/20/2024] [Indexed: 03/22/2024] Open
Abstract
In recent years, the role of bioinformatics and computational biology together with omics techniques and transcriptomics has gained tremendous importance in biomedicine and healthcare, particularly for the identification of biomarkers for precision medicine and drug discovery. Differential gene expression (DGE) analysis is one of the most used techniques for RNA-sequencing (RNA-seq) data analysis. This tool, which is typically used in various RNA-seq data processing applications, allows the identification of differentially expressed genes across two or more sample sets. Functional enrichment analyses can then be performed to annotate and contextualize the resulting gene lists. These studies provide valuable information about disease-causing biological processes and can help in identifying molecular targets for novel therapies. This review focuses on differential gene expression (DGE) analysis pipelines and bioinformatic techniques commonly used to identify specific biomarkers and discuss the advantages and disadvantages of these techniques.
Collapse
Affiliation(s)
- Diletta Rosati
- Department of Medical Biotechnologies, University of Siena, 53100 Siena, Italy
- Cancer Genomics & Systems Biology Lab, Dept. of Medical Biotechnologies, University of Siena, 53100 Siena, Italy
- Med Biotech Hub and Competence Center, Department of Medical Biotechnologies, University of Siena, Italy
| | - Maria Palmieri
- Cancer Genomics & Systems Biology Lab, Dept. of Medical Biotechnologies, University of Siena, 53100 Siena, Italy
- Med Biotech Hub and Competence Center, Department of Medical Biotechnologies, University of Siena, Italy
| | - Giulia Brunelli
- Med Biotech Hub and Competence Center, Department of Medical Biotechnologies, University of Siena, Italy
| | - Andrea Morrione
- Sbarro Institute for Cancer Research and Molecular Medicine, Center for Biotechnology, Department of Biology, College of Science and Technology, Temple University, Philadelphia, PA 19122, USA
| | - Francesco Iannelli
- Laboratory of Molecular Microbiology and Biotechnology, Department of Medical Biotechnologies, University of Siena, Siena, Italy
| | - Elisa Frullanti
- Cancer Genomics & Systems Biology Lab, Dept. of Medical Biotechnologies, University of Siena, 53100 Siena, Italy
- Med Biotech Hub and Competence Center, Department of Medical Biotechnologies, University of Siena, Italy
| | - Antonio Giordano
- Department of Medical Biotechnologies, University of Siena, 53100 Siena, Italy
- Sbarro Institute for Cancer Research and Molecular Medicine, Center for Biotechnology, Department of Biology, College of Science and Technology, Temple University, Philadelphia, PA 19122, USA
| |
Collapse
|
2
|
Li T, Mani M. A physically inspired approach to coarse-graining transcriptomes reveals the dynamics of aging. PLoS One 2024; 19:e0301159. [PMID: 39471158 PMCID: PMC11521254 DOI: 10.1371/journal.pone.0301159] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Accepted: 08/12/2024] [Indexed: 11/01/2024] Open
Abstract
Single-cell RNA sequencing has enabled the study of aging at a molecular scale. While substantial progress has been made in measuring age-related gene expression, the underlying patterns and mechanisms of aging transcriptomes remain poorly understood. To address this gap, we propose a physics-inspired, data-analysis approach to extract additional insights from single-cell RNA sequencing data. By considering the genome as a many-body interacting system, we leverage central idea of the Renormalization Group to construct an approach to hierarchically describe aging across a spectrum of scales for the gene expresion. This framework provides a quantitative language to study the multiscale patterns of aging transcriptomes. Overall, our study demonstrates the value of leveraging theoretical physics concepts like the Renormalization Group to gain new biological insights from complex high-dimensional single-cell data.
Collapse
Affiliation(s)
- Tao Li
- Department of Engineering Science and Applied Mathematics, Northwestern University, Evanston, IL, United States of America
| | - Madhav Mani
- Department of Engineering Science and Applied Mathematics, Northwestern University, Evanston, IL, United States of America
- NSF-Simons Center for Quantitative Biology, Northwestern University, Evanston, IL, United States of America
| |
Collapse
|
3
|
Curtis L, Piggins HD. Diverse genetic alteration dysregulates neuropeptide and intracellular signalling in the suprachiasmatic nuclei. Eur J Neurosci 2024; 60:3921-3945. [PMID: 38924215 DOI: 10.1111/ejn.16443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Revised: 05/12/2024] [Accepted: 05/31/2024] [Indexed: 06/28/2024]
Abstract
In mammals, intrinsic 24 h or circadian rhythms are primarily generated by the suprachiasmatic nuclei (SCN). Rhythmic daily changes in the transcriptome and proteome of SCN cells are controlled by interlocking transcription-translation feedback loops (TTFLs) of core clock genes and their proteins. SCN cells function as autonomous circadian oscillators, which synchronize through intercellular neuropeptide signalling. Physiological and behavioural rhythms can be severely disrupted by genetic modification of a diverse range of genes and proteins in the SCN. With the advent of next generation sequencing, there is unprecedented information on the molecular profile of the SCN and how it is affected by genetically targeted alteration. However, whether the expression of some genes is more readily affected by genetic alteration of the SCN is unclear. Here, using publicly available datasets from recent RNA-seq assessments of the SCN from genetically altered and control mice, we evaluated whether there are commonalities in transcriptome dysregulation. This was completed for four different phases across the 24 h cycle and was augmented by Gene Ontology Molecular Function (GO:MF) and promoter analysis. Common differentially expressed genes (DEGs) and/or enriched GO:MF terms included signalling molecules, their receptors, and core clock components. Finally, examination of the JASPAR database indicated that E-box and CRE elements in the promoter regions of several commonly dysregulated genes. From this analysis, we identify differential expression of genes coding for molecules involved in SCN intra- and intercellular signalling as a potential cause of abnormal circadian rhythms.
Collapse
Affiliation(s)
- Lucy Curtis
- School of Biological Sciences, University of Bristol, Bristol, UK
- School of Physiology, Pharmacology, and Neuroscience, University of Bristol, Bristol, UK
| | - Hugh D Piggins
- School of Physiology, Pharmacology, and Neuroscience, University of Bristol, Bristol, UK
| |
Collapse
|
4
|
Jiang G, Zheng JY, Ren SN, Yin W, Xia X, Li Y, Wang HL. A comprehensive workflow for optimizing RNA-seq data analysis. BMC Genomics 2024; 25:631. [PMID: 38914930 PMCID: PMC11197194 DOI: 10.1186/s12864-024-10414-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Accepted: 05/15/2024] [Indexed: 06/26/2024] Open
Abstract
BACKGROUND Current RNA-seq analysis software for RNA-seq data tends to use similar parameters across different species without considering species-specific differences. However, the suitability and accuracy of these tools may vary when analyzing data from different species, such as humans, animals, plants, fungi, and bacteria. For most laboratory researchers lacking a background in information science, determining how to construct an analysis workflow that meets their specific needs from the array of complex analytical tools available poses a significant challenge. RESULTS By utilizing RNA-seq data from plants, animals, and fungi, it was observed that different analytical tools demonstrate some variations in performance when applied to different species. A comprehensive experiment was conducted specifically for analyzing plant pathogenic fungal data, focusing on differential gene analysis as the ultimate goal. In this study, 288 pipelines using different tools were applied to analyze five fungal RNA-seq datasets, and the performance of their results was evaluated based on simulation. This led to the establishment of a relatively universal and superior fungal RNA-seq analysis pipeline that can serve as a reference, and certain standards for selecting analysis tools were derived for reference. Additionally, we compared various tools for alternative splicing analysis. The results based on simulated data indicated that rMATS remained the optimal choice, although consideration could be given to supplementing with tools such as SpliceWiz. CONCLUSION The experimental results demonstrate that, in comparison to the default software parameter configurations, the analysis combination results after tuning can provide more accurate biological insights. It is beneficial to carefully select suitable analysis software based on the data, rather than indiscriminately choosing tools, in order to achieve high-quality analysis results more efficiently.
Collapse
Affiliation(s)
- Gao Jiang
- School of Information Science and Technology, School of Artificial Intelligence, Beijing Forestry University, Beijing, 100083, People's Republic of China
| | - Juan-Yu Zheng
- School of Information Science and Technology, School of Artificial Intelligence, Beijing Forestry University, Beijing, 100083, People's Republic of China
| | - Shu-Ning Ren
- State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, People's Republic of China
| | - Weilun Yin
- State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, People's Republic of China
| | - Xinli Xia
- State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, People's Republic of China
| | - Yun Li
- School of Information Science and Technology, School of Artificial Intelligence, Beijing Forestry University, Beijing, 100083, People's Republic of China.
| | - Hou-Ling Wang
- State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, People's Republic of China.
| |
Collapse
|
5
|
Pu X, Ma S, Zhao B, Tang S, Lu Q, Liu W, Wang Y, Cen Y, Wu C, Fu X. Transcriptome meta-analysis reveals the hair genetic rules in six animal breeds and genes associated with wool fineness. Front Genet 2024; 15:1401369. [PMID: 38948362 PMCID: PMC11211574 DOI: 10.3389/fgene.2024.1401369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Accepted: 05/24/2024] [Indexed: 07/02/2024] Open
Abstract
Wool plays an irreplaceable role in the lives of livestock and the textile industry. The variety of hair quality and shape leads to the diversity of its functions and applications, and the finer wool has a higher economic value. In this study, 10 coarse and 10 fine ordos fine wool sheep skin samples were collected for RNA-seq, and coarse and fine skin/hair follicle RNA-seq datasets of other five animal breeds were obtained from NCBI. Weighted gene co-expression network analysis showed that the common genes were clustered into eight modules. Similar gene expression patterns in sheep and rabbits with the same wool types, different gene expression patterns in animal species with different hair types, and brown modules were significantly correlated with species and breeds. GO and KEGG enrichment analyses showed that, most genes in the brown module associated with hair follicle development. Hence, gene expression patterns in skin tissues may determine hair morphology in animal. The analysis of differentially expressed genes revealed that 32 highly expressed candidate genes associated with the wool fineness of Ordos fine wool sheep. Among them, KAZALD1 (grey module), MYOC (brown module), C1QTNF6 (brown module), FOS (tan module), ITGAM, MX2, MX1, and IFI6 genes have been reported to be involved in the regulation of the hair follicle cycle or hair loss. Additionally, 12 genes, including KAZALD1, MYOC, C1QTNF6, and FOS, are differentially expressed across various animal breeds and species. The above results suggest that different sheep breeds share a similar molecular regulatory basis of wool fineness. Finally, the study provides a theoretical reference for molecular breeding of sheep breeds as well as for the investigation of the origin and evolution of animal hair.
Collapse
Affiliation(s)
- Xue Pu
- Key Laboratory of Special Environments Biodiversity Application and Regulation in Xinjiang, Xinjiang Key Laboratory of Special Species Conservation and Regulatory Biology, College of Life Sciences, Xinjiang Normal University, Urumqi, Xinjiang, China
- Key Laboratory of Genetics Breeding and Reproduction of Xinjiang Wool-Sheep Cashmere-Goat (XJYS1105), Institute of Animal Science, Xinjiang Academy of Animal Sciences, Urumqi, Xinjiang, China
| | - Shengchao Ma
- Key Laboratory of Genetics Breeding and Reproduction of Xinjiang Wool-Sheep Cashmere-Goat (XJYS1105), Institute of Animal Science, Xinjiang Academy of Animal Sciences, Urumqi, Xinjiang, China
- College of Animal Science, Xinjiang Agricultural University, Urumqi, Xinjiang, China
| | - Bingru Zhao
- Key Laboratory of Genetics Breeding and Reproduction of Xinjiang Wool-Sheep Cashmere-Goat (XJYS1105), Institute of Animal Science, Xinjiang Academy of Animal Sciences, Urumqi, Xinjiang, China
- Jiangsu Livestock Embryo Engineering Laboratory, College of Animal Science and Technology, Nanjing Agricultural University, Nanjing, Jiangsu, China
| | - Sen Tang
- Key Laboratory of Herbivorous Livestock Genetics, Ministry of Agriculture, Institute of Biotechnology, Xinjiang Academy of Animal Sciences, Urumqi, Xinjiang, China
| | - Qingwei Lu
- Key Laboratory of Genetics Breeding and Reproduction of Xinjiang Wool-Sheep Cashmere-Goat (XJYS1105), Institute of Animal Science, Xinjiang Academy of Animal Sciences, Urumqi, Xinjiang, China
- College of Animal Science, Xinjiang Agricultural University, Urumqi, Xinjiang, China
| | - Wenna Liu
- Key Laboratory of Genetics Breeding and Reproduction of Xinjiang Wool-Sheep Cashmere-Goat (XJYS1105), Institute of Animal Science, Xinjiang Academy of Animal Sciences, Urumqi, Xinjiang, China
- College of Animal Science, Xinjiang Agricultural University, Urumqi, Xinjiang, China
| | - Yaqian Wang
- Key Laboratory of Genetics Breeding and Reproduction of Xinjiang Wool-Sheep Cashmere-Goat (XJYS1105), Institute of Animal Science, Xinjiang Academy of Animal Sciences, Urumqi, Xinjiang, China
- College of Animal Science, Xinjiang Agricultural University, Urumqi, Xinjiang, China
| | - Yunlin Cen
- Key Laboratory of Special Environments Biodiversity Application and Regulation in Xinjiang, Xinjiang Key Laboratory of Special Species Conservation and Regulatory Biology, College of Life Sciences, Xinjiang Normal University, Urumqi, Xinjiang, China
| | - Cuiling Wu
- Key Laboratory of Special Environments Biodiversity Application and Regulation in Xinjiang, Xinjiang Key Laboratory of Special Species Conservation and Regulatory Biology, College of Life Sciences, Xinjiang Normal University, Urumqi, Xinjiang, China
| | - Xuefeng Fu
- Key Laboratory of Genetics Breeding and Reproduction of Xinjiang Wool-Sheep Cashmere-Goat (XJYS1105), Institute of Animal Science, Xinjiang Academy of Animal Sciences, Urumqi, Xinjiang, China
| |
Collapse
|
6
|
Rostami Nejad M, Farahani M, Razzaghi Z, Arjmand B, Montazer F, Bandarian F, Razi F, Rezaei Tavirani M. Evaluation of Near-Infrared Laser Effects on 143B Cells: A System Biology Approach. J Lasers Med Sci 2024; 15:e14. [PMID: 39051000 PMCID: PMC11266825 DOI: 10.34172/jlms.2024.14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2024] [Accepted: 02/19/2024] [Indexed: 07/27/2024]
Abstract
Introduction: Photothermal therapy (PTT) by using a near-infrared (NIR) laser, as a successful treatment of cancer, has attracted extensive attention of researchers. Its advantages as a noninvasive and suitable method have been confirmed. Discovery of the NIR laser molecular mechanism at the cellular level via system biology assessment to identify the crucial targeted genes is the aim of this study. Methods: RNA-seq series of six samples were retrieved from Gene Expression Omnibus (GEO) and pre-evaluated by the GEO2R program for more analysis. The significant differentially expressed genes (DEGs) were determined and studied via gene expression analysis, protein-protein interaction (PPI) network assessment, action map evaluation, and gene ontology enrichment. Results: HSPA5, DDIT3, TRIB3, PTGS2, HMOX1, ASNS, GDF15, SLC7A11, and SQSTM1 were identified as central genes. Comparing the central genes and the determined crucial genes via gene expression analysis, actin map results, and gene ontology enrichment led to the introduction of HSPA5, DDIT3, PTGS2, HMOX1, and GDF15 as critical genes in response to the NIR laser. Conclusion: The results indicated that the principle biological process "Endoplasmic reticulum unfolded protein response" and HSPA5, DDIT3, PTGS2, HMOX1, and GDF15 are the crucial targets of the NIR laser. The results also showed that the NIR laser induces stress conditions in the irradiated cells.
Collapse
Affiliation(s)
- Mohammad Rostami Nejad
- Celiac Disease and Gluten Related Disorders Research Center, Research Institute for Gastroenterology and Liver Disease, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Masoumeh Farahani
- Skin Research Center, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Zahra Razzaghi
- Laser Application in Medical Sciences Research Center, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Babak Arjmand
- Cell Therapy and Regenerative Medicine Research Center, Endocrinology and Metabolism Molecular-Cellular Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
- Iranian Cancer Control Center (MACSA), Tehran, Iran
| | - Fatemeh Montazer
- Department of Pathology, Firoozabadi Hospital, School of Medicine, Iran University of Medical Sciences, Tehran, Iran
| | - Fatemeh Bandarian
- Endocrinology and Metabolism Research Center, Endocrinology and Metabolism Clinical Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Farideh Razi
- Diabetes Research Center, Endocrinology and Metabolism Clinical Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Mostafa Rezaei Tavirani
- Proteomics Research Center, Faculty of Paramedical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| |
Collapse
|
7
|
Makino M, Shimizu K, Kadota K. Enhanced clustering-based differential expression analysis method for RNA-seq data. MethodsX 2024; 12:102518. [PMID: 38179066 PMCID: PMC10764243 DOI: 10.1016/j.mex.2023.102518] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Accepted: 12/10/2023] [Indexed: 01/06/2024] Open
Abstract
RNA-seq is a tool for measuring gene expression and is commonly used to identify differentially expressed genes (DEGs). Gene clustering has been widely used to classify DEGs with similar expression patterns, but rarely used to identify DEGs themselves. We recently reported that the clustering-based method (called MBCdeg1 and 2) for identifying DEGs has great potential. However, these methods left room for improvement. This study reports on the improvement (named MBCdeg3). We compared a total of six competing methods: three conventional R packages (edgeR, DESeq2, and TCC) and three versions of MBCdeg (i.e., MBCdeg1, 2, and 3) corresponding to three different normalization algorithms. As MBCdeg3 performs well in many simulation scenarios of RNA-seq count data, MBCdeg3 replaces MBCdeg1 and 2 in our previous report. •MBCdeg3 is a method for both identification and classification of DEGs from RNA-seq count data.•MBCdeg3 is available as a function of R, which is common in the field of expression analysis.•MBCdeg3 performs well in a variety of simulation scenarios for RNA-seq count data.
Collapse
Affiliation(s)
- Manon Makino
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Yayoi 1-1-1, Bunkyo-ku, Tokyo 113-8657, Japan
| | - Kentaro Shimizu
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Yayoi 1-1-1, Bunkyo-ku, Tokyo 113-8657, Japan
| | - Koji Kadota
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Yayoi 1-1-1, Bunkyo-ku, Tokyo 113-8657, Japan
- Interfaculty Initiative in Information Studies, The University of Tokyo, Hongo 7-3-1, Bunkyo-ku, Tokyo 113-0033, Japan
- Collaborative Research Institute for Innovative Microbiology, The University of Tokyo, Yayoi 1-1-1, Bunkyo-ku, Tokyo 113-8657, Japan
| |
Collapse
|
8
|
Ochoa-Alejo N, Reyes-Valdés MH, Martínez O. Estimating Transcriptome Diversity and Specialization in Capsicum annuum L. PLANTS (BASEL, SWITZERLAND) 2024; 13:983. [PMID: 38611513 PMCID: PMC11013594 DOI: 10.3390/plants13070983] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Revised: 03/22/2024] [Accepted: 03/26/2024] [Indexed: 04/14/2024]
Abstract
Chili pepper fruits of the genus Capsicum represent excellent experimental models to study the growth, development, and ripening processes in a non-climacteric species at the physiological, biochemical, and molecular levels. Fruit growth, development, and ripening involve a complex, harmonious, and finely controlled regulation of gene expression. The purpose of this study was to estimate the changes in transcriptome diversity and specialization, as well as gene specificities during fruit development in this crop, and to illustrate the advantages of estimating these parameters. To achieve these aims, we programmed and made publicly available an R package. In this study, we applied these methods to a set of 179 RNA-Seq libraries from a factorial experiment that includes 12 different genotypes at various stages of fruit development. We found that the diversity of the transcriptome decreases linearly from the flower to the mature fruit, while its specialization follows a complex and non-linear behavior during this process. Additionally, by defining sets of genes with different degrees of specialization and applying Gene Ontology enrichment analysis, we identified processes, functions, and components that play a central role in particular fruit development stages. In conclusion, the estimation of diversity, specialization, and specificity summarizes the global properties of the transcriptomes, providing insights that are difficult to achieve by other means.
Collapse
Affiliation(s)
- Neftalí Ochoa-Alejo
- Departamento de Ingeniería Genética, Unidad Irapuato, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional, Irapuato 36824, Guanajuato, Mexico;
| | - M. Humberto Reyes-Valdés
- Department of Plant Breeding, Universidad Autónoma Agraria Antonio Narro, Saltillo 25315, Coahuila, Mexico;
| | - Octavio Martínez
- Unidad de Genómica Avanzada (Langebio), Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (Cinvestav), Irapuato 36824, Guanajuato, Mexico
| |
Collapse
|
9
|
Zayakin P. sRNAflow: A Tool for the Analysis of Small RNA-Seq Data. Noncoding RNA 2024; 10:6. [PMID: 38250806 PMCID: PMC10801628 DOI: 10.3390/ncrna10010006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Revised: 12/29/2023] [Accepted: 01/15/2024] [Indexed: 01/23/2024] Open
Abstract
The analysis of small RNA sequencing data across a range of biofluids is a significant research area, given the diversity of RNA types that hold potential diagnostic, prognostic, and predictive value. The intricate task of segregating the complex mixture of small RNAs from both human and other species, including bacteria, fungi, and viruses, poses one of the most formidable challenges in the analysis of small RNA sequencing data, currently lacking satisfactory solutions. This study introduces sRNAflow, a user-friendly bioinformatic tool with a web interface designed for the analysis of small RNAs obtained from biological fluids. Tailored to the unique requirements of such samples, the proposed pipeline addresses various challenges, including filtering potential RNAs from reagents and environment, classifying small RNA types, managing small RNA annotation overlap, conducting differential expression assays, analysing isomiRs, and presenting an approach to identify the sources of small RNAs within samples. sRNAflow also encompasses an alternative alignment-free analysis of RNA-seq data, featuring clustering and initial RNA source identification using BLAST. This comprehensive approach facilitates meaningful comparisons of results between different analytical methods.
Collapse
Affiliation(s)
- Pawel Zayakin
- Latvian Biomedical Research and Study Centre, LV-1067 Riga, Latvia;
- European Bioinformatics Institute, EMBL-EBI, Hinxton CB10 1SD, UK
| |
Collapse
|
10
|
Chaturvedi A, Som A. Inference of Dynamic Growth Regulatory Network in Cancer Using High-Throughput Transcriptomic Data. Methods Mol Biol 2024; 2719:51-77. [PMID: 37803112 DOI: 10.1007/978-1-0716-3461-5_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/08/2023]
Abstract
Growth is regulated by gene expression variation at different developmental stages of biological processes such as cell differentiation, disease progression, or drug response. In cancer, a stage-specific regulatory model constructed to infer the dynamic expression changes in genes contributing to tissue growth or proliferation is referred as a dynamic growth regulatory network (dGRN). Over the past decade, gene expression data has been widely used for reconstructing dGRN by computing correlations between the differentially expressed genes (DEGs). A wide variety of pipelines are available to construct the GRNs using DEGs and the choice of a particular method or tool depends on the nature of the study. In this protocol, we have outlined a step-by-step guide for the analysis of DEGs using RNA-Seq data, beginning from data acquisition, pre-processing, mapping to reference genome, and construction of a correlation-based co-expression network to further downstream analysis. We have also outlined the steps for the inclusion of publicly available interaction/regulation information into the dGRN followed by relevant topological inferences. This tutorial has been designed in a way that early researchers can refer to for an easy and comprehensive glimpse of methodologies used in the inference of dGRN using transcriptomics data.
Collapse
Affiliation(s)
- Aparna Chaturvedi
- Centre of Bioinformatics, Institute of Interdisciplinary Studies, University of Allahabad, Prayagraj, India
| | - Anup Som
- Centre of Bioinformatics, Institute of Interdisciplinary Studies, University of Allahabad, Prayagraj, India
| |
Collapse
|
11
|
Kozlova A, Sarygina E, Deinichenko K, Radko S, Ptitsyn K, Khmeleva S, Kurbatov L, Spirin P, Prassolov V, Ilgisonis E, Lisitsa A, Ponomarenko E. Comparison of Alternative Splicing Landscapes Revealed by Long-Read Sequencing in Hepatocyte-Derived HepG2 and Huh7 Cultured Cells and Human Liver Tissue. BIOLOGY 2023; 12:1494. [PMID: 38132320 PMCID: PMC10740679 DOI: 10.3390/biology12121494] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Revised: 11/17/2023] [Accepted: 11/25/2023] [Indexed: 12/23/2023]
Abstract
The long-read RNA sequencing developed by Oxford Nanopore Technologies provides a direct quantification of transcript isoforms, thereby making it possible to present alternative splicing (AS) profiles as arrays of single splice variants with different abundances. Additionally, AS profiles can be presented as arrays of genes characterized by the degree of alternative splicing (the DAS-the number of detected splice variants per gene). Here, we successfully utilized the DAS to reveal biological pathways influenced by the alterations in AS in human liver tissue and the hepatocyte-derived malignant cell lines HepG2 and Huh7, thus employing the mathematical algorithm of gene set enrichment analysis. Furthermore, analysis of the AS profiles as abundances of single splice variants by using the graded tissue specificity index τ provided the selection of the groups of genes expressing particular splice variants specifically in liver tissue, HepG2 cells, and Huh7 cells. The majority of these splice variants were translated into proteins products and appeal to be in focus regarding further insights into the mechanisms underlying cell malignization. The used metrics are intrinsically suitable for transcriptome-wide AS profiling using long-read sequencing.
Collapse
Affiliation(s)
- Anna Kozlova
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| | - Elizaveta Sarygina
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| | - Kseniia Deinichenko
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| | - Sergey Radko
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| | - Konstantin Ptitsyn
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| | - Svetlana Khmeleva
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| | - Leonid Kurbatov
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| | - Pavel Spirin
- Department of Cancer Cell Biology, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilova 32, 119991 Moscow, Russia; (P.S.); (V.P.)
| | - Vladimir Prassolov
- Department of Cancer Cell Biology, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilova 32, 119991 Moscow, Russia; (P.S.); (V.P.)
| | - Ekaterina Ilgisonis
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| | - Andrey Lisitsa
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| | - Elena Ponomarenko
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| |
Collapse
|