Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Li Y, Chen L. Big biological data: challenges and opportunities. Genomics Proteomics Bioinformatics 2014;12:187-9. [PMID: 25462151 PMCID: PMC4411415 DOI: 10.1016/j.gpb.2014.10.001] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 09/01/2014] [Revised: 10/01/2014] [Accepted: 10/01/2014] [Indexed: 11/17/2022]

For:	Li Y, Chen L. Big biological data: challenges and opportunities. Genomics Proteomics Bioinformatics 2014;12:187-9. [PMID: 25462151 PMCID: PMC4411415 DOI: 10.1016/j.gpb.2014.10.001] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 09/01/2014] [Revised: 10/01/2014] [Accepted: 10/01/2014] [Indexed: 11/17/2022]

Number

Cited by Other Article(s)

Bi X, Qiu M, Li D, Zhang Y, Zhan W, Wang Z, Lv Z, Li H, Chen G. Transcriptomic and metabolomic analysis of the mechanisms underlying stress responses of the freshwater snail, Pomacea canaliculata, exposed to different levels of arsenic. AQUATIC TOXICOLOGY (AMSTERDAM, NETHERLANDS) 2024;267:106835. [PMID: 38219501 DOI: 10.1016/j.aquatox.2024.106835] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 12/12/2023] [Accepted: 01/09/2024] [Indexed: 01/16/2024]

Affiliation(s)

Xiaoyang Bi Guangdong Laboratory for Lingnan Modern Agriculture, Guangdong Provincial Key Laboratory of Agricultural & Rural Pollution Abatement and Environmental Safety, College of Natural Resources and Environment, South China Agricultural University, Guangzhou 510642, China
Mingxin Qiu Guangdong Laboratory for Lingnan Modern Agriculture, Guangdong Provincial Key Laboratory of Agricultural & Rural Pollution Abatement and Environmental Safety, College of Natural Resources and Environment, South China Agricultural University, Guangzhou 510642, China
Danni Li Guangdong Laboratory for Lingnan Modern Agriculture, Guangdong Provincial Key Laboratory of Agricultural & Rural Pollution Abatement and Environmental Safety, College of Natural Resources and Environment, South China Agricultural University, Guangzhou 510642, China
Yujing Zhang Guangdong Laboratory for Lingnan Modern Agriculture, Guangdong Provincial Key Laboratory of Agricultural & Rural Pollution Abatement and Environmental Safety, College of Natural Resources and Environment, South China Agricultural University, Guangzhou 510642, China
Wenhui Zhan Guangdong Testing Institute of Product Quality Supervision, Foshan 528300, China
Zhixiong Wang Guangdong Testing Institute of Product Quality Supervision, Foshan 528300, China
Zhaowei Lv Guangdong Testing Institute of Product Quality Supervision, Foshan 528300, China
Huashou Li Guangdong Laboratory for Lingnan Modern Agriculture, Guangdong Provincial Key Laboratory of Agricultural & Rural Pollution Abatement and Environmental Safety, College of Natural Resources and Environment, South China Agricultural University, Guangzhou 510642, China
Guikui Chen Guangdong Laboratory for Lingnan Modern Agriculture, Guangdong Provincial Key Laboratory of Agricultural & Rural Pollution Abatement and Environmental Safety, College of Natural Resources and Environment, South China Agricultural University, Guangzhou 510642, China.

Collapse

Feser M, König P, Fiebig A, Arend D, Lange M, Scholz U. On the way to plant data commons - a genotyping use case. J Integr Bioinform 2022;19:jib-2022-0033. [PMID: 36065132 PMCID: PMC9800039 DOI: 10.1515/jib-2022-0033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Revised: 08/04/2022] [Accepted: 08/11/2022] [Indexed: 01/09/2023] Open

Diakou I, Papakonstantinou E, Papageorgiou L, Pierouli K, Dragoumani K, Spandidos DA, Bacopoulou F, Chrousos GP, Goulielmos GΝ, Eliopoulos E, Vlachakis D. Multiple sclerosis and computational biology (Review). Biomed Rep 2022;17:96. [PMID: 36382258 PMCID: PMC9634047 DOI: 10.3892/br.2022.1579] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Accepted: 09/27/2022] [Indexed: 12/02/2022] Open

Affiliation(s)

Io Diakou Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
Eleni Papakonstantinou Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
Louis Papageorgiou Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
Katerina Pierouli Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
Konstantina Dragoumani Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
Demetrios A. Spandidos Laboratory of Clinical Virology, School of Medicine, University of Crete, 71003 Heraklion, Greece
Flora Bacopoulou University Research Institute of Maternal and Child Health and Precision Medicine, and UNESCO Chair on Adolescent Health Care, National and Kapodistrian University of Athens, ‘Aghia Sophia’ Children's Hospital, 11527 Athens, Greece
George P. Chrousos University Research Institute of Maternal and Child Health and Precision Medicine, and UNESCO Chair on Adolescent Health Care, National and Kapodistrian University of Athens, ‘Aghia Sophia’ Children's Hospital, 11527 Athens, Greece
Georges Ν. Goulielmos Section of Molecular Pathology and Human Genetics, Department of Internal Medicine, School of Medicine, University of Crete, 71003 Heraklion, Greece
Elias Eliopoulos Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
Dimitrios Vlachakis Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece University Research Institute of Maternal and Child Health and Precision Medicine, and UNESCO Chair on Adolescent Health Care, National and Kapodistrian University of Athens, ‘Aghia Sophia’ Children's Hospital, 11527 Athens, Greece Division of Endocrinology and Metabolism, Center of Clinical, Experimental Surgery and Translational Research, Biomedical Research Foundation of The Academy of Athens, 11527 Athens, Greece

Collapse

Yan M, Nie H, Wang Y, Wang X, Jarret R, Zhao J, Wang H, Yang J. Exploring and exploiting genetics and genomics for sweetpotato improvement: Status and perspectives. PLANT COMMUNICATIONS 2022;3:100332. [PMID: 35643086 PMCID: PMC9482988 DOI: 10.1016/j.xplc.2022.100332] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Revised: 04/17/2022] [Accepted: 05/02/2022] [Indexed: 05/14/2023]

Applications of dynamic feature selection and clustering methods to medical diagnosis. Appl Soft Comput 2022. [DOI: 10.1016/j.asoc.2022.109293] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Combining metabolome and clinical indicators with machine learning provides some promising diagnostic markers to precisely detect smear-positive/negative pulmonary tuberculosis. BMC Infect Dis 2022;22:707. [PMID: 36008772 PMCID: PMC9403968 DOI: 10.1186/s12879-022-07694-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Accepted: 08/22/2022] [Indexed: 11/30/2022] Open

Abstract

Background

Tuberculosis (TB) had been the leading lethal infectious disease worldwide for a long time (2014–2019) until the COVID-19 global pandemic, and it is still one of the top 10 death causes worldwide. One important reason why there are so many TB patients and death cases in the world is because of the difficulties in precise diagnosis of TB using common detection methods, especially for some smear-negative pulmonary tuberculosis (SNPT) cases. The rapid development of metabolome and machine learning offers a great opportunity for precision diagnosis of TB. However, the metabolite biomarkers for the precision diagnosis of smear-positive and smear-negative pulmonary tuberculosis (SPPT/SNPT) remain to be uncovered. In this study, we combined metabolomics and clinical indicators with machine learning to screen out newly diagnostic biomarkers for the precise identification of SPPT and SNPT patients.

Methods

Untargeted plasma metabolomic profiling was performed for 27 SPPT patients, 37 SNPT patients and controls. The orthogonal partial least squares-discriminant analysis (OPLS-DA) was then conducted to screen differential metabolites among the three groups. Metabolite enriched pathways, random forest (RF), support vector machines (SVM) and multilayer perceptron neural network (MLP) were performed using Metaboanalyst 5.0, “caret” R package, “e1071” R package and “Tensorflow” Python package, respectively.

Results

Metabolomic analysis revealed significant enrichment of fatty acid and amino acid metabolites in the plasma of SPPT and SNPT patients, where SPPT samples showed a more serious dysfunction in fatty acid and amino acid metabolisms. Further RF analysis revealed four optimized diagnostic biomarker combinations including ten features (two lipid/lipid-like molecules and seven organic acids/derivatives, and one clinical indicator) for the identification of SPPT, SNPT patients and controls with high accuracy (83–93%), which were further verified by SVM and MLP. Among them, MLP displayed the best classification performance on simultaneously precise identification of the three groups (94.74%), suggesting the advantage of MLP over RF/SVM to some extent.

Conclusions

Our findings reveal plasma metabolomic characteristics of SPPT and SNPT patients, provide some novel promising diagnostic markers for precision diagnosis of various types of TB, and show the potential of machine learning in screening out biomarkers from big data.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12879-022-07694-8.

Collapse

Adams DC, Collyer ML. Consilience of methods for phylogenetic analysis of variance. Evolution 2022;76:1406-1419. [PMID: 35522593 PMCID: PMC9544334 DOI: 10.1111/evo.14512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 03/22/2022] [Indexed: 01/21/2023]

Arend D, Psaroudakis D, Memon JA, Rey-Mazón E, Schüler D, Szymanski JJ, Scholz U, Junker A, Lange M. From data to knowledge - big data needs stewardship, a plant phenomics perspective. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022;111:335-347. [PMID: 35535481 DOI: 10.1111/tpj.15804] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 05/02/2022] [Accepted: 05/06/2022] [Indexed: 06/14/2023]

Youn J, Rai N, Tagkopoulos I. Knowledge integration and decision support for accelerated discovery of antibiotic resistance genes. Nat Commun 2022;13:2360. [PMID: 35487919 PMCID: PMC9055065 DOI: 10.1038/s41467-022-29993-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Accepted: 03/04/2022] [Indexed: 11/09/2022] Open

Monteiro HS, Leifer I, Reis SDS, Andrade JS, Makse HA. Fast algorithm to identify minimal patterns of synchrony through fibration symmetries in large directed networks. CHAOS (WOODBURY, N.Y.) 2022;32:033120. [PMID: 35364841 PMCID: PMC8933057 DOI: 10.1063/5.0066741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Accepted: 02/24/2022] [Indexed: 06/14/2023]

Sankara Narayanan P, Runthala A. Accurate computational evolution of proteins and its dependence on deep learning and machine learning strategies. BIOCATAL BIOTRANSFOR 2022. [DOI: 10.1080/10242422.2022.2030317] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]

Silva-Costa LC, Smith BJ. Post-translational Modifications in Brain Diseases: A Future for Biomarkers. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2022;1382:129-141. [DOI: 10.1007/978-3-031-05460-0_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Chen J, Guo Y, Huang S, Zhan H, Zhang M, Wang J, Shu Y. Integration of transcriptome and proteome reveals molecular mechanisms underlying stress responses of the cutworm, Spodoptera litura, exposed to different levels of lead (Pb). CHEMOSPHERE 2021;283:131205. [PMID: 34147986 DOI: 10.1016/j.chemosphere.2021.131205] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/14/2021] [Revised: 06/08/2021] [Accepted: 06/09/2021] [Indexed: 06/12/2023]

Abstract

Heavy metals are major environmental pollutants that affect organisms across different trophic levels. Herbivorous insects play an important role in the bioaccumulation, and eventually, biomagnification of these metals. Although effects of heavy metal stress on insects have been well-studied, the molecular mechanisms underlying their effects remain poorly understood. Here, we used the RNA-Seq profiling and isobaric tags for relative and absolute quantitation (iTRAQ) approaches to unravel these mechanisms in the polyphagous pest Spodoptera litura exposed to lead (Pb) at two different concentrations (12.5 and 100 mg Pb/kg; PbL and PbH, respectively). Altogether, 1392 and 1630 differentially expressed genes (DEGs) and 58, 114 differentially expressed proteins (DEPs) were identified in larvae exposed to PbL and PbH, respectively. After exposed to PbL, the main up-regulated genes clusters and proteins in S. litura larvae were associated with their metabolic processes, including carbohydrate, protein, and lipid metabolism, but the levels of cytochrome P450 associated with the pathway of xenobiotic biodegradation and metabolism were found to be decreased. In contrast, the main up-regulated genes clusters and proteins in larvae exposed to PbH were enriched in the metabolism of xenobiotic by cytochrome P450, drug metabolism-cytochrome P450, and other drug metabolism enzymes, while the down-regulated genes and proteins were found to be closely related to the lipid (lipase) and protein (serine protease, trypsin) metabolism and growth processes (cuticular protein). These findings indicate that S. litura larvae exposed to PbL could enhance food digestion and absorption to prioritize for growth rather than detoxification, whereas S. litura larvae exposed to PbH reduced food digestion and absorption and channelized the limited energy for detoxification rather than growth. These contrasting results explain the dose-dependent effects of heavy metal stress on insect life-history traits, wherein low levels of heavy metal stress induce stimulation, while high levels of heavy metal stress cause inhibition at the transcriptome and proteome levels.

Collapse

Affiliation(s)

Jin Chen Key Laboratory of Agro-Environment in the Tropics, Ministry of Agriculture, South China Agricultural University, Guangzhou, 510642, China; Guangdong Provincial Key Laboratory of Eco-Circular Agriculture, South China Agricultural University, Guangzhou, 510642, China; Guangdong Engineering Research Centre for Modern Eco-agriculture, Guangzhou, 510642, China; Department of Ecology, College of Natural Resources and Environment, South China Agricultural University, Guangzhou, 510642, China
Yeshan Guo Key Laboratory of Agro-Environment in the Tropics, Ministry of Agriculture, South China Agricultural University, Guangzhou, 510642, China; Guangdong Provincial Key Laboratory of Eco-Circular Agriculture, South China Agricultural University, Guangzhou, 510642, China; Guangdong Engineering Research Centre for Modern Eco-agriculture, Guangzhou, 510642, China; Department of Ecology, College of Natural Resources and Environment, South China Agricultural University, Guangzhou, 510642, China
Shimin Huang Key Laboratory of Agro-Environment in the Tropics, Ministry of Agriculture, South China Agricultural University, Guangzhou, 510642, China; Guangdong Provincial Key Laboratory of Eco-Circular Agriculture, South China Agricultural University, Guangzhou, 510642, China; Guangdong Engineering Research Centre for Modern Eco-agriculture, Guangzhou, 510642, China; Department of Ecology, College of Natural Resources and Environment, South China Agricultural University, Guangzhou, 510642, China
Huiru Zhan Key Laboratory of Agro-Environment in the Tropics, Ministry of Agriculture, South China Agricultural University, Guangzhou, 510642, China; Guangdong Provincial Key Laboratory of Eco-Circular Agriculture, South China Agricultural University, Guangzhou, 510642, China; Guangdong Engineering Research Centre for Modern Eco-agriculture, Guangzhou, 510642, China; Department of Ecology, College of Natural Resources and Environment, South China Agricultural University, Guangzhou, 510642, China
Meifang Zhang Key Laboratory of Agro-Environment in the Tropics, Ministry of Agriculture, South China Agricultural University, Guangzhou, 510642, China; Guangdong Provincial Key Laboratory of Eco-Circular Agriculture, South China Agricultural University, Guangzhou, 510642, China; Guangdong Engineering Research Centre for Modern Eco-agriculture, Guangzhou, 510642, China; Department of Ecology, College of Natural Resources and Environment, South China Agricultural University, Guangzhou, 510642, China
Jianwu Wang Key Laboratory of Agro-Environment in the Tropics, Ministry of Agriculture, South China Agricultural University, Guangzhou, 510642, China; Guangdong Provincial Key Laboratory of Eco-Circular Agriculture, South China Agricultural University, Guangzhou, 510642, China; Guangdong Engineering Research Centre for Modern Eco-agriculture, Guangzhou, 510642, China; Department of Ecology, College of Natural Resources and Environment, South China Agricultural University, Guangzhou, 510642, China.
Yinghua Shu Key Laboratory of Agro-Environment in the Tropics, Ministry of Agriculture, South China Agricultural University, Guangzhou, 510642, China; Guangdong Provincial Key Laboratory of Eco-Circular Agriculture, South China Agricultural University, Guangzhou, 510642, China; Guangdong Engineering Research Centre for Modern Eco-agriculture, Guangzhou, 510642, China; Department of Ecology, College of Natural Resources and Environment, South China Agricultural University, Guangzhou, 510642, China.

Collapse

Vitorino R, Choudhury M, Guedes S, Ferreira R, Thongboonkerd V, Sharma L, Amado F, Srivastava S. Peptidomics and proteogenomics: background, challenges and future needs. Expert Rev Proteomics 2021;18:643-659. [PMID: 34517741 DOI: 10.1080/14789450.2021.1980388] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Pegg TJ, Gladish DK, Baker RL. Algae to angiosperms: Autofluorescence for rapid visualization of plant anatomy among diverse taxa. APPLICATIONS IN PLANT SCIENCES 2021;9:e11437. [PMID: 34268017 PMCID: PMC8272585 DOI: 10.1002/aps3.11437] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Accepted: 05/19/2021] [Indexed: 05/22/2023]

Kang DS, Kim HS, Jung JH, Lee CM, Ahn YS, Seo YR. Formaldehyde exposure and leukemia risk: a comprehensive review and network-based toxicogenomic approach. Genes Environ 2021;43:13. [PMID: 33845901 PMCID: PMC8042688 DOI: 10.1186/s41021-021-00183-5] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2020] [Accepted: 03/19/2021] [Indexed: 12/20/2022] Open

JSOM: Jointly-evolving self-organizing maps for alignment of biological datasets and identification of related clusters. PLoS Comput Biol 2021;17:e1008804. [PMID: 33724985 PMCID: PMC7963045 DOI: 10.1371/journal.pcbi.1008804] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2020] [Accepted: 02/15/2021] [Indexed: 11/19/2022] Open

Abstract

With the rapid advances of various single-cell technologies, an increasing number of single-cell datasets are being generated, and the computational tools for aligning the datasets which make subsequent integration or meta-analysis possible have become critical. Typically, single-cell datasets from different technologies cannot be directly combined or concatenated, due to the innate difference in the data, such as the number of measured parameters and the distributions. Even datasets generated by the same technology are often affected by the batch effect. A computational approach for aligning different datasets and hence identifying related clusters will be useful for data integration and interpretation in large scale single-cell experiments. Our proposed algorithm called JSOM, a variation of the Self-organizing map, aligns two related datasets that contain similar clusters, by constructing two maps—low-dimensional discretized representation of datasets–that jointly evolve according to both datasets. Here we applied the JSOM algorithm to flow cytometry, mass cytometry, and single-cell RNA sequencing datasets. The resulting JSOM maps not only align the related clusters in the two datasets but also preserve the topology of the datasets so that the maps could be used for further analysis, such as clustering.

Biological datasets are now generated more than ever as many data acquisition technologies have been developed over the years, especially single-cell technologies. With increasing amounts of datasets available for larger scale studies, robust computational tools that could align datasets are needed for data integration and interpretation. We present a new algorithm that can align two biological datasets and demonstrated that the algorithm can work with data generated from different data acquisition technologies. Our proposed algorithm produces low dimensional representations of two datasets to align them in a way that preserves the topology of the respective datasets. Such aligned maps facilitate further analysis, such as clustering. The proposed algorithm showed promising results when applied to different combinations of datasets, i.e., flow cytometry to flow cytometry, flow cytometry to mass cytometry, and two different single-cell RNA sequencing technologies. Therefore, our newly developed algorithm could potentially lead to new discoveries that were once difficult to obtain.

Collapse

Mahmud M, Kaiser MS, McGinnity TM, Hussain A. Deep Learning in Mining Biological Data. Cognit Comput 2021;13:1-33. [PMID: 33425045 PMCID: PMC7783296 DOI: 10.1007/s12559-020-09773-x] [Citation(s) in RCA: 100] [Impact Index Per Article: 33.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2020] [Accepted: 09/28/2020] [Indexed: 02/06/2023]

Wu L, Han L, Li Q, Wang G, Zhang H, Li L. Using Interactome Big Data to Crack Genetic Mysteries and Enhance Future Crop Breeding. MOLECULAR PLANT 2021;14:77-94. [PMID: 33340690 DOI: 10.1016/j.molp.2020.12.012] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Revised: 12/11/2020] [Accepted: 12/14/2020] [Indexed: 05/27/2023]

Multi-assignment clustering: Machine learning from a biological perspective. J Biotechnol 2020;326:1-10. [PMID: 33285150 DOI: 10.1016/j.jbiotec.2020.12.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Accepted: 12/03/2020] [Indexed: 11/21/2022]

Scott JK, Breden F. The adaptive immune receptor repertoire community as a model for FAIR stewardship of big immunology data. CURRENT OPINION IN SYSTEMS BIOLOGY 2020;24:71-77. [PMID: 33073065 PMCID: PMC7547575 DOI: 10.1016/j.coisb.2020.10.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Lung PY, Zhong D, Pang X, Li Y, Zhang J. Maximizing the reusability of gene expression data by predicting missing metadata. PLoS Comput Biol 2020;16:e1007450. [PMID: 33156882 PMCID: PMC7673503 DOI: 10.1371/journal.pcbi.1007450] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2019] [Revised: 11/18/2020] [Accepted: 10/09/2020] [Indexed: 11/18/2022] Open

Kruchten AE. A Curricular Bioinformatics Approach to Teaching Undergraduates to Analyze Metagenomic Datasets Using R. Front Microbiol 2020;11:578600. [PMID: 33013816 PMCID: PMC7511545 DOI: 10.3389/fmicb.2020.578600] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Accepted: 08/12/2020] [Indexed: 01/06/2023] Open

Al-Harazi O, El Allali A, Colak D. Biomolecular Databases and Subnetwork Identification Approaches of Interest to Big Data Community: An Expert Review. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2020;23:138-151. [PMID: 30883301 DOI: 10.1089/omi.2018.0205] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Abstract

Next-generation sequencing approaches and genome-wide studies have become essential for characterizing the mechanisms of human diseases. Consequently, many researchers have applied these approaches to discover the genetic/genomic causes of common complex and rare human diseases, generating multiomics big data that span the continuum of genomics, proteomics, metabolomics, and many other system science fields. Therefore, there is a significant and unmet need for biological databases and tools that enable and empower the researchers to analyze, integrate, and make sense of big data. There are currently large number of databases that offer different types of biological information. In particular, the integration of gene expression profiles and protein-protein interaction networks provides a deeper understanding of the complex multilayered molecular architecture of human diseases. Therefore, there has been a growing interest in developing methodologies that integrate and contextualize big data from molecular interaction networks to identify biomarkers of human diseases at a subnetwork resolution as well. In this expert review, we provide a comprehensive summary of most popular biomolecular databases for molecular interactions (e.g., Biological General Repository for Interaction Datasets, Kyoto Encyclopedia of Genes and Genomes and Search Tool for The Retrieval of Interacting Genes/Proteins), gene-disease associations (e.g., Online Mendelian Inheritance in Man, Disease-Gene Network, MalaCards), and population-specific databases (e.g., Human Genetic Variation Database), and describe some examples of their usage and potential applications. We also present the most recent subnetwork identification approaches and discuss their main advantages and limitations. As the field of data science continues to emerge, the present analysis offers a deeper and contextualized understanding of the available databases in molecular biomedicine.

Collapse

The Translational Status of Cancer Liquid Biopsies. REGENERATIVE ENGINEERING AND TRANSLATIONAL MEDICINE 2019. [DOI: 10.1007/s40883-019-00141-2] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]

Abstract Abstract Precision oncology aims to tailor clinical decisions specifically to patients with the objective of improving treatment outcomes. This can be achieved by leveraging omics information for accurate molecular characterization of tumors. Tumor tissue biopsies are currently the main source of information for molecular profiling. However, biopsies are invasive and limited in resolving spatiotemporal heterogeneity in tumor tissues. Alternative non-invasive liquid biopsies can exploit patient’s body fluids to access multiple layers of tumor-specific biological information (genomes, epigenomes, transcriptomes, proteomes, metabolomes, circulating tumor cells, and exosomes). Analysis and integration of these large and diverse datasets using statistical and machine learning approaches can yield important insights into tumor biology and lead to discovery of new diagnostic, predictive, and prognostic biomarkers. Translation of these new diagnostic tools into standard clinical practice could transform oncology, as demonstrated by a number of liquid biopsy assays already entering clinical use. In this review, we highlight successes and challenges facing the rapidly evolving field of cancer biomarker research. Lay Summary Precision oncology aims to tailor clinical decisions specifically to patients with the objective of improving treatment outcomes. The discovery of biomarkers for precision oncology has been accelerated by high-throughput experimental and computational methods, which can inform fine-grained characterization of tumors for clinical decision-making. Moreover, advances in the liquid biopsy field allow non-invasive sampling of patient’s body fluids with the aim of analyzing circulating biomarkers, obviating the need for invasive tumor tissue biopsies. In this review, we highlight successes and challenges facing the rapidly evolving field of liquid biopsy cancer biomarker research. Collapse

Mabvakure BM, Rott R, Dobrowsky L, Van Heusden P, Morris L, Scheepers C, Moore PL. Advancing HIV Vaccine Research With Low-Cost High-Performance Computing Infrastructure: An Alternative Approach for Resource-Limited Settings. Bioinform Biol Insights 2019;13:1177932219882347. [PMID: 35173421 PMCID: PMC8842485 DOI: 10.1177/1177932219882347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2019] [Accepted: 09/21/2019] [Indexed: 11/17/2022] Open

Simões T, Novais SC, Natal-da-Luz T, Devreese B, de Boer T, Roelofs D, Sousa JP, van Straalen NM, Lemos MFL. Using time-lapse omics correlations to integrate toxicological pathways of a formulated fungicide in a soil invertebrate. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2019;246:845-854. [PMID: 30623841 DOI: 10.1016/j.envpol.2018.12.069] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/24/2018] [Revised: 12/18/2018] [Accepted: 12/22/2018] [Indexed: 06/09/2023]

Sun S, Miao Z, Ratcliffe B, Campbell P, Pasch B, El-Kassaby YA, Balasundaram B, Chen C. SNP variable selection by generalized graph domination. PLoS One 2019;14:e0203242. [PMID: 30677030 PMCID: PMC6345469 DOI: 10.1371/journal.pone.0203242] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Accepted: 01/08/2019] [Indexed: 11/19/2022] Open

Abstract

BACKGROUND

High-throughput sequencing technology has revolutionized both medical and biological research by generating exceedingly large numbers of genetic variants. The resulting datasets share a number of common characteristics that might lead to poor generalization capacity. Concerns include noise accumulated due to the large number of predictors, sparse information regarding the p≫n problem, and overfitting and model mis-identification resulting from spurious collinearity. Additionally, complex correlation patterns are present among variables. As a consequence, reliable variable selection techniques play a pivotal role in predictive analysis, generalization capability, and robustness in clustering, as well as interpretability of the derived models.

METHODS AND FINDINGS

K-dominating set, a parameterized graph-theoretic generalization model, was used to model SNP (single nucleotide polymorphism) data as a similarity network and searched for representative SNP variables. In particular, each SNP was represented as a vertex in the graph, (dis)similarity measures such as correlation coefficients or pairwise linkage disequilibrium were estimated to describe the relationship between each pair of SNPs; a pair of vertices are adjacent, i.e. joined by an edge, if the pairwise similarity measure exceeds a user-specified threshold. A minimum k-dominating set in the SNP graph was then made as the smallest subset such that every SNP that is excluded from the subset has at least k neighbors in the selected ones. The strength of k-dominating set selection in identifying independent variables, and in culling representative variables that are highly correlated with others, was demonstrated by a simulated dataset. The advantages of k-dominating set variable selection were also illustrated in two applications: pedigree reconstruction using SNP profiles of 1,372 Douglas-fir trees, and species delineation for 226 grasshopper mouse samples. A C++ source code that implements SNP-SELECT and uses Gurobi optimization solver for the k-dominating set variable selection is available (https://github.com/transgenomicsosu/SNP-SELECT).

Collapse

Gauthier J, Vincent AT, Charette SJ, Derome N. A brief history of bioinformatics. Brief Bioinform 2018;20:1981-1996. [DOI: 10.1093/bib/bby063] [Citation(s) in RCA: 59] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Revised: 06/22/2018] [Indexed: 02/06/2023] Open

Ng S, Strunk T, Jiang P, Muk T, Sangild PT, Currie A. Precision Medicine for Neonatal Sepsis. Front Mol Biosci 2018;5:70. [PMID: 30094238 PMCID: PMC6070631 DOI: 10.3389/fmolb.2018.00070] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Accepted: 07/06/2018] [Indexed: 11/24/2022] Open

Ham S, Kim TK, Hong H, Kim YS, Tang YP, Im HI. Big Data Analysis of Genes Associated With Neuropsychiatric Disorders in an Alzheimer's Disease Animal Model. Front Neurosci 2018;12:407. [PMID: 29962931 PMCID: PMC6013555 DOI: 10.3389/fnins.2018.00407] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2017] [Accepted: 05/25/2018] [Indexed: 11/13/2022] Open

Abstract

Alzheimer's disease is a neurodegenerative disease characterized by the impairment of cognitive function and loss of memory, affecting millions of individuals worldwide. With the dramatic increase in the prevalence of Alzheimer's disease, it is expected to impose extensive public health and economic burden. However, this burden is particularly heavy on the caregivers of Alzheimer's disease patients eliciting neuropsychiatric symptoms that include mood swings, hallucinations, and depression. Interestingly, these neuropsychiatric symptoms are shared across symptoms of bipolar disorder, schizophrenia, and major depression disorder. Despite the similarities in symptomatology, comorbidities of Alzheimer's disease and these neuropsychiatric disorders have not been studied in the Alzheimer's disease model. Here, we explore the comprehensive changes in gene expression of genes that are associated with bipolar disorder, schizophrenia, and major depression disorder through the microarray of an Alzheimer's disease animal model, the forebrain specific PSEN double knockout mouse. To analyze the genes related with these three neuropsychiatric disorders within the scope of our microarray data, we used selected 1207 of a total of 45,037 genes that satisfied our selection criteria. These genes were selected on the basis of 14 Gene Ontology terms significantly relevant with the three disorders which were identified by previous research conducted by the Psychiatric Genomics Consortium. Our study revealed that the forebrain specific deletion of Alzheimer's disease genes can significantly alter neuropsychiatric disorder associated genes. Most importantly, most of these significantly altered genes were found to be involved with schizophrenia. Taken together, we suggest that the synaptic dysfunction by mutation of Alzheimer's disease genes can lead to the manifestation of not only memory loss and impairments in cognition, but also neuropsychiatric symptoms.

Collapse

Mahmud M, Kaiser MS, Hussain A, Vassanelli S. Applications of Deep Learning and Reinforcement Learning to Biological Data. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2018;29:2063-2079. [PMID: 29771663 DOI: 10.1109/tnnls.2018.2790388] [Citation(s) in RCA: 230] [Impact Index Per Article: 38.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]

Qiu X, Feng JR, Qiu J, Liu L, Xie Y, Zhang YP, Liu J, Zhao Q. ITGBL1 promotes migration, invasion and predicts a poor prognosis in colorectal cancer. Biomed Pharmacother 2018;104:172-180. [PMID: 29772438 DOI: 10.1016/j.biopha.2018.05.033] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2018] [Revised: 05/08/2018] [Accepted: 05/08/2018] [Indexed: 12/27/2022] Open

Hu T, Oksanen K, Zhang W, Randell E, Furey A, Sun G, Zhai G. An evolutionary learning and network approach to identifying key metabolites for osteoarthritis. PLoS Comput Biol 2018;14:e1005986. [PMID: 29494586 PMCID: PMC5849325 DOI: 10.1371/journal.pcbi.1005986] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2017] [Revised: 03/13/2018] [Accepted: 01/06/2018] [Indexed: 12/20/2022] Open

Abstract

Metabolomics studies use quantitative analyses of metabolites from body fluids or tissues in order to investigate a sequence of cellular processes and biological systems in response to genetic and environmental influences. This promises an immense potential for a better understanding of the pathogenesis of complex diseases. Most conventional metabolomics analysis methods exam one metabolite at a time and may overlook the synergistic effect of combining multiple metabolites. In this article, we proposed a new bioinformatics framework that infers the non-linear synergy among multiple metabolites using a symbolic model and subsequently, identify key metabolites using network analysis. Such a symbolic model is able to represent a complex non-linear relationship among a set of metabolites associated with osteoarthritis (OA) and is automatically learned using an evolutionary algorithm. Applied to the Newfoundland Osteoarthritis Study (NFOAS) dataset, our methodology was able to identify nine key metabolites including some known osteoarthritis-associated metabolites and some novel metabolic markers that have never been reported before. The results demonstrate the effectiveness of our methodology and more importantly, with further investigations, propose new hypotheses that can help better understand the OA disease.

Biomedical research has entered a new era where a large number of molecules and different components in biological systems can be quantitatively examined to investigate the causes of common human diseases. However, given the complexity of biological systems, those causes may not contribute to diseases individually but through interactions. The identification of those interactions, or the synergy of multiple factors, is a very challenging task due to the computational limitation, as well as the lack of effective methodologies for investigating multiple factors simultaneously. In this study, we proposed to model such an interaction effect through a self-learning algorithm using mechanisms inspired by natural evolution. Moreover, by constructing a synergy network using those evolved models, we were able to identify a set of interacting factors associated with a particular disease.

Collapse

Hufsky F, Ibrahim B, Beer M, Deng L, Mercier PL, McMahon DP, Palmarini M, Thiel V, Marz M. Virologists-Heroes need weapons. PLoS Pathog 2018;14:e1006771. [PMID: 29420617 PMCID: PMC5805341 DOI: 10.1371/journal.ppat.1006771] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open

Shahjaman M, Kumar N, Ahmed MS, Begum A, Islam SMS, Mollah MNH. Robust Feature Selection Approach for Patient Classification using Gene Expression Data. Bioinformation 2017;13:327-332. [PMID: 29162964 PMCID: PMC5680713 DOI: 10.6026/97320630013327] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2017] [Revised: 09/11/2017] [Accepted: 09/12/2017] [Indexed: 11/23/2022] Open

Omae K, Komori O, Eguchi S. Quasi-linear score for capturing heterogeneous structure in biomarkers. BMC Bioinformatics 2017;18:308. [PMID: 28629325 PMCID: PMC5477283 DOI: 10.1186/s12859-017-1721-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2016] [Accepted: 06/09/2017] [Indexed: 01/01/2023] Open

Salazar BM, Balczewski EA, Ung CY, Zhu S. Neuroblastoma, a Paradigm for Big Data Science in Pediatric Oncology. Int J Mol Sci 2016;18:E37. [PMID: 28035989 PMCID: PMC5297672 DOI: 10.3390/ijms18010037] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2016] [Revised: 12/14/2016] [Accepted: 12/17/2016] [Indexed: 12/13/2022] Open

Williamson ED. Life sciences today and tomorrow: emerging biotechnologies. Crit Rev Biotechnol 2016;37:553-565. [PMID: 27373876 DOI: 10.1080/07388551.2016.1201455] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Li TS, Bravo À, Furlong LI, Good BM, Su AI. A crowdsourcing workflow for extracting chemical-induced disease relations from free text. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016;2016:baw051. [PMID: 27087308 PMCID: PMC4834205 DOI: 10.1093/database/baw051] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/05/2015] [Accepted: 03/17/2016] [Indexed: 01/05/2023]

Zeng T, Zhang W, Yu X, Liu X, Li M, Chen L. Big-data-based edge biomarkers: study on dynamical drug sensitivity and resistance in individuals. Brief Bioinform 2015;17:576-92. [DOI: 10.1093/bib/bbv078] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2015] [Indexed: 12/21/2022] Open

Ow GS, Kuznetsov VA. Multiple signatures of a disease in potential biomarker space: Getting the signatures consensus and identification of novel biomarkers. BMC Genomics 2015;16 Suppl 7:S2. [PMID: 26100469 PMCID: PMC4474413 DOI: 10.1186/1471-2164-16-s7-s2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open

Abstract

Background

The lack of consensus among reported gene signature subsets (GSSs) in multi-gene biomarker discovery studies is often a concern for researchers and clinicians. Subsequently, it discourages larger scale prospective studies, prevents the translation of such knowledge into a practical clinical setting and ultimately hinders the progress of the field of biomarker-based disease classification, prognosis and prediction.

Methods

We define all "gene identificators" (gIDs) as constituents of the entire potential disease biomarker space. For each gID in a GSS of interest ("tested GSS"/tGSS), our method counts the empirical frequency of gID co-occurrences/overlaps in other reference GSSs (rGSSs) and compares it with the expected frequency generated via implementation of a randomized sampling procedure. Comparison of the empirical frequency distribution (EFD) with the expected background frequency distribution (BFD) allows dichotomization of statistically novel (SN) and common (SC) gIDs within the tGSS.

Results

We identify SN or SC biomarkers for tGSSs obtained from previous studies of high-grade serous ovarian cancer (HG-SOC) and breast cancer (BC). For each tGSS, the EFD of gID co-occurrences/overlaps with other rGSSs is characterized by scale and context-dependent Pareto-like frequency distribution function. Our results indicate that while independently there is little overlap between our tGSS with individual rGSSs, comparison of the EFD with BFD suggests that beyond a confidence threshold, tested gIDs become more common in rGSSs than expected. This validates the use of our tGSS as individual or combined prognostic factors. Our method identifies SN and SC genes of a 36-gene prognostic signature that stratify HG-SOC patients into subgroups with low, intermediate or high-risk of the disease outcome. Using 70 BC rGSSs, the method also predicted SN and SC BC prognostic genes from the tested obesity and IGF1 pathway GSSs.

Conclusions

Our method provides a strategy that identify/predict within a tGSS of interest, gID subsets that are either SN or SC when compared to other rGSSs. Practically, our results suggest that there is a stronger association of the IGF1 signature genes with the 70 BC rGSSs, than for the obesity-associated signature. Furthermore, both SC and SN genes, in both signatures could be considered as perspective prognostic biomarkers of BCs that stratify the patients onto low or high risks of cancer development.

Collapse

Agarwal M, Adhil M, Talukder AK. Multi-omics Multi-scale Big Data Analytics for Cancer Genomics. BIG DATA ANALYTICS 2015. [DOI: 10.1007/978-3-319-27057-9_16] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open