Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Tripathi R, Sharma P, Chakraborty P, Varadwaj PK. Next-generation sequencing revolution through big data analytics. Frontiers in Life Science 2016. [DOI: 10.1080/21553769.2016.1178180] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]

Number

Cited by Other Article(s)

Kundu P, Beura S, Mondal S, Das AK, Ghosh A. Machine learning for the advancement of genome-scale metabolic modeling. Biotechnol Adv 2024;74:108400. [PMID: 38944218 DOI: 10.1016/j.biotechadv.2024.108400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 05/13/2024] [Accepted: 06/23/2024] [Indexed: 07/01/2024]

Abstract

Constraint-based modeling (CBM) has evolved as the core systems biology tool to map the interrelations between genotype, phenotype, and external environment. The recent advancement of high-throughput experimental approaches and multi-omics strategies has generated a plethora of new and precise information from wide-ranging biological domains. On the other hand, the continuously growing field of machine learning (ML) and its specialized branch of deep learning (DL) provide essential computational architectures for decoding complex and heterogeneous biological data. In recent years, both multi-omics and ML have assisted in the escalation of CBM. Condition-specific omics data, such as transcriptomics and proteomics, helped contextualize the model prediction while analyzing a particular phenotypic signature. At the same time, the advanced ML tools have eased the model reconstruction and analysis to increase the accuracy and prediction power. However, the development of these multi-disciplinary methodological frameworks mainly occurs independently, which limits the concatenation of biological knowledge from different domains. Hence, we have reviewed the potential of integrating multi-disciplinary tools and strategies from various fields, such as synthetic biology, CBM, omics, and ML, to explore the biochemical phenomenon beyond the conventional biological dogma. How the integrative knowledge of these intersected domains has improved bioengineering and biomedical applications has also been highlighted. We categorically explained the conventional genome-scale metabolic model (GEM) reconstruction tools and their improvement strategies through ML paradigms. Further, the crucial role of ML and DL in omics data restructuring for GEM development has also been briefly discussed. Finally, the case-study-based assessment of the state-of-the-art method for improving biomedical and metabolic engineering strategies has been elaborated. Therefore, this review demonstrates how integrating experimental and in silico strategies can help map the ever-expanding knowledge of biological systems driven by condition-specific cellular information. This multiview approach will elevate the application of ML-based CBM in the biomedical and bioengineering fields for the betterment of society and the environment.

Collapse

Niya B, Yaakoubi K, Beraich FZ, Arouch M, Meftah Kadmiri I. Current status and future developments of assessing microbiome composition and dynamics in anaerobic digestion systems using metagenomic approaches. Heliyon 2024;10:e28221. [PMID: 38560681 PMCID: PMC10979216 DOI: 10.1016/j.heliyon.2024.e28221] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Revised: 03/12/2024] [Accepted: 03/13/2024] [Indexed: 04/04/2024] Open

Murillo Carrasco AG, Furuya TK, Uno M, Citrangulo Tortelli T, Chammas R. deltaXpress (ΔXpress): a tool for mapping differentially correlated genes using single-cell qPCR data. BMC Bioinformatics 2023;24:402. [PMID: 37884889 PMCID: PMC10605457 DOI: 10.1186/s12859-023-05541-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Accepted: 10/20/2023] [Indexed: 10/28/2023] Open

Singla D, Yadav IS. GAAP: A GUI-based Genome Assembly and Annotation Package. Curr Genomics 2022;23:77-82. [PMID: 36778979 PMCID: PMC9878834 DOI: 10.2174/1389202923666220128155537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Revised: 12/13/2021] [Accepted: 12/23/2021] [Indexed: 11/22/2022] Open

Sinha R, Pal RK, De RK. GenSeg and MR-GenSeg: A Novel Segmentation Algorithm and its Parallel MapReduce Based Approach for Identifying Genomic Regions With Copy Number Variations. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022;19:443-454. [PMID: 32750860 DOI: 10.1109/tcbb.2020.3000661] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Herrando‐Pérez S, Tobler R, Huber CD. smartsnp , an r package for fast multivariate analyses of big genomic data. Methods Ecol Evol 2021. [DOI: 10.1111/2041-210x.13684] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]

Jain S, Saxena A, Hesarur S, Bhadhadhara K, Bharti N, Kasibhatla SM, Sonavane U, Joshi R. GenoVault: a cloud based genomics repository. BioData Min 2021;14:36. [PMID: 34325724 PMCID: PMC8319889 DOI: 10.1186/s13040-021-00268-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2020] [Accepted: 07/02/2021] [Indexed: 11/15/2022] Open

Abstract

GenoVault is a cloud-based repository for handling Next Generation Sequencing (NGS) data. It is developed using OpenStack-based private cloud with various services like keystone for authentication, cinder for block storage, neutron for networking and nova for managing compute instances for the Cloud. GenoVault uses object-based storage, which enables data to be stored as objects instead of files or blocks for faster retrieval from different distributed object nodes. Along with a web-based interface, a JavaFX-based desktop client has also been developed to meet the requirements of large file uploads that are usually seen in NGS datasets. Users can store files in their respective object-based storage areas and the metadata provided by the user during file uploads is used for querying the database. GenoVault repository is designed taking into account future needs and hence can scale both vertically and horizontally using OpenStack-based cloud features. Users have an option to make the data shareable to the public or restrict the access as private. Data security is ensured as every container is a separate entity in object-based storage architecture which is also supported by Secure File Transfer Protocol (SFTP) for data upload and download. The data is uploaded by the user in individual containers that include raw read files (fastq), processed alignment files (bam, sam, bed) and the output of variation detection (vcf). GenoVault architecture allows verification of the data in terms of integrity and authentication before making it available to collaborators as per the user’s permissions. GenoVault is useful for maintaining the organization-wide NGS data generated in various labs which is not yet published and submitted to public repositories like NCBI. GenoVault also provides support to share NGS data among the collaborating institutions. GenoVault can thus manage vast volumes of NGS data on any OpenStack-based private cloud.

Collapse

Prasad A, Bhargava H, Gupta A, Shukla N, Rajagopal S, Gupta S, Sharma A, Valadi J, Nigam V, Suravajhala P. Next Generation Sequencing. Adv Bioinformatics 2021. [DOI: 10.1007/978-981-33-6191-1_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open

Tripathi P, Singh J, Lal JA, Tripathi V. Next-Generation Sequencing: An Emerging Tool for Drug Designing. Curr Pharm Des 2020;25:3350-3357. [PMID: 31544713 DOI: 10.2174/1381612825666190911155508] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Accepted: 09/05/2019] [Indexed: 12/14/2022]

Yukselen O, Turkyilmaz O, Ozturk AR, Garber M, Kucukural A. DolphinNext: a distributed data processing platform for high throughput genomics. BMC Genomics 2020;21:310. [PMID: 32306927 PMCID: PMC7168977 DOI: 10.1186/s12864-020-6714-x] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Accepted: 04/01/2020] [Indexed: 12/22/2022] Open

Abstract

BACKGROUND

The emergence of high throughput technologies that produce vast amounts of genomic data, such as next-generation sequencing (NGS) is transforming biological research. The dramatic increase in the volume of data, the variety and continuous change of data processing tools, algorithms and databases make analysis the main bottleneck for scientific discovery. The processing of high throughput datasets typically involves many different computational programs, each of which performs a specific step in a pipeline. Given the wide range of applications and organizational infrastructures, there is a great need for highly parallel, flexible, portable, and reproducible data processing frameworks. Several platforms currently exist for the design and execution of complex pipelines. Unfortunately, current platforms lack the necessary combination of parallelism, portability, flexibility and/or reproducibility that are required by the current research environment. To address these shortcomings, workflow frameworks that provide a platform to develop and share portable pipelines have recently arisen. We complement these new platforms by providing a graphical user interface to create, maintain, and execute complex pipelines. Such a platform will simplify robust and reproducible workflow creation for non-technical users as well as provide a robust platform to maintain pipelines for large organizations.

RESULTS

To simplify development, maintenance, and execution of complex pipelines we created DolphinNext. DolphinNext facilitates building and deployment of complex pipelines using a modular approach implemented in a graphical interface that relies on the powerful Nextflow workflow framework by providing 1. A drag and drop user interface that visualizes pipelines and allows users to create pipelines without familiarity in underlying programming languages. 2. Modules to execute and monitor pipelines in distributed computing environments such as high-performance clusters and/or cloud 3. Reproducible pipelines with version tracking and stand-alone versions that can be run independently. 4. Modular process design with process revisioning support to increase reusability and pipeline development efficiency. 5. Pipeline sharing with GitHub and automated testing 6. Extensive reports with R-markdown and shiny support for interactive data visualization and analysis.

CONCLUSION

DolphinNext is a flexible, intuitive, web-based data processing and analysis platform that enables creating, deploying, sharing, and executing complex Nextflow pipelines with extensive revisioning and interactive reporting to enhance reproducible results.

Collapse

Tripathi R, Aier I, Chakraborty P, Varadwaj PK. Unravelling the role of long non-coding RNA - LINC01087 in breast cancer. Noncoding RNA Res 2019;5:1-10. [PMID: 31989062 PMCID: PMC6965516 DOI: 10.1016/j.ncrna.2019.12.002] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2019] [Revised: 12/17/2019] [Accepted: 12/17/2019] [Indexed: 02/09/2023] Open

Calarco L, Barratt J, Ellis J. Detecting sequence variants in clinically important protozoan parasites. Int J Parasitol 2019;50:1-18. [PMID: 31857072 DOI: 10.1016/j.ijpara.2019.10.004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2019] [Revised: 09/29/2019] [Accepted: 10/01/2019] [Indexed: 02/06/2023]

FastqCleaner: an interactive Bioconductor application for quality-control, filtering and trimming of FASTQ files. BMC Bioinformatics 2019;20:361. [PMID: 31253077 PMCID: PMC6599294 DOI: 10.1186/s12859-019-2961-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Accepted: 06/20/2019] [Indexed: 11/10/2022] Open

Leivada E, D’Alessandro R, Grohmann KK. Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges. Front Psychol 2019;10:313. [PMID: 30837922 PMCID: PMC6382742 DOI: 10.3389/fpsyg.2019.00313] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2018] [Accepted: 02/01/2019] [Indexed: 11/17/2022] Open

Workflow Development for the Functional Characterization of ncRNAs. Methods Mol Biol 2019;1912:111-132. [PMID: 30635892 DOI: 10.1007/978-1-4939-8982-9_5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Gao Y, Yurkovich JT, Seo SW, Kabimoldayev I, Dräger A, Chen K, Sastry AV, Fang X, Mih N, Yang L, Eichner J, Cho BK, Kim D, Palsson BO. Systematic discovery of uncharacterized transcription factors in Escherichia coli K-12 MG1655. Nucleic Acids Res 2018;46:10682-10696. [PMID: 30137486 PMCID: PMC6237786 DOI: 10.1093/nar/gky752] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2018] [Revised: 07/11/2018] [Accepted: 08/08/2018] [Indexed: 02/03/2023] Open

Affiliation(s)

Ye Gao Division of Biological Sciences, University of California, San Diego, La Jolla, CA 92093, USA Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA
James T Yurkovich Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA 92093, USA
Sang Woo Seo School of Chemical and Biological Engineering, Seoul National University, Seoul, Republic of Korea
Ilyas Kabimoldayev Department of Genetic Engineering and Graduate School of Biotechnology, College of Life Sciences, Kyung Hee University, Yongin, Republic of Korea
Andreas Dräger Computational Systems Biology of Infection and Antimicrobial-Resistant Pathogens, Center for Bioinformatics Tübingen (ZBIT), 72076 Tübingen, Germany Department of Computer Science, University of Tübingen, 72076 Tübingen, Germany
Ke Chen Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA
Anand V Sastry Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA
Xin Fang Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA
Nathan Mih Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA 92093, USA
Laurence Yang Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA
Johannes Eichner Computational Systems Biology of Infection and Antimicrobial-Resistant Pathogens, Center for Bioinformatics Tübingen (ZBIT), 72076 Tübingen, Germany
Byung-Kwan Cho Novo Nordisk Foundation Center for Biosustainability, 2800 Kongens Lyngby, Denmark Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea
Donghyuk Kim Department of Genetic Engineering and Graduate School of Biotechnology, College of Life Sciences, Kyung Hee University, Yongin, Republic of Korea School of Energy and Chemical Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, Republic of Korea School of Biological Sciences, Ulsan National Institute of Science and Technology (UNIST), Ulsan, Republic of Korea
Bernhard O Palsson Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA 92093, USA Novo Nordisk Foundation Center for Biosustainability, 2800 Kongens Lyngby, Denmark Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA

Collapse

Demography in the Big Data Revolution: Changing the Culture to Forge New Frontiers. POPULATION RESEARCH AND POLICY REVIEW 2018. [DOI: 10.1007/s11113-018-9464-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]

Samaddar S, Sinha R, De RK. A MODEL for DISTRIBUTED PROCESSING and ANALYSES of NGS DATA under MAP-REDUCE PARADIGM. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018;16:827-840. [PMID: 29993814 DOI: 10.1109/tcbb.2018.2816022] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Gut Dysbiosis and Muscle Aging: Searching for Novel Targets against Sarcopenia. Mediators Inflamm 2018;2018:7026198. [PMID: 29686533 PMCID: PMC5893006 DOI: 10.1155/2018/7026198] [Citation(s) in RCA: 91] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2017] [Revised: 11/28/2017] [Accepted: 12/05/2017] [Indexed: 12/12/2022] Open

Tripathi R, Chakraborty P, Varadwaj PK. Unraveling long non-coding RNAs through analysis of high-throughput RNA-sequencing data. Noncoding RNA Res 2017;2:111-118. [PMID: 30159428 PMCID: PMC6096414 DOI: 10.1016/j.ncrna.2017.06.003] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2017] [Revised: 06/19/2017] [Accepted: 06/21/2017] [Indexed: 01/01/2023] Open

Tripathi R, Patel S, Kumari V, Chakraborty P, Varadwaj PK. DeepLNC, a long non-coding RNA prediction tool using deep neural network. ACTA ACUST UNITED AC 2016. [DOI: 10.1007/s13721-016-0129-2] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]