Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Kim GB, Gao Y, Palsson BO, Lee SY. DeepTFactor: A deep learning-based tool for the prediction of transcription factors. Proc Natl Acad Sci U S A 2021;118:e2021171118. [PMID: 33372147 DOI: 10.1073/pnas.2021171118] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open

For:	Kim GB, Gao Y, Palsson BO, Lee SY. DeepTFactor: A deep learning-based tool for the prediction of transcription factors. Proc Natl Acad Sci U S A 2021;118:e2021171118. [PMID: 33372147 DOI: 10.1073/pnas.2021171118] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open

Number

Cited by Other Article(s)

Lu Z, Xiao X, Zheng Q, Wang X, Xu L. Assessing next-generation sequencing-based computational methods for predicting transcriptional regulators with query gene sets. Brief Bioinform 2024;25:bbae366. [PMID: 39082650 PMCID: PMC11289684 DOI: 10.1093/bib/bbae366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Revised: 06/21/2024] [Accepted: 07/18/2024] [Indexed: 08/03/2024] Open

Patiyal S, Tiwari P, Ghai M, Dhapola A, Dhall A, Raghava GPS. A hybrid approach for predicting transcription factors. FRONTIERS IN BIOINFORMATICS 2024;4:1425419. [PMID: 39119181 PMCID: PMC11306938 DOI: 10.3389/fbinf.2024.1425419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Accepted: 07/03/2024] [Indexed: 08/10/2024] Open

Abstract

Transcription factors are essential DNA-binding proteins that regulate the transcription rate of several genes and control the expression of genes inside a cell. The prediction of transcription factors with high precision is important for understanding biological processes such as cell differentiation, intracellular signaling, and cell-cycle control. In this study, we developed a hybrid method that combines alignment-based and alignment-free methods for predicting transcription factors with higher accuracy. All models have been trained, tested, and evaluated on a large dataset that contains 19,406 transcription factors and 523,560 non-transcription factor protein sequences. To avoid biases in evaluation, the datasets were divided into training and validation/independent datasets, where 80% of the data was used for training, and the remaining 20% was used for external validation. In the case of alignment-free methods, models were developed using machine learning techniques and the composition-based features of a protein. Our best alignment-free model obtained an AUC of 0.97 on an independent dataset. In the case of the alignment-based method, we used BLAST at different cut-offs to predict the transcription factors. Although the alignment-based method demonstrated excellent performance, it was unable to cover all transcription factors due to instances of no hits. To combine the strengths of both methods, we developed a hybrid method that combines alignment-free and alignment-based methods. In the hybrid method, we added the scores of the alignment-free and alignment-based methods and achieved a maximum AUC of 0.99 on the independent dataset. The method proposed in this study performs better than existing methods. We incorporated the best models in the webserver/Python Package Index/standalone package of "TransFacPred" (https://webs.iiitd.edu.in/raghava/transfacpred).

Collapse

McReynolds E, Elshahed MS, Youssef NH. An ecological-evolutionary perspective on the genomic diversity and habitat preferences of the Acidobacteriota. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.05.601421. [PMID: 39005473 PMCID: PMC11245096 DOI: 10.1101/2024.07.05.601421] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]

Joshi SHN, Jenkins C, Ulaeto D, Gorochowski TE. Accelerating Genetic Sensor Development, Scale-up, and Deployment Using Synthetic Biology. BIODESIGN RESEARCH 2024;6:0037. [PMID: 38919711 PMCID: PMC11197468 DOI: 10.34133/bdr.0037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 04/23/2024] [Indexed: 06/27/2024] Open

Nuhamunada M, Mohite OS, Phaneuf P, Palsson B, Weber T. BGCFlow: systematic pangenome workflow for the analysis of biosynthetic gene clusters across large genomic datasets. Nucleic Acids Res 2024;52:5478-5495. [PMID: 38686794 PMCID: PMC11162802 DOI: 10.1093/nar/gkae314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Revised: 03/22/2024] [Accepted: 04/11/2024] [Indexed: 05/02/2024] Open

Chen L, Li C, Li B, Zhou X, Bai Y, Zou X, Zhou Z, He Q, Chen B, Wang M, Xue Y, Jiang Z, Feng J, Zhou T, Liu Z, Xu P. Evolutionary divergence of subgenomes in common carp provides insights into speciation and allopolyploid success. FUNDAMENTAL RESEARCH 2024;4:589-602. [PMID: 38933191 PMCID: PMC11197550 DOI: 10.1016/j.fmre.2023.06.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Revised: 06/29/2023] [Accepted: 06/30/2023] [Indexed: 06/28/2024] Open

Affiliation(s)

Lin Chen State Key Laboratory of Mariculture Breeding, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China
Chengyu Li State Key Laboratory of Mariculture Breeding, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China
Bijun Li State Key Laboratory of Mariculture Breeding, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China
Xiaofan Zhou Integrative Microbiology Research Centre, South China Agricultural University, Guangzhou 510642, China
Yulin Bai State Key Laboratory of Mariculture Breeding, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China
Xiaoqing Zou State Key Laboratory of Mariculture Breeding, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China
Zhixiong Zhou State Key Laboratory of Mariculture Breeding, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China
Qian He State Key Laboratory of Mariculture Breeding, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China
Baohua Chen State Key Laboratory of Mariculture Breeding, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China
Mei Wang State Key Laboratory of Mariculture Breeding, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China
Yaguo Xue College of Fisheries, Henan Normal University, Xinxiang 453007, China
Zhou Jiang State Key Laboratory of Mariculture Breeding, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China
Jianxin Feng Henan Academy of Fishery Science, Zhengzhou 450044, China
Tao Zhou State Key Laboratory of Mariculture Breeding, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China
Zhanjiang Liu Department of Biology, College of Arts and Sciences, Syracuse University, Syracuse 13244, USA
Peng Xu State Key Laboratory of Mariculture Breeding, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China

Collapse

Ko YJ, Lee ME, Cho BH, Kim M, Hyeon JE, Han JH, Han SO. Bioproduction of porphyrins, phycobilins, and their proteins using microbial cell factories: engineering, metabolic regulations, challenges, and perspectives. Crit Rev Biotechnol 2024;44:373-387. [PMID: 36775664 DOI: 10.1080/07388551.2023.2168512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 11/21/2022] [Accepted: 01/03/2023] [Indexed: 02/14/2023]

Ledesma-Dominguez L, Carbajal-Degante E, Moreno-Hagelsieb G, Perez-Rueda E. DeepReg: a deep learning hybrid model for predicting transcription factors in eukaryotic and prokaryotic genomes. Sci Rep 2024;14:9155. [PMID: 38644393 DOI: 10.1038/s41598-024-59487-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 04/11/2024] [Indexed: 04/23/2024] Open

Pandey U, Behara SM, Sharma S, Patil RS, Nambiar S, Koner D, Bhukya H. DeePNAP: A Deep Learning Method to Predict Protein-Nucleic Acid Binding Affinity from Their Sequences. J Chem Inf Model 2024;64:1806-1815. [PMID: 38458968 DOI: 10.1021/acs.jcim.3c01151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/10/2024]

Martinez GS, Perez-Rueda E, Kumar A, Dutt M, Maya CR, Ledesma-Dominguez L, Casa PL, Kumar A, de Avila e Silva S, Kelvin DJ. CDBProm: the Comprehensive Directory of Bacterial Promoters. NAR Genom Bioinform 2024;6:lqae018. [PMID: 38385146 PMCID: PMC10880602 DOI: 10.1093/nargab/lqae018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 01/12/2024] [Accepted: 01/29/2024] [Indexed: 02/23/2024] Open

Affiliation(s)

Gustavo Sganzerla Martinez Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia B3H 4H7, Canada Pediatrics, Izaak Walton Killam (IWK) Health Center. Canadian Center for Vaccinology (CCfV), Halifax, Nova Scotia B3H 4H7, Canada BioForge Canada Limited, Halifax, Nova Scotia B3N 3B9, Canada
Ernesto Perez-Rueda Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autonóma de México, Unidad Académica del Estado de Yucatán, Mérida 97302, Yucatán, Mexico
Anuj Kumar Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia B3H 4H7, Canada Pediatrics, Izaak Walton Killam (IWK) Health Center. Canadian Center for Vaccinology (CCfV), Halifax, Nova Scotia B3H 4H7, Canada BioForge Canada Limited, Halifax, Nova Scotia B3N 3B9, Canada
Mansi Dutt Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia B3H 4H7, Canada Pediatrics, Izaak Walton Killam (IWK) Health Center. Canadian Center for Vaccinology (CCfV), Halifax, Nova Scotia B3H 4H7, Canada BioForge Canada Limited, Halifax, Nova Scotia B3N 3B9, Canada
Cinthia Rodríguez Maya Facultad de Ciencias e Ingeniería, Universidad Nacional Autonoma de Mexico, Mexico City 04510, Mexico
Leonardo Ledesma-Dominguez Instituto de Investigaciones en Matematicas Aplicadas y en Sistemas, Universidad Nacional Autonoma de Mexico, Mexico City 04510, Mexico
Pedro Lenz Casa Biotechnology Institute, Universidade de Caxias do Sul, Caxias do Sul, Rio Grande do Sul 95070-560, Brazil
Aditya Kumar Molecular Biology and Biotechnology, Tezpur University, Tezpur, Assam 784028, India
Scheila de Avila e Silva Biotechnology Institute, Universidade de Caxias do Sul, Caxias do Sul, Rio Grande do Sul 95070-560, Brazil
David J Kelvin Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia B3H 4H7, Canada Pediatrics, Izaak Walton Killam (IWK) Health Center. Canadian Center for Vaccinology (CCfV), Halifax, Nova Scotia B3H 4H7, Canada BioForge Canada Limited, Halifax, Nova Scotia B3N 3B9, Canada

Collapse

Brungardt J, Alarcon Y, Shiller J, Young C, Monteros MJ, Randall JJ, Bock CH. Transcriptome profile of pecan scab resistant and susceptible trees from a pecan provenance collection. BMC Genomics 2024;25:180. [PMID: 38355402 PMCID: PMC10868059 DOI: 10.1186/s12864-024-10010-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Accepted: 01/12/2024] [Indexed: 02/16/2024] Open

Abstract

Pecan scab is a devastating disease that causes damage to pecan (Carya illinoinensis (Wangenh.) K. Koch) fruit and leaves. The disease is caused by the fungus Venturia effusa (G. Winter) and the main management practice for controlling the disease is by application of fungicides at 2-to-3-week intervals throughout the growing season. Besides disease-related yield loss, application of fungicides can result in considerable cost and increases the likelihood of fungicide resistance developing in the pathogen. Resistant cultivars are available for pecan growers; although, in several cases resistance has been overcome as the pathogen adapts to infect resistant hosts. Despite the importance of host resistance in scab management, there is little information regarding the molecular basis of genetic resistance to pecan scab.The purpose of this study was to elucidate mechanisms of natural pecan scab resistance by analyzing transcripts that are differentially expressed in pecan leaf samples from scab resistant and susceptible trees. The leaf samples were collected from trees in a provenance collection orchard that represents the natural range of pecan in the US and Mexico. Trees in the orchard have been exposed to natural scab infections since planting in 1989, and scab ratings were collected over three seasons. Based on this data, ten susceptible trees and ten resistant trees were selected for analysis. RNA-seq data was collected and analyzed for diseased and non-diseased parts of susceptible trees as well as for resistant trees. A total of 313 genes were found to be differentially expressed when comparing resistant and susceptible trees without disease. For susceptible samples showing scab symptoms, 1,454 genes were identified as differentially expressed compared to non-diseased susceptible samples. Many genes involved in pathogen recognition, defense responses, and signal transduction were up-regulated in diseased samples of susceptible trees, whereas differentially expressed genes in pecan scab resistant samples were generally down-regulated compared to non-diseased susceptible samples.Our results provide the first account of candidate genes involved in resistance/susceptibility to pecan scab under natural conditions in a pecan orchard. This information can be used to aid pecan breeding programs and development of biotechnology-based approaches for generating pecan cultivars with more durable scab resistance.

Collapse

Zhang J, Li F, Liu D, Liu Q, Song H. Engineering extracellular electron transfer pathways of electroactive microorganisms by synthetic biology for energy and chemicals production. Chem Soc Rev 2024;53:1375-1446. [PMID: 38117181 DOI: 10.1039/d3cs00537b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2023]

Abstract

The excessive consumption of fossil fuels causes massive emission of CO2, leading to climate deterioration and environmental pollution. The development of substitutes and sustainable energy sources to replace fossil fuels has become a worldwide priority. Bio-electrochemical systems (BESs), employing redox reactions of electroactive microorganisms (EAMs) on electrodes to achieve a meritorious combination of biocatalysis and electrocatalysis, provide a green and sustainable alternative approach for bioremediation, CO2 fixation, and energy and chemicals production. EAMs, including exoelectrogens and electrotrophs, perform extracellular electron transfer (EET) (i.e., outward and inward EET), respectively, to exchange energy with the environment, whose rate determines the efficiency and performance of BESs. Therefore, we review the synthetic biology strategies developed in the last decade for engineering EAMs to enhance the EET rate in cell-electrode interfaces for facilitating the production of electricity energy and value-added chemicals, which include (1) progress in genetic manipulation and editing tools to achieve the efficient regulation of gene expression, knockout, and knockdown of EAMs; (2) synthetic biological engineering strategies to enhance the outward EET of exoelectrogens to anodes for electricity power production and anodic electro-fermentation (AEF) for chemicals production, including (i) broadening and strengthening substrate utilization, (ii) increasing the intracellular releasable reducing equivalents, (iii) optimizing c-type cytochrome (c-Cyts) expression and maturation, (iv) enhancing conductive nanowire biosynthesis and modification, (v) promoting electron shuttle biosynthesis, secretion, and immobilization, (vi) engineering global regulators to promote EET rate, (vii) facilitating biofilm formation, and (viii) constructing cell-material hybrids; (3) the mechanisms of inward EET, CO2 fixation pathway, and engineering strategies for improving the inward EET of electrotrophic cells for CO2 reduction and chemical production, including (i) programming metabolic pathways of electrotrophs, (ii) rewiring bioelectrical circuits for enhancing inward EET, and (iii) constructing microbial (photo)electrosynthesis by cell-material hybridization; (4) perspectives on future challenges and opportunities for engineering EET to develop highly efficient BESs for sustainable energy and chemical production. We expect that this review will provide a theoretical basis for the future development of BESs in energy harvesting, CO2 fixation, and chemical synthesis.

Collapse

Zhang J, Basu S, Kurgan L. HybridDBRpred: improved sequence-based prediction of DNA-binding amino acids using annotations from structured complexes and disordered proteins. Nucleic Acids Res 2024;52:e10. [PMID: 38048333 PMCID: PMC10810184 DOI: 10.1093/nar/gkad1131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 11/10/2023] [Indexed: 12/06/2023] Open

Feldmeyer B, Bornberg-Bauer E, Dohmen E, Fouks B, Heckenhauer J, Huylmans AK, Jones ARC, Stolle E, Harrison MC. Comparative Evolutionary Genomics in Insects. Methods Mol Biol 2024;2802:473-514. [PMID: 38819569 DOI: 10.1007/978-1-0716-3838-5_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]

Al-Tohamy A, Grove A. Targeting bacterial transcription factors for infection control: opportunities and challenges. Transcription 2023:1-28. [PMID: 38126125 DOI: 10.1080/21541264.2023.2293523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Accepted: 12/07/2023] [Indexed: 12/23/2023] Open

Jin P, Zhu B, Jia Y, Zhang Y, Wang W, Shen Y, Zhong Y, Zheng Y, Wang Y, Tong Y, Zhang W, Li S. Single-cell transcriptomics reveals the brain evolution of web-building spiders. Nat Ecol Evol 2023;7:2125-2142. [PMID: 37919396 PMCID: PMC10697844 DOI: 10.1038/s41559-023-02238-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 09/29/2023] [Indexed: 11/04/2023]

Affiliation(s)

Pengyu Jin Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
Bingyue Zhu Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Yinjun Jia School of Life Sciences, IDG/McGovern Institute for Brain Research, Tsinghua University, Beijing, China Tsinghua-Peking Center for Life Sciences, Beijing, China
Yiming Zhang Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Wei Wang Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China Guangxi Normal University, Guilin, China
Yunxiao Shen Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Yu Zhong Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Yami Zheng Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Yang Wang Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Yan Tong Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Wei Zhang School of Life Sciences, IDG/McGovern Institute for Brain Research, Tsinghua University, Beijing, China Tsinghua-Peking Center for Life Sciences, Beijing, China
Shuqiang Li Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.

Collapse

Kim GB, Kim JY, Lee JA, Norsigian CJ, Palsson BO, Lee SY. Functional annotation of enzyme-encoding genes using deep learning with transformer layers. Nat Commun 2023;14:7370. [PMID: 37963869 PMCID: PMC10645960 DOI: 10.1038/s41467-023-43216-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Accepted: 11/03/2023] [Indexed: 11/16/2023] Open

Affiliation(s)

Gi Bae Kim Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), KAIST, Daejeon, 34141, Republic of Korea KAIST Institute for the BioCentury and KAIST Institute for Artificial Intelligence, KAIST, Daejeon, 34141, Republic of Korea
Ji Yeon Kim Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), KAIST, Daejeon, 34141, Republic of Korea KAIST Institute for the BioCentury and KAIST Institute for Artificial Intelligence, KAIST, Daejeon, 34141, Republic of Korea
Jong An Lee Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), KAIST, Daejeon, 34141, Republic of Korea KAIST Institute for the BioCentury and KAIST Institute for Artificial Intelligence, KAIST, Daejeon, 34141, Republic of Korea
Charles J Norsigian Division of Biological Sciences, University of California San Diego, La Jolla, CA, 92093, USA Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA
Bernhard O Palsson Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA, 92093, USA Novo Nordisk Foundation Center for Biosustainability, 2800, Kongens Lyngby, Denmark
Sang Yup Lee Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea. Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), KAIST, Daejeon, 34141, Republic of Korea. KAIST Institute for the BioCentury and KAIST Institute for Artificial Intelligence, KAIST, Daejeon, 34141, Republic of Korea. BioProcess Engineering Research Center and BioInformatics Research Center, KAIST, Daejeon, 34141, Republic of Korea.

Collapse

Lamoureux CR, Decker KT, Sastry AV, Rychel K, Gao Y, McConn J, Zielinski D, Palsson BO. A multi-scale expression and regulation knowledge base for Escherichia coli. Nucleic Acids Res 2023;51:10176-10193. [PMID: 37713610 PMCID: PMC10602906 DOI: 10.1093/nar/gkad750] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 08/02/2023] [Accepted: 09/05/2023] [Indexed: 09/17/2023] Open

Glasscock CJ, Pecoraro R, McHugh R, Doyle LA, Chen W, Boivin O, Lonnquist B, Na E, Politanska Y, Haddox HK, Cox D, Norn C, Coventry B, Goreshnik I, Vafeados D, Lee GR, Gordan R, Stoddard BL, DiMaio F, Baker D. Computational design of sequence-specific DNA-binding proteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.20.558720. [PMID: 37790440 PMCID: PMC10542524 DOI: 10.1101/2023.09.20.558720] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/05/2023]

Affiliation(s)

Cameron J. Glasscock Department of Biochemistry, University of Washington, Seattle, WA, USA Institute for Protein Design, University of Washington, Seattle, WA, USA
Robert Pecoraro Department of Biochemistry, University of Washington, Seattle, WA, USA Institute for Protein Design, University of Washington, Seattle, WA, USA Department of Physics, University of Washington, Seattle, WA, USA
Ryan McHugh Department of Biochemistry, University of Washington, Seattle, WA, USA Institute for Protein Design, University of Washington, Seattle, WA, USA
Lindsey A. Doyle Division of Basic Sciences, Fred Hutchinson Cancer Center, Seattle, Washington, USA
Wei Chen Department of Biochemistry, University of Washington, Seattle, WA, USA Institute for Protein Design, University of Washington, Seattle, WA, USA
Olivier Boivin Program in Genetics and Genomic, Duke University, Durham, NC, USA Center for Advanced Genomic Technologies, Duke University, Durham, NC, USA
Beau Lonnquist Department of Biochemistry, University of Washington, Seattle, WA, USA Institute for Protein Design, University of Washington, Seattle, WA, USA Department of Bioengineering, University of Washington, Seattle, WA, USA
Emily Na Department of Biochemistry, University of Washington, Seattle, WA, USA Institute for Protein Design, University of Washington, Seattle, WA, USA
Yuliya Politanska Department of Biochemistry, University of Washington, Seattle, WA, USA Institute for Protein Design, University of Washington, Seattle, WA, USA
Hugh K. Haddox Division of Basic Sciences, Fred Hutchinson Cancer Center, Seattle, Washington, USA
David Cox Department of Biochemistry, Stanford University School of Medicine, Palo Alto, CA USA Department of Medicine, Division of Hematology, Stanford University, Stanford, CA, USA
Christoffer Norn Department of Biochemistry, University of Washington, Seattle, WA, USA Institute for Protein Design, University of Washington, Seattle, WA, USA BioInnovation Institute, DK2200 Copenhagen N, Denmark
Brian Coventry Department of Biochemistry, University of Washington, Seattle, WA, USA Institute for Protein Design, University of Washington, Seattle, WA, USA
Inna Goreshnik Department of Biochemistry, University of Washington, Seattle, WA, USA Institute for Protein Design, University of Washington, Seattle, WA, USA
Dionne Vafeados Department of Biochemistry, University of Washington, Seattle, WA, USA Institute for Protein Design, University of Washington, Seattle, WA, USA
Gyu Rie Lee Department of Biochemistry, University of Washington, Seattle, WA, USA Institute for Protein Design, University of Washington, Seattle, WA, USA Howard Hughes Medical Institute, University of Washington, Seattle, WA USA
Raluca Gordan Center for Advanced Genomic Technologies, Duke University, Durham, NC, USA Department of Biostatistics and Bioinformatics, Department of Computer Science, Department of Molecular Genetics and Microbiology, Duke University, Durham, NC, USA
Barry L. Stoddard Division of Basic Sciences, Fred Hutchinson Cancer Center, Seattle, Washington, USA
Frank DiMaio Department of Biochemistry, University of Washington, Seattle, WA, USA Institute for Protein Design, University of Washington, Seattle, WA, USA
David Baker Department of Biochemistry, University of Washington, Seattle, WA, USA Institute for Protein Design, University of Washington, Seattle, WA, USA BioInnovation Institute, DK2200 Copenhagen N, Denmark

Collapse

Chen RJ, Wang JJ, Williamson DFK, Chen TY, Lipkova J, Lu MY, Sahai S, Mahmood F. Algorithmic fairness in artificial intelligence for medicine and healthcare. Nat Biomed Eng 2023;7:719-742. [PMID: 37380750 PMCID: PMC10632090 DOI: 10.1038/s41551-023-01056-8] [Citation(s) in RCA: 30] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Accepted: 04/13/2023] [Indexed: 06/30/2023]

Affiliation(s)

Richard J Chen Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA Cancer Program, Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA, USA Cancer Data Science Program, Dana-Farber Cancer Institute, Boston, MA, USA
Judy J Wang Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA Boston University School of Medicine, Boston, MA, USA
Drew F K Williamson Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA Cancer Program, Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA, USA
Tiffany Y Chen Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA Cancer Program, Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA, USA
Jana Lipkova Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA Cancer Program, Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA, USA
Ming Y Lu Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA Cancer Program, Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA, USA Cancer Data Science Program, Dana-Farber Cancer Institute, Boston, MA, USA Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA
Sharifa Sahai Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA Cancer Program, Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA, USA Department of Systems Biology, Harvard Medical School, Boston, MA, USA
Faisal Mahmood Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA. Cancer Program, Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA, USA. Cancer Data Science Program, Dana-Farber Cancer Institute, Boston, MA, USA. Department of Pathology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA. Harvard Data Science Initiative, Harvard University, Cambridge, MA, USA.

Collapse

Barbero-Aparicio JA, Olivares-Gil A, Díez-Pastor JF, García-Osorio C. Deep learning and support vector machines for transcription start site identification. PeerJ Comput Sci 2023;9:e1340. [PMID: 37346545 PMCID: PMC10280436 DOI: 10.7717/peerj-cs.1340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Accepted: 03/21/2023] [Indexed: 06/23/2023]

Abstract

Recognizing transcription start sites is key to gene identification. Several approaches have been employed in related problems such as detecting translation initiation sites or promoters, many of the most recent ones based on machine learning. Deep learning methods have been proven to be exceptionally effective for this task, but their use in transcription start site identification has not yet been explored in depth. Also, the very few existing works do not compare their methods to support vector machines (SVMs), the most established technique in this area of study, nor provide the curated dataset used in the study. The reduced amount of published papers in this specific problem could be explained by this lack of datasets. Given that both support vector machines and deep neural networks have been applied in related problems with remarkable results, we compared their performance in transcription start site predictions, concluding that SVMs are computationally much slower, and deep learning methods, specially long short-term memory neural networks (LSTMs), are best suited to work with sequences than SVMs. For such a purpose, we used the reference human genome GRCh38. Additionally, we studied two different aspects related to data processing: the proper way to generate training examples and the imbalanced nature of the data. Furthermore, the generalization performance of the models studied was also tested using the mouse genome, where the LSTM neural network stood out from the rest of the algorithms. To sum up, this article provides an analysis of the best architecture choices in transcription start site identification, as well as a method to generate transcription start site datasets including negative instances on any species available in Ensembl. We found that deep learning methods are better suited than SVMs to solve this problem, being more efficient and better adapted to long sequences and large amounts of data. We also create a transcription start site (TSS) dataset large enough to be used in deep learning experiments.

Collapse

Cho C, Lee D, Jeong D, Kim S, Kim MK, Srinivasan S. Characterization of radiation-resistance mechanism in Spirosoma montaniterrae DY10^T in terms of transcriptional regulatory system. Sci Rep 2023;13:4739. [PMID: 36959250 PMCID: PMC10036542 DOI: 10.1038/s41598-023-31509-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Accepted: 03/13/2023] [Indexed: 03/25/2023] Open

Genomic Features Predict Bacterial Life History Strategies in Soil, as Identified by Metagenomic Stable Isotope Probing. mBio 2023;14:e0358422. [PMID: 36877031 PMCID: PMC10128055 DOI: 10.1128/mbio.03584-22] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/07/2023] Open

Flores-Díaz A, Escoto-Sandoval C, Cervantes-Hernández F, Ordaz-Ortiz JJ, Hayano-Kanashiro C, Reyes-Valdés H, Garcés-Claver A, Ochoa-Alejo N, Martínez O. Gene Functional Networks from Time Expression Profiles: A Constructive Approach Demonstrated in Chili Pepper (Capsicum annuum L.). PLANTS (BASEL, SWITZERLAND) 2023;12:1148. [PMID: 36904008 PMCID: PMC10005043 DOI: 10.3390/plants12051148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 02/20/2023] [Accepted: 02/27/2023] [Indexed: 06/18/2023]

Abstract

Gene co-expression networks are powerful tools to understand functional interactions between genes. However, large co-expression networks are difficult to interpret and do not guarantee that the relations found will be true for different genotypes. Statistically verified time expression profiles give information about significant changes in expressions through time, and genes with highly correlated time expression profiles, which are annotated in the same biological process, are likely to be functionally connected. A method to obtain robust networks of functionally related genes will be useful to understand the complexity of the transcriptome, leading to biologically relevant insights. We present an algorithm to construct gene functional networks for genes annotated in a given biological process or other aspects of interest. We assume that there are genome-wide time expression profiles for a set of representative genotypes of the species of interest. The method is based on the correlation of time expression profiles, bound by a set of thresholds that assure both, a given false discovery rate, and the discard of correlation outliers. The novelty of the method consists in that a gene expression relation must be repeatedly found in a given set of independent genotypes to be considered valid. This automatically discards relations particular to specific genotypes, assuring a network robustness, which can be set a priori. Additionally, we present an algorithm to find transcription factors candidates for regulating hub genes within a network. The algorithms are demonstrated with data from a large experiment studying gene expression during the development of the fruit in a diverse set of chili pepper genotypes. The algorithm is implemented and demonstrated in a new version of the publicly available R package "Salsa" (version 1.0).

Collapse

Du Z, Huang T, Uversky VN, Li J. Predicting TF Proteins by Incorporating Evolution Information Through PSSM. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023;20:1319-1326. [PMID: 35981062 DOI: 10.1109/tcbb.2022.3199758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

Tellechea-Luzardo J, Martín Lázaro H, Moreno López R, Carbonell P. Sensbio: an online server for biosensor design. BMC Bioinformatics 2023;24:71. [PMID: 36855083 PMCID: PMC9972687 DOI: 10.1186/s12859-023-05201-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Accepted: 02/22/2023] [Indexed: 03/02/2023] Open

Tellechea-Luzardo J, Stiebritz MT, Carbonell P. Transcription factor-based biosensors for screening and dynamic regulation. Front Bioeng Biotechnol 2023;11:1118702. [PMID: 36814719 PMCID: PMC9939652 DOI: 10.3389/fbioe.2023.1118702] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 01/26/2023] [Indexed: 02/09/2023] Open

Sieow BFL, De Sotto R, Seet ZRD, Hwang IY, Chang MW. Synthetic Biology Meets Machine Learning. Methods Mol Biol 2023;2553:21-39. [PMID: 36227537 DOI: 10.1007/978-1-0716-2617-7_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Affiliation(s)

Brendan Fu-Long Sieow NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore NUS Graduate School for Integrative Sciences and Engineering Programme, National University of Singapore, Singapore, Singapore
Ryan De Sotto NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
Zhi Ren Darren Seet NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
In Young Hwang NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
Matthew Wook Chang NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore. Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore. Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore.

Collapse

Volk MJ, Tran VG, Tan SI, Mishra S, Fatma Z, Boob A, Li H, Xue P, Martin TA, Zhao H. Metabolic Engineering: Methodologies and Applications. Chem Rev 2022;123:5521-5570. [PMID: 36584306 DOI: 10.1021/acs.chemrev.2c00403] [Citation(s) in RCA: 29] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Affiliation(s)

Michael J Volk Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
Vinh G Tran Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
Shih-I Tan Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Chemical Engineering, National Cheng Kung University, Tainan 70101, Taiwan
Shekhar Mishra Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
Zia Fatma Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
Aashutosh Boob Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
Hongxiang Li Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
Pu Xue Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
Teresa A Martin Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
Huimin Zhao Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States

Collapse

Singh D, Roy J. A large-scale benchmark study of tools for the classification of protein-coding and non-coding RNAs. Nucleic Acids Res 2022;50:12094-12111. [PMID: 36420898 PMCID: PMC9757047 DOI: 10.1093/nar/gkac1092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Revised: 10/22/2022] [Accepted: 10/28/2022] [Indexed: 11/27/2022] Open

Qin R, Mahal LK, Bojar D. Deep learning explains the biology of branched glycans from single-cell sequencing data. iScience 2022;25:105163. [PMID: 36217547 PMCID: PMC9547197 DOI: 10.1016/j.isci.2022.105163] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Revised: 09/06/2022] [Accepted: 09/16/2022] [Indexed: 11/03/2022] Open

Dai Z, Zhang Z, Zhu L, Zhu Z, Jiang L. Complete Genome Sequencing Analysis of Deinococcus wulumuqiensis R12, an Extremely Radiation-Resistant Strain. Curr Microbiol 2022;79:292. [PMID: 35972568 DOI: 10.1007/s00284-022-02984-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Accepted: 07/20/2022] [Indexed: 11/03/2022]

Liu Q, Wang F, Shuai Y, Huang L, Zhang X. Integrated Analysis of Single-Molecule Real-Time Sequencing and Next-Generation Sequencing Eveals Insights into Drought Tolerance Mechanism of Lolium multiflorum. Int J Mol Sci 2022;23:ijms23147921. [PMID: 35887272 PMCID: PMC9320196 DOI: 10.3390/ijms23147921] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 07/13/2022] [Accepted: 07/14/2022] [Indexed: 02/01/2023] Open

Fu X, Bates PA. Application of deep learning methods: From molecular modelling to patient classification. Exp Cell Res 2022;418:113278. [PMID: 35810775 DOI: 10.1016/j.yexcr.2022.113278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 06/16/2022] [Accepted: 07/05/2022] [Indexed: 11/28/2022]

Wang L, Zhang J, Wang D, Song C. Membrane contact probability: An essential and predictive character for the structural and functional studies of membrane proteins. PLoS Comput Biol 2022;18:e1009972. [PMID: 35353812 PMCID: PMC9000120 DOI: 10.1371/journal.pcbi.1009972] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Revised: 04/11/2022] [Accepted: 02/25/2022] [Indexed: 11/20/2022] Open

Abstract

One of the unique traits of membrane proteins is that a significant fraction of their hydrophobic amino acids is exposed to the hydrophobic core of lipid bilayers rather than being embedded in the protein interior, which is often not explicitly considered in the protein structure and function predictions. Here, we propose a characteristic and predictive quantity, the membrane contact probability (MCP), to describe the likelihood of the amino acids of a given sequence being in direct contact with the acyl chains of lipid molecules. We show that MCP is complementary to solvent accessibility in characterizing the outer surface of membrane proteins, and it can be predicted for any given sequence with a machine learning-based method by utilizing a training dataset extracted from MemProtMD, a database generated from molecular dynamics simulations for the membrane proteins with a known structure. As the first of many potential applications, we demonstrate that MCP can be used to systematically improve the prediction precision of the protein contact maps and structures.

The distribution of residues on protein surfaces is largely determined by the surrounding environment. For soluble proteins, most of the residues on the outer surface are hydrophilic, and people use the quantity “solvent accessibility” to describe and predict these surface residues. In contrast, for membrane proteins that are embedded in a lipid bilayer, many of their surface residues are hydrophobic and membrane-contacting, but there is yet a widely-accepted quantity for the description or prediction of this characteristic property. Here, we propose a new quantity termed “membrane contact probability (MCP)”, which can be used to describe and predict the membrane-contacting surface residues of proteins. We also propose a machine learning-based method to predict MCP from protein sequences, utilizing the dataset generated by physics-based computer simulations. We demonstrate that a quantity such as MCP is helpful for protein structure prediction, and we believe that it will find broad applications in the structure and function studies of membrane proteins.

Collapse

Barrows JK, Van Dyke MW. Biolayer interferometry for DNA-protein interactions. PLoS One 2022;17:e0263322. [PMID: 35108320 PMCID: PMC8809612 DOI: 10.1371/journal.pone.0263322] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 01/14/2022] [Indexed: 11/18/2022] Open

Oliveira Monteiro LM, Saraiva JP, Brizola Toscan R, Stadler PF, Silva-Rocha R, Nunes da Rocha U. PredicTF: prediction of bacterial transcription factors in complex microbial communities using deep learning. ENVIRONMENTAL MICROBIOME 2022;17:7. [PMID: 35135629 PMCID: PMC8822659 DOI: 10.1186/s40793-021-00394-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Accepted: 12/03/2021] [Indexed: 06/14/2023]

Ledesma L, Hernandez-Guerrero R, Perez-Rueda E. Prediction of DNA-Binding Transcription Factors in Bacteria and Archaea Genomes. Methods Mol Biol 2022;2516:103-112. [PMID: 35922624 DOI: 10.1007/978-1-0716-2413-5_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Wan X, Saltepe B, Yu L, Wang B. Programming living sensors for environment, health and biomanufacturing. Microb Biotechnol 2021;14:2334-2342. [PMID: 33960658 PMCID: PMC8601174 DOI: 10.1111/1751-7915.13820] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Revised: 04/05/2021] [Accepted: 04/11/2021] [Indexed: 01/10/2023] Open

Gao Y, Lim HG, Verkler H, Szubin R, Quach D, Rodionova I, Chen K, Yurkovich JT, Cho BK, Palsson BO. Unraveling the functions of uncharacterized transcription factors in Escherichia coli using ChIP-exo. Nucleic Acids Res 2021;49:9696-9710. [PMID: 34428301 PMCID: PMC8464067 DOI: 10.1093/nar/gkab735] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Revised: 08/08/2021] [Accepted: 08/11/2021] [Indexed: 02/07/2023] Open

Escherichia coli as a platform microbial host for systems metabolic engineering. Essays Biochem 2021;65:225-246. [PMID: 33956149 DOI: 10.1042/ebc20200172] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Revised: 04/12/2021] [Accepted: 04/14/2021] [Indexed: 12/19/2022]

Boob A, Zhao H. Can Deep Learning Solve the Cas9 Dilemma? CRISPR J 2021;4:13-15. [PMID: 33616444 DOI: 10.1089/crispr.2020.29117.hzh] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022] Open

Transforming traditional nutrition paradigms with synthetic biology driven microbial production platforms. CURRENT RESEARCH IN BIOTECHNOLOGY 2021. [DOI: 10.1016/j.crbiot.2021.07.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open