Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Padilha VA, Alkhnbashi OS, Shah SA, de Carvalho ACPLF, Backofen R. CRISPRcasIdentifier: Machine learning for accurate identification and classification of CRISPR-Cas systems. Gigascience 2020;9:giaa062. [PMID: 32556168 PMCID: PMC7298778 DOI: 10.1093/gigascience/giaa062] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2020] [Revised: 04/27/2020] [Accepted: 05/15/2020] [Indexed: 12/26/2022] Open

For:	Padilha VA, Alkhnbashi OS, Shah SA, de Carvalho ACPLF, Backofen R. CRISPRcasIdentifier: Machine learning for accurate identification and classification of CRISPR-Cas systems. Gigascience 2020;9:giaa062. [PMID: 32556168 PMCID: PMC7298778 DOI: 10.1093/gigascience/giaa062] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2020] [Revised: 04/27/2020] [Accepted: 05/15/2020] [Indexed: 12/26/2022] Open

Number

Cited by Other Article(s)

Motoche-Monar C, Andrade D, Pijal WD, Hidrobo F, Armas R, Sánchez-Real E, Rocha-Chauca G, Castillo JA. CRISPRals: A Web Database for Assessing the CRISPR Defense System in the Ralstonia solanacearum Species Complex to Avoid Phage Resistance. PHYTOPATHOLOGY 2024;114:1462-1465. [PMID: 38427684 DOI: 10.1094/phyto-01-24-0010-sc] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/03/2024]

Madugula SS, Pujar P, Nammi B, Wang S, Jayasinghe-Arachchige VM, Pham T, Mashburn D, Artiles M, Liu J. Identification of Family-Specific Features in Cas9 and Cas12 Proteins: A Machine Learning Approach Using Complete Protein Feature Spectrum. J Chem Inf Model 2024;64:4897-4911. [PMID: 38838358 DOI: 10.1021/acs.jcim.4c00625] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2024]

Abstract

The recent development of CRISPR-Cas technology holds promise to correct gene-level defects for genetic diseases. The key element of the CRISPR-Cas system is the Cas protein, a nuclease that can edit the gene of interest assisted by guide RNA. However, these Cas proteins suffer from inherent limitations such as large size, low cleavage efficiency, and off-target effects, hindering their widespread application as a gene editing tool. Therefore, there is a need to identify novel Cas proteins with improved editing properties, for which it is necessary to understand the underlying features governing the Cas families. In this study, we aim to elucidate the unique protein features associated with Cas9 and Cas12 families and identify the features distinguishing each family from non-Cas proteins. Here, we built Random Forest (RF) binary classifiers to distinguish Cas12 and Cas9 proteins from non-Cas proteins, respectively, using the complete protein feature spectrum (13,494 features) encoding various physiochemical, topological, constitutional, and coevolutionary information on Cas proteins. Furthermore, we built multiclass RF classifiers differentiating Cas9, Cas12, and non-Cas proteins. All the models were evaluated rigorously on the test and independent data sets. The Cas12 and Cas9 binary models achieved a high overall accuracy of 92% and 95% on their respective independent data sets, while the multiclass classifier achieved an F1 score of close to 0.98. We observed that Quasi-Sequence-Order (QSO) descriptors like Schneider.lag and Composition descriptors like charge, volume, and polarizability are predominant in the Cas12 family. Conversely Amino Acid Composition descriptors, especially Tripeptide Composition (TPC), predominate the Cas9 family. Four of the top 10 descriptors identified in Cas9 classification are tripeptides PWN, PYY, HHA, and DHI, which are seen to be conserved across all Cas9 proteins and located within different catalytically important domains of the Streptococcus pyogenes Cas9 (SpCas9) structure. Among these, DHI and HHA are well-known to be involved in the DNA cleavage activity of the SpCas9 protein. Mutation studies have highlighted the significance of the PWN tripeptide in PAM recognition and DNA cleavage activity of SpCas9, while Y450 from the PYY tripeptide plays a crucial role in reducing off-target effects and improving the specificity in SpCas9. Leveraging our machine learning (ML) pipeline, we identified numerous Cas9 and Cas12 family-specific features. These features offer valuable insights for future experimental and computational studies aiming at designing Cas systems with enhanced gene-editing properties. These features suggest plausible structural modifications that can effectively guide the development of Cas proteins with improved editing capabilities.

Collapse

Affiliation(s)

Sita Sirisha Madugula Department of Pharmaceutical Sciences, University of North Texas System College of Pharmacy, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, Fort Worth, Texas 76107, United States
Pranav Pujar Department of Industrial, Manufacturing and Systems Engineering, University of Texas at Arlington, 701 South Nedderman Drive, Arlington, Texas 76019, United States
Bharani Nammi Department of Industrial, Manufacturing and Systems Engineering, University of Texas at Arlington, 701 South Nedderman Drive, Arlington, Texas 76019, United States
Shouyi Wang Department of Industrial, Manufacturing and Systems Engineering, University of Texas at Arlington, 701 South Nedderman Drive, Arlington, Texas 76019, United States
Vindi M Jayasinghe-Arachchige Department of Pharmaceutical Sciences, University of North Texas System College of Pharmacy, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, Fort Worth, Texas 76107, United States
Tyler Pham School of Biomedical Sciences, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, Fort Worth, Texas 76107, United States
Dominic Mashburn Department of Pharmaceutical Sciences, University of North Texas System College of Pharmacy, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, Fort Worth, Texas 76107, United States
Maria Artiles School of Biomedical Sciences, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, Fort Worth, Texas 76107, United States
Jin Liu Department of Pharmaceutical Sciences, University of North Texas System College of Pharmacy, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, Fort Worth, Texas 76107, United States School of Biomedical Sciences, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, Fort Worth, Texas 76107, United States

Collapse

Wang JH, Huang PT, Huang YT, Mao YC, Lai CH, Yeh TK, Tseng CH, Kao CC. Characterization of CRISPR-Cas Systems in Shewanella algae and Shewanella haliotis: Insights into the Adaptation and Survival of Marine Pathogens. Pathogens 2024;13:439. [PMID: 38921737 PMCID: PMC11207072 DOI: 10.3390/pathogens13060439] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 04/25/2024] [Accepted: 05/15/2024] [Indexed: 06/27/2024] Open

Madugula SS, Pujar P, Bharani N, Wang S, Jayasinghe-Arachchige VM, Pham T, Mashburn D, Artilis M, Liu J. Identification of Family-Specific Features in Cas9 and Cas12 Proteins: A Machine Learning Approach Using Complete Protein Feature Spectrum. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.22.576286. [PMID: 38328240 PMCID: PMC10849529 DOI: 10.1101/2024.01.22.576286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/09/2024]

Abstract

The recent development of CRISPR-Cas technology holds promise to correct gene-level defects for genetic diseases. The key element of the CRISPR-Cas system is the Cas protein, a nuclease that can edit the gene of interest assisted by guide RNA. However, these Cas proteins suffer from inherent limitations like large size, low cleavage efficiency, and off-target effects, hindering their widespread application as a gene editing tool. Therefore, there is a need to identify novel Cas proteins with improved editing properties, for which it is necessary to understand the underlying features governing the Cas families. In the current study, we aim to elucidate the unique protein attributes associated with Cas9 and Cas12 families and identify the features that distinguish each family from the other. Here, we built Random Forest (RF) binary classifiers to distinguish Cas12 and Cas9 proteins from non-Cas proteins, respectively, using the complete protein feature spectrum (13,495 features) encoding various physiochemical, topological, constitutional, and coevolutionary information of Cas proteins. Furthermore, we built multiclass RF classifiers differentiating Cas9, Cas12, and Non-Cas proteins. All the models were evaluated rigorously on the test and independent datasets. The Cas12 and Cas9 binary models achieved a high overall accuracy of 95% and 97% on their respective independent datasets, while the multiclass classifier achieved a high F1 score of 0.97. We observed that Quasi-sequence-order descriptors like Schneider-lag descriptors and Composition descriptors like charge, volume, and polarizability are essential for the Cas12 family. More interestingly, we discovered that Amino Acid Composition descriptors, especially the Tripeptide Composition (TPC) descriptors, are important for the Cas9 family. Four of the identified important descriptors of Cas9 classification are tripeptides PWN, PYY, HHA, and DHI, which are seen to be conserved across all the Cas9 proteins and were located within different catalytically important domains of the Cas9 protein structure. Among these four tripeptides, tripeptides DHI and HHA are well-known to be involved in the DNA cleavage activity of the Cas9 protein. We therefore propose the the other two tripeptides, PWN and PYY, may also be essential for the Cas9 family. Our identified important descriptors enhanced the understanding of the catalytic mechanisms of Cas9 and Cas12 proteins and provide valuable insights into design of novel Cas systems to achieve enhanced gene-editing properties.

Collapse

Backofen R, Gorodkin J, Hofacker IL, Stadler PF. Comparative RNA Genomics. Methods Mol Biol 2024;2802:347-393. [PMID: 38819565 DOI: 10.1007/978-1-0716-3838-5_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]

Muhammad N, Avila F, Nedashkovskaya OI, Kim SG. Three novel marine species of the genus Reichenbachiella exhibiting degradation of complex polysaccharides. Front Microbiol 2023;14:1265676. [PMID: 38156005 PMCID: PMC10752948 DOI: 10.3389/fmicb.2023.1265676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2023] [Accepted: 11/23/2023] [Indexed: 12/30/2023] Open

Abstract

Three novel strains designated ABR2-5T, BKB1-1T, and WSW4-B4T belonging to the genus Reichenbachiella of the phylum Bacteroidota were isolated from algae and mud samples collected in the West Sea, Korea. All three strains were enriched for genes encoding up to 216 carbohydrate-active enzymes (CAZymes), which participate in the degradation of agar, alginate, carrageenan, laminarin, and starch. The 16S rRNA sequence similarities among the three novel isolates were 94.0%-94.7%, and against all three existing species in the genus Reichenbachiella they were 93.6%-97.2%. The genome sizes of the strains ABR2-5T, BKB1-1T, and WSW4-B4T were 5.5, 4.4, and 5.0 Mb, respectively, and the GC content ranged from 41.1%-42.0%. The average nucleotide identity and the digital DNA-DNA hybridization values of each novel strain within the isolates and all existing species in the genus Reichenbachiella were in a range of 69.2%-75.5% and 17.7-18.9%, respectively, supporting the creation of three new species. The three novel strains exhibited a distinctive fatty acid profile characterized by elevated levels of iso-C15:0 (37.7%-47.4%) and C16:1 ω5c (14.4%-22.9%). Specifically, strain ABR2-5T displayed an additional higher proportion of C16:0 (13.0%). The polar lipids were phosphatidylethanolamine, unidentified lipids, aminolipids, and glycolipids. Menaquinone-7 was identified as the respiratory quinone of the isolates. A comparative genome analysis was performed using the KEGG, RAST, antiSMASH, CRISPRCasFinder, dbCAN, and dbCAN-PUL servers and CRISPRcasIdentifier software. The results revealed that the isolates harbored many key genes involved in central metabolism for the synthesis of essential amino acids and vitamins, hydrolytic enzymes, carotenoid pigments, and antimicrobial compounds. The KEGG analysis showed that the three isolates possessed a complete pathway of dissimilatory nitrate reduction to ammonium (DNRA), which is involved in the conservation of bioavailable nitrogen within the ecosystem. Moreover, all the strains possessed genes that participated in the metabolism of heavy metals, including arsenic, copper, cobalt, ferrous, and manganese. All three isolated strains contain the class 2 type II subtype C1 CRISPR-Cas system in their genomes. The distinguished phenotypic, chemotaxonomic, and genomic characteristics led us to propose that the three strains represent three novel species in the genus Reichenbachiella: R. ulvae sp. nov. (ABR2-5T = KCTC 82990T = JCM 35839T), R. agarivorans sp. nov. (BKB1-1T = KCTC 82964T = JCM 35840T), and R. carrageenanivorans sp. nov. (WSW4-B4T = KCTC 82706T = JCM 35841T).

Collapse

Booker AE, D'Angelo T, Adams-Beyea A, Brown JM, Nigro O, Rappé MS, Stepanauskas R, Orcutt BN. Life strategies for Aminicenantia in subseafloor oceanic crust. THE ISME JOURNAL 2023;17:1406-1415. [PMID: 37328571 PMCID: PMC10432499 DOI: 10.1038/s41396-023-01454-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 04/11/2023] [Accepted: 04/17/2023] [Indexed: 06/18/2023]

Patra P, B R D, Kundu P, Das M, Ghosh A. Recent advances in machine learning applications in metabolic engineering. Biotechnol Adv 2023;62:108069. [PMID: 36442697 DOI: 10.1016/j.biotechadv.2022.108069] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2022] [Revised: 10/18/2022] [Accepted: 11/22/2022] [Indexed: 11/27/2022]

Abstract

Metabolic engineering encompasses several widely-used strategies, which currently hold a high seat in the field of biotechnology when its potential is manifesting through a plethora of research and commercial products with a strong societal impact. The genomic revolution that occurred almost three decades ago has initiated the generation of large omics-datasets which has helped in gaining a better understanding of cellular behavior. The itinerary of metabolic engineering that has occurred based on these large datasets has allowed researchers to gain detailed insights and a reasonable understanding of the intricacies of biosystems. However, the existing trail-and-error approaches for metabolic engineering are laborious and time-intensive when it comes to the production of target compounds with high yields through genetic manipulations in host organisms. Machine learning (ML) coupled with the available metabolic engineering test instances and omics data brings a comprehensive and multidisciplinary approach that enables scientists to evaluate various parameters for effective strain design. This vast amount of biological data should be standardized through knowledge engineering to train different ML models for providing accurate predictions in gene circuits designing, modification of proteins, optimization of bioprocess parameters for scaling up, and screening of hyper-producing robust cell factories. This review briefs on the premise of ML, followed by mentioning various ML methods and algorithms alongside the numerous omics datasets available to train ML models for predicting metabolic outcomes with high-accuracy. The combinative interplay between the ML algorithms and biological datasets through knowledge engineering have guided the recent advancements in applications such as CRISPR/Cas systems, gene circuits, protein engineering, metabolic pathway reconstruction, and bioprocess engineering. Finally, this review addresses the probable challenges of applying ML in metabolic engineering which will guide the researchers toward novel techniques to overcome the limitations.

Collapse

Mitrofanov A, Ziemann M, Alkhnbashi OS, Hess WR, Backofen R. CRISPRtracrRNA: robust approach for CRISPR tracrRNA detection. Bioinformatics 2022;38:ii42-ii48. [PMID: 36124799 PMCID: PMC9486595 DOI: 10.1093/bioinformatics/btac466] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open

Unraveling the Genomic Potential of the Thermophilic Bacterium Anoxybacillus flavithermus from an Antarctic Geothermal Environment. Microorganisms 2022;10:microorganisms10081673. [PMID: 36014090 PMCID: PMC9413872 DOI: 10.3390/microorganisms10081673] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 08/12/2022] [Accepted: 08/16/2022] [Indexed: 11/25/2022] Open

Genomes of six viruses that infect Asgard archaea from deep-sea sediments. Nat Microbiol 2022;7:953-961. [PMID: 35760837 DOI: 10.1038/s41564-022-01150-8] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 05/16/2022] [Indexed: 12/25/2022]

A closed Candidatus Odinarchaeum chromosome exposes Asgard archaeal viruses. Nat Microbiol 2022;7:948-952. [PMID: 35760836 PMCID: PMC9246712 DOI: 10.1038/s41564-022-01122-y] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Accepted: 04/06/2022] [Indexed: 12/11/2022]

Mattiello L, Rütgers M, Sua-Rojas MF, Tavares R, Soares JS, Begcy K, Menossi M. Molecular and Computational Strategies to Increase the Efficiency of CRISPR-Based Techniques. FRONTIERS IN PLANT SCIENCE 2022;13:868027. [PMID: 35712599 PMCID: PMC9194676 DOI: 10.3389/fpls.2022.868027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Accepted: 04/27/2022] [Indexed: 06/15/2023]

Wandera KG, Alkhnbashi OS, Bassett HVI, Mitrofanov A, Hauns S, Migur A, Backofen R, Beisel CL. Anti-CRISPR prediction using deep learning reveals an inhibitor of Cas13b nucleases. Mol Cell 2022;82:2714-2726.e4. [PMID: 35649413 DOI: 10.1016/j.molcel.2022.05.003] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 03/25/2022] [Accepted: 05/03/2022] [Indexed: 11/28/2022]

Tesson F, Hervé A, Mordret E, Touchon M, d'Humières C, Cury J, Bernheim A. Systematic and quantitative view of the antiviral arsenal of prokaryotes. Nat Commun 2022;13:2561. [PMID: 35538097 PMCID: PMC9090908 DOI: 10.1038/s41467-022-30269-9] [Citation(s) in RCA: 178] [Impact Index Per Article: 89.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Accepted: 04/22/2022] [Indexed: 12/16/2022] Open

Spacer prioritization in CRISPR-Cas9 immunity is enabled by the leader RNA. Nat Microbiol 2022;7:530-541. [PMID: 35314780 PMCID: PMC7612570 DOI: 10.1038/s41564-022-01074-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2021] [Accepted: 02/01/2022] [Indexed: 11/08/2022]

Santana de Carvalho D, Trovatti Uetanabaro AP, Kato RB, Aburjaile FF, Jaiswal AK, Profeta R, De Oliveira Carvalho RD, Tiwar S, Cybelle Pinto Gomide A, Almeida Costa E, Kukharenko O, Orlovska I, Podolich O, Reva O, Ramos PIP, De Carvalho Azevedo VA, Brenig B, Andrade BS, de Vera JPP, Kozyrovska NO, Barh D, Góes-Neto A. The Space-Exposed Kombucha Microbial Community Member Komagataeibacter oboediens Showed Only Minor Changes in Its Genome After Reactivation on Earth. Front Microbiol 2022;13:782175. [PMID: 35369445 PMCID: PMC8970348 DOI: 10.3389/fmicb.2022.782175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Accepted: 02/01/2022] [Indexed: 11/23/2022] Open

Affiliation(s)

Daniel Santana de Carvalho Laboratory of Molecular and Computational Biology of Fungi, Department of Microbiology, Department of Genetics, Ecology and Evolution, Institute of Biological Sciences, Federal University of Minas Gerais, Belo Horizonte, Brazil Laboratory of Cellular and Molecular Genetics, Department of Genetics, Ecology and Evolution, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
Ana Paula Trovatti Uetanabaro Laboratory of Molecular and Computational Biology of Fungi, Department of Microbiology, Department of Genetics, Ecology and Evolution, Institute of Biological Sciences, Federal University of Minas Gerais, Belo Horizonte, Brazil Postgraduate Program in Biology and Biotechnology of Microorganisms, Department of Biological Sciences, State University of Santa Cruz, Ilhéus, Brazil
Rodrigo Bentes Kato Laboratory of Molecular and Computational Biology of Fungi, Department of Microbiology, Department of Genetics, Ecology and Evolution, Institute of Biological Sciences, Federal University of Minas Gerais, Belo Horizonte, Brazil Laboratory of Cellular and Molecular Genetics, Department of Genetics, Ecology and Evolution, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
Flávia Figueira Aburjaile Laboratory of Cellular and Molecular Genetics, Department of Genetics, Ecology and Evolution, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
Arun Kumar Jaiswal Laboratory of Cellular and Molecular Genetics, Department of Genetics, Ecology and Evolution, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
Rodrigo Profeta Laboratory of Cellular and Molecular Genetics, Department of Genetics, Ecology and Evolution, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
Rodrigo Dias De Oliveira Carvalho Laboratory of Cellular and Molecular Genetics, Department of Genetics, Ecology and Evolution, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
Sandeep Tiwar Laboratory of Cellular and Molecular Genetics, Department of Genetics, Ecology and Evolution, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
Anne Cybelle Pinto Gomide Laboratory of Cellular and Molecular Genetics, Department of Genetics, Ecology and Evolution, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
Eduardo Almeida Costa Computational Biology and Biotechnological Information Management Center (NBCGIB), State University of Santa Cruz, Ilhéus, Brazil
Olga Kukharenko Institute of Molecular Biology and Genetics of NASU, Kyiv, Ukraine
Iryna Orlovska Institute of Molecular Biology and Genetics of NASU, Kyiv, Ukraine
Olga Podolich Institute of Molecular Biology and Genetics of NASU, Kyiv, Ukraine
Oleg Reva Department of Biochemistry, Genetics and Microbiology, Centre for Bioinformatics and Computational Biology, University of Pretoria, Pretoria, South Africa
Pablo Ivan P. Ramos Center for Data and Knowledge Integration for Health (CIDACS), Institute Gonçalo Moniz, Oswaldo Cruz Foundation (FIOCRUZ-Bahia), Salvador, Brazil
Vasco Ariston De Carvalho Azevedo Laboratory of Cellular and Molecular Genetics, Department of Genetics, Ecology and Evolution, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
Bertram Brenig Institute of Veterinary Medicine, Burckhardtweg, University of Göttingen, Göttingen, Germany
Bruno Silva Andrade Laboratory of Bioinformatics and Computational Chemistry, Department of Biological Sciences, State University of Southwest Bahia (UESB), Jequié, Brazil
Jean-Pierre P. de Vera German Aerospace Center (DLR) Berlin, Institute of Planetary Research, Planetary Laboratories, Astrobiological Laboratories, Berlin, Germany
Natalia O. Kozyrovska Institute of Molecular Biology and Genetics of NASU, Kyiv, Ukraine
Debmalya Barh Laboratory of Cellular and Molecular Genetics, Department of Genetics, Ecology and Evolution, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil Centre for Genomics and Applied Gene Technology, Institute of Integrative Omics and Applied Biotechnology, Purba Medinipur, India
Aristóteles Góes-Neto Laboratory of Molecular and Computational Biology of Fungi, Department of Microbiology, Department of Genetics, Ecology and Evolution, Institute of Biological Sciences, Federal University of Minas Gerais, Belo Horizonte, Brazil

Collapse

Payne LJ, Todeschini TC, Wu Y, Perry BJ, Ronson C, Fineran P, Nobrega F, Jackson S. Identification and classification of antiviral defence systems in bacteria and archaea with PADLOC reveals new system types. Nucleic Acids Res 2021;49:10868-10878. [PMID: 34606606 PMCID: PMC8565338 DOI: 10.1093/nar/gkab883] [Citation(s) in RCA: 70] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 09/13/2021] [Accepted: 09/17/2021] [Indexed: 11/14/2022] Open

Yang S, Huang J, He B. CASPredict: a web service for identifying Cas proteins. PeerJ 2021;9:e11887. [PMID: 34395100 PMCID: PMC8327967 DOI: 10.7717/peerj.11887] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2020] [Accepted: 07/09/2021] [Indexed: 12/16/2022] Open

Alkhnbashi OS, Mitrofanov A, Bonidia R, Raden M, Tran V, Eggenhofer F, Shah S, Öztürk E, Padilha V, Sanches D, de Carvalho A, Backofen R. CRISPRloci: comprehensive and accurate annotation of CRISPR-Cas systems. Nucleic Acids Res 2021;49:W125-W130. [PMID: 34133710 PMCID: PMC8265192 DOI: 10.1093/nar/gkab456] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Revised: 04/28/2021] [Accepted: 05/17/2021] [Indexed: 11/17/2022] Open

Padilha VA, Alkhnbashi OS, Tran VD, Shah SA, Carvalho ACPLF, Backofen R. Casboundary: automated definition of integral Cas cassettes. Bioinformatics 2021;37:1352-1359. [PMID: 33226067 PMCID: PMC8208735 DOI: 10.1093/bioinformatics/btaa984] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Revised: 10/28/2020] [Accepted: 11/11/2020] [Indexed: 11/13/2022] Open

Mitrofanov A, Alkhnbashi OS, Shmakov SA, Makarova K, Koonin E, Backofen R. CRISPRidentify: identification of CRISPR arrays using machine learning approach. Nucleic Acids Res 2021;49:e20. [PMID: 33290505 PMCID: PMC7913763 DOI: 10.1093/nar/gkaa1158] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Revised: 11/09/2020] [Accepted: 11/11/2020] [Indexed: 02/02/2023] Open

Tan X, Letendre JH, Collins JJ, Wong WW. Synthetic biology in the clinic: engineering vaccines, diagnostics, and therapeutics. Cell 2021;184:881-898. [PMID: 33571426 PMCID: PMC7897318 DOI: 10.1016/j.cell.2021.01.017] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Revised: 01/12/2021] [Accepted: 01/13/2021] [Indexed: 12/17/2022]

Padilha VA, Alkhnbashi OS, Shah SA, de Carvalho ACPLF, Backofen R. CRISPRcasIdentifier: Machine learning for accurate identification and classification of CRISPR-Cas systems. Gigascience 2020;9:giaa062. [PMID: 32556168 PMCID: PMC7298778 DOI: 10.1093/gigascience/giaa062] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2020] [Revised: 04/27/2020] [Accepted: 05/15/2020] [Indexed: 12/26/2022] Open