Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Mukherjee S, Stamatis D, Bertsch J, Ovchinnikova G, Verezemska O, Isbandi M, Thomas AD, Ali R, Sharma K, Kyrpides NC, Reddy TBK. Genomes OnLine Database (GOLD) v.6: data updates and feature enhancements. Nucleic Acids Res 2017;45:D446-D456. [PMID: 27794040 PMCID: PMC5210664 DOI: 10.1093/nar/gkw992] [Citation(s) in RCA: 135] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2016] [Revised: 10/11/2016] [Accepted: 10/19/2016] [Indexed: 01/28/2023] Open

For:	Mukherjee S, Stamatis D, Bertsch J, Ovchinnikova G, Verezemska O, Isbandi M, Thomas AD, Ali R, Sharma K, Kyrpides NC, Reddy TBK. Genomes OnLine Database (GOLD) v.6: data updates and feature enhancements. Nucleic Acids Res 2017;45:D446-D456. [PMID: 27794040 PMCID: PMC5210664 DOI: 10.1093/nar/gkw992] [Citation(s) in RCA: 135] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2016] [Revised: 10/11/2016] [Accepted: 10/19/2016] [Indexed: 01/28/2023] Open

Number

Cited by Other Article(s)

Kundu P, Beura S, Mondal S, Das AK, Ghosh A. Machine learning for the advancement of genome-scale metabolic modeling. Biotechnol Adv 2024;74:108400. [PMID: 38944218 DOI: 10.1016/j.biotechadv.2024.108400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 05/13/2024] [Accepted: 06/23/2024] [Indexed: 07/01/2024]

Abstract

Constraint-based modeling (CBM) has evolved as the core systems biology tool to map the interrelations between genotype, phenotype, and external environment. The recent advancement of high-throughput experimental approaches and multi-omics strategies has generated a plethora of new and precise information from wide-ranging biological domains. On the other hand, the continuously growing field of machine learning (ML) and its specialized branch of deep learning (DL) provide essential computational architectures for decoding complex and heterogeneous biological data. In recent years, both multi-omics and ML have assisted in the escalation of CBM. Condition-specific omics data, such as transcriptomics and proteomics, helped contextualize the model prediction while analyzing a particular phenotypic signature. At the same time, the advanced ML tools have eased the model reconstruction and analysis to increase the accuracy and prediction power. However, the development of these multi-disciplinary methodological frameworks mainly occurs independently, which limits the concatenation of biological knowledge from different domains. Hence, we have reviewed the potential of integrating multi-disciplinary tools and strategies from various fields, such as synthetic biology, CBM, omics, and ML, to explore the biochemical phenomenon beyond the conventional biological dogma. How the integrative knowledge of these intersected domains has improved bioengineering and biomedical applications has also been highlighted. We categorically explained the conventional genome-scale metabolic model (GEM) reconstruction tools and their improvement strategies through ML paradigms. Further, the crucial role of ML and DL in omics data restructuring for GEM development has also been briefly discussed. Finally, the case-study-based assessment of the state-of-the-art method for improving biomedical and metabolic engineering strategies has been elaborated. Therefore, this review demonstrates how integrating experimental and in silico strategies can help map the ever-expanding knowledge of biological systems driven by condition-specific cellular information. This multiview approach will elevate the application of ML-based CBM in the biomedical and bioengineering fields for the betterment of society and the environment.

Collapse

Bell KL, Turo KJ, Lowe A, Nota K, Keller A, Encinas‐Viso F, Parducci L, Richardson RT, Leggett RM, Brosi BJ, Burgess KS, Suyama Y, de Vere N. Plants, pollinators and their interactions under global ecological change: The role of pollen DNA metabarcoding. Mol Ecol 2023;32:6345-6362. [PMID: 36086900 PMCID: PMC10947134 DOI: 10.1111/mec.16689] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 08/18/2022] [Accepted: 08/30/2022] [Indexed: 11/28/2022]

Khan A, Sohail S, Yaseen S, Fatima S, Wisal A, Ahmed S, Nasir M, Irfan M, Karim A, Basharat Z, Khan Y, Aurongzeb M, Raza SK, Alshahrani MY, Morel CM, Hassan SS. Exploring and targeting potential druggable antimicrobial resistance targets ArgS, SecY, and MurA in Staphylococcus sciuri with TCM inhibitors through a subtractive genomics strategy. Funct Integr Genomics 2023;23:254. [PMID: 37495774 DOI: 10.1007/s10142-023-01179-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 07/14/2023] [Accepted: 07/14/2023] [Indexed: 07/28/2023]

Abstract

Staphylococcus sciuri (also currently Mammaliicoccus sciuri) are anaerobic facultative and non-motile bacteria that cause significant human pathogenesis such as endocarditis, wound infections, peritonitis, UTI, and septic shock. Methicillin-resistant S. sciuri (MRSS) strains also infects animals that include healthy broilers, cattle, dogs, and pigs. The emergence of MRSS strains thereby poses a serious health threat and thrives the scientific community towards novel treatment options. Herein, we investigated the druggable genome of S. sciuri by employing subtractive genomics that resulted in seven genes/proteins where only three of them were predicted as final targets. Further mining the literature showed that the ArgS (WP_058610923), SecY (WP_058611897), and MurA (WP_058612677) are involved in the multi-drug resistance phenomenon. After constructing and verifying the 3D protein homology models, a screening process was carried out using a library of Traditional Chinese Medicine compounds (consisting of 36,043 compounds). The molecular docking and simulation studies revealed the physicochemical stability parameters of the docked TCM inhibitors in the druggable cavities of each protein target by identifying their druggability potential and maximum hydrogen bonding interactions. The simulated receptor-ligand complexes showed the conformational changes and stability index of the secondary structure elements. The root mean square deviation (RMSD) graph showed fluctuations due to structural changes in the helix-coil-helix and beta-turn-beta changes at specific points where the pattern of the RMSD and root mean square fluctuation (RMSF) (< 1.0 Å) support any major domain shifts within the structural framework of the protein-ligand complex and placement of ligand was well complemented within the binding site. The β-factor values demonstrated instability at few points while the radius of gyration for structural compactness as a time function for the 100-ns simulation of protein-ligand complexes showed favorable average values and denoted the stability of all complexes. It is assumed that such findings might facilitate researchers to robustly discover and develop effective therapeutics against S. sciuri alongside other enteric infections.

Collapse

Affiliation(s)

Aafareen Khan Department of Chemistry, Islamia College Peshawar, Peshawar, 25000, KP, Pakistan
Saman Sohail Department of Chemistry, Islamia College Peshawar, Peshawar, 25000, KP, Pakistan
Seerat Yaseen Abbasi Shaheed Hospital, Karachi Medical and Dental College, Karachi, Pakistan
Sareen Fatima Department of Microbiology, University of Balochistan, Quetta, Balochistan, Pakistan
Ayesha Wisal Department of Chemistry, Islamia College Peshawar, Peshawar, 25000, KP, Pakistan
Sufyan Ahmed Abbasi Shaheed Hospital, Karachi Medical and Dental College, Karachi, Pakistan
Mahrukh Nasir Dr. Panjwani Center for Molecular Medicine, International Center for Chemical and Biological Sciences (ICCBS-PCMD), University of Karachi, Karachi, 75270, Pakistan
Muhammad Irfan Dr. Panjwani Center for Molecular Medicine, International Center for Chemical and Biological Sciences (ICCBS-PCMD), University of Karachi, Karachi, 75270, Pakistan
Asad Karim Dr. Panjwani Center for Molecular Medicine, International Center for Chemical and Biological Sciences (ICCBS-PCMD), University of Karachi, Karachi, 75270, Pakistan
Zarrin Basharat Alpha Genomics (Private) Limited, Islamabad, 44710, Pakistan
Yasmin Khan Dr. Panjwani Center for Molecular Medicine, International Center for Chemical and Biological Sciences (ICCBS-PCMD), University of Karachi, Karachi, 75270, Pakistan
Muhammad Aurongzeb Faculty of Engineering Sciences & Technology, Hamdard University, Karachi, 74600, Pakistan
Syed Kashif Raza Faculty of Rehabilitation and Allied Health Sciences (FRAHS), Riphah International University, Faisalabad, Pakistan
Mohammad Y Alshahrani Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, King Khalid University, P.O. Box 61413, Abha, 9088, Saudi Arabia
Carlos M Morel Centre for Technological Development in Health (CDTS), Oswaldo Cruz Foundation (Fiocruz), Building "Expansão", 8Th Floor Room 814, Av. Brasil 4036 - Manguinhos, Rio de Janeiro, RJ, 21040-361, Brazil.
Syed S Hassan Dr. Panjwani Center for Molecular Medicine, International Center for Chemical and Biological Sciences (ICCBS-PCMD), University of Karachi, Karachi, 75270, Pakistan. Centre for Technological Development in Health (CDTS), Oswaldo Cruz Foundation (Fiocruz), Building "Expansão", 8Th Floor Room 814, Av. Brasil 4036 - Manguinhos, Rio de Janeiro, RJ, 21040-361, Brazil.

Collapse

Mukherjee S, Stamatis D, Li C, Ovchinnikova G, Bertsch J, Sundaramurthi J, Kandimalla M, Nicolopoulos P, Favognano A, Chen IM, Kyrpides N, Reddy TBK. Twenty-five years of Genomes OnLine Database (GOLD): data updates and new features in v.9. Nucleic Acids Res 2023;51:D957-D963. [PMID: 36318257 PMCID: PMC9825498 DOI: 10.1093/nar/gkac974] [Citation(s) in RCA: 36] [Impact Index Per Article: 36.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Revised: 10/05/2022] [Accepted: 10/16/2022] [Indexed: 01/09/2023] Open

Patra P, B R D, Kundu P, Das M, Ghosh A. Recent advances in machine learning applications in metabolic engineering. Biotechnol Adv 2023;62:108069. [PMID: 36442697 DOI: 10.1016/j.biotechadv.2022.108069] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2022] [Revised: 10/18/2022] [Accepted: 11/22/2022] [Indexed: 11/27/2022]

Abstract

Metabolic engineering encompasses several widely-used strategies, which currently hold a high seat in the field of biotechnology when its potential is manifesting through a plethora of research and commercial products with a strong societal impact. The genomic revolution that occurred almost three decades ago has initiated the generation of large omics-datasets which has helped in gaining a better understanding of cellular behavior. The itinerary of metabolic engineering that has occurred based on these large datasets has allowed researchers to gain detailed insights and a reasonable understanding of the intricacies of biosystems. However, the existing trail-and-error approaches for metabolic engineering are laborious and time-intensive when it comes to the production of target compounds with high yields through genetic manipulations in host organisms. Machine learning (ML) coupled with the available metabolic engineering test instances and omics data brings a comprehensive and multidisciplinary approach that enables scientists to evaluate various parameters for effective strain design. This vast amount of biological data should be standardized through knowledge engineering to train different ML models for providing accurate predictions in gene circuits designing, modification of proteins, optimization of bioprocess parameters for scaling up, and screening of hyper-producing robust cell factories. This review briefs on the premise of ML, followed by mentioning various ML methods and algorithms alongside the numerous omics datasets available to train ML models for predicting metabolic outcomes with high-accuracy. The combinative interplay between the ML algorithms and biological datasets through knowledge engineering have guided the recent advancements in applications such as CRISPR/Cas systems, gene circuits, protein engineering, metabolic pathway reconstruction, and bioprocess engineering. Finally, this review addresses the probable challenges of applying ML in metabolic engineering which will guide the researchers toward novel techniques to overcome the limitations.

Collapse

Montero-Calasanz MDC, Yaramis A, Rohde M, Schumann P, Klenk HP, Meier-Kolthoff JP. Genotype-phenotype correlations within the Geodermatophilaceae. Front Microbiol 2022;13:975365. [PMID: 36439792 PMCID: PMC9686282 DOI: 10.3389/fmicb.2022.975365] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Accepted: 10/11/2022] [Indexed: 11/11/2022] Open

Abstract

The integration of genomic information into microbial systematics along with physiological and chemotaxonomic parameters provides for a reliable classification of prokaryotes. In silico analysis of chemotaxonomic traits is now being introduced to replace characteristics traditionally determined in the laboratory with the dual goal of both increasing the speed of the description of taxa and the accuracy and consistency of taxonomic reports. Genomics has already successfully been applied in the taxonomic rearrangement of Geodermatophilaceae (Actinomycetota) but in the light of new genomic data the taxonomy of the family needs to be revisited. In conjunction with the taxonomic characterisation of four strains phylogenetically located within the family, we conducted a phylogenetic analysis of the whole proteomes of the sequenced type strains and established genotype-phenotype correlations for traits related to chemotaxonomy, cell morphology and metabolism. Results indicated that the four isolates under study represent four novel species within the genus Blastococcus. Additionally, the genera Blastococcus, Geodermatophilus and Modestobacter were shown to be paraphyletic. Consequently, the new genera Trujillonella, Pleomorpha and Goekera were proposed within the Geodermatophilaceae and Blastococcus endophyticus was reclassified as Trujillonella endophytica comb. nov., Geodermatophilus daqingensis as Pleomorpha daqingensis comb. nov. and Modestobacter deserti as Goekera deserti comb. nov. Accordingly, we also proposed emended descriptions of Blastococcus aggregatus, Blastococcus jejuensis, Blastococcus saxobsidens and Blastococcus xanthilyniticus. In silico chemotaxonomic results were overall consistent with wet-lab results. Even though in silico discriminatory levels varied depending on the respective chemotaxonomic trait, this approach is promising for effectively replacing and/or complementing chemotaxonomic analyses at taxonomic ranks above the species level. Finally, interesting but previously overlooked insights regarding morphology and ecology were revealed by the presence of a repertoire of genes related to flagellum synthesis, chemotaxis, spore production and pilus assembly in all representatives of the family. A rich carbon metabolism including four different CO2 fixation pathways and a battery of enzymes able to degrade complex carbohydrates were also identified in Blastococcus genomes.

Collapse

Di Carlo P, Serra N, Alduina R, Guarino R, Craxì A, Giammanco A, Fasciana T, Cascio A, Sergi CM. A systematic review on omics data (metagenomics, metatranscriptomics, and metabolomics) in the role of microbiome in gallbladder disease. Front Physiol 2022;13:888233. [PMID: 36111147 PMCID: PMC9468903 DOI: 10.3389/fphys.2022.888233] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 07/11/2022] [Indexed: 12/04/2022] Open

Abstract

Microbiotas are the range of microorganisms (mainly bacteria and fungi) colonizing multicellular, macroscopic organisms. They are crucial for several metabolic functions affecting the health of the host. However, difficulties hamper the investigation of microbiota composition in cultivating microorganisms in standard growth media. For this reason, our knowledge of microbiota can benefit from the analysis of microbial macromolecules (DNA, transcripts, proteins, or by-products) present in various samples collected from the host. Various omics technologies are used to obtain different data. Metagenomics provides a taxonomical profile of the sample. It can also be used to obtain potential functional information. At the same time, metatranscriptomics can characterize members of a microbiome responsible for specific functions and elucidate genes that drive the microbiotas relationship with its host. Thus, while microbiota refers to microorganisms living in a determined environment (taxonomy of microorganisms identified), microbiome refers to the microorganisms and their genes living in a determined environment and, of course, metagenomics focuses on the genes and collective functions of identified microorganisms. Metabolomics completes this framework by determining the metabolite fluxes and the products released into the environment. The gallbladder is a sac localized under the liver in the human body and is difficult to access for bile and tissue sampling. It concentrates the bile produced in the hepatocytes, which drains into bile canaliculi. Bile promotes fat digestion and is released from the gallbladder into the upper small intestine in response to food. Considered sterile originally, recent data indicate that bile microbiota is associated with the biliary tract’s inflammation and carcinogenesis. The sample size is relevant for omic studies of rare diseases, such as gallbladder carcinoma. Although in its infancy, the study of the biliary microbiota has begun taking advantage of several omics strategies, mainly based on metagenomics, metabolomics, and mouse models. Here, we show that omics analyses from the literature may provide a more comprehensive image of the biliary microbiota. We review studies performed in this environmental niche and focus on network-based approaches for integrative studies.

Collapse

Li X, Ren W, Li Y, Shi Y, Sun H, Wang L, Wu L, Xie Y, Du Y, Jiang Z, Hong B. Production of chain-extended cinnamoyl compounds by overexpressing two adjacent cluster-situated LuxR regulators in Streptomyces globisporus C-1027. Front Microbiol 2022;13:931180. [PMID: 35992673 PMCID: PMC9381841 DOI: 10.3389/fmicb.2022.931180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Accepted: 07/04/2022] [Indexed: 11/17/2022] Open

MiDSystem: A comprehensive online system for de novo assembly and analysis of microbial genomes. N Biotechnol 2021;65:42-52. [PMID: 34411700 DOI: 10.1016/j.nbt.2021.08.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2021] [Revised: 08/13/2021] [Accepted: 08/14/2021] [Indexed: 12/12/2022]

Vieira AZ, Raittz RT, Faoro H. Origin and evolution of nonulosonic acid synthases and their relationship with bacterial pathogenicity revealed by a large-scale phylogenetic analysis. Microb Genom 2021;7:000563. [PMID: 33848237 PMCID: PMC8208679 DOI: 10.1099/mgen.0.000563] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Accepted: 03/16/2021] [Indexed: 12/28/2022] Open

Abstract

Nonulosonic acids (NulOs) are a group of nine-carbon monosaccharides with different functions in nature. N-acetylneuraminic acid (Neu5Ac) is the most common NulO. It covers the membrane surface of all human cells and is a central molecule in the process of self-recognition via SIGLECS receptors. Some pathogenic bacteria escape the immune system by copying the sialylation of the host cell membrane. Neu5Ac production in these bacteria is catalysed by the enzyme NeuB. Some bacteria can also produce other NulOs named pseudaminic and legionaminic acids, through the NeuB homologues PseI and LegI, respectively. In Opisthokonta eukaryotes, the biosynthesis of Neu5Ac is catalysed by the enzyme NanS. In this study, we used publicly available data of sequences of NulOs synthases to investigate its distribution within the three domains of life and its relationship with pathogenic bacteria. We mined the KEGG database and found 425 NeuB sequences. Most NeuB sequences (58.74 %) from the KEGG orthology database were classified as from environmental bacteria; however, sequences from pathogenic bacteria showed higher conservation and prevalence of a specific domain named SAF. Using the HMM profile we identified 13 941 NulO synthase sequences in UniProt. Phylogenetic analysis of these sequences showed that the synthases were divided into three main groups that can be related to the lifestyle of these bacteria: (I) predominantly environmental, (II) intermediate and (III) predominantly pathogenic. NeuB was widely distributed in the groups. However, LegI and PseI were more concentrated in groups II and III, respectively. We also found that PseI appeared later in the evolutionary process, derived from NeuB. We use this same methodology to retrieve sialic acid synthase sequences from Archaea and Eukarya. A large-scale phylogenetic analysis showed that while the Archaea sequences are spread across the tree, the eukaryotic NanS sequences were grouped in a specific branch in group II. None of the bacterial NanS sequences grouped with the eukaryotic branch. The analysis of conserved residues showed that the synthases of Archaea and Eukarya present a mutation in one of the three catalytic residues, an E134D change, related to a Neisseria meningitidis reference sequence. We also found that the conservation profile is higher between NeuB of pathogenic bacteria and NanS of eukaryotes than between NeuB of environmental bacteria and NanS of eukaryotes. Our large-scale analysis brings new perspectives on the evolution of NulOs synthases, suggesting their presence in the last common universal ancestor.

Collapse

Xavier JC, Gerhards RE, Wimmer JLE, Brueckner J, Tria FDK, Martin WF. The metabolic network of the last bacterial common ancestor. Commun Biol 2021;4:413. [PMID: 33772086 PMCID: PMC7997952 DOI: 10.1038/s42003-021-01918-4] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Accepted: 02/26/2021] [Indexed: 02/03/2023] Open

Thorsen J, Stokholm J, Rasmussen MA, Mortensen MS, Brejnrod AD, Hjelmsø M, Shah S, Chawes B, Bønnelykke K, Sørensen SJ, Bisgaard H. The Airway Microbiota Modulates Effect of Azithromycin Treatment for Episodes of Recurrent Asthma-like Symptoms in Preschool Children: A Randomized Clinical Trial. Am J Respir Crit Care Med 2021;204:149-158. [PMID: 33730519 DOI: 10.1164/rccm.202008-3226oc] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Abstract

Rationale: Childhood asthma is often preceded by recurrent episodes of asthma-like symptoms, which can be triggered by both viral and bacterial agents. Recent randomized controlled trials have shown that azithromycin treatment reduces episode duration and severity through yet undefined mechanisms. Objectives: To study the influence of the airway microbiota on the effect of azithromycin treatment during acute episodes of asthma-like symptoms. Methods: Children from the COPSAC₂₀₁₀ (Copenhagen Prospective Studies on Asthma in Childhood 2010) cohort with recurrent asthma-like symptoms aged 12-36 months were randomized during acute episodes to azithromycin or placebo as previously reported. Before randomization, hypopharyngeal aspirates were collected and examined by 16S ribosomal RNA gene amplicon sequencing. Measurements and Main Results: In 139 airway samples from 68 children, episode duration after randomization was associated with microbiota richness (7.5% increased duration per 10 additional operational taxonomic units [OTUs]; 95% confidence interval, 1-14%; P = 0.025), with 15 individual OTUs (including several Neisseria and Veillonella), and with microbial pneumotypes defined from weighted UniFrac distances (longest durations in a Neisseria-dominated pneumotype). Microbiota richness before treatment increased the effect of azithromycin by 10% per 10 additional OTUs, and more OTUs were positively versus negatively associated with an increased azithromycin effect (82 vs. 58; P = 0.0032). Furthermore, effect modification of azithromycin was found for five individual OTUs (three OTUs increased and two OTUs decreased the effect; q < 0.05). Conclusions: The airway microbiota in acute episodes of asthma-like symptoms is associated with episode duration and modifies the effect of azithromycin treatment of the episodes in preschool children with recurrent asthma-like symptoms. Clinical trial registered with www.clinicaltrials.gov (NCT01233297).

Collapse

Mukherjee S, Stamatis D, Bertsch J, Ovchinnikova G, Sundaramurthi J, Lee J, Kandimalla M, Chen IMA, Kyrpides NC, Reddy TBK. Genomes OnLine Database (GOLD) v.8: overview and updates. Nucleic Acids Res 2021;49:D723-D733. [PMID: 33152092 PMCID: PMC7778979 DOI: 10.1093/nar/gkaa983] [Citation(s) in RCA: 109] [Impact Index Per Article: 36.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Revised: 10/08/2020] [Accepted: 10/19/2020] [Indexed: 12/28/2022] Open

Helmy M, Smith D, Selvarajoo K. Systems biology approaches integrated with artificial intelligence for optimized metabolic engineering. Metab Eng Commun 2020;11:e00149. [PMID: 33072513 PMCID: PMC7546651 DOI: 10.1016/j.mec.2020.e00149] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 10/01/2020] [Accepted: 10/07/2020] [Indexed: 12/05/2022] Open

Canon F, Mariadassou M, Maillard MB, Falentin H, Parayre S, Madec MN, Valence F, Henry G, Laroute V, Daveran-Mingot ML, Cocaign-Bousquet M, Thierry A, Gagnaire V. Function-Driven Design of Lactic Acid Bacteria Co-cultures to Produce New Fermented Food Associating Milk and Lupin. Front Microbiol 2020;11:584163. [PMID: 33329449 PMCID: PMC7717992 DOI: 10.3389/fmicb.2020.584163] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Accepted: 10/13/2020] [Indexed: 11/17/2022] Open

Reyes-Prieto M, Vargas-Chávez C, Llabrés M, Palmer P, Latorre A, Moya A. An update on the Symbiotic Genomes Database (SymGenDB): a collection of metadata, genomic, genetic and protein sequences, orthologs and metabolic networks of symbiotic organisms. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020;2020:5735476. [PMID: 32055857 PMCID: PMC7018611 DOI: 10.1093/database/baz160] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/12/2018] [Revised: 07/20/2019] [Accepted: 12/31/2019] [Indexed: 11/14/2022]

Phase separation by ssDNA binding protein controlled via protein-protein and protein-DNA interactions. Proc Natl Acad Sci U S A 2020;117:26206-26217. [PMID: 33020264 PMCID: PMC7584906 DOI: 10.1073/pnas.2000761117] [Citation(s) in RCA: 73] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open

Abstract

Cells must rapidly and efficiently react to DNA damage to avoid its harmful consequences. Here we report a molecular mechanism that gives rise to a model of how bacterial cells mobilize DNA repair proteins for timely response to genomic stress and initiation of DNA repair upon exposure of single-stranded DNA. We found that bacterial single-stranded DNA binding protein (SSB), a central player in genome metabolism, can undergo dynamic phase separation under physiological conditions. SSB condensates can store a wide array of DNA repair proteins that specifically interact with SSB. However, elevated levels of single-stranded DNA during genomic stress can dissolve SSB condensates, enabling rapid mobilization of SSB and SSB-interacting proteins to sites of DNA damage.

Bacterial single-stranded (ss)DNA-binding proteins (SSB) are essential for the replication and maintenance of the genome. SSBs share a conserved ssDNA-binding domain, a less conserved intrinsically disordered linker (IDL), and a highly conserved C-terminal peptide (CTP) motif that mediates a wide array of protein−protein interactions with DNA-metabolizing proteins. Here we show that the Escherichia coli SSB protein forms liquid−liquid phase-separated condensates in cellular-like conditions through multifaceted interactions involving all structural regions of the protein. SSB, ssDNA, and SSB-interacting molecules are highly concentrated within the condensates, whereas phase separation is overall regulated by the stoichiometry of SSB and ssDNA. Together with recent results on subcellular SSB localization patterns, our results point to a conserved mechanism by which bacterial cells store a pool of SSB and SSB-interacting proteins. Dynamic phase separation enables rapid mobilization of this protein pool to protect exposed ssDNA and repair genomic loci affected by DNA damage.

Collapse

Phylogeny resolved, metabolism revealed: functional radiation within a widespread and divergent clade of sponge symbionts. ISME JOURNAL 2020;15:503-519. [PMID: 33011742 DOI: 10.1038/s41396-020-00791-z] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Revised: 09/09/2020] [Accepted: 09/21/2020] [Indexed: 01/17/2023]

Neubauer V, Petri RM, Humer E, Kröger I, Reisinger N, Baumgartner W, Wagner M, Zebeli Q. Starch-Rich Diet Induced Rumen Acidosis and Hindgut Dysbiosis in Dairy Cows of Different Lactations. Animals (Basel) 2020;10:ani10101727. [PMID: 32977653 PMCID: PMC7598178 DOI: 10.3390/ani10101727] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Revised: 09/17/2020] [Accepted: 09/19/2020] [Indexed: 01/23/2023] Open

Abstract

Simple Summary

High-producing dairy cows receive high-energy diets for maintenance and production. This study showed that 60% concentrate in the diet, containing 27.7% starch, changed the fecal-microbial community and lowered its diversity, suggesting hindgut dysbiosis. Both ruminal and fecal pH decreased with high-starch feeding, which suggests further investigations in fecal pH as rumen- and hindgut-acidosis diagnostic tool. Cows in the third lactation spent more time below the threshold for subacute-ruminal acidosis (pH 6.0) than second or fourth-or-below lactation cows. Their higher susceptibility was caused by their high dry matter intake but missing counter-regulation by increased rumination activity. Further, we suggest that body weight and rumen size might play a role in the absorptive capacity of short-chain fatty acids. The study also identified indicator-bacterial phylotypes that changed with starch-rich diet and lactation number. In conclusion, we suggest including lactation number as a factor in practical feeding management for identification of high risk-cows for acidosis, and in dairy cow research.

Abstract

Starch-rich diets can cause subacute ruminal acidosis (SARA) in dairy cows with potentially different susceptibility according to lactation number. We wanted to evaluate the bacterial community and the fermentation end products in feces to study susceptibility to hindgut acidosis and dysbiosis. Sixteen dairy cows received a medium-concentrate diet (MC, 40% concentrate, 18.8% starch) for one week and a high-concentrate diet (HC, 60% concentrate, 27.7% starch, DM) for four weeks. Milk yield, dry-matter intake, chewing activity, ruminal pH, milk constituents, and fecal samples for short-chain fatty acids (SCFA), pH, and 16S rRNA-gene sequencing were investigated. The HC feeding caused a reduction in fecal pH, bacterial diversity and richness, an increase in total SCFA, and a separate phylogenetic clustering of MC and HC samples. Ruminal and fecal pH had fair correlation (r = 0.5). Cows in the second lactation (2ndL) had lower dry matter intake (DMI) than cows of third or fourth or more lactations (3rdL; ≥4 L), whereas DMI/kg body weight was lower for ≥4 L than for 2ndL and 3rdL cows. The mean ruminal pH was highest in ≥4 L, whereas the time spent below the SARA threshold was highest for 3rdL cows. The latter also had higher total SCFA in the feces. Our results suggest that hindgut dysbiosis is caused by increased substrate flow to the hindgut, but further investigations are needed to define hindgut acidosis. The 3rdL cows were most susceptible to rumen acidosis and hindgut dysbiosis due to high DMI level, but missing counter regulations, as suggested happening in 2ndL and ≥4 L cows.

Collapse

Schmiedová L, Kreisinger J, Požgayová M, Honza M, Martin JF, Procházka P. Gut microbiota in a host-brood parasite system: insights from common cuckoos raised by two warbler species. FEMS Microbiol Ecol 2020;96:5872480. [PMID: 32672792 DOI: 10.1093/femsec/fiaa143] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Accepted: 07/15/2020] [Indexed: 11/13/2022] Open

Moi D, Kilchoer L, Aguilar PS, Dessimoz C. Scalable phylogenetic profiling using MinHash uncovers likely eukaryotic sexual reproduction genes. PLoS Comput Biol 2020;16:e1007553. [PMID: 32697802 PMCID: PMC7423146 DOI: 10.1371/journal.pcbi.1007553] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Revised: 08/12/2020] [Accepted: 05/18/2020] [Indexed: 01/09/2023] Open

Abstract

Phylogenetic profiling is a computational method to predict genes involved in the same biological process by identifying protein families which tend to be jointly lost or retained across the tree of life. Phylogenetic profiling has customarily been more widely used with prokaryotes than eukaryotes, because the method is thought to require many diverse genomes. There are now many eukaryotic genomes available, but these are considerably larger, and typical phylogenetic profiling methods require at least quadratic time as a function of the number of genes. We introduce a fast, scalable phylogenetic profiling approach entitled HogProf, which leverages hierarchical orthologous groups for the construction of large profiles and locality-sensitive hashing for efficient retrieval of similar profiles. We show that the approach outperforms Enhanced Phylogenetic Tree, a phylogeny-based method, and use the tool to reconstruct networks and query for interactors of the kinetochore complex as well as conserved proteins involved in sexual reproduction: Hap2, Spo11 and Gex1. HogProf enables large-scale phylogenetic profiling across the three domains of life, and will be useful to predict biological pathways among the hundreds of thousands of eukaryotic species that will become available in the coming few years. HogProf is available at https://github.com/DessimozLab/HogProf.

Genes that are involved in the same biological process tend to co-evolve. This property is exploited by the technique of phylogenetic profiling, which identifies co-evolving (and therefore likely functionally related) genes through patterns of correlated gene retention and loss in evolution and across species. However, conventional methods to computing and clustering these correlated genes do not scale with increasing numbers of genomes. HogProf is a novel phylogenetic profiling tool built on probabilistic data structures. It allows the user to construct searchable databases containing the evolutionary history of hundreds of thousands of protein families. Such fast detection of coevolution takes advantage of the rapidly increasing amount of genomic data publicly available, and can uncover unknown biological networks and guide in-vivo research and experimentation. We have applied our tool to describe the biological networks underpinning sexual reproduction in eukaryotes.

Collapse

Villar E, Cabrol L, Heimbürger-Boavida LE. Widespread microbial mercury methylation genes in the global ocean. ENVIRONMENTAL MICROBIOLOGY REPORTS 2020. [PMID: 32090489 DOI: 10.1101/648329] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]

Chen IMA, Chu K, Palaniappan K, Pillay M, Ratner A, Huang J, Huntemann M, Varghese N, White JR, Seshadri R, Smirnova T, Kirton E, Jungbluth SP, Woyke T, Eloe-Fadrosh EA, Ivanova NN, Kyrpides NC. IMG/M v.5.0: an integrated data management and comparative analysis system for microbial genomes and microbiomes. Nucleic Acids Res 2020;47:D666-D677. [PMID: 30289528 PMCID: PMC6323987 DOI: 10.1093/nar/gky901] [Citation(s) in RCA: 547] [Impact Index Per Article: 136.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Accepted: 09/24/2018] [Indexed: 11/12/2022] Open

Paez-Espino D, Roux S, Chen IMA, Palaniappan K, Ratner A, Chu K, Huntemann M, Reddy TBK, Pons JC, Llabrés M, Eloe-Fadrosh EA, Ivanova NN, Kyrpides NC. IMG/VR v.2.0: an integrated data management and analysis system for cultivated and environmental viral genomes. Nucleic Acids Res 2020;47:D678-D686. [PMID: 30407573 PMCID: PMC6323928 DOI: 10.1093/nar/gky1127] [Citation(s) in RCA: 114] [Impact Index Per Article: 28.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2018] [Accepted: 10/31/2018] [Indexed: 01/06/2023] Open

Pathogenomics and Management of Fusarium Diseases in Plants. Pathogens 2020;9:pathogens9050340. [PMID: 32369942 PMCID: PMC7281180 DOI: 10.3390/pathogens9050340] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2020] [Revised: 04/25/2020] [Accepted: 04/28/2020] [Indexed: 12/16/2022] Open

The Great Oxidation Event expanded the genetic repertoire of arsenic metabolism and cycling. Proc Natl Acad Sci U S A 2020;117:10414-10421. [PMID: 32350143 PMCID: PMC7229686 DOI: 10.1073/pnas.2001063117] [Citation(s) in RCA: 75] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open

Zoller R, Zehavi M, Ziv-Ukelson M. A New Paradigm for Identifying Reconciliation-Scenario Altering Mutations Conferring Environmental Adaptation. J Comput Biol 2020;27:1561-1580. [PMID: 32250165 DOI: 10.1089/cmb.2019.0472] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Blumer-Schuette SE. Insights into Thermophilic Plant Biomass Hydrolysis from Caldicellulosiruptor Systems Biology. Microorganisms 2020;8:E385. [PMID: 32164310 PMCID: PMC7142884 DOI: 10.3390/microorganisms8030385] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2020] [Revised: 03/06/2020] [Accepted: 03/07/2020] [Indexed: 11/16/2022] Open

Adeyemi JA, Peters SO, De Donato M, Cervantes AP, Ogunade IM. Effects of a blend of Saccharomyces cerevisiae-based direct-fed microbial and fermentation products on plasma carbonyl-metabolome and fecal bacterial community of beef steers. J Anim Sci Biotechnol 2020;11:14. [PMID: 32095237 PMCID: PMC7025411 DOI: 10.1186/s40104-019-0419-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Accepted: 12/22/2019] [Indexed: 01/08/2023] Open

Abstract

BACKGROUND

Previous studies have evaluated the metabolic status of animals fed direct-fed microbial (DFM) using enzyme-based assays which are time-consuming and limited to a few metabolites. In addition, little emphasis has been placed on investigating the effects of DFM on hindgut microbiota. We examined the effects of dietary supplementation of a blend of Saccharomyces cerevisiae-based DFM and fermentation products on the plasma concentrations of carbonyl-containing metabolites via a metabolomics approach, and fecal bacterial community, via 16S rRNA gene sequencing, of beef steers during a 42-day receiving period. Forty newly weaned steers were randomly assigned to receive a basal diet with no additive (CON; n = 20) or a basal diet supplemented with 19 g of Commence™ (PROB; n = 20) for a 42-day period. Commence™ (PMI, Arden Hills, MN) is a blend of 6.2 × 1011 cfu/g of S. cerevisiae, 3.5 × 1010 cfu/g of a mixture of Enterococcus lactis, Bacillus subtilis, Enterococcus faecium, and Lactobacillus casei, and the fermentation products of these aforementioned microorganisms and those of Aspergillus oryzae and Aspergillus niger. On d 0 and 40, rectal fecal samples were collected randomly from 10 steers from each treatment group. On d 42, blood was collected for plasma preparation.

RESULTS

A total number of 812 plasma metabolites were detected. Up to 305 metabolites [fold change (FC) ≥ 1.5, FDR ≤ 0.01] including glucose, hippuric acid, and 5-hydroxykynurenamine were increased by PROB supplementation, whereas 199 metabolites (FC ≤ 0.63, FDR ≤ 0.01) including acetoacetate were reduced. Supplementation of PROB increased (P ≤ 0.05) the relative abundance of Prevotellaceae UCG-003, Megasphaera, Dorea, Acetitomaculum, and Blautia. In contrast, the relative abundance of Elusimicrobium, Moheibacter, Stenotrophomonas, Comamonas, and uncultured bacterium belonging to family p-2534-18B5 gut group (phylum Bacteroidetes) were reduced (P ≤ 0.05).

CONCLUSIONS

The results of this study demonstrated that supplementation of PROB altered both the plasma carbonyl metabolome towards increased glucose concentration suggesting an improved energy status, and fecal bacterial community, suggesting an increased hindgut fermentation of the beef steers.

Collapse

Pérez-Losada M, Arenas M, Galán JC, Bracho MA, Hillung J, García-González N, González-Candelas F. High-throughput sequencing (HTS) for the analysis of viral populations. INFECTION GENETICS AND EVOLUTION 2020;80:104208. [PMID: 32001386 DOI: 10.1016/j.meegid.2020.104208] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Revised: 01/21/2020] [Accepted: 01/24/2020] [Indexed: 12/12/2022]

Tracking microbial evolution in the human gut using Hi-C reveals extensive horizontal gene transfer, persistence and adaptation. Nat Microbiol 2019;5:343-353. [PMID: 31873203 PMCID: PMC6992475 DOI: 10.1038/s41564-019-0625-0] [Citation(s) in RCA: 80] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2019] [Accepted: 10/30/2019] [Indexed: 12/15/2022]

Paez-Espino D, Zhou J, Roux S, Nayfach S, Pavlopoulos GA, Schulz F, McMahon KD, Walsh D, Woyke T, Ivanova NN, Eloe-Fadrosh EA, Tringe SG, Kyrpides NC. Diversity, evolution, and classification of virophages uncovered through global metagenomics. MICROBIOME 2019;7:157. [PMID: 31823797 PMCID: PMC6905037 DOI: 10.1186/s40168-019-0768-5] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/05/2019] [Accepted: 11/11/2019] [Indexed: 05/19/2023]

Sarsaiya S, Shi J, Chen J. Bioengineering tools for the production of pharmaceuticals: current perspective and future outlook. Bioengineered 2019;10:469-492. [PMID: 31656120 PMCID: PMC6844412 DOI: 10.1080/21655979.2019.1682108] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2019] [Revised: 09/08/2019] [Accepted: 10/11/2019] [Indexed: 01/18/2023] Open

Infant airway microbiota and topical immune perturbations in the origins of childhood asthma. Nat Commun 2019;10:5001. [PMID: 31676759 PMCID: PMC6825176 DOI: 10.1038/s41467-019-12989-7] [Citation(s) in RCA: 93] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2019] [Accepted: 10/14/2019] [Indexed: 12/24/2022] Open

Cruz F, Lagoa D, Mendes J, Rocha I, Ferreira EC, Rocha M, Dias O. SamPler - a novel method for selecting parameters for gene functional annotation routines. BMC Bioinformatics 2019;20:454. [PMID: 31488049 PMCID: PMC6727554 DOI: 10.1186/s12859-019-3038-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2018] [Accepted: 08/21/2019] [Indexed: 11/17/2022] Open

Abstract

BACKGROUND

As genome sequencing projects grow rapidly, the diversity of organisms with recently assembled genome sequences peaks at an unprecedented scale, thereby highlighting the need to make gene functional annotations fast and efficient. However, the (high) quality of such annotations must be guaranteed, as this is the first indicator of the genomic potential of every organism. Automatic procedures help accelerating the annotation process, though decreasing the confidence and reliability of the outcomes. Manually curating a genome-wide annotation of genes, enzymes and transporter proteins function is a highly time-consuming, tedious and impractical task, even for the most proficient curator. Hence, a semi-automated procedure, which balances the two approaches, will increase the reliability of the annotation, while speeding up the process. In fact, a prior analysis of the annotation algorithm may leverage its performance, by manipulating its parameters, hastening the downstream processing and the manual curation of assigning functions to genes encoding proteins.

RESULTS

Here SamPler, a novel strategy to select parameters for gene functional annotation routines is presented. This semi-automated method is based on the manual curation of a randomly selected set of genes/proteins. Then, in a multi-dimensional array, this sample is used to assess the automatic annotations for all possible combinations of the algorithm's parameters. These assessments allow creating an array of confusion matrices, for which several metrics are calculated (accuracy, precision and negative predictive value) and used to reach optimal values for the parameters.

CONCLUSIONS

The potential of this methodology is demonstrated with four genome functional annotations performed in merlin, an in-house user-friendly computational framework for genome-scale metabolic annotation and model reconstruction. For that, SamPler was implemented as a new plugin for the merlin tool.

Collapse

Mier P, Andrade-Navarro MA. Toward completion of the Earth's proteome: an update a decade later. Brief Bioinform 2019;20:463-470. [PMID: 29040399 DOI: 10.1093/bib/bbx127] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2017] [Revised: 09/08/2017] [Indexed: 12/13/2022] Open

Garcia AK, Kaçar B. How to resurrect ancestral proteins as proxies for ancient biogeochemistry. Free Radic Biol Med 2019;140:260-269. [PMID: 30951835 DOI: 10.1016/j.freeradbiomed.2019.03.033] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Revised: 02/11/2019] [Accepted: 03/26/2019] [Indexed: 10/27/2022]

Chen KT, Lu CL. CSAR-web: a web server of contig scaffolding using algebraic rearrangements. Nucleic Acids Res 2019;46:W55-W59. [PMID: 29733393 PMCID: PMC6030906 DOI: 10.1093/nar/gky337] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2018] [Accepted: 04/19/2018] [Indexed: 01/23/2023] Open

Klemetsen T, Raknes IA, Fu J, Agafonov A, Balasundaram SV, Tartari G, Robertsen E, Willassen NP. The MAR databases: development and implementation of databases specific for marine metagenomics. Nucleic Acids Res 2019;46:D692-D699. [PMID: 29106641 PMCID: PMC5753341 DOI: 10.1093/nar/gkx1036] [Citation(s) in RCA: 67] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2017] [Accepted: 10/18/2017] [Indexed: 12/03/2022] Open

Graells T, Ishak H, Larsson M, Guy L. The all-intracellular order Legionellales is unexpectedly diverse, globally distributed and lowly abundant. FEMS Microbiol Ecol 2019;94:5110392. [PMID: 30973601 PMCID: PMC6167759 DOI: 10.1093/femsec/fiy185] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2018] [Accepted: 09/08/2018] [Indexed: 12/14/2022] Open

Roux S, Krupovic M, Daly RA, Borges AL, Nayfach S, Schulz F, Sharrar A, Matheus Carnevali PB, Cheng JF, Ivanova NN, Bondy-Denomy J, Wrighton KC, Woyke T, Visel A, Kyrpides NC, Eloe-Fadrosh EA. Cryptic inoviruses revealed as pervasive in bacteria and archaea across Earth's biomes. Nat Microbiol 2019;4:1895-1906. [PMID: 31332386 PMCID: PMC6813254 DOI: 10.1038/s41564-019-0510-x] [Citation(s) in RCA: 153] [Impact Index Per Article: 30.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2019] [Accepted: 06/05/2019] [Indexed: 01/02/2023]

Abstract

Bacteriophages from the Inoviridae family (inoviruses) are characterized by their unique morphology, genome content and infection cycle. One of the most striking features of inoviruses is their ability to establish a chronic infection whereby the viral genome resides within the cell in either an exclusively episomal state or integrated into the host chromosome and virions are continuously released without killing the host. To date, a relatively small number of inovirus isolates have been extensively studied, either for biotechnological applications, such as phage display, or because of their effect on the toxicity of known bacterial pathogens including Vibrio cholerae and Neisseria meningitidis. Here, we show that the current 56 members of the Inoviridae family represent a minute fraction of a highly diverse group of inoviruses. Using a machine learning approach leveraging a combination of marker gene and genome features, we identified 10,295 inovirus-like sequences from microbial genomes and metagenomes. Collectively, our results call for reclassification of the current Inoviridae family into a viral order including six distinct proposed families associated with nearly all bacterial phyla across virtually every ecosystem. Putative inoviruses were also detected in several archaeal genomes, suggesting that, collectively, members of this supergroup infect hosts across the domains Bacteria and Archaea. Finally, we identified an expansive diversity of inovirus-encoded toxin–antitoxin and gene expression modulation systems, alongside evidence of both synergistic (CRISPR evasion) and antagonistic (superinfection exclusion) interactions with co-infecting viruses, which we experimentally validated in a Pseudomonas model. Capturing this previously obscured component of the global virosphere may spark new avenues for microbial manipulation approaches and innovative biotechnological applications.

A machine learning approach was used to recover over 10,000 inovirus-like sequences from existing microbial genomes and metagenomes, consequently proposing the reclassification of the Inoviridae family to a viral order, and uncover the previously unrecognized diversity of these viruses across hosts and environments.

Collapse

Lund JB, List M, Baumbach J. Interactive microbial distribution analysis using BioAtlas. Nucleic Acids Res 2019;45:W509-W513. [PMID: 28460071 PMCID: PMC5570126 DOI: 10.1093/nar/gkx304] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2017] [Accepted: 04/12/2017] [Indexed: 02/01/2023] Open

diCenzo GC, Mengoni A, Perrin E. Chromids Aid Genome Expansion and Functional Diversification in the Family Burkholderiaceae. Mol Biol Evol 2019;36:562-574. [PMID: 30608550 DOI: 10.1093/molbev/msy248] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open

Abstract

Multipartite genomes, containing at least two large replicons, are found in diverse bacteria; however, the advantage of this genome structure remains incompletely understood. Here, we perform comparative genomics of hundreds of finished β-proteobacterial genomes to gain insights into the role and emergence of multipartite genomes. Almost all essential secondary replicons (chromids) of the β-proteobacteria are found in the family Burkholderiaceae. These replicons arose from just two plasmid acquisition events, and they were likely stabilized early in their evolution by the presence of core genes. On average, Burkholderiaceae genera with multipartite genomes had a larger total genome size, but smaller chromosome, than genera without secondary replicons. Pangenome-level functional enrichment analyses suggested that interreplicon functional biases are partially driven by the enrichment of secondary replicons in the accessory pangenome fraction. Nevertheless, the small overlap in orthologous groups present in each replicon's pangenome indicated a clear functional separation of the replicons. Chromids appeared biased to environmental adaptation, as the functional categories enriched on chromids were also overrepresented on the chromosomes of the environmental genera (Paraburkholderia and Cupriavidus) compared with the pathogenic genera (Burkholderia and Ralstonia). Using ancestral state reconstruction, it was predicted that the rate of accumulation of modern-day genes by chromids was more rapid than the rate of gene accumulation by the chromosomes. Overall, the data are consistent with a model where the primary advantage of secondary replicons is in facilitating increased rates of gene acquisition through horizontal gene transfer, consequently resulting in replicons enriched in genes associated with adaptation to novel environments.

Collapse

Dunivin TK, Yeh SY, Shade A. A global survey of arsenic-related genes in soil microbiomes. BMC Biol 2019;17:45. [PMID: 31146755 PMCID: PMC6543643 DOI: 10.1186/s12915-019-0661-5] [Citation(s) in RCA: 66] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2018] [Accepted: 05/02/2019] [Indexed: 01/21/2023] Open

Abstract

BACKGROUND

Environmental resistomes include transferable microbial genes. One important resistome component is resistance to arsenic, a ubiquitous and toxic metalloid that can have negative and chronic consequences for human and animal health. The distribution of arsenic resistance and metabolism genes in the environment is not well understood. However, microbial communities and their resistomes mediate key transformations of arsenic that are expected to impact both biogeochemistry and local toxicity.

RESULTS

We examined the phylogenetic diversity, genomic location (chromosome or plasmid), and biogeography of arsenic resistance and metabolism genes in 922 soil genomes and 38 metagenomes. To do so, we developed a bioinformatic toolkit that includes BLAST databases, hidden Markov models and resources for gene-targeted assembly of nine arsenic resistance and metabolism genes: acr3, aioA, arsB, arsC (grx), arsC (trx), arsD, arsM, arrA, and arxA. Though arsenic-related genes were common, they were not universally detected, contradicting the common conjecture that all organisms have them. From major clades of arsenic-related genes, we inferred their potential for horizontal and vertical transfer. Different types and proportions of genes were detected across soils, suggesting microbial community composition will, in part, determine local arsenic toxicity and biogeochemistry. While arsenic-related genes were globally distributed, particular sequence variants were highly endemic (e.g., acr3), suggesting dispersal limitation. The gene encoding arsenic methylase arsM was unexpectedly abundant in soil metagenomes (median 48%), suggesting that it plays a prominent role in global arsenic biogeochemistry.

CONCLUSIONS

Our analysis advances understanding of arsenic resistance, metabolism, and biogeochemistry, and our approach provides a roadmap for the ecological investigation of environmental resistomes.

Collapse

Vinatzer BA, Heath LS, Almohri HMJ, Stulberg MJ, Lowe C, Li S. Cyberbiosecurity Challenges of Pathogen Genome Databases. Front Bioeng Biotechnol 2019;7:106. [PMID: 31157218 PMCID: PMC6529814 DOI: 10.3389/fbioe.2019.00106] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2019] [Accepted: 04/25/2019] [Indexed: 01/21/2023] Open

Presnell KV, Alper HS. Systems Metabolic Engineering Meets Machine Learning: A New Era for Data-Driven Metabolic Engineering. Biotechnol J 2019;14:e1800416. [PMID: 30927499 DOI: 10.1002/biot.201800416] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2019] [Revised: 02/20/2019] [Indexed: 12/30/2022]

Dutta A, Peoples LM, Gupta A, Bartlett DH, Sar P. Exploring the piezotolerant/piezophilic microbial community and genomic basis of piezotolerance within the deep subsurface Deccan traps. Extremophiles 2019;23:421-433. [PMID: 31049708 DOI: 10.1007/s00792-019-01094-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2019] [Accepted: 04/23/2019] [Indexed: 01/22/2023]

Mathema VB, Dondorp AM, Imwong M. OSTRFPD: Multifunctional Tool for Genome-Wide Short Tandem Repeat Analysis for DNA, Transcripts, and Amino Acid Sequences with Integrated Primer Designer. Evol Bioinform Online 2019;15:1176934319843130. [PMID: 31040636 PMCID: PMC6482647 DOI: 10.1177/1176934319843130] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2019] [Accepted: 03/15/2019] [Indexed: 01/18/2023] Open

Corel E, Méheust R, Watson AK, McInerney JO, Lopez P, Bapteste E. Bipartite Network Analysis of Gene Sharings in the Microbial World. Mol Biol Evol 2019;35:899-913. [PMID: 29346651 PMCID: PMC5888944 DOI: 10.1093/molbev/msy001] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open

Nguyen TTH, Myrold DD, Mueller RS. Distributions of Extracellular Peptidases Across Prokaryotic Genomes Reflect Phylogeny and Habitat. Front Microbiol 2019;10:413. [PMID: 30891022 PMCID: PMC6411800 DOI: 10.3389/fmicb.2019.00413] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2018] [Accepted: 02/18/2019] [Indexed: 11/19/2022] Open