1
|
Targeted genomic sequencing with probe capture for discovery and surveillance of coronaviruses in bats. eLife 2022; 11:79777. [DOI: 10.7554/elife.79777] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 10/18/2022] [Indexed: 11/11/2022] Open
Abstract
Public health emergencies like SARS, MERS, and COVID-19 have prioritized surveillance of zoonotic coronaviruses, resulting in extensive genomic characterization of coronavirus diversity in bats. Sequencing viral genomes directly from animal specimens remains a laboratory challenge, however, and most bat coronaviruses have been characterized solely by PCR amplification of small regions from the best-conserved gene. This has resulted in limited phylogenetic resolution and left viral genetic factors relevant to threat assessment undescribed. In this study, we evaluated whether a technique called hybridization probe capture can achieve more extensive genome recovery from surveillance specimens. Using a custom panel of 20,000 probes, we captured and sequenced coronavirus genomic material in 21 swab specimens collected from bats in the Democratic Republic of the Congo. For 15 of these specimens, probe capture recovered more genome sequence than had been previously generated with standard amplicon sequencing protocols, providing a median 6.1-fold improvement (ranging up to 69.1-fold). Probe capture data also identified five novel alpha- and betacoronaviruses in these specimens, and their full genomes were recovered with additional deep sequencing. Based on these experiences, we discuss how probe capture could be effectively operationalized alongside other sequencing technologies for high-throughput, genomics-based discovery and surveillance of bat coronaviruses.
Collapse
|
2
|
Wildlife in Cameroon harbor diverse coronaviruses, including many closely related to human coronavirus 229E. Virus Evol 2022; 8:veab110. [PMID: 35233291 PMCID: PMC8867583 DOI: 10.1093/ve/veab110] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2021] [Revised: 12/05/2021] [Accepted: 01/07/2022] [Indexed: 11/13/2022] Open
Abstract
Zoonotic spillover of animal viruses into human populations is a continuous and increasing public health risk. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) highlights the global impact of emergence. Considering the history and diversity of coronaviruses (CoVs), especially in bats, SARS-CoV-2 will likely not be the last to spillover from animals into human populations. We sampled and tested wildlife in the Central African country Cameroon to determine which CoVs are circulating and how they relate to previously detected human and animal CoVs. We collected animal and ecological data at sampling locations and used family-level consensus PCR combined with amplicon sequencing for virus detection. Between 2003 and 2018, samples were collected from 6,580 animals of several different orders. CoV RNA was detected in 175 bats, a civet, and a shrew. The CoV RNAs detected in the bats represented 17 different genetic clusters, coinciding with alpha (n = 8) and beta (n = 9) CoVs. Sequences resembling human CoV-229E (HCoV-229E) were found in 40 Hipposideridae bats. Phylogenetic analyses place the human-derived HCoV-229E isolates closest to those from camels in terms of the S and N genes but closest to isolates from bats for the envelope, membrane, and RNA-dependent RNA polymerase genes. The CoV RNA positivity rate in bats varied significantly (P < 0.001) between the wet (8.2 per cent) and dry seasons (4.5 per cent). Most sampled species accordingly had a wet season high and dry season low, while for some the opposite was found. Eight of the suspected CoV species of which we detected RNA appear to be entirely novel CoV species, which suggests that CoV diversity in African wildlife is still rather poorly understood. The detection of multiple different variants of HCoV-229E-like viruses supports the bat reservoir hypothesis for this virus, with the phylogenetic results casting some doubt on camels as an intermediate host. The findings also support the previously proposed influence of ecological factors on CoV circulation, indicating a high level of underlying complexity to the viral ecology. These results indicate the importance of investing in surveillance activities among wild animals to detect all potential threats as well as sentinel surveillance among exposed humans to determine emerging threats.
Collapse
|
3
|
Coronavirus surveillance in wildlife from two Congo basin countries detects RNA of multiple species circulating in bats and rodents. PLoS One 2021; 16:e0236971. [PMID: 34106949 PMCID: PMC8189465 DOI: 10.1371/journal.pone.0236971] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2020] [Accepted: 05/19/2021] [Indexed: 12/21/2022] Open
Abstract
Coronaviruses play an important role as pathogens of humans and animals, and the emergence of epidemics like SARS, MERS and COVID-19 is closely linked to zoonotic transmission events primarily from wild animals. Bats have been found to be an important source of coronaviruses with some of them having the potential to infect humans, with other animals serving as intermediate or alternate hosts or reservoirs. Host diversity may be an important contributor to viral diversity and thus the potential for zoonotic events. To date, limited research has been done in Africa on this topic, in particular in the Congo Basin despite frequent contact between humans and wildlife in this region. We sampled and, using consensus coronavirus PCR-primers, tested 3,561 wild animals for coronavirus RNA. The focus was on bats (38%), rodents (38%), and primates (23%) that posed an elevated risk for contact with people, and we found coronavirus RNA in 121 animals, of which all but two were bats. Depending on the taxonomic family, bats were significantly more likely to be coronavirus RNA-positive when sampled either in the wet (Pteropodidae and Rhinolophidae) or dry season (Hipposideridae, Miniopteridae, Molossidae, and Vespertilionidae). The detected RNA sequences correspond to 15 alpha- and 6 betacoronaviruses, with some of them being very similar (>95% nucleotide identities) to known coronaviruses and others being more unique and potentially representing novel viruses. In seven of the bats, we detected RNA most closely related to sequences of the human common cold coronaviruses 229E or NL63 (>80% nucleotide identities). The findings highlight the potential for coronavirus spillover, especially in regions with a high diversity of bats and close human contact, and reinforces the need for ongoing surveillance.
Collapse
|
4
|
Evaluation of bat adenoviruses suggests co-evolution and host roosting behaviour as drivers for diversity. Microb Genom 2021; 7:000561. [PMID: 33871330 PMCID: PMC8208681 DOI: 10.1099/mgen.0.000561] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2020] [Accepted: 03/15/2021] [Indexed: 11/19/2022] Open
Abstract
Adenoviruses (AdVs) are diverse pathogens of humans and animals, with several dozen bat AdVs already identified. Considering that over 100 human AdVs are known, and the huge diversity of bat species, many bat AdVs likely remain undiscovered. To learn more about AdV prevalence, diversity and evolution, we sampled and tested bats in Cameroon using several PCR assays for viral and host DNA. AdV DNA was detected in 14 % of the 671 sampled animals belonging to 37 different bat species. There was a correlation between species roosting in larger groups and AdV DNA detection. The detected AdV DNA belonged to between 28 and 44 different, mostly previously unknown, mastadenovirus species. The novel isolates are phylogenetically diverse and while some cluster with known viruses, others appear to form divergent new clusters. The phylogenetic tree of novel and previously known bat AdVs does not mirror that of the various host species, but does contain structures consistent with a degree of virus-host co-evolution. Given that closely related isolates were found in different host species, it seems likely that at least some bat AdVs have jumped species barriers, probably in the more recent past; however, the tree is also consistent with such events having taken place throughout bat AdV evolution. AdV diversity was highest in bat species roosting in large groups. The study significantly increased the diversity of AdVs known to be harboured by bats, and suggests that host behaviours, such as roosting size, may be what limits some AdVs to one species rather than an inability of AdVs to infect other related hosts.
Collapse
|
5
|
Coronavirus surveillance of wildlife in the Lao People's Democratic Republic detects viral RNA in rodents. Arch Virol 2020; 165:1869-1875. [PMID: 32488616 PMCID: PMC7265875 DOI: 10.1007/s00705-020-04683-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Accepted: 04/27/2020] [Indexed: 12/16/2022]
Abstract
Coronaviruses can become zoonotic, as in the case of COVID-19, and hunting, sale, and consumption of wild animals in Southeast Asia increases the risk for such incidents. We sampled and tested rodents (851) and other mammals and found betacoronavirus RNA in 12 rodents. The sequences belong to two separate genetic clusters and are closely related to those of known rodent coronaviruses detected in the region and distantly related to those of human coronaviruses OC43 and HKU1. Considering the close human-wildlife contact with many species in and beyond the region, a better understanding of virus diversity is urgently needed for the mitigation of future risks.
Collapse
|
6
|
Evidence of Low-Level Dengue Virus Circulation in the South Region of Cameroon in 2018. Vector Borne Zoonotic Dis 2020; 20:314-317. [PMID: 31934826 DOI: 10.1089/vbz.2019.2531] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Dengue fever is an understudied disease in many parts of Africa and little is known about its prevalence in Cameroon. We tested blood from 629 individuals from the South Region of Cameroon, collected over the course of one year, for flavivirus RNA using conventional broad range PCR. Flavivirus RNA corresponding to dengue virus (DENV) serotype 1 was identified in two individuals who were also diagnosed with malaria. This finding confirms previous reports that indicate the presence of low-level circulation of DENV in Cameroon and supports the concern that dengue fever may be underdiagnosed due to more prevalent diseases that have similar symptomology and insufficient diagnostic capacity.
Collapse
|
7
|
Analysis of adenovirus DNA detected in rodent species from the Democratic Republic of the Congo indicates potentially novel adenovirus types. New Microbes New Infect 2019; 34:100640. [PMID: 32025309 PMCID: PMC6997563 DOI: 10.1016/j.nmni.2019.100640] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2019] [Revised: 12/16/2019] [Accepted: 12/24/2019] [Indexed: 11/25/2022] Open
Abstract
Different species of adenoviruses (AdVs) infect humans and animals and are known for their role as pathogens, especially in humans, with animals, primarily rodents, often serving as model systems. However, although we know over 100 types of human AdVs, we know comparatively little about the diversity of animal AdVs. Due to the fact that rodents are the most diverse family of mammals and a standard model system for human disease, we set out to sample African rodents native to the Democratic Republic of the Congo and test them for AdV DNA using a semi-nested consensus PCR. A total of 775 animals were tested, and viral DNA was detected in four of them. The AdV DNA found belongs to three different AdVs, all being closely related to murine adenovirus 2 (MAdV-2). Considering the genetic differences of the amplicon were 9%, 11% and 19% from MAdV-2 and at least 10% from each other, they seem to belong to up to three different novel types within the Murine mastadenovirus B species. This evidence of genetic diversity highlights the opportunities to isolate and study additional AdVs that infect rodents as models for AdV biology and pathology.
Collapse
|
8
|
CRAGE enables rapid activation of biosynthetic gene clusters in undomesticated bacteria. Nat Microbiol 2019; 4:2498-2510. [PMID: 31611640 DOI: 10.1038/s41564-019-0573-8] [Citation(s) in RCA: 69] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Accepted: 08/27/2019] [Indexed: 12/11/2022]
Abstract
It is generally believed that exchange of secondary metabolite biosynthetic gene clusters (BGCs) among closely related bacteria is an important driver of BGC evolution and diversification. Applying this idea may help researchers efficiently connect many BGCs to their products and characterize the products' roles in various environments. However, existing genetic tools support only a small fraction of these efforts. Here, we present the development of chassis-independent recombinase-assisted genome engineering (CRAGE), which enables single-step integration of large, complex BGC constructs directly into the chromosomes of diverse bacteria with high accuracy and efficiency. To demonstrate the efficacy of CRAGE, we expressed three known and six previously identified but experimentally elusive non-ribosomal peptide synthetase (NRPS) and NRPS-polyketide synthase (PKS) hybrid BGCs from Photorhabdus luminescens in 25 diverse γ-Proteobacteria species. Successful activation of six BGCs identified 22 products for which diversity and yield were greater when the BGCs were expressed in strains closely related to the native strain than when they were expressed in either native or more distantly related strains. Activation of these BGCs demonstrates the feasibility of exploiting their underlying catalytic activity and plasticity, and provides evidence that systematic approaches based on CRAGE will be useful for discovering and identifying previously uncharacterized metabolites.
Collapse
|
9
|
DNA of diverse adenoviruses detected in Cameroonian rodent and shrew species. Arch Virol 2019; 164:2359-2366. [PMID: 31240484 DOI: 10.1007/s00705-019-04323-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2019] [Accepted: 05/22/2019] [Indexed: 11/28/2022]
Abstract
Rodent adenoviruses are important models for human disease. In contrast to the over 70 adenovirus types isolated from humans, few rodent adenoviruses are known, despite the vast diversity of rodent species. PCR and Sanger sequencing were used to investigate adenovirus diversity in wild rodents and shrews in Cameroon. Adenovirus DNA was detected in 13.8% of animals (n = 218). All detected sequences differ from known adenovirus types by more than 10% at the amino acid level, thus indicating up to 14 novel adenovirus species. These results highlight the diversity of rodent adenoviruses, their phylogeny, and opportunities for studying alternative adenovirus rodent models.
Collapse
|
10
|
Author Correction: Rhizosphere microbiome structure alters to enable wilt resistance in tomato. Nat Biotechnol 2019; 36:1117. [PMID: 30412196 DOI: 10.1038/nbt1118-1117] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
11
|
High Herpesvirus Diversity in Wild Rodent and Shrew Species in Central Africa. Intervirology 2018; 61:155-165. [PMID: 30448834 DOI: 10.1159/000493796] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2017] [Accepted: 09/16/2018] [Indexed: 11/19/2022] Open
Abstract
OBJECTIVE Herpesviruses belong to a diverse order of large DNA viruses that can cause diseases in humans and animals. With the goal of gathering information about the distribution and diversity of herpesviruses in wild rodent and shrew species in central Africa, animals in Cameroon and the Democratic Republic of the Congo were sampled and tested by PCR for the presence of herpesvirus DNA. METHODS A broad range PCRs targeting either the Polymerase or the terminase gene were used for virus detection. Amplified products from PCR were sequenced and isolates analysed for phylogenetic placement. RESULTS Overall, samples of 1,004 animals of various rodent and shrew species were tested and 24 were found to be positive for herpesvirus DNA. Six of these samples contained strains of known viruses, while the other positive samples revealed DNA sequences putatively belonging to 11 previously undescribed herpesviruses. The new isolates are beta- and gammaherpesviruses and the shrew isolates appear to form a separate cluster within the Betaherpesvirinae subfamily. CONCLUSION The diversity of viruses detected is higher than in similar studies in Europe and Asia. The high diversity of rodent and shrew species occurring in central Africa may be the reason for a higher diversity in herpesviruses in this area.
Collapse
|
12
|
Rhizosphere microbiome structure alters to enable wilt resistance in tomato. Nat Biotechnol 2018; 36:nbt.4232. [PMID: 30295674 DOI: 10.1038/nbt.4232] [Citation(s) in RCA: 307] [Impact Index Per Article: 51.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2017] [Accepted: 08/01/2018] [Indexed: 11/09/2022]
Abstract
Tomato variety Hawaii 7996 is resistant to the soil-borne pathogen Ralstonia solanacearum, whereas the Moneymaker variety is susceptible to the pathogen. To evaluate whether plant-associated microorganisms have a role in disease resistance, we analyzed the rhizosphere microbiomes of both varieties in a mesocosm experiment. Microbiome structures differed between the two cultivars. Transplantation of rhizosphere microbiota from resistant plants suppressed disease symptoms in susceptible plants. Comparative analyses of rhizosphere metagenomes from resistant and susceptible plants enabled the identification and assembly of a flavobacterial genome that was far more abundant in the resistant plant rhizosphere microbiome than in that of the susceptible plant. We cultivated this flavobacterium, named TRM1, and found that it could suppress R. solanacearum-disease development in a susceptible plant in pot experiments. Our findings reveal a role for native microbiota in protecting plants from microbial pathogens, and our approach charts a path toward the development of probiotics to ameliorate plant diseases.
Collapse
|
13
|
The discovery of Bombali virus adds further support for bats as hosts of ebolaviruses. Nat Microbiol 2018; 3:1084-1089. [PMID: 30150734 PMCID: PMC6557442 DOI: 10.1038/s41564-018-0227-2] [Citation(s) in RCA: 211] [Impact Index Per Article: 35.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2018] [Accepted: 07/25/2018] [Indexed: 11/08/2022]
Abstract
Here we describe the complete genome of a new ebolavirus, Bombali virus (BOMV) detected in free-tailed bats in Sierra Leone (little free-tailed (Chaerephon pumilus) and Angolan free-tailed (Mops condylurus)). The bats were found roosting inside houses, indicating the potential for human transmission. We show that the viral glycoprotein can mediate entry into human cells. However, further studies are required to investigate whether exposure has actually occurred or if BOMV is pathogenic in humans.
Collapse
|
14
|
DNA indicative of human bocaviruses detected in non-human primates in the Democratic Republic of the Congo. J Gen Virol 2018; 99:676-681. [PMID: 29583115 DOI: 10.1099/jgv.0.001048] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Bocaparvoviruses are members of the family Parvovirinae and human bocaviruses have been found to be associated with respiratory and gastrointestinal disease. There are four known human bocaviruses, as well as several distinct ones in great apes. The goal of the presented study was to detect other non-human primate (NHP) bocaviruses in NHP species in the Democratic Republic of the Congo using conventional broad-range PCR. We found bocavirus DNA in blood and tissues samples in 6 out of 620 NHPs, and all isolates showed very high identity (>97 %) with human bocaviruses 2 or 3. These findings suggest cross-species transmission of bocaviruses between humans and NHPs.
Collapse
|
15
|
Building a global atlas of zoonotic viruses. Bull World Health Organ 2018; 96:292-294. [PMID: 29695886 PMCID: PMC5872013 DOI: 10.2471/blt.17.205005] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2017] [Revised: 12/04/2017] [Accepted: 02/07/2018] [Indexed: 01/12/2023] Open
|
16
|
|
17
|
Critical Assessment of Metagenome Interpretation-a benchmark of metagenomics software. Nat Methods 2017; 14:1063-1071. [PMID: 28967888 DOI: 10.1101/099127] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2016] [Accepted: 08/25/2017] [Indexed: 05/25/2023]
Abstract
Methods for assembly, taxonomic profiling and binning are key to interpreting metagenome data, but a lack of consensus about benchmarking complicates performance assessment. The Critical Assessment of Metagenome Interpretation (CAMI) challenge has engaged the global developer community to benchmark their programs on highly complex and realistic data sets, generated from ∼700 newly sequenced microorganisms and ∼600 novel viruses and plasmids and representing common experimental setups. Assembly and genome binning programs performed well for species represented by individual genomes but were substantially affected by the presence of related strains. Taxonomic profiling and binning programs were proficient at high taxonomic ranks, with a notable performance decrease below family level. Parameter settings markedly affected performance, underscoring their importance for program reproducibility. The CAMI results highlight current challenges but also provide a roadmap for software selection to answer specific research questions.
Collapse
|
18
|
Critical Assessment of Metagenome Interpretation-a benchmark of metagenomics software. Nat Methods 2017; 14:1063-1071. [PMID: 28967888 DOI: 10.1038/nmeth.4458] [Citation(s) in RCA: 430] [Impact Index Per Article: 61.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2016] [Accepted: 08/25/2017] [Indexed: 12/12/2022]
Abstract
Methods for assembly, taxonomic profiling and binning are key to interpreting metagenome data, but a lack of consensus about benchmarking complicates performance assessment. The Critical Assessment of Metagenome Interpretation (CAMI) challenge has engaged the global developer community to benchmark their programs on highly complex and realistic data sets, generated from ∼700 newly sequenced microorganisms and ∼600 novel viruses and plasmids and representing common experimental setups. Assembly and genome binning programs performed well for species represented by individual genomes but were substantially affected by the presence of related strains. Taxonomic profiling and binning programs were proficient at high taxonomic ranks, with a notable performance decrease below family level. Parameter settings markedly affected performance, underscoring their importance for program reproducibility. The CAMI results highlight current challenges but also provide a roadmap for software selection to answer specific research questions.
Collapse
|
19
|
Transfer RNAs with novel cloverleaf structures. Nucleic Acids Res 2017; 45:2776-2785. [PMID: 28076288 PMCID: PMC5389517 DOI: 10.1093/nar/gkw898] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2016] [Accepted: 09/30/2016] [Indexed: 01/16/2023] Open
Abstract
We report the identification of novel tRNA species with 12-base pair amino-acid acceptor branches composed of longer acceptor stem and shorter T-stem. While canonical tRNAs have a 7/5 configuration of the branch, the novel tRNAs have either 8/4 or 9/3 structure. They were found during the search for selenocysteine tRNAs in terabytes of genome, metagenome and metatranscriptome sequences. Certain bacteria and their phages employ the 8/4 structure for serine and histidine tRNAs, while minor cysteine and selenocysteine tRNA species may have a modified 8/4 structure with one bulge nucleotide. In Acidobacteria, tRNAs with 8/4 and 9/3 structures may function as missense and nonsense suppressor tRNAs and/or regulatory noncoding RNAs. In δ-proteobacteria, an additional cysteine tRNA with an 8/4 structure mimics selenocysteine tRNA and may function as opal suppressor. We examined the potential translation function of suppressor tRNA species in Escherichia coli; tRNAs with 8/4 or 9/3 structures efficiently inserted serine, alanine and cysteine in response to stop and sense codons, depending on the identity element and anticodon sequence of the tRNA. These findings expand our view of how tRNA, and possibly the genetic code, is diversified in nature.
Collapse
|
20
|
Gene and transcript abundances of bacterial type III secretion systems from the rumen microbiome are correlated with methane yield in sheep. BMC Res Notes 2017; 10:367. [PMID: 28789673 PMCID: PMC5549432 DOI: 10.1186/s13104-017-2671-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2016] [Accepted: 07/22/2017] [Indexed: 11/28/2022] Open
Abstract
Background Ruminants are important contributors to global methane emissions via microbial fermentation in their reticulo-rumens. This study is part of a larger program, characterising the rumen microbiomes of sheep which vary naturally in methane yield (g CH4/kg DM/day) and aims to define differences in microbial communities, and in gene and transcript abundances that can explain the animal methane phenotype. Methods Rumen microbiome metagenomic and metatranscriptomic data were analysed by Gene Set Enrichment, sparse partial least squares regression and the Wilcoxon Rank Sum test to estimate correlations between specific KEGG bacterial pathways/genes and high methane yield in sheep. KEGG genes enriched in high methane yield sheep were reassembled from raw reads and existing contigs and analysed by MEGAN to predict their phylogenetic origin. Protein coding sequences from Succinivibrio dextrinosolvens strains were analysed using Effective DB to predict bacterial type III secreted proteins. The effect of S. dextrinosolvens strain H5 growth on methane formation by rumen methanogens was explored using co-cultures. Results Detailed analysis of the rumen microbiomes of high methane yield sheep shows that gene and transcript abundances of bacterial type III secretion system genes are positively correlated with methane yield in sheep. Most of the bacterial type III secretion system genes could not be assigned to a particular bacterial group, but several genes were affiliated with the genus Succinivibrio, and searches of bacterial genome sequences found that strains of S. dextrinosolvens were part of a small group of rumen bacteria that encode this type of secretion system. In co-culture experiments, S. dextrinosolvens strain H5 showed a growth-enhancing effect on a methanogen belonging to the order Methanomassiliicoccales, and inhibition of a representative of the Methanobrevibacter gottschalkii clade. Conclusions This is the first report of bacterial type III secretion system genes being associated with high methane emissions in ruminants, and identifies these secretions systems as potential new targets for methane mitigation research. The effects of S. dextrinosolvens on the growth of rumen methanogens in co-cultures indicate that bacteria-methanogen interactions are important modulators of methane production in ruminant animals. Electronic supplementary material The online version of this article (doi:10.1186/s13104-017-2671-0) contains supplementary material, which is available to authorized users.
Collapse
|
21
|
Progressive Loss of Function in a Limb Enhancer during Snake Evolution. Cell 2016; 167:633-642.e11. [PMID: 27768887 DOI: 10.1016/j.cell.2016.09.028] [Citation(s) in RCA: 190] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2016] [Revised: 08/07/2016] [Accepted: 09/15/2016] [Indexed: 01/08/2023]
Abstract
The evolution of body shape is thought to be tightly coupled to changes in regulatory sequences, but specific molecular events associated with major morphological transitions in vertebrates have remained elusive. We identified snake-specific sequence changes within an otherwise highly conserved long-range limb enhancer of Sonic hedgehog (Shh). Transgenic mouse reporter assays revealed that the in vivo activity pattern of the enhancer is conserved across a wide range of vertebrates, including fish, but not in snakes. Genomic substitution of the mouse enhancer with its human or fish ortholog results in normal limb development. In contrast, replacement with snake orthologs caused severe limb reduction. Synthetic restoration of a single transcription factor binding site lost in the snake lineage reinstated full in vivo function to the snake enhancer. Our results demonstrate changes in a regulatory sequence associated with a major body plan transition and highlight the role of enhancers in morphological evolution. PAPERCLIP.
Collapse
|
22
|
Rumen metagenome and metatranscriptome analyses of low methane yield sheep reveals a Sharpea-enriched microbiome characterised by lactic acid formation and utilisation. MICROBIOME 2016; 4:56. [PMID: 27760570 PMCID: PMC5069950 DOI: 10.1186/s40168-016-0201-2] [Citation(s) in RCA: 178] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2016] [Accepted: 10/06/2016] [Indexed: 05/25/2023]
Abstract
BACKGROUND Enteric fermentation by farmed ruminant animals is a major source of methane and constitutes the second largest anthropogenic contributor to global warming. Reducing methane emissions from ruminants is needed to ensure sustainable animal production in the future. Methane yield varies naturally in sheep and is a heritable trait that can be used to select animals that yield less methane per unit of feed eaten. We previously demonstrated elevated expression of hydrogenotrophic methanogenesis pathway genes of methanogenic archaea in the rumens of high methane yield (HMY) sheep compared to their low methane yield (LMY) counterparts. Methane production in the rumen is strongly connected to microbial hydrogen production through fermentation processes. In this study, we investigate the contribution that rumen bacteria make to methane yield phenotypes in sheep. RESULTS Using deep sequence metagenome and metatranscriptome datasets in combination with 16S rRNA gene amplicon sequencing from HMY and LMY sheep, we show enrichment of lactate-producing Sharpea spp. in LMY sheep bacterial communities. Increased gene and transcript abundances for sugar import and utilisation and production of lactate, propionate and butyrate were also observed in LMY animals. Sharpea azabuensis and Megasphaera spp. act as important drivers of lactate production and utilisation according to phylogenetic analysis and read mappings. CONCLUSIONS Our findings show that the rumen microbiome in LMY animals supports a rapid heterofermentative growth, leading to lactate production. We postulate that lactate is subsequently metabolised mainly to butyrate in LMY animals, producing 2 mol of hydrogen and 0.5 mol of methane per mol hexose, which represents 24 % less than the 0.66 mol of methane formed from the 2.66 mol of hydrogen produced if hexose fermentation was directly to acetate and butyrate. These findings are consistent with the theory that a smaller rumen size with a higher turnover rate, where rapid heterofermentative growth would be an advantage, results in lower hydrogen production and lower methane formation. Together with previous methanogen gene expression data, this builds a strong concept of how animal traits and microbial communities shape the methane phenotype in sheep.
Collapse
|
23
|
Genome-wide compendium and functional assessment of in vivo heart enhancers. Nat Commun 2016; 7:12923. [PMID: 27703156 PMCID: PMC5059478 DOI: 10.1038/ncomms12923] [Citation(s) in RCA: 61] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2016] [Accepted: 08/16/2016] [Indexed: 12/04/2022] Open
Abstract
Whole-genome sequencing is identifying growing numbers of non-coding variants in human disease studies, but the lack of accurate functional annotations prevents their interpretation. We describe the genome-wide landscape of distant-acting enhancers active in the developing and adult human heart, an organ whose impairment is a predominant cause of mortality and morbidity. Using integrative analysis of >35 epigenomic data sets from mouse and human pre- and postnatal hearts we created a comprehensive reference of >80,000 putative human heart enhancers. To illustrate the importance of enhancers in the regulation of genes involved in heart disease, we deleted the mouse orthologs of two human enhancers near cardiac myosin genes. In both cases, we observe in vivo expression changes and cardiac phenotypes consistent with human heart disease. Our study provides a comprehensive catalogue of human heart enhancers for use in clinical whole-genome sequencing studies and highlights the importance of enhancers for cardiac function. Identification of non-coding variants has outstripped our ability to annotate and interpret them. Dickel et al. present a compendium of over 80,000 putative human heart enhancers and demonstrate that two conserved enhancers are required for proper cardiac function in mice.
Collapse
|
24
|
Abstract
Abstract
High throughput next generation sequencing technologies have enabled cultivation-independent approaches to study microbial
communities in environmental samples. To date much of functional metagenomics has been limited to the gene or pathway
level. Recent breakthroughs in metagenome binning have made it feasible to reconstruct high quality, individual microbial
genomes from complex communities with thousands of species. In this review we aim to compare several automated metagenome
binning software tools for their performance, and provide a practical guide for the metagenomics research community to
carry out successful binning analyses.
Collapse
|
25
|
Abstract
Selenocysteine (Sec or U) is encoded by UGA, a stop codon reassigned by a Sec-specific elongation factor and a distinctive RNA structure. To discover possible code variations in extant organisms we analyzed 6.4 trillion base pairs of metagenomic sequences and 24 903 microbial genomes for tRNA(Sec) species. As expected, UGA is the predominant Sec codon in use. We also found tRNA(Sec) species that recognize the stop codons UAG and UAA, and ten sense codons. Selenoprotein synthesis programmed by UAG in Geodermatophilus and Blastococcus, and by the Cys codon UGU in Aeromonas salmonicida was confirmed by metabolic labeling with (75) Se or mass spectrometry. Other tRNA(Sec) species with different anticodons enabled E. coli to synthesize active formate dehydrogenase H, a selenoenzyme. This illustrates the ease by which the genetic code may evolve new coding schemes, possibly aiding organisms to adapt to changing environments, and show the genetic code is much more flexible than previously thought.
Collapse
|
26
|
|
27
|
Abstract
DNA methylation acts in concert with restriction enzymes to protect the integrity of prokaryotic genomes. Studies in a limited number of organisms suggest that methylation also contributes to prokaryotic genome regulation, but the prevalence and properties of such non-restriction-associated methylation systems remain poorly understood. Here, we used single molecule, real-time sequencing to map DNA modifications including m6A, m4C, and m5C across the genomes of 230 diverse bacterial and archaeal species. We observed DNA methylation in nearly all (93%) organisms examined, and identified a total of 834 distinct reproducibly methylated motifs. This data enabled annotation of the DNA binding specificities of 620 DNA Methyltransferases (MTases), doubling known specificities for previously hard to study Type I, IIG and III MTases, and revealing their extraordinary diversity. Strikingly, 48% of organisms harbor active Type II MTases with no apparent cognate restriction enzyme. These active ‘orphan’ MTases are present in diverse bacterial and archaeal phyla and show motif specificities and methylation patterns consistent with functions in gene regulation and DNA replication. Our results reveal the pervasive presence of DNA methylation throughout the prokaryotic kingdoms, as well as the diversity of sequence specificities and potential functions of DNA methylation systems. DNA methylation is a chemical modification of DNA present in many prokaryotic genomes. The best-known role of DNA methylation is as a component of restriction-modification systems. In these systems, restriction enzymes target foreign DNA for cleavage, while DNA methylation protects the host genome from destruction. Studies in a handful of organisms show that DNA methylation may also act independently of restriction systems and function in genome regulation. However, a lack of technologies has limited the study of DNA methylation to a small number of organisms, and the broader patterns and functions of DNA methylation remain unknown. Here we use SMRT-sequencing to determine the genome wide DNA methylation patterns of more than 200 diverse bacteria and archaea. We show that DNA methylation is pervasive and present in more than 90% of studied organisms. Analysis of this data enabled annotation of the specific DNA binding sites of more than 600 restriction systems, revealing their extraordinary diversity. Strikingly, we observed widespread DNA methylation in the absence of restriction systems. Analyses of these patterns reveal that they are conserved through evolution, and likely function in genome regulation. Thus DNA methylation may play a far wider function in prokaryotic genome biology than was previously supposed.
Collapse
|
28
|
Global metagenomic survey reveals a new bacterial candidate phylum in geothermal springs. Nat Commun 2016; 7:10476. [PMID: 26814032 PMCID: PMC4737851 DOI: 10.1038/ncomms10476] [Citation(s) in RCA: 90] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2015] [Accepted: 12/07/2015] [Indexed: 01/01/2023] Open
Abstract
Analysis of the increasing wealth of metagenomic data collected from diverse environments can lead to the discovery of novel branches on the tree of life. Here we analyse 5.2 Tb of metagenomic data collected globally to discover a novel bacterial phylum (‘Candidatus Kryptonia') found exclusively in high-temperature pH-neutral geothermal springs. This lineage had remained hidden as a taxonomic ‘blind spot' because of mismatches in the primers commonly used for ribosomal gene surveys. Genome reconstruction from metagenomic data combined with single-cell genomics results in several high-quality genomes representing four genera from the new phylum. Metabolic reconstruction indicates a heterotrophic lifestyle with conspicuous nutritional deficiencies, suggesting the need for metabolic complementarity with other microbes. Co-occurrence patterns identifies a number of putative partners, including an uncultured Armatimonadetes lineage. The discovery of Kryptonia within previously studied geothermal springs underscores the importance of globally sampled metagenomic data in detection of microbial novelty, and highlights the extraordinary diversity of microbial life still awaiting discovery. The analysis of existing metagenomic data can lead to discovery of new microorganisms. Here, Eloe-Fadrosh et al. perform a large-scale analysis of global metagenomic data, followed by genome reconstruction and single-cell genomics, to describe a new bacterial phylum that inhabits geothermal springs.
Collapse
|
29
|
Comparative metagenomics reveals impact of contaminants on groundwater microbiomes. Front Microbiol 2015; 6:1205. [PMID: 26583008 PMCID: PMC4628106 DOI: 10.3389/fmicb.2015.01205] [Citation(s) in RCA: 56] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2015] [Accepted: 10/16/2015] [Indexed: 11/15/2022] Open
Abstract
To understand patterns of geochemical cycling in pristine versus contaminated groundwater ecosystems, pristine shallow groundwater (FW301) and contaminated groundwater (FW106) samples from the Oak Ridge Integrated Field Research Center (OR-IFRC) were sequenced and compared to each other to determine phylogenetic and metabolic difference between the communities. Proteobacteria (e.g., Burkholderia, Pseudomonas) are the most abundant lineages in the pristine community, though a significant proportion ( >55%) of the community is composed of poorly characterized low abundance (individually <1%) lineages. The phylogenetic diversity of the pristine community contributed to a broader diversity of metabolic networks than the contaminated community. In addition, the pristine community encodes redundant and mostly complete geochemical cycles distributed over multiple lineages and appears capable of a wide range of metabolic activities. In contrast, many geochemical cycles in the contaminated community appear truncated or minimized due to decreased biodiversity and dominance by Rhodanobacter populations capable of surviving the combination of stresses at the site. These results indicate that the pristine site contains more robust and encodes more functional redundancy than the stressed community, which contributes to more efficient nutrient cycling and adaptability than the stressed community.
Collapse
|
30
|
|
31
|
Phylogenomically guided identification of industrially relevant GH1 β-glucosidases through DNA synthesis and nanostructure-initiator mass spectrometry. ACS Chem Biol 2014; 9:2082-91. [PMID: 24984213 PMCID: PMC4168791 DOI: 10.1021/cb500244v] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Harnessing the biotechnological potential of the large number of proteins available in sequence databases requires scalable methods for functional characterization. Here we propose a workflow to address this challenge by combining phylogenomic guided DNA synthesis with high-throughput mass spectrometry and apply it to the systematic characterization of GH1 β-glucosidases, a family of enzymes necessary for biomass hydrolysis, an important step in the conversion of lignocellulosic feedstocks to fuels and chemicals. We synthesized and expressed 175 GH1s, selected from over 2000 candidate sequences to cover maximum sequence diversity. These enzymes were functionally characterized over a range of temperatures and pHs using nanostructure-initiator mass spectrometry (NIMS), generating over 10,000 data points. When combined with HPLC-based sugar profiling, we observed GH1 enzymes active over a broad temperature range and toward many different β-linked disaccharides. For some GH1s we also observed activity toward laminarin, a more complex oligosaccharide present as a major component of macroalgae. An area of particular interest was the identification of GH1 enzymes compatible with the ionic liquid 1-ethyl-3-methylimidazolium acetate ([C2mim][OAc]), a next-generation biomass pretreatment technology. We thus searched for GH1 enzymes active at 70 °C and 20% (v/v) [C2mim][OAc] over the course of a 24-h saccharification reaction. Using our unbiased approach, we identified multiple enzymes of different phylogentic origin with such activities. Our approach of characterizing sequence diversity through targeted gene synthesis coupled to high-throughput screening technologies is a broadly applicable paradigm for a wide range of biological problems.
Collapse
|
32
|
Tissue-specific RNA expression marks distant-acting developmental enhancers. PLoS Genet 2014; 10:e1004610. [PMID: 25188404 PMCID: PMC4154669 DOI: 10.1371/journal.pgen.1004610] [Citation(s) in RCA: 92] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2014] [Accepted: 07/16/2014] [Indexed: 12/11/2022] Open
Abstract
Short non-coding transcripts can be transcribed from distant-acting transcriptional enhancer loci, but the prevalence of such enhancer RNAs (eRNAs) within the transcriptome, and the association of eRNA expression with tissue-specific enhancer activity in vivo remain poorly understood. Here, we investigated the expression dynamics of tissue-specific non-coding RNAs in embryonic mouse tissues via deep RNA sequencing. Overall, approximately 80% of validated in vivo enhancers show tissue-specific RNA expression that correlates with tissue-specific enhancer activity. Globally, we identified thousands of tissue-specifically transcribed non-coding regions (TSTRs) displaying various genomic hallmarks of bona fide enhancers. In transgenic mouse reporter assays, over half of tested TSTRs functioned as enhancers with reproducible activity in the predicted tissue. Together, our results demonstrate that tissue-specific eRNA expression is a common feature of in vivo enhancers, as well as a major source of extragenic transcription, and that eRNA expression signatures can be used to predict tissue-specific enhancers independent of known epigenomic enhancer marks. Up to 80% of mammalian genomes are actively transcribed, producing large numbers of non-coding RNAs without known functions. One particularly exciting category of such non-coding transcripts are the recently discovered enhancer RNAs (eRNAs) transcribed from distant-acting enhancer elements. Studies in cell-based paradigms suggest a functional requirement for such eRNA in enhancer-mediated gene regulation. In this study, we explored the in vivo expression dynamics of tissue-specific non-coding RNAs in embryonic mouse tissues via in-depth transcriptome profiling. Our results suggest that enhancers may be a predominant function associated with differentially expressed non-coding loci across developing tissues, and that differential eRNA expression signatures from total RNA-Seq can be used to identify uncharacterized tissue-specific in vivo enhancers independent of known epigenomic marks. Our results highlight the widespread and potentially important role of eRNAs in orchestrating gene expression and the necessity for functional studies in interpreting genome-wide enhancer predictions.
Collapse
|
33
|
Abstract
The canonical genetic code is assumed to be deeply conserved across all domains of life with very few exceptions. By scanning 5.6 trillion base pairs of metagenomic data for stop codon reassignment events, we detected recoding in a substantial fraction of the >1700 environmental samples examined. We observed extensive opal and amber stop codon reassignments in bacteriophages and of opal in bacteria. Our data indicate that bacteriophages can infect hosts with a different genetic code and demonstrate phage-host antagonism based on code differences. The abundance and diversity of genetic codes present in environmental organisms should be considered in the design of engineered organisms with altered genetic codes in order to preclude the exchange of genetic information with naturally occurring species.
Collapse
|
34
|
Abstract
Ruminant livestock represent the single largest anthropogenic source of the potent greenhouse gas methane, which is generated by methanogenic archaea residing in ruminant digestive tracts. While differences between individual animals of the same breed in the amount of methane produced have been observed, the basis for this variation remains to be elucidated. To explore the mechanistic basis of this methane production, we measured methane yields from 22 sheep, which revealed that methane yields are a reproducible, quantitative trait. Deep metagenomic and metatranscriptomic sequencing demonstrated a similar abundance of methanogens and methanogenesis pathway genes in high and low methane emitters. However, transcription of methanogenesis pathway genes was substantially increased in sheep with high methane yields. These results identify a discrete set of rumen methanogens whose methanogenesis pathway transcription profiles correlate with methane yields and provide new targets for CH4 mitigation at the levels of microbiota composition and transcriptional regulation.
Collapse
|
35
|
Abstract
The SMARCA4 (also known as BRG1 in humans) chromatin remodeling factor is critical for establishing lineage-specific chromatin states during early mammalian development. However, the role of SMARCA4 in tissue-specific gene regulation during embryogenesis remains poorly defined. To investigate the genome-wide binding landscape of SMARCA4 in differentiating tissues, we engineered a Smarca4FLAG knock-in mouse line. Using ChIP-seq, we identified ∼51,000 SMARCA4-associated regions across six embryonic mouse tissues (forebrain, hindbrain, neural tube, heart, limb, and face) at mid-gestation (E11.5). The majority of these regions was distal from promoters and showed dynamic occupancy, with most distal SMARCA4 sites (73%) confined to a single or limited subset of tissues. To further characterize these regions, we profiled active and repressive histone marks in the same tissues and examined the intersection of informative chromatin states and SMARCA4 binding. This revealed distinct classes of distal SMARCA4-associated elements characterized by activating and repressive chromatin signatures that were associated with tissue-specific up- or down-regulation of gene expression and relevant active/repressed biological pathways. We further demonstrate the predicted active regulatory properties of SMARCA4-associated elements by retrospective analysis of tissue-specific enhancers and direct testing of SMARCA4-bound regions in transgenic mouse assays. Our results indicate a dual active/repressive function of SMARCA4 at distal regulatory sequences in vivo and support its role in tissue-specific gene regulation during embryonic development.
Collapse
|
36
|
Rapid and pervasive changes in genome-wide enhancer usage during mammalian development. Cell 2014; 155:1521-31. [PMID: 24360275 DOI: 10.1016/j.cell.2013.11.033] [Citation(s) in RCA: 260] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2013] [Revised: 10/28/2013] [Accepted: 11/22/2013] [Indexed: 12/26/2022]
Abstract
Enhancers are distal regulatory elements that can activate tissue-specific gene expression and are abundant throughout mammalian genomes. Although substantial progress has been made toward genome-wide annotation of mammalian enhancers, their temporal activity patterns and global contributions in the context of developmental in vivo processes remain poorly explored. Here we used epigenomic profiling for H3K27ac, a mark of active enhancers, coupled to transgenic mouse assays to examine the genome-wide utilization of enhancers in three different mouse tissues across seven developmental stages. The majority of the ∼90,000 enhancers identified exhibited tightly temporally restricted predicted activity windows and were associated with stage-specific biological functions and regulatory pathways in individual tissues. Comparative genomic analysis revealed that evolutionary conservation of enhancers decreases following midgestation across all tissues examined. The dynamic enhancer activities uncovered in this study illuminate rapid and pervasive temporal in vivo changes in enhancer usage that underlie processes central to development and disease.
Collapse
|
37
|
Abstract
Epigenetic information is available from contemporary organisms, but is difficult to track back in evolutionary time. Here, we show that genome-wide epigenetic information can be gathered directly from next-generation sequence reads of DNA isolated from ancient remains. Using the genome sequence data generated from hair shafts of a 4000-yr-old Paleo-Eskimo belonging to the Saqqaq culture, we generate the first ancient nucleosome map coupled with a genome-wide survey of cytosine methylation levels. The validity of both nucleosome map and methylation levels were confirmed by the recovery of the expected signals at promoter regions, exon/intron boundaries, and CTCF sites. The top-scoring nucleosome calls revealed distinct DNA positioning biases, attesting to nucleotide-level accuracy. The ancient methylation levels exhibited high conservation over time, clustering closely with modern hair tissues. Using ancient methylation information, we estimated the age at death of the Saqqaq individual and illustrate how epigenetic information can be used to infer ancient gene expression. Similar epigenetic signatures were found in other fossil material, such as 110,000- to 130,000-yr-old bones, supporting the contention that ancient epigenomic information can be reconstructed from a deep past. Our findings lay the foundation for extracting epigenomic information from ancient samples, allowing shifts in epialleles to be tracked through evolutionary time, as well as providing an original window into modern epigenomics.
Collapse
|
38
|
Abstract
The shape of the human face and skull is largely genetically determined. However, the genomic basis of craniofacial morphology is incompletely understood and hypothesized to involve protein-coding genes, as well as gene regulatory sequences. We used a combination of epigenomic profiling, in vivo characterization of candidate enhancer sequences in transgenic mice, and targeted deletion experiments to examine the role of distant-acting enhancers in craniofacial development. We identified complex regulatory landscapes consisting of enhancers that drive spatially complex developmental expression patterns. Analysis of mouse lines in which individual craniofacial enhancers had been deleted revealed significant alterations of craniofacial shape, demonstrating the functional importance of enhancers in defining face and skull morphology. These results demonstrate that enhancers are involved in craniofacial development and suggest that enhancer sequence variation contributes to the diversity of human facial morphology.
Collapse
|
39
|
A high-resolution enhancer atlas of the developing telencephalon. Cell 2013; 152:895-908. [PMID: 23375746 DOI: 10.1016/j.cell.2012.12.041] [Citation(s) in RCA: 181] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2012] [Revised: 10/31/2012] [Accepted: 12/20/2012] [Indexed: 11/25/2022]
Abstract
The mammalian telencephalon plays critical roles in cognition, motor function, and emotion. Though many of the genes required for its development have been identified, the distant-acting regulatory sequences orchestrating their in vivo expression are mostly unknown. Here, we describe a digital atlas of in vivo enhancers active in subregions of the developing telencephalon. We identified more than 4,600 candidate embryonic forebrain enhancers and studied the in vivo activity of 329 of these sequences in transgenic mouse embryos. We generated serial sets of histological brain sections for 145 reproducible forebrain enhancers, resulting in a publicly accessible web-based data collection comprising more than 32,000 sections. We also used epigenomic analysis of human and mouse cortex tissue to directly compare the genome-wide enhancer architecture in these species. These data provide a primary resource for investigating gene regulatory mechanisms of telencephalon development and enable studies of the role of distant-acting enhancers in neurodevelopmental disorders.
Collapse
|
40
|
Abstract
Owing to the investigative power of classical mouse genetics and the development of transgenic and gene-targeting technologies, the mouse has increasingly been used to study atherogenesis. Investigators have manipulated the plasma levels of several proteins involved in lipid metabolism through the engineering of mice and assessed the impact of these changes on the progression of murine atherogenesis. These studies have established causal relationships between the expression of defined genetic elements and atherosclerosis in a manner not feasible in humans. Studies of transgenic mice with genetically altered levels of apolipoproteins A-I, (a), and E have provided important insights into the in vivo role of these proteins in atherogenesis. These studies illustrate the potential to dissect the complex genetics of atherosclerosis susceptibility with the use of transgenic and gene-targeted mice.
Collapse
|
41
|
Metagenome, metatranscriptome and single-cell sequencing reveal microbial response to Deepwater Horizon oil spill. ISME JOURNAL 2012; 6:1715-27. [PMID: 22717885 PMCID: PMC3498917 DOI: 10.1038/ismej.2012.59] [Citation(s) in RCA: 337] [Impact Index Per Article: 28.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
The Deepwater Horizon oil spill in the Gulf of Mexico resulted in a deep-sea hydrocarbon plume that caused a shift in the indigenous microbial community composition with unknown ecological consequences. Early in the spill history, a bloom of uncultured, thus uncharacterized, members of the Oceanospirillales was previously detected, but their role in oil disposition was unknown. Here our aim was to determine the functional role of the Oceanospirillales and other active members of the indigenous microbial community using deep sequencing of community DNA and RNA, as well as single-cell genomics. Shotgun metagenomic and metatranscriptomic sequencing revealed that genes for motility, chemotaxis and aliphatic hydrocarbon degradation were significantly enriched and expressed in the hydrocarbon plume samples compared with uncontaminated seawater collected from plume depth. In contrast, although genes coding for degradation of more recalcitrant compounds, such as benzene, toluene, ethylbenzene, total xylenes and polycyclic aromatic hydrocarbons, were identified in the metagenomes, they were expressed at low levels, or not at all based on analysis of the metatranscriptomes. Isolation and sequencing of two Oceanospirillales single cells revealed that both cells possessed genes coding for n-alkane and cycloalkane degradation. Specifically, the near-complete pathway for cyclohexane oxidation in the Oceanospirillales single cells was elucidated and supported by both metagenome and metatranscriptome data. The draft genome also included genes for chemotaxis, motility and nutrient acquisition strategies that were also identified in the metagenomes and metatranscriptomes. These data point towards a rapid response of members of the Oceanospirillales to aliphatic hydrocarbons in the deep sea.
Collapse
|
42
|
A vast collection of microbial genes that are toxic to bacteria. Genome Res 2012; 22:802-9. [PMID: 22300632 PMCID: PMC3317161 DOI: 10.1101/gr.133850.111] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2011] [Accepted: 01/25/2012] [Indexed: 11/25/2022]
Abstract
In the process of clone-based genome sequencing, initial assemblies frequently contain cloning gaps that can be resolved using cloning-independent methods, but the reason for their occurrence is largely unknown. By analyzing 9,328,693 sequencing clones from 393 microbial genomes, we systematically mapped more than 15,000 genes residing in cloning gaps and experimentally showed that their expression products are toxic to the Escherichia coli host. A subset of these toxic sequences was further evaluated through a series of functional assays exploring the mechanisms of their toxicity. Among these genes, our assays revealed novel toxins and restriction enzymes, and new classes of small, non-coding toxic RNAs that reproducibly inhibit E. coli growth. Further analyses also revealed abundant, short, toxic DNA fragments that were predicted to suppress E. coli growth by interacting with the replication initiator DnaA. Our results show that cloning gaps, once considered the result of technical problems, actually serve as a rich source for the discovery of biotechnologically valuable functions, and suggest new modes of antimicrobial interventions.
Collapse
MESH Headings
- Anti-Bacterial Agents/metabolism
- Anti-Bacterial Agents/pharmacology
- Bacterial Proteins/genetics
- Bacterial Proteins/metabolism
- Base Sequence
- Binding Sites/genetics
- Cloning, Molecular
- DNA, Bacterial/genetics
- DNA, Bacterial/metabolism
- DNA, Bacterial/pharmacology
- DNA-Binding Proteins/genetics
- DNA-Binding Proteins/metabolism
- Escherichia coli/genetics
- Escherichia coli/metabolism
- Gene Expression Regulation, Bacterial
- Genes, Bacterial/genetics
- Genome, Bacterial/genetics
- Microbial Viability/drug effects
- Microbial Viability/genetics
- Molecular Sequence Data
- Protein Binding
- RNA, Bacterial/genetics
- RNA, Bacterial/metabolism
- RNA, Bacterial/pharmacology
- RNA, Transfer/genetics
- RNA, Transfer/metabolism
- RNA, Transfer/pharmacology
- Sequence Homology, Nucleic Acid
- Transcription, Genetic
Collapse
|
43
|
Titelbild: High-Throughput In Vitro Glycoside Hydrolase (HIGH) Screening for Enzyme Discovery (Angew. Chem. 47/2011). Angew Chem Int Ed Engl 2011. [DOI: 10.1002/ange.201106146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|
44
|
Cover Picture: High-Throughput In Vitro Glycoside Hydrolase (HIGH) Screening for Enzyme Discovery (Angew. Chem. Int. Ed. 47/2011). Angew Chem Int Ed Engl 2011. [DOI: 10.1002/anie.201106146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
45
|
Metagenomic analysis of a permafrost microbial community reveals a rapid response to thaw. Nature 2011; 480:368-71. [PMID: 22056985 DOI: 10.1038/nature10576] [Citation(s) in RCA: 371] [Impact Index Per Article: 28.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2011] [Accepted: 09/20/2011] [Indexed: 02/06/2023]
Abstract
Permafrost contains an estimated 1672 Pg carbon (C), an amount roughly equivalent to the total currently contained within land plants and the atmosphere. This reservoir of C is vulnerable to decomposition as rising global temperatures cause the permafrost to thaw. During thaw, trapped organic matter may become more accessible for microbial degradation and result in greenhouse gas emissions. Despite recent advances in the use of molecular tools to study permafrost microbial communities, their response to thaw remains unclear. Here we use deep metagenomic sequencing to determine the impact of thaw on microbial phylogenetic and functional genes, and relate these data to measurements of methane emissions. Metagenomics, the direct sequencing of DNA from the environment, allows the examination of whole biochemical pathways and associated processes, as opposed to individual pieces of the metabolic puzzle. Our metagenome analyses reveal that during transition from a frozen to a thawed state there are rapid shifts in many microbial, phylogenetic and functional gene abundances and pathways. After one week of incubation at 5 °C, permafrost metagenomes converge to be more similar to each other than while they are frozen. We find that multiple genes involved in cycling of C and nitrogen shift rapidly during thaw. We also construct the first draft genome from a complex soil metagenome, which corresponds to a novel methanogen. Methane previously accumulated in permafrost is released during thaw and subsequently consumed by methanotrophic bacteria. Together these data point towards the importance of rapid cycling of methane and nitrogen in thawing permafrost.
Collapse
|
46
|
High-Throughput In Vitro Glycoside Hydrolase (HIGH) Screening for Enzyme Discovery. Angew Chem Int Ed Engl 2011. [DOI: 10.1002/ange.201104685] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
47
|
High-Throughput In Vitro Glycoside Hydrolase (HIGH) Screening for Enzyme Discovery. Angew Chem Int Ed Engl 2011; 50:11215-8. [DOI: 10.1002/anie.201104685] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2011] [Indexed: 11/06/2022]
|
48
|
Abstract
The paucity of enzymes that efficiently deconstruct plant polysaccharides represents a major bottleneck for industrial-scale conversion of cellulosic biomass into biofuels. Cow rumen microbes specialize in degradation of cellulosic plant material, but most members of this complex community resist cultivation. To characterize biomass-degrading genes and genomes, we sequenced and analyzed 268 gigabases of metagenomic DNA from microbes adherent to plant fiber incubated in cow rumen. From these data, we identified 27,755 putative carbohydrate-active genes and expressed 90 candidate proteins, of which 57% were enzymatically active against cellulosic substrates. We also assembled 15 uncultured microbial genomes, which were validated by complementary methods including single-cell genome sequencing. These data sets provide a substantially expanded catalog of genes and genomes participating in the deconstruction of cellulosic biomass.
Collapse
|
49
|
ChIP-Seq identification of weakly conserved heart enhancers. Nat Genet 2010; 42:806-10. [PMID: 20729851 DOI: 10.1038/ng.650] [Citation(s) in RCA: 337] [Impact Index Per Article: 24.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2010] [Accepted: 07/22/2010] [Indexed: 01/29/2023]
Abstract
Accurate control of tissue-specific gene expression plays a pivotal role in heart development, but few cardiac transcriptional enhancers have thus far been identified. Extreme noncoding-sequence conservation has successfully predicted enhancers that are active in many tissues but has failed to identify substantial numbers of heart-specific enhancers. Here, we used ChIP-Seq with the enhancer-associated protein p300 from mouse embryonic day 11.5 heart tissue to identify over 3,000 candidate heart enhancers genome wide. Compared to enhancers active in other tissues we studied at this time point, most candidate heart enhancers were less deeply conserved in vertebrate evolution. Nevertheless, transgenic mouse assays of 130 candidate regions revealed that most function reproducibly as enhancers active in the heart, irrespective of their degree of evolutionary constraint. These results provide evidence for a large population of poorly conserved heart enhancers and suggest that the evolutionary conservation of embryonic enhancers can vary depending on tissue type.
Collapse
|
50
|
Genomic approaches for enhancer identification. FASEB J 2010. [DOI: 10.1096/fasebj.24.1_supplement.414.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|