26
|
Cheng JL, Li J, Qiu YM, Wei CL, Yang LQ, Fu JJ. Development of novel SCAR markers for genetic characterization of Lonicera japonica from high GC-RAMP-PCR and DNA cloning. GENETICS AND MOLECULAR RESEARCH 2016; 15:gmr7737. [PMID: 27173286 DOI: 10.4238/gmr.15027737] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
Sequence-characterized amplified region (SCAR) markers were further developed from high-GC primer RAMP-PCR-amplified fragments from Lonicera japonica DNA by molecular cloning. The four DNA fragments from three high-GC primers (FY-27, FY-28, and FY-29) were successfully cloned into a pGM-T vector. The positive clones were sequenced; their names, sizes, and GenBank numbers were JYHGC1-1, 345 bp, KJ620024; YJHGC2-1, 388 bp, KJ620025; JYHGC7-2, 1036 bp, KJ620026; and JYHGC6-2, 715 bp, KJ620027, respectively. Four novel SCAR markers were developed by designing specific primers, optimizing conditions, and PCR validation. The developed SCAR markers were used for the genetic authentication of L. japonica from its substitutes. This technique provides another means of developing DNA markers for the characterization and authentication of various organisms including medicinal plants and their substitutes.
Collapse
|
27
|
van Baren MJ, Bachy C, Reistetter EN, Purvine SO, Grimwood J, Sudek S, Yu H, Poirier C, Deerinck TJ, Kuo A, Grigoriev IV, Wong CH, Smith RD, Callister SJ, Wei CL, Schmutz J, Worden AZ. Evidence-based green algal genomics reveals marine diversity and ancestral characteristics of land plants. BMC Genomics 2016; 17:267. [PMID: 27029936 PMCID: PMC4815162 DOI: 10.1186/s12864-016-2585-6] [Citation(s) in RCA: 59] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2015] [Accepted: 03/11/2016] [Indexed: 01/26/2023] Open
Abstract
Background Prasinophytes are widespread marine green algae that are related to plants. Cellular abundance of the prasinophyte Micromonas has reportedly increased in the Arctic due to climate-induced changes. Thus, studies of these unicellular eukaryotes are important for marine ecology and for understanding Viridiplantae evolution and diversification. Results We generated evidence-based Micromonas gene models using proteomics and RNA-Seq to improve prasinophyte genomic resources. First, sequences of four chromosomes in the 22 Mb Micromonas pusilla (CCMP1545) genome were finished. Comparison with the finished 21 Mb genome of Micromonas commoda (RCC299; named herein) shows they share ≤8,141 of ~10,000 protein-encoding genes, depending on the analysis method. Unlike RCC299 and other sequenced eukaryotes, CCMP1545 has two abundant repetitive intron types and a high percent (26 %) GC splice donors. Micromonas has more genus-specific protein families (19 %) than other genome sequenced prasinophytes (11 %). Comparative analyses using predicted proteomes from other prasinophytes reveal proteins likely related to scale formation and ancestral photosynthesis. Our studies also indicate that peptidoglycan (PG) biosynthesis enzymes have been lost in multiple independent events in select prasinophytes and plants. However, CCMP1545, polar Micromonas CCMP2099 and prasinophytes from other classes retain the entire PG pathway, like moss and glaucophyte algae. Surprisingly, multiple vascular plants also have the PG pathway, except the Penicillin-Binding Protein, and share a unique bi-domain protein potentially associated with the pathway. Alongside Micromonas experiments using antibiotics that halt bacterial PG biosynthesis, the findings highlight unrecognized phylogenetic complexity in PG-pathway retention and implicate a role in chloroplast structure or division in several extant Viridiplantae lineages. Conclusions Extensive differences in gene loss and architecture between related prasinophytes underscore their divergence. PG biosynthesis genes from the cyanobacterial endosymbiont that became the plastid, have been selectively retained in multiple plants and algae, implying a biological function. Our studies provide robust genomic resources for emerging model algae, advancing knowledge of marine phytoplankton and plant evolution. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-2585-6) contains supplementary material, which is available to authorized users.
Collapse
|
28
|
Yang WC, Zhu L, Qiu YM, Zhou BX, Cheng JL, Wei CL, Chen HC, Li LY, Fu XD, Fu JJ. Isolation and analysis of cell-free fetal DNA from maternal peripheral blood in Chinese women. GENETICS AND MOLECULAR RESEARCH 2015; 14:18078-89. [PMID: 26782455 DOI: 10.4238/2015.december.22.34] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
Non-invasive prenatal diagnosis is used to detect the genetic material of the fetus by isolating the cell-free fetal DNA (cffDNA) from maternal peripheral blood. In order to establish an isolation method for cffDNA from maternal peripheral blood in Chinese women, the cffDNA was acquired with a two-step centrifugation using a QlAamp DNA Blood mini kit. The SRY gene of plasma DNA was amplified by polymerase chain reaction (PCR). Real-time quantitative PCR was used to measure the concentration of cffDNA in maternal peripheral blood in different pregnant women. The results of the SRY gene amplification of plasma DNA from pregnant women was the same as that of the amniocyte DNA. The average concentration of cffDNA in maternal peripheral blood of pregnant women in different gestational stages was 0.98 ng/mL (0.26-1.49 ng/mL), 1.43 ng/mL (0.46- 2.34 ng/mL), and 1.95 ng/mL (0.65-6.81 ng/mL) from early, middle, and late gestational stages, respectively. The mean of cffDNA from total DNA in plasma in different stages of gestation was 22.28% (9.86-27.81%). The lowest concentration of DNA amplified by nested-PCR in our research was 10-4-10-3 ng/μL. The isolation method for cffDNA from maternal peripheral blood was successfully established and further research into its applications will be conducted.
Collapse
|
29
|
Tang Z, Luo OJ, Li X, Zheng M, Zhu JJ, Szalaj P, Trzaskoma P, Magalska A, Wlodarczyk J, Ruszczycki B, Michalski P, Piecuch E, Wang P, Wang D, Tian SZ, Penrad-Mobayed M, Sachs LM, Ruan X, Wei CL, Liu ET, Wilczynski GM, Plewczynski D, Li G, Ruan Y. CTCF-Mediated Human 3D Genome Architecture Reveals Chromatin Topology for Transcription. Cell 2015; 163:1611-27. [PMID: 26686651 PMCID: PMC4734140 DOI: 10.1016/j.cell.2015.11.024] [Citation(s) in RCA: 645] [Impact Index Per Article: 71.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2015] [Revised: 09/12/2015] [Accepted: 11/10/2015] [Indexed: 01/09/2023]
Abstract
Spatial genome organization and its effect on transcription remains a fundamental question. We applied an advanced chromatin interaction analysis by paired-end tag sequencing (ChIA-PET) strategy to comprehensively map higher-order chromosome folding and specific chromatin interactions mediated by CCCTC-binding factor (CTCF) and RNA polymerase II (RNAPII) with haplotype specificity and nucleotide resolution in different human cell lineages. We find that CTCF/cohesin-mediated interaction anchors serve as structural foci for spatial organization of constitutive genes concordant with CTCF-motif orientation, whereas RNAPII interacts within these structures by selectively drawing cell-type-specific genes toward CTCF foci for coordinated transcription. Furthermore, we show that haplotype variants and allelic interactions have differential effects on chromosome configuration, influencing gene expression, and may provide mechanistic insights into functions associated with disease susceptibility. 3D genome simulation suggests a model of chromatin folding around chromosomal axes, where CTCF is involved in defining the interface between condensed and open compartments for structural regulation. Our 3D genome strategy thus provides unique insights in the topological mechanism of human variations and diseases.
Collapse
|
30
|
Wei CL, Cheng JL, Yang WC, Li LY, Cheng HC, Fu JJ. Identification of the origin of marker chromosomes by two-color fluorescence in situ hybridization and polymerase chain reaction in azoospermic patients. GENETICS AND MOLECULAR RESEARCH 2015; 14:14488-95. [PMID: 26600507 DOI: 10.4238/2015.november.18.11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
Y chromosomal microdeletions at the azoospermia factor locus and chromosome abnormalities have been implicated as the major causes of idiopathic male infertility. A marker chromosome is a structurally abnormal chromosome in which no part can be identified by cytogenetics. In this study, to identify the origin of the marker chromosomes and to perform a genetic diagnosis of patients with azoospermia, two-color fluorescence in situ hybridization (FISH) and polymerase chain reaction (PCR) techniques were carried out. The marker chromosomes for the two patients with azoospermia originated in the Y chromosome; it was ascertained that the karyotype of both patients was 46,X, ish del(Y)(q11)(DYZ3+, DXZ1-). The combination of two-color FISH and PCR techniques is an important method for the identification of the origin of marker chromosomes. Thus, genetic counseling and a clear genetic diagnosis of patients with azoospermia before intracytoplasmic sperm injection or other clinical managements are important.
Collapse
|
31
|
Evans J, Crisovan E, Barry K, Daum C, Jenkins J, Kunde-Ramamoorthy G, Nandety A, Ngan CY, Vaillancourt B, Wei CL, Schmutz J, Kaeppler SM, Casler MD, Buell CR. Diversity and population structure of northern switchgrass as revealed through exome capture sequencing. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2015; 84:800-15. [PMID: 26426343 DOI: 10.1111/tpj.13041] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2015] [Revised: 08/31/2015] [Accepted: 09/03/2015] [Indexed: 05/11/2023]
Abstract
Panicum virgatum L. (switchgrass) is a polyploid, perennial grass species that is native to North America, and is being developed as a future biofuel feedstock crop. Switchgrass is present primarily in two ecotypes: a northern upland ecotype, composed of tetraploid and octoploid accessions, and a southern lowland ecotype, composed of primarily tetraploid accessions. We employed high-coverage exome capture sequencing (~2.4 Tb) to genotype 537 individuals from 45 upland and 21 lowland populations. From these data, we identified ~27 million single-nucleotide polymorphisms (SNPs), of which 1 590 653 high-confidence SNPs were used in downstream analyses of diversity within and between the populations. From the 66 populations, we identified five primary population groups within the upland and lowland ecotypes, a result that was further supported through genetic distance analysis. We identified conserved, ecotype-restricted, non-synonymous SNPs that are predicted to affect the protein function of CONSTANS (CO) and EARLY HEADING DATE 1 (EHD1), key genes involved in flowering, which may contribute to the phenotypic differences between the two ecotypes. We also identified, relative to the near-reference Kanlow population, 17 228 genes present in more copies than in the reference genome (up-CNVs), 112 630 genes present in fewer copies than in the reference genome (down-CNVs) and 14 430 presence/absence variants (PAVs), affecting a total of 9979 genes, including two upland-specific CNV clusters. In total, 45 719 genes were affected by an SNP, CNV, or PAV across the panel, providing a firm foundation to identify functional variation associated with phenotypic traits of interest for biofuel feedstock production.
Collapse
|
32
|
Ngan CY, Wong CH, Choi C, Yoshinaga Y, Louie K, Jia J, Chen C, Bowen B, Cheng H, Leonelli L, Kuo R, Baran R, García-Cerdán JG, Pratap A, Wang M, Lim J, Tice H, Daum C, Xu J, Northen T, Visel A, Bristow J, Niyogi KK, Wei CL. Lineage-specific chromatin signatures reveal a regulator of lipid metabolism in microalgae. NATURE PLANTS 2015; 1:15107. [PMID: 27250540 DOI: 10.1038/nplants.2015.107] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/23/2015] [Accepted: 06/22/2015] [Indexed: 05/09/2023]
Abstract
Alga-derived lipids represent an attractive potential source of biofuels. However, lipid accumulation in algae is a stress response tightly coupled to growth arrest, thereby imposing a major limitation on productivity. To identify transcriptional regulators of lipid accumulation, we performed an integrative chromatin signature and transcriptomic analysis to decipher the regulation of lipid biosynthesis in the alga Chlamydomonas reinhardtii. Genome-wide histone modification profiling revealed remarkable differences in functional chromatin states between the algae and higher eukaryotes and uncovered regulatory components at the core of lipid accumulation pathways. We identified the transcription factor, PSR1, as a pivotal switch that triggers cytosolic lipid accumulation. Dissection of the PSR1-induced lipid profiles corroborates its role in coordinating multiple lipid-inducing stress responses. The comprehensive maps of functional chromatin signatures in a major clade of eukaryotic life and the discovery of a transcriptional regulator of algal lipid metabolism will facilitate targeted engineering strategies to mediate high lipid production in microalgae.
Collapse
|
33
|
Tsai YS, Jou YC, Tsai YP, Liu BD, Lin HI, Wei CL, Chen SY, Tsai HT, Ou CH, Yang WH, Tzai TS. Development of 3-Hydroxyanthranilic acid-based Integrated Non-invasive Biosensor for Bladder Cancer Detection. UROLOGICAL SCIENCE 2015. [DOI: 10.1016/j.urols.2015.06.200] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
|
34
|
Deyle DR, Hansen RS, Cornea AM, Li LB, Burt AA, Alexander IE, Sandstrom RS, Stamatoyannopoulos JA, Wei CL, Russell DW. A genome-wide map of adeno-associated virus-mediated human gene targeting. Nat Struct Mol Biol 2014; 21:969-75. [PMID: 25282150 PMCID: PMC4405182 DOI: 10.1038/nsmb.2895] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2014] [Accepted: 08/27/2014] [Indexed: 02/03/2023]
Abstract
To determine which genomic features promote homologous recombination, we created a genome-wide map of gene targeting sites. We used an adeno-associated virus vector to target identical loci introduced as transcriptionally active retroviral vectors. A comparison of ~2,000 targeted and untargeted sites showed that targeting occurred throughout the human genome and was not influenced by the presence of nearby CpG islands, sequence repeats or DNase I-hypersensitive sites. Targeted sites were preferentially located within transcription units, especially when the target loci were transcribed in the opposite orientation to their surrounding chromosomal genes. We determined the impact of DNA replication by mapping replication forks, which revealed a preference for recombination at target loci transcribed toward an incoming fork. Our results constitute the first genome-wide screen of gene targeting in mammalian cells and demonstrate a strong recombinogenic effect of colliding polymerases.
Collapse
|
35
|
Walz S, Lorenzin F, Morton J, Wiese KE, von Eyss B, Herold S, Rycak L, Dumay-Odelot H, Karim S, Bartkuhn M, Roels F, Wüstefeld T, Fischer M, Teichmann M, Zender L, Wei CL, Sansom O, Wolf E, Eilers M. Activation and repression by oncogenic MYC shape tumour-specific gene expression profiles. Nature 2014; 511:483-7. [PMID: 25043018 PMCID: PMC6879323 DOI: 10.1038/nature13473] [Citation(s) in RCA: 349] [Impact Index Per Article: 34.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2013] [Accepted: 05/13/2014] [Indexed: 12/26/2022]
Abstract
In mammalian cells, the MYC oncoprotein binds to thousands of promoters. During mitogenic stimulation of primary lymphocytes, MYC promotes an increase in the expression of virtually all genes. In contrast, MYC-driven tumour cells differ from normal cells in the expression of specific sets of up- and downregulated genes that have considerable prognostic value. To understand this discrepancy, we studied the consequences of inducible expression and depletion of MYC in human cells and murine tumour models. Changes in MYC levels activate and repress specific sets of direct target genes that are characteristic of MYC-transformed tumour cells. Three factors account for this specificity. First, the magnitude of response parallels the change in occupancy by MYC at each promoter. Functionally distinct classes of target genes differ in the E-box sequence bound by MYC, suggesting that different cellular responses to physiological and oncogenic MYC levels are controlled by promoter affinity. Second, MYC both positively and negatively affects transcription initiation independent of its effect on transcriptional elongation. Third, complex formation with MIZ1 (also known as ZBTB17) mediates repression of multiple target genes by MYC and the ratio of MYC and MIZ1 bound to each promoter correlates with the direction of response.
Collapse
|
36
|
Reeve W, Ardley J, Tian R, De Meyer S, Terpolilli J, Melino V, Tiwari R, Yates R, O’Hara G, Howieson J, Ninawi M, Zhang X, Bruce D, Detter C, Tapia R, Han C, Wei CL, Huntemann M, Han J, Chen IM, Mavromatis K, Markowitz V, Szeto E, Ivanova N, Pagani I, Pati A, Goodwin L, Woyke T, Kyrpides N. Genome sequence of the Listia angolensis microsymbiont Microvirga lotononidis strain WSM3557(T.). Stand Genomic Sci 2014; 9:540-50. [PMID: 25197439 PMCID: PMC4149032 DOI: 10.4056/sigs.4548266] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Microvirga lotononidis is a recently described species of root-nodule bacteria that is an effective nitrogen- (N2) fixing microsymbiont of the symbiotically specific African legume Listia angolensis (Welw. ex Bak.) B.-E. van Wyk & Boatwr. M. lotononidis possesses several properties that are unusual in root-nodule bacteria, including pigmentation and the ability to grow at temperatures of up to 45°C. Strain WSM3557(T) is an aerobic, motile, Gram-negative, non-spore-forming rod isolated from a L. angolensis root nodule collected in Chipata, Zambia in 1963. This is the first report of a complete genome sequence for the genus Microvirga. Here we describe the features of Microvirga lotononidis strain WSM3557(T), together with genome sequence information and annotation. The 7,082,538 high-quality-draft genome is arranged in 18 scaffolds of 104 contigs, contains 6,956 protein-coding genes and 84 RNA-only encoding genes, and is one of 20 rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Community Sequencing Program.
Collapse
|
37
|
Martin JA, Johnson NV, Gross SM, Schnable J, Meng X, Wang M, Coleman-Derr D, Lindquist E, Wei CL, Kaeppler S, Chen F, Wang Z. A near complete snapshot of the Zea mays seedling transcriptome revealed from ultra-deep sequencing. Sci Rep 2014; 4:4519. [PMID: 24682209 PMCID: PMC3970191 DOI: 10.1038/srep04519] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2013] [Accepted: 02/26/2014] [Indexed: 02/04/2023] Open
Abstract
RNA-sequencing (RNA-seq) enables in-depth exploration of transcriptomes, but typical sequencing depth often limits its comprehensiveness. In this study, we generated nearly 3 billion RNA-Seq reads, totaling 341 Gb of sequence, from a Zea mays seedling sample. At this depth, a near complete snapshot of the transcriptome was observed consisting of over 90% of the annotated transcripts, including lowly expressed transcription factors. A novel hybrid strategy combining de novo and reference-based assemblies yielded a transcriptome consisting of 126,708 transcripts with 88% of expressed known genes assembled to full-length. We improved current annotations by adding 4,842 previously unannotated transcript variants and many new features, including 212 maize transcripts, 201 genes, 10 genes with undocumented potential roles in seedlings as well as maize lineage specific gene fusion events. We demonstrated the power of deep sequencing for large transcriptome studies by generating a high quality transcriptome, which provides a rich resource for the research community.
Collapse
|
38
|
Kieffer-Kwon KR, Tang Z, Mathe E, Qian J, Sung MH, Li G, Resch W, Baek S, Pruett N, Grøntved L, Vian L, Nelson S, Zare H, Hakim O, Reyon D, Yamane A, Nakahashi H, Kovalchuk AL, Zou J, Joung JK, Sartorelli V, Wei CL, Ruan X, Hager GL, Ruan Y, Casellas R. Interactome maps of mouse gene regulatory domains reveal basic principles of transcriptional regulation. Cell 2014; 155:1507-20. [PMID: 24360274 DOI: 10.1016/j.cell.2013.11.039] [Citation(s) in RCA: 221] [Impact Index Per Article: 22.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2013] [Revised: 11/01/2013] [Accepted: 11/25/2013] [Indexed: 12/31/2022]
Abstract
A key finding of the ENCODE project is that the enhancer landscape of mammalian cells undergoes marked alterations during ontogeny. However, the nature and extent of these changes are unclear. As part of the NIH Mouse Regulome Project, we here combined DNaseI hypersensitivity, ChIP-seq, and ChIA-PET technologies to map the promoter-enhancer interactomes of pluripotent ES cells and differentiated B lymphocytes. We confirm that enhancer usage varies widely across tissues. Unexpectedly, we find that this feature extends to broadly transcribed genes, including Myc and Pim1 cell-cycle regulators, which associate with an entirely different set of enhancers in ES and B cells. By means of high-resolution CpG methylomes, genome editing, and digital footprinting, we show that these enhancers recruit lineage-determining factors. Furthermore, we demonstrate that the turning on and off of enhancers during development correlates with promoter activity. We propose that organisms rely on a dynamic enhancer landscape to control basic cellular functions in a tissue-specific manner.
Collapse
|
39
|
Reeve W, De Meyer S, Terpolilli J, Melino V, Ardley J, Tian R, Tiwari R, Howieson J, Yates R, O'Hara G, Ninawi M, Lu M, Bruce D, Detter C, Tapia R, Han C, Wei CL, Huntemann M, Han J, Chen IM, Mavromatis K, Markowitz V, Ivanova N, Pagani I, Pati A, Goodwin L, Woyke T, Kyrpides N. Genome sequence of the Ornithopus/Lupinus-nodulating Bradyrhizobium sp. strain WSM471. Stand Genomic Sci 2013; 9:254-63. [PMID: 24976882 PMCID: PMC4062639 DOI: 10.4056/sigs.4498256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Bradyrhizobium sp. strain WSM471 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen- (N2) fixing root nodule formed on the annual legume Ornithopus pinnatus (Miller) Druce growing at Oyster Harbour, Albany district, Western Australia in 1982. This strain is in commercial production as an inoculant for Lupinus and Ornithopus. Here we describe the features of Bradyrhizobium sp. strain WSM471, together with genome sequence information and annotation. The 7,784,016 bp high-quality-draft genome is arranged in 1 scaffold of 2 contigs, contains 7,372 protein-coding genes and 58 RNA-only encoding genes, and is one of 20 rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Community Sequencing Program.
Collapse
|
40
|
Reeve W, De Meyer S, Terpolilli J, Melino V, Ardley J, Rui T, Tiwari R, Howieson J, Yates R, O’Hara G, Lu M, Bruce D, Detter C, Tapia R, Han C, Wei CL, Huntemann M, Han J, Chen IM, Mavromatis K, Markowitz V, Szeto E, Ivanova N, Mikhailova N, Ovchinnikova G, Pagani I, Pati A, Goodwin L, Peters L, Pitluck S, Woyke T, Kyrpides N. Genome sequence of the Lebeckia ambigua-nodulating "Burkholderia sprentiae" strain WSM5005(T.). Stand Genomic Sci 2013; 9:385-94. [PMID: 24976894 PMCID: PMC4062628 DOI: 10.4056/sigs.4558268] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
"Burkholderia sprentiae" strain WSM5005(T) is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated in Australia from an effective N2-fixing root nodule of Lebeckia ambigua collected in Klawer, Western Cape of South Africa, in October 2007. Here we describe the features of "Burkholderia sprentiae" strain WSM5005(T), together with the genome sequence and its annotation. The 7,761,063 bp high-quality-draft genome is arranged in 8 scaffolds of 236 contigs, contains 7,147 protein-coding genes and 76 RNA-only encoding genes, and is one of 20 rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Community Sequencing Program.
Collapse
|
41
|
Reeve W, Terpolilli J, Melino V, Ardley J, Tian R, De Meyer S, Tiwari R, Yates R, O'Hara G, Howieson J, Ninawi M, Teshima H, Bruce D, Detter C, Tapia R, Han C, Wei CL, Huntemann M, Han J, Chen IM, Mavrommatis K, Markowitz V, Ivanova N, Ovchinnikova G, Pagani I, Pati A, Goodwin L, Peters L, Woyke T, Kyrpides N. Genome sequence of the lupin-nodulating Bradyrhizobium sp. strain WSM1417. Stand Genomic Sci 2013; 9:273-82. [PMID: 24976884 PMCID: PMC4062640 DOI: 10.4056/sigs.4518260] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Bradyrhizobium sp. strain WSM1417 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen (N2) fixing root nodule of Lupinus sp. collected in Papudo, Chile, in 1995. However, this microsymbiont is a poorly effective N2 fixer with the legume host Lupinus angustifolius L.; a lupin species of considerable economic importance in both Chile and Australia. The symbiosis formed with L. angustifolius produces less than half of the dry matter achieved by the symbioses with commercial inoculant strains such as Bradyrhizobium sp. strain WSM471. Therefore, WSM1417 is an important candidate strain with which to investigate the genetics of effective N2 fixation in the lupin-bradyrhizobia symbioses. Here we describe the features of Bradyrhizobium sp. strain WSM1417, together with genome sequence information and annotation. The 8,048,963 bp high-quality-draft genome is arranged in a single scaffold of 2 contigs, contains 7,695 protein-coding genes and 77 RNA-only encoding genes, and is one of 20 rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Community Sequencing Program.
Collapse
|
42
|
Reeve W, Melino V, Ardley J, Tian R, De Meyer S, Terpolilli J, Tiwari R, Yates R, O'Hara G, Howieson J, Ninawi M, Held B, Bruce D, Detter C, Tapia R, Han C, Wei CL, Huntemann M, Han J, Chen IM, Mavromatis K, Markowitz V, Szeto E, Ivanova N, Mikhailova N, Pagani I, Pati A, Goodwin L, Woyke T, Kyrpides N. Genome sequence of the Trifolium rueppellianum -nodulating Rhizobium leguminosarum bv. trifolii strain WSM2012. Stand Genomic Sci 2013; 9:283-93. [PMID: 24976885 PMCID: PMC4062638 DOI: 10.4056/sigs.4528262] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Rhizobium leguminosarum bv. trifolii WSM2012 (syn. MAR1468) is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an ineffective root nodule recovered from the roots of the annual clover Trifolium rueppellianum Fresen growing in Ethiopia. WSM2012 has a narrow, specialized host range for N2-fixation. Here we describe the features of R. leguminosarum bv. trifolii strain WSM2012, together with genome sequence information and annotation. The 7,180,565 bp high-quality-draft genome is arranged into 6 scaffolds of 68 contigs, contains 7,080 protein-coding genes and 86 RNA-only encoding genes, and is one of 20 rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Community Sequencing Program.
Collapse
|
43
|
Reeve W, Tian R, De Meyer S, Melino V, Terpolilli J, Ardley J, Tiwari R, Howieson J, Yates R, O'Hara G, Ninawi M, Teshima H, Bruce D, Detter C, Tapia R, Han C, Wei CL, Huntemann M, Han J, Chen IM, Mavromatis K, Markowitz V, Ivanova N, Ovchinnikova G, Pagani I, Pati A, Goodwin L, Pitluck S, Woyke T, Kyrpides N. Genome sequence of the clover-nodulating Rhizobium leguminosarum bv. trifolii strain TA1. Stand Genomic Sci 2013; 9:243-53. [PMID: 24976881 PMCID: PMC4062637 DOI: 10.4056/sigs.4488254] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open
Abstract
Rhizobium leguminosarum bv. trifolii strain TA1 is an aerobic, motile, Gram-negative, non-spore-forming rod that is an effective nitrogen fixing microsymbiont on the perennial clovers originating from Europe and the Mediterranean basin. TA1 however is ineffective with many annual and perennial clovers originating from Africa and America. Here we describe the features of R. leguminosarum bv. trifolii strain TA1, together with genome sequence information and annotation. The 8,618,824 bp high-quality-draft genome is arranged in a 6 scaffold of 32 contigs, contains 8,493 protein-coding genes and 83 RNA-only encoding genes, and is one of 20 rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Community Sequencing Program.
Collapse
|
44
|
Reeve W, Terpolilli J, Melino V, Ardley J, Tian R, De Meyer S, Tiwari R, Yates R, O'Hara G, Howieson J, Ninawi M, Held B, Bruce D, Detter C, Tapia R, Han C, Wei CL, Huntemann M, Han J, Chen IM, Mavromatis K, Markowitz V, Ivanova N, Ovchinnikova G, Pagani I, Pati A, Goodwin L, Woyke T, Kyrpides N. Genome sequence of the South American clover-nodulating Rhizobium leguminosarum bv. trifolii strain WSM597. Stand Genomic Sci 2013; 9:264-72. [PMID: 24976883 PMCID: PMC4062625 DOI: 10.4056/sigs.4508258] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Rhizobium leguminosarum bv. trifolii strain WSM597 is an aerobic, motile, Gram-negative, non-spore-forming rod isolated from a root nodule of the annual clover Trifolium pallidum L. growing at Glencoe Research Station near Tacuarembó, Uruguay. This strain is generally ineffective for nitrogen (N2) fixation with clovers of Mediterranean, North American and African origin, but is effective on the South American perennial clover T. polymorphum Poir. Here we describe the features of R. leguminosarum bv. trifolii strain WSM597, together with genome sequence information and annotation. The 7,634,384 bp high-quality-draft genome is arranged in 2 scaffolds of 53 contigs, contains 7,394 protein-coding genes and 87 RNA-only encoding genes, and is one of 20 rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Community Sequencing Program.
Collapse
|
45
|
Kruse T, Maillard J, Goodwin L, Woyke T, Teshima H, Bruce D, Detter C, Tapia R, Han C, Huntemann M, Wei CL, Han J, Chen A, Kyrpides N, Szeto E, Markowitz V, Ivanova N, Pagani I, Pati A, Pitluck S, Nolan M, Holliger C, Smidt H. Complete genome sequence of Dehalobacter restrictus PER-K23(T.). Stand Genomic Sci 2013; 8:375-88. [PMID: 24501624 PMCID: PMC3910700 DOI: 10.4056/sigs.3787426] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Dehalobacter restrictus strain PER-K23 (DSM 9455) is the type strain of the species Dehalobacter restrictus. D. restrictus strain PER-K23 grows by organohalide respiration, coupling the oxidation of H2 to the reductive dechlorination of tetra- or trichloroethene. Growth has not been observed with any other electron donor or acceptor, nor has fermentative growth been shown. Here we introduce the first full genome of a pure culture within the genus Dehalobacter. The 2,943,336 bp long genome contains 2,826 protein coding and 82 RNA genes, including 5 16S rRNA genes. Interestingly, the genome contains 25 predicted reductive dehalogenase genes, the majority of which appear to be full length. The reductive dehalogenase genes are mainly located in two clusters, suggesting a much larger potential for organohalide respiration than previously anticipated.
Collapse
|
46
|
Wall LG, Beauchemin N, Cantor MN, Chaia E, Chen A, Detter JC, Furnholm T, Ghodhbane-Gtari F, Goodwin L, Gtari M, Han C, Han J, Huntemann M, Hua SX, Ivanova N, Kyrpides N, Markowitz V, Mavrommatis K, Mikhailova N, Nordberg HP, Nouioui I, Ovchinnikova G, Pagani I, Pati A, Sen A, Sur S, Szeto E, Thakur S, Wei CL, Woyke T, Tisa LS. Draft Genome Sequence of Frankia sp. Strain BCU110501, a Nitrogen-Fixing Actinobacterium Isolated from Nodules of Discaria trinevis. GENOME ANNOUNCEMENTS 2013; 1:e00503-13. [PMID: 23846281 PMCID: PMC3709158 DOI: 10.1128/genomea.00503-13] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 06/07/2013] [Accepted: 06/07/2013] [Indexed: 11/20/2022]
Abstract
Frankia forms a nitrogen-fixing symbiosis with actinorhizal plants. We report a draft genome sequence for Frankia sp. strain BCU110501, a nitrogen-fixing actinobacterium isolated from nodules of Discaria trinevis grown in the Patagonia region of Argentina.
Collapse
|
47
|
Murugapiran SK, Huntemann M, Wei CL, Han J, Detter JC, Han C, Erkkila TH, Teshima H, Chen A, Kyrpides N, Mavrommatis K, Markowitz V, Szeto E, Ivanova N, Pagani I, Pati A, Goodwin L, Peters L, Pitluck S, Lam J, McDonald AI, Dodsworth JA, Woyke T, Hedlund BP. Thermus oshimai JL-2 and T. thermophilus JL-18 genome analysis illuminates pathways for carbon, nitrogen, and sulfur cycling. Stand Genomic Sci 2013; 7:449-68. [PMID: 24019992 PMCID: PMC3764938 DOI: 10.4056/sigs.3667269] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
The complete genomes of Thermus oshimai JL-2 and T. thermophilus JL-18 each consist of a circular chromosome, 2.07 Mb and 1.9 Mb, respectively, and two plasmids ranging from 0.27 Mb to 57.2 kb. Comparison of the T. thermophilus JL-18 chromosome with those from other strains of T. thermophilus revealed a high degree of synteny, whereas the megaplasmids from the same strains were highly plastic. The T. oshimai JL-2 chromosome and megaplasmids shared little or no synteny with other sequenced Thermus strains. Phylogenomic analyses using a concatenated set of conserved proteins confirmed the phylogenetic and taxonomic assignments based on 16S rRNA phylogenetics. Both chromosomes encode a complete glycolysis, tricarboxylic acid (TCA) cycle, and pentose phosphate pathway plus glucosidases, glycosidases, proteases, and peptidases, highlighting highly versatile heterotrophic capabilities. Megaplasmids of both strains contained a gene cluster encoding enzymes predicted to catalyze the sequential reduction of nitrate to nitrous oxide; however, the nitrous oxide reductase required for the terminal step in denitrification was absent, consistent with their incomplete denitrification phenotypes. A sox gene cluster was identified in both chromosomes, suggesting a mode of chemolithotrophy. In addition, nrf and psr gene clusters in T. oshmai JL-2 suggest respiratory nitrite ammonification and polysulfide reduction as possible modes of anaerobic respiration.
Collapse
|
48
|
Lim JQ, Tennakoon C, Li G, Wong E, Ruan Y, Wei CL, Sung WK. BatMeth: improved mapper for bisulfite sequencing reads on DNA methylation. Genome Biol 2012; 13:R82. [PMID: 23034162 PMCID: PMC3491410 DOI: 10.1186/gb-2012-13-10-r82] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2012] [Accepted: 10/03/2012] [Indexed: 12/20/2022] Open
Abstract
DNA methylation plays a crucial role in higher organisms. Coupling bisulfite treatment with next generation sequencing enables the interrogation of 5-methylcytosine sites in the genome. However, bisulfite conversion introduces mismatches between the reads and the reference genome, which makes mapping of Illumina and SOLiD reads slow and inaccurate. BatMeth is an algorithm that integrates novel Mismatch Counting, List Filtering, Mismatch Stage Filtering and Fast Mapping onto Two Indexes components to improve unique mapping rate, speed and precision. Experimental results show that BatMeth is faster and more accurate than existing tools. BatMeth is freely available at http://code.google.com/p/batmeth/.
Collapse
|
49
|
Yao F, Ariyaratne PN, Hillmer AM, Lee WH, Li G, Teo ASM, Woo XY, Zhang Z, Chen JP, Poh WT, Zawack KFB, Chan CS, Leong ST, Neo SC, Choi PSD, Gao S, Nagarajan N, Thoreau H, Shahab A, Ruan X, Cacheux-Rataboul V, Wei CL, Bourque G, Sung WK, Liu ET, Ruan Y. Long span DNA paired-end-tag (DNA-PET) sequencing strategy for the interrogation of genomic structural mutations and fusion-point-guided reconstruction of amplicons. PLoS One 2012; 7:e46152. [PMID: 23029419 PMCID: PMC3461012 DOI: 10.1371/journal.pone.0046152] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2012] [Accepted: 08/28/2012] [Indexed: 01/23/2023] Open
Abstract
Structural variations (SVs) contribute significantly to the variability of the human genome and extensive genomic rearrangements are a hallmark of cancer. While genomic DNA paired-end-tag (DNA-PET) sequencing is an attractive approach to identify genomic SVs, the current application of PET sequencing with short insert size DNA can be insufficient for the comprehensive mapping of SVs in low complexity and repeat-rich genomic regions. We employed a recently developed procedure to generate PET sequencing data using large DNA inserts of 10–20 kb and compared their characteristics with short insert (1 kb) libraries for their ability to identify SVs. Our results suggest that although short insert libraries bear an advantage in identifying small deletions, they do not provide significantly better breakpoint resolution. In contrast, large inserts are superior to short inserts in providing higher physical genome coverage for the same sequencing cost and achieve greater sensitivity, in practice, for the identification of several classes of SVs, such as copy number neutral and complex events. Furthermore, our results confirm that large insert libraries allow for the identification of SVs within repetitive sequences, which cannot be spanned by short inserts. This provides a key advantage in studying rearrangements in cancer, and we show how it can be used in a fusion-point-guided-concatenation algorithm to study focally amplified regions in cancer.
Collapse
|
50
|
Heisig J, Weber D, Englberger E, Winkler A, Kneitz S, Sung WK, Wolf E, Eilers M, Wei CL, Gessler M. Target gene analysis by microarrays and chromatin immunoprecipitation identifies HEY proteins as highly redundant bHLH repressors. PLoS Genet 2012; 8:e1002728. [PMID: 22615585 PMCID: PMC3355086 DOI: 10.1371/journal.pgen.1002728] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2011] [Accepted: 04/05/2012] [Indexed: 01/03/2023] Open
Abstract
HEY bHLH transcription factors have been shown to regulate multiple key steps in cardiovascular development. They can be induced by activated NOTCH receptors, but other upstream stimuli mediated by TGFß and BMP receptors may elicit a similar response. While the basic and helix-loop-helix domains exhibit strong similarity, large parts of the proteins are still unique and may serve divergent functions. The striking overlap of cardiac defects in HEY2 and combined HEY1/HEYL knockout mice suggested that all three HEY genes fulfill overlapping function in target cells. We therefore sought to identify target genes for HEY proteins by microarray expression and ChIPseq analyses in HEK293 cells, cardiomyocytes, and murine hearts. HEY proteins were found to modulate expression of their target gene to a rather limited extent, but with striking functional interchangeability between HEY factors. Chromatin immunoprecipitation revealed a much greater number of potential binding sites that again largely overlap between HEY factors. Binding sites are clustered in the proximal promoter region especially of transcriptional regulators or developmental control genes. Multiple lines of evidence suggest that HEY proteins primarily act as direct transcriptional repressors, while gene activation seems to be due to secondary or indirect effects. Mutagenesis of putative DNA binding residues supports the notion of direct DNA binding. While class B E-box sequences (CACGYG) clearly represent preferred target sequences, there must be additional and more loosely defined modes of DNA binding since many of the target promoters that are efficiently bound by HEY proteins do not contain an E-box motif. These data clearly establish the three HEY bHLH factors as highly redundant transcriptional repressors in vitro and in vivo, which explains the combinatorial action observed in different tissues with overlapping expression. NOTCH signaling is a central developmental pathway that influences a multitude of cell fate decisions and differentiation steps as well as later tissue homeostasis and regeneration. The three HEY genes encode basic helix-loop-helix transcription factors that are critical effectors to convey signaling by NOTCH receptors and similar signaling systems. This is underscored by the multitude of developmental defects observed in HEY single- and double-mutant mice. The mode of action of HEY proteins remained largely unexplored, however. By gene expression analysis and chromatin immunoprecipitation we have now identified a large set of HEY target genes. While only 500–2,000 mRNAs are regulated by HEY1 or HEY2, there are around 10,000 binding sites in the genome. HEY proteins act as transcriptional repressors that bind close to transcriptional start sites in all cases tested. In contrast, gene activation seems to be mediated by indirect/secondary mechanisms. The extent of regulation is rather limited, implicating HEY genes in modulating rather than switching on or off target gene expression. All our in vitro and in vivo data point to a high degree of redundancy between the three HEY genes, suggesting that tissue specific patterns and expression levels determine the final outcome of HEY induced cellular responses.
Collapse
|