101
|
Zhang Y, Du L, Liu A, Chen J, Wu L, Hu W, Zhang W, Kim K, Lee SC, Yang TJ, Wang Y. The Complete Chloroplast Genome Sequences of Five Epimedium Species: Lights into Phylogenetic and Taxonomic Analyses. FRONTIERS IN PLANT SCIENCE 2016; 7:306. [PMID: 27014326 PMCID: PMC4791396 DOI: 10.3389/fpls.2016.00306] [Citation(s) in RCA: 130] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2016] [Accepted: 02/26/2016] [Indexed: 05/18/2023]
Abstract
Epimedium L. is a phylogenetically and economically important genus in the family Berberidaceae. We here sequenced the complete chloroplast (cp) genomes of four Epimedium species using Illumina sequencing technology via a combination of de novo and reference-guided assembly, which was also the first comprehensive cp genome analysis on Epimedium combining the cp genome sequence of E. koreanum previously reported. The five Epimedium cp genomes exhibited typical quadripartite and circular structure that was rather conserved in genomic structure and the synteny of gene order. However, these cp genomes presented obvious variations at the boundaries of the four regions because of the expansion and contraction of the inverted repeat (IR) region and the single-copy (SC) boundary regions. The trnQ-UUG duplication occurred in the five Epimedium cp genomes, which was not found in the other basal eudicotyledons. The rapidly evolving cp genome regions were detected among the five cp genomes, as well as the difference of simple sequence repeats (SSR) and repeat sequence were identified. Phylogenetic relationships among the five Epimedium species based on their cp genomes showed accordance with the updated system of the genus on the whole, but reminded that the evolutionary relationships and the divisions of the genus need further investigation applying more evidences. The availability of these cp genomes provided valuable genetic information for accurately identifying species, taxonomy and phylogenetic resolution and evolution of Epimedium, and assist in exploration and utilization of Epimedium plants.
Collapse
|
102
|
Ma C, Ma H, Zhang H, Feng C, Wei H, Wang W, Chen W, Zhang F, Ma L. The complete mitochondrial genome sequence and gene organization of the rainbow runner (Elagatis bipinnulata) (Perciformes: Carangidae). Mitochondrial DNA A DNA Mapp Seq Anal 2015; 28:5-6. [PMID: 26709669 DOI: 10.3109/19401736.2015.1106516] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
The complete mitochondrial genome information can play an important role in species identification, phylogeny, evolution research, genetic differentiation, and diversity. Here we determined the complete mitochondrial genome sequence of Elagatis bipinnulata (Perciformes: Carangidae). This circular genome was 16 542 bp in length, and included all 37 typical mitochondrial genes, containing 13 protein-coding genes, 22 tRNA genes, two rRNA genes, and a putative control region. The gene order of E. bipinnulata was identical to that observed in most other vertebrates. Of 37 genes, 28 were encoded by heavy strand, while the other ones were encoded by light strand. According to the phylogenetic analysis based on 13 concatenated protein-coding genes, E. bipinnulata was genetically closer to the species of genus Seriola compared with any other species within Perciformes. This work can provide helpful data for further studies on population genetic diversity and molecular evolution.
Collapse
|
103
|
Ma H, Ma C, Zhang H, Zhang X, Feng C, Zhang Y, Wang W, Chen W, Ma L. Characterization of the complete mitochondrial genome and phylogenetic relationship of Caranx tille (Perciformes: Carangidae). Mitochondrial DNA A DNA Mapp Seq Anal 2015; 27:4704-4705. [PMID: 26643511 DOI: 10.3109/19401736.2015.1106510] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
In the present study, we determined the complete mitochondrial genome sequence of Caranx tille by using Illumina and Sanger sequencing technology that was 16 593 bp in size. This mitochondrial genome was made up of 13 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes, and a putative control region. Of these 37 genes, 28 were encoded by heavy strand, while nine were encoded by light strand. Three types of start codons and four kinds of stop codons were separately detected in 13 protein-coding genes. The overall base composition of this genome was 28.95% for A, 15.84% for G, 26.26% for T, and 28.95% for C, with a slight higher A + T content of 55.21%. From the phylogenetic tree constructed based on 13 concatenated protein-coding genes, we can see that C. tille, C. melampygus and C. ignobilis grouped together and formed a single cluster. In addition, C. tille was genetically closest to C. melampygus. This work should facilitate the studies on molecular evolution and population genetic structure for fishes under family Carangidae.
Collapse
|
104
|
Wei H, Ma H, Ma C, Zhang H, Zhang F, Wang W, Chen W, Meng Y, Cheng J, Li S, Jiang Y, Ma L. Isolation and characterization of the complete mitochondrial genome of Taenioides anguillaris (Gobiidae: Amblyopinae) with phylogenetic consideration. Mitochondrial DNA A DNA Mapp Seq Anal 2015; 27:4681-4682. [PMID: 26642858 DOI: 10.3109/19401736.2015.1106500] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Here we first isolated and characterized the complete mitochondrial genome DNA of Taenioides anguillaris. It was 16 974 bp in length, and contained 13 protein-coding genes, 22 tRNA genes, two rRNA genes, and a putative control region. The structure and the gene arrangement of this genome were identical to those isolated from other Gobiidae fishes. Twenty-eight genes were encoded by heavy strand, while nine genes were encoded by light strand. The total nucleotide composition of this genome was 28.7% for A, 15.6% for G, 28.6% for C, and 27.1% for T, with a high A + T content of 55.8%. From the neighbor-joining (NJ) phylogenetic tree, we can find that T. anguillaris was genetically closest to species Odontamblyopus rubicundus among 20 species within suborder Gobioidei. This work should be helpful for the studies on population genetic diversity and molecular evolution in T. anguillaris and related fish species.
Collapse
|
105
|
Chen F, Ma H, Ma C, Ni Y, Zhang X, Wei H, Meng Y, Zhao M, Ma L. The complete mitochondrial genome sequence and gene organization of Amblychaeturichthys hexanema (Gobiidae: Amblychaeturichthys) with phylogenetic consideration. Mitochondrial DNA A DNA Mapp Seq Anal 2015; 27:4687-4688. [PMID: 26642097 DOI: 10.3109/19401736.2015.1106503] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
We sequenced and characterized the complete mitochondrial genome of Amblychaeturichthys hexanema and uncovered its molecular phylogenetic relationship. This genome was 18 642 bp in size, and contained 37 classic genes, including 13 protein-coding genes, 22 transfer RNA genes and 2 ribosomal RNA genes. The gene organization and nucleotide composition were the same with those found in most other Gobiidae fishes. In contrast, gene NADH 6 was obviously larger than the same gene identified from other Gobiidae species. Two big repeat regions were detected on both sides of D-loop region. The A + T content (71.2%) of D-loop region was higher than any other region in this genome. Among 37 genes, 28 were encoded by the heavy strand, while nine were encoded by the light strand. The total nucleotide composition of this genome was 28.6% for A, 15.8% for G, 27.2% for C, and 28.3% for T, with a high A + T content of 56.9%. Phylogeny analysis of 22 fish species within Gobioidei showed that A. hexanema and Chaeturichthys stigmatias first formed a single cluster, then they together with other 17 species grouped a big branch. This work should be helpful for the studies on population genetic diversity and molecular evolution of Gobiidae fish species.
Collapse
|
106
|
AlMomin S, Kumar V, Al-Amad S, Al-Hussaini M, Dashti T, Al-Enezi K, Akbar A. Draft genome sequence of the silver pomfret fish, Pampus argenteus. Genome 2015; 59:51-8. [PMID: 26692342 DOI: 10.1139/gen-2015-0056] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Silver pomfret, Pampus argenteus, is a fish species from coastal waters. Despite its high commercial value, this edible fish has not been sequenced. Hence, its genetic and genomic studies have been limited. We report the first draft genome sequence of the silver pomfret obtained using a Next Generation Sequencing (NGS) technology. We assembled 38.7 Gb of nucleotides into scaffolds of 350 Mb with N50 of about 1.5 kb, using high quality paired end reads. These scaffolds represent 63.7% of the estimated silver pomfret genome length. The newly sequenced and assembled genome has 11.06% repetitive DNA regions, and this percentage is comparable to that of the tilapia genome. The genome analysis predicted 16 322 genes. About 91% of these genes showed homology with known proteins. Many gene clusters were annotated to protein and fatty-acid metabolism pathways that may be important in the context of the meat texture and immune system developmental processes. The reference genome can pave the way for the identification of many other genomic features that could improve breeding and population-management strategies, and it can also help characterize the genetic diversity of P. argenteus.
Collapse
|
107
|
Kinaro ZO, Xue L, Nyaundi KJ, Shen J. The mitochondrial genome of an endangered native Singidia tilapia, Oreochromis esculentus: genome organization and control region polymorphism. Mitochondrial DNA A DNA Mapp Seq Anal 2015; 27:4344-4346. [PMID: 26462597 DOI: 10.3109/19401736.2015.1089493] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Singidia tilapia (Oreochromis esculentus) is a native Cichlid fish of important commercial value, distributed in Lake Victoria, East Africa. Due to its declining population levels in its natural habitat, this species has now been classified as a Critically Endangerd by the International Union for the Conservation of nature (IUCN). In the present study the complete nucleotide sequence of the mitochondrial genome (mtDNA) of O. esculentus was determined. In addition, polymorphism analysis based on the mtDNA's control region sequence was investigated on two of its remaining populations of Yala and Borabu as well as a phylogenetic consideration using 16S rRNA mtDNA genes to explore its position and relationship within Cichlidae fish. The length of the complete mitogenome of O. esculentus is 16 622 bp, containing the same order and an identical number of genes and regions with the other reported Cichlid fishes, which consists of 13 protein-coding genes, 22 transfer RNA (tRNA) genes, 2 ribosomal RNA (rRNA) genes, and a putative non-coding region. The phylogenetic analysis shows O. esculentus being clustered within the Oreochromini sub-tribe of the Cichlidae. The high genetic diversity and low genetic differentiation between the two populations indicated the need for conservation of both the refuge ecosystems and the fish species under study.
Collapse
|
108
|
Zhao M, Ma H, Ma C, Zhang H, Zhang X, Meng Y, Wei H, Chen F, Ma L. The complete mitochondrial genome and gene organization of Cubiceps squamiceps (Perciformes: nomeidae) with phylogenetic consideration. Mitochondrial DNA A DNA Mapp Seq Anal 2015; 27:4296-4297. [PMID: 26404517 DOI: 10.3109/19401736.2015.1082103] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
In this study, we reported the complete mitochondrial DNA sequence of Cubiceps squamiceps, and its phylogenetic relationship. The full length of this mitogenome was 16 510 bp, including 22 transfer RNA genes, two ribosomal RNA genes, 13 protein-coding genes, and a putative control region. The location of genes in mitochondrial genome was similar to other fish species within family Nomeidae. Twenty-eight genes were located on the heavy strand, while nine genes were located on the light strand. The nucleotide composition of this mitogenome was 27.5% for A, 28.5% for C, 17.5% for G, and 26.5% for T. From the NJ phylogenetic tree, we can find that C. squamiceps was genetically closest to C. pauciradiatus among 20 species within suborder Stromateoidei. This study could lay the basis for the future studies in population genetic diversity, taxonomic status, molecular systematics, and conservation genetics in C. squamiceps.
Collapse
|
109
|
Rogers RL. Chromosomal Rearrangements as Barriers to Genetic Homogenization between Archaic and Modern Humans. Mol Biol Evol 2015; 32:3064-78. [PMID: 26399483 DOI: 10.1093/molbev/msv204] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Chromosomal rearrangements, which shuffle DNA throughout the genome, are an important source of divergence across taxa. Using a paired-end read approach with Illumina sequence data for archaic humans, I identify changes in genome structure that occurred recently in human evolution. Hundreds of rearrangements indicate genomic trafficking between the sex chromosomes and autosomes, raising the possibility of sex-specific changes. Additionally, genes adjacent to genome structure changes in Neanderthals are associated with testis-specific expression, consistent with evolutionary theory that new genes commonly form with expression in the testes. I identify one case of new-gene creation through transposition from the Y chromosome to chromosome 10 that combines the 5'-end of the testis-specific gene Fank1 with previously untranscribed sequence. This new transcript experienced copy number expansion in archaic genomes, indicating rapid genomic change. Among rearrangements identified in Neanderthals, 13% are transposition of selfish genetic elements, whereas 32% appear to be ectopic exchange between repeats. In Denisovan, the pattern is similar but numbers are significantly higher with 18% of rearrangements reflecting transposition and 40% ectopic exchange between distantly related repeats. There is an excess of divergent rearrangements relative to polymorphism in Denisovan, which might result from nonuniform rates of mutation, possibly reflecting a burst of transposable element activity in the lineage that led to Denisovan. Finally, loci containing genome structure changes show diminished rates of introgression from Neanderthals into modern humans, consistent with the hypothesis that rearrangements serve as barriers to gene flow during hybridization. Together, these results suggest that this previously unidentified source of genomic variation has important biological consequences in human evolution.
Collapse
|
110
|
Wei H, Ma H, Ma C, Zhang F, Wang W, Chen W, Ma L. The complete mitochondrial genome sequence and gene organization of Tridentiger trigonocephalus (Gobiidae: Gobionellinae) with phylogenetic consideration. Mitochondrial DNA A DNA Mapp Seq Anal 2015; 27:3725-6. [PMID: 26370266 DOI: 10.3109/19401736.2015.1079876] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
The complete mitochondrial genome plays an important role in studies of genome-level characteristics and phylogenetic relationships. Here we determined the complete mitogenome sequence of Tridentiger trigonocephalus (Perciformes, Gobiidae), and discovered its phylogenetic relationship. This circular genome was 16 662 bp in length, and consisted of 37 typical genes, including 13 protein-coding genes, 22 tRNA genes, and two rRNA genes. The gene order of T. trigonocephalus mitochondrial genome was identical to those observed in most other vertebrates. Of 37 genes, 28 were encoded by heavy strand, while the others were encoded by light strand. The phylogenetic tree constructed by 13 concatenated protein-coding genes showed that T. trigonocephalus was closest to T. bifasciatus, and then to T. barbatus among the 20 species within suborder Gobioidei. This work should facilitate the studies on population genetic diversity, and molecular evolution in Gobioidei fishes.
Collapse
|
111
|
Singh S, Gupta SK, Nischal A, Pant KK, Seth PK. HDVDB: a data warehouse for hepatitis delta virus. INTERNATIONAL JOURNAL OF BIOINFORMATICS RESEARCH AND APPLICATIONS 2015; 11:162-170. [PMID: 25786795 DOI: 10.1504/ijbra.2015.068091] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/24/2023]
Abstract
Hepatitis Delta Virus (HDV) is an RNA virus and causes delta hepatitis in humans. Although a lot of data is available for HDV, but retrieval of information is a complicated task. Current web database 'HDVDB' provides a comprehensive web-resource for HDV. The database is basically concerned with basic information about HDV and disease caused by this virus, genome structure, pathogenesis, epidemiology, symptoms and prevention, etc. Database also supplies sequence data and bibliographic information about HDV. A tool 'siHDV Predict' to design the effective siRNA molecule to control the activity of HDV, is also integrated in database. It is a user friendly information system available at public domain and provides annotated information about HDV for research scholars, scientists, pharma industry people for further study.
Collapse
|
112
|
Behura SK, Severson DW. Bidirectional promoters of insects: genome-wide comparison, evolutionary implication and influence on gene expression. J Mol Biol 2014; 427:521-36. [PMID: 25463441 DOI: 10.1016/j.jmb.2014.11.008] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2014] [Revised: 10/31/2014] [Accepted: 11/07/2014] [Indexed: 10/24/2022]
Abstract
Bidirectional promoters are widespread in insect genomes. By analyzing 23 insect genomes we show that the frequency of bidirectional gene pairs varies according to genome compactness and density of genes among the species. The density of bidirectional genes expected based on number of genes per megabase of genome explains the observed density suggesting that bidirectional pairing of genes may be due to random event. We identified specific transcription factor binding motifs that are enriched in bidirectional promoters across insect species. Furthermore, we observed that bidirectional promoters may act as transcriptional hotspots in insect genomes where protein coding genes tend to aggregate in significantly biased (p < 0.001) manner compared to unidirectional promoters. Natural selection seems to have an association with the extent of bidirectionality of genes among the species. The rate of non-synonymous-to-synonymous changes (dN/dS) shows a second-order polynomial distribution with bidirectionality between species indicating that bidirectionality is dependent upon evolutionary pressure acting on the genomes. Analysis of genome-wide microarray expression data of multiple insect species suggested that bidirectionality has a similar association with transcriptome variation across species. Furthermore, bidirectional promoters show significant association with correlated expression of the divergent gene pairs depending upon their motif composition. Analysis of gene ontology showed that bidirectional genes tend to have a common association with functions related to "binding" (including ion binding, nucleotide binding and protein binding) across genomes. Such functional constraint of bidirectional genes may explain their widespread persistence in genome of diverse insect species.
Collapse
|
113
|
Ottolini B, Hornsby MJ, Abujaber R, MacArthur JAL, Badge RM, Schwarzacher T, Albertson DG, Bevins CL, Solnick JV, Hollox EJ. Evidence of convergent evolution in humans and macaques supports an adaptive role for copy number variation of the β-defensin-2 gene. Genome Biol Evol 2014; 6:3025-38. [PMID: 25349268 PMCID: PMC4255768 DOI: 10.1093/gbe/evu236] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
β-defensins are a family of important peptides of innate immunity, involved in host defense, immunomodulation, reproduction, and pigmentation. Genes encoding β-defensins show evidence of birth-and-death evolution, adaptation by amino acid sequence changes, and extensive copy number variation (CNV) within humans and other species. The role of CNV in the adaptation of β-defensins to new functions remains unclear, as does the adaptive role of CNV in general. Here, we fine-map CNV of a cluster of β-defensins in humans and rhesus macaques. Remarkably, we found that the structure of the CNV is different between primates, with distinct mutational origins and CNV boundaries defined by retroviral long terminal repeat elements. Although the human β-defensin CNV region is 322 kb and encompasses several genes, including β-defensins, a long noncoding RNA gene, and testes-specific zinc-finger transcription factors, the orthologous region in the rhesus macaque shows CNV of a 20-kb region, containing only a single gene, the ortholog of the human β-defensin-2 gene. Despite its independent origins, the range of gene copy numbers in the rhesus macaque is similar to humans. In addition, the rhesus macaque gene has been subject to divergent positive selection at the amino acid level following its initial duplication event between 3 and 9.5 Ma, suggesting adaptation of this gene as the macaque successfully colonized novel environments outside Africa. Therefore, the molecular phenotype of β-defensin-2 CNV has undergone convergent evolution, and this gene shows evidence of adaptation at the amino acid level in rhesus macaques.
Collapse
|
114
|
Faino L, Thomma BPHJ. Get your high-quality low-cost genome sequence. TRENDS IN PLANT SCIENCE 2014; 19:288-91. [PMID: 24636622 DOI: 10.1016/j.tplants.2014.02.003] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2013] [Revised: 01/31/2014] [Accepted: 02/06/2014] [Indexed: 05/03/2023]
Abstract
The study of whole-genome sequences has become essential for almost all branches of biological research. Next-generation sequencing (NGS) has revolutionized the scalability, speed, and resolution of sequencing and brought genomic science within reach of academic laboratories that study non-model organisms. Here, we show that a high-quality draft genome of a eukaryote can be obtained at relatively low cost by exploiting a hybrid combination of sequencing strategies.
Collapse
|
115
|
Holwerda S, de Laat W. Chromatin loops, gene positioning, and gene expression. Front Genet 2012; 3:217. [PMID: 23087710 PMCID: PMC3473233 DOI: 10.3389/fgene.2012.00217] [Citation(s) in RCA: 66] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2012] [Accepted: 10/01/2012] [Indexed: 01/09/2023] Open
Abstract
Technological developments and intense research over the last years have led to a better understanding of the 3D structure of the genome and its influence on genome function inside the cell nucleus. We will summarize topological studies performed on four model gene loci: the α- and β-globin gene loci, the antigen receptor loci, the imprinted H19-Igf2 locus and the Hox gene clusters. Collectively, these studies show that regulatory DNA sequences physically contact genes to control their transcription. Proteins set up the 3D configuration of the genome and we will discuss the roles of the key structural organizers CTCF and cohesin, the nuclear lamina and the transcription machinery. Finally, genes adopt non-random positions in the nuclear interior. We will review studies on gene positioning and propose that cell-specific genome conformations can juxtapose a regulatory sequence on one chromosome to a responsive gene on another chromosome to cause altered gene expression in subpopulations of cells.
Collapse
|
116
|
Incidence of genome structure, DNA asymmetry, and cell physiology on T-DNA integration in chromosomes of the phytopathogenic fungus Leptosphaeria maculans. G3-GENES GENOMES GENETICS 2012; 2:891-904. [PMID: 22908038 PMCID: PMC3411245 DOI: 10.1534/g3.112.002048] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/21/2012] [Accepted: 06/07/2012] [Indexed: 11/18/2022]
Abstract
The ever-increasing generation of sequence data is accompanied by unsatisfactory functional annotation, and complex genomes, such as those of plants and filamentous fungi, show a large number of genes with no predicted or known function. For functional annotation of unknown or hypothetical genes, the production of collections of mutants using Agrobacterium tumefaciens–mediated transformation (ATMT) associated with genotyping and phenotyping has gained wide acceptance. ATMT is also widely used to identify pathogenicity determinants in pathogenic fungi. A systematic analysis of T-DNA borders was performed in an ATMT-mutagenized collection of the phytopathogenic fungus Leptosphaeria maculans to evaluate the features of T-DNA integration in its particular transposable element-rich compartmentalized genome. A total of 318 T-DNA tags were recovered and analyzed for biases in chromosome and genic compartments, existence of CG/AT skews at the insertion site, and occurrence of microhomologies between the T-DNA left border (LB) and the target sequence. Functional annotation of targeted genes was done using the Gene Ontology annotation. The T-DNA integration mainly targeted gene-rich, transcriptionally active regions, and it favored biological processes consistent with the physiological status of a germinating spore. T-DNA integration was strongly biased toward regulatory regions, and mainly promoters. Consistent with the T-DNA intranuclear-targeting model, the density of T-DNA insertion correlated with CG skew near the transcription initiation site. The existence of microhomologies between promoter sequences and the T-DNA LB flanking sequence was also consistent with T-DNA integration to host DNA mediated by homologous recombination based on the microhomology-mediated end-joining pathway.
Collapse
|
117
|
Aoki R, Takeda T, Omata T, Ihara K, Fujita Y. MarR-type transcriptional regulator ChlR activates expression of tetrapyrrole biosynthesis genes in response to low-oxygen conditions in cyanobacteria. J Biol Chem 2012; 287:13500-7. [PMID: 22375005 PMCID: PMC3339928 DOI: 10.1074/jbc.m112.346205] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2012] [Revised: 02/22/2012] [Indexed: 01/17/2023] Open
Abstract
Oxygen is required for three enzyme reactions in chlorophyll and bilin biosynthesis pathways: coproporphyrinogen III oxidase (HemF), heme oxygenase (HO1), and Mg-protoporphyrin IX monomethylester cyclase (ChlA(I)). The cyanobacterium Synechocystis sp. PCC 6803 has alternative enzymes, HemN, HO2, and ChlA(II), to supply chlorophyll/bilins even under low-oxygen environments. The three genes form an operon, chlA(II)-ho2-hemN, that is induced in response to low-oxygen conditions to bypass the oxygen-dependent reactions. Here we identified a transcriptional regulator for the induction of the operon in response to low-oxygen conditions. A pseudorevertant, Δho1R, was isolated from a HO1-lacking mutant Δho1 that is lethal under aerobic conditions. Δho1R grew well even under aerobic conditions. In Δho1R, HO2 that is induced only under low-oxygen conditions was anomalously expressed under aerobic conditions to complement the loss of HO1. A G-to-C transversion in sll1512 causing the amino acid change from aspartate 35 to histidine was identified as the relevant mutation by resequencing of the Δho1R genome. Sll1512 is a MarR-type transcriptional regulator. An sll1512-lacking mutant grew poorly under low-oxygen conditions with a remarked decrease in Chl content that would be caused by the suppressed induction of the chlA(II) and hemN genes in Chl biosynthesis under low-oxygen conditions. These results demonstrated that Sll1512 is an activator in response to low-oxygen environments and that the D35H variant becomes a constitutive activator. This hypothesis was supported by a gel shift assay showing that the Sll1512-D35H variant binds to the DNA fragment upstream of the operon. We propose to name sll1512 chlR.
Collapse
|
118
|
Integration of the Draft Sequence and Physical Map as a Framework for Genomic Research in Soybean (Glycine max (L.) Merr.) and Wild Soybean (Glycine soja Sieb. and Zucc.). G3-GENES GENOMES GENETICS 2012; 2:321-9. [PMID: 22413085 PMCID: PMC3291501 DOI: 10.1534/g3.111.001834] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/07/2011] [Accepted: 12/21/2011] [Indexed: 11/21/2022]
Abstract
Soybean is a model for the legume research community because of its importance as a crop, densely populated genetic maps, and the availability of a genome sequence. Even though a whole-genome shotgun sequence and bacterial artificial chromosome (BAC) libraries are available, a high-resolution, chromosome-based physical map linked to the sequence assemblies is still needed for whole-genome alignments and to facilitate map-based gene cloning. Three independent G. max BAC libraries combined with genetic and gene-based markers were used to construct a minimum tiling path (MTP) of BAC clones. A total of 107,214 clones were assembled into 1355 FPC (FingerPrinted Contigs) contigs, incorporating 4628 markers and aligned to the G. max reference genome sequence using BAC end-sequence information. Four different MTPs were made for G. max that covered from 92.6% to 95.0% of the soybean draft genome sequence (gmax1.01). Because our purpose was to pick the most reliable and complete MTP, and not the MTP with the minimal number of clones, the FPC map and draft sequence were integrated and clones with unpaired BES were added to build a high-quality physical map with the fewest gaps possible (http://soybase.org). A physical map was also constructed for the undomesticated ancestor (G. soja) of soybean to explore genome variation between G. max and G. soja. 66,028 G. soja clones were assembled into 1053 FPC contigs covering approximately 547 Mbp of the G. max genome sequence. These physical maps for G. max and its undomesticated ancestor, G. soja, will serve as a framework for ordering sequence fragments, comparative genomics, cloning genes, and evolutionary analyses of legume genomes.
Collapse
|
119
|
Abstract
Over the past 10 years, the development of chromosome conformation capture (3C) technology and the subsequent genomic variants thereof have enabled the analysis of nuclear organization at an unprecedented resolution and throughput. The technology relies on the original and, in hindsight, remarkably simple idea that digestion and religation of fixed chromatin in cells, followed by the quantification of ligation junctions, allows for the determination of DNA contact frequencies and insight into chromosome topology. Here we evaluate and compare the current 3C-based methods (including 4C [chromosome conformation capture-on-chip], 5C [chromosome conformation capture carbon copy], HiC, and ChIA-PET), summarize their contribution to our current understanding of genome structure, and discuss how shape influences genome function.
Collapse
|
120
|
Tyler S, Severini A, Black D, Walker M, Eberle R. Structure and sequence of the saimiriine herpesvirus 1 genome. Virology 2011; 410:181-91. [PMID: 21130483 PMCID: PMC3017652 DOI: 10.1016/j.virol.2010.11.008] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2010] [Revised: 10/25/2010] [Accepted: 11/03/2010] [Indexed: 01/24/2023]
Abstract
We report here the complete genome sequence of the squirrel monkey α-herpesvirus saimiriine herpesvirus 1 (HVS1). Unlike the simplexviruses of other primate species, only the unique short region of the HVS1 genome is bounded by inverted repeats. While all Old World simian simplexviruses characterized to date lack the herpes simplex virus RL1 (γ34.5) gene, HVS1 has an RL1 gene. HVS1 lacks several genes that are present in other primate simplexviruses (US8.5, US10-12, UL43/43.5 and UL49A). Although the overall genome structure appears more like that of varicelloviruses, the encoded HVS1 proteins are most closely related to homologous proteins of the primate simplexviruses. Phylogenetic analyses confirm that HVS1 is a simplexvirus. Limited comparison of two HVS1 strains revealed a very low degree of sequence variation more typical of varicelloviruses. HVS1 is thus unique among the primate α-herpesviruses in that its genome has properties of both simplexviruses and varicelloviruses.
Collapse
|
121
|
Abstract
Sequence data of entire eukaryotic genomes and their detailed comparison have provided new evidence on genome evolution. The major mechanisms involved in the increase of genome sizes are polyploidization and gene duplication. Subsequent gene silencing or mutations, preferentially in regulatory sequences of genes, modify the genome and permit the development of genes with new properties. Mechanisms such as lateral gene transfer, exon shuffling or the creation of new genes by transposition contribute to the evolution of a genome, but remain of relatively restricted relevance. Mechanisms to decrease genome sizes and, in particular, to remove specific DNA sequences, such as blocks of satellite DNAs, appear to involve the action of RNA interference (RNAi). RNAi mechanisms have been proven to be involved in chromatin packaging related with gene inactivation as well as in DNA excision during the macronucleus development in ciliates.
Collapse
|
122
|
Feuillet C, Keller B. Comparative genomics in the grass family: molecular characterization of grass genome structure and evolution. ANNALS OF BOTANY 2002; 89:3-10. [PMID: 12096816 PMCID: PMC4233775 DOI: 10.1093/aob/mcf008] [Citation(s) in RCA: 102] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
The genomes of grasses are very different in terms of size, ploidy level and chromosome number. Despite these significant differences, it was found by comparative mapping that the linear order (colinearity) of genetic markers and genes is very well conserved between different grass genomes. The potential of such conservation has been exploited in several directions, e.g. in defining rice as a model genome for grasses and in designing better strategies for positional cloning in large genomes. Recently, the development of large insert libraries in species such as maize, rice, barley and diploid wheat has allowed the study of large stretches of DNA sequence and has provided insight into gene organization in grasses. It was found that genes are not distributed randomly along the chromosomes and that there are clusters of high gene density in species with large genomes. Comparative analysis performed at the DNA sequence level has demonstrated that colinearity between the grass genomes is retained at the molecular level (microcolinearity) in most cases. However, detailed analysis has also revealed a number of exceptions to microcolinearity, which have given insight into mechanisms that are involved in grass-genome evolution. In some cases, the use of rice as a model to support gene isolation from other grass genomes will be complicated by local rearrangements. In this Botanical Briefing, we present recent progress and future prospects of comparative genomics in grasses.
Collapse
|
123
|
Rasschaert D, Gelfi J, Laude H. Enteric coronavirus TGEV: partial sequence of the genomic RNA, its organization and expression. Biochimie 1987; 69:591-600. [PMID: 2825819 PMCID: PMC7130962 DOI: 10.1016/0300-9084(87)90178-7] [Citation(s) in RCA: 54] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
The sequence of the 3'-most 8300 nucleotides of the genome RNA of the Purdue-115 strain of the transmissible gastroenteritis virus TGEV, a porcine coronavirus, was determined from cDNA clones. The available sequence corresponds to the part of the genome (total length greater than 20 kb) expressed through subgenomic mRNAs. The 5 subgenomic and the genomic RNA species detected in TGEV-infected cells form a 3'-coterminal 'nested' structure, a unique feature of Coronaviridae. The transcription initiation site of the TGEV subgenomic RNAs appears to involve the hexameric sequence 5'CTAAAC, which is present upstream from each coding region. In addition to the previously identified genes encoding the three structural proteins, E2, E1 and N, two regions, X1 and X2, corresponding to the non-overlapping portion of mRNAs 4 and 3, may code for so far unidentified non-structural polypeptides. The predicted X1 polypeptide (9.2 kDa) is highly hydrophobic. The sequence of the X2 region allows the translation of two non-overlapping products, i.e., X2a (7.7 kDa) and X2b (18.8 kDa). No RNA species liable to express the extreme 3' open reading frame X3 was found.
Collapse
|