Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Pauws E, van Kampen AH, van de Graaf SA, de Vijlder JJ, Ris-Stalpers C. Heterogeneity in polyadenylation cleavage sites in mammalian mRNA sequences: implications for SAGE analysis. Nucleic Acids Res 2001;29:1690-4. [PMID: 11292841 PMCID: PMC31324 DOI: 10.1093/nar/29.8.1690] [Citation(s) in RCA: 85] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

For:	Pauws E, van Kampen AH, van de Graaf SA, de Vijlder JJ, Ris-Stalpers C. Heterogeneity in polyadenylation cleavage sites in mammalian mRNA sequences: implications for SAGE analysis. Nucleic Acids Res 2001;29:1690-4. [PMID: 11292841 PMCID: PMC31324 DOI: 10.1093/nar/29.8.1690] [Citation(s) in RCA: 85] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Number

Cited by Other Article(s)

Fine gene expression regulation by minor sequence variations downstream of the polyadenylation signal. Mol Biol Rep 2021;48:1539-1547. [PMID: 33517473 DOI: 10.1007/s11033-021-06160-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Accepted: 01/12/2021] [Indexed: 12/22/2022]

de la Fuente L, Arzalluz-Luque Á, Tardáguila M, Del Risco H, Martí C, Tarazona S, Salguero P, Scott R, Lerma A, Alastrue-Agudo A, Bonilla P, Newman JRB, Kosugi S, McIntyre LM, Moreno-Manzano V, Conesa A. tappAS: a comprehensive computational framework for the analysis of the functional impact of differential splicing. Genome Biol 2020;21:119. [PMID: 32423416 PMCID: PMC7236505 DOI: 10.1186/s13059-020-02028-w] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Accepted: 04/23/2020] [Indexed: 12/26/2022] Open

Affiliation(s)

Lorena de la Fuente Genomics of Gene Expression Laboratory, Prince Felipe Research Center, Valencia, Spain Present Address: Bioinformatics Unit, IIS Fundación Jiménez Díaz, Madrid, Spain
Ángeles Arzalluz-Luque Department of Statistics and Operational Research, Polytechnical University of Valencia, Valencia, Spain
Manuel Tardáguila Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL, USA Present Address: Human Genetics Department, Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK
Héctor Del Risco Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL, USA
Cristina Martí Genomics of Gene Expression Laboratory, Prince Felipe Research Center, Valencia, Spain
Sonia Tarazona Department of Statistics and Operational Research, Polytechnical University of Valencia, Valencia, Spain
Pedro Salguero Genomics of Gene Expression Laboratory, Prince Felipe Research Center, Valencia, Spain
Raymond Scott Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL, USA
Alberto Lerma Genomics of Gene Expression Laboratory, Prince Felipe Research Center, Valencia, Spain
Ana Alastrue-Agudo Present Address: Human Genetics Department, Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK
Pablo Bonilla Present Address: Human Genetics Department, Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK
Jeremy R B Newman Genetics Institute, University of Florida, Gainesville, FL, USA Department of Pathology, University of Florida, Gainesville, FL, USA
Shunichi Kosugi Genetics Institute, University of Florida, Gainesville, FL, USA Laboratory for Statistical and Translational Genetics, Center for Integrative Medical Sciences, RIKEN, Wako, Japan
Lauren M McIntyre Genetics Institute, University of Florida, Gainesville, FL, USA Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL, USA
Victoria Moreno-Manzano Neural Regeneration Laboratory, Prince Felipe Research Center, Valencia, Spain
Ana Conesa Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL, USA. Genetics Institute, University of Florida, Gainesville, FL, USA.

Collapse

Stolyarenko AD. Nuclear Argonaute Piwi Gene Mutation Affects rRNA by Inducing rRNA Fragment Accumulation, Antisense Expression, and Defective Processing in Drosophila Ovaries. Int J Mol Sci 2020;21:ijms21031119. [PMID: 32046213 PMCID: PMC7037970 DOI: 10.3390/ijms21031119] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2019] [Revised: 01/27/2020] [Accepted: 02/04/2020] [Indexed: 12/26/2022] Open

Genome-Wide Profiling of Polyadenylation Events in Maize Using High-Throughput Transcriptomic Sequences. G3-GENES GENOMES GENETICS 2019;9:2749-2760. [PMID: 31239292 PMCID: PMC6686930 DOI: 10.1534/g3.119.400196] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

The Transcriptional Landscape of Marek's Disease Virus in Primary Chicken B Cells Reveals Novel Splice Variants and Genes. Viruses 2019;11:v11030264. [PMID: 30884829 PMCID: PMC6466439 DOI: 10.3390/v11030264] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Revised: 03/12/2019] [Accepted: 03/13/2019] [Indexed: 12/14/2022] Open

Majerciak V, Yang W, Zheng J, Zhu J, Zheng ZM. A Genome-Wide Epstein-Barr Virus Polyadenylation Map and Its Antisense RNA to EBNA. J Virol 2019;93:e01593-18. [PMID: 30355690 PMCID: PMC6321932 DOI: 10.1128/jvi.01593-18] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2018] [Accepted: 10/17/2018] [Indexed: 12/14/2022] Open

Abstract

Epstein-Barr virus (EBV) is a ubiquitous human pathogen associated with Burkitt's lymphoma and nasopharyngeal carcinoma. Although the EBV genome harbors more than a hundred genes, a full transcription map with EBV polyadenylation profiles remains unknown. To elucidate the 3' ends of all EBV transcripts genome-wide, we performed the first comprehensive analysis of viral polyadenylation sites (pA sites) using our previously reported polyadenylation sequencing (PA-seq) technology. We identified that EBV utilizes a total of 62 pA sites in JSC-1, 60 in Raji, and 53 in Akata cells for the expression of EBV genes from both plus and minus DNA strands; 42 of these pA sites are commonly used in all three cell lines. The majority of identified pA sites were mapped to the intergenic regions downstream of previously annotated EBV open reading frames (ORFs) and viral promoters. pA sites lacking an association with any known EBV genes were also identified, mostly for the minus DNA strand within the EBNA locus, a major locus responsible for maintenance of viral latency and cell transformation. The expression of these novel antisense transcripts to EBNA were verified by 3' rapid amplification of cDNA ends (RACE) and Northern blot analyses in several EBV-positive (EBV+) cell lines. In contrast to EBNA RNA expressed during latency, expression of EBNA-antisense transcripts, which is restricted in latent cells, can be significantly induced by viral lytic infection, suggesting potential regulation of viral gene expression by EBNA-antisense transcription during lytic EBV infection. Our data provide the first evidence that EBV has an unrecognized mechanism that regulates EBV reactivation from latency.IMPORTANCE Epstein-Barr virus represents an important human pathogen with an etiological role in the development of several cancers. By elucidation of a genome-wide polyadenylation landscape of EBV in JSC-1, Raji, and Akata cells, we have redefined the EBV transcriptome and mapped individual polymerase II (Pol II) transcripts of viral genes to each one of the mapped pA sites at single-nucleotide resolution as well as the depth of expression. By unveiling a new class of viral lytic RNA transcripts antisense to latent EBNAs, we provide a novel mechanism of how EBV might control the expression of viral latent genes and lytic infection. Thus, this report takes another step closer to understanding EBV gene structure and expression and paves a new path for antiviral approaches.

Collapse

Freitas N, Lukash T, Gunewardena S, Chappell B, Slagle BL, Gudima SO. Relative Abundance of Integrant-Derived Viral RNAs in Infected Tissues Harvested from Chronic Hepatitis B Virus Carriers. J Virol 2018;92:e02221-17. [PMID: 29491161 PMCID: PMC5923063 DOI: 10.1128/jvi.02221-17] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2017] [Accepted: 02/17/2018] [Indexed: 02/07/2023] Open

Abstract

Five matching sets of nonmalignant liver tissues and hepatocellular carcinoma (HCC) samples from individuals chronically infected with hepatitis B virus (HBV) were examined. The HBV genomic sequences were determined by using overlapping PCR amplicons covering the entire viral genome. Four pairs of tissues were infected with HBV genotype C, while one pair was infected with HBV genotype B. HBV replication markers were found in all tissues. In the majority of HCC samples, the levels of pregenomic/precore RNA (pgRNA) and covalently closed circular DNA (cccDNA) were lower than those in liver tissue counterparts. Regardless of the presence of HBV replication markers, (i) integrant-derived HBV RNAs (id-RNAs) were found in all tissues by reverse transcription-PCR (RT-PCR) analysis and were considerably abundant or predominant in 6/10 tissue samples (2 liver and 4 HCC samples), (ii) RNAs that were polyadenylated using the cryptic HBV polyadenylation signal and therefore could be produced by HBV replication or derived from integrated HBV DNA were found in 5/10 samples (3 liver and 2 HCC samples) and were considerably abundant species in 3/10 tissues (2 livers and 1 HCC), and (iii) cccDNA-transcribed RNAs polyadenylated near position 1931 were not abundant in 7/10 tissues (2 liver and 5 HCC samples) and were predominant in only two liver samples. Subsequent RNA sequencing analysis of selected liver/HCC samples also showed relative abundance of id-RNAs in most of the examined tissues. Our findings suggesting that id-RNAs could represent a significant source of HBV envelope proteins, which is independent of viral replication, are discussed in the context of the possible contribution of id-RNAs to the HBV life cycle.IMPORTANCE The relative abundance of integrant-derived HBV RNAs (id-RNAs) in chronically infected tissues suggest that id-RNAs coding for the envelope proteins may facilitate the production of a considerable fraction of surface antigens (HBsAg) in infected cells bearing HBV integrants. If the same cells support HBV replication, then a significant fraction of assembled HBV virions could bear id-RNA-derived HBsAg as a major component of their envelopes. Therefore, the infectivity of these HBV virions and their ability to facilitate virus cell-to-cell spread could be determined mainly by the properties of id-RNA-derived envelope proteins and not by the properties of replication-derived HBsAg. These interpretations suggest that id-RNAs may play a role in the maintenance of chronic HBV infection and therefore contribute to the HBV life cycle. Furthermore, the production of HBsAg from id-RNAs independently of viral replication may explain at least in part why treatment with interferon or nucleos(t)ides in most cases fails to achieve a loss of serum HBsAg.

Collapse

Targeting the Polyadenylation Signal of Pre-mRNA: A New Gene Silencing Approach for Facioscapulohumeral Dystrophy. Int J Mol Sci 2018;19:ijms19051347. [PMID: 29751519 PMCID: PMC5983732 DOI: 10.3390/ijms19051347] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2018] [Revised: 04/27/2018] [Accepted: 04/30/2018] [Indexed: 02/07/2023] Open

Rot G, Wang Z, Huppertz I, Modic M, Lenče T, Hallegger M, Haberman N, Curk T, von Mering C, Ule J. High-Resolution RNA Maps Suggest Common Principles of Splicing and Polyadenylation Regulation by TDP-43. Cell Rep 2018;19:1056-1067. [PMID: 28467899 PMCID: PMC5437728 DOI: 10.1016/j.celrep.2017.04.028] [Citation(s) in RCA: 67] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2016] [Revised: 03/06/2017] [Accepted: 04/06/2017] [Indexed: 11/05/2022] Open

Feng L, Yuen YL, Xu J, Liu X, Chan MYC, Wang K, Fong WP, Cheung WT, Lee SST. Identification and characterization of a novel PPARα-regulated and 7α-hydroxyl bile acid-preferring cytosolic sulfotransferase mL-STL (Sult2a8). J Lipid Res 2017;58:1114-1131. [PMID: 28442498 DOI: 10.1194/jlr.m074302] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2016] [Revised: 04/19/2017] [Indexed: 12/25/2022] Open

Prediction of Poly(A) Sites by Poly(A) Read Mapping. PLoS One 2017;12:e0170914. [PMID: 28135292 PMCID: PMC5279776 DOI: 10.1371/journal.pone.0170914] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2016] [Accepted: 01/12/2017] [Indexed: 11/19/2022] Open

Abstract

RNA-seq reads containing part of the poly(A) tail of transcripts (denoted as poly(A) reads) provide the most direct evidence for the position of poly(A) sites in the genome. However, due to reduced coverage of poly(A) tails by reads, poly(A) reads are not routinely identified during RNA-seq mapping. Nevertheless, recent studies for several herpesviruses successfully employed mapping of poly(A) reads to identify herpesvirus poly(A) sites using different strategies and customized programs. To more easily allow such analyses without requiring additional programs, we integrated poly(A) read mapping and prediction of poly(A) sites into our RNA-seq mapping program ContextMap 2. The implemented approach essentially generalizes previously used poly(A) read mapping approaches and combines them with the context-based approach of ContextMap 2 to take into account information provided by other reads aligned to the same location. Poly(A) read mapping using ContextMap 2 was evaluated on real-life data from the ENCODE project and compared against a competing approach based on transcriptome assembly (KLEAT). This showed high positive predictive value for our approach, evidenced also by the presence of poly(A) signals, and considerably lower runtime than KLEAT. Although sensitivity is low for both methods, we show that this is in part due to a high extent of spurious results in the gold standard set derived from RNA-PET data. Sensitivity improves for poly(A) sites of known transcripts or determined with a more specific poly(A) sequencing protocol and increases with read coverage on transcript ends. Finally, we illustrate the usefulness of the approach in a high read coverage scenario by a re-analysis of published data for herpes simplex virus 1. Thus, with current trends towards increasing sequencing depth and read length, poly(A) read mapping will prove to be increasingly useful and can now be performed automatically during RNA-seq mapping with ContextMap 2.

Collapse

Wang X, Zheng ZM. Construction of a Transcription Map for Papillomaviruses using RACE, RNase Protection, and Primer Extension Assays. ACTA ACUST UNITED AC 2016;40:14B.6.1-14B.6.29. [PMID: 26855281 DOI: 10.1002/9780471729259.mc14b06s40] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

The UVS9 gene of Chlamydomonas encodes an XPG homolog with a new conserved domain. DNA Repair (Amst) 2015;37:33-42. [PMID: 26658142 DOI: 10.1016/j.dnarep.2015.11.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2015] [Revised: 11/06/2015] [Accepted: 11/16/2015] [Indexed: 11/20/2022]

Extraction of poly(A) sites from large-scale RNA-Seq data. Methods Mol Biol 2015;1255:25-37. [PMID: 25487201 DOI: 10.1007/978-1-4939-2175-1_3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

You L, Wu J, Feng Y, Fu Y, Guo Y, Long L, Zhang H, Luan Y, Tian P, Chen L, Huang G, Huang S, Li Y, Li J, Chen C, Zhang Y, Chen S, Xu A. APASdb: a database describing alternative poly(A) sites and selection of heterogeneous cleavage sites downstream of poly(A) signals. Nucleic Acids Res 2014;43:D59-67. [PMID: 25378337 PMCID: PMC4383914 DOI: 10.1093/nar/gku1076] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Affiliation(s)

Leiming You State Key Laboratory of Biocontrol, Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-Sen University, Higher Education Mega Center, Guangzhou 510006, People's Republic of China School of Basic Medical Sciences, Beijing University of Chinese Medicine, Beijing 100029, People's Republic of China
Jiexin Wu State Key Laboratory of Biocontrol, Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-Sen University, Higher Education Mega Center, Guangzhou 510006, People's Republic of China
Yuchao Feng State Key Laboratory of Biocontrol, Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-Sen University, Higher Education Mega Center, Guangzhou 510006, People's Republic of China
Yonggui Fu State Key Laboratory of Biocontrol, Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-Sen University, Higher Education Mega Center, Guangzhou 510006, People's Republic of China
Yanan Guo State Key Laboratory of Biocontrol, Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-Sen University, Higher Education Mega Center, Guangzhou 510006, People's Republic of China
Liyuan Long State Key Laboratory of Biocontrol, Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-Sen University, Higher Education Mega Center, Guangzhou 510006, People's Republic of China
Hui Zhang State Key Laboratory of Biocontrol, Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-Sen University, Higher Education Mega Center, Guangzhou 510006, People's Republic of China
Yijie Luan State Key Laboratory of Biocontrol, Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-Sen University, Higher Education Mega Center, Guangzhou 510006, People's Republic of China
Peng Tian State Key Laboratory of Biocontrol, Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-Sen University, Higher Education Mega Center, Guangzhou 510006, People's Republic of China
Liangfu Chen State Key Laboratory of Biocontrol, Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-Sen University, Higher Education Mega Center, Guangzhou 510006, People's Republic of China
Guangrui Huang State Key Laboratory of Biocontrol, Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-Sen University, Higher Education Mega Center, Guangzhou 510006, People's Republic of China
Shengfeng Huang State Key Laboratory of Biocontrol, Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-Sen University, Higher Education Mega Center, Guangzhou 510006, People's Republic of China
Yuxin Li State Key Laboratory of Biocontrol, Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-Sen University, Higher Education Mega Center, Guangzhou 510006, People's Republic of China
Jie Li State Key Laboratory of Biocontrol, Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-Sen University, Higher Education Mega Center, Guangzhou 510006, People's Republic of China
Chengyong Chen State Key Laboratory of Biocontrol, Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-Sen University, Higher Education Mega Center, Guangzhou 510006, People's Republic of China
Yaqing Zhang State Key Laboratory of Biocontrol, Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-Sen University, Higher Education Mega Center, Guangzhou 510006, People's Republic of China
Shangwu Chen State Key Laboratory of Biocontrol, Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-Sen University, Higher Education Mega Center, Guangzhou 510006, People's Republic of China
Anlong Xu State Key Laboratory of Biocontrol, Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, School of Life Sciences, Sun Yat-Sen University, Higher Education Mega Center, Guangzhou 510006, People's Republic of China School of Basic Medical Sciences, Beijing University of Chinese Medicine, Beijing 100029, People's Republic of China

Collapse

Janbon G, Ormerod KL, Paulet D, Byrnes EJ, Yadav V, Chatterjee G, Mullapudi N, Hon CC, Billmyre RB, Brunel F, Bahn YS, Chen W, Chen Y, Chow EWL, Coppée JY, Floyd-Averette A, Gaillardin C, Gerik KJ, Goldberg J, Gonzalez-Hilarion S, Gujja S, Hamlin JL, Hsueh YP, Ianiri G, Jones S, Kodira CD, Kozubowski L, Lam W, Marra M, Mesner LD, Mieczkowski PA, Moyrand F, Nielsen K, Proux C, Rossignol T, Schein JE, Sun S, Wollschlaeger C, Wood IA, Zeng Q, Neuvéglise C, Newlon CS, Perfect JR, Lodge JK, Idnurm A, Stajich JE, Kronstad JW, Sanyal K, Heitman J, Fraser JA, Cuomo CA, Dietrich FS. Analysis of the genome and transcriptome of Cryptococcus neoformans var. grubii reveals complex RNA expression and microevolution leading to virulence attenuation. PLoS Genet 2014;10:e1004261. [PMID: 24743168 PMCID: PMC3990503 DOI: 10.1371/journal.pgen.1004261] [Citation(s) in RCA: 276] [Impact Index Per Article: 27.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2013] [Accepted: 02/07/2014] [Indexed: 02/07/2023] Open

Abstract

Cryptococcus neoformans is a pathogenic basidiomycetous yeast responsible for more than 600,000 deaths each year. It occurs as two serotypes (A and D) representing two varieties (i.e. grubii and neoformans, respectively). Here, we sequenced the genome and performed an RNA-Seq-based analysis of the C. neoformans var. grubii transcriptome structure. We determined the chromosomal locations, analyzed the sequence/structural features of the centromeres, and identified origins of replication. The genome was annotated based on automated and manual curation. More than 40,000 introns populating more than 99% of the expressed genes were identified. Although most of these introns are located in the coding DNA sequences (CDS), over 2,000 introns in the untranslated regions (UTRs) were also identified. Poly(A)-containing reads were employed to locate the polyadenylation sites of more than 80% of the genes. Examination of the sequences around these sites revealed a new poly(A)-site-associated motif (AUGHAH). In addition, 1,197 miscRNAs were identified. These miscRNAs can be spliced and/or polyadenylated, but do not appear to have obvious coding capacities. Finally, this genome sequence enabled a comparative analysis of strain H99 variants obtained after laboratory passage. The spectrum of mutations identified provides insights into the genetics underlying the micro-evolution of a laboratory strain, and identifies mutations involved in stress responses, mating efficiency, and virulence.

Cryptococcus neoformans var. grubii is a major human pathogen responsible for deadly meningoencephalitis in immunocompromised patients. Here, we report the sequencing and annotation of its genome. Evidence for extensive intron splicing, antisense transcription, non-coding RNAs, and alternative polyadenylation indicates the potential for highly intricate regulation of gene expression in this opportunistic pathogen. In addition, detailed molecular, genetic, and genomic studies were performed to characterize structural features of the genome, including centromeres and origins of replication. Finally, the phenotypic and genome re-sequencing analysis of a collection of isolates of the reference H99 strain resulting from laboratory passage revealed that microevolutionary processes during in vitro culturing of pathogenic fungi can impact virulence.

Collapse

Affiliation(s)

Guilhem Janbon Institut Pasteur, Unité Biologie et Pathogénicité Fongiques, Département Génomes et Génétique, Paris, France INRA, USC2019, Paris, France * E-mail: (GJ); (JH); (CAC); (FSD)
Kate L. Ormerod University of Queensland, School of Chemistry and Molecular Biosciences, Brisbane, Queensland, Australia
Damien Paulet Institut Pasteur, Plate-forme Transcriptome et Epigénome, Département Génomes et Génétique, Paris, France
Edmond J. Byrnes Duke University Medical Center, Department of Molecular Genetics and Microbiology, Durham, North Carolina, United States of America
Vikas Yadav Jawaharlal Nehru Centre for Advanced Scientific Research, Molecular Biology and Genetics Unit, Bangalore, India
Gautam Chatterjee Jawaharlal Nehru Centre for Advanced Scientific Research, Molecular Biology and Genetics Unit, Bangalore, India
Nandita Mullapudi Genotypic Technology Private Limited, Bangalore, India
Chung-Chau Hon Institut Pasteur, Unité Biologie Cellulaire du Parasitisme, Département Biologie Cellulaire et Infection, Paris, France
R. Blake Billmyre Duke University Medical Center, Department of Molecular Genetics and Microbiology, Durham, North Carolina, United States of America
François Brunel INRA, UMR 1319 Micalis, Jouy-en-Josas, France
Yong-Sun Bahn Yonsei University, Center for Fungal Pathogenesis, Department of Biotechnology, Seoul, Republic of Korea
Weidong Chen Rutgers New Jersey Medical School, Department of Microbiology and Molecular Genetics, Newark, New Jersey, United States of America
Yuan Chen Duke University Medical Center, Department of Molecular Genetics and Microbiology, Durham, North Carolina, United States of America
Eve W. L. Chow University of Queensland, School of Chemistry and Molecular Biosciences, Brisbane, Queensland, Australia
Jean-Yves Coppée Institut Pasteur, Plate-forme Transcriptome et Epigénome, Département Génomes et Génétique, Paris, France
Anna Floyd-Averette Duke University Medical Center, Department of Molecular Genetics and Microbiology, Durham, North Carolina, United States of America
Claude Gaillardin INRA, UMR 1319 Micalis, Jouy-en-Josas, France
Kimberly J. Gerik Washington University School of Medicine, Department of Molecular Microbiology, St. Louis, Missouri, United States of America
Jonathan Goldberg Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
Sara Gonzalez-Hilarion Institut Pasteur, Unité Biologie et Pathogénicité Fongiques, Département Génomes et Génétique, Paris, France INRA, USC2019, Paris, France
Sharvari Gujja Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
Joyce L. Hamlin University of Virginia, Department of Biochemistry and Molecular Genetics, Charlottesville, Virginia, United States of America
Yen-Ping Hsueh Duke University Medical Center, Department of Molecular Genetics and Microbiology, Durham, North Carolina, United States of America California Institute of Technology, Division of Biology, Pasadena, California, United States of America
Giuseppe Ianiri University of Missouri-Kansas City, School of Biological Sciences, Division of Cell Biology and Biophysics, Kansas City, Missouri, United States of America
Steven Jones Canada's Michael Smith Genome Sciences Centre, BC Cancer Agency, Vancouver, British Columbia, Canada
Chinnappa D. Kodira Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
Lukasz Kozubowski Clemson University, Department of Genetics and Biochemistry, Clemson, South Carolina, United States of America
Woei Lam Washington University School of Medicine, Department of Molecular Microbiology, St. Louis, Missouri, United States of America
Marco Marra Canada's Michael Smith Genome Sciences Centre, BC Cancer Agency, Vancouver, British Columbia, Canada
Larry D. Mesner University of Virginia, Department of Biochemistry and Molecular Genetics, Charlottesville, Virginia, United States of America
Piotr A. Mieczkowski University of North Carolina, Department of Genetics, Chapel Hill, North Carolina, United States of America
Frédérique Moyrand Institut Pasteur, Unité Biologie et Pathogénicité Fongiques, Département Génomes et Génétique, Paris, France INRA, USC2019, Paris, France
Kirsten Nielsen Duke University Medical Center, Department of Molecular Genetics and Microbiology, Durham, North Carolina, United States of America University of Minnesota, Microbiology Department, Minneapolis, Minnesota, United States of America
Caroline Proux Institut Pasteur, Plate-forme Transcriptome et Epigénome, Département Génomes et Génétique, Paris, France
Tristan Rossignol INRA, UMR 1319 Micalis, Jouy-en-Josas, France
Jacqueline E. Schein Canada's Michael Smith Genome Sciences Centre, BC Cancer Agency, Vancouver, British Columbia, Canada
Sheng Sun Duke University Medical Center, Department of Molecular Genetics and Microbiology, Durham, North Carolina, United States of America
Carolin Wollschlaeger Institut Pasteur, Unité Biologie et Pathogénicité Fongiques, Département Génomes et Génétique, Paris, France INRA, USC2019, Paris, France
Ian A. Wood University of Queensland, School of Mathematics and Physics, Brisbane, Queensland, Australia
Qiandong Zeng Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
Cécile Neuvéglise INRA, UMR 1319 Micalis, Jouy-en-Josas, France
Carol S. Newlon Rutgers New Jersey Medical School, Department of Microbiology and Molecular Genetics, Newark, New Jersey, United States of America
John R. Perfect Duke University Medical Center, Duke Department of Medicine and Molecular Genetics and Microbiology, Durham, North Carolina, United States of America
Jennifer K. Lodge Washington University School of Medicine, Department of Molecular Microbiology, St. Louis, Missouri, United States of America
Alexander Idnurm University of Missouri-Kansas City, School of Biological Sciences, Division of Cell Biology and Biophysics, Kansas City, Missouri, United States of America
Jason E. Stajich Duke University Medical Center, Department of Molecular Genetics and Microbiology, Durham, North Carolina, United States of America University of California, Department of Plant Pathology & Microbiology, Riverside, California, United States of America
James W. Kronstad Michael Smith Laboratories, Department of Microbiology and Immunology, Vancouver, British Columbia, Canada
Kaustuv Sanyal Jawaharlal Nehru Centre for Advanced Scientific Research, Molecular Biology and Genetics Unit, Bangalore, India
Joseph Heitman Duke University Medical Center, Department of Molecular Genetics and Microbiology, Durham, North Carolina, United States of America * E-mail: (GJ); (JH); (CAC); (FSD)
James A. Fraser University of Queensland, School of Chemistry and Molecular Biosciences, Brisbane, Queensland, Australia
Christina A. Cuomo Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America * E-mail: (GJ); (JH); (CAC); (FSD)
Fred S. Dietrich Duke University Medical Center, Department of Molecular Genetics and Microbiology, Durham, North Carolina, United States of America * E-mail: (GJ); (JH); (CAC); (FSD)

Collapse

Delineating the structural blueprint of the pre-mRNA 3'-end processing machinery. Mol Cell Biol 2014;34:1894-910. [PMID: 24591651 DOI: 10.1128/mcb.00084-14] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Siegel TN, Hon CC, Zhang Q, Lopez-Rubio JJ, Scheidig-Benatar C, Martins RM, Sismeiro O, Coppée JY, Scherf A. Strand-specific RNA-Seq reveals widespread and developmentally regulated transcription of natural antisense transcripts in Plasmodium falciparum. BMC Genomics 2014;15:150. [PMID: 24559473 PMCID: PMC4007998 DOI: 10.1186/1471-2164-15-150] [Citation(s) in RCA: 84] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2013] [Accepted: 02/06/2014] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Advances in high-throughput sequencing have led to the discovery of widespread transcription of natural antisense transcripts (NATs) in a large number of organisms, where these transcripts have been shown to play important roles in the regulation of gene expression. Likewise, the existence of NATs has been observed in Plasmodium but our understanding towards their genome-wide distribution remains incomplete due to the limited depth and uncertainties in the level of strand specificity of previous datasets.

RESULTS

To gain insights into the genome-wide distribution of NATs in P. falciparum, we performed RNA-ligation based strand-specific RNA sequencing at unprecedented depth. Our data indicate that 78.3% of the genome is transcribed during blood-stage development. Moreover, our analysis reveals significant levels of antisense transcription from at least 24% of protein-coding genes and that while expression levels of NATs change during the intraerythrocytic developmental cycle (IDC), they do not correlate with the corresponding mRNA levels. Interestingly, antisense transcription is not evenly distributed across coding regions (CDSs) but strongly clustered towards the 3'-end of CDSs. Furthermore, for a significant subset of NATs, transcript levels correlate with mRNA levels of neighboring genes.Finally, we were able to identify the polyadenylation sites (PASs) for a subset of NATs, demonstrating that at least some NATs are polyadenylated. We also mapped the PASs of 3443 coding genes, yielding an average 3' untranslated region length of 523 bp.

CONCLUSIONS

Our strand-specific analysis of the P. falciparum transcriptome expands and strengthens the existing body of evidence that antisense transcription is a substantial phenomenon in P. falciparum. For a subset of neighboring genes we find that sense and antisense transcript levels are intricately linked while other NATs appear to be regulated independently of mRNA transcription. Our deep strand-specific dataset will provide a valuable resource for the precise determination of expression levels as it separates sense from antisense transcript levels, which we find to often significantly differ. In addition, the extensive novel data on 3' UTR length will allow others to perform searches for regulatory motifs in the UTRs and help understand post-translational regulation in P. falciparum.

Collapse

Zheng D, Tian B. RNA-binding proteins in regulation of alternative cleavage and polyadenylation. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2014;825:97-127. [PMID: 25201104 DOI: 10.1007/978-1-4939-1221-6_3] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Majerciak V, Ni T, Yang W, Meng B, Zhu J, Zheng ZM. A viral genome landscape of RNA polyadenylation from KSHV latent to lytic infection. PLoS Pathog 2013;9:e1003749. [PMID: 24244170 PMCID: PMC3828183 DOI: 10.1371/journal.ppat.1003749] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2013] [Accepted: 09/20/2013] [Indexed: 11/30/2022] Open

Abstract

RNA polyadenylation (pA) is one of the major steps in regulation of gene expression at the posttranscriptional level. In this report, a genome landscape of pA sites of viral transcripts in B lymphocytes with Kaposi sarcoma-associated herpesvirus (KSHV) infection was constructed using a modified PA-seq strategy. We identified 67 unique pA sites, of which 55 could be assigned for expression of annotated ∼90 KSHV genes. Among the assigned pA sites, twenty are for expression of individual single genes and the rest for multiple genes (average 2.7 genes per pA site) in cluster-gene loci of the genome. A few novel viral pA sites that could not be assigned to any known KSHV genes are often positioned in the antisense strand to ORF8, ORF21, ORF34, K8 and ORF50, and their associated antisense mRNAs to ORF21, ORF34 and K8 could be verified by 3′RACE. The usage of each mapped pA site correlates to its peak size, the larger (broad and wide) peak size, the more usage and thus, the higher expression of the pA site-associated gene(s). Similar to mammalian transcripts, KSHV RNA polyadenylation employs two major poly(A) signals, AAUAAA and AUUAAA, and is regulated by conservation of cis-elements flanking the mapped pA sites. Moreover, we found two or more alternative pA sites downstream of ORF54, K2 (vIL6), K9 (vIRF1), K10.5 (vIRF3), K11 (vIRF2), K12 (Kaposin A), T1.5, and PAN genes and experimentally validated the alternative polyadenylation for the expression of KSHV ORF54, K11, and T1.5 transcripts. Together, our data provide not only a comprehensive pA site landscape for understanding KSHV genome structure and gene expression, but also the first evidence of alternative polyadenylation as another layer of posttranscriptional regulation in viral gene expression.

A genome-wide polyadenylation landscape in the expression of human herpesviruses has not been reported. In this study, we provide the first genome landscape of viral RNA polyadenylation sites in B cells from KSHV latent to lytic infection by using a modified PA-seq protocol and selectively validated by 3′ RACE. We found that KSHV genome contains 67 active pA sites for the expression of its ∼90 genes and a few antisense transcripts. Among the mapped pA sites, a large fraction of them are for the expression of cluster genes and the production of bicistronic or polycistronic transcripts from KSHV genome and only one-third are used for the expression of single genes. We found that the size of individual PA peaks is positively correlated with the usage of corresponding pA site, which is determined by the number of reads within the PA peak from latent to lytic KSHV infection, and the strength of cis-elements surrounding KSHV pA site determines the expression level of viral genes. Lastly, we identified and experimentally validated alternative polyadenylation of KSHV ORF54, T1.5, and K11 during viral lytic infection. To our knowledge, this is the first report on alternative polyadenylation events in KSHV infection.

Collapse

Genomewide mapping and screening of Kaposi's sarcoma-associated herpesvirus (KSHV) 3' untranslated regions identify bicistronic and polycistronic viral transcripts as frequent targets of KSHV microRNAs. J Virol 2013;88:377-92. [PMID: 24155407 DOI: 10.1128/jvi.02689-13] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open

Sheppard S, Lawson ND, Zhu LJ. Accurate identification of polyadenylation sites from 3' end deep sequencing using a naive Bayes classifier. ACTA ACUST UNITED AC 2013;29:2564-71. [PMID: 23962617 DOI: 10.1093/bioinformatics/btt446] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Pereira-Castro I, Costa AMS, Oliveira MJ, Barbosa I, Rocha AS, Azevedo L, da Costa LT. Characterization of human NLZ1/ZNF703 identifies conserved domains essential for proper subcellular localization and transcriptional repression. J Cell Biochem 2013;114:120-33. [PMID: 22886885 DOI: 10.1002/jcb.24309] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2011] [Accepted: 07/26/2012] [Indexed: 11/06/2022]

Franzén O, Jerlström-Hultqvist J, Einarsson E, Ankarklev J, Ferella M, Andersson B, Svärd SG. Transcriptome profiling of Giardia intestinalis using strand-specific RNA-seq. PLoS Comput Biol 2013;9:e1003000. [PMID: 23555231 PMCID: PMC3610916 DOI: 10.1371/journal.pcbi.1003000] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2012] [Accepted: 02/02/2013] [Indexed: 01/08/2023] Open

Abstract

Giardia intestinalis is a common cause of diarrheal disease and it consists of eight genetically distinct genotypes or assemblages (A-H). Only assemblages A and B infect humans and are suggested to represent two different Giardia species. Correlations exist between assemblage type and host-specificity and to some extent symptoms. Phenotypical differences have been documented between assemblages and genome sequences are available for A, B and E. We have characterized and compared the polyadenylated transcriptomes of assemblages A, B and E. Four genetically different isolates were studied (WB (AI), AS175 (AII), P15 (E) and GS (B)) using paired-end, strand-specific RNA-seq. Most of the genome was transcribed in trophozoites grown in vitro, but at vastly different levels. RNA-seq confirmed many of the present annotations and refined the current genome annotation. Gene expression divergence was found to recapitulate the known phylogeny, and uncovered lineage-specific differences in expression. Polyadenylation sites were mapped for over 70% of the genes and revealed many examples of conserved and unexpectedly long 3′ UTRs. 28 open reading frames were found in a non-transcribed gene cluster on chromosome 5 of the WB isolate. Analysis of allele-specific expression revealed a correlation between allele-dosage and allele expression in the GS isolate. Previously reported cis-splicing events were confirmed and global mapping of cis-splicing identified only one novel intron. These observations can possibly explain differences in host-preference and symptoms, and it will be the basis for further studies of Giardia pathogenesis and biology.

Giardia is a single cell intestinal parasite and a common cause of diarrhea in humans and animals. Giardia is an unusual eukaryote by possessing two nuclei, a highly reduced genome and simple transcriptional apparatus. We have characterized the transcriptome of Giardia at single nucleotide resolution, which allowed the calculation of digital gene expression values for the complete set of genes. We performed a comparison of gene expression divergence across three genotypes. Most of the genes were transcribed, and the data were used to refine and correct gene models. Several gene expression differences were identified between the genotypes. A non-transcribed cluster of genes was detected on chromosome 5, likely representing a silenced region. The data also allowed mapping of transcript termini, which provided the first global view of 3′ untranslated regions in this parasite. This study also gives the first genome-wide evidence of transcription of allelic variants in Giardia. In this study, we provide novel insights into the transcriptome of an important human pathogen and model eukaryote. The findings reported here likely relate to the lifestyle of this parasite and its adaptation to parasitism. The data provide starting points for functional investigation of Giardia's biology and diplomonads generally.

Collapse

Rigault C, Le Borgne F, Tazir B, Benani A, Demarquoy J. A high-fat diet increases L-carnitine synthesis through a differential maturation of the Bbox1 mRNAs. BIOCHIMICA ET BIOPHYSICA ACTA 2013;1831:370-7. [PMID: 23127966 DOI: 10.1016/j.bbalip.2012.10.007] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/26/2012] [Revised: 10/15/2012] [Accepted: 10/26/2012] [Indexed: 12/30/2022]

Rehfeld A, Plass M, Krogh A, Friis-Hansen L. Alterations in polyadenylation and its implications for endocrine disease. Front Endocrinol (Lausanne) 2013;4:53. [PMID: 23658553 PMCID: PMC3647115 DOI: 10.3389/fendo.2013.00053] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/10/2013] [Accepted: 04/22/2013] [Indexed: 12/17/2022] Open

Hon CC, Weber C, Sismeiro O, Proux C, Koutero M, Deloger M, Das S, Agrahari M, Dillies MA, Jagla B, Coppee JY, Bhattacharya A, Guillen N. Quantification of stochastic noise of splicing and polyadenylation in Entamoeba histolytica. Nucleic Acids Res 2012;41:1936-52. [PMID: 23258700 PMCID: PMC3561952 DOI: 10.1093/nar/gks1271] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Shi Y. Alternative polyadenylation: new insights from global analyses. RNA (NEW YORK, N.Y.) 2012;18:2105-17. [PMID: 23097429 PMCID: PMC3504663 DOI: 10.1261/rna.035899.112] [Citation(s) in RCA: 168] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]

Lin Y, Li Z, Ozsolak F, Kim SW, Arango-Argoty G, Liu TT, Tenenbaum SA, Bailey T, Monaghan AP, Milos PM, John B. An in-depth map of polyadenylation sites in cancer. Nucleic Acids Res 2012;40:8460-71. [PMID: 22753024 PMCID: PMC3458571 DOI: 10.1093/nar/gks637] [Citation(s) in RCA: 115] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2011] [Revised: 05/16/2012] [Accepted: 06/06/2012] [Indexed: 12/22/2022] Open

Affiliation(s)

Yuefeng Lin Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15260, Helicos BioSciences Corporation, One Kendall Square, Cambridge, MA 02139, College of Nanoscale Science and Engineering, University at Albany-Suny, Albany, NY, USA, Institute for Molecular Bioscience, the University of Queensland, Queensland, Australia and Department of Neurobiology, University of Pittsburgh, 3501 Fifth Avenue, Pittsburgh, PA 15260, USA
Zhihua Li Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15260, Helicos BioSciences Corporation, One Kendall Square, Cambridge, MA 02139, College of Nanoscale Science and Engineering, University at Albany-Suny, Albany, NY, USA, Institute for Molecular Bioscience, the University of Queensland, Queensland, Australia and Department of Neurobiology, University of Pittsburgh, 3501 Fifth Avenue, Pittsburgh, PA 15260, USA
Fatih Ozsolak Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15260, Helicos BioSciences Corporation, One Kendall Square, Cambridge, MA 02139, College of Nanoscale Science and Engineering, University at Albany-Suny, Albany, NY, USA, Institute for Molecular Bioscience, the University of Queensland, Queensland, Australia and Department of Neurobiology, University of Pittsburgh, 3501 Fifth Avenue, Pittsburgh, PA 15260, USA
Sang Woo Kim Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15260, Helicos BioSciences Corporation, One Kendall Square, Cambridge, MA 02139, College of Nanoscale Science and Engineering, University at Albany-Suny, Albany, NY, USA, Institute for Molecular Bioscience, the University of Queensland, Queensland, Australia and Department of Neurobiology, University of Pittsburgh, 3501 Fifth Avenue, Pittsburgh, PA 15260, USA
Gustavo Arango-Argoty Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15260, Helicos BioSciences Corporation, One Kendall Square, Cambridge, MA 02139, College of Nanoscale Science and Engineering, University at Albany-Suny, Albany, NY, USA, Institute for Molecular Bioscience, the University of Queensland, Queensland, Australia and Department of Neurobiology, University of Pittsburgh, 3501 Fifth Avenue, Pittsburgh, PA 15260, USA
Teresa T. Liu Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15260, Helicos BioSciences Corporation, One Kendall Square, Cambridge, MA 02139, College of Nanoscale Science and Engineering, University at Albany-Suny, Albany, NY, USA, Institute for Molecular Bioscience, the University of Queensland, Queensland, Australia and Department of Neurobiology, University of Pittsburgh, 3501 Fifth Avenue, Pittsburgh, PA 15260, USA
Scott A. Tenenbaum Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15260, Helicos BioSciences Corporation, One Kendall Square, Cambridge, MA 02139, College of Nanoscale Science and Engineering, University at Albany-Suny, Albany, NY, USA, Institute for Molecular Bioscience, the University of Queensland, Queensland, Australia and Department of Neurobiology, University of Pittsburgh, 3501 Fifth Avenue, Pittsburgh, PA 15260, USA
Timothy Bailey Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15260, Helicos BioSciences Corporation, One Kendall Square, Cambridge, MA 02139, College of Nanoscale Science and Engineering, University at Albany-Suny, Albany, NY, USA, Institute for Molecular Bioscience, the University of Queensland, Queensland, Australia and Department of Neurobiology, University of Pittsburgh, 3501 Fifth Avenue, Pittsburgh, PA 15260, USA
A. Paula Monaghan Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15260, Helicos BioSciences Corporation, One Kendall Square, Cambridge, MA 02139, College of Nanoscale Science and Engineering, University at Albany-Suny, Albany, NY, USA, Institute for Molecular Bioscience, the University of Queensland, Queensland, Australia and Department of Neurobiology, University of Pittsburgh, 3501 Fifth Avenue, Pittsburgh, PA 15260, USA
Patrice M. Milos Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15260, Helicos BioSciences Corporation, One Kendall Square, Cambridge, MA 02139, College of Nanoscale Science and Engineering, University at Albany-Suny, Albany, NY, USA, Institute for Molecular Bioscience, the University of Queensland, Queensland, Australia and Department of Neurobiology, University of Pittsburgh, 3501 Fifth Avenue, Pittsburgh, PA 15260, USA
Bino John Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15260, Helicos BioSciences Corporation, One Kendall Square, Cambridge, MA 02139, College of Nanoscale Science and Engineering, University at Albany-Suny, Albany, NY, USA, Institute for Molecular Bioscience, the University of Queensland, Queensland, Australia and Department of Neurobiology, University of Pittsburgh, 3501 Fifth Avenue, Pittsburgh, PA 15260, USA

Collapse

de Klerk E, Venema A, Anvar SY, Goeman JJ, Hu O, Trollet C, Dickson G, den Dunnen JT, van der Maarel SM, Raz V, 't Hoen PAC. Poly(A) binding protein nuclear 1 levels affect alternative polyadenylation. Nucleic Acids Res 2012;40:9089-101. [PMID: 22772983 PMCID: PMC3467053 DOI: 10.1093/nar/gks655] [Citation(s) in RCA: 126] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Lopes-Marques M, Pereira-Castro I, Amorim A, Azevedo L. Characterization of the human ornithine transcarbamylase 3' untranslated regulatory region. DNA Cell Biol 2011;31:427-33. [PMID: 22054066 DOI: 10.1089/dna.2011.1391] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Shepard PJ, Choi EA, Lu J, Flanagan LA, Hertel KJ, Shi Y. Complex and dynamic landscape of RNA polyadenylation revealed by PAS-Seq. RNA (NEW YORK, N.Y.) 2011;17:761-72. [PMID: 21343387 PMCID: PMC3062186 DOI: 10.1261/rna.2581711] [Citation(s) in RCA: 327] [Impact Index Per Article: 25.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/03/2010] [Accepted: 01/11/2011] [Indexed: 05/20/2023]

Liu X, Jiang Y, Russell JE. A potential regulatory role for mRNA secondary structures within the prothrombin 3'UTR. Thromb Res 2010;126:130-6. [PMID: 20553951 DOI: 10.1016/j.thromres.2010.04.010] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2010] [Revised: 03/10/2010] [Accepted: 04/20/2010] [Indexed: 11/20/2022]

Chan S, Choi EA, Shi Y. Pre-mRNA 3'-end processing complex assembly and function. WILEY INTERDISCIPLINARY REVIEWS-RNA 2010;2:321-35. [PMID: 21957020 DOI: 10.1002/wrna.54] [Citation(s) in RCA: 114] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

The transcriptome of the human pathogen Trypanosoma brucei at single-nucleotide resolution. PLoS Pathog 2010;6:e1001090. [PMID: 20838601 PMCID: PMC2936537 DOI: 10.1371/journal.ppat.1001090] [Citation(s) in RCA: 225] [Impact Index Per Article: 16.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2010] [Accepted: 08/06/2010] [Indexed: 12/30/2022] Open

Abstract

The genome of Trypanosoma brucei, the causative agent of African trypanosomiasis, was published five years ago, yet identification of all genes and their transcripts remains to be accomplished. Annotation is challenged by the organization of genes transcribed by RNA polymerase II (Pol II) into long unidirectional gene clusters with no knowledge of how transcription is initiated. Here we report a single-nucleotide resolution genomic map of the T. brucei transcriptome, adding 1,114 new transcripts, including 103 non-coding RNAs, confirming and correcting many of the annotated features and revealing an extensive heterogeneity of 5′ and 3′ ends. Some of the new transcripts encode polypeptides that are either conserved in T. cruzi and Leishmania major or were previously detected in mass spectrometry analyses. High-throughput RNA sequencing (RNA-Seq) was sensitive enough to detect transcripts at putative Pol II transcription initiation sites. Our results, as well as recent data from the literature, indicate that transcription initiation is not solely restricted to regions at the beginning of gene clusters, but may occur at internal sites. We also provide evidence that transcription at all putative initiation sites in T. brucei is bidirectional, a recently recognized fundamental property of eukaryotic promoters. Our results have implications for gene expression patterns in other important human pathogens with similar genome organization (Trypanosoma cruzi, Leishmania sp.) and revealed heterogeneity in pre-mRNA processing that could potentially contribute to the survival and success of the parasite population in the insect vector and the mammalian host.

Identifying genes essential for survival in the host is fundamental to unraveling the biology of human pathogens and understanding mechanisms of pathogenesis. The protozoan parasite Trypanosoma brucei causes devastating diseases in humans and animals in sub-Saharan Africa, and the publication in 2005 of the genome sequence provided the first glance at the coding potential of this organism. Although at present there is a catalogue of predicted protein coding genes, the challenge remains to identify all authentic genes, including their boundaries. We used next generation RNA sequencing (RNA-Seq) to map transcribed regions and RNA polymerase II transcription initiation sites on a genome-wide scale. This approach allowed us to improve and correct the current annotation, to reveal a widespread heterogeneity of RNA processing sites (trans-splicing and polyadenylation) and to estimate that most genes are expressed at levels corresponding to 1 to 10 mRNAs per cell. Our data indicate that different transcript forms representing the same gene are present stochastically within the mRNA population. This unanticipated scenario may contribute to determining gene expression landscapes to adapt to different environments in the parasite life cycle.

Collapse

Transcriptional and structural analyses of Amsacta moorei entomopoxvirus protein kinase gene (AMV197, pk). ANN MICROBIOL 2010. [DOI: 10.1007/s13213-010-0082-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022] Open

Zaretzki RL, Gilchrist MA, Briggs WM, Armagan A. Bias correction and Bayesian analysis of aggregate counts in SAGE libraries. BMC Bioinformatics 2010;11:72. [PMID: 20128916 PMCID: PMC2829012 DOI: 10.1186/1471-2105-11-72] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2009] [Accepted: 02/03/2010] [Indexed: 12/02/2022] Open

Abstract

Background

Tag-based techniques, such as SAGE, are commonly used to sample the mRNA pool of an organism's transcriptome. Incomplete digestion during the tag formation process may allow for multiple tags to be generated from a given mRNA transcript. The probability of forming a tag varies with its relative location. As a result, the observed tag counts represent a biased sample of the actual transcript pool. In SAGE this bias can be avoided by ignoring all but the 3' most tag but will discard a large fraction of the observed data. Taking this bias into account should allow more of the available data to be used leading to increased statistical power.

Results

Three new hierarchical models, which directly embed a model for the variation in tag formation probability, are proposed and their associated Bayesian inference algorithms are developed. These models may be applied to libraries at both the tag and aggregate level. Simulation experiments and analysis of real data are used to contrast the accuracy of the various methods. The consequences of tag formation bias are discussed in the context of testing differential expression. A description is given as to how these algorithms can be applied in that context.

Conclusions

Several Bayesian inference algorithms that account for tag formation effects are compared with the DPB algorithm providing clear evidence of superior performance. The accuracy of inferences when using a particular non-informative prior is found to depend on the expression level of a given gene. The multivariate nature of the approach easily allows both univariate and joint tests of differential expression. Calculations demonstrate the potential for false positive and negative findings due to variation in tag formation probabilities across samples when testing for differential expression.

Collapse

Wang P, Yu P, Gao P, Shi T, Ma D. Discovery of novel human transcript variants by analysis of intronic single-block EST with polyadenylation site. BMC Genomics 2009;10:518. [PMID: 19906316 PMCID: PMC2784480 DOI: 10.1186/1471-2164-10-518] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2009] [Accepted: 11/12/2009] [Indexed: 01/24/2023] Open

't Hoen PAC, Ariyurek Y, Thygesen HH, Vreugdenhil E, Vossen RHAM, de Menezes RX, Boer JM, van Ommen GJB, den Dunnen JT. Deep sequencing-based expression analysis shows major advances in robustness, resolution and inter-lab portability over five microarray platforms. Nucleic Acids Res 2008;36:e141. [PMID: 18927111 PMCID: PMC2588528 DOI: 10.1093/nar/gkn705] [Citation(s) in RCA: 560] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

Characterization of a nonclassical class I MHC gene in a reptile, the Galápagos marine iguana (Amblyrhynchus cristatus). PLoS One 2008;3:e2859. [PMID: 18682845 PMCID: PMC2483932 DOI: 10.1371/journal.pone.0002859] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2008] [Accepted: 06/24/2008] [Indexed: 11/19/2022] Open

Abstract

Squamates are a diverse order of vertebrates, representing more than 7,000 species. Yet, descriptions of full-length major histocompatibility complex (MHC) genes in this group are nearly absent from the literature, while the number of MHC studies continues to rise in other vertebrate taxa. The lack of basic information about MHC organization in squamates inhibits investigation into the relationship between MHC polymorphism and disease, and leaves a large taxonomic gap in our understanding of amniote MHC evolution. Here, we use both cDNA and genomic sequence data to characterize a class I MHC gene (Amcr-UA) from the Galápagos marine iguana, a member of the squamate subfamily Iguaninae. Amcr-UA appears to be functional since it is expressed in the blood and contains many of the conserved peptide-binding residues that are found in classical class I genes of other vertebrates. In addition, comparison of Amcr-UA to homologous sequences from other iguanine species shows that the antigen-binding portion of this gene is under purifying selection, rather than balancing selection, and therefore may have a conserved function. A striking feature of Amcr-UA is that both the cDNA and genomic sequences lack the transmembrane and cytoplasmic domains that are necessary to anchor the class I receptor molecule into the cell membrane, suggesting that the product of this gene is secreted and consequently not involved in classical class I antigen-presentation. The truncated and conserved character of Amcr-UA lead us to define it as a nonclassical gene that is related to the few available squamate class I sequences. However, phylogenetic analysis placed Amcr-UA in a basal position relative to other published classical MHC genes from squamates, suggesting that this gene diverged near the beginning of squamate diversification.

Collapse

Carpentier SC, Coemans B, Podevin N, Laukens K, Witters E, Matsumura H, Terauchi R, Swennen R, Panis B. Functional genomics in a non-model crop: transcriptomics or proteomics? PHYSIOLOGIA PLANTARUM 2008;133:117-30. [PMID: 18312499 DOI: 10.1111/j.1399-3054.2008.01069.x] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]

Zhu J, He F, Wang J, Yu J. Modeling transcriptome based on transcript-sampling data. PLoS One 2008;3:e1659. [PMID: 18286206 PMCID: PMC2243018 DOI: 10.1371/journal.pone.0001659] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2007] [Accepted: 01/21/2008] [Indexed: 01/10/2023] Open

Abstract

Background

Newly-evolved multiplex sequencing technology has been bringing transcriptome sequencing into an unprecedented depth. Millions of transcript tags now can be acquired in a single experiment through parallelization. The significant increase in throughput and reduction in cost required us to address some fundamental questions, such as how many transcript tags do we have to sequence for a given transcriptome? How could we estimate the total number of unique transcripts for different cell types (transcriptome diversity) and the distribution of their copy numbers (transcriptome dynamics)? What is the probability that a transcript with a given expression level to be detected at a certain sampling depth?

Methodology/Principal Findings

We developed a statistical model to evaluate these parameters based on transcriptome-sampling data. Three mixture models were exploited for their potentials to model the sampling frequencies. We demonstrated that relative abundances of all transcripts in a transcriptome follow the generalized inverse Gaussian distribution. The widely known beta and gamma distributions failed to fulfill the singular characteristics of relative abundance distribution, i.e., highly skewed toward zero and with a long tail. An estimator of transcriptome diversity and an analytical form of sampling growth curve were proposed in a coherent framework. Experimental data fitted this model very well and Monte Carlo simulations based on this model replicated sampling experiments in a remarkable precision.

Conclusions

Taking human embryonic stem cell as a prototype, we demonstrated that sequencing tens of thousands of transcript tags in an ordinary EST/SAGE experiment was far from sufficient. In order to fully characterize a human transcriptome, millions of transcript tags had to be sequenced. This model lays a statistical basis for transcriptome-sampling experiments and in essence can be used in all sampling-based data.

Collapse

Lee JY, Park JY, Tian B. Identification of mRNA polyadenylation sites in genomes using cDNA sequences, expressed sequence tags, and Trace. Methods Mol Biol 2008;419:23-37. [PMID: 18369973 DOI: 10.1007/978-1-59745-033-1_2] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]

Bioinformatics detection of alternative splicing. Methods Mol Biol 2008;452:179-97. [PMID: 18566765 DOI: 10.1007/978-1-60327-159-2_9] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Liu F, Jenssen TK, Trimarchi J, Punzo C, Cepko CL, Ohno-Machado L, Hovig E, Patrick Kuo W. Comparison of hybridization-based and sequencing-based gene expression technologies on biological replicates. BMC Genomics 2007;8:153. [PMID: 17555589 PMCID: PMC1899500 DOI: 10.1186/1471-2164-8-153] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2006] [Accepted: 06/07/2007] [Indexed: 02/06/2023] Open

Gain and loss of polyadenylation signals during evolution of green algae. BMC Evol Biol 2007;7:65. [PMID: 17442103 PMCID: PMC1868727 DOI: 10.1186/1471-2148-7-65] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2006] [Accepted: 04/18/2007] [Indexed: 11/24/2022] Open

Abstract

Background

The Viridiplantae (green algae and land plants) consist of two monophyletic lineages: the Chlorophyta and the Streptophyta. Most green algae belong to the Chlorophyta, while the Streptophyta include all land plants and a small group of freshwater algae known as Charophyceae. Eukaryotes attach a poly-A tail to the 3' ends of most nuclear-encoded mRNAs. In embryophytes, animals and fungi, the signal for polyadenylation contains an A-rich sequence (often AAUAAA or related sequence) 13 to 30 nucleotides upstream from the cleavage site, which is commonly referred to as the near upstream element (NUE). However, it has been reported that the pentanucleotide UGUAA is used as polyadenylation signal for some genes in volvocalean algae.

Results

We set out to investigate polyadenylation signal differences between streptophytes and chlorophytes that may have emerged shortly after the evolutionary split between Streptophyta and Chlorophyta. We therefore analyzed expressed genes (ESTs) from three streptophyte algae, Mesostigma viride, Klebsormidium subtile and Coleochaete scutata, and from two early-branching chlorophytes, Pyramimonas parkeae and Scherffelia dubia. In addition, to extend the database, our analyses included ESTs from six other chlorophytes (Acetabularia acetabulum, Chlamydomonas reinhardtii, Helicosporidium sp. ex Simulium jonesii, Prototheca wickerhamii, Scenedesmus obliquus and Ulva linza) and one streptophyte (Closterium peracerosum). Our results indicate that polyadenylation signals in green algae vary widely. The UGUAA motif is confined to late-branching Chlorophyta. Most streptophyte algae do not have an A-rich sequence motif like that in embryophytes, animals and fungi. We observed polyadenylation signals similar to those of Arabidopsis and other land plants only in Mesostigma.

Conclusion

Polyadenylation signals in green algae show considerable variation. A new NUE (UGUAA) was invented in derived chlorophytes and replaced not only the A-rich NUE but the complete poly(A) signal in all chlorophytes investigated except Scherffelia (only NUE replaced) and Pyramimonas (UGUAA completely missing). The UGUAA element is completely absent from streptophytes. However, the structure of the poly(A) signal was often modified in streptophyte algae. In most species investigated, an A-rich NUE is missing; instead, these species seem to rely mainly on U-rich elements.

Collapse

Moucadel V, Lopez F, Ara T, Benech P, Gautheret D. Beyond the 3' end: experimental validation of extended transcript isoforms. Nucleic Acids Res 2007;35:1947-57. [PMID: 17339231 PMCID: PMC1874610 DOI: 10.1093/nar/gkm062] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Gilat R, Shweiki D. A novel function for alternative polyadenylation as a rescue pathway from NMD surveillance. Biochem Biophys Res Commun 2007;353:487-92. [PMID: 17188645 DOI: 10.1016/j.bbrc.2006.12.052] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2006] [Accepted: 12/07/2006] [Indexed: 10/23/2022]

Gilat R, Goncharov S, Esterman N, Shweiki D. Under-representation of PolyA/PolyT tailed ESTs in human ESTdb: an obstacle to alternative polyadenylation inference. Bioinformation 2006;1:220-4. [PMID: 17597892 PMCID: PMC1891686] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2006] [Revised: 10/02/2006] [Accepted: 10/02/2006] [Indexed: 11/21/2022] Open

Majerciak V, Yamanegi K, Zheng ZM. Gene structure and expression of Kaposi's sarcoma-associated herpesvirus ORF56, ORF57, ORF58, and ORF59. J Virol 2006;80:11968-81. [PMID: 17020939 PMCID: PMC1676266 DOI: 10.1128/jvi.01394-06] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Abstract

Though similar to those of herpesvirus saimiri and Epstein-Barr virus (EBV), the Kaposi's sarcoma-associated herpesvirus (KSHV) genome features more splice genes and encodes many genes with bicistronic or polycistronic transcripts. In the present study, the gene structure and expression of KSHV ORF56 (primase), ORF57 (MTA), ORF58 (EBV BMRF2 homologue), and ORF59 (DNA polymerase processivity factor) were analyzed in butyrate-activated KSHV(+) JSC-1 cells. ORF56 was expressed at low abundance as a bicistronic ORF56/57 transcript that utilized the same intron, with two alternative branch points, as ORF57 for its RNA splicing. ORF56 was transcribed from two transcription start sites, nucleotides (nt) 78994 (minor) and 79075 (major), but selected the same poly(A) signal as ORF57 for RNA polyadenylation. The majority of ORF56 and ORF57 transcripts were cleaved at nt 83628, although other nearby cleavage sites were selectable. On the opposite strand of the viral genome, colinear ORF58 and ORF59 were transcribed from different transcription start sites, nt 95821 (major) or 95824 (minor) for ORF58 and nt 96790 (minor) or 96794 (major) for ORF59, but shared overlapping poly(A) signals at nt 94492 and 94488. Two cleavage sites, at nt 94477 and nt 94469, could be equally selected for ORF59 polyadenylation, but only the cleavage site at nt 94469 could be selected for ORF58 polyadenylation without disrupting the ORF58 stop codon immediately upstream. ORF58 was expressed in low abundance as a monocistronic transcript, with a long 5' untranslated region (UTR) but a short 3' UTR, whereas ORF59 was expressed in high abundance as a bicistronic transcript, with a short 5' UTR and a long 3' UTR similar to those of polycistronic ORF60 and ORF62. Both ORF56 and ORF59 are targets of ORF57 and were up-regulated significantly in the presence of ORF57, a posttranscriptional regulator.

Collapse