1
|
Zou Z, Zhang Z, Wang Y, Han K, Fu M, Lin P, Xiwei J. EST analysis on the gonad development related organs and microarray screen for differentially expressed genes in mature ovary and testis of Scylla paramamosain. COMPARATIVE BIOCHEMISTRY AND PHYSIOLOGY D-GENOMICS & PROTEOMICS 2011; 6:150-7. [PMID: 21262594 DOI: 10.1016/j.cbd.2010.12.003] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2010] [Revised: 12/18/2010] [Accepted: 12/27/2010] [Indexed: 11/24/2022]
Abstract
A total of 5160 high quality ESTs (expressed sequence tags) averaging 357 bp were collected from normalized cDNA libraries created from testis, ovary and mixed organs of mud crab Scylla paramamosain. Clustering and assembly of these ESTs resulted in a total of 3837 unique sequences with 576 overlapping contigs and 3261 singletons. Comparisons with the GenBank non-redundant (Nr) protein database (BLASTx, e-values <10(-5)) revealed putative functions or matched homologs from other organisms for 847 (22%) of the ESTs. Several gonad development related genes such as cathepsin C, thioredoxin peroxidase, vitellogenin receptor precursor, 50S ribosomal protein L24 and ubiquitin-conjugating enzyme E2 isoform 2 were identified from this EST project and demonstrated as gonad differential expression genes by rqRT-PCR. Sixty five different types of SSRs (simple sequence repeats) were identified from the total 411 EST-SSR motifs. A home-made cDNA microarray containing 5664 spots was developed and the hybridization results indicated that 39 unique transcripts were differentially expressed in testis and ovaries (P<0.05). The expression levels of eleven unique transcripts examined by rqRT-PCR were matched with microarray fairly. These results will provide a useful resource for functional genomic studies on the biology of reproduction of mud crab.
Collapse
Affiliation(s)
- Zhihua Zou
- The Key Laboratory of Science and Technology for Aquaculture and Food Safety, Fisheries College, Jimei University, Xiamen, China
| | | | | | | | | | | | | |
Collapse
|
2
|
Abstract
Recently, it has become possible to mobilize the Tc1/mariner transposon, Sleeping Beauty (SB), in mouse somatic cells at frequencies high enough to induce cancer. Tumours result from SB insertional mutagenesis of cancer genes, thus facilitating the identification of the genes and signalling pathways that drive tumour formation. A conditional SB transposition system has also been developed that makes it possible to limit where SB mutagenesis occurs, providing a means to selectively model many types of human cancer. SB mutagenesis has already identified a large collection of known cancer genes in addition to a plethora of new candidate cancer genes and potential drug targets.
Collapse
Affiliation(s)
- Neal G Copeland
- Genomics and Genetics Division, Institute of Molecular and Cell Biology, Agency for Science, Technology and Research, 61 Biopolis Drive, Proteos, Singapore 138673
| | | |
Collapse
|
3
|
Dupuy AJ, Rogers LM, Kim J, Nannapaneni K, Starr TK, Liu P, Largaespada DA, Scheetz TE, Jenkins NA, Copeland NG. A modified sleeping beauty transposon system that can be used to model a wide variety of human cancers in mice. Cancer Res 2009; 69:8150-6. [PMID: 19808965 DOI: 10.1158/0008-5472.can-09-1135] [Citation(s) in RCA: 135] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
Recent advances in cancer therapeutics stress the need for a better understanding of the molecular mechanisms driving tumor formation. This can be accomplished by obtaining a more complete description of the genes that contribute to cancer. We previously described an approach using the Sleeping Beauty (SB) transposon system to model hematopoietic malignancies in mice. Here, we describe modifications of the SB system that provide additional flexibility in generating mouse models of cancer. First, we describe a Cre-inducible SBase allele, RosaSBase(LsL), that allows the restriction of transposon mutagenesis to a specific tissue of interest. This allele was used to generate a model of germinal center B-cell lymphoma by activating SBase expression with an Aid-Cre allele. In a second approach, a novel transposon was generated, T2/Onc3, in which the CMV enhancer/chicken beta-actin promoter drives oncogene expression. When combined with ubiquitous SBase expression, the T2/Onc3 transposon produced nearly 200 independent tumors of more than 20 different types in a cohort of 62 mice. Analysis of transposon insertion sites identified novel candidate genes, including Zmiz1 and Rian, involved in squamous cell carcinoma and hepatocellular carcinoma, respectively. These novel alleles provide additional tools for the SB system and provide some insight into how this mutagenesis system can be manipulated to model cancer in mice.
Collapse
Affiliation(s)
- Adam J Dupuy
- Department of Anatomy and Cell Biology, Center for Bioinformatics, Computational Biology and Biomedical Engineering, Roy J and Lucille A Carver College of Medicine, University of Iowa, Iowa City, Iowa 52242, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
4
|
TAYLOR DLEE, BOOTH MICHAELG, MCFARLAND JACKW, HERRIOTT IANC, LENNON NIALLJ, NUSBAUM CHAD, MARR THOMASG. Increasing ecological inference from high throughput sequencing of fungi in the environment through a tagging approach. Mol Ecol Resour 2008; 8:742-52. [DOI: 10.1111/j.1755-0998.2008.02094.x] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|
5
|
Kullberg M, Hallström BM, Arnason U, Janke A. Phylogenetic analysis of 1.5 Mbp and platypus EST data refute the Marsupionta hypothesis and unequivocally support Monotremata as sister group to Marsupialia/Placentalia. ZOOL SCR 2008. [DOI: 10.1111/j.1463-6409.2007.00319.x] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
6
|
TAYLOR DLEE, BOOTH MICHAELG, MCFARLAND JACKW, HERRIOTT IANC, LENNON NIALLJ, NUSBAUM CHAD, MARR THOMASG. Increasing ecological inference from high throughput sequencing of fungi in the environment through a tagging approach. Mol Ecol Resour 2008. [DOI: 10.1111/j.1471-8286.2008.02094.x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
7
|
Govoroun M, Le Gac F, Guiguen Y. Generation of a large scale repertoire of Expressed Sequence Tags (ESTs) from normalised rainbow trout cDNA libraries. BMC Genomics 2006; 7:196. [PMID: 16887034 PMCID: PMC1564016 DOI: 10.1186/1471-2164-7-196] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2006] [Accepted: 08/03/2006] [Indexed: 11/12/2022] Open
Abstract
Background Within the framework of a genomics project on livestock species (AGENAE), we initiated a high-throughput DNA sequencing program of Expressed Sequence Tags (ESTs) in rainbow trout, Oncorhynchus mykiss. Results We constructed three cDNA libraries including one highly complex pooled-tissue library. These libraries were normalized and subtracted to reduce clone redundancy. ESTs sequences were produced, and 96 472 ESTs corresponding to high quality sequence reads were released on the international database, currently representing 42.5% of the overall sequence knowledge in this species. All these EST sequences and other publicly available ESTs in rainbow trout have been included on a publicly available Website (SIGENAE) and have been clustered into a total of 52 930 clusters of putative transcripts groups, including 24 616 singletons. 57.1% of these 52 930 clusters are represented by at least one Agenae EST and 14 343 clusters (27.1%) are only composed by Agenae ESTs. Sequence analysis also reveals that normalization and especially subtraction were effective in decreasing redundancy, and that the pooled-tissue library was representative of the initial tissue complexity. Conclusion Due to present work on the construction of rainbow trout normalized cDNA libraries and their extensive sequencing, along with other large scale sequencing programs, rainbow trout is now one of the major fish models in term of EST sequences available in a public database, just after Zebrafish, Danio rerio. This information is now used for the selection of a non redundant set of clones for producing DNA micro-arrays in order to examine global gene expression.
Collapse
Affiliation(s)
- Marina Govoroun
- Institut National de la Recherche Agronomique, Station Commune de Recherches en Ichtyophysiologie, Biodiversité et Environnement (SCRIBE), INRA-SCRIBE, IFR 140, Campus de Beaulieu, 35 042 Rennes Cedex, France
- Station INRA de Recherches avicoles, 37380 Nouzilly, France
| | - Florence Le Gac
- Institut National de la Recherche Agronomique, Station Commune de Recherches en Ichtyophysiologie, Biodiversité et Environnement (SCRIBE), INRA-SCRIBE, IFR 140, Campus de Beaulieu, 35 042 Rennes Cedex, France
| | - Yann Guiguen
- Institut National de la Recherche Agronomique, Station Commune de Recherches en Ichtyophysiologie, Biodiversité et Environnement (SCRIBE), INRA-SCRIBE, IFR 140, Campus de Beaulieu, 35 042 Rennes Cedex, France
| |
Collapse
|
8
|
Chun CK, Scheetz TE, Bonaldo MDF, Brown B, Clemens A, Crookes-Goodson WJ, Crouch K, DeMartini T, Eyestone M, Goodson MS, Janssens B, Kimbell JL, Koropatnick TA, Kucaba T, Smith C, Stewart JJ, Tong D, Troll JV, Webster S, Winhall-Rice J, Yap C, Casavant TL, McFall-Ngai MJ, Soares MB. An annotated cDNA library of juvenile Euprymna scolopes with and without colonization by the symbiont Vibrio fischeri. BMC Genomics 2006; 7:154. [PMID: 16780587 PMCID: PMC1574308 DOI: 10.1186/1471-2164-7-154] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2006] [Accepted: 06/16/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Biologists are becoming increasingly aware that the interaction of animals, including humans, with their coevolved bacterial partners is essential for health. This growing awareness has been a driving force for the development of models for the study of beneficial animal-bacterial interactions. In the squid-vibrio model, symbiotic Vibrio fischeri induce dramatic developmental changes in the light organ of host Euprymna scolopes over the first hours to days of their partnership. We report here the creation of a juvenile light-organ specific EST database. RESULTS We generated eleven cDNA libraries from the light organ of E. scolopes at developmentally significant time points with and without colonization by V. fischeri. Single pass 3' sequencing efforts generated 42,564 expressed sequence tags (ESTs) of which 35,421 passed our quality criteria and were then clustered via the UIcluster program into 13,962 nonredundant sequences. The cDNA clones representing these nonredundant sequences were sequenced from the 5' end of the vector and 58% of these resulting sequences overlapped significantly with the associated 3' sequence to generate 8,067 contigs with an average sequence length of 1,065 bp. All sequences were annotated with BLASTX (E-value < -03) and Gene Ontology (GO). CONCLUSION Both the number of ESTs generated from each library and GO categorizations are reflective of the activity state of the light organ during these early stages of symbiosis. Future analyses of the sequences identified in these libraries promise to provide valuable information not only about pathways involved in colonization and early development of the squid light organ, but also about pathways conserved in response to bacterial colonization across the animal kingdom.
Collapse
Affiliation(s)
- Carlene K Chun
- Department of Medical Microbiology and Immunology, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Todd E Scheetz
- Department of Ophthalmology and Visual Science, University of Iowa, Iowa City, IA 52242, USA
- Department of Biomedical Engineering, University of Iowa, Iowa City, IA 52242, USA
| | | | - Bartley Brown
- Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA 52242, USA
| | - Anik Clemens
- Pacific Biomedical Research Center, Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI, 96813, USA
| | - Wendy J Crookes-Goodson
- Department of Medical Microbiology and Immunology, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Keith Crouch
- Department of Pediatrics, University of Iowa, Iowa City, IA 52242, USA
| | - Tad DeMartini
- Pacific Biomedical Research Center, Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI, 96813, USA
| | - Mari Eyestone
- Department of Pediatrics, University of Iowa, Iowa City, IA 52242, USA
| | - Michael S Goodson
- Department of Medical Microbiology and Immunology, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Bernadette Janssens
- Pacific Biomedical Research Center, Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI, 96813, USA
| | - Jennifer L Kimbell
- Pacific Biomedical Research Center, Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI, 96813, USA
| | - Tanya A Koropatnick
- Pacific Biomedical Research Center, Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI, 96813, USA
| | - Tamara Kucaba
- Department of Pediatrics, University of Iowa, Iowa City, IA 52242, USA
| | - Christina Smith
- Children's Memorial Research Center, Northwestern University, Chicago, IL, 60614, USA
| | - Jennifer J Stewart
- Pacific Biomedical Research Center, Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI, 96813, USA
| | - Deyan Tong
- Department of Medical Microbiology and Immunology, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Joshua V Troll
- Department of Medical Microbiology and Immunology, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Sarahrose Webster
- Department of Pediatrics, University of Iowa, Iowa City, IA 52242, USA
| | - Jane Winhall-Rice
- Pacific Biomedical Research Center, Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI, 96813, USA
| | - Cory Yap
- Pacific Biomedical Research Center, Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI, 96813, USA
| | - Thomas L Casavant
- Department of Ophthalmology and Visual Science, University of Iowa, Iowa City, IA 52242, USA
- Department of Biomedical Engineering, University of Iowa, Iowa City, IA 52242, USA
- Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA 52242, USA
| | - Margaret J McFall-Ngai
- Department of Medical Microbiology and Immunology, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Pacific Biomedical Research Center, Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI, 96813, USA
| | - M Bento Soares
- Department of Pediatrics, University of Iowa, Iowa City, IA 52242, USA
- Department of Biochemistry, University of Iowa, Iowa City, IA 52242, USA
- Department of Orthopaedics, University of Iowa, Iowa City, IA 52242, USA
- Physiology and Biophysics, University of Iowa, Iowa City, IA 52242, USA
- Children's Memorial Research Center, Northwestern University, Chicago, IL, 60614, USA
| |
Collapse
|
9
|
Bonaldo MF, Bair TB, Scheetz TE, Snir E, Akabogu I, Bair JL, Berger B, Crouch K, Davis A, Eyestone ME, Keppel C, Kucaba TA, Lebeck M, Lin JL, de Melo AIR, Rehmann J, Reiter RS, Schaefer K, Smith C, Tack D, Trout K, Sheffield VC, Lin JJC, Casavant TL, Soares MB. 1274 full-open reading frames of transcripts expressed in the developing mouse nervous system. Genome Res 2004; 14:2053-63. [PMID: 15489326 PMCID: PMC528920 DOI: 10.1101/gr.2601304] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
As part of the trans-National Institutes of Health (NIH) Mouse Brain Molecular Anatomy Project (BMAP), and in close coordination with the NIH Mammalian Gene Collection Program (MGC), we initiated a large-scale project to clone, identify, and sequence the complete open reading frame (ORF) of transcripts expressed in the developing mouse nervous system. Here we report the analysis of the ORF sequence of 1274 cDNAs, obtained from 47 full-length-enriched cDNA libraries, constructed by using a novel approach, herein described. cDNA libraries were derived from size-fractionated cytoplasmic mRNA isolated from brain and eye tissues obtained at several embryonic stages and postnatal days. Altogether, including the full-ORF MGC sequences derived from these libraries by the MGC sequencing team, NIH_BMAP full-ORF sequences correspond to approximately 20% of all transcripts currently represented in mouse MGC. We show that NIH_BMAP clones comprise 68% of mouse MGC cDNAs > or =5 kb, and 54% of those > or =4 kb, as of March 15, 2004. Importantly, we identified transcripts, among the 1274 full-ORF sequences, that are exclusively or predominantly expressed in brain and eye tissues, many of which encode yet uncharacterized proteins.
Collapse
Affiliation(s)
- Maria F Bonaldo
- Department of Pediatrics, The University of Iowa, Iowa City, Iowa 52242, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Schlicht M, Matysiak B, Brodzeller T, Wen X, Liu H, Zhou G, Dhir R, Hessner MJ, Tonellato P, Suckow M, Pollard M, Datta MW. Cross-species global and subset gene expression profiling identifies genes involved in prostate cancer response to selenium. BMC Genomics 2004; 5:58. [PMID: 15318950 PMCID: PMC516028 DOI: 10.1186/1471-2164-5-58] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2003] [Accepted: 08/20/2004] [Indexed: 11/25/2022] Open
Abstract
Background Gene expression technologies have the ability to generate vast amounts of data, yet there often resides only limited resources for subsequent validation studies. This necessitates the ability to perform sorting and prioritization of the output data. Previously described methodologies have used functional pathways or transcriptional regulatory grouping to sort genes for further study. In this paper we demonstrate a comparative genomics based method to leverage data from animal models to prioritize genes for validation. This approach allows one to develop a disease-based focus for the prioritization of gene data, a process that is essential for systems that lack significant functional pathway data yet have defined animal models. This method is made possible through the use of highly controlled spotted cDNA slide production and the use of comparative bioinformatics databases without the use of cross-species slide hybridizations. Results Using gene expression profiling we have demonstrated a similar whole transcriptome gene expression patterns in prostate cancer cells from human and rat prostate cancer cell lines both at baseline expression levels and after treatment with physiologic concentrations of the proposed chemopreventive agent Selenium. Using both the human PC3 and rat PAII prostate cancer cell lines have gone on to identify a subset of one hundred and fifty-four genes that demonstrate a similar level of differential expression to Selenium treatment in both species. Further analysis and data mining for two genes, the Insulin like Growth Factor Binding protein 3, and Retinoic X Receptor alpha, demonstrates an association with prostate cancer, functional pathway links, and protein-protein interactions that make these genes prime candidates for explaining the mechanism of Selenium's chemopreventive effect in prostate cancer. These genes are subsequently validated by western blots showing Selenium based induction and using tissue microarrays to demonstrate a significant association between downregulated protein expression and tumorigenesis, a process that is the reverse of what is seen in the presence of Selenium. Conclusions Thus the outlined process demonstrates similar baseline and selenium induced gene expression profiles between rat and human prostate cancers, and provides a method for identifying testable functional pathways for the action of Selenium's chemopreventive properties in prostate cancer.
Collapse
Affiliation(s)
- Michael Schlicht
- Department of Pathology, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI, 53226, USA
- Department of Pathology, Winship Cancer Institute, Emory University School of Medicine, 1365-B Clifton Road NE, Atlanta, GA, 30322, USA
| | - Brian Matysiak
- Department of Pathology, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI, 53226, USA
| | - Tracy Brodzeller
- Department of Pathology, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI, 53226, USA
| | - Xinyu Wen
- Department of Pathology, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI, 53226, USA
- Bioinformatics Program and Human and Molecular Genetics Center, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI, 53226, USA
| | - Hang Liu
- Department of Pathology, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI, 53226, USA
- Bioinformatics Program and Human and Molecular Genetics Center, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI, 53226, USA
| | - Guohui Zhou
- Department of Pathology, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI, 53226, USA
- Department of Pathology, Winship Cancer Institute, Emory University School of Medicine, 1365-B Clifton Road NE, Atlanta, GA, 30322, USA
| | - Rajiv Dhir
- Department of Pathology, University of Pittsburgh Medical Center, 200 Lothrop Street, Pittsburgh, PA, 15242, USA
| | - Martin J Hessner
- Department of Pediatrics and Human and Molecular Genetics Center, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI, 53226, USA
- Department of Physiology, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI, 53226, USA
| | - Peter Tonellato
- Bioinformatics Program and Human and Molecular Genetics Center, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI, 53226, USA
- Department of Physiology, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI, 53226, USA
| | - Mark Suckow
- Walther Cancer Center, Lobund Laboratories, 400 Freiman Life Science Center, Notre Dame University, Notre Dame, IN, 46556, USA
| | - Morris Pollard
- Walther Cancer Center, Lobund Laboratories, 400 Freiman Life Science Center, Notre Dame University, Notre Dame, IN, 46556, USA
| | - Milton W Datta
- Department of Pathology, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI, 53226, USA
- Department of Pathology, Winship Cancer Institute, Emory University School of Medicine, 1365-B Clifton Road NE, Atlanta, GA, 30322, USA
| |
Collapse
|
11
|
Scheetz TE, Laffin JJ, Berger B, Holte S, Baumes SA, Brown R, Chang S, Coco J, Conklin J, Crouch K, Donohue M, Doonan G, Estes C, Eyestone M, Fishler K, Gardiner J, Guo L, Johnson B, Keppel C, Kreger R, Lebeck M, Marcelino R, Miljkovich V, Perdue M, Qui L, Rehmann J, Reiter RS, Rhoads B, Schaefer K, Smith C, Sunjevaric I, Trout K, Wu N, Birkett CL, Bischof J, Gackle B, Gavin A, Grundstad AJ, Mokrzycki B, Moressi C, O'Leary B, Pedretti K, Roberts C, Robinson NL, Smith M, Tack D, Trivedi N, Kucaba T, Freeman T, Lin JJC, Bonaldo MF, Casavant TL, Sheffield VC, Soares MB. High-throughput gene discovery in the rat. Genome Res 2004; 14:733-41. [PMID: 15060017 PMCID: PMC383320 DOI: 10.1101/gr.1414204] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The rat is an important animal model for human diseases and is widely used in physiology. In this article we present a new strategy for gene discovery based on the production of ESTs from serially subtracted and normalized cDNA libraries, and we describe its application for the development of a comprehensive nonredundant collection of rat ESTs. Our new strategy appears to yield substantially more EST clusters per ESTs sequenced than do previous approaches that did not use serial subtraction. However, multiple rounds of library subtraction resulted in high frequencies of otherwise rare internally primed cDNAs, defining the limits of this powerful approach. To date, we have generated >200,000 3' ESTs from >100 cDNA libraries representing a wide range of tissues and developmental stages of the laboratory rat. Most importantly, we have contributed to approximately 50,000 rat UniGene clusters. We have identified, arrayed, and derived 5' ESTs from >30,000 unique rat cDNA clones. Complete information, including radiation hybrid mapping data, is also maintained locally at http://genome.uiowa.edu/clcg.html. All of the sequences described in this article have been submitted to the dbEST division of the NCBI.
Collapse
Affiliation(s)
- Todd E Scheetz
- Center for Bioinformatics and Computational Biology, The University of Iowa, Iowa City, Iowa 52242, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Laffin JJS, Scheetz TE, Bonaldo MDF, Reiter RS, Chang S, Eyestone M, Abdulkawy H, Brown B, Roberts C, Tack D, Kucaba T, Lin JJC, Sheffield VC, Casavant TL, Soares MB. A comprehensive nonredundant expressed sequence tag collection for the developing Rattus norvegicus heart. Physiol Genomics 2004; 17:245-52. [PMID: 14762174 DOI: 10.1152/physiolgenomics.00186.2003] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Congenital heart defects affect ∼1,000,000 people in the United States, with 40,000 new births contributing to that number every year. A large percentage of these defects can be attributed to septal defects. We assembled a nonredundant collection of over 12,000 expressed sequence tags (ESTs) from a total of 30,000 ESTs, with the ultimate goal of identifying spatially and/or temporally regulated genes during heart septation. These ESTs were compiled from nonnormalized, normalized, and serially subtracted cDNA libraries derived from two sets of tissue samples. The first includes microdissected rat hearts from embryonic (E) days E13, E15, and E16.5–E18.5 and adult heart. The second includes hearts from embryonic days E17, E19, and E21 and postnatal (P) days P1, P12, P74, and P200. Over 6,000 novel ESTs were identified in the libraries derived from these two sets of tissues, all of which have been contributed to the NCBI rat UniGene collection. It is anticipated that such EST and cDNA clone resources will prove invaluable to gene expression studies aimed at the understanding of the molecular mechanisms underlying heart septation defects.
Collapse
Affiliation(s)
- Jennifer J S Laffin
- Department of Pediatrics and Interdepartmental-Genetics Graduate Program, The University of Iowa, Iowa City, Iowa 52242, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Scheetz TE, Zabner J, Welsh MJ, Coco J, Eyestone MDF, Bonaldo M, Kucaba T, Casavant TL, Soares MB, McCray PB. Large-scale gene discovery in human airway epithelia reveals novel transcripts. Physiol Genomics 2004; 17:69-77. [PMID: 14701920 DOI: 10.1152/physiolgenomics.00188.2003] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
The airway epithelium represents an important barrier between the host and the environment. It is a first site of contact with pathogens, particulates, and other stimuli, and has evolved the means to dynamically respond to these challenges. In an effort to define the transcript profile of airway epithelia, we created and sequenced cDNA libraries from cystic fibrosis (CF) and non-CF epithelia and from human lung tissue. Sequencing of these libraries produced approximately 53,000 3'-expressed sequence tags (3'-ESTs). From these, a nonredundant UniGene set of more than 19,000 sequences was generated. Despite the relatively small contribution of airway epithelia to the total mass of the lung, focused gene discovery in this tissue yielded novel results. The ESTs included several thousand transcripts (6,416) not previously identified from cDNA sequences as expressed in the lung. Among the abundant transcripts were several genes involved in host defense. Most importantly, the set also included 879 3'-ESTs that appear to be novel sequences not previously represented in the National Center for Biotechnology Information UniGene collection. This UniGene set should be useful for studies of pulmonary diseases involving the airway epithelium including cystic fibrosis, respiratory infections and asthma. It also provides a reagent for large-scale expression profiling.
Collapse
Affiliation(s)
- Todd E Scheetz
- Department of Pediatrics, Roy J. and Lucille A. Carver College of Medicine, University of Iowa, Iowa City, Iowa 52242, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Tuggle CK, Green JA, Fitzsimmons C, Woods R, Prather RS, Malchenko S, Soares BM, Kucaba T, Crouch K, Smith C, Tack D, Robinson N, O'Leary B, Scheetz T, Casavant T, Pomp D, Edeal BJ, Zhang Y, Rothschild MF, Garwood K, Beavis W. EST-based gene discovery in pig: virtual expression patterns and comparative mapping to human. Mamm Genome 2003; 14:565-79. [PMID: 12925889 DOI: 10.1007/s00335-002-2263-7] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2003] [Accepted: 04/03/2003] [Indexed: 10/26/2022]
Abstract
A molecular understanding of porcine reproduction is of biological interest and economic importance. Our Midwest Consortium has produced cDNA libraries containing the majority of genes expressed in major female reproductive tissues, and we have deposited into public databases 21,499 expressed sequence tag (EST) gene sequences from the 3' end of clones from these libraries. These sequences represent 10,574 different genes, based on sequence comparison among these data, and comparison with existing porcine ESTs and genes indicate as many as 4652 of these EST clusters are novel. In silico analysis identified sequences that are expressed in specific pig tissues or organs and confirmed the broad expression in pig for many genes ubiquitously expressed in human tissues. Furthermore, we have developed computer software to identify sequence similarity of these pig genes with their human counterparts, and to extract the mapping information of these human homologues from genome databases. We demonstrate the utility of this software for comparative mapping by localizing 61 genes on the porcine physical map for Chromosomes (Chrs) 5, 10, and 14.
Collapse
Affiliation(s)
- Christopher K Tuggle
- Center for Integrated Animal Genomics and Department of Animal Science, Iowa State University, 2255 Kildee Hall, Ames, Iowa 50011, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Sorek R, Safer HM. A novel algorithm for computational identification of contaminated EST libraries. Nucleic Acids Res 2003; 31:1067-74. [PMID: 12560505 PMCID: PMC149192 DOI: 10.1093/nar/gkg170] [Citation(s) in RCA: 67] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
A key goal of the Human Genome Project was to understand the complete set of human proteins, the proteome. Since the genome sequence by itself is not sufficient for predicting new genes and alternative splicing events that lead to new proteins, expressed sequence tags (ESTs) are used as the primary tool for these purposes. The high prevalence of artifacts in dbEST, however, often leads to invalid predictions. Here we describe a novel method for recognizing genomic DNA contamination and other artifacts that cannot be identified using current EST cleaning techniques. Our method uses the alignment of the entire set of ESTs to the human genome to identify highly contaminated EST libraries. We discovered 53 highly contaminated libraries and a subset of 24 766 ESTs from these libraries that probably represent contamination with genomic DNA, pre-mRNA, and ESTs that span non-canonical introns. Although this is only a small fraction of the entire EST dataset, each contaminating sequence could create a spurious transcript prediction. Indeed, in the clustering and assembly tool that we used, these sequences would have caused incorrect inference of 9575 new splice variants and 6370 new genes. Conclusions based on EST analysis, including prediction of alternative splicing, should be re-evaluated in light of these results. Our method, along with the identified set of contaminated sequences, will be essential for applications that depend on large EST datasets.
Collapse
Affiliation(s)
- Rotem Sorek
- Compugen Ltd, 72 Pinchas Rosen Street, Tel Aviv 69512, Israel.
| | | |
Collapse
|
16
|
Current Awareness on Comparative and Functional Genomics. Comp Funct Genomics 2003. [PMCID: PMC2447381 DOI: 10.1002/cfg.226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
|