1
|
Pardo I, Lillemoe HA, Blosser RJ, Choi M, Sauder CAM, Doxey DK, Mathieson T, Hancock BA, Baptiste D, Atale R, Hickenbotham M, Zhu J, Glasscock J, Storniolo AMV, Zheng F, Doerge RW, Liu Y, Badve S, Radovich M, Clare SE. Next-generation transcriptome sequencing of the premenopausal breast epithelium using specimens from a normal human breast tissue bank. Breast Cancer Res 2014; 16:R26. [PMID: 24636070 PMCID: PMC4053088 DOI: 10.1186/bcr3627] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2013] [Accepted: 03/10/2014] [Indexed: 12/12/2022] Open
Abstract
Introduction Our efforts to prevent and treat breast cancer are significantly impeded by a lack of knowledge of the biology and developmental genetics of the normal mammary gland. In order to provide the specimens that will facilitate such an understanding, The Susan G. Komen for the Cure Tissue Bank at the IU Simon Cancer Center (KTB) was established. The KTB is, to our knowledge, the only biorepository in the world prospectively established to collect normal, healthy breast tissue from volunteer donors. As a first initiative toward a molecular understanding of the biology and developmental genetics of the normal mammary gland, the effect of the menstrual cycle and hormonal contraceptives on DNA expression in the normal breast epithelium was examined. Methods Using normal breast tissue from 20 premenopausal donors to KTB, the changes in the mRNA of the normal breast epithelium as a function of phase of the menstrual cycle and hormonal contraception were assayed using next-generation whole transcriptome sequencing (RNA-Seq). Results In total, 255 genes representing 1.4% of all genes were deemed to have statistically significant differential expression between the two phases of the menstrual cycle. The overwhelming majority (221; 87%) of the genes have higher expression during the luteal phase. These data provide important insights into the processes occurring during each phase of the menstrual cycle. There was only a single gene significantly differentially expressed when comparing the epithelium of women using hormonal contraception to those in the luteal phase. Conclusions We have taken advantage of a unique research resource, the KTB, to complete the first-ever next-generation transcriptome sequencing of the epithelial compartment of 20 normal human breast specimens. This work has produced a comprehensive catalog of the differences in the expression of protein-coding genes as a function of the phase of the menstrual cycle. These data constitute the beginning of a reference data set of the normal mammary gland, which can be consulted for comparison with data developed from malignant specimens, or to mine the effects of the hormonal flux that occurs during the menstrual cycle.
Collapse
|
2
|
Radovich M, Clare SE, Atale R, Pardo I, Hancock BA, Solzak JP, Kassem N, Mathieson T, Storniolo AMV, Rufenbarger C, Lillemoe HA, Blosser RJ, Choi MR, Sauder CA, Doxey D, Henry JE, Hilligoss EE, Sakarya O, Hyland FC, Hickenbotham M, Zhu J, Glasscock J, Badve S, Ivan M, Liu Y, Sledge GW, Schneider BP. Characterizing the heterogeneity of triple-negative breast cancers using microdissected normal ductal epithelium and RNA-sequencing. Breast Cancer Res Treat 2013; 143:57-68. [PMID: 24292813 DOI: 10.1007/s10549-013-2780-y] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2013] [Accepted: 11/15/2013] [Indexed: 12/31/2022]
Abstract
Triple-negative breast cancers (TNBCs) are a heterogeneous set of tumors defined by an absence of actionable therapeutic targets (ER, PR, and HER-2). Microdissected normal ductal epithelium from healthy volunteers represents a novel comparator to reveal insights into TNBC heterogeneity and to inform drug development. Using RNA-sequencing data from our institution and The Cancer Genome Atlas (TCGA) we compared the transcriptomes of 94 TNBCs, 20 microdissected normal breast tissues from healthy volunteers from the Susan G. Komen for the Cure Tissue Bank, and 10 histologically normal tissues adjacent to tumor. Pathway analysis comparing TNBCs to optimized normal controls of microdissected normal epithelium versus classic controls composed of adjacent normal tissue revealed distinct molecular signatures. Differential gene expression of TNBC compared with normal comparators demonstrated important findings for TNBC-specific clinical trials testing targeted agents; lack of over-expression for negative studies and over-expression in studies with drug activity. Next, by comparing each individual TNBC to the set of microdissected normals, we demonstrate that TNBC heterogeneity is attributable to transcriptional chaos, is associated with non-silent DNA mutational load, and explains transcriptional heterogeneity in addition to known molecular subtypes. Finally, chaos analysis identified 146 core genes dysregulated in >90 % of TNBCs revealing an over-expressed central network. In conclusion, use of microdissected normal ductal epithelium from healthy volunteers enables an optimized approach for studying TNBC and uncovers biological heterogeneity mediated by transcriptional chaos.
Collapse
Affiliation(s)
- Milan Radovich
- Division of General Surgery, Department of Surgery, Indiana University School of Medicine, Indianapolis, IN, USA,
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
3
|
|
4
|
|
5
|
Radovich M, Hancock BA, Kassem N, Zhu J, Glasscock J, Badve S, Liu Y, Kesler KA, Loehrer PJ, Schneider BP. Abstract 4858: Next-generation whole transcriptome sequencing of thymic malignancies. Cancer Res 2011. [DOI: 10.1158/1538-7445.am2011-4858] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Background: Thymomas & Thymic Carcinomas are rare malignancies with approximately 500 cases in the US per year. Apart from standard chemotherapy, treatment options are limited for those patients who become refractory to therapy or present with distant metastasis. Further, histological subtyping of these tumors has proven challenging, resulting in substantial discordance between pathologists and hindering the development of targeted therapy. A major impediment to therapeutic advancement is an inadequate understanding of the transcriptional biology of these cancers. Using next-generation sequencing, we embarked on a study to survey the transcriptomes of thymic malignancies to comprehensively identify novel biology by analyzing all full length transcripts expressed in these tissues.
Methods: Frozen thymomas, thymic carcinoma, and normal tissues were obtained from the Indiana University Simon Cancer Center. The WHO (2004 classification) subtypes represented in our sample set include: (4) type A, (2) A/B, (1) B2, (5) B3, (1) C, and (3) normal tissues. Tissues were reviewed and classified by one pathologist (S.B.) who was blinded to subsequent analyses. cDNA libraries were prepared and sequenced on an Applied Biosystems (AB) SOLiD3+ sequencer using a 50bp fragment run. For gene expression, mapping of reads to the genome was performed using the AB BioScope 1.0 Pipeline and outputs imported into Partek Genomics Suite for analysis. In Partek, mapped reads were cross-referenced against known genes from the UCSC database followed by statistical comparison of RPKM values for each gene between subtypes. Dimensionality reduction analyses (PCA & hierarchical clustering) were also performed in Partek.
Results: RNA sequencing of the 16 tissues produced 736 million reads equaling 37GB of data of which 24.4GB (66%) mapped to the human genome. These initial sequencing outputs represent only a portion, as additional paired-end sequencing of these samples is ongoing. In our preliminary data analysis, unsupervised hierarchical clustering of RPKM values from 20,600 RefSeq genes revealed 100% concordance between gene expression clusters and WHO subtype. A subsequent unsupervised clustering of 705 pre-miRNAs also showed substantial concordance between clusters and subtype. When analyzing differential gene expression between the three subtypes most represented in our data set (A, A/B, B3), we report 318 genes to be differentially expressed between A vs. A/B, 799 genes for A/B vs. B3, and 1524 genes for A vs. B3.
Discussion: We report preliminary data from RNA-sequencing of thymic malignancies. Initial analyses reveal that this technology could be used to accurately subtype these tumors. Further paired-end sequencing of these samples and additional tumors is ongoing. Subsequent analyses of these data include identifying gene fusions, mutations, alternative splicing, noncoding RNAs, novel transcribed regions, and potential viral genomes.
Citation Format: {Authors}. {Abstract title} [abstract]. In: Proceedings of the 102nd Annual Meeting of the American Association for Cancer Research; 2011 Apr 2-6; Orlando, FL. Philadelphia (PA): AACR; Cancer Res 2011;71(8 Suppl):Abstract nr 4858. doi:10.1158/1538-7445.AM2011-4858
Collapse
Affiliation(s)
| | | | - Nawal Kassem
- 1Indiana Univ. School of Medicine, Indianapolis, IN
| | - Jin Zhu
- 2Cofactor Genomics, LLC, St. Louis, MO
| | | | - Sunil Badve
- 1Indiana Univ. School of Medicine, Indianapolis, IN
| | - Yunlong Liu
- 1Indiana Univ. School of Medicine, Indianapolis, IN
| | | | | | | |
Collapse
|
6
|
Radovich M, Clare SE, Sledge GW, Pardo I, Mathieson T, Kassem N, Hancock BA, Storniolo AMV, Rufenbarger C, Lillemoe HA, Sun J, Henry JE, Goulet R, Hilligoss EE, Siddiqui AS, Breu H, Sakarya O, Hyland FC, Muller MW, Popescu L, Zhu J, Hickenbotham M, Glasscock J, Ivan M, Liu Y, Schneider BP. Abstract PD01-08: Decoding the Transcriptional Landscape of Triple-Negative Breast Cancer Using Next-Generation Whole Transcriptome Sequencing. Cancer Res 2010. [DOI: 10.1158/0008-5472.sabcs10-pd01-08] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Background: Triple-negative breast cancer (TNBC) has been plagued by the absence of targeted therapies. Discovery of therapeutic targets in TNBC has in part, been hampered by an inadequate understanding of the transcriptional biology of the normal breast as an optimal comparator. Using next-generation sequencing, we embarked on a study to compare the transcriptomes of TNBC and normal breast to comprehensively identify novel targets by analyzing all full length transcripts expressed in these tissues.
Methods: Normal breast tissues from healthy pre-menopausal volunteers with no history of disease were procured from the Susan G. Komen for the Cure® Tissue Bank at the IU Simon Cancer Center. To eliminate bias from stromal tissue, normal tissues were laser capture microdissected for ductal epithelium. cDNA libraries from 10 TNBC tumors and 10 normal breast tissues were sequenced on an Applied Biosystems (AB) SOLiD3 sequencer using 50bp fragment runs. For gene expression, mapping of reads to the genome was performed using the AB BioScope 1.2 Pipeline and outputs imported into Partek Genomics Suite for analysis. In Partek, mapped reads were cross-referenced against known genes from the UCSC database followed by statistical comparison of RPKM values for each gene between TNBC and normal. Dimensionality reduction analyses (PCA & Hierarchical clustering) and identification of Novel Transcribed Regions were also performed in Partek, whereas construction of gene networks was performed using Ingenuity Pathway Analysis. To identify gene fusions, partially mapped reads were interrogated utilizing a novel algorithm that searched for reads spanning exons from two different genes. Fusions that were supported by at least 3 reads (of which 2 had to be unique) were considered candidates and were subsequently validated. Results/Discussion: Sequencing produced 1.1 billion reads equaling 57.3GB of data of which 36.0GB (63%) mapped to the human genome. In comparing RPKM values between TNBC and Normal, we report 7140 RefSeq Genes, 22 pre-miRNAs, 109 lincRNA exons, and 15 ultraconserved regions that were differentially expressed between these tissues (FDR<0.01). Biological interpretation of these results reveals upregulation of genes and miRNAs involved in DNA repair, angiogenesis, and inhibitors of Estrogen Receptor-alpha. Some previous drug targets (e.g. EGFR and c-kit) were not found to be upregulated here which may explain lack of clinical success to date. Conversely, PARP was significantly upregulated and early trial results suggest a strong signal for efficacy with inhibition of PARP. We also surveyed the genome for Novel Transcribed Regions (NTRs), defined as areas of significant transcription where no annotated gene is present. When comparing between TNBC and Normal, we report 6408 NTRs to be differentially expressed (FDR<0.01). Lastly, when analyzing the dataset for gene fusions, we identified several gene fusions in the TNBC samples, though no individual fusion was present in more than one sample.
Conclusion: We report an extensive comparison of the transcriptomes of TNBC and normal ductal epithelium. We identified numerous genes previously unknown to be dysregulated in TNBC that can be utilized for therapeutic discovery.
Citation Information: Cancer Res 2010;70(24 Suppl):Abstract nr PD01-08.
Collapse
Affiliation(s)
- M Radovich
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - SE Clare
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - GW Sledge
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - I Pardo
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - T Mathieson
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - N Kassem
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - BA Hancock
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - AMV Storniolo
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - C Rufenbarger
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - HA Lillemoe
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - J Sun
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - JE Henry
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - R Goulet
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - EE Hilligoss
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - AS Siddiqui
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - H Breu
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - O Sakarya
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - FC Hyland
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - MW Muller
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - L Popescu
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - J Zhu
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - M Hickenbotham
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - J Glasscock
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - M Ivan
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - Y Liu
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - BP. Schneider
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| |
Collapse
|
7
|
Radovich M, Clare SE, Pardo I, Hancock BA, Kassem N, Sledge GW, Rufenbarger C, Storniolo AMV, Mathieson T, Sun J, Henry JE, Lillemoe HA, Hilligoss EE, Elliott JS, Richt R, Hickenbotham M, Glasscock J, Liu Y, Schneider BP. Abstract 2216: Next-generation whole transcriptome sequencing of triple-negative breast tumors and normal tissues. Cancer Res 2010. [DOI: 10.1158/1538-7445.am10-2216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Background: Triple-negative breast cancer (TNBC) disproportionally affects pre-menopausal women and women of African-American descent, and has been plagued by the absence of targeted therapies leading to poor survival. The paucity of therapeutic targets in TNBC impels us to utilize new technologies that can determine novel targets on a global scale. Using next-generation sequencing, we embarked on a study to analyze the whole transcriptomes of TNBC tumors compared to normal breast tissues in order to comprehensively identify novel targets by analyzing all full length transcripts expressed in these tissues.
Methods: Normal breast tissues from healthy pre-menopausal volunteers with no history of disease were procured from the Susan G. Komen for the Cure® Tissue Bank at the IU Simon Cancer Center. To eliminate bias from stromal tissue, epithelial cells were laser capture microdissected and RNA extracted from captured cells. cDNA libraries from 10 TNBC tumors and 10 normal breast tissues were subsequently sequenced on an ABI SOLiD3 sequencer using a 50bp fragment run. For gene expression, mapping of reads to the human genome was performed using the ABI Whole Transcriptome Pipeline and outputs were imported into Partek Genomics Suite for analysis. To analyze for gene fusions, reads were mapped to the genome using the SOLiD Analysis Pipeline Tool, followed by an alignment to Refseq to map reads crossing exon-exon junctions. A composite transcriptome was formed from areas of the genome with significant expression (17% of the genome sequence) and served as a concise search space for identifying fusions. Reads not mapping to the genome or to RefSeq (a rich source of fusion reads) were then mapped to the composite transcriptome using BLAT to facilitate a highly accurate split-read alignment. Using a custom developed pipeline, reads that spanned transcribed regions from two different chromosomes, or to loci farther than 200kb apart on the same chromosome, were considered as candidate fusions.
Results/Discussion: Sequencing of the 10 TNBC tumors and 10 normal samples produced 1.1 billion reads equaling 58.15GB of data. Mapping of the reads to the genome revealed 1.6 million transcribed regions (exons) of significant expression. A preliminary analysis of gene expression shows 55.2% of the transcribed loci to have significant differential expression between tumor and normal. Network-node, non-coding RNA, and statistical analyses are currently ongoing. In a further interim analysis, we bioinformatically identified several interchromosomal fusions that were present in a majority of the tumors but were absent in the normal samples. RT-PCR validation of these candidate fusions in a larger validation cohort of TNBC tumors and normal breast tissues is ongoing. A multitude of additional analyses including but not limited to: novel transcripts, alternative splicing, and presence of viral genes are also planned.
Citation Format: {Authors}. {Abstract title} [abstract]. In: Proceedings of the 101st Annual Meeting of the American Association for Cancer Research; 2010 Apr 17-21; Washington, DC. Philadelphia (PA): AACR; Cancer Res 2010;70(8 Suppl):Abstract nr 2216.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | - Jie Sun
- 1Indiana Univ., Indianapolis, IN
| | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Radovich M, Clare S, Clare S, Pardo I, Hancock B, Sledge G, Rufenbarger C, Rufenbarger C, Storniolo A, Storniolo A, Mathieson T, Mathieson T, Sun J, Sun J, Henry J, Henry J, Hilligoss E, Elliott J, Richt R, Hickenbotham M, Glasscock J, Liu Y, Schneider B. Next-Generation Whole Transcriptome Sequencing of Triple-Negative Breast Tumors and Normal Tissues. Cancer Res 2009. [DOI: 10.1158/0008-5472.sabcs-09-6134] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Background: Triple-negative breast cancer predominately affects pre-menopausal women and women of African-American descent and has been plagued by the absence of targeted therapies leading to poor survival. Using a new cutting edge technology, next-generation sequencing, we embarked on a study to analyze the whole transcriptomes of triple-negative tumors and normal tissues from pre-menopausal women in order to comprehensively identify new targets by analyzing all full length transcripts expressed in these tissues. This approach is independent of pre-determined gene selection as is common with microarrays, and allows for the analysis of RNA species that have not been previously profiled in breast cancer.Methods: cDNA libraries were created from RNA isolated from 8 triple-negative tumors and 2 normal breast tissues. Triple negative tumors were procured from Origene Technologies and normal breast tissues were procured from the Susan G. Komen for the Cure tissue bank at Indiana University. Normal samples were from healthy pre-menopausal volunteers with no history of disease. In order to eliminate bias from stromal tissue, normal samples were laser capture microdissected for ductal cells and RNA extracted from the excised tissue. cDNA libraries were prepared and subsequently sequenced on an Applied Biosystems (ABI) SOLiD3 sequencer using a 50bp fragment run. Mapping of whole reads to the human genome was performed using the SOLiD Analysis Pipeline Tool software (ABI) followed by a split-read alignment in order to map reads crossing exon-exon junctions. Gene expression profiles for each sample were then created and statistically compared to identify the most differentially expressed genes. In order to analyze for fusion genes, a split-read alignment of non-mapping reads to a composite transcriptome formed from previously mapped reads (clusters) was performed.Results: Sequencing of the 10 samples produced 513 million filtered reads equaling 25.66GB of data. Mapping of the reads to the genome revealed 1.14 million transcribed regions (exons). A preliminary analysis of gene expression shows 55.2% of the transcribed loci to have significant differential expression between tumor and normal. In a further analysis for gene fusions, several candidate fusions were bioinformatically detected. These are currently being reviewed and validated.Discussion: Herein we present a preliminary analysis of the transcriptomes of triple-negative breast cancers in comparison to normal tissues. A multitude of analyses are ongoing, including but not limited to: gene fusions, differentially expressed novel genes, novel transcripts, alternative splicing, intrinsic subtyping, and presence of viral genes. In addition 2 more triple-negative tumors and 8 normal samples will also be sequenced. In the current analysis, differentially expressed non-coding RNAs was highly pervasive among the samples indicating a major role of this RNA species in tumorigenesis. In addition, triple-negative breast cancers may contain fusion genes that could be “drivers” of this malignancy. Further validation of these differentially expressed RNAs and fusion genes in a larger set of samples with subsequent functional studies is planned.
Citation Information: Cancer Res 2009;69(24 Suppl):Abstract nr 6134.
Collapse
Affiliation(s)
| | - S. Clare
- 1Indiana University School of Medicine, IN,
| | - S. Clare
- 2Susan G. Komen for the Cure Tissue Bank at the IU Simon Cancer Center, IN,
| | - I. Pardo
- 1Indiana University School of Medicine, IN,
| | - B. Hancock
- 1Indiana University School of Medicine, IN,
| | - G. Sledge
- 1Indiana University School of Medicine, IN,
| | - C. Rufenbarger
- 2Susan G. Komen for the Cure Tissue Bank at the IU Simon Cancer Center, IN,
| | | | | | - A. Storniolo
- 2Susan G. Komen for the Cure Tissue Bank at the IU Simon Cancer Center, IN,
| | | | - T. Mathieson
- 2Susan G. Komen for the Cure Tissue Bank at the IU Simon Cancer Center, IN,
| | - J. Sun
- 1Indiana University School of Medicine, IN,
| | - J. Sun
- 2Susan G. Komen for the Cure Tissue Bank at the IU Simon Cancer Center, IN,
| | - J. Henry
- 1Indiana University School of Medicine, IN,
| | - J. Henry
- 2Susan G. Komen for the Cure Tissue Bank at the IU Simon Cancer Center, IN,
| | | | | | | | | | | | - Y. Liu
- 1Indiana University School of Medicine, IN,
| | | |
Collapse
|
9
|
Romanov MN, Tuttle EM, Houck ML, Modi WS, Chemnick LG, Korody ML, Mork EMS, Otten CA, Renner T, Jones KC, Dandekar S, Papp JC, Da Y, Green ED, Magrini V, Hickenbotham MT, Glasscock J, McGrath S, Mardis ER, Ryder OA. The value of avian genomics to the conservation of wildlife. BMC Genomics 2009; 10 Suppl 2:S10. [PMID: 19607652 PMCID: PMC2966331 DOI: 10.1186/1471-2164-10-s2-s10] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Background Genomic studies in non-domestic avian models, such as the California condor and white-throated sparrow, can lead to more comprehensive conservation plans and provide clues for understanding mechanisms affecting genetic variation, adaptation and evolution. Developing genomic tools and resources including genomic libraries and a genetic map of the California condor is a prerequisite for identification of candidate loci for a heritable embryonic lethal condition. The white-throated sparrow exhibits a stable genetic polymorphism (i.e. chromosomal rearrangements) associated with variation in morphology, physiology, and behavior (e.g., aggression, social behavior, sexual behavior, parental care). In this paper we outline the utility of these species as well as report on recent advances in the study of their genomes. Results Genotyping of the condor resource population at 17 microsatellite loci provided a better assessment of the current population's genetic variation. Specific New World vulture repeats were found in the condor genome. Using condor BAC library and clones, chicken-condor comparative maps were generated. A condor fibroblast cell line transcriptome was characterized using the 454 sequencing technology. Our karyotypic analyses of the sparrow in combination with other studies indicate that the rearrangements in both chromosomes 2m and 3a are complex and likely involve multiple inversions, interchromosomal linkage, and pleiotropy. At least a portion of the rearrangement in chromosome 2m existed in the common ancestor of the four North American species of Zonotrichia, but not in the one South American species, and that the 2m form, originally thought to be the derived condition, might actually be the ancestral one. Conclusion Mining and characterization of candidate loci in the California condor using molecular genetic and genomic techniques as well as linkage and comparative genomic mapping will eventually enable the identification of carriers of the chondrodystrophy allele, resulting in improved genetic management of this disease. In the white-throated sparrow, genomic studies, combined with ecological data, will help elucidate the basis of genic selection in a natural population. Morphs of the sparrow provide us with a unique opportunity to study intraspecific genomic differences, which have resulted from two separate yet linked evolutionary trajectories. Such results can transform our understanding of evolutionary and conservation biology.
Collapse
Affiliation(s)
- Michael N Romanov
- Genetics Division, San Diego Zoo's Institute for Conservation Research, Zoological Society of San Diego, Arnold and Mabel Beckman Center for Conservation Research, 15600 San Pasqual Valley Rd., Escondido, CA 92027, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Glasscock J, Richt R, Hickenbotham M. Flipping NextGen: using biological systems to characterize NextGen sequencing technologies. BMC Bioinformatics 2009. [PMCID: PMC3313264 DOI: 10.1186/1471-2105-10-s7-a18] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
11
|
Zody MC, Jiang Z, Fung HC, Antonacci F, Hillier LW, Cardone MF, Graves TA, Kidd JM, Cheng Z, Abouelleil A, Chen L, Wallis J, Glasscock J, Wilson RK, Reily AD, Duckworth J, Ventura M, Hardy J, Warren WC, Eichler EE. Evolutionary toggling of the MAPT 17q21.31 inversion region. Nat Genet 2009; 40:1076-83. [PMID: 19165922 DOI: 10.1038/ng.193] [Citation(s) in RCA: 142] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Using comparative sequencing approaches, we investigated the evolutionary history of the European-enriched 17q21.31 MAPT inversion polymorphism. We present a detailed, BAC-based sequence assembly of the inverted human H2 haplotype and compare it to the sequence structure and genetic variation of the corresponding 1.5-Mb region for the noninverted H1 human haplotype and that of chimpanzee and orangutan. We found that inversion of the MAPT region is similarly polymorphic in other great ape species, and we present evidence that the inversions occurred independently in chimpanzees and humans. In humans, the inversion breakpoints correspond to core duplications with the LRRC37 gene family. Our analysis favors the H2 configuration and sequence haplotype as the likely great ape and human ancestral state, with inversion recurrences during primate evolution. We show that the H2 architecture has evolved more extensive sequence homology, perhaps explaining its tendency to undergo microdeletion associated with mental retardation in European populations.
Collapse
Affiliation(s)
- Michael C Zody
- Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, Massachusetts 02142, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Warren WC, Hillier LW, Marshall Graves JA, Birney E, Ponting CP, Grützner F, Belov K, Miller W, Clarke L, Chinwalla AT, Yang SP, Heger A, Locke DP, Miethke P, Waters PD, Veyrunes F, Fulton L, Fulton B, Graves T, Wallis J, Puente XS, López-Otín C, Ordóñez GR, Eichler EE, Chen L, Cheng Z, Deakin JE, Alsop A, Thompson K, Kirby P, Papenfuss AT, Wakefield MJ, Olender T, Lancet D, Huttley GA, Smit AFA, Pask A, Temple-Smith P, Batzer MA, Walker JA, Konkel MK, Harris RS, Whittington CM, Wong ESW, Gemmell NJ, Buschiazzo E, Vargas Jentzsch IM, Merkel A, Schmitz J, Zemann A, Churakov G, Kriegs JO, Brosius J, Murchison EP, Sachidanandam R, Smith C, Hannon GJ, Tsend-Ayush E, McMillan D, Attenborough R, Rens W, Ferguson-Smith M, Lefèvre CM, Sharp JA, Nicholas KR, Ray DA, Kube M, Reinhardt R, Pringle TH, Taylor J, Jones RC, Nixon B, Dacheux JL, Niwa H, Sekita Y, Huang X, Stark A, Kheradpour P, Kellis M, Flicek P, Chen Y, Webber C, Hardison R, Nelson J, Hallsworth-Pepin K, Delehaunty K, Markovic C, Minx P, Feng Y, Kremitzki C, Mitreva M, Glasscock J, Wylie T, Wohldmann P, Thiru P, Nhan MN, Pohl CS, Smith SM, Hou S, Nefedov M, de Jong PJ, Renfree MB, Mardis ER, Wilson RK. Genome analysis of the platypus reveals unique signatures of evolution. Nature 2008; 453:175-83. [PMID: 18464734 PMCID: PMC2803040 DOI: 10.1038/nature06936] [Citation(s) in RCA: 475] [Impact Index Per Article: 29.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2007] [Accepted: 03/25/2008] [Indexed: 12/18/2022]
Abstract
We present a draft genome sequence of the platypus, Ornithorhynchus anatinus. This monotreme exhibits a fascinating combination of reptilian and mammalian characters. For example, platypuses have a coat of fur adapted to an aquatic lifestyle; platypus females lactate, yet lay eggs; and males are equipped with venom similar to that of reptiles. Analysis of the first monotreme genome aligned these features with genetic innovations. We find that reptile and platypus venom proteins have been co-opted independently from the same gene families; milk protein genes are conserved despite platypuses laying eggs; and immune gene family expansions are directly related to platypus biology. Expansions of protein, non-protein-coding RNA and microRNA families, as well as repeat elements, are identified. Sequencing of this genome now provides a valuable resource for deep mammalian comparative analyses, as well as for monotreme biology and conservation.
Collapse
Affiliation(s)
- Wesley C Warren
- Genome Sequencing Center, Washington University School of Medicine, Campus Box 8501, 4444 Forest Park Avenue, St Louis, Missouri 63108, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Schmidt CJ, Romanov M, Ryder O, Magrini V, Hickenbotham M, Glasscock J, McGrath S, Mardis E, Stein LD. Gallus GBrowse: a unified genomic database for the chicken. Nucleic Acids Res 2007; 36:D719-23. [PMID: 17933775 PMCID: PMC2238981 DOI: 10.1093/nar/gkm783] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Gallus GBrowse (http://birdbase.net/cgi-bin/gbrowse/gallus/) provides online access to genomic and other information about the chicken, Gallus gallus. The information provided by this resource includes predicted genes and Gene Ontology (GO) terms, links to Gallus In Situ Hybridization Analysis (GEISHA), Unigene and Reactome, the genomic positions of chicken genetic markers, SNPs and microarray probes, and mappings from turkey, condor and zebra finch DNA and EST sequences to the chicken genome. We also provide a BLAT server (http://birdbase.net/cgi-bin/webBlat) for matching user-provided sequences to the chicken genome. These tools make the Gallus GBrowse server a valuable resource for researchers seeking genomic information regarding the chicken and other avian species.
Collapse
Affiliation(s)
- Carl J Schmidt
- Department of Animal and Food Sciences, University of Delaware, Newark, DE 19706, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Gibbs RA, Rogers J, Katze MG, Bumgarner R, Weinstock GM, Mardis ER, Remington KA, Strausberg RL, Venter JC, Wilson RK, Batzer MA, Bustamante CD, Eichler EE, Hahn MW, Hardison RC, Makova KD, Miller W, Milosavljevic A, Palermo RE, Siepel A, Sikela JM, Attaway T, Bell S, Bernard KE, Buhay CJ, Chandrabose MN, Dao M, Davis C, Delehaunty KD, Ding Y, Dinh HH, Dugan-Rocha S, Fulton LA, Gabisi RA, Garner TT, Godfrey J, Hawes AC, Hernandez J, Hines S, Holder M, Hume J, Jhangiani SN, Joshi V, Khan ZM, Kirkness EF, Cree A, Fowler RG, Lee S, Lewis LR, Li Z, Liu YS, Moore SM, Muzny D, Nazareth LV, Ngo DN, Okwuonu GO, Pai G, Parker D, Paul HA, Pfannkoch C, Pohl CS, Rogers YH, Ruiz SJ, Sabo A, Santibanez J, Schneider BW, Smith SM, Sodergren E, Svatek AF, Utterback TR, Vattathil S, Warren W, White CS, Chinwalla AT, Feng Y, Halpern AL, Hillier LW, Huang X, Minx P, Nelson JO, Pepin KH, Qin X, Sutton GG, Venter E, Walenz BP, Wallis JW, Worley KC, Yang SP, Jones SM, Marra MA, Rocchi M, Schein JE, Baertsch R, Clarke L, Csürös M, Glasscock J, Harris RA, Havlak P, Jackson AR, Jiang H, Liu Y, Messina DN, Shen Y, Song HXZ, Wylie T, Zhang L, Birney E, Han K, Konkel MK, Lee J, Smit AFA, Ullmer B, Wang H, Xing J, Burhans R, Cheng Z, Karro JE, Ma J, Raney B, She X, Cox MJ, Demuth JP, Dumas LJ, Han SG, Hopkins J, Karimpour-Fard A, Kim YH, Pollack JR, Vinar T, Addo-Quaye C, Degenhardt J, Denby A, Hubisz MJ, Indap A, Kosiol C, Lahn BT, Lawson HA, Marklein A, Nielsen R, Vallender EJ, Clark AG, Ferguson B, Hernandez RD, Hirani K, Kehrer-Sawatzki H, Kolb J, Patil S, Pu LL, Ren Y, Smith DG, Wheeler DA, Schenck I, Ball EV, Chen R, Cooper DN, Giardine B, Hsu F, Kent WJ, Lesk A, Nelson DL, O'brien WE, Prüfer K, Stenson PD, Wallace JC, Ke H, Liu XM, Wang P, Xiang AP, Yang F, Barber GP, Haussler D, Karolchik D, Kern AD, Kuhn RM, Smith KE, Zwieg AS. Evolutionary and biomedical insights from the rhesus macaque genome. Science 2007; 316:222-34. [PMID: 17431167 DOI: 10.1126/science.1139247] [Citation(s) in RCA: 989] [Impact Index Per Article: 58.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The rhesus macaque (Macaca mulatta) is an abundant primate species that diverged from the ancestors of Homo sapiens about 25 million years ago. Because they are genetically and physiologically similar to humans, rhesus monkeys are the most widely used nonhuman primate in basic and applied biomedical research. We determined the genome sequence of an Indian-origin Macaca mulatta female and compared the data with chimpanzees and humans to reveal the structure of ancestral primate genomes and to identify evidence for positive selection and lineage-specific expansions and contractions of gene families. A comparison of sequences from individual animals was used to investigate their underlying genetic diversity. The complete description of the macaque genome blueprint enhances the utility of this animal model for biomedical research and improves our understanding of the basic biology of the species.
Collapse
|
15
|
Messina DN, Glasscock J, Gish W, Lovett M. An ORFeome-based analysis of human transcription factor genes and the construction of a microarray to interrogate their expression. Genome Res 2004; 14:2041-7. [PMID: 15489324 PMCID: PMC528918 DOI: 10.1101/gr.2584104] [Citation(s) in RCA: 111] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Transcription factors (TFs) are essential regulators of gene expression, and mutated TF genes have been shown to cause numerous human genetic diseases. Yet to date, no single, comprehensive database of human TFs exists. In this work, we describe the collection of an essentially complete set of TF genes from one depiction of the human ORFeome, and the design of a microarray to interrogate their expression. Taking 1468 known TFs from TRANSFAC, InterPro, and FlyBase, we used this seed set to search the ScriptSure human transcriptome database for additional genes. ScriptSure's genome-anchored transcript clusters allowed us to work with a nonredundant high-quality representation of the human transcriptome. We used a high-stringency similarity search by using BLASTN, and a protein motif search of the human ORFeome by using hidden Markov models of DNA-binding domains known to occur exclusively or primarily in TFs. Four hundred ninety-four additional TF genes were identified in the overlap between the two searches, bringing our estimate of the total number of human TFs to 1962. Zinc finger genes are by far the most abundant family (762 members), followed by homeobox (199 members) and basic helix-loop-helix genes (117 members). We designed a microarray of 50-mer oligonucleotide probes targeted to a unique region of the coding sequence of each gene. We have successfully used this microarray to interrogate TF gene expression in species as diverse as chickens and mice, as well as in humans.
Collapse
Affiliation(s)
- David N Messina
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | | | | | | |
Collapse
|
16
|
Kan Z, Gish W, Rouchka E, Glasscock J, States D. UTR reconstruction and analysis using genomically aligned EST sequences. Proc Int Conf Intell Syst Mol Biol 2001; 8:218-27. [PMID: 10977083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]
Abstract
Untranslated regions (UTR) play important roles in the posttranscriptional regulation of mRNA processing. There is a wealth of UTR-related information to be mined from the rapidly accumulating EST collections. A computational tool, UTR-extender, has been developed to infer UTR sequences from genomically aligned ESTs. It can completely and accurately reconstruct 72% of the 3' UTRs and 15% of the 5' UTRs when tested using 908 functionally cloned transcripts. In addition, it predicts extensions for 11% of the 5' UTRs and 28% of the 3' UTRs. These extension regions are validated by examining splicing frequencies and conservation levels. We also developed a method called polyadenylation site scan (PASS) to precisely map polyadenylation sites in human genomic sequences. A PASS analysis of 908 genic regions estimates that 40-50% of human genes undergo alternative polyadenylation. Using EST redundancy to assess expression levels, we also find that genes with short 3' UTRs tend to be highly expressed.
Collapse
Affiliation(s)
- Z Kan
- Institute for Biomedical Computing, Washington University, St. Louis, MO 63110, USA.
| | | | | | | | | |
Collapse
|
17
|
|