Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wheeler DL, Barrett T, Benson DA, Bryant SH, Canese K, Church DM, DiCuccio M, Edgar R, Federhen S, Helmberg W, Kenton DL, Khovayko O, Lipman DJ, Madden TL, Maglott DR, Ostell J, Pontius JU, Pruitt KD, Schuler GD, Schriml LM, Sequeira E, Sherry ST, Sirotkin K, Starchenko G, Suzek TO, Tatusov R, Tatusova TA, Wagner L, Yaschenko E. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 2005;33:D39-45. [PMID: 15608222 PMCID: PMC540016 DOI: 10.1093/nar/gki062] [Citation(s) in RCA: 338] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

For:	Wheeler DL, Barrett T, Benson DA, Bryant SH, Canese K, Church DM, DiCuccio M, Edgar R, Federhen S, Helmberg W, Kenton DL, Khovayko O, Lipman DJ, Madden TL, Maglott DR, Ostell J, Pontius JU, Pruitt KD, Schuler GD, Schriml LM, Sequeira E, Sherry ST, Sirotkin K, Starchenko G, Suzek TO, Tatusov R, Tatusova TA, Wagner L, Yaschenko E. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 2005;33:D39-45. [PMID: 15608222 PMCID: PMC540016 DOI: 10.1093/nar/gki062] [Citation(s) in RCA: 338] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Number

Cited by Other Article(s)

101

Winsor GL, Van Rossum T, Lo R, Khaira B, Whiteside MD, Hancock REW, Brinkman FSL. Pseudomonas Genome Database: facilitating user-friendly, comprehensive comparisons of microbial genomes. Nucleic Acids Res 2008;37:D483-8. [PMID: 18978025 PMCID: PMC2686508 DOI: 10.1093/nar/gkn861] [Citation(s) in RCA: 193] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

102

Czerwoniec A, Dunin-Horkawicz S, Purta E, Kaminska KH, Kasprzak JM, Bujnicki JM, Grosjean H, Rother K. MODOMICS: a database of RNA modification pathways. 2008 update. Nucleic Acids Res 2008;37:D118-21. [PMID: 18854352 PMCID: PMC2686465 DOI: 10.1093/nar/gkn710] [Citation(s) in RCA: 175] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open

Affiliation(s)

Anna Czerwoniec Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Max Planck Institute for Developmental Biology, Department 1, Protein Evolution Spemannstr. 35, 72076 Tuebingen, Germany, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Ks. Trojdena 4, PL-02-190 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5a, 02-106 Warsaw and IGM, Univ Paris-Sud, UMR 8621, Orsay, F 91405, France
Stanislaw Dunin-Horkawicz Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Max Planck Institute for Developmental Biology, Department 1, Protein Evolution Spemannstr. 35, 72076 Tuebingen, Germany, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Ks. Trojdena 4, PL-02-190 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5a, 02-106 Warsaw and IGM, Univ Paris-Sud, UMR 8621, Orsay, F 91405, France
Elzbieta Purta Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Max Planck Institute for Developmental Biology, Department 1, Protein Evolution Spemannstr. 35, 72076 Tuebingen, Germany, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Ks. Trojdena 4, PL-02-190 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5a, 02-106 Warsaw and IGM, Univ Paris-Sud, UMR 8621, Orsay, F 91405, France
Katarzyna H. Kaminska Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Max Planck Institute for Developmental Biology, Department 1, Protein Evolution Spemannstr. 35, 72076 Tuebingen, Germany, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Ks. Trojdena 4, PL-02-190 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5a, 02-106 Warsaw and IGM, Univ Paris-Sud, UMR 8621, Orsay, F 91405, France
Joanna M. Kasprzak Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Max Planck Institute for Developmental Biology, Department 1, Protein Evolution Spemannstr. 35, 72076 Tuebingen, Germany, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Ks. Trojdena 4, PL-02-190 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5a, 02-106 Warsaw and IGM, Univ Paris-Sud, UMR 8621, Orsay, F 91405, France
Janusz M. Bujnicki Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Max Planck Institute for Developmental Biology, Department 1, Protein Evolution Spemannstr. 35, 72076 Tuebingen, Germany, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Ks. Trojdena 4, PL-02-190 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5a, 02-106 Warsaw and IGM, Univ Paris-Sud, UMR 8621, Orsay, F 91405, France
Henri Grosjean Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Max Planck Institute for Developmental Biology, Department 1, Protein Evolution Spemannstr. 35, 72076 Tuebingen, Germany, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Ks. Trojdena 4, PL-02-190 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5a, 02-106 Warsaw and IGM, Univ Paris-Sud, UMR 8621, Orsay, F 91405, France
Kristian Rother Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Max Planck Institute for Developmental Biology, Department 1, Protein Evolution Spemannstr. 35, 72076 Tuebingen, Germany, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Ks. Trojdena 4, PL-02-190 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5a, 02-106 Warsaw and IGM, Univ Paris-Sud, UMR 8621, Orsay, F 91405, France *To whom correspondence should be addressed. Tel: +48-22 597 0752; Fax: +48 22 597 0715;

Collapse

103

Vernot B, Stolzer M, Goldman A, Durand D. Reconciliation with non-binary species trees. J Comput Biol 2008;15:981-1006. [PMID: 18808330 PMCID: PMC3205801 DOI: 10.1089/cmb.2008.0092] [Citation(s) in RCA: 120] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open

Abstract

Reconciliation extracts information from the topological incongruence between gene and species trees to infer duplications and losses in the history of a gene family. The inferred duplication-loss histories provide valuable information for a broad range of biological applications, including ortholog identification, estimating gene duplication times, and rooting and correcting gene trees. While reconciliation for binary trees is a tractable and well studied problem, there are no algorithms for reconciliation with non-binary species trees. Yet a striking proportion of species trees are non-binary. For example, 64% of branch points in the NCBI taxonomy have three or more children. When applied to non-binary species trees, current algorithms overestimate the number of duplications because they cannot distinguish between duplication and incomplete lineage sorting. We present the first algorithms for reconciling binary gene trees with non-binary species trees under a duplication-loss parsimony model. Our algorithms utilize an efficient mapping from gene to species trees to infer the minimum number of duplications in O(|V(G) | x (k(S) + h(S))) time, where |V(G)| is the number of nodes in the gene tree, h(S) is the height of the species tree and k(S) is the size of its largest polytomy. We present a dynamic programming algorithm which also minimizes the total number of losses. Although this algorithm is exponential in the size of the largest polytomy, it performs well in practice for polytomies with outdegree of 12 or less. We also present a heuristic which estimates the minimal number of losses in polynomial time. In empirical tests, this algorithm finds an optimal loss history 99% of the time. Our algorithms have been implemented in NOTUNG, a robust, production quality, tree-fitting program, which provides a graphical user interface for exploratory analysis and also supports automated, high-throughput analysis of large data sets.

Collapse

104

Müller H, Mancuso F. Identification and analysis of co-occurrence networks with NetCutter. PLoS One 2008;3:e3178. [PMID: 18781200 PMCID: PMC2526157 DOI: 10.1371/journal.pone.0003178] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2008] [Accepted: 08/07/2008] [Indexed: 01/25/2023] Open

Abstract

Background

Co-occurrence analysis is a technique often applied in text mining, comparative genomics, and promoter analysis. The methodologies and statistical models used to evaluate the significance of association between co-occurring entities are quite diverse, however.

Methodology/Principal Findings

We present a general framework for co-occurrence analysis based on a bipartite graph representation of the data, a novel co-occurrence statistic, and software performing co-occurrence analysis as well as generation and analysis of co-occurrence networks. We show that the overall stringency of co-occurrence analysis depends critically on the choice of the null-model used to evaluate the significance of co-occurrence and find that random sampling from a complete permutation set of the bipartite graph permits co-occurrence analysis with optimal stringency. We show that the Poisson-binomial distribution is the most natural co-occurrence probability distribution when vertex degrees of the bipartite graph are variable, which is usually the case. Calculation of Poisson-binomial P-values is difficult, however. Therefore, we propose a fast bi-binomial approximation for calculation of P-values and show that this statistic is superior to other measures of association such as the Jaccard coefficient and the uncertainty coefficient. Furthermore, co-occurrence analysis of more than two entities can be performed using the same statistical model, which leads to increased signal-to-noise ratios, robustness towards noise, and the identification of implicit relationships between co-occurring entities. Using NetCutter, we identify a novel protein biosynthesis related set of genes that are frequently coordinately deregulated in human cancer related gene expression studies. NetCutter is available at http://bio.ifom-ieo-campus.it/NetCutter/).

Conclusion

Our approach can be applied to any set of categorical data where co-occurrence analysis might reveal functional relationships such as clinical parameters associated with cancer subtypes or SNPs associated with disease phenotypes. The stringency of our approach is expected to offer an advantage in a variety of applications.

Collapse

105

Chen H, Kihara D. Estimating quality of template-based protein models by alignment stability. Proteins 2008;71:1255-74. [PMID: 18041762 DOI: 10.1002/prot.21819] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]

Abstract

The error in protein tertiary structure prediction is unavoidable, but it is not explicitly shown in most of the current prediction algorithms. Estimated error of a predicted structure is crucial information for experimental biologists to use the prediction model for design and interpretation of experiments. Here, we propose a method to estimate errors in predicted structures based on the stability of the optimal target-template alignment when compared with a set of suboptimal alignments. The stability of the optimal alignment is quantified by an index named the SuboPtimal Alignment Diversity (SPAD). We implemented SPAD in a profile-based threading algorithm and investigated how well SPAD can indicate errors in threading models using a large benchmark dataset of 5232 alignments. SPAD shows a very good correlation not only to alignment shift errors but also structure-level errors, the root mean square deviation (RMSD) of predicted structure models to the native structures (i.e. global errors), and local errors at each residue position. We have further compared SPAD with seven other quality measures, six from sequence alignment-based measures and one atomic statistical potential, discrete optimized protein energy (DOPE), in terms of the correlation coefficient to the global and local structure-level errors. In terms of the correlation to the RMSD of structure models, when a target and a template are in the same SCOP family, the sequence identity showed a best correlation to the RMSD; in the superfamily level, SPAD was the best; and in the fold level, DOPE was best. However, in a head-to-head comparison, SPAD wins over the other measures. Next, SPAD is compared with three other measures of local errors. In this comparison, SPAD was best in all of the family, the superfamily and the fold levels. Using the discovered correlation, we have also predicted the global and local error of our predicted structures of CASP7 targets by the SPAD. Finally, we proposed a sausage representation of predicted tertiary structures which intuitively indicate the predicted structure and the estimated error range of the structure simultaneously.

Collapse

106

Khan AM, Miotto O, Nascimento EJM, Srinivasan KN, Heiny AT, Zhang GL, Marques ET, Tan TW, Brusic V, Salmon J, August JT. Conservation and variability of dengue virus proteins: implications for vaccine design. PLoS Negl Trop Dis 2008;2:e272. [PMID: 18698358 PMCID: PMC2491585 DOI: 10.1371/journal.pntd.0000272] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2008] [Accepted: 07/10/2008] [Indexed: 12/27/2022] Open

Abstract

Background

Genetic variation and rapid evolution are hallmarks of RNA viruses, the result of high mutation rates in RNA replication and selection of mutants that enhance viral adaptation, including the escape from host immune responses. Variability is uneven across the genome because mutations resulting in a deleterious effect on viral fitness are restricted. RNA viruses are thus marked by protein sites permissive to multiple mutations and sites critical to viral structure-function that are evolutionarily robust and highly conserved. Identification and characterization of the historical dynamics of the conserved sites have relevance to multiple applications, including potential targets for diagnosis, and prophylactic and therapeutic purposes.

Methodology/Principal Findings

We describe a large-scale identification and analysis of evolutionarily highly conserved amino acid sequences of the entire dengue virus (DENV) proteome, with a focus on sequences of 9 amino acids or more, and thus immune-relevant as potential T-cell determinants. DENV protein sequence data were collected from the NCBI Entrez protein database in 2005 (9,512 sequences) and again in 2007 (12,404 sequences). Forty-four (44) sequences (pan-DENV sequences), mainly those of nonstructural proteins and representing ∼15% of the DENV polyprotein length, were identical in 80% or more of all recorded DENV sequences. Of these 44 sequences, 34 (∼77%) were present in ≥95% of sequences of each DENV type, and 27 (∼61%) were conserved in other Flaviviruses. The frequencies of variants of the pan-DENV sequences were low (0 to ∼5%), as compared to variant frequencies of ∼60 to ∼85% in the non pan-DENV sequence regions. We further showed that the majority of the conserved sequences were immunologically relevant: 34 contained numerous predicted human leukocyte antigen (HLA) supertype-restricted peptide sequences, and 26 contained T-cell determinants identified by studies with HLA-transgenic mice and/or reported to be immunogenic in humans.

Conclusions/Significance

Forty-four (44) pan-DENV sequences of at least 9 amino acids were highly conserved and identical in 80% or more of all recorded DENV sequences, and the majority were found to be immune-relevant by their correspondence to known or putative HLA-restricted T-cell determinants. The conservation of these sequences through the entire recorded DENV genetic history supports their possible value for diagnosis, prophylactic and/or therapeutic applications. The combination of bioinformatics and experimental approaches applied herein provides a framework for large-scale and systematic analysis of conserved and variable sequences of other pathogens, in particular, for rapidly mutating viruses, such as influenza A virus and HIV.

Dengue viruses (DENVs) circulate in nature as a population of 4 distinct types, each with multiple genotypes and variants, and represent an increasing global public health issue with no prophylactic and therapeutic formulations currently available. Viral genomes contain sites that are evolutionarily stable and therefore highly conserved, presumably because changes in these sites have deleterious effects on viral fitness and survival. The identification and characterization of the historical dynamics of these sites in DENV have relevance to several applications such as diagnosis and drug and vaccine development. In this study, we have identified sequence fragments that were conserved across the majority of available DENV sequences, analyzed their historical dynamics, and evaluated their relevance as candidate vaccine targets, using various bioinformatics-based methods and immune assay in human leukocyte antigen (HLA) transgenic mice. This approach provides a framework for large-scale and systematic analysis of other human pathogens.

Collapse

Affiliation(s)

Asif M. Khan Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
Olivo Miotto Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore Institute of Systems Science, National University of Singapore, Singapore
Eduardo J. M. Nascimento Department of Medicine, Division of Infectious Diseases, The Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
K. N. Srinivasan Department of Pharmacology and Molecular Sciences, The Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America Product Evaluation and Registration Division, Centre for Drug Administration, Health Sciences Authority, Singapore
A. T. Heiny Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
Guang Lan Zhang Cancer Vaccine Center, Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
E. T. Marques Department of Medicine, Division of Infectious Diseases, The Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America Department of Pharmacology and Molecular Sciences, The Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
Tin Wee Tan Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
Vladimir Brusic Cancer Vaccine Center, Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
Jerome Salmon Department of Pharmacology and Molecular Sciences, The Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
J. Thomas August Department of Pharmacology and Molecular Sciences, The Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America * E-mail:

Collapse

107

Bourgogne A, Garsin DA, Qin X, Singh KV, Sillanpaa J, Yerrapragada S, Ding Y, Dugan-Rocha S, Buhay C, Shen H, Chen G, Williams G, Muzny D, Maadani A, Fox KA, Gioia J, Chen L, Shang Y, Arias CA, Nallapareddy SR, Zhao M, Prakash VP, Chowdhury S, Jiang H, Gibbs RA, Murray BE, Highlander SK, Weinstock GM. Large scale variation in Enterococcus faecalis illustrated by the genome analysis of strain OG1RF. Genome Biol 2008;9:R110. [PMID: 18611278 PMCID: PMC2530867 DOI: 10.1186/gb-2008-9-7-r110] [Citation(s) in RCA: 217] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2008] [Revised: 05/08/2008] [Accepted: 07/08/2008] [Indexed: 11/18/2022] Open

108

Budowle B, Aranda XG, Lagace RE, Hennessy LK, Planz JV, Rodriguez M, Eisenberg AJ. Null allele sequence structure at the DYS448 locus and implications for profile interpretation. Int J Legal Med 2008;122:421-7. [DOI: 10.1007/s00414-008-0258-y] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2008] [Accepted: 05/27/2008] [Indexed: 11/24/2022]

109

Jelinsky SA, Choe SE, Crabtree JS, Cotreau MM, Wilson E, Saraf K, Dorner AJ, Brown EL, Peano BJ, Zhang X, Winneker RC, Harris HA. Molecular analysis of the vaginal response to estrogens in the ovariectomized rat and postmenopausal woman. BMC Med Genomics 2008;1:27. [PMID: 18578861 PMCID: PMC2453134 DOI: 10.1186/1755-8794-1-27] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2008] [Accepted: 06/25/2008] [Indexed: 11/21/2022] Open

Abstract

Background

Vaginal atrophy (VA) is the thinning of the vaginal epithelial lining, typically the result of lowered estrogen levels during menopause. Some of the consequences of VA include increased susceptibility to bacterial infection, pain during sexual intercourse, and vaginal burning or itching. Although estrogen treatment is highly effective, alternative therapies are also desired for women who are not candidates for post-menopausal hormone therapy (HT). The ovariectomized (OVX) rat is widely accepted as an appropriate animal model for many estrogen-dependent responses in humans; however, since reproductive biology can vary significantly between mammalian systems, this study examined how well the OVX rat recapitulates human biology.

Methods

We analyzed 19 vaginal biopsies from human subjects pre and post 3-month 17β-estradiol treated by expression profiling. Data were compared to transcriptional profiling generated from vaginal samples obtained from ovariectomized rats treated with 17β-estradiol for 6 hrs, 3 days or 5 days. The level of differential expression between pre- vs. post- estrogen treatment was calculated for each of the human and OVX rat datasets. Probe sets corresponding to orthologous rat and human genes were mapped to each other using NCBI Homologene.

Results

A positive correlation was observed between the rat and human responses to estrogen. Genes belonging to several biological pathways and GO categories were similarly differentially expressed in rat and human. A large number of the coordinately regulated biological processes are already known to be involved in human VA, such as inflammation, epithelial development, and EGF pathway activation.

Conclusion

At the transcriptional level, there is evidence of significant overlap of the effects of estrogen treatment between the OVX rat and human VA samples.

Collapse

110

Moon S, Cho S, Kim H. Organization and evolution of mitochondrial gene clusters in human. Genomics 2008;92:85-93. [PMID: 18559289 DOI: 10.1016/j.ygeno.2008.01.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2007] [Revised: 01/07/2008] [Accepted: 01/08/2008] [Indexed: 11/29/2022]

111

WANG SHU, HU ROUHMEI, HSIAO HANCW, HECHT DAVIDA, NG KALOK, CHEN RONGMING, SHEU PHILLIPCY, TSAI JEFFREYJP. USING SCDL FOR INTEGRATING TOOLS AND DATA FOR COMPLEX BIOMEDICAL APPLICATIONS. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING 2008. [DOI: 10.1142/s1793351x08000476] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

112

Cheng D, Knox C, Young N, Stothard P, Damaraju S, Wishart DS. PolySearch: a web-based text mining system for extracting relationships between human diseases, genes, mutations, drugs and metabolites. Nucleic Acids Res 2008;36:W399-405. [PMID: 18487273 PMCID: PMC2447794 DOI: 10.1093/nar/gkn296] [Citation(s) in RCA: 155] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open

113

Blenkiron C, Goldstein LD, Thorne NP, Spiteri I, Chin SF, Dunning MJ, Barbosa-Morais NL, Teschendorff AE, Green AR, Ellis IO, Tavaré S, Caldas C, Miska EA. MicroRNA expression profiling of human breast cancer identifies new markers of tumor subtype. Genome Biol 2008;8:R214. [PMID: 17922911 PMCID: PMC2246288 DOI: 10.1186/gb-2007-8-10-r214] [Citation(s) in RCA: 720] [Impact Index Per Article: 45.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2007] [Revised: 08/22/2007] [Accepted: 10/08/2007] [Indexed: 12/19/2022] Open

114

Navratil V, Penel S, Delmotte S, Mouchiroud D, Gautier C, Aouacheria A. DigiPINS: A database for vertebrate exonic single nucleotide polymorphisms and its application to cancer association studies. Biochimie 2008;90:563-9. [DOI: 10.1016/j.biochi.2007.09.017] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2007] [Accepted: 09/21/2007] [Indexed: 11/28/2022]

115

Taswell C. DOORS to the Semantic Web and Grid With a PORTAL for Biomedical Computing. ACTA ACUST UNITED AC 2008;12:191-204. [DOI: 10.1109/titb.2007.905861] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

116

Merging microarray data from separate breast cancer studies provides a robust prognostic test. BMC Bioinformatics 2008;9:125. [PMID: 18304324 PMCID: PMC2409450 DOI: 10.1186/1471-2105-9-125] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2007] [Accepted: 02/27/2008] [Indexed: 11/15/2022] Open

Abstract

Background

There is an urgent need for new prognostic markers of breast cancer metastases to ensure that newly diagnosed patients receive appropriate therapy. Recent studies have demonstrated the potential value of gene expression signatures in assessing the risk of developing distant metastases. However, due to the small sample sizes of individual studies, the overlap among signatures is almost zero and their predictive power is often limited. Integrating microarray data from multiple studies in order to increase sample size is therefore a promising approach to the development of more robust prognostic tests.

Results

In this study, by using a highly stable data aggregation procedure based on expression comparisons, we have integrated three independent microarray gene expression data sets for breast cancer and identified a structured prognostic signature consisting of 112 genes organized into 80 pair-wise expression comparisons. A classical likelihood ratio test based on these comparisons, essentially weighted voting, achieves 88.6% sensitivity and 54.6% specificity in an independent external test set of 154 samples. The test is highly informative in assessing the risk of developing distant metastases within five years (hazard ratio 9.3 with 95% CI 2.9–29.9).

Conclusion

Rank-based features provide a stable way to integrate patient data from separate microarray studies due to invariance to data normalization, and such features can be combined into a useful predictor of distant metastases in breast cancer within a statistical modeling framework which begins to capture gene-gene interactions. Upon further confirmation on large-scale independent data, such prognostic signatures and tests could provide a powerful tool to guide adjuvant systemic treatment that could greatly reduce the cost of breast cancer treatment, both in terms of toxic side effects and health care expenditures.

Collapse

117

Pontius JU, Mullikin JC, Smith DR, Lindblad-Toh K, Gnerre S, Clamp M, Chang J, Stephens R, Neelam B, Volfovsky N, Schäffer AA, Agarwala R, Narfström K, Murphy WJ, Giger U, Roca AL, Antunes A, Menotti-Raymond M, Yuhki N, Pecon-Slattery J, Johnson WE, Bourque G, Tesler G, O'Brien SJ. Initial sequence and comparative analysis of the cat genome. Genome Res 2008;17:1675-89. [PMID: 17975172 DOI: 10.1101/gr.6380007] [Citation(s) in RCA: 251] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

118

Kosinski J, Kubareva E, Bujnicki JM. A model of restriction endonuclease MvaI in complex with DNA: a template for interpretation of experimental data and a guide for specificity engineering. Proteins 2007;68:324-36. [PMID: 17407166 DOI: 10.1002/prot.21460] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]

119

Kim E, Goren A, Ast G. Insights into the connection between cancer and alternative splicing. Trends Genet 2007;24:7-10. [PMID: 18054115 DOI: 10.1016/j.tig.2007.10.001] [Citation(s) in RCA: 131] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2007] [Revised: 10/21/2007] [Accepted: 10/22/2007] [Indexed: 01/14/2023]

120

Frenz CM. Deafness mutation mining using regular expression based pattern matching. BMC Med Inform Decis Mak 2007;7:32. [PMID: 17961241 PMCID: PMC2180167 DOI: 10.1186/1472-6947-7-32] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2007] [Accepted: 10/25/2007] [Indexed: 11/16/2022] Open

121

Rava P, Hussain MM. Acquisition of triacylglycerol transfer activity by microsomal triglyceride transfer protein during evolution. Biochemistry 2007;46:12263-74. [PMID: 17924655 DOI: 10.1021/bi700762z] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

122

Rahman FA, Ainscough JFX, Copeland N, Coverley D. Cancer-associated missplicing of exon 4 influences the subnuclear distribution of the DNA replication factor CIZ1. Hum Mutat 2007;28:993-1004. [PMID: 17508423 DOI: 10.1002/humu.20550] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

123

Gioia J, Yerrapragada S, Qin X, Jiang H, Igboeli OC, Muzny D, Dugan-Rocha S, Ding Y, Hawes A, Liu W, Perez L, Kovar C, Dinh H, Lee S, Nazareth L, Blyth P, Holder M, Buhay C, Tirumalai MR, Liu Y, Dasgupta I, Bokhetache L, Fujita M, Karouia F, Eswara Moorthy P, Siefert J, Uzman A, Buzumbo P, Verma A, Zwiya H, McWilliams BD, Olowu A, Clinkenbeard KD, Newcombe D, Golebiewski L, Petrosino JF, Nicholson WL, Fox GE, Venkateswaran K, Highlander SK, Weinstock GM. Paradoxical DNA repair and peroxide resistance gene conservation in Bacillus pumilus SAFR-032. PLoS One 2007;2:e928. [PMID: 17895969 PMCID: PMC1976550 DOI: 10.1371/journal.pone.0000928] [Citation(s) in RCA: 92] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2007] [Accepted: 08/31/2007] [Indexed: 11/25/2022] Open

Affiliation(s)

Jason Gioia Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
Shailaja Yerrapragada Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
Xiang Qin Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
Huaiyang Jiang Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
Okezie C. Igboeli Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
Donna Muzny Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
Shannon Dugan-Rocha Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
Yan Ding Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
Alicia Hawes Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
Wen Liu Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
Lesette Perez Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
Christie Kovar Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
Huyen Dinh Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
Sandra Lee Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
Lynne Nazareth Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
Peter Blyth Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
Michael Holder Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
Christian Buhay Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
Madhan R. Tirumalai Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
Yamei Liu Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
Indrani Dasgupta Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
Lina Bokhetache Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
Masaya Fujita Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
Fathi Karouia Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
Prahathees Eswara Moorthy Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
Johnathan Siefert Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
Akif Uzman Department of Natural Sciences, University of Houston‐Downtown, Houston, Texas, United States of America
Prince Buzumbo Department of Natural Sciences, University of Houston‐Downtown, Houston, Texas, United States of America
Avani Verma Department of Natural Sciences, University of Houston‐Downtown, Houston, Texas, United States of America
Hiba Zwiya Department of Natural Sciences, University of Houston‐Downtown, Houston, Texas, United States of America
Brian D. McWilliams Department of Molecular Virology and Microbiology, Baylor College of Medicine, Houston, Texas, United States of America
Adeola Olowu University of St. Thomas, Houston Texas, United States of America
Kenneth D. Clinkenbeard Department of Veterinary Pathobiology, Center for Veterinary Health Sciences, Oklahoma State University, Stillwater, Oklahoma, United States of America
David Newcombe University of Idaho Coeur d'Alene, Coeur d'Alene, Idaho, United States of America Jet Propulsion Laboratory, California Institute of Technology, Pasadena, California, United States of America
Lisa Golebiewski Department of Molecular Virology and Microbiology, Baylor College of Medicine, Houston, Texas, United States of America
Joseph F. Petrosino Department of Molecular Virology and Microbiology, Baylor College of Medicine, Houston, Texas, United States of America
Wayne L. Nicholson Department of Microbiology and Cell Science, University of Florida Space Life Sciences Laboratory, Kennedy Space Center, Florida, United States of America
George E. Fox Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
Kasthuri Venkateswaran Jet Propulsion Laboratory, California Institute of Technology, Pasadena, California, United States of America
Sarah K. Highlander Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America Department of Molecular Virology and Microbiology, Baylor College of Medicine, Houston, Texas, United States of America
George M. Weinstock Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America Department of Molecular Virology and Microbiology, Baylor College of Medicine, Houston, Texas, United States of America * To whom correspondence should be addressed. E-mail:

Collapse

124

Zhang Z, Chen D, Fenstermacher DA. Integrated analysis of independent gene expression microarray datasets improves the predictability of breast cancer outcome. BMC Genomics 2007;8:331. [PMID: 17883867 PMCID: PMC2064937 DOI: 10.1186/1471-2164-8-331] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2006] [Accepted: 09/20/2007] [Indexed: 11/10/2022] Open

125

Li H, Guan L, Liu T, Guo Y, Zheng WM, Wong GKS, Wang J. A cross-species alignment tool (CAT). BMC Bioinformatics 2007;8:349. [PMID: 17880681 PMCID: PMC2082505 DOI: 10.1186/1471-2105-8-349] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2007] [Accepted: 09/19/2007] [Indexed: 01/06/2023] Open

Affiliation(s)

Heng Li Beijing Institute of Genomics of Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China James D. Watson Institute of Genome Sciences of Zhejiang University, Hangzhou 310008, China Institute of Theoretical Physics, Chinese Academy of Sciences, Beijing 100080, China Graduate University of the Chinese Academy of Sciences, Yuquan Road 19A, Beijing 100039, China
Liang Guan Beijing Institute of Genomics of Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China James D. Watson Institute of Genome Sciences of Zhejiang University, Hangzhou 310008, China Graduate University of the Chinese Academy of Sciences, Yuquan Road 19A, Beijing 100039, China Institute of Computing Technology, Chinese Academy of Science, Beijing 100080, China
Tao Liu Beijing Institute of Genomics of Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China James D. Watson Institute of Genome Sciences of Zhejiang University, Hangzhou 310008, China
Yiran Guo Beijing Institute of Genomics of Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China Graduate University of the Chinese Academy of Sciences, Yuquan Road 19A, Beijing 100039, China
Wei-Mou Zheng Beijing Institute of Genomics of Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China James D. Watson Institute of Genome Sciences of Zhejiang University, Hangzhou 310008, China Institute of Theoretical Physics, Chinese Academy of Sciences, Beijing 100080, China
Gane Ka-Shu Wong Beijing Institute of Genomics of Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China James D. Watson Institute of Genome Sciences of Zhejiang University, Hangzhou 310008, China UW Genome Center, Department of Medicine, University of Washington, Seattle, WA 98195, USA
Jun Wang Beijing Institute of Genomics of Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China James D. Watson Institute of Genome Sciences of Zhejiang University, Hangzhou 310008, China The Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230, Odense M, Denmark

Collapse

126

Vider-Shalit T, Fishbain V, Raffaeli S, Louzoun Y. Phase-dependent immune evasion of herpesviruses. J Virol 2007;81:9536-45. [PMID: 17609281 PMCID: PMC1951411 DOI: 10.1128/jvi.02636-06] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2006] [Accepted: 06/22/2007] [Indexed: 12/14/2022] Open

Abstract

Viruses employ various modes to evade immune detection. Two possible evasion modes are a reduction of the number of epitopes presented and the mimicry of host epitopes. The immune evasion efforts are not uniform among viral proteins. The number of epitopes in a given viral protein and the similarity of the epitopes to host peptides can be used as a measure of the viral attempts to hide this protein. Using bioinformatics tools, we here present a genomic analysis of the attempts of four human herpesviruses (herpes simplex virus type 1-human herpesvirus 1, Epstein-Barr virus-human herpesvirus 4, human cytomegalovirus-human herpesvirus 5, and Kaposi's sarcoma-associated herpesvirus-human herpesvirus 8) and one murine herpesvirus (murine herpesvirus 68) to escape from immune detection. We determined the full repertoire of CD8 T-lymphocyte epitopes presented by each viral protein and show that herpesvirus proteins present many fewer epitopes than expected. Furthermore, the epitopes that are presented are more similar to host epitopes than are random viral epitopes, minimizing the immune response. We defined a score for the size of the immune repertoire (the SIR score) based on the number of epitopes in a protein. The numbers of epitopes in proteins expressed in the latent and early phases of infection were significantly smaller than those in proteins expressed in the lytic phase in all tested viruses. The latent and immediate-early epitopes were also more similar to host epitopes than were lytic epitopes. A clear trend emerged from the analysis. In general, herpesviruses demonstrated an effort to evade immune detection. However, within a given herpesvirus, proteins expressed in phases critical to the fate of infection (e.g., early lytic and latent) evaded immune detection more than all others. The application of the SIR score to specific proteins allows us to quantify the importance of immune evasion and to detect optimal targets for immunotherapy and vaccine development.

Collapse

127

Ingsriswang S, Pacharawongsakda E. sMOL Explorer: an open source, web-enabled database and exploration tool for Small MOLecules datasets. Bioinformatics 2007;23:2498-500. [PMID: 17660205 DOI: 10.1093/bioinformatics/btm363] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

128

A survey of orphan enzyme activities. BMC Bioinformatics 2007;8:244. [PMID: 17623104 PMCID: PMC1940265 DOI: 10.1186/1471-2105-8-244] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2007] [Accepted: 07/10/2007] [Indexed: 11/10/2022] Open

129

Davies L, Anderson IP, Turner PC, Shirras AD, Rees HH, Rigden DJ. An unsuspected ecdysteroid/steroid phosphatase activity in the key T-cell regulator, Sts-1: surprising relationship to insect ecdysteroid phosphate phosphatase. Proteins 2007;67:720-31. [PMID: 17348005 DOI: 10.1002/prot.21357] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

130

Cahan P, Rovegno F, Mooney D, Newman JC, Laurent GS, McCaffrey TA. Meta-analysis of microarray results: challenges, opportunities, and recommendations for standardization. Gene 2007;401:12-8. [PMID: 17651921 PMCID: PMC2111172 DOI: 10.1016/j.gene.2007.06.016] [Citation(s) in RCA: 100] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2007] [Revised: 06/06/2007] [Accepted: 06/12/2007] [Indexed: 12/31/2022]

131

Jain M, Khurana P, Tyagi AK, Khurana JP. Genome-wide analysis of intronless genes in rice and Arabidopsis. Funct Integr Genomics 2007;8:69-78. [PMID: 17578610 DOI: 10.1007/s10142-007-0052-9] [Citation(s) in RCA: 77] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2007] [Revised: 04/07/2007] [Accepted: 05/06/2007] [Indexed: 10/23/2022]

132

Zhou L, Florea L. Designing sensitive and specific spaced seeds for cross-species mRNA-to-genome alignment. J Comput Biol 2007;14:113-30. [PMID: 17456011 DOI: 10.1089/cmb.2006.0130] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

133

Wishart DS. In Silico Drug Exploration and Discovery Using DrugBank. ACTA ACUST UNITED AC 2007;Chapter 14:Unit 14.4. [DOI: 10.1002/0471250953.bi1404s18] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

134

Gowri VS, Tina KG, Krishnadev O, Srinivasan N. Strategies for the effective identification of remotely related sequences in multiple PSSM search approach. Proteins 2007;67:789-94. [PMID: 17380509 DOI: 10.1002/prot.21356] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

135

Wilkerson MD, Schlueter SD, Brendel V. yrGATE: a web-based gene-structure annotation tool for the identification and dissemination of eukaryotic genes. Genome Biol 2007;7:R58. [PMID: 16859520 PMCID: PMC1779557 DOI: 10.1186/gb-2006-7-7-r58] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2006] [Revised: 06/08/2006] [Accepted: 07/05/2006] [Indexed: 11/10/2022] Open

136

PCR-based landmark unique gene (PLUG) markers effectively assign homoeologous wheat genes to A, B and D genomes. BMC Genomics 2007;8:135. [PMID: 17535443 PMCID: PMC1904201 DOI: 10.1186/1471-2164-8-135] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2007] [Accepted: 05/30/2007] [Indexed: 01/06/2023] Open

Abstract

BACKGROUND

EST-PCR markers normally represent specific products from target genes, and are therefore effective tools for genetic analysis. However, because wheat is an allohexaploid plant, PCR products derived from homoeologous genes are often simultaneously amplified. Such products may be easier to differentiate if they include intron sequences, which are more polymorphic than exon sequences. However, genomic sequence data for wheat are limited; therefore it is difficult to predict the location of introns. By using the similarities in gene structures between rice and wheat, we developed a system called PLUG (PCR-based Landmark Unique Gene) to design primers so that PCR products include intron sequences. We then investigated whether products amplified using such primers could serve as markers able to distinguish multiple products derived from homoeologous genes.

RESULTS

The PLUG system consists of the following steps: (1) Single-copy rice genes (Landmark Unique Gene loci; LUGs) exhibiting high degrees of homology to wheat UniGene sequences are extracted; (2) Alignment analysis is carried out using the LUGs and wheat UniGene sequences to predict exon-exon junctions, and LUGs which can be used to design wheat primers flanking introns (TaEST-LUGs) are extracted; and (3) Primers are designed in an interactive manner. From a total of 4,312 TaEST-LUGs, 24 loci were randomly selected and used to design primers. With all of these primer sets, we obtained specific, intron-containing products from the target genes. These markers were assigned to chromosomes using wheat nullisomic-tetrasomic lines. By PCR-RFLP analysis using agarose gel electrophoresis, 19 of the 24 markers were located on at least one chromosome.

CONCLUSION

In the development of wheat EST-PCR markers capable of efficiently sorting products derived from homoeologous genes, it is important to design primers able to amplify products that include intron sequences with insertion/deletion polymorphisms. Using the PLUG system, wheat EST sequences that can be used for marker development are selected based on comparative genomics with rice, and then primer sets flanking intron sequences are prepared in an interactive, semi-automatic manner. Hence, the PLUG system is an effective tool for large-scale marker development.

Collapse

137

Mazumder R, Hu ZZ, Vinayaka CR, Sagripanti JL, Frost SDW, Kosakovsky Pond SL, Wu CH. Computational analysis and identification of amino acid sites in dengue E proteins relevant to development of diagnostics and vaccines. Virus Genes 2007;35:175-86. [PMID: 17508277 DOI: 10.1007/s11262-007-0103-2] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2007] [Accepted: 04/11/2007] [Indexed: 10/23/2022]

138

Kowalska A, Bozsaky E, Ramsauer T, Rieder D, Bindea G, Lörch T, Trajanoski Z, Ambros PF. A new platform linking chromosomal and sequence information. Chromosome Res 2007;15:327-39. [PMID: 17406992 DOI: 10.1007/s10577-007-1129-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2006] [Revised: 01/24/2007] [Accepted: 01/24/2007] [Indexed: 10/23/2022]

139

Montgomery SB, Griffith OL, Schuetz JM, Brooks-Wilson A, Jones SJM. A survey of genomic properties for the detection of regulatory polymorphisms. PLoS Comput Biol 2007;3:e106. [PMID: 17559298 PMCID: PMC1892352 DOI: 10.1371/journal.pcbi.0030106] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2006] [Accepted: 04/25/2007] [Indexed: 11/18/2022] Open

140

Welsch C, Albrecht M, Maydt J, Herrmann E, Welker MW, Sarrazin C, Scheidig A, Lengauer T, Zeuzem S. Structural and functional comparison of the non-structural protein 4B in flaviviridae. J Mol Graph Model 2007;26:546-57. [PMID: 17507273 DOI: 10.1016/j.jmgm.2007.03.012] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2006] [Revised: 03/23/2007] [Accepted: 03/28/2007] [Indexed: 12/27/2022]

141

Pritham EJ, Putliwala T, Feschotte C. Mavericks, a novel class of giant transposable elements widespread in eukaryotes and related to DNA viruses. Gene 2007;390:3-17. [PMID: 17034960 DOI: 10.1016/j.gene.2006.08.008] [Citation(s) in RCA: 157] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2006] [Accepted: 08/02/2006] [Indexed: 11/23/2022]

Abstract

We previously identified a group of atypical mobile elements designated Mavericks from the nematodes Caenorhabditis elegans and C. briggsae and the zebrafish Danio rerio. Here we present the results of comprehensive database searches of the genome sequences available, which reveal that Mavericks are widespread in invertebrates and non-mammalian vertebrates but show a patchy distribution in non-animal species, being present in the fungi Glomus intraradices and Phakopsora pachyrhizi and in several single-celled eukaryotes such as the ciliate Tetrahymena thermophila, the stramenopile Phytophthora infestans and the trichomonad Trichomonas vaginalis, but not detectable in plants. This distribution, together with comparative and phylogenetic analyses of Maverick-encoded proteins, is suggestive of an ancient origin of these elements in eukaryotes followed by lineage-specific losses and/or recurrent episodes of horizontal transmission. In addition, we report that Maverick elements have amplified recently to high copy numbers in T. vaginalis where they now occupy as much as 30% of the genome. Sequence analysis confirms that most Mavericks encode a retroviral-like integrase, but lack other open reading frames typically found in retroelements. Nevertheless, the length and conservation of the target site duplication created upon Maverick insertion (5- or 6-bp) is consistent with a role of the integrase-like protein in the integration of a double-stranded DNA transposition intermediate. Mavericks also display long terminal-inverted repeats but do not contain ORFs similar to proteins encoded by DNA transposons. Instead, Mavericks encode a conserved set of 5 to 9 genes (in addition to the integrase) that are predicted to encode proteins with homology to replication and packaging proteins of some bacteriophages and diverse eukaryotic double-stranded DNA viruses, including a DNA polymerase B homolog and putative capsid proteins. Based on these and other structural similarities, we speculate that Mavericks represent an evolutionary missing link between seemingly disparate invasive DNA elements that include bacteriophages, adenoviruses and eukaryotic linear plasmids.

Collapse

142

Gene function in early mouse embryonic stem cell differentiation. BMC Genomics 2007;8:85. [PMID: 17394647 PMCID: PMC1851713 DOI: 10.1186/1471-2164-8-85] [Citation(s) in RCA: 113] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2006] [Accepted: 03/29/2007] [Indexed: 12/20/2022] Open

Abstract

Background

Little is known about the genes that drive embryonic stem cell differentiation. However, such knowledge is necessary if we are to exploit the therapeutic potential of stem cells. To uncover the genetic determinants of mouse embryonic stem cell (mESC) differentiation, we have generated and analyzed 11-point time-series of DNA microarray data for three biologically equivalent but genetically distinct mESC lines (R1, J1, and V6.5) undergoing undirected differentiation into embryoid bodies (EBs) over a period of two weeks.

Results

We identified the initial 12 hour period as reflecting the early stages of mESC differentiation and studied probe sets showing consistent changes of gene expression in that period. Gene function analysis indicated significant up-regulation of genes related to regulation of transcription and mRNA splicing, and down-regulation of genes related to intracellular signaling. Phylogenetic analysis indicated that the genes showing the largest expression changes were more likely to have originated in metazoans. The probe sets with the most consistent gene changes in the three cell lines represented 24 down-regulated and 12 up-regulated genes, all with closely related human homologues. Whereas some of these genes are known to be involved in embryonic developmental processes (e.g. Klf4, Otx2, Smn1, Socs3, Tagln, Tdgf1), our analysis points to others (such as transcription factor Phf21a, extracellular matrix related Lama1 and Cyr61, or endoplasmic reticulum related Sc4mol and Scd2) that have not been previously related to mESC function. The majority of identified functions were related to transcriptional regulation, intracellular signaling, and cytoskeleton. Genes involved in other cellular functions important in ESC differentiation such as chromatin remodeling and transmembrane receptors were not observed in this set.

Conclusion

Our analysis profiles for the first time gene expression at a very early stage of mESC differentiation, and identifies a functional and phylogenetic signature for the genes involved. The data generated constitute a valuable resource for further studies. All DNA microarray data used in this study are available in the StemBase database of stem cell gene expression data [1] and in the NCBI's GEO database.

Collapse

143

Clavel T, Lippman R, Gavini F, Doré J, Blaut M. Clostridium saccharogumia sp. nov. and Lactonifactor longoviformis gen. nov., sp. nov., two novel human faecal bacteria involved in the conversion of the dietary phytoestrogen secoisolariciresinol diglucoside. Syst Appl Microbiol 2007;30:16-26. [PMID: 17196483 DOI: 10.1016/j.syapm.2006.02.003] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2006] [Indexed: 10/24/2022]

144

A comparative genomics approach to identifying the plasticity transcriptome. BMC Neurosci 2007;8:20. [PMID: 17355637 PMCID: PMC1831778 DOI: 10.1186/1471-2202-8-20] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2006] [Accepted: 03/13/2007] [Indexed: 02/04/2023] Open

145

Wood V. How to get the most from fission yeast genome data: a report from the 2006 European Fission Yeast Meeting computing workshop. Yeast 2007;23:905-12. [PMID: 17072881 DOI: 10.1002/yea.1419] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open

146

Holloway DT, Kon M, DeLisi C. Machine learning for regulatory analysis and transcription factor target prediction in yeast. SYSTEMS AND SYNTHETIC BIOLOGY 2007;1:25-46. [PMID: 19003435 PMCID: PMC2533145 DOI: 10.1007/s11693-006-9003-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Abstract

High throughput technologies, including array-based chromatin immunoprecipitation, have rapidly increased our knowledge of transcriptional maps-the identity and location of regulatory binding sites within genomes. Still, the full identification of sites, even in lower eukaryotes, remains largely incomplete. In this paper we develop a supervised learning approach to site identification using support vector machines (SVMs) to combine 26 different data types. A comparison with the standard approach to site identification using position specific scoring matrices (PSSMs) for a set of 104 Saccharomyces cerevisiae regulators indicates that our SVM-based target classification is more sensitive (73 vs. 20%) when specificity and positive predictive value are the same. We have applied our SVM classifier for each transcriptional regulator to all promoters in the yeast genome to obtain thousands of new targets, which are currently being analyzed and refined to limit the risk of classifier over-fitting. For the purpose of illustration we discuss several results, including biochemical pathway predictions for Gcn4 and Rap1. For both transcription factors SVM predictions match well with the known biology of control mechanisms, and possible new roles for these factors are suggested, such as a function for Rap1 in regulating fermentative growth. We also examine the promoter melting temperature curves for the targets of YJR060W, and show that targets of this TF have potentially unique physical properties which distinguish them from other genes. The SVM output automatically provides the means to rank dataset features to identify important biological elements. We use this property to rank classifying k-mers, thereby reconstructing known binding sites for several TFs, and to rank expression experiments, determining the conditions under which Fhl1, the factor responsible for expression of ribosomal protein genes, is active. We can see that targets of Fhl1 are differentially expressed in the chosen conditions as compared to the expression of average and negative set genes. SVM-based classifiers provide a robust framework for analysis of regulatory networks. Processing of classifier outputs can provide high quality predictions and biological insight into functions of particular transcription factors. Future work on this method will focus on increasing the accuracy and quality of predictions using feature reduction and clustering strategies. Since predictions have been made on only 104 TFs in yeast, new classifiers will be built for the remaining 100 factors which have available binding data.

Collapse

147

Zhu S, Okuno Y, Tsujimoto G, Mamitsuka H. Application of a new probabilistic model for mining implicit associated cancer genes from OMIM and medline. Cancer Inform 2007;2:361-71. [PMID: 19458778 PMCID: PMC2675505] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open

148

Marsh S, McLeod HL. Pharmacogenetics and oncology treatment for breast cancer. Expert Opin Pharmacother 2007;8:119-27. [PMID: 17257083 DOI: 10.1517/14656566.8.2.119] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

149

Grow M, Neff AW, Mescher AL, King MW. Global analysis of gene expression in Xenopus hindlimbs during stage-dependent complete and incomplete regeneration. Dev Dyn 2007;235:2667-85. [PMID: 16871633 DOI: 10.1002/dvdy.20897] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open

150

Murphy AM, MacHugh DE, Park SDE, Scraggs E, Haley CS, Lynn DJ, Boland MP, Doherty ML. Linkage mapping of the locus for inherited ovine arthrogryposis (IOA) to sheep chromosome 5. Mamm Genome 2007;18:43-52. [PMID: 17242863 DOI: 10.1007/s00335-006-0016-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2006] [Accepted: 09/21/2006] [Indexed: 11/30/2022]