Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Journal Articles

Rank	Citation Analysis	Article Type	Number of Years	Citation(s) in RCA
1	Harris PA, Taylor R, Thielke R, Payne J, Gonzalez N, Conde JG. Research electronic data capture (REDCap)--a metadata-driven methodology and workflow process for providing translational research informatics support. J Biomed Inform 2009;42:377-81. [PMID: 18929686 PMCID: PMC2700030 DOI: 10.1016/j.jbi.2008.08.010] [Citation(s) in RCA: 36093] [Impact Index Per Article: 2255.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2008] [Accepted: 08/26/2008] [Indexed: 02/06/2023] Abstract Research electronic data capture (REDCap) is a novel workflow methodology and software solution designed for rapid development and deployment of electronic data capture tools to support clinical and translational research. We present: (1) a brief description of the REDCap metadata-driven software toolset; (2) detail concerning the capture and use of study-related metadata from scientific research teams; (3) measures of impact for REDCap; (4) details concerning a consortium network of domestic and international institutions collaborating on the project; and (5) strengths and limitations of the REDCap system. REDCap is currently supporting 286 translational research projects in a growing collaborative network including 27 active partner institutions. Collapse Key Words medical informatics electronic data capture clinical research translational research Collapse MESH Headings Biomedical Research Clinical Trials as Topic Data Collection/methods Humans Internet Medical Informatics/methods Software Collapse Grants M01 RR000058 NCRR NIH HHS TL1 RR024978 NCRR NIH HHS G12 RR003051 NCRR NIH HHS 5M01RR000058-45 NCRR NIH HHS 1UL1 RR024975 NCRR NIH HHS G12RR03051 NCRR NIH HHS M01 RR000095 NCRR NIH HHS 5M01-RR00095 NCRR NIH HHS KL2 RR024977 NCRR NIH HHS UL1 RR024975 NCRR NIH HHS G12 MD007600 NIMHD NIH HHS Collapse Collaborators Collapse	Research Support, N.I.H., Extramural	16	36093
2	Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 2004;13:2498-504. [PMID: 14597658 PMCID: PMC403769 DOI: 10.1101/gr.1239303] [Citation(s) in RCA: 32961] [Impact Index Per Article: 1569.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Abstract Cytoscape is an open source software project for integrating biomolecular interaction networks with high-throughput expression data and other molecular states into a unified conceptual framework. Although applicable to any system of molecular components and interactions, Cytoscape is most powerful when used in conjunction with large databases of protein-protein, protein-DNA, and genetic interactions that are increasingly available for humans and model organisms. Cytoscape's software Core provides basic functionality to layout and query the network; to visually integrate the network with expression profiles, phenotypes, and other molecular states; and to link the network to databases of functional annotations. The Core is extensible through a straightforward plug-in architecture, allowing rapid development of additional computational analyses and features. Several case studies of Cytoscape plug-ins are surveyed, including a search for interaction pathways correlating with changes in gene expression, a study of protein complexes involved in cellular recovery to DNA damage, inference of a combined physical/functional interaction network for Halobacterium, and an interface to detailed stochastic/kinetic gene regulatory models. Collapse Key Words Collapse MESH Headings Algorithms Archaeal Proteins/chemistry Archaeal Proteins/metabolism Bacteriophage lambda/physiology Computational Biology/methods Halobacterium/chemistry Halobacterium/cytology Halobacterium/physiology Internet Models, Biological Neural Networks, Computer Phenotype Software/trends Software Design Stochastic Processes Collapse Grants P20 GM064361 NIGMS NIH HHS P20 GM64361 NIGMS NIH HHS Collapse Collaborators Collapse	Research Support, U.S. Gov't, P.H.S.	21	32961
3	Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 2004;32:1792-7. [PMID: 15034147 PMCID: PMC390337 DOI: 10.1093/nar/gkh340] [Citation(s) in RCA: 29781] [Impact Index Per Article: 1418.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open Abstract We describe MUSCLE, a new computer program for creating multiple alignments of protein sequences. Elements of the algorithm include fast distance estimation using kmer counting, progressive alignment using a new profile function we call the log-expectation score, and refinement using tree-dependent restricted partitioning. The speed and accuracy of MUSCLE are compared with T-Coffee, MAFFT and CLUSTALW on four test sets of reference alignments: BAliBASE, SABmark, SMART and a new benchmark, PREFAB. MUSCLE achieves the highest, or joint highest, rank in accuracy on each of these sets. Without refinement, MUSCLE achieves average accuracy statistically indistinguishable from T-Coffee and MAFFT, and is the fastest of the tested methods for large numbers of sequences, aligning 5000 sequences of average length 350 in 7 min on a current desktop computer. The MUSCLE program, source code and PREFAB test data are freely available at http://www.drive5. com/muscle. Collapse Key Words Collapse MESH Headings Algorithms Amino Acid Motifs Amino Acid Sequence Internet Molecular Sequence Data Reproducibility of Results Sequence Alignment/methods Sequence Analysis, Protein/methods Software Time Factors Collapse Grants Collapse Collaborators Collapse	Journal Article	21	29781
4	Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE. The Protein Data Bank. Nucleic Acids Res 2000;28:235-42. [PMID: 10592235 PMCID: PMC102472 DOI: 10.1093/nar/28.1.235] [Citation(s) in RCA: 26950] [Impact Index Per Article: 1078.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/1999] [Revised: 10/17/1999] [Accepted: 10/17/1999] [Indexed: 11/14/2022] Open Abstract The Protein Data Bank (PDB; http://www.rcsb.org/pdb/ ) is the single worldwide archive of structural data of biological macromolecules. This paper describes the goals of the PDB, the systems in place for data deposition and access, how to obtain further information, and near-term plans for the future development of the resource. Collapse Key Words Collapse MESH Headings Databases, Factual Information Storage and Retrieval Internet Magnetic Resonance Spectroscopy Protein Conformation Proteins/chemistry Collapse Grants Collapse Collaborators Collapse	research-article	25	26950
5	Tamura K, Dudley J, Nei M, Kumar S. MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol 2007;24:1596-9. [PMID: 17488738 DOI: 10.1093/molbev/msm092] [Citation(s) in RCA: 19658] [Impact Index Per Article: 1092.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open Abstract We announce the release of the fourth version of MEGA software, which expands on the existing facilities for editing DNA sequence data from autosequencers, mining Web-databases, performing automatic and manual sequence alignment, analyzing sequence alignments to estimate evolutionary distances, inferring phylogenetic trees, and testing evolutionary hypotheses. Version 4 includes a unique facility to generate captions, written in figure legend format, in order to provide natural language descriptions of the models and methods used in the analyses. This facility aims to promote a better understanding of the underlying assumptions used in analyses, and of the results generated. Another new feature is the Maximum Composite Likelihood (MCL) method for estimating evolutionary distances between all pairs of sequences simultaneously, with and without incorporating rate variation among sites and substitution pattern heterogeneities among lineages. This MCL method also can be used to estimate transition/transversion bias and nucleotide substitution pattern without knowledge of the phylogenetic tree. This new version is a native 32-bit Windows application with multi-threading and multi-user supports, and it is also available to run in a Linux desktop environment (via the Wine compatibility layer) and on Intel-based Macintosh computers under the Parallels program. The current version of MEGA is available free of charge at (http://www.megasoftware.net). Collapse Key Words Collapse MESH Headings Databases, Genetic Evolution, Molecular Internet Phylogeny Sequence Alignment/methods Sequence Analysis, DNA Sequence Analysis, Protein Software Collapse Grants Collapse Collaborators Collapse	Research Support, Non-U.S. Gov't	18	19658
6	Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, Peplies J, Glöckner FO. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res 2013;41. [PMID: 23193283 PMCID: PMC3531112 DOI: 10.1093/nar/gks1219] [Citation(s) in RCA: 17774] [Impact Index Per Article: 1481.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open Abstract SILVA (from Latin silva, forest, http://www.arb-silva.de) is a comprehensive web resource for up to date, quality-controlled databases of aligned ribosomal RNA (rRNA) gene sequences from the Bacteria, Archaea and Eukaryota domains and supplementary online services. The referred database release 111 (July 2012) contains 3 194 778 small subunit and 288 717 large subunit rRNA gene sequences. Since the initial description of the project, substantial new features have been introduced, including advanced quality control procedures, an improved rRNA gene aligner, online tools for probe and primer evaluation and optimized browsing, searching and downloading on the website. Furthermore, the extensively curated SILVA taxonomy and the new non-redundant SILVA datasets provide an ideal reference for high-throughput classification of data from next-generation sequencing approaches. Collapse Key Words Collapse MESH Headings Archaea/classification Archaea/genetics Bacteria/classification Bacteria/genetics Databases, Nucleic Acid Eukaryota/genetics Genes, rRNA High-Throughput Nucleotide Sequencing Internet Software Collapse Grants Collapse Collaborators Collapse	research-article	12	17774
7	Posada D, Crandall KA. MODELTEST: testing the model of DNA substitution. Bioinformatics 1999;14:817-8. [PMID: 9918953 DOI: 10.1093/bioinformatics/14.9.817] [Citation(s) in RCA: 12299] [Impact Index Per Article: 473.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open Abstract SUMMARY The program MODELTEST uses log likelihood scores to establish the model of DNA evolution that best fits the data. AVAILABILITY The MODELTEST package, including the source code and some documentation is available at http://bioag.byu. edu/zoology/crandall_lab/modeltest.html. Collapse Key Words Collapse MESH Headings Algorithms DNA/genetics DNA/metabolism Evolution, Molecular Internet Models, Biological Online Systems Software Collapse Grants Collapse Collaborators Collapse		26	12299
8	Barrett JC, Fry B, Maller J, Daly MJ. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 2004;21:263-5. [PMID: 15297300 DOI: 10.1093/bioinformatics/bth457] [Citation(s) in RCA: 11830] [Impact Index Per Article: 563.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open Abstract UNLABELLED Research over the last few years has revealed significant haplotype structure in the human genome. The characterization of these patterns, particularly in the context of medical genetic association studies, is becoming a routine research activity. Haploview is a software package that provides computation of linkage disequilibrium statistics and population haplotype patterns from primary genotype data in a visually appealing and interactive interface. AVAILABILITY http://www.broad.mit.edu/mpg/haploview/ CONTACT jcbarret@broad.mit.edu Collapse Key Words Collapse MESH Headings Algorithms Chromosome Mapping/methods Haplotypes/genetics Internet Linkage Disequilibrium/genetics Programming Languages Sequence Alignment/methods Sequence Analysis, DNA/methods Software User-Computer Interface Collapse Grants Collapse Collaborators Collapse	Journal Article	21	11830
9	Gao J, Aksoy BA, Dogrusoz U, Dresdner G, Gross B, Sumer SO, Sun Y, Jacobsen A, Sinha R, Larsson E, Cerami E, Sander C, Schultz N. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci Signal 2013;6:pl1. [PMID: 23550210 PMCID: PMC4160307 DOI: 10.1126/scisignal.2004088] [Citation(s) in RCA: 11117] [Impact Index Per Article: 926.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Abstract The cBioPortal for Cancer Genomics (http://cbioportal.org) provides a Web resource for exploring, visualizing, and analyzing multidimensional cancer genomics data. The portal reduces molecular profiling data from cancer tissues and cell lines into readily understandable genetic, epigenetic, gene expression, and proteomic events. The query interface combined with customized data storage enables researchers to interactively explore genetic alterations across samples, genes, and pathways and, when available in the underlying data, to link these to clinical outcomes. The portal provides graphical summaries of gene-level data from multiple platforms, network visualization and analysis, survival analysis, patient-centric queries, and software programmatic access. The intuitive Web interface of the portal makes complex cancer genomics profiles accessible to researchers and clinicians without requiring bioinformatics expertise, thus facilitating biological discoveries. Here, we provide a practical guide to the analysis and visualization features of the cBioPortal for Cancer Genomics. Collapse Key Words Collapse MESH Headings Gene Expression Profiling Gene Regulatory Networks Genetic Predisposition to Disease/genetics Genomics Humans Information Storage and Retrieval/methods Internet Neoplasms/genetics Neoplasms/pathology Reproducibility of Results Software Collapse Grants U24CA143840 NCI NIH HHS P41 RR031228 NCRR NIH HHS R01 CA132744 NCI NIH HHS P41 GM103504 NIGMS NIH HHS R21 CA135870 NCI NIH HHS U24 CA143840 NCI NIH HHS GM103504 NIGMS NIH HHS R21CA135870 NCI NIH HHS Collapse Collaborators Collapse	Research Support, N.I.H., Extramural	12	11117
10	Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, Mesirov JP. Integrative genomics viewer. Nat Biotechnol 2011;29:24-6. [PMID: 21221095 PMCID: PMC3346182 DOI: 10.1038/nbt.1754] [Citation(s) in RCA: 10535] [Impact Index Per Article: 752.5] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Abstract Collapse Key Words Collapse MESH Headings Chromosome Mapping/methods Computational Biology/methods Computer Graphics Gene Dosage Gene Expression Profiling Genomics/methods Glioblastoma/genetics Humans Information Storage and Retrieval/methods Internet Neoplasms/genetics Oligonucleotide Array Sequence Analysis Online Systems Polymorphism, Single Nucleotide Software Collapse Grants R01 GM074024 NIGMS NIH HHS U54 HG003067 NHGRI NIH HHS U54HG003067 NHGRI NIH HHS R21 CA135827 NCI NIH HHS R21CA135827 NCI NIH HHS R01GM074024 NIGMS NIH HHS Collapse Collaborators Collapse	Letter	14	10535
11	Uhlén M, Fagerberg L, Hallström BM, Lindskog C, Oksvold P, Mardinoglu A, Sivertsson Å, Kampf C, Sjöstedt E, Asplund A, Olsson I, Edlund K, Lundberg E, Navani S, Szigyarto CAK, Odeberg J, Djureinovic D, Takanen JO, Hober S, Alm T, Edqvist PH, Berling H, Tegel H, Mulder J, Rockberg J, Nilsson P, Schwenk JM, Hamsten M, von Feilitzen K, Forsberg M, Persson L, Johansson F, Zwahlen M, von Heijne G, Nielsen J, Pontén F. Tissue-based map of the human proteome. Science 2015;347:1260419. [PMID: 25613900 DOI: 10.1126/science.1260419] [Citation(s) in RCA: 10347] [Impact Index Per Article: 1034.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Abstract Collapse Key Words Collapse MESH Headings Alternative Splicing Cell Line Databases, Protein Female Genes Genetic Code Humans Internet Male Membrane Proteins/genetics Membrane Proteins/metabolism Mitochondrial Proteins/genetics Mitochondrial Proteins/metabolism Neoplasms/genetics Neoplasms/metabolism Protein Array Analysis Protein Isoforms/genetics Protein Isoforms/metabolism Proteome/genetics Proteome/metabolism Tissue Distribution Transcription, Genetic Collapse Grants Collapse Collaborators Collapse		10	10347
12	Zuker M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res 2003;31:3406-15. [PMID: 12824337 PMCID: PMC169194 DOI: 10.1093/nar/gkg595] [Citation(s) in RCA: 10092] [Impact Index Per Article: 458.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2003] [Revised: 04/07/2003] [Accepted: 04/07/2003] [Indexed: 11/13/2022] Open Abstract The abbreviated name, 'mfold web server', describes a number of closely related software applications available on the World Wide Web (WWW) for the prediction of the secondary structure of single stranded nucleic acids. The objective of this web server is to provide easy access to RNA and DNA folding and hybridization software to the scientific community at large. By making use of universally available web GUIs (Graphical User Interfaces), the server circumvents the problem of portability of this software. Detailed output, in the form of structure plots with or without reliability information, single strand frequency plots and 'energy dot plots', are available for the folding of single sequences. A variety of 'bulk' servers give less information, but in a shorter time and for up to hundreds of sequences at once. The portal for the mfold web server is http://www.bioinfo.rpi.edu/applications/mfold. This URL will be referred to as 'MFOLDROOT'. Collapse Key Words Collapse MESH Headings Base Sequence Computer Graphics DNA/chemistry DNA, Single-Stranded/chemistry Databases, Nucleic Acid Internet Models, Molecular Nucleic Acid Conformation Nucleic Acid Denaturation Nucleic Acid Hybridization RNA/chemistry Software Thermodynamics User-Computer Interface Collapse Grants R01 GM054250 NIGMS NIH HHS GM54250 NIGMS NIH HHS Collapse Collaborators Collapse	research-article	22	10092
13	Edgar R, Domrachev M, Lash AE. Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res 2002;30:207-10. [PMID: 11752295 PMCID: PMC99122 DOI: 10.1093/nar/30.1.207] [Citation(s) in RCA: 9622] [Impact Index Per Article: 418.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open Abstract The Gene Expression Omnibus (GEO) project was initiated in response to the growing demand for a public repository for high-throughput gene expression data. GEO provides a flexible and open design that facilitates submission, storage and retrieval of heterogeneous data sets from high-throughput gene expression and genomic hybridization experiments. GEO is not intended to replace in house gene expression databases that benefit from coherent data sets, and which are constructed to facilitate a particular analytic method, but rather complement these by acting as a tertiary, central data distribution hub. The three central data entities of GEO are platforms, samples and series, and were designed with gene expression and genomic hybridization experiments in mind. A platform is, essentially, a list of probes that define what set of molecules may be detected. A sample describes the set of molecules that are being probed and references a single platform used to generate its molecular abundance data. A series organizes samples into the meaningful data sets which make up an experiment. The GEO repository is publicly accessible through the World Wide Web at http://www.ncbi.nlm.nih.gov/geo. Collapse Key Words Collapse MESH Headings Animals Communication Database Management Systems Databases, Genetic Forecasting Gene Expression Profiling Genome Humans Information Storage and Retrieval Internet National Library of Medicine (U.S.) Oligonucleotide Array Sequence Analysis United States Collapse Grants Collapse Collaborators Collapse	research-article	23	9622
14	Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, Ellis B, Gautier L, Ge Y, Gentry J, Hornik K, Hothorn T, Huber W, Iacus S, Irizarry R, Leisch F, Li C, Maechler M, Rossini AJ, Sawitzki G, Smith C, Smyth G, Tierney L, Yang JYH, Zhang J. Bioconductor: open software development for computational biology and bioinformatics. Genome Biol 2004;5:R80. [PMID: 15461798 PMCID: PMC545600 DOI: 10.1186/gb-2004-5-10-r80] [Citation(s) in RCA: 9575] [Impact Index Per Article: 456.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2004] [Revised: 07/01/2004] [Accepted: 08/03/2004] [Indexed: 12/12/2022] Open Abstract The Bioconductor project is an initiative for the collaborative creation of extensible software for computational biology and bioinformatics. The goals of the project include: fostering collaborative development and widespread use of innovative software, reducing barriers to entry into interdisciplinary scientific research, and promoting the achievement of remote reproducibility of research results. We describe details of our aims and methods, identify current challenges, compare Bioconductor to other open bioinformatics projects, and provide working examples. Collapse Key Words Collapse MESH Headings Computational Biology/instrumentation Computational Biology/methods Internet Reproducibility of Results Software Collapse Grants Collapse Collaborators Collapse	research-article	21	9575
15	Crooks GE, Hon G, Chandonia JM, Brenner SE. WebLogo: a sequence logo generator. Genome Res 2004;14:1188-90. [PMID: 15173120 PMCID: PMC419797 DOI: 10.1101/gr.849004] [Citation(s) in RCA: 9564] [Impact Index Per Article: 455.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2002] [Accepted: 01/06/2004] [Indexed: 11/25/2022] Abstract WebLogo generates sequence logos, graphical representations of the patterns within a multiple sequence alignment. Sequence logos provide a richer and more precise description of sequence similarity than consensus sequences and can rapidly reveal significant features of the alignment otherwise difficult to perceive. Each logo consists of stacks of letters, one stack for each position in the sequence. The overall height of each stack indicates the sequence conservation at that position (measured in bits), whereas the height of symbols within the stack reflects the relative frequency of the corresponding amino or nucleic acid at that position. WebLogo has been enhanced recently with additional features and options, to provide a convenient and highly configurable sequence logo generator. A command line interface and the complete, open WebLogo source code are available for local installation and customization. Collapse Key Words Collapse MESH Headings Amino Acid Sequence Base Sequence Computer Graphics Cyclic AMP Receptor Protein/chemistry DNA, Bacterial/chemistry Escherichia coli Proteins/chemistry Internet Nucleic Acid Conformation Protein Structure, Quaternary Software Collapse Grants K22 HG000056 NHGRI NIH HHS P50 GM062412 NIGMS NIH HHS 1-K22-HG00056 NHGRI NIH HHS 1-P50-GM62412 NIGMS NIH HHS Collapse Collaborators Collapse	research-article	21	9564
16	Krogh A, Larsson B, von Heijne G, Sonnhammer EL. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol 2001;305:567-80. [PMID: 11152613 DOI: 10.1006/jmbi.2000.4315] [Citation(s) in RCA: 9553] [Impact Index Per Article: 398.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Abstract We describe and validate a new membrane protein topology prediction method, TMHMM, based on a hidden Markov model. We present a detailed analysis of TMHMM's performance, and show that it correctly predicts 97-98 % of the transmembrane helices. Additionally, TMHMM can discriminate between soluble and membrane proteins with both specificity and sensitivity better than 99 %, although the accuracy drops when signal peptides are present. This high degree of accuracy allowed us to predict reliably integral membrane proteins in a large collection of genomes. Based on these predictions, we estimate that 20-30 % of all genes in most genomes encode membrane proteins, which is in agreement with previous estimates. We further discovered that proteins with N(in)-C(in) topologies are strongly preferred in all examined organisms, except Caenorhabditis elegans, where the large number of 7TM receptors increases the counts for N(out)-C(in) topologies. We discuss the possible relevance of this finding for our understanding of membrane protein assembly mechanisms. A TMHMM prediction service is available at http://www.cbs.dtu.dk/services/TMHMM/. Collapse Key Words Collapse MESH Headings Animals Bacterial Proteins/chemistry Computational Biology/methods Databases as Topic Fungal Proteins/chemistry Genome Internet Markov Chains Membrane Proteins/chemistry Plant Proteins/chemistry Porins/chemistry Protein Sorting Signals Protein Structure, Secondary Reproducibility of Results Research Design Sensitivity and Specificity Software Solubility Collapse Grants Collapse Collaborators Collapse		24	9553
17	Conesa A, Götz S, García-Gómez JM, Terol J, Talón M, Robles M. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 2005;21:3674-6. [PMID: 16081474 DOI: 10.1093/bioinformatics/bti610] [Citation(s) in RCA: 8340] [Impact Index Per Article: 417.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open Abstract SUMMARY We present here Blast2GO (B2G), a research tool designed with the main purpose of enabling Gene Ontology (GO) based data mining on sequence data for which no GO annotation is yet available. B2G joints in one application GO annotation based on similarity searches with statistical analysis and highlighted visualization on directed acyclic graphs. This tool offers a suitable platform for functional genomics research in non-model species. B2G is an intuitive and interactive desktop application that allows monitoring and comprehension of the whole annotation and analysis process. AVAILABILITY Blast2GO is freely available via Java Web Start at http://www.blast2go.de. SUPPLEMENTARY MATERIAL http://www.blast2go.de -> Evaluation. Collapse Key Words Collapse MESH Headings Algorithms Computational Biology/methods Computer Graphics Database Management Systems Databases, Protein Gene Expression Profiling/methods Genome Genomics Information Storage and Retrieval/methods Internet Oligonucleotide Array Sequence Analysis Sequence Alignment Sequence Analysis, Protein Software User-Computer Interface Collapse Grants Collapse Collaborators Collapse	Research Support, Non-U.S. Gov't	20	8340
18	Kumar S, Tamura K, Nei M. MEGA3: Integrated software for Molecular Evolutionary Genetics Analysis and sequence alignment. Brief Bioinform 2005;5:150-63. [PMID: 15260895 DOI: 10.1093/bib/5.2.150] [Citation(s) in RCA: 8079] [Impact Index Per Article: 404.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open Abstract With its theoretical basis firmly established in molecular evolutionary and population genetics, the comparative DNA and protein sequence analysis plays a central role in reconstructing the evolutionary histories of species and multigene families, estimating rates of molecular evolution, and inferring the nature and extent of selective forces shaping the evolution of genes and genomes. The scope of these investigations has now expanded greatly owing to the development of high-throughput sequencing techniques and novel statistical and computational methods. These methods require easy-to-use computer programs. One such effort has been to produce Molecular Evolutionary Genetics Analysis (MEGA) software, with its focus on facilitating the exploration and analysis of the DNA and protein sequence variation from an evolutionary perspective. Currently in its third major release, MEGA3 contains facilities for automatic and manual sequence alignment, web-based mining of databases, inference of the phylogenetic trees, estimation of evolutionary distances and testing evolutionary hypotheses. This paper provides an overview of the statistical methods, computational tools, and visual exploration modules for data input and the results obtainable in MEGA. Collapse Key Words Collapse MESH Headings Databases, Genetic Evolution, Molecular Internet Phylogeny Sequence Alignment/methods Sequence Analysis, DNA Sequence Analysis, Protein Software Collapse Grants Collapse Collaborators Collapse	Research Support, U.S. Gov't, P.H.S.	20	8079
19	Tang Z, Li C, Kang B, Gao G, Li C, Zhang Z. GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses. Nucleic Acids Res 2017;45:W98-W102. [PMID: 28407145 PMCID: PMC5570223 DOI: 10.1093/nar/gkx247] [Citation(s) in RCA: 6980] [Impact Index Per Article: 872.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2017] [Revised: 03/27/2017] [Accepted: 04/05/2017] [Indexed: 12/11/2022] Open Abstract Tremendous amount of RNA sequencing data have been produced by large consortium projects such as TCGA and GTEx, creating new opportunities for data mining and deeper understanding of gene functions. While certain existing web servers are valuable and widely used, many expression analysis functions needed by experimental biologists are still not adequately addressed by these tools. We introduce GEPIA (Gene Expression Profiling Interactive Analysis), a web-based tool to deliver fast and customizable functionalities based on TCGA and GTEx data. GEPIA provides key interactive and customizable functions including differential expression analysis, profiling plotting, correlation analysis, patient survival analysis, similar gene detection and dimensionality reduction analysis. The comprehensive expression analyses with simple clicking through GEPIA greatly facilitate data mining in wide research areas, scientific discussion and the therapeutic discovery process. GEPIA fills in the gap between cancer genomics big data and the delivery of integrated information to end users, thus helping unleash the value of the current data resources. GEPIA is available at http://gepia.cancer-pku.cn/. Collapse Key Words Collapse MESH Headings Data Mining Gene Expression Profiling Gene Expression Regulation, Neoplastic Humans Internet Multifactor Dimensionality Reduction Neoplasm Proteins/genetics Neoplasm Proteins/metabolism Neoplasms/genetics Neoplasms/mortality Neoplasms/pathology Survival Analysis User-Computer Interface Collapse Grants Collapse Collaborators Collapse	research-article	8	6980
20	Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, Noble WS. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res 2009;37:W202-8. [PMID: 19458158 PMCID: PMC2703892 DOI: 10.1093/nar/gkp335] [Citation(s) in RCA: 6816] [Impact Index Per Article: 426.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2009] [Revised: 04/10/2009] [Accepted: 04/21/2009] [Indexed: 11/13/2022] Open Abstract The MEME Suite web server provides a unified portal for online discovery and analysis of sequence motifs representing features such as DNA binding sites and protein interaction domains. The popular MEME motif discovery algorithm is now complemented by the GLAM2 algorithm which allows discovery of motifs containing gaps. Three sequence scanning algorithms--MAST, FIMO and GLAM2SCAN--allow scanning numerous DNA and protein sequence databases for motifs discovered by MEME and GLAM2. Transcription factor motifs (including those discovered using MEME) can be compared with motifs in many popular motif databases using the motif database scanning algorithm TOMTOM. Transcription factor motifs can be further analyzed for putative function by association with Gene Ontology (GO) terms using the motif-GO term association tool GOMO. MEME output now contains sequence LOGOS for each discovered motif, as well as buttons to allow motifs to be conveniently submitted to the sequence and motif database scanning algorithms (MAST, FIMO and TOMTOM), or to GOMO, for further analysis. GLAM2 output similarly contains buttons for further analysis using GLAM2SCAN and for rerunning GLAM2 with different parameters. All of the motif-based tools are now implemented as web services via Opal. Source code, binaries and a web server are freely available for noncommercial use at http://meme.nbcr.net. Collapse Key Words Collapse MESH Headings Algorithms Binding Sites Databases, Genetic Internet Regulatory Elements, Transcriptional Sequence Analysis, DNA Sequence Analysis, Protein Software Transcription Factors/metabolism Collapse Grants P41 RR008605 NCRR NIH HHS R01 RR021692 NCRR NIH HHS P41 RR08605 NCRR NIH HHS Collapse Collaborators Collapse	Research Support, N.I.H., Extramural	16	6816
21	Rice P, Longden I, Bleasby A. EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet 2000;16:276-7. [PMID: 10827456 DOI: 10.1016/s0168-9525(00)02024-2] [Citation(s) in RCA: 6688] [Impact Index Per Article: 267.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Abstract Collapse Key Words Collapse MESH Headings Internet Molecular Biology Sequence Alignment/methods Software User-Computer Interface Collapse Grants Collapse Collaborators Collapse		25	6688
22	Baba T, Ara T, Hasegawa M, Takai Y, Okumura Y, Baba M, Datsenko KA, Tomita M, Wanner BL, Mori H. Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection. Mol Syst Biol 2006;2:2006.0008. [PMID: 16738554 PMCID: PMC1681482 DOI: 10.1038/msb4100050] [Citation(s) in RCA: 5898] [Impact Index Per Article: 310.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2005] [Accepted: 12/07/2005] [Indexed: 11/17/2022] Open Abstract We have systematically made a set of precisely defined, single-gene deletions of all nonessential genes in Escherichia coli K-12. Open-reading frame coding regions were replaced with a kanamycin cassette flanked by FLP recognition target sites by using a one-step method for inactivation of chromosomal genes and primers designed to create in-frame deletions upon excision of the resistance cassette. Of 4288 genes targeted, mutants were obtained for 3985. To alleviate problems encountered in high-throughput studies, two independent mutants were saved for every deleted gene. These mutants—the ‘Keio collection'—provide a new resource not only for systematic analyses of unknown gene functions and gene regulatory networks but also for genome-wide testing of mutational effects in a common strain background, E. coli K-12 BW25113. We were unable to disrupt 303 genes, including 37 of unknown function, which are candidates for essential genes. Distribution is being handled via GenoBase (http://ecoli.aist-nara.ac.jp/). Collapse Key Words Collapse MESH Headings Escherichia coli/genetics Gene Deletion Internet Mutation Organisms, Genetically Modified Collapse Grants R01 GM062662 NIGMS NIH HHS GM62662 NIGMS NIH HHS Collapse Collaborators Collapse	Research Support, Non-U.S. Gov't	19	5898
23	Jo S, Kim T, Iyer VG, Im W. CHARMM-GUI: a web-based graphical user interface for CHARMM. J Comput Chem 2008;29:1859-65. [PMID: 18351591 DOI: 10.1002/jcc.20945] [Citation(s) in RCA: 5696] [Impact Index Per Article: 335.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Abstract CHARMM is an academic research program used widely for macromolecular mechanics and dynamics with versatile analysis and manipulation tools of atomic coordinates and dynamics trajectories. CHARMM-GUI, http://www.charmm-gui.org, has been developed to provide a web-based graphical user interface to generate various input files and molecular systems to facilitate and standardize the usage of common and advanced simulation techniques in CHARMM. The web environment provides an ideal platform to build and validate a molecular model system in an interactive fashion such that, if a problem is found through visual inspection, one can go back to the previous setup and regenerate the whole system again. In this article, we describe the currently available functional modules of CHARMM-GUI Input Generator that form a basis for the advanced simulation techniques. Future directions of the CHARMM-GUI development project are also discussed briefly together with other features in the CHARMM-GUI website, such as Archive and Movie Gallery. Collapse Key Words Collapse MESH Headings Computer Graphics Internet Protein Conformation Proteins/chemistry Solvents/chemistry Static Electricity User-Computer Interface Collapse Grants Collapse Collaborators Collapse	Research Support, Non-U.S. Gov't	17	5696
24	Pfaffl MW, Horgan GW, Dempfle L. Relative expression software tool (REST) for group-wise comparison and statistical analysis of relative expression results in real-time PCR. Nucleic Acids Res 2002;30:e36. [PMID: 11972351 PMCID: PMC113859 DOI: 10.1093/nar/30.9.e36] [Citation(s) in RCA: 5632] [Impact Index Per Article: 244.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open Abstract Real-time reverse transcription followed by polymerase chain reaction (RT-PCR) is the most suitable method for the detection and quantification of mRNA. It offers high sensitivity, good reproducibility and a wide quantification range. Today, relative expression is increasingly used, where the expression of a target gene is standardised by a non-regulated reference gene. Several mathematical algorithms have been developed to compute an expression ratio, based on real-time PCR efficiency and the crossing point deviation of an unknown sample versus a control. But all published equations and available models for the calculation of relative expression ratio allow only for the determination of a single transcription difference between one control and one sample. Therefore a new software tool was established, named REST (relative expression software tool), which compares two groups, with up to 16 data points in a sample and 16 in a control group, for reference and up to four target genes. The mathematical model used is based on the PCR efficiencies and the mean crossing point deviation between the sample and control group. Subsequently, the expression ratio results of the four investigated transcripts are tested for significance by a randomisation test. Herein, development and application of REST is explained and the usefulness of relative expression in real-time PCR using REST is discussed. The latest software version of REST and examples for the correct use can be downloaded at http://www.wzw.tum.de/gene-quantification/. Collapse Key Words Collapse MESH Headings Animals Internet Models, Statistical RNA Probes RNA, Messenger/analysis Rats Reference Standards Reproducibility of Results Reverse Transcriptase Polymerase Chain Reaction Sensitivity and Specificity Software Time Factors Transcription, Genetic Collapse Grants Collapse Collaborators Collapse	research-article	23	5632
25	Arnold K, Bordoli L, Kopp J, Schwede T. The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling. ACTA ACUST UNITED AC 2005;22:195-201. [PMID: 16301204 DOI: 10.1093/bioinformatics/bti770] [Citation(s) in RCA: 5574] [Impact Index Per Article: 278.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Abstract MOTIVATION Homology models of proteins are of great interest for planning and analysing biological experiments when no experimental three-dimensional structures are available. Building homology models requires specialized programs and up-to-date sequence and structural databases. Integrating all required tools, programs and databases into a single web-based workspace facilitates access to homology modelling from a computer with web connection without the need of downloading and installing large program packages and databases. RESULTS SWISS-MODEL workspace is a web-based integrated service dedicated to protein structure homology modelling. It assists and guides the user in building protein homology models at different levels of complexity. A personal working environment is provided for each user where several modelling projects can be carried out in parallel. Protein sequence and structure databases necessary for modelling are accessible from the workspace and are updated in regular intervals. Tools for template selection, model building and structure quality evaluation can be invoked from within the workspace. Workflow and usage of the workspace are illustrated by modelling human Cyclin A1 and human Transmembrane Protease 3. AVAILABILITY The SWISS-MODEL workspace can be accessed freely at http://swissmodel.expasy.org/workspace/ Collapse Key Words Collapse MESH Headings Algorithms Amino Acid Sequence Computer Simulation Databases, Protein Internet Models, Chemical Models, Molecular Molecular Sequence Data Online Systems Protein Conformation Proteins/analysis Proteins/chemistry Proteins/classification Sequence Alignment/methods Sequence Analysis, Protein/methods Sequence Homology, Amino Acid Software Systems Integration User-Computer Interface Collapse Grants Collapse Collaborators Collapse	Research Support, Non-U.S. Gov't	20	5574

Please SIGN IN to browse more articles.