Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Total Articles

33
(from Reference Citation Analysis)

Article PDFs (9)

Cited by > 0 (26)

Searched Name

Databases, Nucleic Acid/trends

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Statistics

Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Category

Show more Refine

Number	Citation Analysis
1	The 2024 Nucleic Acids Research database issue and the online molecular biology database collection. Nucleic Acids Res 2024;52:D1-D9. [PMID: 38035367 PMCID: PMC10767945 DOI: 10.1093/nar/gkad1173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 11/23/2023] [Indexed: 12/02/2023] Open Abstract The 2024 Nucleic Acids Research database issue contains 180 papers from across biology and neighbouring disciplines. There are 90 papers reporting on new databases and 83 updates from resources previously published in the Issue. Updates from databases most recently published elsewhere account for a further seven. Nucleic acid databases include the new NAKB for structural information and updates from Genbank, ENA, GEO, Tarbase and JASPAR. The Issue's Breakthrough Article concerns NMPFamsDB for novel prokaryotic protein families and the AlphaFold Protein Structure Database has an important update. Metabolism is covered by updates from Reactome, Wikipathways and Metabolights. Microbes are covered by RefSeq, UNITE, SPIRE and P10K; viruses by ViralZone and PhageScope. Medically-oriented databases include the familiar COSMIC, Drugbank and TTD. Genomics-related resources include Ensembl, UCSC Genome Browser and Monarch. New arrivals cover plant imaging (OPIA and PlantPAD) and crop plants (SoyMD, TCOD and CropGS-Hub). The entire Database Issue is freely available online on the Nucleic Acids Research website (https://academic.oup.com/nar). Over the last year the NAR online Molecular Biology Database Collection has been updated, reviewing 1060 entries, adding 97 new resources and eliminating 388 discontinued URLs bringing the current total to 1959 databases. It is available at http://www.oxfordjournals.org/nar/database/c/. Collapse Key Words Collapse MESH Headings Computational Biology Databases, Genetic Databases, Nucleic Acid/trends Genomics Internet Molecular Biology/trends Collapse Grants Oxford University Press Collapse
2	RefSeq and the prokaryotic genome annotation pipeline in the age of metagenomes. Nucleic Acids Res 2024;52:D762-D769. [PMID: 37962425 PMCID: PMC10767926 DOI: 10.1093/nar/gkad988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 10/13/2023] [Accepted: 10/18/2023] [Indexed: 11/15/2023] Open Abstract The Reference Sequence (RefSeq) project at the National Center for Biotechnology Information (NCBI) contains over 315 000 bacterial and archaeal genomes and 236 million proteins with up-to-date and consistent annotation. In the past 3 years, we have expanded the diversity of the RefSeq collection by including the best quality metagenome-assembled genomes (MAGs) submitted to INSDC (DDBJ, ENA and GenBank), while maintaining its quality by adding validation checks. Assemblies are now more stringently evaluated for contamination and for completeness of annotation prior to acceptance into RefSeq. MAGs now account for over 17000 assemblies in RefSeq, split over 165 orders and 362 families. Changes in the Prokaryotic Genome Annotation Pipeline (PGAP), which is used to annotate nearly all RefSeq assemblies include better detection of protein-coding genes. Nearly 83% of RefSeq proteins are now named by a curated Protein Family Model, a 4.7% increase in the past three years ago. In addition to literature citations, Enzyme Commission numbers, and gene symbols, Gene Ontology terms are now assigned to 48% of RefSeq proteins, allowing for easier multi-genome comparison. RefSeq is found at https://www.ncbi.nlm.nih.gov/refseq/. PGAP is available as a stand-alone tool able to produce GenBank-ready files at https://github.com/ncbi/pgap. Collapse Key Words Collapse MESH Headings Archaea/genetics Bacteria/genetics Databases, Nucleic Acid/standards Databases, Nucleic Acid/trends Genome, Archaeal/genetics Genome, Bacterial/genetics Internet Metagenome Molecular Sequence Annotation Proteins/genetics Collapse Grants NLM NIH HHS NIH HHS NLM NIH HHS NIH HHS National Library of Medicine National Institutes of Health Collapse
3	GenBank. Nucleic Acids Res 2019;47:D94-D99. [PMID: 30365038 PMCID: PMC6323954 DOI: 10.1093/nar/gky989] [Citation(s) in RCA: 253] [Impact Index Per Article: 50.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2018] [Revised: 10/03/2018] [Accepted: 10/18/2018] [Indexed: 11/14/2022] Open Abstract GenBank® (www.ncbi.nlm.nih.gov/genbank/) is a comprehensive database that contains publicly available nucleotide sequences for 420 000 formally described species. Most GenBank submissions are made using BankIt, the NCBI Submission Portal, or the tool tbl2asn, and are obtained from individual laboratories and batch submissions from large-scale sequencing projects, including whole genome shotgun (WGS) and environmental sampling projects. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. GenBank is accessible through the NCBI Nucleotide database, which links to related information such as taxonomy, genomes, protein sequences and structures, and biomedical journal literature in PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP. Recent updates include an expansion of sequence identifier formats to accommodate expected database growth, submission wizards for ribosomal RNA, and the transfer of Expressed Sequence Tag (EST) and Genome Survey Sequence (GSS) data into the Nucleotide database. Collapse Key Words Collapse MESH Headings Computational Biology/methods Databases, Nucleic Acid/trends Genomics/methods Humans Information Storage and Retrieval Software Design Web Browser Collapse Grants Collapse
4	The 2018 Nucleic Acids Research database issue and the online molecular biology database collection. Nucleic Acids Res 2018;46:D1-D7. [PMID: 29316735 PMCID: PMC5753253 DOI: 10.1093/nar/gkx1235] [Citation(s) in RCA: 58] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2017] [Accepted: 11/29/2017] [Indexed: 12/20/2022] Open Abstract The 2018 Nucleic Acids Research Database Issue contains 181 papers spanning molecular biology. Among them, 82 are new and 84 are updates describing resources that appeared in the Issue previously. The remaining 15 cover databases most recently published elsewhere. Databases in the area of nucleic acids include 3DIV for visualisation of data on genome 3D structure and RNArchitecture, a hierarchical classification of RNA families. Protein databases include the established SMART, ELM and MEROPS while GPCRdb and the newcomer STCRDab cover families of biomedical interest. In the area of metabolism, HMDB and Reactome both report new features while PULDB appears in NAR for the first time. This issue also contains reports on genomics resources including Ensembl, the UCSC Genome Browser and ENCODE. Update papers from the IUPHAR/BPS Guide to Pharmacology and DrugBank are highlights of the drug and drug target section while a number of proteomics databases including proteomicsDB are also covered. The entire Database Issue is freely available online on the Nucleic Acids Research website (https://academic.oup.com/nar). The NAR online Molecular Biology Database Collection has been updated, reviewing 138 entries, adding 88 new resources and eliminating 47 discontinued URLs, bringing the current total to 1737 databases. It is available at http://www.oxfordjournals.org/nar/database/c/. Collapse Key Words Collapse MESH Headings Animals Computational Biology Databases, Nucleic Acid/trends Databases, Protein Genomics Humans Internet Molecular Biology Proteomics Collapse Grants Collapse
5	The UNITE database for molecular identification of fungi--recent updates and future perspectives. THE NEW PHYTOLOGIST 2010;186:281-5. [PMID: 20409185 DOI: 10.1111/j.1469-8137.2009.03160.x] [Citation(s) in RCA: 972] [Impact Index Per Article: 69.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023] Abstract Collapse Key Words Collapse MESH Headings Base Sequence Databases, Nucleic Acid/statistics & numerical data Databases, Nucleic Acid/trends Fungi/classification Fungi/genetics Information Storage and Retrieval International Cooperation Sequence Analysis, DNA Collapse Grants Collapse
6	Data sharing: making headway in a competitive research milieu. Ann Neurol 2008;64:A13-6. [PMID: 18668610 DOI: 10.1002/ana.21478] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Abstract Collapse Key Words Collapse MESH Headings Access to Information/ethics Access to Information/legislation & jurisprudence Animals Biomedical Research/economics Biomedical Research/ethics Biomedical Research/trends Computer Security/economics Computer Security/ethics Computer Security/trends Databases as Topic/economics Databases as Topic/ethics Databases as Topic/trends Databases, Nucleic Acid/economics Databases, Nucleic Acid/ethics Databases, Nucleic Acid/trends Human Genome Project/economics Human Genome Project/ethics Human Genome Project/legislation & jurisprudence Humans National Institutes of Health (U.S.)/economics National Institutes of Health (U.S.)/ethics National Institutes of Health (U.S.)/legislation & jurisprudence Peer Review, Research/ethics Peer Review, Research/standards Peer Review, Research/trends Periodicals as Topic/ethics Private Sector/economics Private Sector/ethics Private Sector/trends Research Support as Topic/economics Research Support as Topic/ethics Research Support as Topic/trends United States Collapse Grants Collapse
7	[International collaboration among DDBJ, EMBL Bank and GenBank]. TANPAKUSHITSU KAKUSAN KOSO. PROTEIN, NUCLEIC ACID, ENZYME 2008;53:182-189. [PMID: 18240597] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/25/2023] Abstract Collapse Key Words Collapse MESH Headings Computational Biology Databases, Nucleic Acid/trends Genome/genetics Humans International Cooperation Collapse Grants Collapse
8	EMBL Nucleotide Sequence Database in 2006. Nucleic Acids Res 2006;35:D16-20. [PMID: 17148479 PMCID: PMC1897316 DOI: 10.1093/nar/gkl913] [Citation(s) in RCA: 114] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open Abstract The EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl) at the EMBL European Bioinformatics Institute, UK, offers a large and freely accessible collection of nucleotide sequences and accompanying annotation. The database is maintained in collaboration with DDBJ and GenBank. Data are exchanged between the collaborating databases on a daily basis to achieve optimal synchrony. Webin is the preferred tool for individual submissions of nucleotide sequences, including Third Party Annotation, alignments and bulk data. Automated procedures are provided for submissions from large-scale sequencing projects and data from the European Patent Office. In 2006, the volume of data has continued to grow exponentially. Access to the data is provided via SRS, ftp and variety of other methods. Extensive external and internal cross-references enable users to search for related information across other databases and within the database. All available resources can be accessed via the EBI home page at http://www.ebi.ac.uk/. Changes over the past year include changes to the file format, further development of the EMBLCDS dataset and developments to the XML format. Collapse Key Words Collapse MESH Headings Base Sequence Databases, Nucleic Acid/trends Internet User-Computer Interface Collapse Grants Collapse
9	From hype to mothballs in four years: troubles in the development of large-scale DNA biobanks in Europe. Public Health Genomics 2006;9:184-9. [PMID: 16741348 DOI: 10.1159/000092655] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open Abstract This paper analyses the difficulties experienced by three large European DNA biobanks. The first, Icelandic-based deCode, generated immense commercial interest and intense ethical controversy. As a biotechnology company, deCode succeeded, but the Icelandic Health Sector Data Base failed. The second firm, Swedish UmanGenomics, marketed itself as the 'ethical' biotech company. Management problems including the inadequate recognition of intellectual property issues led to the company failing to secure adequate investment. The third and largest, UK Biobank, has, as a non-profit organization, not experienced these problems. But when the product - bio information--is marketed, the issue of ethically acceptable purchasers could well become contentious. Collapse Key Words Collapse MESH Headings Computational Biology Databases, Genetic/trends Databases, Nucleic Acid/trends Europe Humans Pharmacogenetics Collapse Grants Collapse
10	Biobanks and registries for HSCT research: potential for future individualized medicine. Int J Immunogenet 2006;33:153-4. [PMID: 16712643 DOI: 10.1111/j.1744-313x.2006.00592.x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Abstract Collapse Key Words Collapse MESH Headings Age Factors Autoimmune Diseases/genetics Autoimmune Diseases/immunology Autoimmune Diseases/therapy Blood Banks/legislation & jurisprudence Blood Banks/trends Databases, Nucleic Acid/legislation & jurisprudence Databases, Nucleic Acid/trends Europe Female Graft vs Host Disease/genetics Graft vs Host Disease/immunology Graft vs Host Disease/prevention & control Hematopoietic Stem Cell Transplantation/trends Humans Male Neoplasms/genetics Neoplasms/immunology Neoplasms/therapy Polymorphism, Single Nucleotide Promoter Regions, Genetic/genetics Promoter Regions, Genetic/immunology Registries Specimen Handling Transplantation, Homologous Collapse Grants Collapse
11	Pointing the way. Int J Immunogenet 2006;33:151. [PMID: 16712642 DOI: 10.1111/j.1744-313x.2006.00601.x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Abstract Collapse Key Words Collapse MESH Headings Autoimmune Diseases/genetics Autoimmune Diseases/immunology DNA Mutational Analysis/instrumentation DNA Mutational Analysis/methods DNA Mutational Analysis/trends Databases, Nucleic Acid/trends Female Genetic Testing/methods Genetic Testing/trends Genome, Human/genetics Genome, Human/immunology Humans Male Periodicals as Topic/trends Quantitative Trait, Heritable Collapse Grants Collapse
12	Pokemon expression in malignant glioma: an application of bioinformatics methods. Neurosurg Focus 2005;19:E8. [PMID: 16241110 DOI: 10.3171/foc.2005.19.4.9] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Abstract OBJECT In this report the authors review the role of bioinformatics in the design of a research project in which the molecular genetics of malignant glioma were studied. A project to characterize Pokemon expression in malignant glioma was developed, refined, and implemented using bioinformatics methods. METHODS Using the resources available from the National Center for Biotechnology Information, the messenger RNA (mRNA) sequence for Pokemon was determined. With this information and online primer design tools, novel primers were designed that would specifically amplify Pokemon mRNA by using reverse transcription-polymerase chain reaction assays. CONCLUSIONS The promise of bioinformatics is in the rapid and widespread dissemination and analysis of genomic information. This information is then used in research investigating the genetic basis of disease. In this paper the authors review the bioinformatics methods used in their study of Pokemon expression in malignant glioma. Collapse Key Words Collapse MESH Headings Base Sequence/genetics Brain Neoplasms/diagnosis Brain Neoplasms/genetics Cell Transformation, Neoplastic/genetics Chromosomes, Human, Pair 19/genetics Computational Biology/methods Computational Biology/trends DNA Mutational Analysis DNA Primers/genetics DNA-Binding Proteins/analysis DNA-Binding Proteins/genetics Databases, Nucleic Acid/trends Gene Expression Regulation, Neoplastic/genetics Gene Silencing/physiology Genetic Markers/genetics Genomic Library Glioma/diagnosis Glioma/genetics Humans RNA, Messenger/analysis RNA, Messenger/genetics Repressor Proteins/analysis Repressor Proteins/genetics Reverse Transcriptase Polymerase Chain Reaction Transcription Factors/analysis Transcription Factors/genetics Tumor Suppressor Protein p14ARF/genetics Collapse Grants Collapse
13	Nucleic acid sequence data turns 100,000,000,000 and looks to the future. J Clin Invest 2005;115:2588. [PMID: 16200189 PMCID: PMC1236709 DOI: 10.1172/jci26755] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open Abstract Collapse Key Words Collapse MESH Headings Animals Base Sequence/genetics Databases, Nucleic Acid/trends Humans Molecular Sequence Data Sequence Analysis, DNA/methods Sequence Analysis, DNA/trends Collapse Grants Collapse
14	DDBJ in collaboration with mass-sequencing teams on annotation. Nucleic Acids Res 2005;33:D25-8. [PMID: 15608189 PMCID: PMC539974 DOI: 10.1093/nar/gki020] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open Abstract In the past year, we at DDBJ (DNA Data Bank of Japan; http://www.ddbj.nig.ac.jp) collected and released 1 066 084 entries or 718 072 425 bases including the whole chromosome 22 of chimpanzee, the whole-genome shotgun sequences of silkworm and various others. On the other hand, we hosted workshops for human full-length cDNA annotation and participated in jamborees of mouse full-length cDNA annotation. The annotated data are made public at DDBJ. We are also in collaboration with a RIKEN team to accept and release the CAGE (Cap Analysis Gene Expression) data under a new category, MGA (Mass Sequences for Genome Annotation). The data will be useful for studying gene expression control in many aspects. Collapse Key Words Collapse MESH Headings Animals Cooperative Behavior Databases, Nucleic Acid/trends Gene Expression Genome Genomics Humans Internet Sequence Analysis, DNA Collapse Grants Collapse
15	The EMBL Nucleotide Sequence Database. Nucleic Acids Res 2005;33:D29-33. [PMID: 15608199 PMCID: PMC540052 DOI: 10.1093/nar/gki098] [Citation(s) in RCA: 171] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open Abstract The EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl), maintained at the European Bioinformatics Institute (EBI) near Cambridge, UK, is a comprehensive collection of nucleotide sequences and annotation from available public sources. The database is part of an international collaboration with DDBJ (Japan) and GenBank (USA). Data are exchanged daily between the collaborating institutes to achieve swift synchrony. Webin is the preferred tool for individual submissions of nucleotide sequences, including Third Party Annotation (TPA) and alignments. Automated procedures are provided for submissions from large-scale sequencing projects and data from the European Patent Office. New and updated data records are distributed daily and the whole EMBL Nucleotide Sequence Database is released four times a year. Access to the sequence data is provided via ftp and several WWW interfaces. With the web-based Sequence Retrieval System (SRS) it is also possible to link nucleotide data to other specialist molecular biology databases maintained at the EBI. Other tools are available for sequence similarity searching (e.g. FASTA and BLAST). Changes over the past year include the removal of the sequence length limit, the launch of the EMBLCDSs dataset, extension of the Sequence Version Archive functionality and the revision of quality rules for TPA data. Collapse Key Words Collapse MESH Headings Base Sequence Databases, Nucleic Acid/trends Internet User-Computer Interface Collapse Grants Collapse
16	Textmining in support of knowledge discovery for vaccine development. Methods 2005;34:488-95. [PMID: 15542375 DOI: 10.1016/j.ymeth.2004.06.009] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/21/2004] [Indexed: 11/25/2022] Open Abstract Complete genome data of infectious microorganisms permit systematic computational sequence-based predictions and experimental testing of candidate vaccine epitopes. Both, predictions and the interpretation of experiments rely on existing information in the literature which is mostly manually extracted and curated. The growing amount of data and literature information has created a major bottleneck for the interpretation of results and maintenance of curated databases. The lack of suitable free-text information extraction, processing, and reporting tools prompted us to develop a knowledge discovery support system that enhances the understanding of immune response and vaccine development. The current prototype system, Gene expression/epitpopes/protein interaction (GEpi), focuses on molecular functions of HIV-infected T-cells and HIV epitope information, using textmining, and interrelation of biomolecular data from domain-specific databases with MEDLINE abstract-inferred information. Results showed that extraction and processing of molecular interaction, disease associations, and gene ontology-derived functional information generate intuitive knowledge reports that aid the interpretation of host-pathogen interaction. In contrast, epitope (word and sequence) information in MEDLINE abstracts is surprisingly sparse and often lacks necessary context information, such as HLA-restriction. Since the majority of epitope information is found in tables, figures, and legends of full-text articles, its extraction may not require sophisticated natural language processing techniques. Support of vaccine development through textmining requires therefore the timely development of domain-specific extraction rules for full-text articles, and a knowledge model for epitope-related information. Collapse Key Words Collapse MESH Headings Animals Computational Biology/methods Computational Biology/trends Database Management Systems Databases, Nucleic Acid/trends Humans Vaccines/chemical synthesis Vaccines/genetics Vaccines/therapeutic use Collapse Grants Collapse
17	Genomic resources for ascidians: sequence/expression databases and genome projects. Methods Cell Biol 2005;74:759-74. [PMID: 15575630 DOI: 10.1016/s0091-679x(04)74031-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/01/2023] Abstract Collapse Key Words Collapse MESH Headings Animals Ciona intestinalis/embryology Ciona intestinalis/genetics Databases, Nucleic Acid/trends Gene Expression Regulation, Developmental/genetics Gene Library Genomic Library Oligonucleotides, Antisense/genetics Urochordata/embryology Urochordata/genetics Collapse Grants Collapse
18	RAG: RNA-As-Graphs web resource. BMC Bioinformatics 2004;5:88. [PMID: 15238163 PMCID: PMC471545 DOI: 10.1186/1471-2105-5-88] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2004] [Accepted: 07/06/2004] [Indexed: 11/10/2022] Open Abstract BACKGROUND The proliferation of structural and functional studies of RNA has revealed an increasing range of RNA's structural repertoire. Toward the objective of systematic cataloguing of RNA's structural repertoire, we have recently described the basis of a graphical approach for organizing RNA secondary structures, including existing and hypothetical motifs. DESCRIPTION We now present an RNA motif database based on graph theory, termed RAG for RNA-As-Graphs, to catalogue and rank all theoretically possible, including existing, candidate and hypothetical, RNA secondary motifs. The candidate motifs are predicted using a clustering algorithm that classifies RNA graphs into RNA-like and non-RNA groups. All RNA motifs are filed according to their graph vertex number (RNA length) and ranked by topological complexity. CONCLUSIONS RAG's quantitative cataloguing allows facile retrieval of all classes of RNA secondary motifs, assists identification of structural and functional properties of user-supplied RNA sequences, and helps stimulate the search for novel RNAs based on predicted candidate motifs. Collapse Key Words Collapse MESH Headings Computational Biology/methods Computer Graphics/trends Databases, Nucleic Acid/trends Internet/trends RNA/chemistry Software Software Design Collapse Grants Collapse
19	A plea for “Omics” research in complex diseases such as multiple sclerosis—a change of mind is needed. J Neurol Sci 2004;222:3-5. [PMID: 15240188 DOI: 10.1016/j.jns.2004.02.013] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2004] [Accepted: 02/24/2004] [Indexed: 11/26/2022] Abstract Collapse Key Words Collapse MESH Headings Biomedical Research/methods Biomedical Research/standards Computational Biology/standards Computational Biology/trends Databases, Nucleic Acid/standards Databases, Nucleic Acid/trends Drug Evaluation, Preclinical/methods Drug Evaluation, Preclinical/standards Genomics/methods Genomics/standards Humans Meta-Analysis as Topic Multiple Sclerosis/drug therapy Multiple Sclerosis/genetics Multiple Sclerosis/metabolism Proteomics/methods Proteomics/standards Collapse Grants Collapse
20	Carpe diem. Retooling the publish or perish model into the share and survive model. PLANT PHYSIOLOGY 2004;134:543-7. [PMID: 14966244 PMCID: PMC523887 DOI: 10.1104/pp.103.035907] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/06/2003] [Revised: 11/14/2003] [Accepted: 11/14/2003] [Indexed: 05/18/2023] Abstract Collapse Key Words Collapse MESH Headings Community-Institutional Relations/trends Databases, Nucleic Acid/trends International Agencies/trends Internet/trends Peer Review, Research/trends Periodicals as Topic/trends Plants/genetics Technology Transfer United States Collapse Grants Collapse
21	The Impact of Structural Genomics on the Protein Data Bank. ACTA ACUST UNITED AC 2004;4:247-52. [PMID: 15287818 DOI: 10.2165/00129785-200404040-00004] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022] Abstract The advent of structural genomics presents new challenges to the archive of biomacromolecular structures--the Protein Data Bank (PDB). As technologies involved in structure determination have advanced, both the number and size of structures available in the PDB have increased rapidly. The structural genomics initiatives are creating a large amount of data that needs to be tracked, archived, and made easily available. The PDB has developed tools to facilitate the rapid deposition of data produced by the structural genomics initiatives and has created databases to track the progress of the work. Collapse Key Words Collapse MESH Headings Animals Database Management Systems Databases, Nucleic Acid/trends Genomics/trends Human Genome Project Humans Collapse Grants Collapse
22	Genomic Resources for the Study of Sea Urchin Development. Methods Cell Biol 2004;74:733-57. [PMID: 15575629 DOI: 10.1016/s0091-679x(04)74030-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/01/2023] Abstract Collapse Key Words Collapse MESH Headings Animals Databases, Nucleic Acid/trends Embryo, Nonmammalian/metabolism Gene Expression Regulation, Developmental/genetics Gene Library Genome Internet/trends Molecular Biology/methods Oligonucleotide Array Sequence Analysis Sea Urchins/embryology Sea Urchins/genetics Collapse Grants GM-61005 NIGMS NIH HHS HD-37105 NICHD NIH HHS RR-06591 NCRR NIH HHS Collapse
23	The COG database: an updated version includes eukaryotes. BMC Bioinformatics 2003;4:41. [PMID: 12969510 PMCID: PMC222959 DOI: 10.1186/1471-2105-4-41] [Citation(s) in RCA: 3202] [Impact Index Per Article: 152.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2003] [Accepted: 09/11/2003] [Indexed: 11/10/2022] Open Abstract BACKGROUND The availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies. RESULTS We describe here a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after eukaryotic orthologous groups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted) proteins encoded in 66 genomes of unicellular organisms. The eukaryotic orthologous groups (KOGs) include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens), one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe), and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or approximately 54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of approximately 20% of the KOG set. This conserved portion of the KOG set is much greater than the ubiquitous portion of the COG set (approximately 1% of the COGs). In part, this difference is probably due to the small number of included eukaryotic genomes, but it could also reflect the relative compactness of eukaryotes as a clade and the greater evolutionary stability of eukaryotic genomes. CONCLUSION The updated collection of orthologous protein sets for prokaryotes and eukaryotes is expected to be a useful platform for functional annotation of newly sequenced genomes, including those of complex eukaryotes, and genome-wide evolutionary studies. Collapse Key Words Collapse MESH Headings Animals Databases, Nucleic Acid/trends Databases, Protein/trends Eukaryotic Cells/chemistry Eukaryotic Cells/physiology Evolution, Molecular Humans National Institutes of Health (U.S.) Proteins/classification Proteins/genetics Proteins/physiology Terminology as Topic United States Collapse Grants Collapse
24	MatGAT: an application that generates similarity/identity matrices using protein or DNA sequences. BMC Bioinformatics 2003;4:29. [PMID: 12854978 PMCID: PMC166169 DOI: 10.1186/1471-2105-4-29] [Citation(s) in RCA: 649] [Impact Index Per Article: 30.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2003] [Accepted: 07/10/2003] [Indexed: 12/03/2022] Open Abstract BACKGROUND The rapid increase in the amount of protein and DNA sequence information available has become almost overwhelming to researchers. So much information is now accessible that high-quality, functional gene analysis and categorization has become a major goal for many laboratories. To aid in this categorization, there is a need for non-commercial software that is able to both align sequences and also calculate pairwise levels of similarity/identity. RESULTS We have developed MatGAT (Matrix Global Alignment Tool), a simple, easy to use computer application that generates similarity/identity matrices for DNA or protein sequences without needing pre-alignment of the data. CONCLUSIONS The advantages of this program over other software are that it is open-source freeware, can analyze a large number of sequences simultaneously, can visualize both sequence alignment and similarity/identity values concurrently, employs global alignment in calculations, and has been formatted to run under both the Unix and the Microsoft Windows Operating Systems. We are presently completing the Macintosh-based version of the program. Collapse Key Words nucleic acid protein sequence alignment pairwise analysis similarity matrix Collapse MESH Headings Animals Computational Biology/methods Computational Biology/trends Computer Graphics/trends Databases, Nucleic Acid/trends Humans Internet Sequence Alignment/trends Sequence Analysis, DNA/methods Sequence Analysis, Protein/methods Sequence Homology, Amino Acid Sequence Homology, Nucleic Acid Software/trends Collapse Grants Collapse
25	The nucleic acid database. METHODS OF BIOCHEMICAL ANALYSIS 2003;44:199-216. [PMID: 12688301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 03/01/2023] Abstract Collapse Key Words Collapse MESH Headings Computational Biology Data Display Databases, Nucleic Acid/history Databases, Nucleic Acid/trends Electronic Data Processing History, 20th Century History, 21st Century Models, Molecular Molecular Structure Nucleic Acid Conformation Nucleic Acids/chemistry Collapse Grants Collapse
26	Other structure-based databases. METHODS OF BIOCHEMICAL ANALYSIS 2003;44:217-36. [PMID: 12647388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 03/01/2023] Abstract Collapse Key Words Collapse MESH Headings Computational Biology Crystallography, X-Ray Databases, Nucleic Acid/trends Databases, Protein/trends Genomics Molecular Structure Proteins/chemistry Collapse Grants Collapse
27	Big gene banks: nuggets for drug discovery or fool's gold? Drug Discov Today 2003;8:100-1. [PMID: 12568772 DOI: 10.1016/s1359-6446(02)02586-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022] Abstract Gene-mapping studies that look for complex traits among human populations have deepened our understanding of disease causes, but do they hold promise for identifying drug targets? Collapse Key Words Collapse MESH Headings Databases, Nucleic Acid/trends Humans Pharmacogenetics/methods Pharmacogenetics/trends Technology, Pharmaceutical/methods Technology, Pharmaceutical/trends Collapse Grants Collapse
28	SeqVISTA: a graphical tool for sequence feature visualization and comparison. BMC Bioinformatics 2003;4:1. [PMID: 12513700 PMCID: PMC140037 DOI: 10.1186/1471-2105-4-1] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2002] [Accepted: 01/04/2003] [Indexed: 11/12/2022] Open Abstract BACKGROUND Many readers will sympathize with the following story. You are viewing a gene sequence in Entrez, and you want to find whether it contains a particular sequence motif. You reach for the browser's "find in page" button, but those darn spaces every 10 bp get in the way. And what if the motif is on the opposite strand? Subsequently, your favorite sequence analysis software informs you that there is an interesting feature at position 13982-14013. By painstakingly counting the 10 bp blocks, you are able to examine the sequence at this location. But now you want to see what other features have been annotated close by, and this information is buried several screenfuls higher up the web page. RESULTS SeqVISTA presents a holistic, graphical view of features annotated on nucleotide or protein sequences. This interactive tool highlights the residues in the sequence that correspond to features chosen by the user, and allows easy searching for sequence motifs or extraction of particular subsequences. SeqVISTA is able to display results from diverse sequence analysis tools in an integrated fashion, and aims to provide much-needed unity to the bioinformatics resources scattered around the Internet. Our viewer may be launched on a GenBank record by a single click of a button installed in the web browser. CONCLUSION SeqVISTA allows insights to be gained by viewing the totality of sequence annotations and predictions, which may be more revealing than the sum of their parts. SeqVISTA runs on any operating system with a Java 1.4 virtual machine. It is freely available to academic users at http://zlab.bu.edu/SeqVISTA. Collapse Key Words Collapse MESH Headings Amino Acid Sequence Base Sequence Computational Biology/methods Computer Graphics Databases, Nucleic Acid/trends Genome, Viral Humans Molecular Sequence Data Sequence Alignment/methods Simian virus 40/genetics Software/classification Software/standards User-Computer Interface Collapse Grants P20 GM066401 NIGMS NIH HHS IP20GM066401-01 NIGMS NIH HHS Collapse
29	The EMBL Nucleotide Sequence Database: major new developments. Nucleic Acids Res 2003;31:17-22. [PMID: 12519939 PMCID: PMC165468 DOI: 10.1093/nar/gkg021] [Citation(s) in RCA: 75] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open Abstract The EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl/) incorporates, organizes and distributes nucleotide sequences from all available public sources. The database is located and maintained at the European Bioinformatics Institute (EBI) near Cambridge, UK. In an international collaboration with DDBJ (Japan) and GenBank (USA), data are exchanged amongst the collaborating databases on a daily basis to achieve optimal synchronization. Webin is the preferred web-based submission system for individual submitters, while automatic procedures allow incorporation of sequence data from large-scale genome sequencing centres and from the European Patent Office (EPO). Database releases are produced quarterly. Network services allow free access to the most up-to-date data collection via FTP, Email and World Wide Web interfaces. EBI's Sequence Retrieval System (SRS) integrates and links the main nucleotide and protein databases plus many other specialized molecular biology databases. For sequence similarity searching, a variety of tools (e.g. Fasta, BLAST) are available which allow external users to compare their own sequences against the latest data in the EMBL Nucleotide Sequence Database and SWISS-PROT. All resources can be accessed via the EBI home page at http://www.ebi.ac.uk. Collapse Key Words Collapse MESH Headings Animals Base Sequence Data Collection Databases, Nucleic Acid/trends Genomics Information Storage and Retrieval Internet Sequence Analysis, DNA Collapse Grants Collapse
30	DNA testing for all. Nature 2002;418:585-6. [PMID: 12167832 DOI: 10.1038/418585a] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Abstract Collapse Key Words Genetics and Reproduction Legal Approach Collapse MESH Headings Accreditation Civil Rights/standards Crime/legislation & jurisprudence DNA Fingerprinting/standards DNA Fingerprinting/trends Databases, Nucleic Acid/standards Databases, Nucleic Acid/trends Fear Genetic Privacy/standards Genetic Testing/standards Genetic Testing/trends Humans Male Polymerase Chain Reaction/standards Collapse Grants Collapse
31	The future of publishing microarray data. Brief Bioinform 2001;2:316-8. [PMID: 11808743 DOI: 10.1093/bib/2.4.316] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open Abstract Collapse Key Words Collapse MESH Headings Databases, Nucleic Acid/standards Databases, Nucleic Acid/trends Forecasting Oligonucleotide Array Sequence Analysis/trends Publishing Collapse Grants Collapse
32	Exploiting big biology: integrating large-scale biological data for function inference. Brief Bioinform 2001;2:363-74. [PMID: 11808748 DOI: 10.1093/bib/2.4.363] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open Abstract The amount of data produced by molecular biologists is growing at an exponential rate. Some of the fastest growing sets of data are measurements of gene expression, comparable in quantity only to gene sequences and the vast biological literature. Both gene expression data and sequence data offer hints as to the functions of thousands of newly discovered genes, but neither give complete answers. Therefore, much effort is being focused on integrating these large data sets and combining them with all available functional data to draw inferences about the functions of uncharacterised genes. This review discusses the most pertinent functional data for genome-wide functional inference and describes several methods by which these disparate data types are being integrated. Collapse Key Words Collapse MESH Headings Data Collection Data Interpretation, Statistical Databases, Nucleic Acid/trends Databases, Protein/trends Genomics/trends Information Storage and Retrieval Sequence Analysis, DNA Collapse Grants Collapse
33	Risking ethical insolvency: a survey of trends in criminal DNA databanking. THE JOURNAL OF LAW, MEDICINE & ETHICS : A JOURNAL OF THE AMERICAN SOCIETY OF LAW, MEDICINE & ETHICS 2000;28:209-223. [PMID: 11210371 DOI: 10.1111/j.1748-720x.2000.tb00661.x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023] Abstract Over ten years have elapsed since Virginia passed the nation's first criminal DNA banking law, which authorized law enforcement authorities to collect DNA samples from certain categories of offenders for the purposes of performing profile analysis. Within nine years, Rhode Island became the fiftieth state to enact a similar statute. The passage of a decade since the first enactment provides a convenient opportunity to assess the strengths and weaknesses of ethical safeguards under present law as well as predict the likely direction of future developments.DNA forensics are merely the latest in a long line of biologically based identifying law enforcement technologies that include fingerprints and serotyping. Nevertheless, DNA has properties that make it significantly different than its predecessors with respect to the ethical and social concerns it raises. Collapse Key Words Genetics and Reproduction Legal Approach Collapse MESH Headings Canada Confidentiality/legislation & jurisprudence Criminal Law DNA Fingerprinting/statistics & numerical data DNA Fingerprinting/trends Databases, Nucleic Acid/legislation & jurisprudence Databases, Nucleic Acid/trends England Ethics, Medical Government Regulation Humans Internationality Juvenile Delinquency/legislation & jurisprudence State Government Technology Assessment, Biomedical United States Collapse Grants Collapse