1
|
Gal-Oz ST, Haiat N, Eliyahu D, Shani G, Shay T. DoChaP: the domain change presenter. Nucleic Acids Res 2021; 49:W162-W168. [PMID: 33988713 PMCID: PMC8262731 DOI: 10.1093/nar/gkab357] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2021] [Revised: 04/18/2021] [Accepted: 04/23/2021] [Indexed: 12/18/2022] Open
Abstract
Alternative splicing results in multiple transcripts of the same gene, possibly encoding for different protein isoforms with different domains. Whereas it is possible to manually determine the effect of alternative splicing on the domain composition for a single event, the process requires the tedious integration of several data sources; it is error prone and not feasible for genome-wide characterization of domains affected by differential splicing. To fulfill the need for an automated solution, we developed the Domain Change Presenter (DoChaP, https://dochap.bgu.ac.il/), a web server for the visualization of exon-domain associations. DoChaP visualizes all transcripts of a given gene, the encoded proteins and their domains, and enables a comparison between the transcripts and between their protein products. The colors and organization make the structural effect of alternative splicing events on protein structures easily identified. To enable the study of the conservation of exons structure, alternative splicing, and the effect of alternative splicing on protein domains, DoChaP also provides a two-species comparison of exon-domain associations. DoChaP thus provides a unique and easy-to-use visualization of the exon-domain association and conservation, and will facilitate the study of the structural effects of alternative splicing in health and disease.
Collapse
Affiliation(s)
- Shani T Gal-Oz
- Department of Life Sciences, Ben-Gurion University of the Negev, Beer-Sheva 8410501, Israel
| | - Nimrod Haiat
- Department of Software and Information Systems Engineering, Ben-Gurion University of the Negev, Beer-Sheva 8410501, Israel
| | - Dana Eliyahu
- Department of Software and Information Systems Engineering, Ben-Gurion University of the Negev, Beer-Sheva 8410501, Israel
| | - Guy Shani
- Department of Software and Information Systems Engineering, Ben-Gurion University of the Negev, Beer-Sheva 8410501, Israel
| | - Tal Shay
- Department of Life Sciences, Ben-Gurion University of the Negev, Beer-Sheva 8410501, Israel.,Department of Software and Information Systems Engineering, Ben-Gurion University of the Negev, Beer-Sheva 8410501, Israel
| |
Collapse
|
2
|
Moyer DC, Larue GE, Hershberger CE, Roy SW, Padgett RA. Comprehensive database and evolutionary dynamics of U12-type introns. Nucleic Acids Res 2020; 48:7066-7078. [PMID: 32484558 PMCID: PMC7367187 DOI: 10.1093/nar/gkaa464] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Revised: 05/19/2020] [Accepted: 05/20/2020] [Indexed: 12/16/2022] Open
Abstract
During nuclear maturation of most eukaryotic pre-messenger RNAs and long non-coding RNAs, introns are removed through the process of RNA splicing. Different classes of introns are excised by the U2-type or the U12-type spliceosomes, large complexes of small nuclear ribonucleoprotein particles and associated proteins. We created intronIC, a program for assigning intron class to all introns in a given genome, and used it on 24 eukaryotic genomes to create the Intron Annotation and Orthology Database (IAOD). We then used the data in the IAOD to revisit several hypotheses concerning the evolution of the two classes of spliceosomal introns, finding support for the class conversion model explaining the low abundance of U12-type introns in modern genomes.
Collapse
Affiliation(s)
- Devlin C Moyer
- Department of Cardiovascular and Metabolic Sciences, Lerner Research Institute, Cleveland Clinic Lerner College of Medicine, Cleveland Clinic and Department of Molecular Medicine, Case Western Reserve University, Cleveland, OH 44106, USA
| | - Graham E Larue
- Department of Molecular and Cell Biology, University of California, Merced, Merced, CA 95343, USA
| | - Courtney E Hershberger
- Department of Cardiovascular and Metabolic Sciences, Lerner Research Institute, Cleveland Clinic Lerner College of Medicine, Cleveland Clinic and Department of Molecular Medicine, Case Western Reserve University, Cleveland, OH 44106, USA
| | - Scott W Roy
- Department of Biology, San Francisco State University, San Francisco, CA 94132, USA
| | - Richard A Padgett
- Department of Cardiovascular and Metabolic Sciences, Lerner Research Institute, Cleveland Clinic Lerner College of Medicine, Cleveland Clinic and Department of Molecular Medicine, Case Western Reserve University, Cleveland, OH 44106, USA
| |
Collapse
|
3
|
Medvedeva IV, Demenkov PS, Ivanisenko VA. Computer analysis of protein functional sites projection on exon structure of genes in Metazoa. BMC Genomics 2015; 16 Suppl 13:S2. [PMID: 26693737 PMCID: PMC4686782 DOI: 10.1186/1471-2164-16-s13-s2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. RESULTS One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. CONCLUSIONS These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity.
Collapse
|
4
|
Piwowar M, Krzysztof P, Piotr P. ExonVisualiser - application for visualization exon units in 2D and 3D protein structures. Bioinformation 2012; 8:1280-2. [PMID: 23275735 PMCID: PMC3532015 DOI: 10.6026/97320630081280] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2012] [Accepted: 11/14/2012] [Indexed: 11/23/2022] Open
Abstract
UNLABELLED The web application oriented on identification and visualization of protein regions encoded by exons is presented. The Exon Visualiser can be used for visualisation on different levels of protein structure: at the primary (sequence) level and secondary structures level, as well as at the level of tertiary protein structure. The programme is suitable for processing data for all genes which have protein expressions deposited in the PDB database. The procedure steps implemented in the application: I) loading exons sequences and theirs coordinates from GenBank file as well as protein sequences: CDS from GenBank and aminoacid sequence from PDB II) consensus sequence creation (comparing amino acid sequences form PDB file with the CDS sequence from GenBank file) III) matching exon coordinates IV) visualisation in 2D and 3D protein structures. Presented web-tool among others provides the color-coded graphical display of protein sequences and chains in three dimensional protein structures which are correlated with the corresponding exons. AVAILABILITY http://149.156.12.53/ExonVisualiser/
Collapse
Affiliation(s)
- Monika Piwowar
- Department of Bioinformatics and Telemedicine, Collegium Medicum, Jagiellonian University, Lazarza 16, 31-530 Krakow, Poland
| | - Porembski Krzysztof
- Department of Bioinformatics and Telemedicine, Collegium Medicum, Jagiellonian University, Lazarza 16, 31-530 Krakow, Poland
| | - Piwowar Piotr
- Department of Measurement and Electronics, AGH University of Science and Technology, al. A. Mickiewicza 30, 30-059 Krakow, Poland
| |
Collapse
|
5
|
Fernández-Suárez XM, Galperin MY. The 2013 Nucleic Acids Research Database Issue and the online molecular biology database collection. Nucleic Acids Res 2012. [PMID: 23203983 PMCID: PMC3531151 DOI: 10.1093/nar/gks1297] [Citation(s) in RCA: 67] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
The 20th annual Database Issue of Nucleic Acids Research includes 176 articles, half of which describe new online molecular biology databases and the other half provide updates on the databases previously featured in NAR and other journals. This year’s highlights include two databases of DNA repeat elements; several databases of transcriptional factors and transcriptional factor-binding sites; databases on various aspects of protein structure and protein–protein interactions; databases for metagenomic and rRNA sequence analysis; and four databases specifically dedicated to Escherichia coli. The increased emphasis on using the genome data to improve human health is reflected in the development of the databases of genomic structural variation (NCBI’s dbVar and EBI’s DGVa), the NIH Genetic Testing Registry and several other databases centered on the genetic basis of human disease, potential drugs, their targets and the mechanisms of protein–ligand binding. Two new databases present genomic and RNAseq data for monkeys, providing wealth of data on our closest relatives for comparative genomics purposes. The NAR online Molecular Biology Database Collection, available at http://www.oxfordjournals.org/nar/database/a/, has been updated and currently lists 1512 online databases. The full content of the Database Issue is freely available online on the Nucleic Acids Research website (http://nar.oxfordjournals.org/).
Collapse
|
6
|
Wang Y, You FM, Lazo GR, Luo MC, Thilmony R, Gordon S, Kianian SF, Gu YQ. PIECE: a database for plant gene structure comparison and evolution. Nucleic Acids Res 2012. [PMID: 23180792 PMCID: PMC3531150 DOI: 10.1093/nar/gks1109] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
Gene families often show degrees of differences in terms of exon–intron structures depending on their distinct evolutionary histories. Comparative analysis of gene structures is important for understanding their evolutionary and functional relationships within plant species. Here, we present a comparative genomics database named PIECE (http://wheat.pw.usda.gov/piece) for Plant Intron and Exon Comparison and Evolution studies. The database contains all the annotated genes extracted from 25 sequenced plant genomes. These genes were classified based on Pfam motifs. Phylogenetic trees were pre-constructed for each gene category. PIECE provides a user-friendly interface for different types of searches and a graphical viewer for displaying a gene structure pattern diagram linked to the resulting bootstrapped dendrogram for each gene family. The gene structure evolution of orthologous gene groups was determined using the GLOOME, Exalign and GECA software programs that can be accessed within the database. PIECE also provides a web server version of the software, GSDraw, for drawing schematic diagrams of gene structures. PIECE is a powerful tool for comparing gene sequences and provides valuable insights into the evolution of gene structure in plant genomes.
Collapse
Affiliation(s)
- Yi Wang
- USDA-Agriculture Research Service, Western Regional Research Center, Albany, CA 94710, USA
| | | | | | | | | | | | | | | |
Collapse
|
7
|
Medvedeva I, Demenkov P, Kolchanov N, Ivanisenko V. SitEx: a computer system for analysis of projections of protein functional sites on eukaryotic genes. Nucleic Acids Res 2011; 40:D278-83. [PMID: 22139920 PMCID: PMC3245165 DOI: 10.1093/nar/gkr1187] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Search of interrelationships between the structural-functional protein organization and exon structure of encoding gene provides insights into issues concerned with the function, origin and evolution of genes and proteins. The functions of proteins and their domains are defined mostly by functional sites. The relation of the exon-intron structure of the gene to the protein functional sites has been little studied. Development of resources containing data on projections of protein functional sites on eukaryotic genes is needed. We have developed SitEx, a database that contains information on functional site amino acid positions in the exon structure of encoding gene. SitEx is integrated with the BLAST and 3DExonScan programs. BLAST is used for searching sequence similarity between the query protein and polypeptides encoded by single exons stored in SitEx. The 3DExonScan program is used for searching for structural similarity of the given protein with these polypeptides using superimpositions. The developed computer system allows users to analyze the coding features of functional sites by taking into account the exon structure of the gene, to detect the exons involved in shuffling in protein evolution, also to design protein-engineering experiments. SitEx is accessible at http://www-bionet.sscc.ru/sitex/. Currently, it contains information about 9994 functional sites presented in 2021 proteins described in proteomes of 17 organisms.
Collapse
Affiliation(s)
- Irina Medvedeva
- Computer Proteomics Laboratory, Institute of Cytology and Genetics SB RAS, 10 Lavrentyeva Avenue, 630090 Novosibirsk, Russia
| | | | | | | |
Collapse
|
8
|
Bhasi A, Senalik D, Simon PW, Kumar B, Manikandan V, Philip P, Senapathy P. RoBuST: an integrated genomics resource for the root and bulb crop families Apiaceae and Alliaceae. BMC PLANT BIOLOGY 2010; 10:161. [PMID: 20691054 PMCID: PMC3017783 DOI: 10.1186/1471-2229-10-161] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/22/2010] [Accepted: 08/06/2010] [Indexed: 05/29/2023]
Abstract
BACKGROUND Root and bulb vegetables (RBV) include carrots, celeriac (root celery), parsnips (Apiaceae), onions, garlic, and leek (Alliaceae)--food crops grown globally and consumed worldwide. Few data analysis platforms are currently available where data collection, annotation and integration initiatives are focused on RBV plant groups. Scientists working on RBV include breeders, geneticists, taxonomists, plant pathologists, and plant physiologists who use genomic data for a wide range of activities including the development of molecular genetic maps, delineation of taxonomic relationships, and investigation of molecular aspects of gene expression in biochemical pathways and disease responses. With genomic data coming from such diverse areas of plant science, availability of a community resource focused on these RBV data types would be of great interest to this scientific community. DESCRIPTION The RoBuST database has been developed to initiate a platform for collecting and organizing genomic information useful for RBV researchers. The current release of RoBuST contains genomics data for 294 Alliaceae and 816 Apiaceae plant species and has the following features: (1) comprehensive sequence annotations of 3663 genes 5959 RNAs, 22,723 ESTs and 11,438 regulatory sequence elements from Apiaceae and Alliaceae plant families; (2) graphical tools for visualization and analysis of sequence data; (3) access to traits, biosynthetic pathways, genetic linkage maps and molecular taxonomy data associated with Alliaceae and Apiaceae plants; and (4) comprehensive plant splice signal repository of 659,369 splice signals collected from 6015 plant species for comparative analysis of plant splicing patterns. CONCLUSIONS RoBuST, available at http://robust.genome.com, provides an integrated platform for researchers to effortlessly explore and analyze genomic data associated with root and bulb vegetables.
Collapse
Affiliation(s)
- Ashwini Bhasi
- Genome International Corporation, 8000 Excelsior Drive, Suite 202, Madison, Wisconsin 53717, USA
| | - Doug Senalik
- United States Department of Agriculture, Agricultural Research Service, Vegetable Crops Research Unit, Department of Horticulture, University of Wisconsin, Madison, Wisconsin 53706, USA
| | - Philipp W Simon
- United States Department of Agriculture, Agricultural Research Service, Vegetable Crops Research Unit, Department of Horticulture, University of Wisconsin, Madison, Wisconsin 53706, USA
| | - Brajendra Kumar
- International Center for Advanced Genomics and Proteomics, 83, 1st Cross Street, Nehru Nagar, Kottivakkam, Chennai 600096, India
| | - Vinu Manikandan
- International Center for Advanced Genomics and Proteomics, 83, 1st Cross Street, Nehru Nagar, Kottivakkam, Chennai 600096, India
| | - Philge Philip
- International Center for Advanced Genomics and Proteomics, 83, 1st Cross Street, Nehru Nagar, Kottivakkam, Chennai 600096, India
| | - Periannan Senapathy
- Genome International Corporation, 8000 Excelsior Drive, Suite 202, Madison, Wisconsin 53717, USA
- International Center for Advanced Genomics and Proteomics, 83, 1st Cross Street, Nehru Nagar, Kottivakkam, Chennai 600096, India
| |
Collapse
|