1
|
Rani I, Kalsi A, Kaur G, Sharma P, Gupta S, Gautam RK, Chopra H, Bibi S, Ahmad SU, Singh I, Dhawan M, Emran TB. Modern drug discovery applications for the identification of novel candidates for COVID-19 infections. Ann Med Surg (Lond) 2022; 80:104125. [PMID: 35845863 PMCID: PMC9273307 DOI: 10.1016/j.amsu.2022.104125] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Revised: 07/05/2022] [Accepted: 07/06/2022] [Indexed: 11/23/2022] Open
Abstract
In early December 2019, a large pneumonia epidemic occurred in Wuhan, China. The World Health Organization is concerned about the outbreak of another coronavirus with the powerful, rapid, and contagious transmission. Anyone with minor symptoms like fever and cough or travel history to contaminated places might be suspected of having COVID-19. COVID-19 therapy focuses on treating the disease's symptoms. So far, no such therapeutic molecule has been shown effective in treating this condition. So the treatment is mostly supportive and plasma. Globally, numerous studies and researchers have recently started fighting this virus. Vaccines and chemical compounds are also being investigated against infection. COVID-19 was successfully diagnosed using RNA detection and very sensitive RT-PCR (reverse transcription-polymerase chain reaction). The evolution of particular vaccinations is required to reduce illness severity and spread. Numerous computational analyses and molecular docking have predicted various target compounds that might stop this condition. This paper examines the main characteristics of coronavirus and the computational analyses necessary to avoid infection.
Collapse
Affiliation(s)
- Isha Rani
- MM College of Pharmacy, Maharishi Markandeshwar (Deemed to be University), Mullana, Haryana, India
| | - Avjit Kalsi
- MM School of Pharmacy, MM University, Sadopur, Ambala, Haryana, India
| | - Gagandeep Kaur
- Chitkara School of Pharmacy, Chitkara University-Baddi, Himachal Pradesh, India
| | - Pankaj Sharma
- Apotex Research Pvt. Ltd, Bangalore, Karnataka, India
| | - Sumeet Gupta
- MM College of Pharmacy, Maharishi Markandeshwar (Deemed to be University), Mullana, Haryana, India
| | - Rupesh K. Gautam
- MM School of Pharmacy, MM University, Sadopur, Ambala, Haryana, India
| | - Hitesh Chopra
- Department of Pharmaceutics, Chitkara College of Pharmacy, Chitkara University, Punjab, India
| | - Shabana Bibi
- Yunnan Herbal Laboratory, College of Ecology and Environmental Sciences, Yunnan University, Kunming, 650091, Yunnan, China
- The International Joint Research Center for Sustainable Utilization of Cordyceps Bioresources in China and Southeast Asia, Yunnan University, Kunming, 650091, Yunnan, China
| | - Syed Umair Ahmad
- Department of Bioinformatics, Hazara University, Mansehra, Pakistan
| | - Inderbir Singh
- Department of Pharmaceutics, Chitkara College of Pharmacy, Chitkara University, Punjab, India
| | - Manish Dhawan
- Department of Microbiology, Punjab Agricultural University, Ludhiana, 141004, Punjab, India
- Trafford College, Altrincham, Manchester, WA14 5PQ, UK
| | - Talha Bin Emran
- Department of Pharmacy, BGC Trust University Bangladesh, Chittagong, 4381, Bangladesh
- Department of Pharmacy, Faculty of Allied Health Sciences, Daffodil International University, Dhaka, 1207, Bangladesh
| |
Collapse
|
2
|
Nehme R, Hallal R, El Dor M, Kobeissy F, Gouilleux F, Mazurier F, Zibara K. Repurposing of Acriflavine to Target Chronic Myeloid Leukemia Treatment. Curr Med Chem 2021; 28:2218-2233. [PMID: 32900342 DOI: 10.2174/0929867327666200908114411] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Revised: 06/10/2020] [Accepted: 06/25/2020] [Indexed: 11/22/2022]
Abstract
Drug repurposing has lately received increasing interest in several diseases especially in cancers, due to its advantages in facilitating the development of new therapeutic strategies, by adopting a cost-friendly approach and avoiding the strict Food and Drug Administration (FDA) regulations. Acriflavine (ACF) is an FDA approved molecule that has been extensively studied since 1912 with antiseptic, trypanocidal, anti-viral, anti-bacterial and anti-cancer effects. ACF has been shown to block the growth of solid and hematopoietic tumor cells. Indeed, ACF acts as an inhibitor of various proteins, including DNA-dependent protein kinases C (DNA-PKcs), topoisomerase I and II, hypoxia-inducible factor 1α (HIF-1α), in addition to its recent discovery as an inhibitor of the signal transducer and activator of transcription (STAT). Chronic myeloid leukemia (CML) is a clonal myeloproliferative disorder characterized by the expression of the constitutively active tyrosine kinase BCR-ABL. This protein allows the activation of several signaling pathways known for their role in cell proliferation and survival, such as the JAK/STAT pathway. CML therapy, based on tyrosine kinase inhibitors (TKIs), such as imatinib (IM), is highly effective. However, 15% of patients are refractory to IM, where in some cases, 20-30% of patients become resistant. Thus, we suggest the repurposing of ACF in CML after IM failure or in combination with IM to improve the anti-tumor effects of IM. In this review, we present the different pharmacological properties of ACF along with its anti-leukemic effects in the hope of its repurposing in CML therapy.
Collapse
Affiliation(s)
- Rawan Nehme
- Universite de Tours, EA7501 GICC, Tours, France
| | | | - Maya El Dor
- Universite de Tours, EA7501 GICC, Tours, France
| | - Firas Kobeissy
- Department of Biochemistry and Molecular Genetics, Faculty of Medicine, American University of Beirut, Beirut, Lebanon
| | | | | | | |
Collapse
|
3
|
Kumar R, Harilal S, Gupta SV, Jose J, Thomas Parambi DG, Uddin MS, Shah MA, Mathew B. Exploring the new horizons of drug repurposing: A vital tool for turning hard work into smart work. Eur J Med Chem 2019; 182:111602. [PMID: 31421629 PMCID: PMC7127402 DOI: 10.1016/j.ejmech.2019.111602] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2019] [Revised: 08/07/2019] [Accepted: 08/07/2019] [Indexed: 02/07/2023]
Abstract
Drug discovery and development are long and financially taxing processes. On an average it takes 12-15 years and costs 1.2 billion USD for successful drug discovery and approval for clinical use. Many lead molecules are not developed further and their potential is not tapped to the fullest due to lack of resources or time constraints. In order for a drug to be approved by FDA for clinical use, it must have excellent therapeutic potential in the desired area of target with minimal toxicities as supported by both pre-clinical and clinical studies. The targeted clinical evaluations fail to explore other potential therapeutic applications of the candidate drug. Drug repurposing or repositioning is a fast and relatively cheap alternative to the lengthy and expensive de novo drug discovery and development. Drug repositioning utilizes the already available clinical trials data for toxicity and adverse effects, at the same time explores the drug's therapeutic potential for a different disease. This review addresses recent developments and future scope of drug repositioning strategy.
Collapse
Affiliation(s)
- Rajesh Kumar
- Department of Pharmacy, Kerala University of Health Sciences, Thrissur, Kerala, India
| | - Seetha Harilal
- Department of Pharmacy, Kerala University of Health Sciences, Thrissur, Kerala, India
| | - Sheeba Varghese Gupta
- Department of Pharmaceutical Sciences, College of Pharmacy, University of South Florida, Tampa, FL, 33612, USA
| | - Jobin Jose
- Department of Pharmaceutics, NGSM Institute of Pharmaceutical Science, NITTE Deemed to be University, Manglore, 575018, India
| | - Della Grace Thomas Parambi
- Department of Pharmaceutical Chemistry, College of Pharmacy, Jouf University, Sakaka, Al Jouf, 2014, Saudi Arabia
| | - Md Sahab Uddin
- Department of Pharmacy, Southeast University, Dhaka, Bangladesh; Pharmakon Neuroscience Research Network, Dhaka, Bangladesh
| | - Muhammad Ajmal Shah
- Department of Pharmacogonosy, Faculty of Pharmaceutical Sciences, Government College University, Faisalabad, Pakistan
| | - Bijo Mathew
- Division of Drug Design and Medicinal Chemistry Research Lab, Department of Pharmaceutical Chemistry, Ahalia School of Pharmacy, Palakkad, 678557, Kerala, India.
| |
Collapse
|
4
|
Xue H, Li J, Xie H, Wang Y. Review of Drug Repositioning Approaches and Resources. Int J Biol Sci 2018; 14:1232-1244. [PMID: 30123072 PMCID: PMC6097480 DOI: 10.7150/ijbs.24612] [Citation(s) in RCA: 327] [Impact Index Per Article: 54.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2017] [Accepted: 06/12/2018] [Indexed: 12/23/2022] Open
Abstract
Drug discovery is a time-consuming, high-investment, and high-risk process in traditional drug development. Drug repositioning has become a popular strategy in recent years. Different from traditional drug development strategies, the strategy is efficient, economical and riskless. There are usually three kinds of approaches: computational approaches, biological experimental approaches, and mixed approaches, all of which are widely used in drug repositioning. In this paper, we reviewed computational approaches and highlighted their characteristics to provide references for researchers to develop more powerful approaches. At the same time, the important findings obtained using these approaches are listed. Furthermore, we summarized 76 important resources about drug repositioning. Finally, challenges and opportunities in drug repositioning are discussed from multiple perspectives, including technology, commercial models, patents and investment.
Collapse
Affiliation(s)
- Hanqing Xue
- School of Computer Science and Technology, Harbin Institute of Technology, 150001, Harbin, China
| | - Jie Li
- School of Computer Science and Technology, Harbin Institute of Technology, 150001, Harbin, China
| | - Haozhe Xie
- School of Computer Science and Technology, Harbin Institute of Technology, 150001, Harbin, China
| | - Yadong Wang
- School of Computer Science and Technology, Harbin Institute of Technology, 150001, Harbin, China
| |
Collapse
|
5
|
Kimura Y, Fujino K, Ogawa K, Masuda K. Localization of Daucus carota NMCP1 to the nuclear periphery: the role of the N-terminal region and an NLS-linked sequence motif, RYNLRR, in the tail domain. FRONTIERS IN PLANT SCIENCE 2014; 5:62. [PMID: 24616728 PMCID: PMC3935212 DOI: 10.3389/fpls.2014.00062] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2013] [Accepted: 02/07/2014] [Indexed: 05/24/2023]
Abstract
Recent ultrastructural studies revealed that a structure similar to the vertebrate nuclear lamina exists in the nuclei of higher plants. However, plant genomes lack genes for lamins and intermediate-type filament proteins, and this suggests that plant-specific nuclear coiled-coil proteins make up the lamina-like structure in plants. NMCP1 is a protein, first identified in Daucus carota cells, that localizes exclusively to the nuclear periphery in interphase cells. It has a tripartite structure comprised of head, rod, and tail domains, and includes putative nuclear localization signal (NLS) motifs. We identified the functional NLS of DcNMCP1 (carrot NMCP1) and determined the protein regions required for localizing to the nuclear periphery using EGFP-fused constructs transiently expressed in Apium graveolens epidermal cells. Transcription was driven under a CaMV35S promoter, and the genes were introduced into the epidermal cells by a DNA-coated microprojectile delivery system. Of the NLS motifs, KRRRK and RRHK in the tail domain were highly functional for nuclear localization. Addition of the N-terminal 141 amino acids from DcNMCP1 shifted the localization of a region including these NLSs from the entire nucleus to the nuclear periphery. Using this same construct, the replacement of amino acids in RRHK or its preceding sequence, YNL, with alanine residues abolished localization to the nuclear periphery, while replacement of KRRRK did not affect localization. The sequence R/Q/HYNLRR/H, including YNL and the first part of the sequence of RRHK, is evolutionarily conserved in a subclass of NMCP1 sequences from many plant species. These results show that NMCP1 localizes to the nuclear periphery by a combined action of a sequence composed of R/Q/HYNLRR/H, NLS, and the N-terminal region including the head and a portion of the rod domain, suggesting that more than one binding site is implicated in localization of NMCP1.
Collapse
Affiliation(s)
- Yuta Kimura
- Laboratory of Plant Functional Biology, Chair of Botany and Agronomy, Graduate School of Agriculture, Hokkaido UniversityHokkaido, Japan
| | - Kaien Fujino
- Laboratory of Crop Physiology, Chair of Botany and Agronomy, Graduate School of Agriculture, Hokkaido UniversityHokkaido, Japan
| | - Kana Ogawa
- Laboratory of Plant Functional Biology, Chair of Botany and Agronomy, Graduate School of Agriculture, Hokkaido UniversityHokkaido, Japan
| | - Kiyoshi Masuda
- Laboratory of Plant Functional Biology, Chair of Botany and Agronomy, Graduate School of Agriculture, Hokkaido UniversityHokkaido, Japan
| |
Collapse
|
6
|
KURAMOCHI MICHIHIRO, KARYPIS GEORGE. GENE CLASSIFICATION USING EXPRESSION PROFILES: A FEASIBILITY STUDY. INT J ARTIF INTELL T 2011. [DOI: 10.1142/s0218213005002302] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
As various genome sequencing projects have already been completed or are near completion, genome researchers are shifting their focus to functional genomics. Functional genomics represents the next phase, that expands the biological investigation to studying the functionality of genes of a single organism as well as studying and correlating the functionality of genes across many different organisms. Recently developed methods for monitoring genome-wide mRNA expression changes hold the promise of allowing us to inexpensively gain insights into the function of unknown genes. In this paper we focus on evaluating the feasibility of using supervised machine learning methods for determining the function of genes based solely on their expression profiles. We experimentally evaluate the performance of traditional classification algorithms such as support vector machines and k-nearest neighbors on the yeast genome, and present new approaches for classification that improve the overall recall with moderate reductions in precision. Our experiments show that the accuracies achieved for different classes varies dramatically. In analyzing these results we show that the achieved accuracy is highly dependent on whether or not the genes of that class were significantly active during the various experimental conditions, suggesting that gene expression profiles can become a viable alternative to sequence similarity searches provided that the genes are observed under a wide range of experimental conditions.
Collapse
Affiliation(s)
- MICHIHIRO KURAMOCHI
- University of Minnesota, Department of Computer Science & Engineering/Army HPC Research Center/Digital Technology Center, 4-192 EE/CS Building, 200 Union St SE, Minneapolis, MN 55455, USA
| | - GEORGE KARYPIS
- University of Minnesota, Department of Computer Science & Engineering/Army HPC Research Center/Digital Technology Center, 4-192 EE/CS Building, 200 Union St SE, Minneapolis, MN 55455, USA
| |
Collapse
|
7
|
Wang J, Li C, Wang E, Wang X. An FPT approach for predicting protein localization from yeast genomic data. PLoS One 2011; 6:e14449. [PMID: 21283516 PMCID: PMC3023707 DOI: 10.1371/journal.pone.0014449] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2010] [Accepted: 12/01/2010] [Indexed: 11/18/2022] Open
Abstract
Accurately predicting the localization of proteins is of paramount importance in the quest to determine their respective functions within the cellular compartment. Because of the continuous and rapid progress in the fields of genomics and proteomics, more data are available now than ever before. Coincidentally, data mining methods been developed and refined in order to handle this experimental windfall, thus allowing the scientific community to quantitatively address long-standing questions such as that of protein localization. Here, we develop a frequent pattern tree (FPT) approach to generate a minimum set of rules (mFPT) for predicting protein localization. We acquire a series of rules according to the features of yeast genomic data. The mFPT prediction accuracy is benchmarked against other commonly used methods such as Bayesian networks and logistic regression under various statistical measures. Our results show that mFPT gave better performance than other approaches in predicting protein localization. Meanwhile, setting 0.65 as the minimum hit-rate, we obtained 138 proteins that mFPT predicted differently than the simple naive bayesian method (SNB). In our analysis of these 138 proteins, we present novel predictions for the location for 17 proteins, which currently do not have any defined localization. These predictions can serve as putative annotations and should provide preliminary clues for experimentalists. We also compared our predictions against the eukaryotic subcellular localization database and related predictions by others on protein localization. Our method is quite generalized and can thus be applied to discover the underlying rules for protein-protein interactions, genomic interactions, and structure-function relationships, as well as those of other fields of research.
Collapse
Affiliation(s)
- Jin Wang
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin, China
- Departments of Chemistry, Physics and Astronomy, State University of New York at Stony Brook, Stony Brook, New York, United States of America
| | - Chunhe Li
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin, China
- Graduate School of the Chinese Academy of Sciences, Beijing, China
| | - Erkang Wang
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin, China
| | | |
Collapse
|
8
|
Jaimovich A, Friedman N. From large-scale assays to mechanistic insights: computational analysis of interactions. Curr Opin Biotechnol 2010; 22:87-93. [PMID: 21109421 DOI: 10.1016/j.copbio.2010.10.017] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2010] [Accepted: 10/27/2010] [Indexed: 01/17/2023]
Abstract
The activity in the living cell is carried out by a myriad network of interactions between macromolecules. These include interactions between proteins that form a functional complex, a protein modifying another protein in a transient interaction, a transcription factor that binds a specific DNA locus triggering a change in chromatin or transcription, and so on. Characterization of these interactions in terms of timing, context, and function is crucial for understanding how cells carry out basic biological processes. The recent years have led to the introduction of many assays for probing these interactions in a systematic and large-scale manner. However, there is a large gap between assay results and understanding of biological systems. The challenge for computational methods is to bridge this gap by combining results of different assays and introducing statistical methodologies. In this review we discuss recent advances in approaches dealing with these challenges, and key directions for the future.
Collapse
Affiliation(s)
- Ariel Jaimovich
- School of Computer Science & Engineering, Hebrew University of Jerusalem, Jerusalem, Israel
| | | |
Collapse
|
9
|
Meier I, Brkljacic J. The Arabidopsis nuclear pore and nuclear envelope. THE ARABIDOPSIS BOOK 2010; 8:e0139. [PMID: 22303264 PMCID: PMC3244964 DOI: 10.1199/tab.0139] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
The nuclear envelope is a double membrane structure that separates the eukaryotic cytoplasm from the nucleoplasm. The nuclear pores embedded in the nuclear envelope are the sole gateways for macromolecular trafficking in and out of the nucleus. The nuclear pore complexes assembled at the nuclear pores are large protein conglomerates composed of multiple units of about 30 different nucleoporins. Proteins and RNAs traffic through the nuclear pore complexes, enabled by the interacting activities of nuclear transport receptors, nucleoporins, and elements of the Ran GTPase cycle. In addition to directional and possibly selective protein and RNA nuclear import and export, the nuclear pore gains increasing prominence as a spatial organizer of cellular processes, such as sumoylation and desumoylation. Individual nucleoporins and whole nuclear pore subcomplexes traffic to specific mitotic locations and have mitotic functions, for example at the kinetochores, in spindle assembly, and in conjunction with the checkpoints. Mutants of nucleoporin genes and genes of nuclear transport components lead to a wide array of defects from human diseases to compromised plant defense responses. The nuclear envelope acts as a repository of calcium, and its inner membrane is populated by functionally unique proteins connected to both chromatin and-through the nuclear envelope lumen-the cytoplasmic cytoskeleton. Plant nuclear pore and nuclear envelope research-predominantly focusing on Arabidopsis as a model-is discovering both similarities and surprisingly unique aspects compared to the more mature model systems. This chapter gives an overview of our current knowledge in the field and of exciting areas awaiting further exploration.
Collapse
Affiliation(s)
- Iris Meier
- Department of Plant Cellular and Molecular Biology and Plant Biotechnology Center, The Ohio State University, 520 Aronoff Laboratory, 318 W 12th Avenue, Columbus, OH 43210
- Address correspondence to
| | - Jelena Brkljacic
- Department of Plant Cellular and Molecular Biology and Plant Biotechnology Center, The Ohio State University, 520 Aronoff Laboratory, 318 W 12th Avenue, Columbus, OH 43210
| |
Collapse
|
10
|
Ulitsky I, Krogan NJ, Shamir R. Towards accurate imputation of quantitative genetic interactions. Genome Biol 2009; 10:R140. [PMID: 20003301 PMCID: PMC2812947 DOI: 10.1186/gb-2009-10-12-r140] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2009] [Revised: 11/08/2009] [Accepted: 12/10/2009] [Indexed: 12/25/2022] Open
Abstract
A new method for calculating quantitative genetic interactions allows for the inference of 190,000 new genetic interactions in Saccharomyces cerevisae. Recent technological breakthroughs have enabled high-throughput quantitative measurements of hundreds of thousands of genetic interactions among hundreds of genes in Saccharomyces cerevisiae. However, these assays often fail to measure the genetic interactions among up to 40% of the studied gene pairs. Here we present a novel method, which combines genetic interaction data together with diverse genomic data, to quantitatively impute these missing interactions. We also present data on almost 190,000 novel interactions.
Collapse
Affiliation(s)
- Igor Ulitsky
- Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel.
| | | | | |
Collapse
|
11
|
Chen Y, Dokholyan NV. Natural selection against protein aggregation on self-interacting and essential proteins in yeast, fly, and worm. Mol Biol Evol 2008; 25:1530-3. [PMID: 18503047 DOI: 10.1093/molbev/msn122] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Protein aggregation is the phenomenon of protein self-association potentially leading to detrimental effects on physiology, which is closely related to numerous human diseases such as Alzheimer's and Parkinson's disease. Despite progress in understanding the mechanism of protein aggregation, how natural selection against protein aggregation acts on subunits of protein complexes and on proteins with different contributions to organism fitness remains largely unknown. Here, we perform a proteome-wide analysis by using an experimentally validated algorithm TANGO and utilizing sequence, interactomic and phenotype-based functional genomic data from yeast, fly, and nematode. We find that proteins that are capable of forming homooligomeric complex have lower aggregation propensity compared with proteins that do not function as homooligomer. Further, proteins that are essential to the fitness of an organism have lower aggregation propensity compared with nonessential ones. Our finding suggests that the selection force against protein aggregation acts across different hierarchies of biological system.
Collapse
|
12
|
Rohl C, Price Y, Fischer TB, Paczkowski M, Zettel MF, Tsai J. Cataloging the relationships between proteins: a review of interaction databases. Mol Biotechnol 2006; 34:69-93. [PMID: 16943573 DOI: 10.1385/mb:34:1:69] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/1999] [Revised: 11/30/1999] [Accepted: 11/30/1999] [Indexed: 11/11/2022]
Abstract
By organizing and making widely accessible the increasing amounts of data from high-throughput analyses, protein interaction databases have become an integral resource for the biological community in relating sequence data with higher-order function. To provide a sense of the use and applicability of these databases, we describe each of the major comprehensive interaction databases as well as some of the more specialized ones. Content description, search/browse functionalities, and data presentation are discussed. A succinct explanation of database contents helps the user quickly identify whether the database contains applicable information to their research interest. Broad levels of search/browse functions as well as descriptions/examples allow users to quickly find and access pertinent data. At this point, clear presentation of search results as well as the primary content is necessary. Many databases display information graphically or divided into smaller digestible parts over a number of tabbed/linked pages. In addition, cross-linking between the databases promotes interconnectivity of the data and is an added layer of relational data for the user. Overall, although these protein interaction databases are under continual improvement, their current state shows that much time and effort has gone into organizing and presenting these large sets of data-describing protein interactions.
Collapse
Affiliation(s)
- Carol Rohl
- Rosetta Inpharmatics LLC, Seattle, WA, USA
| | | | | | | | | | | |
Collapse
|
13
|
Ausmees N. Intermediate Filament-Like Cytoskeleton of Caulobacter crescentus. J Mol Microbiol Biotechnol 2006; 11:152-8. [PMID: 16983192 DOI: 10.1159/000094051] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Eukaryotic cytoskeleton consists of three main types of filaments: actin microfilaments, microtubules and intermediate filaments (IFs). Actin and tubulin-like proteins are also found in bacteria where they perform diverse cytoskeletal functions. IFs, however, are considered to be a characteristic constituent of metazoan cells only, where they (among other functions) are involved in determination and maintenance of cell shape and cellular integrity. Surprisingly, a coiled coil-rich protein called crescentin was recently shown to play a key role in determining the complex curved and helical cell shapes of the bacterium Caulobacter crescentus, and to exhibit several characteristic properties of animal IF proteins. First, the arrangement of the coiled coil domains of crescentin closely resembles the tripartite molecular architecture of IF proteins. Second, crescentin also possesses the defining biochemical property of IF proteins to assemble into 10-nm-wide filaments in vitro without cofactors. Furthermore, crescentin forms a higher-order helical structure in vivo, which is localized asymmetrically along the concave side of the cell. In close association with the cell membrane, the crescentin structure promotes the helical growth of the cell and thereby determines a curved or a helical shape, depending on the length of the cell. The unexpected finding of an IF-like element in a bacterium raises several interesting questions concerning, for example, the molecular mechanisms whereby complex and asymmetric cell shapes are generated by different bacteria, or the functional and evolutionary relatedness of crescentin to animal IF proteins.
Collapse
Affiliation(s)
- Nora Ausmees
- Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden.
| |
Collapse
|
14
|
Pribylová L, Papousková K, Zavrel M, Souciet JL, Sychrová H. Exploration of yeast alkali metal cation/H+ antiporters: Sequence and structure comparison. Folia Microbiol (Praha) 2006; 51:413-24. [PMID: 17176761 DOI: 10.1007/bf02931585] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The Saccharomyces cerevisiae genome contains three genes encoding alkali metal cation/H+ antiporters (Nha1p, Nhx1p, Kha1p) that differ in cell localization, substrate specificity and physiological function. Systematic genome sequencing of other yeast species revealed highly conserved homologous ORFs in all of them. We compared the yeast sequences both at DNA and protein levels. The subfamily of yeast endosomal/prevacuolar Nhx1 antiporters is closely related to mammalian plasma membrane NHE proteins and to both plasma membrane and vacuolar plant antiporters. The high sequence conservation within this subfamily of yeast antiporters suggests that Nhx1p is of great importance in cell physiology. Yeast Kha1 proteins probably belong to the same subfamily as bacterial antiporters, whereas Nhal proteins form a distinct subfamily.
Collapse
Affiliation(s)
- L Pribylová
- Department of Membrane Transport, Institute of Physiology, Academy of Sciences of the Czech Republic, 142 20 Prague, Czechia
| | | | | | | | | |
Collapse
|
15
|
Jaimovich A, Elidan G, Margalit H, Friedman N. Towards an integrated protein-protein interaction network: a relational Markov network approach. J Comput Biol 2006; 13:145-64. [PMID: 16597232 DOI: 10.1089/cmb.2006.13.145] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Protein-protein interactions play a major role in most cellular processes. Thus, the challenge of identifying the full repertoire of interacting proteins in the cell is of great importance and has been addressed both experimentally and computationally. Today, large scale experimental studies of protein interactions, while partial and noisy, allow us to characterize properties of interacting proteins and develop predictive algorithms. Most existing algorithms, however, ignore possible dependencies between interacting pairs and predict them independently of one another. In this study, we present a computational approach that overcomes this drawback by predicting protein-protein interactions simultaneously. In addition, our approach allows us to integrate various protein attributes and explicitly account for uncertainty of assay measurements. Using the language of relational Markov networks, we build a unified probabilistic model that includes all of these elements. We show how we can learn our model properties and then use it to predict all unobserved interactions simultaneously. Our results show that by modeling dependencies between interactions, as well as by taking into account protein attributes and measurement noise, we achieve a more accurate description of the protein interaction network. Furthermore, our approach allows us to gain new insights into the properties of interacting proteins.
Collapse
Affiliation(s)
- Ariel Jaimovich
- School of Computer Science and Engineering, The Hebrew University, Jerusalem, Israel
| | | | | | | |
Collapse
|
16
|
Sun N, Carroll RJ, Zhao H. Bayesian error analysis model for reconstructing transcriptional regulatory networks. Proc Natl Acad Sci U S A 2006; 103:7988-93. [PMID: 16702552 PMCID: PMC1472417 DOI: 10.1073/pnas.0600164103] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Transcription regulation is a fundamental biological process, and extensive efforts have been made to dissect its mechanisms through direct biological experiments and regulation modeling based on physical-chemical principles and mathematical formulations. Despite these efforts, transcription regulation is yet not well understood because of its complexity and limitations in biological experiments. Recent advances in high throughput technologies have provided substantial amounts and diverse types of genomic data that reveal valuable information on transcription regulation, including DNA sequence data, protein-DNA binding data, microarray gene expression data, and others. In this article, we propose a Bayesian error analysis model to integrate protein-DNA binding data and gene expression data to reconstruct transcriptional regulatory networks. There are two unique aspects to this proposed model. First, transcription is modeled as a set of biochemical reactions, and a linear system model with clear biological interpretation is developed. Second, measurement errors in both protein-DNA binding data and gene expression data are explicitly considered in a Bayesian hierarchical model framework. Model parameters are inferred through Markov chain Monte Carlo. The usefulness of this approach is demonstrated through its application to infer transcriptional regulatory networks in the yeast cell cycle.
Collapse
Affiliation(s)
- Ning Sun
- *Department of Epidemiology and Public Health, Yale University School of Medicine, New Haven, CT 06520; and
| | - Raymond J. Carroll
- Department of Statistics, Texas A&M University, College Station, TX 77843
| | - Hongyu Zhao
- *Department of Epidemiology and Public Health, Yale University School of Medicine, New Haven, CT 06520; and
- To whom correspondence should be addressed. E-mail:
| |
Collapse
|
17
|
Guo X, Tatsuoka K, Liu R. Histone acetylation and transcriptional regulation in the genome of Saccharomyces cerevisiae. Bioinformatics 2005; 22:392-9. [PMID: 16339282 DOI: 10.1093/bioinformatics/bti823] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
MOTIVATION In eukaryotic genomes, histone acetylation and thereafter departure from the chromatin are essential for gene transcription initiation. Because gene transcription is tightly regulated by transcription factors, there are some speculations on the cooperation of histone acetylation and transcription factor binding. However, systematic statistical analyses of this relationship on a genomic scale have not been reported. RESULTS We apply several statistical methods to explore this relationship on two recent genomic datasets: acetylation levels on 11 histone lysines and binding activities of 203 transcription factors, both in promoter regions across the yeast genome. By canonical correlation analysis, we find that a histone acetylation pattern is correlated with a certain profile of transcription factor binding in the genome. Furthermore, after clustering the genes by their acetylation levels on the 11 histone lysines, the genes within clusters show distinct transcription factor binding profiles, as discovered by principle component analysis. Even after applying fairly stringent statistical measurement, most of these clusters have transcription factors with binding activities significantly deviated from the overall genome. We conclude that in the yeast genome, there is a strong correlation between histone acetylation and transcription factor binding in the promoter regions.
Collapse
Affiliation(s)
- Xiang Guo
- GlaxoSmithKline, Bioinformatics Division 709 Swedeland Road, King of Prussia, PA 19406, USA
| | | | | |
Collapse
|
18
|
|
19
|
Goh CS, Lan N, Douglas SM, Wu B, Echols N, Smith A, Milburn D, Montelione GT, Zhao H, Gerstein M. Mining the structural genomics pipeline: identification of protein properties that affect high-throughput experimental analysis. J Mol Biol 2004; 336:115-30. [PMID: 14741208 DOI: 10.1016/j.jmb.2003.11.053] [Citation(s) in RCA: 93] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Structural genomics projects represent major undertakings that will change our understanding of proteins. They generate unique datasets that, for the first time, present a standardized view of proteins in terms of their physical and chemical properties. By analyzing these datasets here, we are able to discover correlations between a protein's characteristics and its progress through each stage of the structural genomics pipeline, from cloning, expression, purification, and ultimately to structural determination. First, we use tree-based analyses (decision trees and random forest algorithms) to discover the most significant protein features that influence a protein's amenability to high-throughput experimentation. Based on this, we identify potential bottlenecks in various stages of the structural genomics process through specialized "pipeline schematics". We find that the properties of a protein that are most significant are: (i.) whether it is conserved across many organisms; (ii). the percentage composition of charged residues; (iii). the occurrence of hydrophobic patches; (iv). the number of binding partners it has; and (v). its length. Conversely, a number of other properties that might have been thought to be important, such as nuclear localization signals, are not significant. Thus, using our tree-based analyses, we are able to identify combinations of features that best differentiate the small group of proteins for which a structure has been determined from all the currently selected targets. This information may prove useful in optimizing high-throughput experimentation. Further information is available from http://mining.nesg.org/.
Collapse
Affiliation(s)
- Chern-Sing Goh
- Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Ave, New Haven, CT 06520, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Ausmees N, Kuhn JR, Jacobs-Wagner C. The bacterial cytoskeleton: an intermediate filament-like function in cell shape. Cell 2004; 115:705-13. [PMID: 14675535 DOI: 10.1016/s0092-8674(03)00935-8] [Citation(s) in RCA: 272] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Various cell shapes are encountered in the prokaryotic world, but how they are achieved is poorly understood. Intermediate filaments (IFs) of the eukaryotic cytoskeleton play an important role in cell shape in higher organisms. No such filaments have been found in prokaryotes. Here, we describe a bacterial equivalent to IF proteins, named crescentin, whose cytoskeletal function is required for the vibrioid and helical shapes of Caulobacter crescentus. Without crescentin, the cells adopt a straight-rod morphology. Crescentin has characteristic features of IF proteins including the ability to assemble into filaments in vitro without energy or cofactor requirements. In vivo, crescentin forms a helical structure that colocalizes with the inner cell curvatures beneath the cytoplasmic membrane. We propose that IF-like filaments of crescentin assemble into a helical structure, which by applying its geometry to the cell, generates a vibrioid or helical cell shape depending on the length of the cell.
Collapse
Affiliation(s)
- Nora Ausmees
- Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, CT 06520, USA
| | | | | |
Collapse
|
21
|
|
22
|
Kitagawa E, Momose Y, Iwahashi H. Correlation of the structures of agricultural fungicides to gene expression in Saccharomyces cerevisiae upon exposure to toxic doses. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2003; 37:2788-2793. [PMID: 12854720 DOI: 10.1021/es026156b] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Correlations between the chemical structures of agricultural fungicides and mRNA expression levels following exposure of Saccharomyces cerevisiae to toxic doses of thiuram, zineb, maneb, TPN, and PCP were examined. Structurally, thiuram, zineb, and maneb are dithiocarbamate fungicides, whereas TPN and PCP are not. To characterize chemical toxicity, genes expression was classified according to the functional groups used by the MIPS database. However, no correlations between the classification scheme and chemical structures were found. Hierarchical clustering of gene expression profiles was performed to characterize the effects of the five chemicals. According to this analysis the similarity of gene expression profiles depended on the similarity of chemical structures. These results suggest that DNA microarray technology has potential for predicting the major chemicals which will cause environmental toxicity and will provide information on new biomonitoring methods.
Collapse
Affiliation(s)
- Emiko Kitagawa
- National Institute of Advanced Industrial Science and Technology AIST), Central-6, Higashi 1-1-1, Tukuba, Ibaraki 305-8566, Japan
| | | | | |
Collapse
|
23
|
Jiménez JL, Mitchell MP, Sgouros JG. Microarray analysis of orthologous genes: conservation of the translational machinery across species at the sequence and expression level. Genome Biol 2002; 4:R4. [PMID: 12537549 PMCID: PMC151285 DOI: 10.1186/gb-2002-4-1-r4] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2002] [Revised: 08/28/2002] [Accepted: 10/31/2002] [Indexed: 12/04/2022] Open
Abstract
BACKGROUND Genome projects have provided a vast amount of sequence information. Sequence comparison between species helps to establish functional catalogues within organisms and to study how they are maintained and modified across phylogenetic groups during evolution. Microarray studies allow us to determine groups of genes with similar temporal regulation and perhaps also common regulatory upstream regions for binding of transcription factors. The integration of sequence and expression data is expected to refine our current annotations and provide some insight into the evolution of gene regulation across organisms. RESULTS We have investigated how well the protein subcellular localization and functional categories established from clustering of orthologous genes agree with gene-expression data in Saccharomyces cerevisiae. An increase in the resolution of biologically meaningful classes is observed upon the combination of experiments under different conditions. The functional categories deduced by sequence comparison approaches are, in general, preserved at the level of expression and can sometimes interact into larger co-regulated networks, such as the protein translation process. Differences and similarities in the expression between cytoplasmic-mitochondrial and interspecies translation machineries complement evolutionary information from sequence similarity. CONCLUSIONS Combination of several microarray experiments is a powerful tool for the identification of upstream regulatory motifs of yeast genes involved in protein synthesis. Comparison of these yeast co-regulated genes against the archaeal and bacterial operons indicates that the components of the protein translation process are conserved across organisms at the expression level with minor specific adaptations.
Collapse
Affiliation(s)
- Jose L Jiménez
- Computational Genome Analysis Laboratory, Cancer Research UK, 44 Lincoln's Inn Fields, London WC2A 3PX, UK.
| | | | | |
Collapse
|
24
|
Abstract
Structural analyses on a small number of protein families have shown that residues in protein interfaces are more conserved than average amino acid residues. This is also true of other ligand-binding and active site residues. This raises the question whether protein interactions place additional constraints on sequence divergence beyond this general background of functional restrictions on all different types of proteins. In order to investigate this, the sequence identities of Saccharomyces cerevisiae (SC) proteins to their Schizosaccharomyces pombe (SP) orthologues were used as a measure of sequence divergence. The SC proteins were divided into those in stable complexes, those that participate in transient interactions and the remaining proteins. All types of proteins can undergo extensive divergence: all three sequence identity distributions range from less than 20 to over 90%. However, overall, protein interactions do place additional constraints on sequence divergence and the distributions differ significantly: proteins not known to be involved in interactions have an average sequence identity of 38% while this value is 46% for proteins in stable complexes. Proteins that have transient interactions are intermediate between the two, with an average sequence identity of 41%. This trend is independent of whether the proteins are involved in informational functions (transcription, translation and replication) or not and of protein dispensability.
Collapse
Affiliation(s)
- Sarah A Teichmann
- MRC Laboratory of Molecular Biology, Hills Road, CB2 2QH, Cambridge, UK.
| |
Collapse
|
25
|
McClain DL, Gurnon DG, Oakley MG. Importance of potential interhelical salt-bridges involving interior residues for coiled-coil stability and quaternary structure. J Mol Biol 2002; 324:257-70. [PMID: 12441105 DOI: 10.1016/s0022-2836(02)01072-0] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Coiled coils are formed by two or more alpha-helices that align in a parallel or an antiparallel relative orientation. Polar interactions involving residues at the interior a and d positions are important for determining the quaternary structure of coiled coils. In the model heterodimeric coiled-coil Acid-a1-Base-a1, a buried a-d' Asn-Asn interaction is sufficient to specify both a dimeric structure and an antiparallel relative helix orientation. Although the equivalent a-a' interaction is found in parallel coiled coils, there is no example of an a-d' Asn-Asn interaction in structurally characterized, naturally occurring antiparallel coiled coils. Instead, interior charged residues form interhelical salt-bridges with residues at the adjacent e or g positions. Using a model coiled-coil heterodimer, we have explored the role of a potential interhelical interaction between an Arg at an interior d position and a Glu at the adjacent g' position. Our results demonstrate that this potentially attractive interhelical Coulombic interaction has little or no influence on helix orientation. Instead, we show that burying a single Arg residue at an interior position is sufficient to specify a dimeric state at a significantly lower thermodynamic cost than burial of two interacting Asn residues.
Collapse
Affiliation(s)
- Diana L McClain
- Department of Chemistry, Indiana University, 800 East Kirkwood Avenue, Bloomington, IN 47405-7102, USA
| | | | | |
Collapse
|
26
|
Kim BM, Oakley MG. A general method for selection and screening of coiled coils on the basis of relative helix orientation. J Am Chem Soc 2002; 124:8237-44. [PMID: 12105901 DOI: 10.1021/ja020275+] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
We have developed a method for selecting coiled coils that associate with a given relative helix orientation from a randomized pool of proteins. To select for antiparallel dimers, we have designed a model basic region-leucine zipper (bZip) heterodimer capable of binding DNA only when the coiled coil associates with an antiparallel relative helix alignment. The dimerization domain for this bZip heterodimer is the model antiparallel coiled coil Acid-a1-Base-a1 (Oakley, M. G.; Kim, P. S. Biochemistry 1998, 37, 12603), and both monomers contain the GCN4 basic region. Although the basic regions in naturally occurring bZip proteins are located N-terminal to the leucine zipper, we have attached the GCN4 basic region to the C-terminus of Acid-a1 to allow both basic regions to contact DNA in an antiparallel heterodimer. The resulting heterodimer, BR-Base-a1-Acid-a1-BR, can bind to a direct repeat of the GCN4 half-site in vivo, leading to spectinomycin resistance in the transcription interference assay of Elledge et al. (Elledge, S. J.; Sugiono, P.; Guarente, L.; Davis, R. W. Proc. Natl. Acad. Sci. U.S.A. 1989, 86, 3689). A buried interhelical polar interaction between two Asn residues in the Acid-a1-Base-a1 heterodimer is known to specify an antiparallel helix orientation. The position of one of these buried Asn residues was randomized, and bZip heterodimers containing antiparallel coiled coils were selected using the transcription interference assay. All of the selected colonies contained Asn at the randomized position, suggesting that the selection is specific for antiparallel coiled coils.
Collapse
Affiliation(s)
- Byung-Moon Kim
- Department of Chemistry, Indiana University, 800 East Kirkwood Avenue, Bloomington, Indiana 47405-7102, USA
| | | |
Collapse
|
27
|
Gindullis F, Rose A, Patel S, Meier I. Four signature motifs define the first class of structurally related large coiled-coil proteins in plants. BMC Genomics 2002; 3:9. [PMID: 11972898 PMCID: PMC102765 DOI: 10.1186/1471-2164-3-9] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2002] [Accepted: 04/09/2002] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND Animal and yeast proteins containing long coiled-coil domains are involved in attaching other proteins to the large, solid-state components of the cell. One subgroup of long coiled-coil proteins are the nuclear lamins, which are involved in attaching chromatin to the nuclear envelope and have recently been implicated in inherited human diseases. In contrast to other eukaryotes, long coiled-coil proteins have been barely investigated in plants. RESULTS We have searched the completed Arabidopsis genome and have identified a family of structurally related long coiled-coil proteins. Filament-like plant proteins (FPP) were identified by sequence similarity to a tomato cDNA that encodes a coiled-coil protein which interacts with the nuclear envelope-associated protein, MAF1. The FPP family is defined by four novel unique sequence motifs and by two clusters of long coiled-coil domains separated by a non-coiled-coil linker. All family members are expressed in a variety of Arabidopsis tissues. A homolog sharing the structural features was identified in the monocot rice, indicating conservation among angiosperms. CONCLUSION Except for myosins, this is the first characterization of a family of long coiled-coil proteins in plants. The tomato homolog of the FPP family binds in a yeast two-hybrid assay to a nuclear envelope-associated protein. This might suggest that FPP family members function in nuclear envelope biology. Because the full Arabidopsis genome does not appear to contain genes for lamins, it is of interest to investigate other long coiled-coil proteins, which might functionally replace lamins in the plant kingdom.
Collapse
Affiliation(s)
- Frank Gindullis
- CellTec Biotechnologie GmbH, Frohmestrasse 110, D-22459 Hamburg, Germany
| | - Annkatrin Rose
- Plant Biotechnology Center and Department of Plant Biology, Ohio State University, 210 Rightmire Hall, 1060 Carmack Road, Columbus, Ohio 43210, USA
| | - Shalaka Patel
- Plant Biotechnology Center and Department of Plant Biology, Ohio State University, 210 Rightmire Hall, 1060 Carmack Road, Columbus, Ohio 43210, USA
| | - Iris Meier
- Plant Biotechnology Center and Department of Plant Biology, Ohio State University, 210 Rightmire Hall, 1060 Carmack Road, Columbus, Ohio 43210, USA
| |
Collapse
|
28
|
Hinnebusch AG, Natarajan K. Gcn4p, a master regulator of gene expression, is controlled at multiple levels by diverse signals of starvation and stress. EUKARYOTIC CELL 2002; 1:22-32. [PMID: 12455968 PMCID: PMC118051 DOI: 10.1128/ec.01.1.22-32.2002] [Citation(s) in RCA: 259] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Affiliation(s)
- Alan G Hinnebusch
- Laboratory of Gene Regulation and Development, National Institute of Child Health and Human Development, Bethesda, Maryland 20892, USA.
| | | |
Collapse
|
29
|
McClain DL, Binfet JP, Oakley MG. Evaluation of the energetic contribution of interhelical Coulombic interactions for coiled coil helix orientation specificity. J Mol Biol 2001; 313:371-83. [PMID: 11800563 DOI: 10.1006/jmbi.2001.5044] [Citation(s) in RCA: 44] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Coiled coils are formed by two or more alpha-helices that align in a parallel or an antiparallel relative orientation. The factors that determine a preference for a given relative helix orientation are incompletely understood. The helix orientation preference for the designed coiled coil, Acid-a1-Base-a1, was measured previously. This model system therefore provides a means for the experimental determination of the energetic contribution of a variety of interactions to helix orientation specificity. The antiparallel preference for Acid-a1-Base-a1 is imparted by a single buried polar interaction. Interhelical Coulombic interactions between residues at the e and g positions have been proposed to influence helix orientation preference. In the Acid-a1-Base-a1 heterodimer, potentially attractive Coulombic interactions are expected in both orientations. To determine the energetic consequences of Coulombic interactions for helix orientation preference, we have positioned a single charged residue in each peptide such that exclusively favorable interhelical Coulombic interactions can occur only in the parallel orientation. In contrast, two potentially repulsive interactions are expected in the antiparallel orientation. Because the buried polar interaction can occur only in the antiparallel orientation, interhelical Coulombic interactions favor the parallel orientation and the potential to form a buried polar interaction favors the antiparallel orientation. We find no clear preference for an antiparallel orientation in the resulting heterodimer, Acid-Ke-Base-Eg, suggesting that interhelical Coulombic interactions and a buried polar interaction are of approximately equal importance for helix orientation specificity. Stability measurements indicate that maintenance of all favorable electrostatic interactions and/or avoidance of two potentially repulsive interactions contributes approximately 2.1 kcal/mol to helix orientation preference.
Collapse
Affiliation(s)
- D L McClain
- Department of Chemistry, Indiana University, 800 East Kirkwood Avenue, Bloomington, IN 47405-7102, USA
| | | | | |
Collapse
|
30
|
Natarajan K, Meyer MR, Jackson BM, Slade D, Roberts C, Hinnebusch AG, Marton MJ. Transcriptional profiling shows that Gcn4p is a master regulator of gene expression during amino acid starvation in yeast. Mol Cell Biol 2001; 21:4347-4368. [PMID: 11390663 DOI: 10.1128/mcb.21.13.4347-4368] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/28/2023] Open
Abstract
Starvation for amino acids induces Gcn4p, a transcriptional activator of amino acid biosynthetic genes in Saccharomyces cerevisiae. In an effort to identify all genes regulated by Gcn4p during amino acid starvation, we performed cDNA microarray analysis. Data from 21 pairs of hybridization experiments using two different strains derived from S288c revealed that more than 1,000 genes were induced, and a similar number were repressed, by a factor of 2 or more in response to histidine starvation imposed by 3-aminotriazole (3AT). Profiling of a gcn4Delta strain and a constitutively induced mutant showed that Gcn4p is required for the full induction by 3AT of at least 539 genes, termed Gcn4p targets. Genes in every amino acid biosynthetic pathway except cysteine and genes encoding amino acid precursors, vitamin biosynthetic enzymes, peroxisomal components, mitochondrial carrier proteins, and autophagy proteins were all identified as Gcn4p targets. Unexpectedly, genes involved in amino acid biosynthesis represent only a quarter of the Gcn4p target genes. Gcn4p also activates genes involved in glycogen homeostasis, and mutant analysis showed that Gcn4p suppresses glycogen levels in amino acid-starved cells. Numerous genes encoding protein kinases and transcription factors were identified as targets, suggesting that Gcn4p is a master regulator of gene expression. Interestingly, expression profiles for 3AT and the alkylating agent methyl methanesulfonate (MMS) overlapped extensively, and MMS induced GCN4 translation. Thus, the broad transcriptional response evoked by Gcn4p is produced by diverse stress conditions. Finally, profiling of a gcn4Delta mutant uncovered an alternative induction pathway operating at many Gcn4p target genes in histidine-starved cells.
Collapse
Affiliation(s)
- K Natarajan
- Laboratory of Gene Regulation and Development, National Institute of Child Health and Human Development, Bethesda, Maryland 20892, USA
| | | | | | | | | | | | | |
Collapse
|
31
|
Natarajan K, Meyer MR, Jackson BM, Slade D, Roberts C, Hinnebusch AG, Marton MJ. Transcriptional profiling shows that Gcn4p is a master regulator of gene expression during amino acid starvation in yeast. Mol Cell Biol 2001; 21:4347-68. [PMID: 11390663 PMCID: PMC87095 DOI: 10.1128/mcb.21.13.4347-4368.2001] [Citation(s) in RCA: 551] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2001] [Accepted: 04/03/2001] [Indexed: 11/20/2022] Open
Abstract
Starvation for amino acids induces Gcn4p, a transcriptional activator of amino acid biosynthetic genes in Saccharomyces cerevisiae. In an effort to identify all genes regulated by Gcn4p during amino acid starvation, we performed cDNA microarray analysis. Data from 21 pairs of hybridization experiments using two different strains derived from S288c revealed that more than 1,000 genes were induced, and a similar number were repressed, by a factor of 2 or more in response to histidine starvation imposed by 3-aminotriazole (3AT). Profiling of a gcn4Delta strain and a constitutively induced mutant showed that Gcn4p is required for the full induction by 3AT of at least 539 genes, termed Gcn4p targets. Genes in every amino acid biosynthetic pathway except cysteine and genes encoding amino acid precursors, vitamin biosynthetic enzymes, peroxisomal components, mitochondrial carrier proteins, and autophagy proteins were all identified as Gcn4p targets. Unexpectedly, genes involved in amino acid biosynthesis represent only a quarter of the Gcn4p target genes. Gcn4p also activates genes involved in glycogen homeostasis, and mutant analysis showed that Gcn4p suppresses glycogen levels in amino acid-starved cells. Numerous genes encoding protein kinases and transcription factors were identified as targets, suggesting that Gcn4p is a master regulator of gene expression. Interestingly, expression profiles for 3AT and the alkylating agent methyl methanesulfonate (MMS) overlapped extensively, and MMS induced GCN4 translation. Thus, the broad transcriptional response evoked by Gcn4p is produced by diverse stress conditions. Finally, profiling of a gcn4Delta mutant uncovered an alternative induction pathway operating at many Gcn4p target genes in histidine-starved cells.
Collapse
Affiliation(s)
- K Natarajan
- Laboratory of Gene Regulation and Development, National Institute of Child Health and Human Development, Bethesda, Maryland 20892, USA
| | | | | | | | | | | | | |
Collapse
|
32
|
Stanchi F, Bertocco E, Toppo S, Dioguardi R, Simionati B, Cannata N, Zimbello R, Lanfranchi G, Valle G. Characterization of 16 novel human genes showing high similarity to yeast sequences. Yeast 2001; 18:69-80. [PMID: 11124703 DOI: 10.1002/1097-0061(200101)18:1<69::aid-yea647>3.0.co;2-h] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The entire set of open reading frames (ORFs) of Saccharomyces cerevisiae has been used to perform systematic similarity searches against nucleic acid and protein databases: with the aim of identifying interesting homologies between yeast and mammalian genes. Many similarities were detected: mostly with known genes. However: several yeast ORFs were only found to match human partial sequence tags: indicating the presence of human transcripts still uncharacterized that have a homologous counterpart in yeast. About 30 such transcripts were further studied and named HUSSY (human sequence similar to yeast). The 16 most interesting are presented in this paper along with their sequencing and mapping data. As expected: most of these genes seem to be involved in basic metabolic and cellular functions (lipoic acid biosynthesis: ribulose-5-phosphate-3-epimerase: glycosyl transferase: beta-transducin: serine-threonine-kinase: ABC proteins: cation transporters). Genes related to RNA maturation were also found (homologues to DIM1: ROK1-RNA-elicase and NFS1). Furthermore: five novel human genes were detected (HUSSY-03: HUSSY-22: HUSSY-23: HUSSY-27: HUSSY-29) that appear to be homologous to yeast genes whose function is still undetermined. More information on this work can be obtained at the website http://grup.bio.unipd.it/hussy
Collapse
Affiliation(s)
- F Stanchi
- CRIBI Biotechnology Centre, Università di Padova, via G. Colombo 3, Padova 35121, Italy
| | | | | | | | | | | | | | | | | |
Collapse
|
33
|
Tsuji G, Kenmochi Y, Takano Y, Sweigard J, Farrall L, Furusawa I, Horino O, Kubo Y. Novel fungal transcriptional activators, Cmr1p of Colletotrichum lagenarium and pig1p of Magnaporthe grisea, contain Cys2His2 zinc finger and Zn(II)2Cys6 binuclear cluster DNA-binding motifs and regulate transcription of melanin biosynthesis genes in a developmentally specific manner. Mol Microbiol 2000; 38:940-54. [PMID: 11123670 DOI: 10.1046/j.1365-2958.2000.02181.x] [Citation(s) in RCA: 123] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Colletotrichum lagenarium and Magnaporthe grisea are plant pathogenic fungi that produce melanin during the appressorial differentiation stage of conidial germination and during the late stationary phase of mycelial growth. Here, we report the identification of genes for two unique transcription factors, CMR1 (Colletotrichum melanin regulation) and PIG1 (pigment of Magnaporthe), that are involved in melanin biosynthesis. Both Cmr1p and Pig1p contain two distinct DNA-binding motifs, a Cys2His2 zinc finger motif and a Zn(II)2Cys6 binuclear cluster motif. The presence of both these motifs in a single transcriptional regulatory protein is unique among known eukaryotic transcription factors. Deletion of CMR1 in C. lagenarium caused a defect in mycelial melanization, but not in appressorial melanization. Also, cmr1Delta mutants do not express the melanin biosynthetic structural genes SCD1 and THR1 during mycelial melanization, although the expression of these two genes was not affected during appressorial melanization.
Collapse
Affiliation(s)
- G Tsuji
- Laboratory of Plant Pathology, Faculty of Agriculture, Kyoto Prefectural University, Kyoto 606-8522, Japan
| | | | | | | | | | | | | | | |
Collapse
|
34
|
Drawid A, Gerstein M. A Bayesian system integrating expression data with sequence patterns for localizing proteins: comprehensive application to the yeast genome. J Mol Biol 2000; 301:1059-75. [PMID: 10966805 DOI: 10.1006/jmbi.2000.3968] [Citation(s) in RCA: 106] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
We develop a probabilistic system for predicting the subcellular localization of proteins and estimating the relative population of the various compartments in yeast. Our system employs a Bayesian approach, updating a protein's probability of being in a compartment, based on a diverse range of 30 features. These range from specific motifs (e.g. signal sequences or the HDEL motif) to overall properties of a sequence (e.g. surface composition or isoelectric point) to whole-genome data (e.g. absolute mRNA expression levels or their fluctuations). The strength of our approach is the easy integration of many features, particularly the whole-genome expression data. We construct a training and testing set of approximately 1300 yeast proteins with an experimentally known localization from merging, filtering, and standardizing the annotation in the MIPS, Swiss-Prot and YPD databases, and we achieve 75 % accuracy on individual protein predictions using this dataset. Moreover, we are able to estimate the relative protein population of the various compartments without requiring a definite localization for every protein. This approach, which is based on an analogy to formalism in quantum mechanics, gives better accuracy in determining relative compartment populations than that obtained by simply tallying the localization predictions for individual proteins (on the yeast proteins with known localization, 92% versus 74%). Our training and testing also highlights which of the 30 features are informative and which are redundant (19 being particularly useful). After developing our system, we apply it to the 4700 yeast proteins with currently unknown localization and estimate the relative population of the various compartments in the entire yeast genome. An unbiased prior is essential to this extrapolated estimate; for this, we use the MIPS localization catalogue, and adapt recent results on the localization of yeast proteins obtained by Snyder and colleagues using a minitransposon system. Our final localizations for all approximately 6000 proteins in the yeast genome are available over the web at: http://bioinfo.mbb.yale. edu/genome/localize.
Collapse
Affiliation(s)
- A Drawid
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | | |
Collapse
|
35
|
Galat A. Sequence diversification of the FK506-binding proteins in several different genomes. EUROPEAN JOURNAL OF BIOCHEMISTRY 2000; 267:4945-59. [PMID: 10931176 DOI: 10.1046/j.1432-1327.2000.01509.x] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Sequences of FK506-binding proteins (FKBPs) from four genomes of the following organisms were compared: the prokaryote Escherichia coli, the lower eukaryote Saccharomyces cerevisiae, the plant Arabidopsis thaliana, the nematode Caenorhabditis elegans and a composite of 14 unique FKBPs from two mammalian organisms Homo sapiens (man) and Mus musculus (domestic mouse). A singular FK506-like binding domain (FKBD) has about 12 kDa and occurs in the form of archetypal FKBP-12 and as a part of different proteins ranging in size from 13 to 135 kDa. Some organisms may contain a variable number of proteins which consist from two to four consecutively fused FKBDs. In the 12-kDa subgroup of archetypal FKBPs sequence identity (ID) varies from 100 to 83% (mammalian FKBPs-12), 75-50% in mammalian vs. invertebrate FKBPs-12, and fall to about 30% for pairwise sequence comparisons of mammalian and bacterial FKBPs-12 which suggests that their sequences are divergent. Multiple sequence alignment of FKBPs from the four genomes and a set of unique mammalian FKBPs does not contain any explicit consensus sequence but certain sequence positions have conserved physico-chemical characteristics. Variations of hydrophobicity and bulkiness in the multiple sequence alignment are nonsymmetrical because the physico-chemical properties of the aligned sequences changed during evolution. These variations at the sequence positions which are crucial for binding the immunosuppressive macrolide FK506 and peptidyl-prolyl cis/trans isomerase (PPIase) activity are small.
Collapse
Affiliation(s)
- A Galat
- Département d'Ingénierie et d'Etudes des Protéines, DSV/CEA, CE-Saclay, Gif-sur-Yvette, France
| |
Collapse
|
36
|
Galperin MY, Koonin EV. Who's your neighbor? New computational approaches for functional genomics. Nat Biotechnol 2000; 18:609-13. [PMID: 10835597 DOI: 10.1038/76443] [Citation(s) in RCA: 246] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Several recently developed computational approaches in comparative genomics go beyond sequence comparison. By analyzing phylogenetic profiles of protein families, domain fusions, gene adjacency in genomes, and expression patterns, these methods predict many functional interactions between proteins and help deduce specific functions for numerous proteins. Although some of the resultant predictions may not be highly specific, these developments herald a new era in genomics in which the benefits of comparative analysis of the rapidly growing collection of complete genomes will become increasingly obvious.
Collapse
Affiliation(s)
- M Y Galperin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD 20894, USA
| | | |
Collapse
|
37
|
Wilson CA, Kreychman J, Gerstein M. Assessing annotation transfer for genomics: quantifying the relations between protein sequence, structure and function through traditional and probabilistic scores. J Mol Biol 2000; 297:233-49. [PMID: 10704319 DOI: 10.1006/jmbi.2000.3550] [Citation(s) in RCA: 277] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Measuring in a quantitative, statistical sense the degree to which structural and functional information can be "transferred" between pairs of related protein sequences at various levels of similarity is an essential prerequisite for robust genome annotation. To this end, we performed pairwise sequence, structure and function comparisons on approximately 30,000 pairs of protein domains with known structure and function. Our domain pairs, which are constructed according to the SCOP fold classification, range in similarity from just sharing a fold, to being nearly identical. Our results show that traditional scores for sequence and structure similarity have the same basic exponential relationship as observed previously, with structural divergence, measured in RMS, being exponentially related to sequence divergence, measured in percent identity. However, as the scale of our survey is much larger than any previous investigations, our results have greater statistical weight and precision. We have been able to express the relationship of sequence and structure similarity using more "modern scores," such as Smith-Waterman alignment scores and probabilistic P-values for both sequence and structure comparison. These modern scores address some of the problems with traditional scores, such as determining a conserved core and correcting for length dependency; they enable us to phrase the sequence-structure relationship in more precise and accurate terms. We found that the basic exponential sequence-structure relationship is very general: the same essential relationship is found in the different secondary-structure classes and is evident in all the scoring schemes. To relate function to sequence and structure we assigned various levels of functional similarity to the domain pairs, based on a simple functional classification scheme. This scheme was constructed by combining and augmenting annotations in the enzyme and fly functional classifications and comparing subsets of these to the Escherichia coli and yeast classifications. We found sigmoidal relationships between similarity in function and sequence, with clear thresholds for different levels of functional conservation. For pairs of domains that share the same fold, precise function appears to be conserved down to approximately 40 % sequence identity, whereas broad functional class is conserved to approximately 25 %. Interestingly, percent identity is more effective at quantifying functional conservation than the more modern scores (e.g. P-values). Results of all the pairwise comparisons and our combined functional classification scheme for protein structures can be accessed from a web database at http://bioinfo.mbb.yale.edu/alignCopyright 2000 Academic Press.
Collapse
Affiliation(s)
- C A Wilson
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | | | | |
Collapse
|
38
|
Jansen R, Gerstein M. Analysis of the yeast transcriptome with structural and functional categories: characterizing highly expressed proteins. Nucleic Acids Res 2000; 28:1481-8. [PMID: 10684945 PMCID: PMC111042 DOI: 10.1093/nar/28.6.1481] [Citation(s) in RCA: 122] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We analyzed 10 genome expression data sets by large-scale cross-referencing against broad structural and functional categories. The data sets, generated by different techniques (e.g. SAGE and gene chips), provide various representations of the yeast transcriptome (the set of all yeast genes, weighted by transcript abundance). Our analysis enabled us to determine features more prevalent in the transcriptome than the genome: i.e. those that are common to highly expressed proteins. Starting with simplest categories, we find that, relative to the genome, the transcriptome is enriched in Ala and Gly and depleted in Asn and very long proteins. We find, furthermore, that protein length and maximum expression level have a roughly inverse relationship. To relate expression level and protein structure, we assigned transmembrane helices and known folds (using PSI-blast) to each protein in the genome; this allowed us to determine that the transcriptome is enriched in mixed alpha-beta structures and depleted in membrane proteins relative to the genome. In particular, some enzymatic folds, such as the TIM barrel and the G3P dehydrogenase fold, are much more prevalent in the transcriptome than the genome, whereas others, such as the protein-kinase and leucine-zipper folds, are depleted. The TIM barrel, in fact, is overwhelmingly the 'top fold' in the transcriptome, while it only ranks fifth in the genome. The most highly enriched functional categories in the transcriptome (based on the MIPS system) are energy production and protein synthesis, while categories such as transcription, transport and signaling are depleted. Furthermore, for a given functional category, transcriptome enrichment varies quite substantially between the different expression data sets, with a variation an order of magnitude larger than for the other categories cross-referenced (e.g. amino acids). One can readily see how the enrichment and depletion of the various functional categories relates directly to that of particular folds.
Collapse
Affiliation(s)
- R Jansen
- Department of Molecular Biophysics and Biochemistry, 266 Whitney Avenue, Yale University, PO Box 208114, New Haven, CT 06520, USA
| | | |
Collapse
|
39
|
Kraemer C, Enklaar T, Zabel B, Schmidt ER. Mapping and structure of DMXL1, a human homologue of the DmX gene from Drosophila melanogaster coding for a WD repeat protein. Genomics 2000; 64:97-101. [PMID: 10708522 DOI: 10.1006/geno.1999.6050] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The DmX gene was recently isolated from the X chromosome of Drosophila melanogaster. TBLASTN searches of the dbEST databases revealed sequences with a high level of similarity to DmX in a variety of different species, including insects, nematodes, and mammals showing that DmX is an evolutionarily highly conserved gene. Here we describe the cloning of the cDNA and the chromosomal localization of one of the human homologues of DmX, Dmx-like 1 (DMXL1). The human DMXL1 gene codes for a large mRNA of 11 kb with an open reading frame of 3027 amino acids. The putative protein belongs to the superfamily of WD repeat proteins, which have mostly regulatory functions. The DMXL1 protein contains an exceptionally large number of WD repeat units. The DMXL1 gene is located on chromosome 5q22 as determined by radiation hybrid mapping and fluorescence in situ hybridization. Although the function of the DMXL1 gene and its homologues in other species remains to be discovered, the high level of evolutionary conservation together with the unusual structure suggests that it probably has an important function.
Collapse
Affiliation(s)
- C Kraemer
- Institute for Molecular Genetics, Johannes Gutenberg-University Mainz, Mainz, D-55099, Germany
| | | | | | | |
Collapse
|
40
|
Hailes HC. Royal Society of Chemistry and The Biochemical Society's international symposium on functional genomics: from gene to commercial reality? September 19-22, 1999, Ambleside, UK. Expert Opin Investig Drugs 2000; 9:173-9. [PMID: 11185013 DOI: 10.1517/13543784.9.1.173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
All aspects of genomics were covered during the meeting from gene identification and characterization to exploitation of genomic targets. Speakers highlighted the strategies used together with recent examples, indicating the advantages together with the disadvantages of the methodologies used.
Collapse
Affiliation(s)
- H C Hailes
- Department of Chemistry, University College London, UK
| |
Collapse
|
41
|
Huang H, Xiao C, Wu CH. ProClass protein family database. Nucleic Acids Res 2000; 28:273-6. [PMID: 10592245 PMCID: PMC102450 DOI: 10.1093/nar/28.1.273] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/1999] [Accepted: 10/07/1999] [Indexed: 11/13/2022] Open
Abstract
ProClass is a protein family database that organizes non-redundant sequence entries into families defined collectively by PIR superfamilies and PROSITE patterns. By combining global similarities and functional motifs into a single classification scheme, ProClass helps to reveal domain and family relationships and classify multi-domain proteins. The database currently consists of >155 000 sequence entries retrieved from both PIR-International and SWISS-PROT databases. Approximately 92 000 or 60% of the ProClass entries are classified into approximately 6000 families, including a large number of new members detected by our GeneFIND family identification system. The ProClass motif collection contains approximately 72 000 motif sequences and >1300 multiple alignments for all PROSITE patterns, including >21 000 matches not listed in PROSITE and mostly detected from unique PIR sequences. To maximize family information retrieval, the database provides links to various protein family, domain, alignment and structural class databases. With its high classification rate and comprehensive family relationships, ProClass can be used to support full-scale genomic annotation. The database, now being implemented in an object-relational database management system, is available for online sequence search and record retrieval from our WWW server at http://pir.georgetown.edu/gfserver/proclass.html
Collapse
Affiliation(s)
- H Huang
- Protein Information Resource, National Biomedical Research Foundation, 3900 Reservoir Road, NW, Washington, DC 20007, USA
| | | | | |
Collapse
|
42
|
Harder PA, Silverstein RA, Meier I. Conservation of matrix attachment region-binding filament-like protein 1 among higher plants. PLANT PHYSIOLOGY 2000; 122:225-34. [PMID: 10631266 PMCID: PMC58861 DOI: 10.1104/pp.122.1.225] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/1999] [Accepted: 10/11/1999] [Indexed: 05/21/2023]
Abstract
The interaction of chromatin with the nuclear matrix via matrix attachment regions (MARs) on the DNA is considered to be of fundamental importance for higher-order chromatin organization and the regulation of gene expression. We have previously isolated a novel nuclear matrix-localized protein (MFP1) from tomato (Lycopersicon esculentum) that preferentially binds to MAR DNA. Tomato MFP1 has a predicted filament-protein-like structure and is associated with the nuclear envelope via an N-terminal targeting domain. Based on the antigenic relationship, we report here that MFP1 is conserved in a large number of dicot and monocot species. Several cDNAs were cloned from tobacco (Nicotiana tabacum) and shown to correspond to two tobacco MFP1 genes. Comparison of the primary and predicted secondary structures of MFP1 from tomato, tobacco, and Arabidopsis indicates a high degree of conservation of the N-terminal targeting domain, the overall putative coiled-coil structure of the protein, and the C-terminal DNA-binding domain. In addition, we show that tobacco MFP1 is regulated in an organ-specific and developmental fashion, and that this regulation occurs at the level of transcription or RNA stability.
Collapse
Affiliation(s)
- P A Harder
- DuPont Central Research and Development, P.O. Box 80402, Wilmington, Delaware 19880-0402, USA
| | | | | |
Collapse
|
43
|
Galat A. Variations of sequences and amino acid compositions of proteins that sustain their biological functions: An analysis of the cyclophilin family of proteins. Arch Biochem Biophys 1999; 371:149-62. [PMID: 10545201 DOI: 10.1006/abbi.1999.1434] [Citation(s) in RCA: 87] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
The sequences of the ubiquitous and phylogenetically diversified cyclophilin family of proteins were divided into six groups, namely, vertebrates, invertebrates, other metazoa, plants, fungi, and prokaryotes. These groups of sequences were aligned with the multiple sequence alignment program Clustal-W. The variations of amino acid substitutions and amino acid compositions for these six groups of cyclophilins were calculated using a novel suite of multiple-sequence alignment analysis routines. The cyclophilins from vertebrates can be divided for at least two distinct structural classes that differ from each other by a variable-length amino acid insert within the loop that links alpha-helix II and beta-strand III. A similar structural feature is also present in the other groups of cyclophilins, namely, those from invertebrates, other metazoa, plants, and fungi. The sequences of cyclophilins from fungi and prokaryotes are more diversified than those from vertebrates, and their alterations involve structures other than the amino acid inserts within the loops. Variations of the hydrophobicity and bulkiness of amino acid substitutions of the aligned sequences were calculated for each group of cyclophilins and for the alignment of all the sequences. The variations have clear asymmetry that may signify the need for modification of the physical properties of certain fragments of cyclophilins that are involved in interactions with various cellular components in the evolving environment.
Collapse
Affiliation(s)
- A Galat
- Département d'Ingénierie et d'Etudes des Protéines, DSV/CEA, Gif-sur-Yvette Cedex, CE-Saclay, F-91191, France
| |
Collapse
|
44
|
Marcotte EM, Pellegrini M, Thompson MJ, Yeates TO, Eisenberg D. A combined algorithm for genome-wide prediction of protein function. Nature 1999; 402:83-6. [PMID: 10573421 DOI: 10.1038/47048] [Citation(s) in RCA: 572] [Impact Index Per Article: 22.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The availability of over 20 fully sequenced genomes has driven the development of new methods to find protein function and interactions. Here we group proteins by correlated evolution, correlated messenger RNA expression patterns and patterns of domain fusion to determine functional relationships among the 6,217 proteins of the yeast Saccharomyces cerevisiae. Using these methods, we discover over 93,000 pairwise links between functionally related yeast proteins. Links between characterized and uncharacterized proteins allow a general function to be assigned to more than half of the 2,557 previously uncharacterized yeast proteins. Examples of functional links are given for a protein family of previously unknown function, a protein whose human homologues are implicated in colon cancer and the yeast prion Sup35.
Collapse
Affiliation(s)
- E M Marcotte
- Molecular Biology Institute, UCLA-DOE Laboratory of Structural Biology and Molecular Medicine, University of California, Los Angeles 90095, USA
| | | | | | | | | |
Collapse
|
45
|
Fortes P, Kufel J, Fornerod M, Polycarpou-Schwarz M, Lafontaine D, Tollervey D, Mattaj IW. Genetic and physical interactions involving the yeast nuclear cap-binding complex. Mol Cell Biol 1999; 19:6543-53. [PMID: 10490594 PMCID: PMC84624 DOI: 10.1128/mcb.19.10.6543] [Citation(s) in RCA: 73] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/1999] [Accepted: 07/12/1999] [Indexed: 11/20/2022] Open
Abstract
Yeast strains lacking the yeast nuclear cap-binding complex (yCBC) are viable, although impaired in growth. We have taken advantage of this observation to carry out a genetic screen for components that show synthetic lethality (SL) with a cbp20-Delta cbp80-Delta double mutation. One set of SL interactions was due to mutations that were complemented by components of U1 small nuclear RNP (snRNP) and the yeast splicing commitment complex. These interactions confirm the role of yCBC in commitment complex formation. Physical interaction of yCBC with the commitment complex components Mud10p and Mud2p, which may directly mediate yCBC function, was demonstrated. Unexpectedly, we identified multiple SL mutations that were complemented by Cbf5p and Nop58p. These are components of the two major classes of yeast small nucleolar RNPs, which function in the maturation of rRNA precursors. Mutants lacking yCBC were found to be defective in rRNA processing. Analysis of the yCBC deletion phenotype suggests that this is likely to be due to a defect in the splicing of a subset of ribosomal protein mRNA precursors.
Collapse
Affiliation(s)
- P Fortes
- European Molecular Biology Laboratory, D-69117 Heidelberg, Germany
| | | | | | | | | | | | | |
Collapse
|
46
|
Gindullis F, Peffer NJ, Meier I. MAF1, a novel plant protein interacting with matrix attachment region binding protein MFP1, is located at the nuclear envelope. THE PLANT CELL 1999; 11:1755-68. [PMID: 10488241 PMCID: PMC144308 DOI: 10.1105/tpc.11.9.1755] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
The interaction of chromatin with the nuclear matrix via matrix attachment region (MAR) DNA is considered to be of fundamental importance for chromatin organization in all eukaryotic cells. MAR binding filament-like protein 1 (MFP1) from tomato is a novel plant protein that specifically binds to MAR DNA. Its filament protein-like structure makes it a likely candidate for a structural component of the nuclear matrix. MFP1 is located at nuclear matrix-associated, specklelike structures at the nuclear envelope. Here, we report the identification of a novel protein that specifically interacts with MFP1 in yeast two-hybrid and in vitro binding assays. MFP1 associated factor 1 (MAF1) is a small, soluble, serine/threonine-rich protein that is ubiquitously expressed and has no similarity to known proteins. MAF1, like MFP1, is located at the nuclear periphery and is a component of the nuclear matrix. These data suggest that MFP1 and MAF1 are in vivo interaction partners and that both proteins are components of a nuclear substructure, previously undescribed in plants, that connects the nuclear envelope and the internal nuclear matrix.
Collapse
Affiliation(s)
- F Gindullis
- DuPont Central Research and Development, Wilmington, Delaware 19880-0402, USA
| | | | | |
Collapse
|
47
|
Hegemann JH, Klein S, Heck S, Güldener U, Niedenthal RK, Fleig U. A fast method to diagnose chromosome and plasmid loss in Saccharomyces cerevisiae strains. Yeast 1999; 15:1009-19. [PMID: 10407280 DOI: 10.1002/(sici)1097-0061(199907)15:10b<1009::aid-yea396>3.0.co;2-i] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
We have developed a simple, fast and reliable method for the analysis of genetic stability in budding yeast strains. The assay relies on our previous finding that cells expressing the green fluorescent protein (GFP) can be detected and counted by flow cytometric analysis (FACS) (Niedenthal et al., 1996). Expression of a gfp-carrying CEN-plasmid in a wild-type strain resulted in the emission of strong fluorescence from 80% of the cell population. Strong fluorescence and presence of the plasmid, determined by the presence of the URA3 genetic marker, was strictly correlated. Expression of this plasmid in 266 yeast strains, each carrying a complete deletion of a novel, non-essential gene identified in the S. cerevisiae sequencing project, pinpointed 12 strains with an increased level of mitotic plasmid loss. Finally we have shown that measurement of mitotic loss of artificial chromosome fragments equipped with the gfp expression cassette can be performed quantitatively using FACS.
Collapse
Affiliation(s)
- J H Hegemann
- Institut für Mikrobiologie und Molekularbiologie, Justus-Liebig-Universität Giessen, Frankfurter Strasse 107, 35392 Giessen, Germany
| | | | | | | | | | | |
Collapse
|
48
|
Gindullis F, Meier I. Matrix attachment region binding protein MFP1 is localized in discrete domains at the nuclear envelope. THE PLANT CELL 1999; 11:1117-28. [PMID: 10368182 PMCID: PMC144256 DOI: 10.1105/tpc.11.6.1117] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
Recently, it has been suggested that nuclear processes, such as replication, transcription, and splicing, are spatially organized and associated with a nuclear framework called the nuclear matrix, a structure of unknown molecular composition. It has been shown that chromatin is attached to the nuclear matrix via specific DNA fragments called matrix attachment regions (MARs). We have begun to dissect the plant nuclear matrix by isolating a DNA binding protein with specific affinity for MARs. Here, it is shown that MAR binding filament-like protein 1 (MFP1) is associated with specklelike structures at the nuclear periphery that are part of isolated nuclei and the nuclear matrix. A predicted N-terminal transmembrane domain is necessary for the specific targeting of MFP1 to the speckles, indicating an association with the nuclear envelope-endoplasmic reticulum continuum. In addition, it is shown that a marker protein for plant microtubule organizing centers, which has been shown to be localized on the outside of the plant nuclear envelope, is also part of the nuclear matrix. These findings indicate a close and previously undescribed connection in plants between the nuclear envelope and the internal nuclear matrix, and they suggest a function for MFP1 in attaching chromatin to specific sites at the nuclear periphery.
Collapse
Affiliation(s)
- F Gindullis
- DuPont Central Research and Development, P.O. Box 80402, Wilmington, Delaware 19880-0402, USA
| | | |
Collapse
|
49
|
Abstract
A novel method was introduced to predict protein subcellular locations from sequences. Using sequence data, this method achieved a prediction accuracy higher than previous methods based on the amino acid composition. For three subcellular locations in a prokaryotic organism, the overall prediction accuracy reached 89.1%. For eukaryotic proteins, prediction accuracies of 73.0% and 78.7% were attained within four and three location categories, respectively. These results demonstrate the applicability of this relative simple method and possible improvement of prediction for the protein subcellular location.
Collapse
Affiliation(s)
- Z Yuan
- National Laboratory of Biomacromolecules, Institute of Biophysics, Academia Sinica, Beijing, China.
| |
Collapse
|
50
|
Rudinger-Thirion J, Giegé R. The peculiar architectural framework of tRNASec is fully recognized by yeast AspRS. RNA (NEW YORK, N.Y.) 1999; 5:495-502. [PMID: 10199566 PMCID: PMC1369776 DOI: 10.1017/s1355838299981955] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
The wild-type transcript of Escherichia coli tRNASec, characterized by a peculiar core architecture and a large variable region, was shown to be aspartylatable by yeast AspRS. Similar activities were found for tRNASec mutants with methionine, leucine, and tryptophan anticodons. The charging efficiency of these molecules was found comparable to that of a minihelix derived from tRNAAsp and is accounted for by the presence of the discriminator residue G73, which is a major aspartate identity determinant. Introducing the aspartate identity elements from the anticodon loop (G34, U35, C36, C38) into tRNASec transforms this molecule into an aspartate acceptor with kinetic properties identical to tRNAAsp. Expression of the aspartate identity set in tRNASec is independent of the size of its variable region. The functional study was completed by footprinting experiments with four different nucleases as structural probes. Protection patterns by AspRS of transplanted tRNASec and tRNAAsp were found similar. They are modified, particularly in the anticodon loop, upon changing the aspartate anticodon into that of methionine. Altogether, it appears that recognition of a tRNA by AspRS is more governed by the presence of the aspartate identity set than by the structural framework that carries this set.
Collapse
Affiliation(s)
- J Rudinger-Thirion
- Unité Propre de Recherche 9002 Structure des Macromolécules Biologiques et Mécanismes de Reconnaissance, Institut de Biologie Moléculaire et Cellulaire du Centre National de la Recherche Scientifique, Strasbourg, France
| | | |
Collapse
|