1
|
Using SQL Databases for Sequence Similarity Searching and Analysis. ACTA ACUST UNITED AC 2018; 59:9.4.1-9.4.22. [DOI: 10.1002/cpbi.32] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
2
|
Abstract
In December, 2006, a group of 26 software developers from some of the most widely used life science programming toolkits and phylogenetic software projects converged on Durham, North Carolina, for a Phyloinformatics Hackathon, an intense five-day collaborative software coding event sponsored by the National Evolutionary Synthesis Center (NESCent). The goal was to help researchers to integrate multiple phylogenetic software tools into automated workflows. Participants addressed deficiencies in interoperability between programs by implementing “glue code” and improving support for phylogenetic data exchange standards (particularly NEXUS) across the toolkits. The work was guided by use-cases compiled in advance by both developers and users, and the code was documented as it was developed. The resulting software is freely available for both users and developers through incorporation into the distributions of several widely-used open-source toolkits. We explain the motivation for the hackathon, how it was organized, and discuss some of the outcomes and lessons learned. We conclude that hackathons are an effective mode of solving problems in software interoperability and usability, and are underutilized in scientific software development.
Collapse
|
3
|
Optimization of techniques for multiple platform testing in small, precious samples such as human chorionic villus sampling. Prenat Diagn 2016; 36:1061-1070. [PMID: 27718505 DOI: 10.1002/pd.4936] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2016] [Revised: 09/02/2016] [Accepted: 10/05/2016] [Indexed: 12/24/2022]
Abstract
BACKGROUND Multiple testing to understand global changes in gene expression based on genetic and epigenetic modifications is evolving. Chorionic villi, obtained for prenatal testing, is limited, but can be used to understand ongoing human pregnancies. However, optimal storage, processing and utilization of CVS for multiple platform testing have not been established. RESULTS Leftover CVS samples were flash-frozen or preserved in RNAlater. Modifications to standard isolation kits were performed to isolate quality DNA and RNA from samples as small as 2-5 mg. RNAlater samples had significantly higher RNA yields and quality and were successfully used in microarray and RNA-sequencing (RNA-seq). RNA-seq libraries generated using 200 versus 800-ng RNA showed similar biological coefficients of variation. RNAlater samples had lower DNA yields and quality, which improved by heating the elution buffer to 70 °C. Purification of DNA was not necessary for bisulfite-conversion and genome-wide methylation profiling. CVS cells were propagated and continue to express genes found in freshly isolated chorionic villi. CONCLUSIONS CVS samples preserved in RNAlater are superior. Our optimized techniques provide specimens for genetic, epigenetic and gene expression studies from a single small sample which can be used to develop diagnostics and treatments using a systems biology approach in the prenatal period. © 2016 John Wiley & Sons, Ltd.
Collapse
|
4
|
Targeted Deep Sequencing in Multiple-Affected Sibships of European Ancestry Identifies Rare Deleterious Variants in PTPN22 That Confer Risk for Type 1 Diabetes. Diabetes 2016; 65:794-802. [PMID: 26631741 PMCID: PMC4764149 DOI: 10.2337/db15-0322] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/06/2015] [Accepted: 11/24/2015] [Indexed: 01/22/2023]
Abstract
Despite finding more than 40 risk loci for type 1 diabetes (T1D), the causative variants and genes remain largely unknown. Here, we sought to identify rare deleterious variants of moderate-to-large effects contributing to T1D. We deeply sequenced 301 protein-coding genes located in 49 previously reported T1D risk loci in 70 T1D cases of European ancestry. These cases were selected from putatively high-risk families that had three or more siblings diagnosed with T1D at early ages. A cluster of rare deleterious variants in PTPN22 was identified, including two novel frameshift mutations (ss538819444 and rs371865329) and two missense variants (rs74163663 and rs56048322). Genotyping in 3,609 T1D families showed that rs56048322 was significantly associated with T1D and that this association was independent of the T1D-associated common variant rs2476601. The risk allele at rs56048322 affects splicing of PTPN22, resulting in the production of two alternative PTPN22 transcripts and a novel isoform of LYP (the protein encoded by PTPN22). This isoform competes with the wild-type LYP for binding to CSK and results in hyporesponsiveness of CD4(+) T cells to antigen stimulation in T1D subjects. These findings demonstrate that in addition to common variants, rare deleterious variants in PTPN22 exist and can affect T1D risk.
Collapse
|
5
|
Combinatorial drug screening and molecular profiling reveal diverse mechanisms of intrinsic and adaptive resistance to BRAF inhibition in V600E BRAF mutant melanomas. Oncotarget 2016; 7:2734-53. [PMID: 26673621 PMCID: PMC4823068 DOI: 10.18632/oncotarget.6548] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2015] [Accepted: 11/21/2015] [Indexed: 12/28/2022] Open
Abstract
Over half of BRAFV600E melanomas display intrinsic resistance to BRAF inhibitors, in part due to adaptive signaling responses. In this communication we ask whether BRAFV600E melanomas share common adaptive responses to BRAF inhibition that can provide clinically relevant targets for drug combinations. We screened a panel of 12 treatment-naïve BRAFV600E melanoma cell lines with MAP Kinase pathway inhibitors in pairwise combination with 58 signaling inhibitors, assaying for synergistic cytotoxicity. We found enormous diversity in the drug combinations that showed synergy, with no two cell lines having an identical profile. Although the 6 lines most resistant to BRAF inhibition showed synergistic benefit from combination with lapatinib, the signaling mechanisms by which this combination generated synergistic cytotoxicity differed between the cell lines. We conclude that adaptive responses to inhibition of the primary oncogenic driver (BRAFV600E) are determined not only by the primary oncogenic driver but also by diverse secondary genetic and epigenetic changes ("back-seat drivers") and hence optimal drug combinations will be variable. Because upregulation of receptor tyrosine kinases is a major source of drug resistance arising from diverse adaptive responses, we propose that inhibitors of these receptors may have substantial clinical utility in combination with inhibitors of the MAP Kinase pathway.
Collapse
|
6
|
Systems Analysis of Adaptive Responses to MAP Kinase Pathway Blockade in BRAF Mutant Melanoma. PLoS One 2015; 10:e0138210. [PMID: 26405815 PMCID: PMC4583389 DOI: 10.1371/journal.pone.0138210] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2015] [Accepted: 08/27/2015] [Indexed: 12/31/2022] Open
Abstract
Fifty percent of cutaneous melanomas are driven by activated BRAFV600E, but tumors treated with RAF inhibitors, even when they respond dramatically, rapidly adapt and develop resistance. Thus, there is a pressing need to identify the major mechanisms of intrinsic and adaptive resistance and develop drug combinations that target these resistance mechanisms. In a combinatorial drug screen on a panel of 12 treatment-naïve BRAFV600E mutant melanoma cell lines of varying levels of resistance to mitogen-activated protein kinase (MAPK) pathway inhibition, we identified the combination of PLX4720, a targeted inhibitor of mutated BRaf, and lapatinib, an inhibitor of the ErbB family of receptor tyrosine kinases, as synergistically cytotoxic in the subset of cell lines that displayed the most resistance to PLX4720. To identify potential mechanisms of resistance to PLX4720 treatment and synergy with lapatinib treatment, we performed a multi-platform functional genomics analysis to profile the genome as well as the transcriptional and proteomic responses of these cell lines to treatment with PLX4720. We found modest levels of resistance correlated with the zygosity of the BRAF V600E allele and receptor tyrosine kinase (RTK) mutational status. Layered over base-line resistance was substantial upregulation of many ErbB pathway genes in response to BRaf inhibition, thus generating the vulnerability to combination with lapatinib. The transcriptional responses of ErbB pathway genes are associated with a number of transcription factors, including ETS2 and its associated cofactors that represent a convergent regulatory mechanism conferring synergistic drug susceptibility in the context of diverse mutational landscapes.
Collapse
|
7
|
An Islet-Targeted Genome-Wide Association Scan Identifies Novel Genes Implicated in Cytokine-Mediated Islet Stress in Type 2 Diabetes. Endocrinology 2015; 156:3147-56. [PMID: 26018251 PMCID: PMC4541617 DOI: 10.1210/en.2015-1203] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
Genome-wide association studies in human type 2 diabetes (T2D) have renewed interest in the pancreatic islet as a contributor to T2D risk. Chronic low-grade inflammation resulting from obesity is a risk factor for T2D and a possible trigger of β-cell failure. In this study, microarray data were collected from mouse islets after overnight treatment with cytokines at concentrations consistent with the chronic low-grade inflammation in T2D. Genes with a cytokine-induced change of >2-fold were then examined for associations between single nucleotide polymorphisms and the acute insulin response to glucose (AIRg) using data from the Genetics Underlying Diabetes in Hispanics (GUARDIAN) Consortium. Significant evidence of association was found between AIRg and single nucleotide polymorphisms in Arap3 (5q31.3), F13a1 (6p25.3), Klhl6 (3q27.1), Nid1 (1q42.3), Pamr1 (11p13), Ripk2 (8q21.3), and Steap4 (7q21.12). To assess the potential relevance to islet function, mouse islets were exposed to conditions modeling low-grade inflammation, mitochondrial stress, endoplasmic reticulum (ER) stress, glucotoxicity, and lipotoxicity. RT-PCR revealed that one or more forms of stress significantly altered expression levels of all genes except Arap3. Thapsigargin-induced ER stress up-regulated both Pamr1 and Klhl6. Three genes confirmed microarray predictions of significant cytokine sensitivity: F13a1 was down-regulated 3.3-fold by cytokines, Ripk2 was up-regulated 1.5- to 3-fold by all stressors, and Steap4 was profoundly cytokine sensitive (167-fold up-regulation). Three genes were thus closely associated with low-grade inflammation in murine islets and also with a marker for islet function (AIRg) in a diabetes-prone human population. This islet-targeted genome-wide association scan identified several previously unrecognized candidate genes related to islet dysfunction during the development of T2D.
Collapse
|
8
|
Abstract
Rationale The rationale was to utilize a bioinformatics approach to identify miRNA binding sites in genes with single nucleotide mutations (SNPs) to discover pathways in heart failure (HF). Objective The objective was to focus on the genes containing miRNA binding sites with miRNAs that were significantly altered in end-stage HF and in response to a left ventricular assist device (LVAD). Methods and Results BEDTools v2.14.3 was used to discriminate SNPs within predicted 3′UTR miRNA binding sites. A member of the miR-15/107 family, miR-16, was decreased in the circulation of end-stage HF patients and increased in response to a LVAD (p<0.001). MiR-16 decreased Vacuolar Protein Sorting 4a (VPS4a) expression in HEK 293T cells (p<0.01). The SNP rs16958754 was identified in the miR-15/107 family binding site of VPS4a which abolished direct binding of miR-16 to the 3′UTR of VPS4a (p<0.05). VPS4a was increased in the circulation of end-stage HF patients (p<0.001), and led to a decrease in the number of HEK 293T cells in vitro (p<0.001). Conclusions We provide evidence that miR-16 decreases in the circulation of end-stage HF patients and increases with a LVAD. Modeling studies suggest that miR-16 binds to and decreases expression of VPS4a. Overexpression of VPS4a decreases cell number. Together, these experiments suggest that miR-16 and VPS4a expression are altered in end-stage HF and in response to unloading with a LVAD. This signaling pathway may lead to reduced circulating cell number in HF.
Collapse
|
9
|
Abstract B29: Combinatorial screens with targeted inhibitors reveal diverse compensatory responses and mechanisms of adaptive resistance to therapy. Mol Cancer Ther 2014. [DOI: 10.1158/1535-7163.pms-b29] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
The effectiveness of targeted inhibition of cell signaling can be blunted by compensatory signaling, which generates adaptive resistance mechanisms and reduces therapeutic responses. We have performed high-throughput combinatorial drug screening as a discovery tool to identify compensatory pathways that confer resistance to the cytotoxic effects of targeted therapy. We screened subsets of over 1,000 drug combinations in 14 different epithelial cell lines representing three distinct cancer lineages, and 19 melanoma cell lines, and assessed the ability of each combination to cause synergistic cytotoxicity. We focused on synergistic combinations because they point to mechanistic linkages between the signaling pathways, and also because of the possibility of improved therapeutic index in vivo. Drug substitution studies were used to validate the functionally important drug targets. Of the 84 combinations that caused robust synergy in multiple epithelial cell lines, none were synergistic in more than half of the lines tested, and we observed no pattern with respect to lineage specificity or mutational status of commonly altered oncogenes in the observed synergies. Within the melanoma panel, BRAF mutational status predicted response to single-agent BRAF inhibition, but did not predict synergistic drug combinations, which were different for each cell line. These results reflect the heterogeneity of genetic alterations and the plasticity of cell signaling networks even among cell lines of the same tissue of origin that contain the same predominant driver mutations. We suggest that there is not a sharp dichotomy between “driver” and “passenger” mutations, and that the biological responses to combination therapies are determined by functionally important modifier mutations that we term “back-seat drivers.” This hypothesis is supported by analysis of the transcriptomes and phosphoproteomes of cells treated with drugs singly and in combination, and by exome sequencing. Such analyses also can reveal critical nodes with the potential to function as effective single targets. We found that co-inhibition of EGFR and PI3 Kinase causes synergistic cytotoxicity in some epithelial cancer cell lines, and that phosphoproteomic analysis of signaling pathway responses revealed concordant synergistic inhibition of p70S6 Kinase in KU-7 bladder cancer cells. Using an epistasis paradigm, restoration of p70S6 Kinase signaling by expression of mutationally activated p70S6 Kinase resulted in protection from cytotoxicity, indicating that p70S6 Kinase is a critical node for enhanced cytotoxicity due to combination treatment. AT7867, a potent inhibitor of p70S6 Kinase, was able to inhibit phosphorylation of ribosomal protein S6 and induce cytotoxicity as effectively as the combination drug treatment. We suggest that p70S6 Kinase acts as a functionally important node within the EGFR/PI3 Kinase signaling network and is an attractive target for therapeutic intervention.
Citation Format: Mark J. Axelrod, Devin Roller, Brian Capaldo, Aaron J. Mackey, Mark Conaway, Daniel Gioeli, Michael J. Weber. Combinatorial screens with targeted inhibitors reveal diverse compensatory responses and mechanisms of adaptive resistance to therapy. [abstract]. In: Proceedings of the AACR Precision Medicine Series: Synthetic Lethal Approaches to Cancer Vulnerabilities; May 17-20, 2013; Bellevue, WA. Philadelphia (PA): AACR; Mol Cancer Ther 2013;12(5 Suppl):Abstract nr B29.
Collapse
|
10
|
BAYSIC: a Bayesian method for combining sets of genome variants with improved specificity and sensitivity. BMC Bioinformatics 2014; 15:104. [PMID: 24725768 PMCID: PMC3999887 DOI: 10.1186/1471-2105-15-104] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2013] [Accepted: 03/31/2014] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Accurate genomic variant detection is an essential step in gleaning medically useful information from genome data. However, low concordance among variant-calling methods reduces confidence in the clinical validity of whole genome and exome sequence data, and confounds downstream analysis for applications in genome medicine.Here we describe BAYSIC (BAYeSian Integrated Caller), which combines SNP variant calls produced by different methods (e.g. GATK, FreeBayes, Atlas, SamTools, etc.) into a more accurate set of variant calls. BAYSIC differs from majority voting, consensus or other ad hoc intersection-based schemes for combining sets of genome variant calls. Unlike other classification methods, the underlying BAYSIC model does not require training using a "gold standard" of true positives. Rather, with each new dataset, BAYSIC performs an unsupervised, fully Bayesian latent class analysis to estimate false positive and false negative error rates for each input method. The user specifies a posterior probability threshold according to the user's tolerance for false positive and false negative errors; lowering the posterior probability threshold allows the user to trade specificity for sensitivity while raising the threshold increases specificity in exchange for sensitivity. RESULTS We assessed the performance of BAYSIC in comparison to other variant detection methods using ten low coverage (~5X) samples from The 1000 Genomes Project, a tumor/normal exome pair (40X), and exome sequences (40X) from positive control samples previously identified to contain clinically relevant SNPs. We demonstrated BAYSIC's superior variant-calling accuracy, both for somatic mutation detection and germline variant detection. CONCLUSIONS BAYSIC provides a method for combining sets of SNP variant calls produced by different variant calling programs. The integrated set of SNP variant calls produced by BAYSIC improves the sensitivity and specificity of the variant calls used as input. In addition to combining sets of germline variants, BAYSIC can also be used to combine sets of somatic mutations detected in the context of tumor/normal sequencing experiments.
Collapse
|
11
|
Sequencing of mRNA identifies re-expression of fetal splice variants in cardiac hypertrophy. J Mol Cell Cardiol 2013; 62:99-107. [PMID: 23688780 DOI: 10.1016/j.yjmcc.2013.05.004] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/30/2012] [Revised: 05/06/2013] [Accepted: 05/09/2013] [Indexed: 02/07/2023]
Abstract
Cardiac hypertrophy has been well-characterized at the level of transcription. During cardiac hypertrophy, genes normally expressed primarily during fetal heart development are re-expressed, and this fetal gene program is believed to be a critical component of the hypertrophic process. Recently, alternative splicing of mRNA transcripts has been shown to be temporally regulated during heart development, leading us to consider whether fetal patterns of splicing also reappear during hypertrophy. We hypothesized that patterns of alternative splicing occurring during heart development are recapitulated during cardiac hypertrophy. Here we present a study of isoform expression during pressure-overload cardiac hypertrophy induced by 10 days of transverse aortic constriction (TAC) in rats and in developing fetal rat hearts compared to sham-operated adult rat hearts, using high-throughput sequencing of poly(A) tail mRNA. We find a striking degree of overlap between the isoforms expressed differentially in fetal and pressure-overloaded hearts compared to control: forty-four percent of the isoforms with significantly altered expression in TAC hearts are also expressed at significantly different levels in fetal hearts compared to control (P<0.001). The isoforms that are shared between hypertrophy and fetal heart development are significantly enriched for genes involved in cytoskeletal organization, RNA processing, developmental processes, and metabolic enzymes. Our data strongly support the concept that mRNA splicing patterns normally associated with heart development recur as part of the hypertrophic response to pressure overload. These findings suggest that cardiac hypertrophy shares post-transcriptional as well as transcriptional regulatory mechanisms with fetal heart development.
Collapse
|
12
|
Abstract 3400: Small molecule biological screens and global analytical approaches reveal the complexity of cancer cell signaling networks and the challenge of personalized medicine. Cancer Res 2013. [DOI: 10.1158/1538-7445.am2013-3400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
BRAF is mutationally activated in 50% of melanomas with a variety of data indicating its role as a driver of malignancy. However, while patients with BRAF mutations respond to vemurafenib, most responses are not long lasting. Moreover, the 25% of melanomas harboring an NRAS activating mutation are resistant de novo to BRAF inhibitors. In order to identify effective drug combinations that increase the effectiveness of BRAF inhibition, we used combinatorial small molecule biological screens and global analytical approaches to identify compensatory and redundant pathways. We screened a drug library for combinations with BRAF or MEK inhibitors that cause synergistic cytotoxicity in melanoma cell lines. Although BRAF mutational status predicted sensitivity to inhibitors of BRAF, each BRAF mutant line differed in the combination of drugs that induced synergistic cytotoxicity. One robust and clinically relevant pair identified was the BRAF inhibitor PLX4720 and the EGFR/HER2 inhibitor lapatinib. This drug combination was synergistic in half of the BRAF mutant cell lines tested, including a line that was completely resistant to PLX4720. The combination was also effective in the two NRAS mutant lines tested. Xenograft experiments testing the combination of PLX4720 and lapatinib revealed a significant reduction in tumor volume compared to either single agent alone in both PLX4720 sensitive and resistant BRAF mutant tumors, providing preclinical data supporting the efficacy of this drug combination. Synergy of the PLX4720-lapatinib combination did not correlate with EGFR/HER2 expression, activity state, or mutational status. This further indicates that sensitivity of melanoma to drug combinations is not determined by the targeted oncogenic drivers, but by the existence of unique secondary, compensatory survival responses. We hypothesize that mutationally activated BRAF is wired into the constitutive signaling network differently in each of these cells, presumably a consequence of the diverse secondary mutations that characterize melanomas. Consistent with this, whole exome-sequencing data revealed a diversity of somatic variation among the synergistic and non-synergistic melanomas, while gene expression analysis demonstrated both distinct, line-specific basal transcriptional profiles and the lack of common gene expression signatures in melanomas synergistically inhibited by treatment with PLX4720-lapatinib. Collectively, these data point out the extraordinary robustness of cancer cell signaling networks and the challenges of individualizing therapies. This study has uncovered novel functional drug combinations and suggests that the underlying signaling networks that control responses to targeted agents can vary substantially depending on unexplored components of the cell genotype and cell signaling network.
Citation Format: Devin Roller, Brian Capaldo, Aaron J. Mackey, Mark Conaway, Michael J. Weber, Daniel G. Gioeli. Small molecule biological screens and global analytical approaches reveal the complexity of cancer cell signaling networks and the challenge of personalized medicine. [abstract]. In: Proceedings of the 104th Annual Meeting of the American Association for Cancer Research; 2013 Apr 6-10; Washington, DC. Philadelphia (PA): AACR; Cancer Res 2013;73(8 Suppl):Abstract nr 3400. doi:10.1158/1538-7445.AM2013-3400
Collapse
|
13
|
Global changes in gene expression during compatible and incompatible interactions of cowpea (Vigna unguiculata L.) with the root parasitic angiosperm Striga gesnerioides. BMC Genomics 2012; 13:402. [PMID: 22900582 PMCID: PMC3505475 DOI: 10.1186/1471-2164-13-402] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2012] [Accepted: 08/03/2012] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND Cowpea, Vigna unguiculata L. Walp., is one of the most important food and forage legumes in the semi-arid tropics. While most domesticated forms of cowpea are susceptible to the root parasitic weed Striga gesnerioides, several cultivars have been identified that show race-specific resistance. Cowpea cultivar B301 contains the RSG3-301 gene for resistance to S. gesnerioides race SG3, but is susceptible to race SG4z. When challenged by SG3, roots of cultivar B301 develop a strong resistance response characterized by a hypersensitive reaction and cell death at the site of parasite attachment. In contrast, no visible response occurs in B301 roots parasitized by SG4z. RESULTS Gene expression in the roots of the cowpea cultivar B301 during compatible (susceptible) and incompatible (resistant) interactions with S. gesnerioides races SG4z and SG3, respectively, were investigated at the early (6 days post-inoculation (dpi)) and late (13 dpi) stages of the resistance response using a Nimblegen custom design cowpea microarray. A total of 111 genes were differentially expressed in B301 roots at 6 dpi; this number increased to 2102 genes at 13 dpi. At 13 dpi, a total of 1944 genes were differentially expressed during compatible (susceptible) interactions of B301 with SG4z. Genes and pathways involved in signal transduction, programmed cell death and apoptosis, and defense response to biotic and abiotic stress were differentially expressed in the early resistance response; at the later time point, enrichment was primarily for defense-related gene expression, and genes encoding components of lignifications and secondary wall formation. In compatible interactions (B301-SG4z), multiple defense pathways were repressed, including those involved in lignin biosynthesis and secondary cell wall modifications, while cellular transport processes for nitrogen and sulfur were increased. CONCLUSION Distinct changes in global gene expression profiles occur in host roots following successful and unsuccessful attempted parasitism by Striga. Induction of specific defense related genes and pathways defines components of a unique resistance mechanism. Some genes and pathways up-regulated in the host resistance response to SG3 are repressed in the susceptible interactions, suggesting that the parasite is targeting specific components of the host's defense. These results add to our understanding of plant-parasite interactions and the evolution of resistance to parasitic weeds.
Collapse
|
14
|
Abstract 266: MicroRNA Profiling Identifies Enrichment for Proteoglycan-Genes in Symptomatic Carotid Plaques. Arterioscler Thromb Vasc Biol 2012. [DOI: 10.1161/atvb.32.suppl_1.a266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
MicroRNAs are a class of small non-coding RNAs that regulate gene expression by binding to discrete sites of genes and inhibiting translation. The objective of this study was to identify microRNAs and downstream enriched targets associated with symptomatic carotid plaques. Carotid plaque samples from patients were defined as symptomatic if they were associated with a prior ischemic stroke or transient ischemic attack, or as asymptomatic if there were no prior ischemic events. Digital microRNA profiling was performed on the NanoString nCounter analysis system (NanoString Technologies, Seattle, WA) on carotid plaques from symptomatic (n=6) and asymptomatic (n=6) patients. We used the BioConductor/R package "edgeR" to perform moderated-ANOVA testing for differences in gene abundance between groups, assuming a negative binomial distribution of gene counts, with Benjamini-Hochberg multiple hypothesis correction to control false discovery rate. Analysis revealed five microRNAs that were significantly up-regulated in the symptomatic cohort (see Table 1) including miR-16, which was confirmed by RTqPCR (p=0.02). Predicted downstream targets of miR-16 were enriched for proteoglycan related genes including hyaluronan synthase 2, perlecan and betaglycan. Additionally, RTqPCR demonstrated up-regulation of CathepsinB transcript in symptomatic patients (n=46-50 per cohort, p=0.02) while CathepsinL and heparanase transcripts were not significantly different. CathepsinL enzyme activity was also not significantly different (n=46-50 per cohort). Taken together, these findings may reveal novel proteoglycan mediated signaling pathways, mediated through miR-16, regulating the progression of carotid plaque formation.
Collapse
|
15
|
Leptin protects host cells from Entamoeba histolytica cytotoxicity by a STAT3-dependent mechanism. Infect Immun 2012; 80:1934-43. [PMID: 22331430 PMCID: PMC3347425 DOI: 10.1128/iai.06140-11] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2011] [Accepted: 01/29/2012] [Indexed: 11/20/2022] Open
Abstract
The adipocytokine leptin links nutritional status to immune function. Leptin signaling protects from amebiasis, but the molecular mechanism is not understood. We developed an in vitro model of ameba-host cell interaction to test the hypothesis that leptin prevents ameba-induced apoptosis in host epithelial cells. We demonstrated that activation of mammalian leptin signaling increased cellular resistance to amebic cytotoxicity, including caspase-3 activation. Exogenous expression of the leptin receptor conferred resistance in susceptible cells, and leptin stimulation enhanced protection. A series of leptin receptor signaling mutants showed that resistance to amebic cytotoxicity was dependent on activation of STAT3 but not the Src homology-2 domain-containing tyrosine phosphatase (SHP-2) or STAT5. A common polymorphism in the leptin receptor (Q223R) that increases susceptibility to amebiasis in humans and mice was found to increase susceptibility to amebic cytotoxicity in single cells. The Q223R polymorphism also decreased leptin-dependent STAT3 activation by 21% relative to that of the wild-type (WT) receptor (P = 0.035), consistent with a central role of STAT3 signaling in protection. A subset of genes uniquely regulated by STAT3 in response to leptin was identified. Most notable were the TRIB1 and suppressor of cytokine signaling 3 (SOCS3) genes, which have opposing roles in the regulation of apoptosis. Overall apoptotic genes were highly enriched in this gene set (P < 1E-05), supporting the hypothesis that leptin regulation of host apoptotic genes via STAT3 is responsible for protection. This is the first demonstration of a mammalian signaling pathway that restricts amebic pathogenesis and represents an important advance in our mechanistic understanding of how leptin links nutrition and susceptibility to infection.
Collapse
|
16
|
The genome of the model beetle and pest Tribolium castaneum. Nature 2008; 452:949-55. [PMID: 18362917 DOI: 10.1038/nature06784] [Citation(s) in RCA: 976] [Impact Index Per Article: 61.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2007] [Accepted: 02/06/2008] [Indexed: 02/08/2023]
Abstract
Tribolium castaneum is a member of the most species-rich eukaryotic order, a powerful model organism for the study of generalized insect development, and an important pest of stored agricultural products. We describe its genome sequence here. This omnivorous beetle has evolved the ability to interact with a diverse chemical environment, as shown by large expansions in odorant and gustatory receptors, as well as P450 and other detoxification enzymes. Development in Tribolium is more representative of other insects than is Drosophila, a fact reflected in gene content and function. For example, Tribolium has retained more ancestral genes involved in cell-cell communication than Drosophila, some being expressed in the growth zone crucial for axial elongation in short-germ development. Systemic RNA interference in T. castaneum functions differently from that in Caenorhabditis elegans, but nevertheless offers similar power for the elucidation of gene function and identification of targets for selective insect control.
Collapse
|
17
|
Evigan: a hidden variable model for integrating gene evidence for eukaryotic gene prediction. ACTA ACUST UNITED AC 2008; 24:597-605. [PMID: 18187439 DOI: 10.1093/bioinformatics/btn004] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION The increasing diversity and variable quality of evidence relevant to gene annotation argues for a probabilistic framework that automatically integrates such evidence to yield candidate gene models. RESULTS Evigan is an automated gene annotation program for eukaryotic genomes, employing probabilistic inference to integrate multiple sources of gene evidence. The probabilistic model is a dynamic Bayes network whose parameters are adjusted to maximize the probability of observed evidence. Consensus gene predictions are then derived by maximum likelihood decoding, yielding n-best models (with probabilities for each). Evigan is capable of accommodating a variety of evidence types, including (but not limited to) gene models computed by diverse gene finders, BLAST hits, EST matches, and splice site predictions; learned parameters encode the relative quality of evidence sources. Since separate training data are not required (apart from the training sets used by individual gene finders), Evigan is particularly attractive for newly sequenced genomes where little or no reliable manually curated annotation is available. The ability to produce a ranked list of alternative gene models may facilitate identification of alternatively spliced transcripts. Experimental application to ENCODE regions of the human genome, and the genomes of Plasmodium vivax and Arabidopsis thaliana show that Evigan achieves better performance than any of the individual data sources used as evidence. AVAILABILITY The source code is available at http://www.seas.upenn.edu/~strctlrn/evigan/evigan.html.
Collapse
|
18
|
Periplasmic proteins of the extremophile Acidithiobacillus ferrooxidans: a high throughput proteomics analysis. Mol Cell Proteomics 2007; 6:2239-51. [PMID: 17911085 PMCID: PMC4631397 DOI: 10.1074/mcp.m700042-mcp200] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Acidithiobacillus ferrooxidans is a chemolithoautotrophic acidophile capable of obtaining energy by oxidizing ferrous iron or sulfur compounds such as metal sulfides. Some of the proteins involved in these oxidations have been described as forming part of the periplasm of this extremophile. The detailed study of the periplasmic components constitutes an important area to understand the physiology and environmental interactions of microorganisms. Proteomics analysis of the periplasmic fraction of A. ferrooxidans ATCC 23270 was performed by using high resolution linear ion trap-FT MS. We identified a total of 131 proteins in the periplasm of the microorganism grown in thiosulfate. When possible, functional categories were assigned to the proteins: 13.8% were transport and binding proteins, 14.6% were several kinds of cell envelope proteins, 10.8% were involved in energy metabolism, 10% were related to protein fate and folding, 10% were proteins with unknown functions, and 26.1% were proteins without homologues in databases. These last proteins are most likely characteristic of A. ferrooxidans and may have important roles yet to be assigned. The majority of the periplasmic proteins from A. ferrooxidans were very basic compared with those of neutrophilic microorganisms such as Escherichia coli, suggesting a special adaptation of the chemolithoautotrophic bacterium to its very acidic environment. The high throughput proteomics approach used here not only helps to understand the physiology of this extreme acidophile but also offers an important contribution to the functional annotation for the available genomes of biomining microorganisms such as A. ferrooxidans for which no efficient genetic systems are available to disrupt genes by procedures such as homologous recombination.
Collapse
|
19
|
Abstract
ToxoDB (http://ToxoDB.org) is a genome and functional genomic database for the protozoan parasite Toxoplasma gondii. It incorporates the sequence and annotation of the T. gondii ME49 strain, as well as genome sequences for the GT1, VEG and RH (Chr Ia, Chr Ib) strains. Sequence information is integrated with various other genomic-scale data, including community annotation, ESTs, gene expression and proteomics data. ToxoDB has matured significantly since its initial release. Here we outline the numerous updates with respect to the data and increased functionality available on the website.
Collapse
|
20
|
Assessing performance of orthology detection strategies applied to eukaryotic genomes. PLoS One 2007; 2:e383. [PMID: 17440619 PMCID: PMC1849888 DOI: 10.1371/journal.pone.0000383] [Citation(s) in RCA: 311] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2007] [Accepted: 03/13/2007] [Indexed: 12/02/2022] Open
Abstract
Orthology detection is critically important for accurate functional annotation, and has been widely used to facilitate studies on comparative and evolutionary genomics. Although various methods are now available, there has been no comprehensive analysis of performance, due to the lack of a genomic-scale ‘gold standard’ orthology dataset. Even in the absence of such datasets, the comparison of results from alternative methodologies contains useful information, as agreement enhances confidence and disagreement indicates possible errors. Latent Class Analysis (LCA) is a statistical technique that can exploit this information to reasonably infer sensitivities and specificities, and is applied here to evaluate the performance of various orthology detection methods on a eukaryotic dataset. Overall, we observe a trade-off between sensitivity and specificity in orthology detection, with BLAST-based methods characterized by high sensitivity, and tree-based methods by high specificity. Two algorithms exhibit the best overall balance, with both sensitivity and specificity>80%: INPARANOID identifies orthologs across two species while OrthoMCL clusters orthologs from multiple species. Among methods that permit clustering of ortholog groups spanning multiple genomes, the (automated) OrthoMCL algorithm exhibits better within-group consistency with respect to protein function and domain architecture than the (manually curated) KOG database, and the homolog clustering algorithm TribeMCL as well. By way of using LCA, we are also able to comprehensively assess similarities and statistical dependence between various strategies, and evaluate the effects of parameter settings on performance. In summary, we present a comprehensive evaluation of orthology detection on a divergent set of eukaryotic genomes, thus providing insights and guides for method selection, tuning and development for different applications. Many biological questions have been addressed by multiple tests yielding binary (yes/no) outcomes but no clear definition of truth, making LCA an attractive approach for computational biology.
Collapse
|
21
|
Abstract
A high-quality consensus gene set for the honey bee (Apis mellifera) created using a new algorithm (GLEAN) is described. Background We wished to produce a single reference gene set for honey bee (Apis mellifera). Our motivation was twofold. First, we wished to obtain an improved set of gene models with increased coverage of known genes, while maintaining gene model quality. Second, we wished to provide a single official gene list that the research community could further utilize for consistent and comparable analyses and functional annotation. Results We created a consensus gene set for honey bee (Apis mellifera) using GLEAN, a new algorithm that uses latent class analysis to automatically combine disparate gene prediction evidence in the absence of known genes. The consensus gene models had increased representation of honey bee genes without sacrificing quality compared with any one of the input gene predictions. When compared with manually annotated gold standards, the consensus set of gene models was similar or superior in quality to each of the input sets. Conclusion Most eukaryotic genome projects produce multiple gene sets because of the variety of gene prediction programs. Each of the gene prediction programs has strengths and weaknesses, and so the multiplicity of gene sets offers users a more comprehensive collection of genes to use than is available from a single program. On the other hand, the availability of multiple gene sets is also a cause for uncertainty among users as regards which set they should use. GLEAN proved to be an effective method to combine gene lists into a single reference set.
Collapse
|
22
|
First Time in Man (FTIM) Patient Risk Management Plans (PRMPs): A Review of Small Molecules versus Biologicals. Drug Saf 2007. [DOI: 10.2165/00002018-200730100-00139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
23
|
Review of New Challenges for the Development of Early Phase Biologic Compounds in the Wake of the TGN1412 Clinical Trial. Drug Saf 2007. [DOI: 10.2165/00002018-200730100-00024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
24
|
Abstract
We report the sequence and analysis of the 814-megabase genome of the sea urchin Strongylocentrotus purpuratus, a model for developmental and systems biology. The sequencing strategy combined whole-genome shotgun and bacterial artificial chromosome (BAC) sequences. This use of BAC clones, aided by a pooling strategy, overcame difficulties associated with high heterozygosity of the genome. The genome encodes about 23,300 genes, including many previously thought to be vertebrate innovations or known only outside the deuterostomes. This echinoderm genome provides an evolutionary outgroup for the chordates and yields insights into the evolution of deuterostomes.
Collapse
|
25
|
Common inheritance of chromosome Ia associated with clonal expansion of Toxoplasma gondii. Genes Dev 2006; 16:1119-25. [PMID: 16902086 PMCID: PMC1557770 DOI: 10.1101/gr.5318106] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2006] [Accepted: 06/06/2006] [Indexed: 11/24/2022]
Abstract
Toxoplasma gondii is a globally distributed protozoan parasite that can infect virtually all warm-blooded animals and humans. Despite the existence of a sexual phase in the life cycle, T. gondii has an unusual population structure dominated by three clonal lineages that predominate in North America and Europe, (Types I, II, and III). These lineages were founded by common ancestors approximately10,000 yr ago. The recent origin and widespread distribution of the clonal lineages is attributed to the circumvention of the sexual cycle by a new mode of transmission-asexual transmission between intermediate hosts. Asexual transmission appears to be multigenic and although the specific genes mediating this trait are unknown, it is predicted that all members of the clonal lineages should share the same alleles. Genetic mapping studies suggested that chromosome Ia was unusually monomorphic compared with the rest of the genome. To investigate this further, we sequenced chromosome Ia and chromosome Ib in the Type I strain, RH, and the Type II strain, ME49. Comparative genome analyses of the two chromosomal sequences revealed that the same copy of chromosome Ia was inherited in each lineage, whereas chromosome Ib maintained the same high frequency of between-strain polymorphism as the rest of the genome. Sampling of chromosome Ia sequence in seven additional representative strains from the three clonal lineages supports a monomorphic inheritance, which is unique within the genome. Taken together, our observations implicate a specific combination of alleles on chromosome Ia in the recent origin and widespread success of the clonal lineages of T. gondii.
Collapse
|
26
|
Abstract
UNLABELLED We present SynView, a simple and generic approach to dynamically visualize multi-species comparative genome data. It is a light-weight application based on the popular and configurable web-based GBrowse framework. It can be used with a variety of databases and provides the user with a high degree of interactivity. The tool is written in Perl and runs on top of the GBrowse framework. It is in use in the PlasmoDB (http://www.PlasmoDB.org) and the CryptoDB (http://www.CryptoDB.org) projects and can be easily integrated into other cross-species comparative genome projects. AVAILABILITY The program and instructions are freely available at http://www.ApiDB.org/apps/SynView/ CONTACT jkissing@uga.edu.
Collapse
|
27
|
OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups. Nucleic Acids Res 2006; 34:D363-8. [PMID: 16381887 PMCID: PMC1347485 DOI: 10.1093/nar/gkj123] [Citation(s) in RCA: 641] [Impact Index Per Article: 35.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2005] [Revised: 10/20/2005] [Accepted: 10/20/2005] [Indexed: 11/12/2022] Open
Abstract
The OrthoMCL database (http://orthomcl.cbil.upenn.edu) houses ortholog group predictions for 55 species, including 16 bacterial and 4 archaeal genomes representing phylogenetically diverse lineages, and most currently available complete eukaryotic genomes: 24 unikonts (12 animals, 9 fungi, microsporidium, Dictyostelium, Entamoeba), 4 plants/algae and 7 apicomplexan parasites. OrthoMCL software was used to cluster proteins based on sequence similarity, using an all-against-all BLAST search of each species' proteome, followed by normalization of inter-species differences, and Markov clustering. A total of 511,797 proteins (81.6% of the total dataset) were clustered into 70,388 ortholog groups. The ortholog database may be queried based on protein or group accession numbers, keyword descriptions or BLAST similarity. Ortholog groups exhibiting specific phyletic patterns may also be identified, using either a graphical interface or a text-based Phyletic Pattern Expression grammar. Information for ortholog groups includes the phyletic profile, the list of member proteins and a multiple sequence alignment, a statistical summary and graphical view of similarities, and a graphical representation of domain architecture. OrthoMCL software, the entire FASTA dataset employed and clustering results are available for download. OrthoMCL-DB provides a centralized warehouse for orthology prediction among multiple species, and will be updated and expanded as additional genome sequence data become available.
Collapse
|
28
|
Abstract
BACKGROUND Toxoplasma gondii gives rise to toxoplasmosis, among the most prevalent parasitic diseases of animals and man. Transformation of the tachzyoite stage into the latent bradyzoite-cyst form underlies chronic disease and leads to a lifetime risk of recrudescence in individuals whose immune system becomes compromised. Given the importance of tissue cyst formation, there has been intensive focus on the development of methods to study bradyzoite differentiation, although the molecular basis for the developmental switch is still largely unknown. RESULTS We have used serial analysis of gene expression (SAGE) to define the Toxoplasma gondii transcriptome of the intermediate-host life cycle that leads to the formation of the bradyzoite/tissue cyst. A broad view of gene expression is provided by >4-fold coverage from nine distinct libraries (approximately 300,000 SAGE tags) representing key developmental transitions in primary parasite populations and in laboratory strains representing the three canonical genotypes. SAGE tags, and their corresponding mRNAs, were analyzed with respect to abundance, uniqueness, and antisense/sense polarity and chromosome distribution and developmental specificity. CONCLUSION This study demonstrates that phenotypic transitions during parasite development were marked by unique stage-specific mRNAs that accounted for 18% of the total SAGE tags and varied from 1-5% of the tags in each developmental stage. We have also found that Toxoplasma mRNA pools have a unique parasite-specific composition with 1 in 5 transcripts encoding Apicomplexa-specific genes functioning in parasite invasion and transmission. Developmentally co-regulated genes were dispersed across all Toxoplasma chromosomes, as were tags representing each abundance class, and a variety of biochemical pathways indicating that trans-acting mechanisms likely control gene expression in this parasite. We observed distinct similarities in the specificity and expression levels of mRNAs in primary populations (Day-6 post-sporozoite infection) that occur prior to the onset of bradyzoite development that were uniquely shared with the virulent Type I-RH laboratory strain suggesting that development of RH may be arrested. By contrast, strains from Type II-Me49B7 and Type III-VEGmsj contain SAGE tags corresponding to bradyzoite genes, which suggests that priming of developmental expression likely plays a role in the greater capacity of these strains to complete bradyzoite development.
Collapse
|
29
|
Composite genome map and recombination parameters derived from three archetypal lineages of Toxoplasma gondii. Nucleic Acids Res 2005; 33:2980-92. [PMID: 15911631 PMCID: PMC1137028 DOI: 10.1093/nar/gki604] [Citation(s) in RCA: 111] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
Toxoplasma gondii is a highly successful protozoan parasite in the phylum Apicomplexa, which contains numerous animal and human pathogens. T.gondii is amenable to cellular, biochemical, molecular and genetic studies, making it a model for the biology of this important group of parasites. To facilitate forward genetic analysis, we have developed a high-resolution genetic linkage map for T.gondii. The genetic map was used to assemble the scaffolds from a 10X shotgun whole genome sequence, thus defining 14 chromosomes with markers spaced at ∼300 kb intervals across the genome. Fourteen chromosomes were identified comprising a total genetic size of ∼592 cM and an average map unit of ∼104 kb/cM. Analysis of the genetic parameters in T.gondii revealed a high frequency of closely adjacent, apparent double crossover events that may represent gene conversions. In addition, we detected large regions of genetic homogeneity among the archetypal clonal lineages, reflecting the relatively few genetic outbreeding events that have occurred since their recent origin. Despite these unusual features, linkage analysis proved to be effective in mapping the loci determining several drug resistances. The resulting genome map provides a framework for analysis of complex traits such as virulence and transmission, and for comparative population genetic studies.
Collapse
|
30
|
Abstract
Cryptosporidium species cause acute gastroenteritis and diarrhoea worldwide. They are members of the Apicomplexa--protozoan pathogens that invade host cells by using a specialized apical complex and are usually transmitted by an invertebrate vector or intermediate host. In contrast to other Apicomplexans, Cryptosporidium is transmitted by ingestion of oocysts and completes its life cycle in a single host. No therapy is available, and control focuses on eliminating oocysts in water supplies. Two species, C. hominis and C. parvum, which differ in host range, genotype and pathogenicity, are most relevant to humans. C. hominis is restricted to humans, whereas C. parvum also infects other mammals. Here we describe the eight-chromosome approximately 9.2-million-base genome of C. hominis. The complement of C. hominis protein-coding genes shows a striking concordance with the requirements imposed by the environmental niches the parasite inhabits. Energy metabolism is largely from glycolysis. Both aerobic and anaerobic metabolisms are available, the former requiring an alternative electron transport system in a simplified mitochondrion. Biosynthesis capabilities are limited, explaining an extensive array of transporters. Evidence of an apicoplast is absent, but genes associated with apical complex organelles are present. C. hominis and C. parvum exhibit very similar gene complements, and phenotypic differences between these parasites must be due to subtle sequence divergence.
Collapse
|
31
|
Using relational databases for improved sequence similarity searching and large-scale genomic analyses. CURRENT PROTOCOLS IN BIOINFORMATICS 2004; Chapter 9:Unit 9.4. [PMID: 18428739 DOI: 10.1002/0471250953.bi0904s7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Relational databases are designed to integrate diverse types of information and manage large sets of search results, greatly simplifying genome-scale analyses. Relational databases are essential for management and analysis of large-scale sequence analyses, and can also be used to improve the statistical significance of similarity searches by focusing on subsets of sequence libraries most likely to contain homologs. This unit describes using relational databases to improve the efficiency of sequence similarity searching and to demonstrate various large-scale genomic analyses of homology-related data. This unit describes the installation and use of a simple protein sequence database, seqdb_demo, which is used as a basis for the other protocols. These include basic use of the database to generate a novel sequence library subset, how to extend and use seqdb_demo for the storage of sequence similarity search results and making use of various kinds of stored search results to address aspects of comparative genomic analysis.
Collapse
|
32
|
Abstract
The CRP (Cleavage of Radiolabeled Phosphoproteins) program guides the design and interpretation of experiments to identify protein phosphorylation sites by Edman sequencing of unseparated peptides. Traditionally, phosphorylation sites are determined by cleaving the phosphoprotein and separating the peptides for Edman 32P-phosphate release sequencing. CRP analysis of a phosphoprotein's sequence accelerates this process by omitting the separation step: given a protein sequence of interest, the CRP program performs an in silico proteolytic cleavage of the sequence and reports the predicted Edman cycles in which radioactivity would be observed if a given serine, threonine or tyrosine were phosphorylated. Experimentally observed cycles containing 32P can be compared with CRP predictions to confirm candidate sites and/or explore the ability of additional cleavage experiments to resolve remaining ambiguities. To reduce ambiguity, the phosphorylated residue (P-Tyr, P-Ser or P-Thr) can be determined experimentally, and CRP will ignore sites with alternative residues. CRP also provides simple predictions of likely phosphorylation sites using known kinase recognition motifs. The CRP interface is available at http://fasta.bioch.virginia.edu/crp.
Collapse
|
33
|
Identification of residues in glutathione transferase capable of driving functional diversification in evolution. A novel approach to protein redesign. J Biol Chem 2003; 278:8733-8. [PMID: 12486119 DOI: 10.1074/jbc.m211776200] [Citation(s) in RCA: 107] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Evolution of protein function can be driven by positive selection of advantageous nonsynonymous codon mutations that arise following gene duplication. By observing the presence and degree of site-specific positive selection for change between divergent paralogs, residue positions responsible for functional changes can be identified. We applied this analysis to genes encoding Mu class glutathione transferases, which differ widely in substrate specificities. Approximately 3% of the amino acid residue positions, both near to and distant from the active site, are under statistically significant positive selection for change. Relevant human glutathione transferase (GST) M1-1 and GST M2-2 codons were mutated. A chemically conservative threonine to serine mutation in GST M2-2 elicited a 1,000-fold increase in specific activity with the GST M1-1-specific substrate trans-stilbene oxide and a 30-fold increase with the alternative epoxide substrates styrene oxide and nitrophenyl glycidol. The reverse mutation in GST M1-1 resulted in reciprocal decreases in activity. Thus, identification of hypervariable codon positions can be a powerful aid in the redesign of protein function, lessening the requirement for extensive mutagenesis or structural knowledge and sometimes suggesting mutations that would otherwise be considered functionally conservative.
Collapse
|
34
|
Abstract
The Gal/GalNAc lectin gene of Entamoeba histolytica is a major amebic virulence protein responsible for interaction with host tissues. We investigated sequence differences in the Gal/GalNAc lectin heavy subunit in three isolates from Bangladesh and one isolate from Georgia, each of which was determined to be genetically distinct by SREHP AluI digestion. Interestingly, we observed only slight genetic diversity in the lectin gene as compared with the HM1:IMSS laboratory strain, originally a clinical isolate from Mexico. Genetic conservation of the Gal/GalNAc lectin between isolates may reflect that the lectin is under strong functional selection or possibly, that E. histolytica is a clonal population. Sequence conservation of the lectin indicates that immune responses against it should be cross-protective.
Collapse
|
35
|
Abstract
Edman phosphate ((32)P) release sequencing provides a high sensitivity means of identifying phosphorylation sites in proteins that complements mass spectrometry techniques. We have developed a bioinformatic assessment tool, the cleavage of radiolabeled protein (CRP) program, which enables experimental identification of phosphorylation sites via (32)P labeling and Edman degradation of cleaved proteins obtained at femtomole levels. By observing the Edman cycle(s) in which radioactivity is found, candidate phosphorylation sites are identified by determining which residues occur at the observed number of cycles downstream from a peptide cleavage site. In cases where more than one residue could be responsible for the observed radioactivity, additional experiments with cleavage reagents having alternative specificities may resolve the ambiguity. Given a protein sequence and a cleavage site, CRP performs these experiments in silico, identifying resolved sites based on user-supplied experimental data, as well as suggesting combinations of reagents for additional analyses. Analysis of the PhosphoBase protein sequence database suggests that CRP data from two cleavage experiments can be used to identify unambiguously 60% of known phosphorylation sites. Data from additional cleavage experiments may increase the overall coverage to 70% of known sites. By comparing theoretical data obtained from the CRP program with (32)P release data obtained from an Edman sequencer, a known phosphorylation site was identified unambiguously and correctly. In addition, our results show that in vivo phosphorylation sites can be determined routinely by differential proteolysis analysis and Edman cycling with less than 1 fmol of protein and 1000 cpm.
Collapse
|
36
|
Getting more from less: algorithms for rapid protein identification with multiple short peptide sequences. Mol Cell Proteomics 2002; 1:139-47. [PMID: 12096132 DOI: 10.1074/mcp.m100004-mcp200] [Citation(s) in RCA: 179] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
We describe two novel sequence similarity search algorithms, FASTS and FASTF, that use multiple short peptide sequences to identify homologous sequences in protein or DNA databases. FASTS searches with peptide sequences of unknown order, as obtained by mass spectrometry-based sequencing, evaluating all possible arrangements of the peptides. FASTF searches with mixed peptide sequences, as generated by Edman sequencing of unseparated mixtures of peptides. FASTF deconvolutes the mixture, using a greedy heuristic that allows rapid identification of high scoring alignments while reducing the total number of explored alternatives. Both algorithms use the heuristic FASTA comparison strategy to accelerate the search but use alignment probability, rather than similarity score, as the criterion for alignment optimality. Statistical estimates are calculated using an empirical correction to a theoretical probability. These calculated estimates were accurate within a factor of 10 for FASTS and 1000 for FASTF on our test dataset. FASTS requires only 15-20 total residues in three or four peptides to robustly identify homologues sharing 50% or greater protein sequence identity. FASTF requires about 25% more sequence data than FASTS for equivalent sensitivity, but additional sequence data are usually available from mixed Edman experiments. Thus, both algorithms can identify homologues that diverged 100 to 500 million years ago, allowing proteomic identification from organisms whose genomes have not been sequenced.
Collapse
|
37
|
Abstract
SUMMARY A general system for performing multiple independent database searches in parallel is presented. Run-time addition and removal of clients, robust failure and error trapping and near 100% efficiency with very large numbers of clients are achieved by a flexible asynchronous, client-driven approach.
Collapse
|
38
|
Bactericidal effects of sugar-fed antibiotics on resident midgut bacteria of newly emerged anopheline mosquitoes (Diptera: Culicidae). JOURNAL OF MEDICAL ENTOMOLOGY 2000; 37:246-249. [PMID: 10730495 DOI: 10.1093/jmedent/37.2.246] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
A protocol was developed for significantly reducing resident midgut bacteria in newly emerged anopheline mosquitoes using a combination of antibiotics. Pupa harvested from colony-reared Anopheles gambiae s.l. Giles and Anopheles stephensi (Liston) were placed in cages wiped previously with 70% alcohol and kept under UV light for 24 h. Emerging adult mosquitoes were fed for 3 consecutive days on antibiotic solution, consisting of 0.4% gentamicin sulfate and 1% penicillin-streptomycin solution in a 10% sterile sucrose solution. Bacterial suspensions of Escherichia coli, Klebsiella pneumoniae (Schroeter, 1886), and Pseudomonas stutzeri (Lehmann & Neumann, 1896) isolated from wild-caught anophelines were fed to antibiotic-treated mosquitoes starved for 24 h via either sugar or membrane-feeding. Mosquitoes dissected 1 and 24 h after blood-feeding or sugar-feeding, and plated on trypticase soy agar plates, yielded the same type of bacteria fed originally without evidence of contaminants. There was no residual effect of the antibiotics on introduced single bacteria strains as judged by the presence of bacteria in antibiotic-treated mosquitoes. This experimental reduction of resident midgut bacteria and their replacement with single strains in newly emerged anopheline mosquitoes should facilitate further investigations of the interactions between malaria parasites and bacteria found in the midguts of mosquitoes.
Collapse
|
39
|
Abstract
The membrane cytoskeletal component dystrophin and its associated glycoproteins play a central role in the molecular pathogenesis of several muscular dystrophies, i.e. Duchenne/Becker muscular dystrophy, congenital muscular dystrophy and various forms of limb-girdle muscular dystrophy. Although the most frequent of these disorders, Duchenne muscular dystrophy, is mainly recognized as a disease of skeletal muscle fibers, pathophysiological changes also involve the heart and diaphragm, as well as the peripheral and central nervous system. Thus current research efforts into the elucidation of the molecular mechanisms underlying these genetic diseases are not only directed towards studying skeletal muscle necrosis but also investigate abnormalities of heart and brain dystrophin-glycoprotein complexes in cardiomyopathy and brain deficiencies associated with muscular dystrophy. Furthermore, many isoforms of dystrophin and dystrophin-associated components have been identified in various non-muscle tissues and their function(s) are mostly unknown. With respect to skeletal muscle fibers, the characterization of new dystrophin-associated proteins, such as dystrobrevin, sarcospan and the syntrophins, led to a modified model of the spatial configuration of the dystrophin-glycoprotein complex. However, it is generally accepted now that beta-dystroglycan forms the plasmalemma-spanning linkage between dystrophin and the laminin-binding protein alpha-dystroglycan and that this complex is associated with the sarcoglycan subcomplex of sarcolemmal glycoproteins.
Collapse
|
40
|
Genetic mapping of a murine locus controlling development of T helper 1/T helper 2 type responses. Proc Natl Acad Sci U S A 1996; 93:12467-72. [PMID: 8901605 PMCID: PMC38015 DOI: 10.1073/pnas.93.22.12467] [Citation(s) in RCA: 153] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
Genetic background of the T cell can influence T helper (Th) phenotype development, with some murine strains (e.g., B10.D2) favoring Th1 development and others (e.g., BALB/c) favoring Th2 development. Recently we found that B10.D2 exhibit an intrinsically greater capacity to maintain interleukin 12 (IL-12) responsiveness under neutral conditions in vitro compared with BALB/c T cells, allowing for prolonged capacity to undergo IL-12-induced Th1 development. To begin identification of the loci controlling this genetic effect, we used a T-cell antigen receptor-transgenic system for in vitro analysis of intercrosses between BALB/c and B10.D2 mice and have identified a locus on murine chromosome 11 that controls the maintenance of IL-12 responsiveness, and therefore the subsequent Th1/Th2 response. This chromosomal region is syntenic with a locus on human chromosome 5q31.1 shown to be associated with elevated serum IgE levels, suggesting that genetic control of Th1/Th2 differentiation in mouse, and of atopy development in humans, may be expressed through similar mechanisms.
Collapse
|
41
|
Abstract
The genetic background of T lymphocytes influences development of the T helper (TH) phenotype, resulting in either resistance or susceptibility of certain mouse strains to pathogens such as Leishmania major. With an in vitro model system, a difference in maintenance of responsiveness of T cells to interleukin-12 (IL-12) was detected between BALB/c and B10.D2 mice. Although naive T cells from both strains initially responded to IL-12, BALB/c T cells lost IL-12 responsiveness after stimulation with antigen in vitro, even when cocultured with B10.D2 T cells. Thus, susceptibility of BALB/c mice to infection with L. major may derive from the loss of the ability to generate IL-12-induced TH1 responses rather than from an IL-4-induced TH2 response.
Collapse
|