101
|
Lennquist A, Mårtensson Lindblad LGE, Hedberg D, Kristiansson E, Förlin L. Colour and melanophore function in rainbow trout after long term exposure to the new antifoulant medetomidine. CHEMOSPHERE 2010; 80:1050-1055. [PMID: 20538317 DOI: 10.1016/j.chemosphere.2010.05.014] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2009] [Revised: 05/04/2010] [Accepted: 05/10/2010] [Indexed: 05/29/2023]
Abstract
Medetomidine is a new antifouling agent, and its effects in non-target aquatic organisms have been investigated. Earlier short-term studies in fish have shown a skin lightening response to medetomidine, but effects after chronic exposure have not been studied. In fish, the dark pigment melanin is contained within specialized cells, melanophores. Medetomidine binds to the melanophore alpha2-adrenoceptor, which stimulates pigment aggregation resulting in the light appearance. In the present study, rainbow trout (Oncorhynchus mykiss) was long-term exposed to 0.5 and 5.0 nM of medetomidine via water for 54 d. The fish were then photographed for paleness quantification and the images were analyzed using ImageJ analysis software. Additionally, scales were removed and used for in vitro function studies of the melanophores, monitoring the response to melanophore stimulating hormone (MSH) and subsequent medetomidine addition. The number of melanophores was also investigated. As a result of the medetomidine exposure, fish from the 5 nM treatment were significantly paler than control fish and the melanophores from these fishes were also more aggregated. Melanophores from all the treatments were functional, responding to MSH by dispersion and to subsequent medetomidine by aggregation. However, the results indicate a difference in sensitivity among treatments. The number of melanophores in the scales did not change significantly after long term exposure to medetomidine. These results suggest that the observed paleness may be reversible, even after chronic exposure.
Collapse
|
102
|
Arvidsson Y, Bergström A, Arvidsson L, Kristiansson E, Ahlman H, Nilsson O. Hypoxia stimulates CXCR4 signalling in ileal carcinoids. Endocr Relat Cancer 2010; 17:303-16. [PMID: 20071457 DOI: 10.1677/erc-09-0085] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Tumour hypoxia is associated with increased metastatic potential and resistance to radiotherapy and chemotherapy. Ileal carcinoids are usually metastatic at the time of diagnosis and respond poorly to chemotherapy. The aim of this study was to investigate the extent of hypoxia in ileal carcinoids and the response of tumour cells to induced hypoxia. Vascular endothelial growth factor (VEGF), carbonic anhydrase (CA-IX), hypoxia-inducible factor (HIF)-1alpha and HIF-2alpha were studied by immunohistochemistry in biopsies from 24 patients with ileal carcinoids. All hypoxic markers were shown to be highly expressed in localized areas of the tumours irrespective of tumour location or stage. However, HIF-2alpha expression was significantly higher in distant metastases compared to primary tumours in the same patient. Global gene expression profiling of GOT1 carcinoid cells revealed a marked response to hypoxia. Expression of genes related to epithelial-to-mesenchymal transition and development was altered including increased expression of the C-X-C chemokine receptor type 4 (CXCR4), an important regulator of invasive growth and metastasis formation. High expression of CXCR4 was confirmed by immunohistochemistry in tumour biopsies. Stimulation of GOT1 cells by the CXCR4 ligand, CXCL12 (stromal cell-derived factor 1 (SDF-1)), activated the mitogen-activated protein kinase (MAPK) p42/44 signalling pathway and increased tumour cell migration. We conclude that ileal carcinoids contain hypoxic areas expressing HIF-1alpha, HIF-2alpha and CXCR4. Signalling through the CXCL12-CXCR4 axis may contribute to the metastatic potential of ileal carcinoids. Targeting of HIFs and/or the CXCR4 signalling pathway may offer new therapeutic strategies for carcinoid tumour disease.
Collapse
|
103
|
Lagerstedt KK, Kristiansson E, Lönnroth C, Andersson M, Iresjö BM, Gustafsson A, Hansson E, Kressner U, Nordgren S, Enlund F, Lundholm K. Genes with relevance for early to late progression of colon carcinoma based on combined genomic and transcriptomic information from the same patients. Cancer Inform 2010; 9:79-91. [PMID: 20467480 PMCID: PMC2867635 DOI: 10.4137/cin.s4545] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
BACKGROUND Genetic and epigenetic alterations in colorectal cancer are numerous. However, it is difficult to judge whether such changes are primary or secondary to the appearance and progression of tumors. Therefore, the aim of the present study was to identify altered DNA regions with significant covariation to transcription alterations along colon cancer progression. METHODS Tumor and normal colon tissue were obtained at primary operations from 24 patients selected by chance. DNA, RNA and microRNAs were extracted from the same biopsy material in all individuals and analyzed by oligo-nucleotide array-based comparative genomic hybridization (CGH), mRNA- and microRNA oligo-arrays. Statistical analyses were performed to assess statistical interactions (correlations, co-variations) between DNA copy number changes and significant alterations in gene and microRNA expression using appropriate parametric and non-parametric statistics. RESULTS Main DNA alterations were located on chromosome 7, 8, 13 and 20. Tumor DNA copy number gain increased with tumor progression, significantly related to increased gene expression. Copy number loss was not observed in Dukes A tumors. There was no significant relationship between expressed genes and tumor progression across Dukes A-D tumors; and no relationship between tumor stage and the number of microRNAs with significantly altered expression. Interaction analyses identified overall 41 genes, which discriminated early Dukes A plus B tumors from late Dukes C plus D tumor; 28 of these genes remained with correlations between genomic and transcriptomic alterations in Dukes C plus D tumors and 17 in Dukes D. One microRNA (microR-663) showed interactions with DNA alterations in all Dukes A-D tumors. CONCLUSIONS Our modeling confirms that colon cancer progression is related to genomic instability and altered gene expression. However, early invasive tumor growth seemed rather related to transcriptomic alterations, where changes in microRNA may be an early phenomenon, and less to DNA copy number changes.
Collapse
|
104
|
Selpi, Bryant CH, Kemp GJL, Sarv J, Kristiansson E, Sunnerhagen P. Predicting functional upstream open reading frames in Saccharomyces cerevisiae. BMC Bioinformatics 2009; 10:451. [PMID: 20042076 PMCID: PMC2813248 DOI: 10.1186/1471-2105-10-451] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2008] [Accepted: 12/30/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Some upstream open reading frames (uORFs) regulate gene expression (i.e., they are functional) and can play key roles in keeping organisms healthy. However, how uORFs are involved in gene regulation is not yet fully understood. In order to get a complete view of how uORFs are involved in gene regulation, it is expected that a large number of experimentally verified functional uORFs are needed. Unfortunately, wet-experiments to verify that uORFs are functional are expensive. RESULTS In this paper, a new computational approach to predicting functional uORFs in the yeast Saccharomyces cerevisiae is presented. Our approach is based on inductive logic programming and makes use of a novel combination of knowledge about biological conservation, Gene Ontology annotations and genes' responses to different conditions. Our method results in a set of simple and informative hypotheses with an estimated sensitivity of 76%. The hypotheses predict 301 further genes to have 398 novel functional uORFs. Three (RPC11, TPK1, and FOL1) of these 301 genes have been hypothesised, following wet-experiments, by a related study to have functional uORFs. A comparison with another related study suggests that eleven of the predicted functional uORFs from genes LDB17, HEM3, CIN8, BCK2, PMC1, FAS1, APP1, ACC1, CKA2, SUR1, and ATH1 are strong candidates for wet-lab experimental studies. CONCLUSIONS Learning based prediction of functional uORFs can be done with a high sensitivity. The predictions made in this study can serve as a list of candidates for subsequent wet-lab verification and might help to elucidate the regulatory roles of uORFs.
Collapse
|
105
|
Gunnarsson L, Kristiansson E, Rutgersson C, Sturve J, Fick J, Förlin L, Larsson DGJ. Pharmaceutical industry effluent diluted 1:500 affects global gene expression, cytochrome P450 1A activity, and plasma phosphate in fish. ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY 2009; 28:2639-47. [PMID: 19610678 DOI: 10.1897/09-120.1] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/13/2009] [Accepted: 06/30/2009] [Indexed: 05/20/2023]
Abstract
Patancheru, near Hyderabad, India, is a major production site for the global bulk drug market. Approximately 90 manufacturers send their wastewater to a common treatment plant in Patancheru. Extraordinary high levels of a wide range of pharmaceuticals have recently been demonstrated in the treated effluent. As little as 0.2% of this effluent can strongly reduce the growth rate of tadpoles, but the underlying mechanisms of toxicity are not known. To begin addressing how the effluent affects aquatic vertebrates, rainbow trout (Oncorhynchus mykiss) were exposed to 0.2% effluent for 5 d. Several physiological endpoints, together with effects on global hepatic gene expression patterns, were analyzed. The exposed fish showed both an induction of hepatic cytochrome P450 1A (CYP1A) gene expression, as well as enzyme activity. Clinical blood chemistry analyses revealed an increase in plasma phosphate levels, which in humans indicates impaired kidney function. Several oxidative stress-related genes were induced in the livers; however, no significant changes in antioxidant enzyme activities or in the hepatic glutathione levels were found. Furthermore, estrogen-regulated genes were slightly up-regulated following exposure, and moderate levels of estriol were detected in the effluent. The present study identifies changes in gene expression triggered by exposure to a high dilution of the effluent, supporting the hypothesis that these fish are responding to chemical exposure. The pattern of regulated genes may contribute to the identification of mechanisms of sublethal toxicity, as well as illuminate possible causative agents.
Collapse
|
106
|
Kristiansson E, Hugenholtz P, Dalevi D. ShotgunFunctionalizeR: an R-package for functional comparison of metagenomes. ACTA ACUST UNITED AC 2009; 25:2737-8. [PMID: 19696045 DOI: 10.1093/bioinformatics/btp508] [Citation(s) in RCA: 110] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
UNLABELLED Microorganisms are ubiquitous in nature and constitute intrinsic parts of almost every ecosystem. A culture-independent and powerful way to study microbial communities is metagenomics. In such studies, functional analysis is performed on fragmented genetic material from multiple species in the community. The recent advances in high-throughput sequencing have greatly increased the amount of data in metagenomic projects. At present, there is an urgent need for efficient statistical tools to analyse these data. We have created ShotgunFunctionalizeR, an R-package for functional comparison of metagenomes. The package contains tools for importing, annotating and visualizing metagenomic data produced by shotgun high-throughput sequencing. ShotgunFunctionalizeR contains several statistical procedures for assessing functional differences between samples, both for individual genes and for entire pathways. In addition to standard and previously published methods, we have developed and implemented a novel approach based on a Poisson model. This procedure is highly flexible and thus applicable to a wide range of different experimental designs. We demonstrate the potential of ShotgunFunctionalizeR by performing a regression analysis on metagenomes sampled at multiple depths in the Pacific Ocean. AVAILABILITY http://shotgun.zool.gu.se
Collapse
|
107
|
Kristiansson E, Asker N, Förlin L, Larsson DGJ. Characterization of the Zoarces viviparus liver transcriptome using massively parallel pyrosequencing. BMC Genomics 2009; 10:345. [PMID: 19646242 PMCID: PMC2725146 DOI: 10.1186/1471-2164-10-345] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2009] [Accepted: 07/31/2009] [Indexed: 11/16/2022] Open
Abstract
Background The teleost Zoarces viviparus (eelpout) lives along the coasts of Northern Europe and has long been an established model organism for marine ecology and environmental monitoring. The scarce information about this species genome has however restrained the use of efficient molecular-level assays, such as gene expression microarrays. Results In the present study we present the first comprehensive characterization of the Zoarces viviparus liver transcriptome. From 400,000 reads generated by massively parallel pyrosequencing, more than 50,000 pieces of putative transcripts were assembled, annotated and functionally classified. The data was estimated to cover roughly 40% of the total transcriptome and homologues for about half of the genes of Gasterosteus aculeatus (stickleback) were identified. The sequence data was consequently used to design an oligonucleotide microarray for large-scale gene expression analysis. Conclusion Our results show that one run using a Genome Sequencer FLX from 454 Life Science/Roche generates enough genomic information for adequate de novo assembly of a large number of genes in a higher vertebrate. The generated sequence data, including the validated microarray probes, are publicly available to promote genome-wide research in Zoarces viviparus.
Collapse
|
108
|
Nilsson RH, Ryberg M, Abarenkov K, Sjökvist E, Kristiansson E. The ITS region as a target for characterization of fungal communities using emerging sequencing technologies. FEMS Microbiol Lett 2009; 296:97-101. [PMID: 19459974 DOI: 10.1111/j.1574-6968.2009.01618.x] [Citation(s) in RCA: 169] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
The advent of new high-throughput DNA-sequencing technologies promises to redefine the way in which fungi and fungal communities--as well as other groups of organisms--are studied in their natural environment. With read lengths of some few hundred base pairs, massively parallel sequencing (pyrosequencing) stands out among the new technologies as the most apt for large-scale species identification in environmental samples. Although parallel pyrosequencing can generate hundreds of thousands of sequences at an exceptional speed, the limited length of the reads may pose a problem to the species identification process. This study explores whether the discrepancy in read length between parallel pyrosequencing and traditional (Sanger) sequencing will have an impact on the perceived taxonomic affiliation of the underlying species. Based on all 39,200 publicly available fungal environmental DNA sequences representing the nuclear ribosomal internal transcribed spacer (ITS) region, the results show that the two approaches give rise to quite different views of the diversity of the underlying samples. Standardization of which subregion from the ITS region should be sequenced, as well as a recognition that the composition of fungal communities as depicted through different sequencing methods need not be directly comparable, appear crucial to the integration of the new sequencing technologies with current mycological praxis.
Collapse
|
109
|
Thorsen M, Perrone GG, Kristiansson E, Traini M, Ye T, Dawes IW, Nerman O, Tamás MJ. Genetic basis of arsenite and cadmium tolerance in Saccharomyces cerevisiae. BMC Genomics 2009; 10:105. [PMID: 19284616 PMCID: PMC2660369 DOI: 10.1186/1471-2164-10-105] [Citation(s) in RCA: 83] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2008] [Accepted: 03/12/2009] [Indexed: 11/10/2022] Open
Abstract
Background Arsenic and cadmium are widely distributed in nature and pose serious threats to the environment and human health. Exposure to these nonessential toxic metals may result in a variety of human diseases including cancer. However, arsenic and cadmium toxicity targets and the cellular systems contributing to tolerance acquisition are not fully known. Results To gain insight into metal action and cellular tolerance mechanisms, we carried out genome-wide screening of the Saccharomyces cerevisiae haploid and homozygous diploid deletion mutant collections and scored for reduced growth in the presence of arsenite or cadmium. Processes found to be required for tolerance to both metals included sulphur and glutathione biosynthesis, environmental sensing, mRNA synthesis and transcription, and vacuolar/endosomal transport and sorting. We also identified metal-specific defence processes. Arsenite-specific defence functions were related to cell cycle regulation, lipid and fatty acid metabolism, mitochondrial biogenesis, and the cytoskeleton whereas cadmium-specific defence functions were mainly related to sugar/carbohydrate metabolism, and metal-ion homeostasis and transport. Molecular evidence indicated that the cytoskeleton is targeted by arsenite and that phosphorylation of the Snf1p kinase is required for cadmium tolerance. Conclusion This study has pin-pointed core functions that protect cells from arsenite and cadmium toxicity. It also emphasizes the existence of both common and specific defence systems. Since many of the yeast genes that confer tolerance to these agents have homologues in humans, similar biological processes may act in yeast and humans to prevent metal toxicity and carcinogenesis.
Collapse
|
110
|
Kristiansson E, Thorsen M, Tamás MJ, Nerman O. Evolutionary forces act on promoter length: identification of enriched cis-regulatory elements. Mol Biol Evol 2009; 26:1299-307. [PMID: 19258451 DOI: 10.1093/molbev/msp040] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Transcription factors govern gene expression by binding to short DNA sequences called cis-regulatory elements. These sequences are typically located in promoters, which are regions of variable length upstream of the open reading frames of genes. Here, we report that promoter length and gene function are related in yeast, fungi, and plants. In particular, the promoters for stress-responsive genes are in general longer than those of other genes. Essential genes have, on the other hand, relatively short promoters. We utilize these findings in a novel method for identifying relevant cis-regulatory elements in a set of coexpressed genes. The method is shown to generate more accurate results and fewer false positives compared with other common procedures. Our results suggest that genes with complex transcriptional regulation tend to have longer promoters than genes responding to few signals. This phenomenon is present in all investigated species, indicating that evolution adjust promoter length according to gene function. Identification of cis-regulatory elements in Saccharomyces cerevisiae can be done with the web service located at http://enricher.zool.gu.se.
Collapse
|
111
|
Nilsson RH, Bok G, Ryberg M, Kristiansson E, Hallenberg N. A software pipeline for processing and identification of fungal ITS sequences. SOURCE CODE FOR BIOLOGY AND MEDICINE 2009; 4:1. [PMID: 19146660 PMCID: PMC2649129 DOI: 10.1186/1751-0473-4-1] [Citation(s) in RCA: 79] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/12/2008] [Accepted: 01/15/2009] [Indexed: 11/17/2022]
Abstract
BACKGROUND Fungi from environmental samples are typically identified to species level through DNA sequencing of the nuclear ribosomal internal transcribed spacer (ITS) region for use in BLAST-based similarity searches in the International Nucleotide Sequence Databases. These searches are time-consuming and regularly require a significant amount of manual intervention and complementary analyses. We here present software - in the form of an identification pipeline for large sets of fungal ITS sequences - developed to automate the BLAST process and several additional analysis steps. The performance of the pipeline was evaluated on a dataset of 350 ITS sequences from fungi growing as epiphytes on building material. RESULTS The pipeline was written in Perl and uses a local installation of NCBI-BLAST for the similarity searches of the query sequences. The variable subregion ITS2 of the ITS region is extracted from the sequences and used for additional searches of higher sensitivity. Multiple alignments of each query sequence and its closest matches are computed, and query sequences sharing at least 50% of their best matches are clustered to facilitate the evaluation of hypothetically conspecific groups. The pipeline proved to speed up the processing, as well as enhance the resolution, of the evaluation dataset considerably, and the fungi were found to belong chiefly to the Ascomycota, with Penicillium and Aspergillus as the two most common genera. The ITS2 was found to indicate a different taxonomic affiliation than did the complete ITS region for 10% of the query sequences, though this figure is likely to vary with the taxonomic scope of the query sequences. CONCLUSION The present software readily assigns large sets of fungal query sequences to their respective best matches in the international sequence databases and places them in a larger biological context. The output is highly structured to be easy to process, although it still needs to be inspected and possibly corrected for the impact of the incomplete and sometimes erroneously annotated fungal entries in these databases. The open source pipeline is available for UNIX-type platforms, and updated releases of the target database are made available biweekly. The pipeline is easily modified to operate on other molecular regions and organism groups.
Collapse
|
112
|
Ryberg M, Kristiansson E, Sjökvist E, Nilsson RH. An outlook on the fungal internal transcribed spacer sequences in GenBank and the introduction of a web-based tool for the exploration of fungal diversity. THE NEW PHYTOLOGIST 2009; 181:471-477. [PMID: 19121041 DOI: 10.1111/j.1469-8137.2008.02667.x] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
The environmental and distributional data associated with fungal internal transcribed spacer (ITS) sequences in GenBank are investigated and a new web-based tool with which these sequences can be explored is introduced. All fungal ITS sequences in GenBank were classified as either identified to species level or insufficiently identified and compared using BLAST. The results are made available as a biweekly updated web service that can be queried to retrieve all insufficiently identified sequences (IIS) associated with any fungal genus. The most commonly available annotation items in GenBank are isolation source (55%); country of origin (50%); and specific host (38%). The molecular sampling of fungi shows a bias towards North America, Europe, China, and Japan whereas vast geographical areas remain effectively unexplored. Mycorrhizal and parasitic genera are on average associated with more IIS than are saprophytic taxa. Glomus, Alternaria, and Tomentella are the genera represented by the highest number of insufficiently identified ITS sequences in GenBank. The web service presented (http://andromeda.botany.gu.se/emerencia.html#genus_search) offers new means, particularly for mycorrhizal and plant pathogenic fungi, to examine the IIS in GenBank in a taxon-oriented framework and to explore their metadata in an easily accessible and time-efficient manner.
Collapse
|
113
|
Gunnarsson L, Jauhiainen A, Kristiansson E, Nerman O, Larsson DGJ. Evolutionary conservation of human drug targets in organisms used for environmental risk assessments. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2008; 42:5807-13. [PMID: 18754513 DOI: 10.1021/es8005173] [Citation(s) in RCA: 376] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
Pharmaceuticals are typically found in very low concentrations in the aquatic environment. Accordingly, environmental effects clearly assigned to residual drugs are consistent with high affinity interactions with conserved targets in affected wildlife species rather than with a general toxic effect. Thus, evolutionarily well-conserved targets in a given species are associated with an increased risk. In this study orthologs for 1318 human drug targets were predicted in 16 species of which several are relevant for ecotoxicity testing. The conservation of different functional categories of targets was also analyzed. Zebrafish had orthologs to 86% of the drug targets while only 61% were conserved in Daphnia and 35% in green alga. The predicted presence and absence of orthologs agrees well with published experimental data on the potential for specific drug target interaction in various species. Based on the conservation of targets we propose that aquatic environmental risk assessments for human drugs should always include comprehensive studies on aquatic vertebrates. Furthermore, individual targets, especially enzymes, are well conserved suggesting that tests on evolutionarily distant organisms would be highly relevant for certain drugs. We propose that the results can guide environmental risk assessments by improving the possibilities to identify species sensitive to certain types of pharmaceuticals or to other contaminants that act through well defined mechanisms of action. Moreover, we suggest that the results can be used to interpret the relevance of existing ecotoxicity data.
Collapse
|
114
|
Nilsson RH, Kristiansson E, Ryberg M, Hallenberg N, Larsson KH. Intraspecific ITS variability in the kingdom fungi as expressed in the international sequence databases and its implications for molecular species identification. Evol Bioinform Online 2008; 4:193-201. [PMID: 19204817 PMCID: PMC2614188 DOI: 10.4137/ebo.s653] [Citation(s) in RCA: 546] [Impact Index Per Article: 34.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
The internal transcribed spacer (ITS) region of the nuclear ribosomal repeat unit is the most popular locus for species identification and subgeneric phylogenetic inference in sequence-based mycological research. The region is known to show certain variability even within species, although its intraspecific variability is often held to be limited and clearly separated from interspecific variability. The existence of such a divide between intra- and interspecific variability is implicitly assumed by automated approaches to species identification, but whether intraspecific variability indeed is negligible within the fungal kingdom remains contentious. The present study estimates the intraspecific ITS variability in all fungi presently available to the mycological community through the international sequence databases. Substantial differences were found within the kingdom, and the results are not easily correlated to the taxonomic affiliation or nutritional mode of the taxa considered. No single unifying yet stringent upper limit for intraspecific variability, such as the canonical 3% threshold, appears to be applicable with the desired outcome throughout the fungi. Our results caution against simplified approaches to automated ITS-based species delimitation and reiterate the need for taxonomic expertise in the translation of sequence data into species names.
Collapse
|
115
|
Ryberg M, Nilsson RH, Kristiansson E, Töpel M, Jacobsson S, Larsson E. Mining metadata from unidentified ITS sequences in GenBank: a case study in Inocybe (Basidiomycota). BMC Evol Biol 2008; 8:50. [PMID: 18282272 PMCID: PMC2275786 DOI: 10.1186/1471-2148-8-50] [Citation(s) in RCA: 105] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2007] [Accepted: 02/18/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The lack of reference sequences from well-identified mycorrhizal fungi often poses a challenge to the inference of taxonomic affiliation of sequences from environmental samples, and many environmental sequences are thus left unidentified. Such unidentified sequences belonging to the widely distributed ectomycorrhizal fungal genus Inocybe (Basidiomycota) were retrieved from GenBank and divided into species that were identified in a phylogenetic context using a reference dataset from an ongoing study of the genus. The sequence metadata of the unidentified Inocybe sequences stored in GenBank, as well as data from the corresponding original papers, were compiled and used to explore the ecology and distribution of the genus. In addition, the relative occurrence of Inocybe was contrasted to that of other mycorrhizal genera. RESULTS Most species of Inocybe were found to have less than 3% intraspecific variability in the ITS2 region of the nuclear ribosomal DNA. This cut-off value was used jointly with phylogenetic analysis to delimit and identify unidentified Inocybe sequences to species level. A total of 177 unidentified Inocybe ITS sequences corresponding to 98 species were recovered, 32% of which were successfully identified to species level in this study. These sequences account for an unexpectedly large proportion of the publicly available unidentified fungal ITS sequences when compared with other mycorrhizal genera. Eight Inocybe species were reported from multiple hosts and some even from hosts forming arbutoid or orchid mycorrhizae. Furthermore, Inocybe sequences have been reported from four continents and in climate zones ranging from cold temperate to equatorial climate. Out of the 19 species found in more than one study, six were found in both Europe and North America and one was found in both Europe and Japan, indicating that at least many north temperate species have a wide distribution. CONCLUSION Although DNA-based species identification and circumscription are associated with practical and conceptual difficulties, they also offer new possibilities and avenues for research. Metadata assembly holds great potential to synthesize valuable information from community studies for use in a species and taxonomy-oriented framework.
Collapse
|
116
|
Sjögren A, Kristiansson E, Rudemo M, Nerman O. Weighted analysis of general microarray experiments. BMC Bioinformatics 2007; 8:387. [PMID: 17937807 PMCID: PMC2175522 DOI: 10.1186/1471-2105-8-387] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2007] [Accepted: 10/15/2007] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND In DNA microarray experiments, measurements from different biological samples are often assumed to be independent and to have identical variance. For many datasets these assumptions have been shown to be invalid and typically lead to too optimistic p-values. A method called WAME has been proposed where a variance is estimated for each sample and a covariance is estimated for each pair of samples. The current version of WAME is, however, limited to experiments with paired design, e.g. two-channel microarrays. RESULTS The WAME procedure is extended to general microarray experiments, making it capable of handling both one- and two-channel datasets. Two public one-channel datasets are analysed and WAME detects both unequal variances and correlations. WAME is compared to other common methods: fold-change ranking, ordinary linear model with t-tests, LIMMA and weighted LIMMA. The p-value distributions are shown to differ greatly between the examined methods. In a resampling-based simulation study, the p-values generated by WAME are found to be substantially more correct than the alternatives when a relatively small proportion of the genes is regulated. WAME is also shown to have higher power than the other methods. WAME is available as an R-package. CONCLUSION The WAME procedure is generalized and the limitation to paired-design microarray datasets is removed. The examined other methods produce invalid p-values in many cases, while WAME is shown to produce essentially valid p-values when a relatively small proportion of genes is regulated. WAME is also shown to have higher power than the examined alternative methods.
Collapse
|
117
|
Thorsen M, Lagniel G, Kristiansson E, Junot C, Nerman O, Labarre J, Tamás MJ. Quantitative transcriptome, proteome, and sulfur metabolite profiling of the Saccharomyces cerevisiae response to arsenite. Physiol Genomics 2007; 30:35-43. [PMID: 17327492 DOI: 10.1152/physiolgenomics.00236.2006] [Citation(s) in RCA: 98] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Arsenic is ubiquitously present in nature, and various mechanisms have evolved enabling cells to evade toxicity and acquire tolerance. Herein, we explored how Saccharomyces cerevisiae (budding yeast) respond to trivalent arsenic (arsenite) by quantitative transcriptome, proteome, and sulfur metabolite profiling. Arsenite exposure affected transcription of genes encoding functions related to protein biosynthesis, arsenic detoxification, oxidative stress defense, redox maintenance, and proteolytic activity. Importantly, we observed that nearly all components of the sulfate assimilation and glutathione biosynthesis pathways were induced at both gene and protein levels. Kinetic metabolic profiling evidenced a significant increase in the pools of sulfur metabolites as well as elevated cellular glutathione levels. Moreover, the flux in the sulfur assimilation pathway as well as the glutathione synthesis rate strongly increased with a concomitant reduction of sulfur incorporation into proteins. By combining comparative genomics and molecular analyses, we pinpointed transcription factors that mediate the core of the transcriptional response to arsenite. Taken together, our data reveal that arsenite-exposed cells channel a large part of assimilated sulfur into glutathione biosynthesis, and we provide evidence that the transcriptional regulators Yap1p and Met4p control this response in concert.
Collapse
|
118
|
Nilsson RH, Ryberg M, Kristiansson E, Abarenkov K, Larsson KH, Kõljalg U. Taxonomic reliability of DNA sequences in public sequence databases: a fungal perspective. PLoS One 2006; 1:e59. [PMID: 17183689 PMCID: PMC1762357 DOI: 10.1371/journal.pone.0000059] [Citation(s) in RCA: 384] [Impact Index Per Article: 21.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2006] [Accepted: 10/26/2006] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND DNA sequences are increasingly seen as one of the primary information sources for species identification in many organism groups. Such approaches, popularly known as barcoding, are underpinned by the assumption that the reference databases used for comparison are sufficiently complete and feature correctly and informatively annotated entries. METHODOLOGY/PRINCIPAL FINDINGS The present study uses a large set of fungal DNA sequences from the inclusive International Nucleotide Sequence Database to show that the taxon sampling of fungi is far from complete, that about 20% of the entries may be incorrectly identified to species level, and that the majority of entries lack descriptive and up-to-date annotations. CONCLUSIONS The problems with taxonomic reliability and insufficient annotations in public DNA repositories form a tangible obstacle to sequence-based species identification, and it is manifest that the greatest challenges to biological barcoding will be of taxonomical, rather than technical, nature.
Collapse
|
119
|
Kristiansson E, Sjögren A, Rudemo M, Nerman O. Quality Optimised Analysis of General Paired Microarray Experiments. Stat Appl Genet Mol Biol 2006; 5:Article10. [PMID: 16646864 DOI: 10.2202/1544-6115.1209] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
In microarray experiments, several steps may cause sub-optimal quality and the need for quality control is strong. Often the experiments are complex, with several conditions studied simultaneously. A linear model for paired microarray experiments is proposed as a generalisation of the paired two-sample method by Kristiansson et al. (2005). Quality variation is modelled by different variance scales for different (pairs of) arrays, and shared sources of variation are modelled by covariances between arrays. The gene-wise variance estimates are moderated in an empirical Bayes approach. Due to correlations all data is typically used in the inference of any linear combination of parameters. Both real and simulated data are analysed. Unequal variances and strong correlations are found in real data, leading to further examination of the fit of the model and of the nature of the datasets in general. The empirical distributions of the test-statistics are found to have a considerably improved match to the null distribution compared to previous methods, which implies more correct p-values provided that most genes are non-differentially expressed. In fact, assuming independent observations with identical variances typically leads to optimistic p-values. The method is shown to perform better than the alternatives in the simulation study.
Collapse
|
120
|
Kristiansson E, Sjögren A, Rudemo M, Nerman O. Weighted analysis of paired microarray experiments. Stat Appl Genet Mol Biol 2005; 4:Article30. [PMID: 16646849 DOI: 10.2202/1544-6115.1160] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
In microarray experiments quality often varies, for example between samples and between arrays. The need for quality control is therefore strong. A statistical model and a corresponding analysis method is suggested for experiments with pairing, including designs with individuals observed before and after treatment and many experiments with two-colour spotted arrays. The model is of mixed type with some parameters estimated by an empirical Bayes method. Differences in quality are modelled by individual variances and correlations between repetitions. The method is applied to three real and several simulated datasets. Two of the real datasets are of Affymetrix type with patients profiled before and after treatment, and the third dataset is of two-colour spotted cDNA type. In all cases, the patients or arrays had different estimated variances, leading to distinctly unequal weights in the analysis. We suggest also plots which illustrate the variances and correlations that affect the weights computed by our analysis method. For simulated data the improvement relative to previously published methods without weighting is shown to be substantial.
Collapse
|
121
|
Nilsson RH, Kristiansson E, Ryberg M, Larsson KH. Approaching the taxonomic affiliation of unidentified sequences in public databases--an example from the mycorrhizal fungi. BMC Bioinformatics 2005; 6:178. [PMID: 16022740 PMCID: PMC1186019 DOI: 10.1186/1471-2105-6-178] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2005] [Accepted: 07/18/2005] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND During the last few years, DNA sequence analysis has become one of the primary means of taxonomic identification of species, particularly so for species that are minute or otherwise lack distinct, readily obtainable morphological characters. Although the number of sequences available for comparison in public databases such as GenBank increases exponentially, only a minuscule fraction of all organisms have been sequenced, leaving taxon sampling a momentous problem for sequence-based taxonomic identification. When querying GenBank with a set of unidentified sequences, a considerable proportion typically lack fully identified matches, forming an ever-mounting pile of sequences that the researcher will have to monitor manually in the hope that new, clarifying sequences have been submitted by other researchers. To alleviate these concerns, a project to automatically monitor select unidentified sequences in GenBank for taxonomic progress through repeated local BLAST searches was initiated. Mycorrhizal fungi--a field where species identification often is prohibitively complex--and the much used ITS locus were chosen as test bed. RESULTS A Perl script package called emerencia is presented. On a regular basis, it downloads select sequences from GenBank, separates the identified sequences from those insufficiently identified, and performs BLAST searches between these two datasets, storing all results in an SQL database. On the accompanying web-service http://emerencia.math.chalmers.se, users can monitor the taxonomic progress of insufficiently identified sequences over time, either through active searches or by signing up for e-mail notification upon disclosure of better matches. Other search categories, such as listing all insufficiently identified sequences (and their present best fully identified matches) publication-wise, are also available. DISCUSSION The ever-increasing use of DNA sequences for identification purposes largely falls back on the assumption that public sequence databases contain a thorough sampling of taxonomically well-annotated sequences. Taxonomy, held by some to be an old-fashioned trade, has accordingly never been more important. emerencia does not automate the taxonomic process, but it does allow researchers to focus their efforts elsewhere than countless manual BLAST runs and arduous sieving of BLAST hit lists. The emerencia system is available on an open source basis for local installation with any organism and gene group as targets.
Collapse
|
122
|
Nelander S, Larsson E, Kristiansson E, Månsson R, Nerman O, Sigvardsson M, Mostad P, Lindahl P. Predictive screening for regulators of conserved functional gene modules (gene batteries) in mammals. BMC Genomics 2005; 6:68. [PMID: 15882449 PMCID: PMC1134656 DOI: 10.1186/1471-2164-6-68] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2004] [Accepted: 05/09/2005] [Indexed: 01/08/2023] Open
Abstract
Background The expression of gene batteries, genomic units of functionally linked genes which are activated by similar sets of cis- and trans-acting regulators, has been proposed as a major determinant of cell specialization in metazoans. We developed a predictive procedure to screen the mouse and human genomes and transcriptomes for cases of gene-battery-like regulation. Results In a screen that covered ~40 per cent of all annotated protein-coding genes, we identified 21 co-expressed gene clusters with statistically supported sharing of cis-regulatory sequence elements. 66 predicted cases of over-represented transcription factor binding motifs were validated against the literature and fell into three categories: (i) previously described cases of gene battery-like regulation, (ii) previously unreported cases of gene battery-like regulation with some support in a limited number of genes, and (iii) predicted cases that currently lack experimental support. The novel predictions include for example Sox 17 and RFX transcription factor binding sites that were detected in ~10% of all testis specific genes, and HNF-1 and 4 binding sites that were detected in ~30% of all kidney specific genes respectively. The results are publicly available at . Conclusion 21 co-expressed gene clusters were enriched for a total of 66 shared cis-regulatory sequence elements. A majority of these predictions represent novel cases of potential co-regulation of functionally coupled proteins. Critical technical parameters were evaluated, and the results and the methods provide a valuable resource for future experimental design.
Collapse
|