1
|
Ban M, Liao W, Baker A, Compston A, Thorpe J, Molyneux P, Fraser M, Khadake J, Jones J, Coles A, Sawcer S. Transcript specific regulation of expression influences susceptibility to multiple sclerosis. Eur J Hum Genet 2020; 28:826-834. [PMID: 31932686 DOI: 10.1038/s41431-019-0569-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2019] [Revised: 11/03/2019] [Accepted: 12/17/2019] [Indexed: 01/01/2023] Open
Abstract
Genome-wide association studies (GWAS) have identified over 100 loci containing single nucleotide variants (SNVs) that influence the risk of developing multiple sclerosis (MS). Most of these loci lie in non-coding regulatory regions of the genome that are active in immune cells and are therefore thought to modify risk by altering the expression of key immune genes. To explore this hypothesis we screened genes flanking MS-associated variants for evidence of allele specific expression (ASE) by quantifying the transcription of coding variants in linkage disequilibrium with MS-associated SNVs. In total, we were able to identify and successfully analyse 200 such coding variants (from 112 genes) in both CD4+ and CD8+ T cells from 106 MS patients and 105 controls. Fifty-six of these coding variants (from 43 genes) showed statistically significant evidence of ASE in one or both cell types. In the Lck interacting transmembrane adaptor 1 gene (LIME1), for example, we were able to show that in both cell types, the MS-associated variant rs2256814 increased the expression of some transcripts while simultaneously reducing the expression of other transcripts. In CD4+ cells from an additional independent set of 96 cases and 93 controls we were able to replicate the effect of this SNV on the balance of alternate LIME1 transcripts using qPCR (p = 5 × 10-24). Our data thus indicate that some of the MS-associated SNVs identified by GWAS likely exert their effects on risk by distorting the balance of alternate transcripts rather than by changing the overall level of gene expression.
Collapse
Affiliation(s)
- Maria Ban
- Department of Clinical Neurosciences, Cambridge Biomedical Campus, University of Cambridge, Box 165, Hills Road, Cambridge, CB2 0QQ, UK.
| | - Wenjia Liao
- Department of Clinical Neurosciences, Cambridge Biomedical Campus, University of Cambridge, Box 165, Hills Road, Cambridge, CB2 0QQ, UK
| | - Amie Baker
- Department of Clinical Neurosciences, Cambridge Biomedical Campus, University of Cambridge, Box 165, Hills Road, Cambridge, CB2 0QQ, UK
| | - Alastair Compston
- Department of Clinical Neurosciences, Cambridge Biomedical Campus, University of Cambridge, Box 165, Hills Road, Cambridge, CB2 0QQ, UK
| | - John Thorpe
- Department of Clinical Neurosciences, Cambridge Biomedical Campus, University of Cambridge, Box 165, Hills Road, Cambridge, CB2 0QQ, UK
| | - Paul Molyneux
- Department of Clinical Neurosciences, Cambridge Biomedical Campus, University of Cambridge, Box 165, Hills Road, Cambridge, CB2 0QQ, UK
| | - Mary Fraser
- Department of Clinical Neurosciences, Cambridge Biomedical Campus, University of Cambridge, Box 165, Hills Road, Cambridge, CB2 0QQ, UK
| | - Jyoti Khadake
- NIHR BioResource for Translational Research, University of Cambridge and Cambridge University Hospitals NHS Foundation Trust Hills Road, Box 299, Cambridge, CB2 0QQ, UK
| | - Joanne Jones
- Department of Clinical Neurosciences, Cambridge Biomedical Campus, University of Cambridge, Box 165, Hills Road, Cambridge, CB2 0QQ, UK
| | - Alasdair Coles
- Department of Clinical Neurosciences, Cambridge Biomedical Campus, University of Cambridge, Box 165, Hills Road, Cambridge, CB2 0QQ, UK
| | - Stephen Sawcer
- Department of Clinical Neurosciences, Cambridge Biomedical Campus, University of Cambridge, Box 165, Hills Road, Cambridge, CB2 0QQ, UK
| |
Collapse
|
2
|
Mitrovič M, Patsopoulos NA, Beecham AH, Dankowski T, Goris A, Dubois B, D’hooghe MB, Lemmens R, Van Damme P, Søndergaard HB, Sellebjerg F, Sorensen PS, Ullum H, Thørner LW, Werge T, Saarela J, Cournu-Rebeix I, Damotte V, Fontaine B, Guillot-Noel L, Lathrop M, Vukusik S, Gourraud PA, Andlauer TF, Pongratz V, Buck D, Gasperi C, Bayas A, Heesen C, Kümpfel T, Linker R, Paul F, Stangel M, Tackenberg B, Bergh FT, Warnke C, Wiendl H, Wildemann B, Zettl U, Ziemann U, Tumani H, Gold R, Grummel V, Hemmer B, Knier B, Lill CM, Luessi F, Dardiotis E, Agliardi C, Barizzone N, Mascia E, Bernardinelli L, Comi G, Cusi D, Esposito F, Ferrè L, Comi C, Galimberti D, Leone MA, Sorosina M, Mescheriakova J, Hintzen R, van Duijn C, Teunissen CE, Bos SD, Myhr KM, Celius EG, Lie BA, Spurkland A, Comabella M, Montalban X, Alfredsson L, Stridh P, Hillert J, Jagodic M, Piehl F, Jelčić I, Martin R, Sospedra M, Ban M, Hawkins C, Hysi P, Kalra S, Karpe F, Khadake J, Lachance G, Neville M, Santaniello A, Caillier SJ, Calabresi PA, Cree BA, Cross A, Davis MF, Haines JL, de Bakker PI, Delgado S, Dembele M, Edwards K, Fitzgerald KC, Hakonarson H, Konidari I, Lathi E, Manrique CP, Pericak-Vance MA, Piccio L, Schaefer C, McCabe C, Weiner H, Goldstein J, Olsson T, Hadjigeorgiou G, Taylor B, Tajouri L, Charlesworth J, Booth DR, Harbo HF, Ivinson AJ, Hauser SL, Compston A, Stewart G, Zipp F, Barcellos LF, Baranzini SE, Martinelli-Boneschi F, D’Alfonso S, Ziegler A, Oturai A, McCauley JL, Sawcer SJ, Oksenberg JR, De Jager PL, Kockum I, Hafler DA, Cotsapas C. Low-Frequency and Rare-Coding Variation Contributes to Multiple Sclerosis Risk. Cell 2019; 178:262. [PMID: 31251915 PMCID: PMC6602362 DOI: 10.1016/j.cell.2019.06.016] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
3
|
Mitrovič M, Patsopoulos NA, Beecham AH, Dankowski T, Goris A, Dubois B, D’hooghe MB, Lemmens R, Van Damme P, Søndergaard HB, Sellebjerg F, Sorensen PS, Ullum H, Thørner LW, Werge T, Saarela J, Cournu-Rebeix I, Damotte V, Fontaine B, Guillot-Noel L, Lathrop M, Vukusik S, Gourraud PA, Andlauer TF, Pongratz V, Buck D, Gasperi C, Bayas A, Heesen C, Kümpfel T, Linker R, Paul F, Stangel M, Tackenberg B, Bergh FT, Warnke C, Wiendl H, Wildemann B, Zettl U, Ziemann U, Tumani H, Gold R, Grummel V, Hemmer B, Knier B, Lill CM, Luessi F, Dardiotis E, Agliardi C, Barizzone N, Mascia E, Bernardinelli L, Comi G, Cusi D, Esposito F, Ferrè L, Comi C, Galimberti D, Leone MA, Sorosina M, Mescheriakova J, Hintzen R, van Duijn C, Teunissen CE, Bos SD, Myhr KM, Celius EG, Lie BA, Spurkland A, Comabella M, Montalban X, Alfredsson L, Stridh P, Hillert J, Jagodic M, Piehl F, Jelčić I, Martin R, Sospedra M, Ban M, Hawkins C, Hysi P, Kalra S, Karpe F, Khadake J, Lachance G, Neville M, Santaniello A, Caillier SJ, Calabresi PA, Cree BA, Cross A, Davis MF, Haines JL, de Bakker PI, Delgado S, Dembele M, Edwards K, Fitzgerald KC, Hakonarson H, Konidari I, Lathi E, Manrique CP, Pericak-Vance MA, Piccio L, Schaefer C, McCabe C, Weiner H, Goldstein J, Olsson T, Hadjigeorgiou G, Taylor B, Tajouri L, Charlesworth J, Booth DR, Harbo HF, Ivinson AJ, Hauser SL, Compston A, Stewart G, Zipp F, Barcellos LF, Baranzini SE, Martinelli-Boneschi F, D’Alfonso S, Ziegler A, Oturai A, McCauley JL, Sawcer SJ, Oksenberg JR, De Jager PL, Kockum I, Hafler DA, Cotsapas C. Low-Frequency and Rare-Coding Variation Contributes to Multiple Sclerosis Risk. Cell 2018; 175:1679-1687.e7. [PMID: 30343897 PMCID: PMC6269166 DOI: 10.1016/j.cell.2018.09.049] [Citation(s) in RCA: 88] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2018] [Revised: 08/08/2018] [Accepted: 09/24/2018] [Indexed: 12/21/2022]
Abstract
Multiple sclerosis is a complex neurological disease, with ∼20% of risk heritability attributable to common genetic variants, including >230 identified by genome-wide association studies. Multiple strands of evidence suggest that much of the remaining heritability is also due to additive effects of common variants rather than epistasis between these variants or mutations exclusive to individual families. Here, we show in 68,379 cases and controls that up to 5% of this heritability is explained by low-frequency variation in gene coding sequence. We identify four novel genes driving MS risk independently of common-variant signals, highlighting key pathogenic roles for regulatory T cell homeostasis and regulation, IFNγ biology, and NFκB signaling. As low-frequency variants do not show substantial linkage disequilibrium with other variants, and as coding variants are more interpretable and experimentally tractable than non-coding variation, our discoveries constitute a rich resource for dissecting the pathobiology of MS.
Collapse
|
4
|
Griffin PC, Khadake J, LeMay KS, Lewis SE, Orchard S, Pask A, Pope B, Roessner U, Russell K, Seemann T, Treloar A, Tyagi S, Christiansen JH, Dayalan S, Gladman S, Hangartner SB, Hayden HL, Ho WWH, Keeble-Gagnère G, Korhonen PK, Neish P, Prestes PR, Richardson MF, Watson-Haigh NS, Wyres KL, Young ND, Schneider MV. Best practice data life cycle approaches for the life sciences. F1000Res 2018; 6:1618. [PMID: 30109017 DOI: 10.12688/f1000research.12344.1] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 08/17/2017] [Indexed: 11/20/2022] Open
Abstract
Throughout history, the life sciences have been revolutionised by technological advances; in our era this is manifested by advances in instrumentation for data generation, and consequently researchers now routinely handle large amounts of heterogeneous data in digital formats. The simultaneous transitions towards biology as a data science and towards a 'life cycle' view of research data pose new challenges. Researchers face a bewildering landscape of data management requirements, recommendations and regulations, without necessarily being able to access data management training or possessing a clear understanding of practical approaches that can assist in data management in their particular research domain. Here we provide an overview of best practice data life cycle approaches for researchers in the life sciences/bioinformatics space with a particular focus on 'omics' datasets and computer-based data processing and analysis. We discuss the different stages of the data life cycle and provide practical suggestions for useful tools and resources to improve data management practices.
Collapse
Affiliation(s)
- Philippa C Griffin
- EMBL Australia Bioinformatics Resource, The University of Melbourne, Parkville, VIC, 3010, Australia.,Melbourne Bioinformatics, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Jyoti Khadake
- NIHR BioResource, University of Cambridge and Cambridge University Hospitals NHS Foundation Trust Hills Road, Cambridge , CB2 0QQ, UK
| | - Kate S LeMay
- Australian National Data Service, Monash University, Malvern East , VIC, 3145, Australia
| | - Suzanna E Lewis
- Lawrence Berkeley National Laboratory, Environmental Genomics and Systems Biology Division, Berkeley, CA, 94720, USA
| | - Sandra Orchard
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Cambridge, CB10 1SD, UK
| | - Andrew Pask
- School of BioSciences, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Bernard Pope
- Melbourne Bioinformatics, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Ute Roessner
- Metabolomics Australia, School of BioSciences, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Keith Russell
- Australian National Data Service, Monash University, Malvern East , VIC, 3145, Australia
| | - Torsten Seemann
- Melbourne Bioinformatics, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Andrew Treloar
- Australian National Data Service, Monash University, Malvern East , VIC, 3145, Australia
| | - Sonika Tyagi
- Australian Genome Research Facility Ltd, Parkville, VIC, 3052, Australia.,Monash Bioinformatics Platform, Monash University, Clayton, VIC, 3800, Australia
| | - Jeffrey H Christiansen
- Queensland Cyber Infrastructure Foundation and the University of Queensland Research Computing Centre, St Lucia, QLD, 4072, Australia
| | - Saravanan Dayalan
- Metabolomics Australia, School of BioSciences, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Simon Gladman
- EMBL Australia Bioinformatics Resource, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Sandra B Hangartner
- School of Biological Sciences, Monash University, Clayton, VIC, 3800, Australia
| | - Helen L Hayden
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Department of Economic Development, Jobs, Transport and Resources (DEDJTR), Bundoora, VIC, 3083, Australia
| | - William W H Ho
- School of BioSciences, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Gabriel Keeble-Gagnère
- School of BioSciences, The University of Melbourne, Parkville, VIC, 3010, Australia.,Agriculture Victoria, AgriBio, Centre for AgriBioscience, Department of Economic Development, Jobs, Transport and Resources (DEDJTR), Bundoora, VIC, 3083, Australia
| | - Pasi K Korhonen
- Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Peter Neish
- The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Priscilla R Prestes
- Faculty of Science and Engineering, Federation University Australia, Mt Helen , VIC, 3350, Australia
| | - Mark F Richardson
- Bioinformatics Core Research Group & Centre for Integrative Ecology, Deakin University, Geelong, VIC, 3220, Australia
| | - Nathan S Watson-Haigh
- School of Agriculture, Food and Wine, University of Adelaide, Glen Osmond, SA, 5064, Australia
| | - Kelly L Wyres
- Department of Biochemistry and Molecular Biology, Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Neil D Young
- Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Maria Victoria Schneider
- Melbourne Bioinformatics, The University of Melbourne, Parkville, VIC, 3010, Australia.,The University of Melbourne, Parkville, VIC, 3010, Australia
| |
Collapse
|
5
|
Warrier V, Grasby KL, Uzefovsky F, Toro R, Smith P, Chakrabarti B, Khadake J, Mawbey-Adamson E, Litterman N, Hottenga JJ, Lubke G, Boomsma DI, Martin NG, Hatemi PK, Medland SE, Hinds DA, Bourgeron T, Baron-Cohen S. Genome-wide meta-analysis of cognitive empathy: heritability, and correlates with sex, neuropsychiatric conditions and cognition. Mol Psychiatry 2018; 23:1402-1409. [PMID: 28584286 PMCID: PMC5656177 DOI: 10.1038/mp.2017.122] [Citation(s) in RCA: 74] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/19/2016] [Revised: 04/08/2017] [Accepted: 04/12/2017] [Indexed: 12/19/2022]
Abstract
We conducted a genome-wide meta-analysis of cognitive empathy using the 'Reading the Mind in the Eyes' Test (Eyes Test) in 88,056 research volunteers of European Ancestry (44,574 females and 43,482 males) from 23andMe Inc., and an additional 1497 research volunteers of European Ancestry (891 females and 606 males) from the Brisbane Longitudinal Twin Study. We confirmed a female advantage on the Eyes Test (Cohen's d=0.21, P<2.2 × 10-16), and identified a locus in 3p26.1 that is associated with scores on the Eyes Test in females (rs7641347, Pmeta=1.58 × 10-8). Common single nucleotide polymorphisms explained 5.8% (95% CI: 4.5%-7.2%; P=1.00 × 10-17) of the total trait variance in both sexes, and we identified a twin heritability of 28% (95% CI: 13%-42%). Finally, we identified significant genetic correlation between the Eyes Test and anorexia nervosa, openness (NEO-Five Factor Inventory), and different measures of educational attainment and cognitive aptitude.
Collapse
Affiliation(s)
- Varun Warrier
- Autism Research Centre, Department of Psychiatry, University of Cambridge, Cambridgeshire, United Kingdom,Corresponding authors: Varun Warrier () and Simon Baron-Cohen (). Autism Research Centre, Douglas House, 18B Trumpington Road, Cambridge, CB2 8AH, United Kingdom. Telephone: 0044 (0) 1223 746057
| | | | - Florina Uzefovsky
- Autism Research Centre, Department of Psychiatry, University of Cambridge, Cambridgeshire, United Kingdom,Department of Psychology, Ben Gurion University of the Negev, Israel
| | - Roberto Toro
- CNRS UMR 3571: Genes, Synapses and Cognition, Institut Pasteur, Paris, France,Université Paris Diderot, Sorbonne Paris Cité, Human Genetics and Cognitive Functions, Paris, France,Institut Pasteur, 5-28 Rue du Dr Roux, 75015 Paris, France
| | - Paula Smith
- Autism Research Centre, Department of Psychiatry, University of Cambridge, Cambridgeshire, United Kingdom
| | - Bhismadev Chakrabarti
- Centre for Integrative Neuroscience and Neurodynamics, School of Psychology and Clinical Language Sciences, University of Reading, Reading, United Kingdom
| | - Jyoti Khadake
- NIHR Cambridge BioResource, Cambridge University and Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
| | - Eleanor Mawbey-Adamson
- NIHR Cambridge BioResource, Cambridge University and Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
| | - Nadia Litterman
- 23andMe Inc., 899 West Evelyn Ave, Mountain View, California 94041, USA
| | - Jouke-Jan Hottenga
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands,EMGO+ Institute for Health and Care Research, VU Medical Center, Amsterdam, the Netherlands.,Neuroscience Campus Amsterdam, Vrije Universiteit, Amsterdam, the Netherlands
| | - Gitta Lubke
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands,Department of Psychology, University of Notre Dame, United States
| | - Dorret I Boomsma
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| | | | - Peter K Hatemi
- Political Science, Microbiology and Biochemistry, Pennsylvania State University
| | - Sarah E Medland
- QIMR Berghofer Medical Research Institute, Brisbane, Australia
| | - David A Hinds
- 23andMe Inc., 899 West Evelyn Ave, Mountain View, California 94041, USA
| | - Thomas Bourgeron
- CNRS UMR 3571: Genes, Synapses and Cognition, Institut Pasteur, Paris, France,Université Paris Diderot, Sorbonne Paris Cité, Human Genetics and Cognitive Functions, Paris, France,Institut Pasteur, 5-28 Rue du Dr Roux, 75015 Paris, France
| | - Simon Baron-Cohen
- Autism Research Centre, Department of Psychiatry, University of Cambridge, Cambridgeshire, United Kingdom,CLASS Clinic, Cambridgeshire and Peterborough NHS Foundation Trust (CPFT), Cambridgeshire, United Kingdom,Corresponding authors: Varun Warrier () and Simon Baron-Cohen (). Autism Research Centre, Douglas House, 18B Trumpington Road, Cambridge, CB2 8AH, United Kingdom. Telephone: 0044 (0) 1223 746057
| |
Collapse
|
6
|
Griffin PC, Khadake J, LeMay KS, Lewis SE, Orchard S, Pask A, Pope B, Roessner U, Russell K, Seemann T, Treloar A, Tyagi S, Christiansen JH, Dayalan S, Gladman S, Hangartner SB, Hayden HL, Ho WWH, Keeble-Gagnère G, Korhonen PK, Neish P, Prestes PR, Richardson MF, Watson-Haigh NS, Wyres KL, Young ND, Schneider MV. Best practice data life cycle approaches for the life sciences. F1000Res 2017; 6:1618. [PMID: 30109017 PMCID: PMC6069748 DOI: 10.12688/f1000research.12344.2] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 05/29/2018] [Indexed: 11/20/2022] Open
Abstract
Throughout history, the life sciences have been revolutionised by technological advances; in our era this is manifested by advances in instrumentation for data generation, and consequently researchers now routinely handle large amounts of heterogeneous data in digital formats. The simultaneous transitions towards biology as a data science and towards a 'life cycle' view of research data pose new challenges. Researchers face a bewildering landscape of data management requirements, recommendations and regulations, without necessarily being able to access data management training or possessing a clear understanding of practical approaches that can assist in data management in their particular research domain. Here we provide an overview of best practice data life cycle approaches for researchers in the life sciences/bioinformatics space with a particular focus on 'omics' datasets and computer-based data processing and analysis. We discuss the different stages of the data life cycle and provide practical suggestions for useful tools and resources to improve data management practices.
Collapse
Affiliation(s)
- Philippa C Griffin
- EMBL Australia Bioinformatics Resource, The University of Melbourne, Parkville, VIC, 3010, Australia.,Melbourne Bioinformatics, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Jyoti Khadake
- NIHR BioResource, University of Cambridge and Cambridge University Hospitals NHS Foundation Trust Hills Road, Cambridge , CB2 0QQ, UK
| | - Kate S LeMay
- Australian National Data Service, Monash University, Malvern East , VIC, 3145, Australia
| | - Suzanna E Lewis
- Lawrence Berkeley National Laboratory, Environmental Genomics and Systems Biology Division, Berkeley, CA, 94720, USA
| | - Sandra Orchard
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Cambridge, CB10 1SD, UK
| | - Andrew Pask
- School of BioSciences, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Bernard Pope
- Melbourne Bioinformatics, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Ute Roessner
- Metabolomics Australia, School of BioSciences, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Keith Russell
- Australian National Data Service, Monash University, Malvern East , VIC, 3145, Australia
| | - Torsten Seemann
- Melbourne Bioinformatics, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Andrew Treloar
- Australian National Data Service, Monash University, Malvern East , VIC, 3145, Australia
| | - Sonika Tyagi
- Australian Genome Research Facility Ltd, Parkville, VIC, 3052, Australia.,Monash Bioinformatics Platform, Monash University, Clayton, VIC, 3800, Australia
| | - Jeffrey H Christiansen
- Queensland Cyber Infrastructure Foundation and the University of Queensland Research Computing Centre, St Lucia, QLD, 4072, Australia
| | - Saravanan Dayalan
- Metabolomics Australia, School of BioSciences, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Simon Gladman
- EMBL Australia Bioinformatics Resource, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Sandra B Hangartner
- School of Biological Sciences, Monash University, Clayton, VIC, 3800, Australia
| | - Helen L Hayden
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Department of Economic Development, Jobs, Transport and Resources (DEDJTR), Bundoora, VIC, 3083, Australia
| | - William W H Ho
- School of BioSciences, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Gabriel Keeble-Gagnère
- School of BioSciences, The University of Melbourne, Parkville, VIC, 3010, Australia.,Agriculture Victoria, AgriBio, Centre for AgriBioscience, Department of Economic Development, Jobs, Transport and Resources (DEDJTR), Bundoora, VIC, 3083, Australia
| | - Pasi K Korhonen
- Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Peter Neish
- The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Priscilla R Prestes
- Faculty of Science and Engineering, Federation University Australia, Mt Helen , VIC, 3350, Australia
| | - Mark F Richardson
- Bioinformatics Core Research Group & Centre for Integrative Ecology, Deakin University, Geelong, VIC, 3220, Australia
| | - Nathan S Watson-Haigh
- School of Agriculture, Food and Wine, University of Adelaide, Glen Osmond, SA, 5064, Australia
| | - Kelly L Wyres
- Department of Biochemistry and Molecular Biology, Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Neil D Young
- Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Maria Victoria Schneider
- Melbourne Bioinformatics, The University of Melbourne, Parkville, VIC, 3010, Australia.,The University of Melbourne, Parkville, VIC, 3010, Australia
| |
Collapse
|
7
|
Lachmann PJ, Lay E, Seilly DJ, Buchberger A, Schwaeble W, Khadake J. Further studies of the down-regulation by Factor I of the C3b feedback cycle using endotoxin as a soluble activator and red cells as a source of CR1 on sera of different complotype. Clin Exp Immunol 2015; 183:150-6. [PMID: 26415566 DOI: 10.1111/cei.12714] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/23/2015] [Indexed: 01/20/2023] Open
Abstract
In this paper we have extended our earlier studies of the action of increasing Factor I concentration on complement activation by using a soluble activator, lipopolysaccharide (LPS) endotoxin, and using human erythrocytes as a source of CR1 - the co-factor needed for the final clip of iC3b to C3dg by Factor I. Using this more physiological system, the results show that we can predict that a quite modest increase in Factor I concentration - 22 µg/ml of extra Factor I - will convert the activity of the highest risk sera to those of the lowest risk. Preliminary experiments have been performed with erythrocytes allotyped for CR1 number. While we have not been able to perform an adequate study of their co-factor activities in our assays, preliminary experiments suggest that when Factor I levels are increased the difference produced by different allotypes of red cells is largely overcome. This suggests that in patients with paroxysmal nocturnal haemoglobinuria (PNH) treated with eculizumab, additional treatment with Factor I may be very useful in reducing the need for blood transfusion. We have also explored the age-related allele frequency for the two polymorphisms of Factor H and the polymorphism of C3. In our population, unlike the 1975 study, we found no age variation in the allele frequency in these polymorphisms. This may, however, reflect that the Cambridge BioResource volunteers do not include many very young or very elderly patients, and in general comprise a population not greatly at risk of death from infectious disease.
Collapse
Affiliation(s)
- P J Lachmann
- Department of Veterinary Medicine, University of Cambridge, Cambridge, UK
| | - E Lay
- Department of Veterinary Medicine, University of Cambridge, Cambridge, UK
| | - D J Seilly
- Department of Veterinary Medicine, University of Cambridge, Cambridge, UK
| | - A Buchberger
- Department of Veterinary Medicine, University of Cambridge, Cambridge, UK
| | - W Schwaeble
- Department of Infection, Immunity and Inflammation, University of Leicester, Leicester, UK
| | - J Khadake
- Cambridge BioResource, Cambridge University and Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
| |
Collapse
|
8
|
López-Sánchez I, Valbuena A, Vázquez-Cedeira M, Khadake J, Sanz-García M, Carrillo-Jiménez A, Lazo PA. VRK1 interacts with p53 forming a basal complex that is activated by UV-induced DNA damage. FEBS Lett 2014; 588:692-700. [PMID: 24492002 DOI: 10.1016/j.febslet.2014.01.040] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2013] [Revised: 01/09/2014] [Accepted: 01/19/2014] [Indexed: 01/08/2023]
Abstract
DNA damage immediate cellular response requires the activation of p53 by kinases. We found that p53 forms a basal stable complex with VRK1, a Ser-Thr kinase that responds to UV-induced DNA damage by specifically phosphorylating p53. This interaction takes place through the p53 DNA binding domain, and frequent DNA-contact mutants of p53, such as R273H, R248H or R280K, do not disrupt the complex. UV-induced DNA damage activates VRK1, and is accompanied by phosphorylation of p53 at Thr-18 before it accumulates. We propose that the VRK1-p53 basal complex is an early-warning system for immediate cellular responses to DNA damage.
Collapse
Affiliation(s)
- Inmaculada López-Sánchez
- Experimental Therapeutics and Translational Oncology Program, Instituto de Biología Molecular y Celular del Cáncer, Consejo Superior de Investigaciones Científicas (CSIC), Universidad de Salamanca, Salamanca, Spain
| | - Alberto Valbuena
- Experimental Therapeutics and Translational Oncology Program, Instituto de Biología Molecular y Celular del Cáncer, Consejo Superior de Investigaciones Científicas (CSIC), Universidad de Salamanca, Salamanca, Spain
| | - Marta Vázquez-Cedeira
- Experimental Therapeutics and Translational Oncology Program, Instituto de Biología Molecular y Celular del Cáncer, Consejo Superior de Investigaciones Científicas (CSIC), Universidad de Salamanca, Salamanca, Spain; Instituto de Investigación Biomédica de Salamanca (IBSAL), Hospital Universitario de Salamanca, Salamanca, Spain
| | - Jyoti Khadake
- European Bioinformatics Institute-EMBL, Cambridge, England, United Kingdom
| | - Marta Sanz-García
- Experimental Therapeutics and Translational Oncology Program, Instituto de Biología Molecular y Celular del Cáncer, Consejo Superior de Investigaciones Científicas (CSIC), Universidad de Salamanca, Salamanca, Spain
| | - Alejandro Carrillo-Jiménez
- Experimental Therapeutics and Translational Oncology Program, Instituto de Biología Molecular y Celular del Cáncer, Consejo Superior de Investigaciones Científicas (CSIC), Universidad de Salamanca, Salamanca, Spain
| | - Pedro A Lazo
- Experimental Therapeutics and Translational Oncology Program, Instituto de Biología Molecular y Celular del Cáncer, Consejo Superior de Investigaciones Científicas (CSIC), Universidad de Salamanca, Salamanca, Spain; Instituto de Investigación Biomédica de Salamanca (IBSAL), Hospital Universitario de Salamanca, Salamanca, Spain.
| |
Collapse
|
9
|
Orchard S, Ammari M, Aranda B, Breuza L, Briganti L, Broackes-Carter F, Campbell NH, Chavali G, Chen C, del-Toro N, Duesbury M, Dumousseau M, Galeota E, Hinz U, Iannuccelli M, Jagannathan S, Jimenez R, Khadake J, Lagreid A, Licata L, Lovering RC, Meldal B, Melidoni AN, Milagros M, Peluso D, Perfetto L, Porras P, Raghunath A, Ricard-Blum S, Roechert B, Stutz A, Tognolli M, van Roey K, Cesareni G, Hermjakob H. The MIntAct project--IntAct as a common curation platform for 11 molecular interaction databases. Nucleic Acids Res 2013; 42:D358-63. [PMID: 24234451 PMCID: PMC3965093 DOI: 10.1093/nar/gkt1115] [Citation(s) in RCA: 1305] [Impact Index Per Article: 118.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
IntAct (freely available at http://www.ebi.ac.uk/intact) is an open-source, open data molecular interaction database populated by data either curated from the literature or from direct data depositions. IntAct has developed a sophisticated web-based curation tool, capable of supporting both IMEx- and MIMIx-level curation. This tool is now utilized by multiple additional curation teams, all of whom annotate data directly into the IntAct database. Members of the IntAct team supply appropriate levels of training, perform quality control on entries and take responsibility for long-term data maintenance. Recently, the MINT and IntAct databases decided to merge their separate efforts to make optimal use of limited developer resources and maximize the curation output. All data manually curated by the MINT curators have been moved into the IntAct database at EMBL-EBI and are merged with the existing IntAct dataset. Both IntAct and MINT are active contributors to the IMEx consortium (http://www.imexconsortium.org).
Collapse
Affiliation(s)
- Sandra Orchard
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, School of Animal and Comparative Biomedical Sciences, The University of Arizona, Tucson, AZ 85721-0036, USA, Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Rue Michel-Servet, CH-1211 Geneva 4, Switzerland, Department of Biology, University of Rome Tor Vergata, Rome 00133, Italy, Ontario Cancer Institute, the Campbell Family Institute for Cancer Research, and Techna Institute, University Health Network, Toronto, Ontario M5G 0A3, Canada, Cardiovascular Gene Annotation Initiative, Centre for Cardiovascular Genetics, Institute of Cardiovascular Science, University College London, London WC1E 6BT, UK, Centre for Microbial Diseases and Immunity Research, University of British Columbia, Vancouver, British Columbia V6T 1Z4 Canada, Mechanobiology Institute, National University of Singapore, T-Lab #05-01, 5A Engineering Drive 1, Singapore 117411, Singapore, Department of Cancer Research and Molecular Medicine, Norwegian University of Science and Technology, 7489 Trondheim, Norway, Research Institute IRCSS "Fondazione Santa Lucia", Rome 00179, Italy, Molecular Connections Pvt. Ltd., Bangalore 560 004, India, Institut de Biologie et Chimie des Protéines, Unité Mixte de Recherche 5086, Centre National de la Recherche Scientifique, Université Lyon 1, Lyon, France and Structural and Computational Biology Unit, European Molecular Biology Laboratory (EMBL), Meyerhofstrasse 1, D-69117 Heidelberg, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Orchard S, Kerrien S, Abbani S, Aranda B, Bhate J, Bidwell S, Bridge A, Briganti L, Brinkman FSL, Brinkman F, Cesareni G, Chatr-aryamontri A, Chautard E, Chen C, Dumousseau M, Goll J, Hancock REW, Hancock R, Hannick LI, Jurisica I, Khadake J, Lynn DJ, Mahadevan U, Perfetto L, Raghunath A, Ricard-Blum S, Roechert B, Salwinski L, Stümpflen V, Tyers M, Uetz P, Xenarios I, Hermjakob H. Protein interaction data curation: the International Molecular Exchange (IMEx) consortium. Nat Methods 2012; 9:345-50. [PMID: 22453911 DOI: 10.1038/nmeth.1931] [Citation(s) in RCA: 430] [Impact Index Per Article: 35.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
The International Molecular Exchange (IMEx) consortium is an international collaboration between major public interaction data providers to share literature-curation efforts and make a nonredundant set of protein interactions available in a single search interface on a common website (http://www.imexconsortium.org/). Common curation rules have been developed, and a central registry is used to manage the selection of articles to enter into the dataset. We discuss the advantages of such a service to the user, our quality-control measures and our data-distribution practices.
Collapse
Affiliation(s)
- Sandra Orchard
- European Molecular Biology Laboratory-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Gieger C, Radhakrishnan A, Cvejic A, Tang W, Porcu E, Pistis G, Serbanovic-Canic J, Elling U, Goodall AH, Labrune Y, Lopez LM, Mägi R, Meacham S, Okada Y, Pirastu N, Sorice R, Teumer A, Voss K, Zhang W, Ramirez-Solis R, Bis JC, Ellinghaus D, Gögele M, Hottenga JJ, Langenberg C, Kovacs P, O'Reilly PF, Shin SY, Esko T, Hartiala J, Kanoni S, Murgia F, Parsa A, Stephens J, van der Harst P, Ellen van der Schoot C, Allayee H, Attwood A, Balkau B, Bastardot F, Basu S, Baumeister SE, Biino G, Bomba L, Bonnefond A, Cambien F, Chambers JC, Cucca F, D'Adamo P, Davies G, de Boer RA, de Geus EJC, Döring A, Elliott P, Erdmann J, Evans DM, Falchi M, Feng W, Folsom AR, Frazer IH, Gibson QD, Glazer NL, Hammond C, Hartikainen AL, Heckbert SR, Hengstenberg C, Hersch M, Illig T, Loos RJF, Jolley J, Khaw KT, Kühnel B, Kyrtsonis MC, Lagou V, Lloyd-Jones H, Lumley T, Mangino M, Maschio A, Mateo Leach I, McKnight B, Memari Y, Mitchell BD, Montgomery GW, Nakamura Y, Nauck M, Navis G, Nöthlings U, Nolte IM, Porteous DJ, Pouta A, Pramstaller PP, Pullat J, Ring SM, Rotter JI, Ruggiero D, Ruokonen A, Sala C, Samani NJ, Sambrook J, Schlessinger D, Schreiber S, Schunkert H, Scott J, Smith NL, Snieder H, Starr JM, Stumvoll M, Takahashi A, Tang WHW, Taylor K, Tenesa A, Lay Thein S, Tönjes A, Uda M, Ulivi S, van Veldhuisen DJ, Visscher PM, Völker U, Wichmann HE, Wiggins KL, Willemsen G, Yang TP, Hua Zhao J, Zitting P, Bradley JR, Dedoussis GV, Gasparini P, Hazen SL, Metspalu A, Pirastu M, Shuldiner AR, Joost van Pelt L, Zwaginga JJ, Boomsma DI, Deary IJ, Franke A, Froguel P, Ganesh SK, Jarvelin MR, Martin NG, Meisinger C, Psaty BM, Spector TD, Wareham NJ, Akkerman JWN, Ciullo M, Deloukas P, Greinacher A, Jupe S, Kamatani N, Khadake J, Kooner JS, Penninger J, Prokopenko I, Stemple D, Toniolo D, Wernisch L, Sanna S, Hicks AA, Rendon A, Ferreira MA, Ouwehand WH, Soranzo N. New gene functions in megakaryopoiesis and platelet formation. Nature 2011; 480:201-8. [PMID: 22139419 PMCID: PMC3335296 DOI: 10.1038/nature10659] [Citation(s) in RCA: 309] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2010] [Accepted: 10/21/2011] [Indexed: 12/23/2022]
Abstract
Platelets are the second most abundant cell type in blood and are essential for maintaining haemostasis. Their count and volume are tightly controlled within narrow physiological ranges, but there is only limited understanding of the molecular processes controlling both traits. Here we carried out a high-powered meta-analysis of genome-wide association studies (GWAS) in up to 66,867 individuals of European ancestry, followed by extensive biological and functional assessment. We identified 68 genomic loci reliably associated with platelet count and volume mapping to established and putative novel regulators of megakaryopoiesis and platelet formation. These genes show megakaryocyte-specific gene expression patterns and extensive network connectivity. Using gene silencing in Danio rerio and Drosophila melanogaster, we identified 11 of the genes as novel regulators of blood cell formation. Taken together, our findings advance understanding of novel gene functions controlling fate-determining events during megakaryopoiesis and platelet formation, providing a new example of successful translation of GWAS to function.
Collapse
Affiliation(s)
- Christian Gieger
- Institute of Genetic Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Ingolstädter Landstr 1, 85764 Neuherberg, Germany.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Kerrien S, Aranda B, Breuza L, Bridge A, Broackes-Carter F, Chen C, Duesbury M, Dumousseau M, Feuermann M, Hinz U, Jandrasits C, Jimenez RC, Khadake J, Mahadevan U, Masson P, Pedruzzi I, Pfeiffenberger E, Porras P, Raghunath A, Roechert B, Orchard S, Hermjakob H. The IntAct molecular interaction database in 2012. Nucleic Acids Res 2011; 40:D841-6. [PMID: 22121220 PMCID: PMC3245075 DOI: 10.1093/nar/gkr1088] [Citation(s) in RCA: 727] [Impact Index Per Article: 55.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
IntAct is an open-source, open data molecular interaction database populated by data either curated from the literature or from direct data depositions. Two levels of curation are now available within the database, with both IMEx-level annotation and less detailed MIMIx-compatible entries currently supported. As from September 2011, IntAct contains approximately 275 000 curated binary interaction evidences from over 5000 publications. The IntAct website has been improved to enhance the search process and in particular the graphical display of the results. New data download formats are also available, which will facilitate the inclusion of IntAct's data in the Semantic Web. IntAct is an active contributor to the IMEx consortium (http://www.imexconsortium.org). IntAct source code and data are freely available at http://www.ebi.ac.uk/intact.
Collapse
Affiliation(s)
- Samuel Kerrien
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome Campus Hinxton, Cambridge CB10 1SD, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Aranda B, Blankenburg H, Kerrien S, Brinkman FSL, Ceol A, Chautard E, Dana JM, De Las Rivas J, Dumousseau M, Galeota E, Gaulton A, Goll J, Hancock REW, Isserlin R, Jimenez RC, Kerssemakers J, Khadake J, Lynn DJ, Michaut M, O'Kelly G, Ono K, Orchard S, Prieto C, Razick S, Rigina O, Salwinski L, Simonovic M, Velankar S, Winter A, Wu G, Bader GD, Cesareni G, Donaldson IM, Eisenberg D, Kleywegt GJ, Overington J, Ricard-Blum S, Tyers M, Albrecht M, Hermjakob H. PSICQUIC and PSISCORE: accessing and scoring molecular interactions. Nat Methods 2011; 8:528-9. [PMID: 21716279 DOI: 10.1038/nmeth.1637] [Citation(s) in RCA: 233] [Impact Index Per Article: 17.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
14
|
Salwinski L, Licata L, Winter A, Thorneycroft D, Khadake J, Ceol A, Aryamontri AC, Oughtred R, Livstone M, Boucher L, Botstein D, Dolinski K, Berardini T, Huala E, Tyers M, Eisenberg D, Cesareni G, Hermjakob H. Recurated protein interaction datasets. Nat Methods 2009; 6:860-1. [DOI: 10.1038/nmeth1209-860] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
15
|
Aranda B, Achuthan P, Alam-Faruque Y, Armean I, Bridge A, Derow C, Feuermann M, Ghanbarian AT, Kerrien S, Khadake J, Kerssemakers J, Leroy C, Menden M, Michaut M, Montecchi-Palazzi L, Neuhauser SN, Orchard S, Perreau V, Roechert B, van Eijk K, Hermjakob H. The IntAct molecular interaction database in 2010. Nucleic Acids Res 2009; 38:D525-31. [PMID: 19850723 PMCID: PMC2808934 DOI: 10.1093/nar/gkp878] [Citation(s) in RCA: 524] [Impact Index Per Article: 34.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
IntAct is an open-source, open data molecular interaction database and toolkit. Data is abstracted from the literature or from direct data depositions by expert curators following a deep annotation model providing a high level of detail. As of September 2009, IntAct contains over 200.000 curated binary interaction evidences. In response to the growing data volume and user requests, IntAct now provides a two-tiered view of the interaction data. The search interface allows the user to iteratively develop complex queries, exploiting the detailed annotation with hierarchical controlled vocabularies. Results are provided at any stage in a simplified, tabular view. Specialized views then allows 'zooming in' on the full annotation of interactions, interactors and their properties. IntAct source code and data are freely available at http://www.ebi.ac.uk/intact.
Collapse
Affiliation(s)
- B Aranda
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome Campus Hinxton, Cambridge CB10 1SD, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Chatr-aryamontri A, Kerrien S, Khadake J, Orchard S, Ceol A, Licata L, Castagnoli L, Costa S, Derow C, Huntley R, Aranda B, Leroy C, Thorneycroft D, Apweiler R, Cesareni G, Hermjakob H. MINT and IntAct contribute to the Second BioCreative challenge: serving the text-mining community with high quality molecular interaction data. Genome Biol 2008; 9 Suppl 2:S5. [PMID: 18834496 PMCID: PMC2559989 DOI: 10.1186/gb-2008-9-s2-s5] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Background In the absence of consolidated pipelines to archive biological data electronically, information dispersed in the literature must be captured by manual annotation. Unfortunately, manual annotation is time consuming and the coverage of published interaction data is therefore far from complete. The use of text-mining tools to identify relevant publications and to assist in the initial information extraction could help to improve the efficiency of the curation process and, as a consequence, the database coverage of data available in the literature. The 2006 BioCreative competition was aimed at evaluating text-mining procedures in comparison with manual annotation of protein-protein interactions. Results To aid the BioCreative protein-protein interaction task, IntAct and MINT (Molecular INTeraction) provided both the training and the test datasets. Data from both databases are comparable because they were curated according to the same standards. During the manual curation process, the major cause of data loss in mining the articles for information was ambiguity in the mapping of the gene names to stable UniProtKB database identifiers. It was also observed that most of the information about interactions was contained only within the full-text of the publication; hence, text mining of protein-protein interaction data will require the analysis of the full-text of the articles and cannot be restricted to the abstract. Conclusion The development of text-mining tools to extract protein-protein interaction information may increase the literature coverage achieved by manual curation. To support the text-mining community, databases will highlight those sentences within the articles that describe the interactions. These will supply data-miners with a high quality dataset for algorithm development. Furthermore, the dictionary of terms created by the BioCreative competitors could enrich the synonym list of the PSI-MI (Proteomics Standards Initiative-Molecular Interactions) controlled vocabulary, which is used by both databases to annotate their data content.
Collapse
Affiliation(s)
- Andrew Chatr-aryamontri
- Department of Biology, University of Rome, Tor Vergata, Via della Ricerca Scientifica, 00133 Rome, Italy
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Kerrien S, Alam-Faruque Y, Aranda B, Bancarz I, Bridge A, Derow C, Dimmer E, Feuermann M, Friedrichsen A, Huntley R, Kohler C, Khadake J, Leroy C, Liban A, Lieftink C, Montecchi-Palazzi L, Orchard S, Risse J, Robbe K, Roechert B, Thorneycroft D, Zhang Y, Apweiler R, Hermjakob H. IntAct--open source resource for molecular interaction data. Nucleic Acids Res 2006; 35:D561-5. [PMID: 17145710 PMCID: PMC1751531 DOI: 10.1093/nar/gkl958] [Citation(s) in RCA: 630] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
IntAct is an open source database and software suite for modeling, storing and analyzing molecular interaction data. The data available in the database originates entirely from published literature and is manually annotated by expert biologists to a high level of detail, including experimental methods, conditions and interacting domains. The database features over 126 000 binary interactions extracted from over 2100 scientific publications and makes extensive use of controlled vocabularies. The web site provides tools allowing users to search, visualize and download data from the repository. IntAct supports and encourages local installations as well as direct data submission and curation collaborations. IntAct source code and data are freely available from .
Collapse
Affiliation(s)
- S Kerrien
- EMBL Outstation-European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|