1
|
Agrawal S, Buyan A, Severin J, Koido M, Alam T, Abugessaisa I, Chang HY, Dostie J, Itoh M, Kere J, Kondo N, Li Y, Makeev VJ, Mendez M, Okazaki Y, Ramilowski JA, Sigorskikh AI, Strug LJ, Yagi K, Yasuzawa K, Yip CW, Hon CC, Hoffman MM, Terao C, Kulakovskiy IV, Kasukawa T, Shin JW, Carninci P, de Hoon MJL. Annotation of nuclear lncRNAs based on chromatin interactions. PLoS One 2024; 19:e0295971. [PMID: 38709794 PMCID: PMC11073715 DOI: 10.1371/journal.pone.0295971] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Accepted: 12/02/2023] [Indexed: 05/08/2024] Open
Abstract
The human genome is pervasively transcribed and produces a wide variety of long non-coding RNAs (lncRNAs), constituting the majority of transcripts across human cell types. Some specific nuclear lncRNAs have been shown to be important regulatory components acting locally. As RNA-chromatin interaction and Hi-C chromatin conformation data showed that chromatin interactions of nuclear lncRNAs are determined by the local chromatin 3D conformation, we used Hi-C data to identify potential target genes of lncRNAs. RNA-protein interaction data suggested that nuclear lncRNAs act as scaffolds to recruit regulatory proteins to target promoters and enhancers. Nuclear lncRNAs may therefore play a role in directing regulatory factors to locations spatially close to the lncRNA gene. We provide the analysis results through an interactive visualization web portal at https://fantom.gsc.riken.jp/zenbu/reports/#F6_3D_lncRNA.
Collapse
Affiliation(s)
- Saumya Agrawal
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Andrey Buyan
- Autosome.org, Russia
- FANTOM Consortium, Dolgoprudny, Russia
| | - Jessica Severin
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Masaru Koido
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Tanvir Alam
- College of Science and Engineering, Hamad Bin Khalifa University, Doha, Qatar
| | | | - Howard Y. Chang
- Center for Personal Dynamic Regulome, Stanford University, Stanford, California, United States of America
| | - Josée Dostie
- Department of Biochemistry, Rosalind and Morris Goodman Cancer Research Center, McGill University, Montréal, Québec, Canada
| | - Masayoshi Itoh
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- RIKEN Preventive Medicine and Diagnosis Innovation Program, Wako, Japan
| | - Juha Kere
- Department of Biosciences and Nutrition, Karolinska Institutet, Huddinge, Sweden
- Stem Cells and Metabolism Research Program, University of Helsinki and Folkhälsan Research Center, Helsinki, Finland
| | - Naoto Kondo
- RIKEN Center for Life Science Technologies, Yokohama, Japan
| | - Yunjing Li
- Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada
| | | | - Mickaël Mendez
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| | - Yasushi Okazaki
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Jordan A. Ramilowski
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Advanced Medical Research Center, Yokohama City University, Yokohama, Japan
| | | | - Lisa J. Strug
- Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Department of Statistical Sciences, University of Toronto, Ontario, Canada
- The Centre for Applied Genomics and Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Ken Yagi
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Kayoko Yasuzawa
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Chi Wai Yip
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Chung Chau Hon
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Michael M. Hoffman
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Princess Margaret Cancer Centre, Toronto, Ontario, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
- Vector Institute, Toronto, Ontario, Canada
| | - Chikashi Terao
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | | | - Takeya Kasukawa
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Jay W. Shin
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
| | - Piero Carninci
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Human Technopole, Milan, Italy
| | | |
Collapse
|
2
|
Collins GS, Moons KGM, Dhiman P, Riley RD, Beam AL, Van Calster B, Ghassemi M, Liu X, Reitsma JB, van Smeden M, Boulesteix AL, Camaradou JC, Celi LA, Denaxas S, Denniston AK, Glocker B, Golub RM, Harvey H, Heinze G, Hoffman MM, Kengne AP, Lam E, Lee N, Loder EW, Maier-Hein L, Mateen BA, McCradden MD, Oakden-Rayner L, Ordish J, Parnell R, Rose S, Singh K, Wynants L, Logullo P. TRIPOD+AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods. BMJ 2024; 385:e078378. [PMID: 38626948 PMCID: PMC11019967 DOI: 10.1136/bmj-2023-078378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/17/2024] [Indexed: 04/19/2024]
Affiliation(s)
- Gary S Collins
- Centre for Statistics in Medicine, UK EQUATOR Centre, Nuffield Department of Orthopaedics, Rheumatology, and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
| | - Karel G M Moons
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Paula Dhiman
- Centre for Statistics in Medicine, UK EQUATOR Centre, Nuffield Department of Orthopaedics, Rheumatology, and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
| | - Richard D Riley
- Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
- National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK
| | - Andrew L Beam
- Department of Epidemiology, Harvard T H Chan School of Public Health, Boston, MA, USA
| | - Ben Van Calster
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium
- Department of Biomedical Data Science, Leiden University Medical Centre, Leiden, Netherlands
| | - Marzyeh Ghassemi
- Department of Electrical Engineering and Computer Science, Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Xiaoxuan Liu
- Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
- University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
| | - Johannes B Reitsma
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Maarten van Smeden
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Anne-Laure Boulesteix
- Department of Medical Information Processing, Biometry and Epidemiology, Ludwig-Maximilians-University of Munich, Munich, Germany
| | - Jennifer Catherine Camaradou
- Patient representative, Health Data Research UK patient and public involvement and engagement group
- Patient representative, University of East Anglia, Faculty of Health Sciences, Norwich Research Park, Norwich, UK
| | - Leo Anthony Celi
- Beth Israel Deaconess Medical Center, Boston, MA, USA
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Biostatistics, Harvard T H Chan School of Public Health, Boston, MA, USA
| | - Spiros Denaxas
- Institute of Health Informatics, University College London, London, UK
- British Heart Foundation Data Science Centre, London, UK
| | - Alastair K Denniston
- National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK
- Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
| | - Ben Glocker
- Department of Computing, Imperial College London, London, UK
| | - Robert M Golub
- Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | | | - Georg Heinze
- Section for Clinical Biometrics, Centre for Medical Data Science, Medical University of Vienna, Vienna, Austria
| | - Michael M Hoffman
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada
| | | | - Emily Lam
- Patient representative, Health Data Research UK patient and public involvement and engagement group
| | - Naomi Lee
- National Institute for Health and Care Excellence, London, UK
| | - Elizabeth W Loder
- The BMJ, London, UK
- Department of Neurology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Lena Maier-Hein
- Department of Intelligent Medical Systems, German Cancer Research Centre, Heidelberg, Germany
| | - Bilal A Mateen
- Institute of Health Informatics, University College London, London, UK
- Wellcome Trust, London, UK
- Alan Turing Institute, London, UK
| | - Melissa D McCradden
- Department of Bioethics, Hospital for Sick Children Toronto, ON, Canada
- Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, Canada
| | - Lauren Oakden-Rayner
- Australian Institute for Machine Learning, University of Adelaide, Adelaide, SA, Australia
| | - Johan Ordish
- Medicines and Healthcare products Regulatory Agency, London, UK
| | - Richard Parnell
- Patient representative, Health Data Research UK patient and public involvement and engagement group
| | - Sherri Rose
- Department of Health Policy and Center for Health Policy, Stanford University, Stanford, CA, USA
| | - Karandeep Singh
- Department of Epidemiology, CAPHRI Care and Public Health Research Institute, Maastricht University, Maastricht, Netherlands
| | - Laure Wynants
- Department of Epidemiology, CAPHRI Care and Public Health Research Institute, Maastricht University, Maastricht, Netherlands
| | - Patricia Logullo
- Centre for Statistics in Medicine, UK EQUATOR Centre, Nuffield Department of Orthopaedics, Rheumatology, and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
| |
Collapse
|
3
|
Reinke A, Tizabi MD, Baumgartner M, Eisenmann M, Heckmann-Nötzel D, Kavur AE, Rädsch T, Sudre CH, Acion L, Antonelli M, Arbel T, Bakas S, Benis A, Buettner F, Cardoso MJ, Cheplygina V, Chen J, Christodoulou E, Cimini BA, Farahani K, Ferrer L, Galdran A, van Ginneken B, Glocker B, Godau P, Hashimoto DA, Hoffman MM, Huisman M, Isensee F, Jannin P, Kahn CE, Kainmueller D, Kainz B, Karargyris A, Kleesiek J, Kofler F, Kooi T, Kopp-Schneider A, Kozubek M, Kreshuk A, Kurc T, Landman BA, Litjens G, Madani A, Maier-Hein K, Martel AL, Meijering E, Menze B, Moons KGM, Müller H, Nichyporuk B, Nickel F, Petersen J, Rafelski SM, Rajpoot N, Reyes M, Riegler MA, Rieke N, Saez-Rodriguez J, Sánchez CI, Shetty S, Summers RM, Taha AA, Tiulpin A, Tsaftaris SA, Van Calster B, Varoquaux G, Yaniv ZR, Jäger PF, Maier-Hein L. Understanding metric-related pitfalls in image analysis validation. Nat Methods 2024; 21:182-194. [PMID: 38347140 DOI: 10.1038/s41592-023-02150-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Accepted: 12/12/2023] [Indexed: 02/15/2024]
Abstract
Validation metrics are key for tracking scientific progress and bridging the current chasm between artificial intelligence research and its translation into practice. However, increasing evidence shows that, particularly in image analysis, metrics are often chosen inadequately. Although taking into account the individual strengths, weaknesses and limitations of validation metrics is a critical prerequisite to making educated choices, the relevant knowledge is currently scattered and poorly accessible to individual researchers. Based on a multistage Delphi process conducted by a multidisciplinary expert consortium as well as extensive community feedback, the present work provides a reliable and comprehensive common point of access to information on pitfalls related to validation metrics in image analysis. Although focused on biomedical image analysis, the addressed pitfalls generalize across application domains and are categorized according to a newly created, domain-agnostic taxonomy. The work serves to enhance global comprehension of a key topic in image analysis validation.
Collapse
Affiliation(s)
- Annika Reinke
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Heidelberg, Germany.
- German Cancer Research Center (DKFZ) Heidelberg, HI Helmholtz Imaging, Heidelberg, Germany.
- Faculty of Mathematics and Computer Science, Heidelberg University, Heidelberg, Germany.
| | - Minu D Tizabi
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Heidelberg, Germany.
- National Center for Tumor Diseases (NCT), NCT Heidelberg, a partnership between DKFZ and University Medical Center Heidelberg, Heidelberg, Germany.
| | - Michael Baumgartner
- German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Heidelberg, Germany
| | - Matthias Eisenmann
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Heidelberg, Germany
| | - Doreen Heckmann-Nötzel
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Heidelberg, Germany
- National Center for Tumor Diseases (NCT), NCT Heidelberg, a partnership between DKFZ and University Medical Center Heidelberg, Heidelberg, Germany
| | - A Emre Kavur
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Heidelberg, Germany
- German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Heidelberg, Germany
- German Cancer Research Center (DKFZ) Heidelberg, HI Applied Computer Vision Lab, Heidelberg, Germany
| | - Tim Rädsch
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Heidelberg, Germany
- German Cancer Research Center (DKFZ) Heidelberg, HI Helmholtz Imaging, Heidelberg, Germany
| | - Carole H Sudre
- MRC Unit for Lifelong Health and Ageing at UCL and Centre for Medical Image Computing, Department of Computer Science, University College London, London, UK
- School of Biomedical Engineering and Imaging Science, King's College London, London, UK
| | - Laura Acion
- Instituto de Cálculo, CONICET - Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Michela Antonelli
- School of Biomedical Engineering and Imaging Science, King's College London, London, UK
- Centre for Medical Image Computing, University College London, London, UK
| | - Tal Arbel
- Centre for Intelligent Machines and MILA (Quebec Artificial Intelligence Institute), McGill University, Montréal, Quebec, Canada
| | - Spyridon Bakas
- Division of Computational Pathology, Dept of Pathology & Laboratory Medicine, Indiana University School of Medicine, Indianapolis, IN, USA
- Center for Biomedical Image Computing and Analytics (CBICA), University of Pennsylvania, Philadelphia, PA, USA
| | - Arriel Benis
- Department of Digital Medical Technologies, Holon Institute of Technology, Holon, Israel
- European Federation for Medical Informatics, Le Mont-sur-Lausanne, Switzerland
| | - Florian Buettner
- German Cancer Consortium (DKTK), partner site Frankfurt/Mainz, a partnership between DKFZ and UCT Frankfurt-Marburg, Frankfurt am Main, Germany
- German Cancer Research Center (DKFZ) Heidelberg, Heidelberg, Germany
- Goethe University Frankfurt, Department of Medicine, Frankfurt am Main, Germany
- Goethe University Frankfurt, Department of Informatics, Frankfurt am Main, Germany
- Frankfurt Cancer Insititute, Frankfurt am Main, Germany
| | - M Jorge Cardoso
- School of Biomedical Engineering and Imaging Science, King's College London, London, UK
| | - Veronika Cheplygina
- Department of Computer Science, IT University of Copenhagen, Copenhagen, Denmark
| | - Jianxu Chen
- Leibniz-Institut für Analytische Wissenschaften - ISAS - e.V., Dortmund, Germany
| | - Evangelia Christodoulou
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Heidelberg, Germany
| | - Beth A Cimini
- Imaging Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Keyvan Farahani
- Center for Biomedical Informatics and Information Technology, National Cancer Institute, Bethesda, MD, USA
| | - Luciana Ferrer
- Instituto de Investigación en Ciencias de la Computación (ICC), CONICET-UBA, Ciudad Autónoma de Buenos Aires, Buenos Aires, Argentina
| | - Adrian Galdran
- Universitat Pompeu Fabra, Barcelona, Spain
- University of Adelaide, Adelaide, South Australia, Australia
| | - Bram van Ginneken
- Fraunhofer MEVIS, Bremen, Germany
- Radboud Institute for Health Sciences, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Ben Glocker
- Department of Computing, Imperial College London, South Kensington Campus, London, UK
| | - Patrick Godau
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Heidelberg, Germany
- Faculty of Mathematics and Computer Science, Heidelberg University, Heidelberg, Germany
- National Center for Tumor Diseases (NCT), NCT Heidelberg, a partnership between DKFZ and University Medical Center Heidelberg, Heidelberg, Germany
| | - Daniel A Hashimoto
- Department of Surgery, Perelman School of Medicine, Philadelphia, PA, USA
- General Robotics Automation Sensing and Perception Laboratory, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA
| | - Michael M Hoffman
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada
| | - Merel Huisman
- Department of Radiology and Nuclear Medicine, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Fabian Isensee
- German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Heidelberg, Germany
- German Cancer Research Center (DKFZ) Heidelberg, HI Applied Computer Vision Lab, Heidelberg, Germany
| | - Pierre Jannin
- Laboratoire Traitement du Signal et de l'Image - UMR_S 1099, Université de Rennes 1, Rennes, France
- INSERM, Paris, France
| | - Charles E Kahn
- Department of Radiology and Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA, USA
| | - Dagmar Kainmueller
- Max-Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Biomedical Image Analysis and HI Helmholtz Imaging, Berlin, Germany
- University of Potsdam, Digital Engineering Faculty, Potsdam, Germany
| | - Bernhard Kainz
- Department of Computing, Faculty of Engineering, Imperial College London, London, UK
- Department AIBE, Friedrich-Alexander-Universität (FAU), Erlangen-Nürnberg, Germany
| | | | - Jens Kleesiek
- Translational Image-guided Oncology (TIO), Institute for AI in Medicine (IKIM), University Medicine Essen, Essen, Germany
| | | | | | - Annette Kopp-Schneider
- German Cancer Research Center (DKFZ) Heidelberg, Division of Biostatistics, Heidelberg, Germany
| | - Michal Kozubek
- Centre for Biomedical Image Analysis and Faculty of Informatics, Masaryk University, Brno, Czech Republic
| | - Anna Kreshuk
- Cell Biology and Biophysics Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Tahsin Kurc
- Department of Biomedical Informatics, Stony Brook University, Health Science Center, Stony Brook, NY, USA
| | | | - Geert Litjens
- Department of Pathology, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Amin Madani
- Department of Surgery, University Health Network, Philadelphia, PA, USA
| | - Klaus Maier-Hein
- German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Heidelberg, Germany
- Pattern Analysis and Learning Group, Department of Radiation Oncology, Heidelberg University Hospital, Heidelberg, Germany
| | - Anne L Martel
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
- Physical Sciences, Sunnybrook Research Institute, Toronto, Ontario, Canada
| | - Erik Meijering
- School of Computer Science and Engineering, University of New South Wales, UNSW Sydney, Kensington, New South Wales, Australia
| | - Bjoern Menze
- Department of Quantitative Biomedicine, University of Zurich, Zurich, Switzerland
| | - Karel G M Moons
- Julius Center for Health Sciences and Primary Care, UMC Utrecht, Utrecht University, Utrecht, the Netherlands
| | - Henning Müller
- Information Systems Institute, University of Applied Sciences Western Switzerland (HES-SO), Sierre, Switzerland
- Medical Faculty, University of Geneva, Geneva, Switzerland
| | - Brennan Nichyporuk
- MILA (Quebec Artificial Intelligence Institute), Montréal, Quebec, Canada
| | - Felix Nickel
- Department of General, Visceral and Thoracic Surgery, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Jens Petersen
- German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Heidelberg, Germany
| | | | - Nasir Rajpoot
- Tissue Image Analytics Laboratory, Department of Computer Science, University of Warwick, Coventry, UK
| | - Mauricio Reyes
- ARTORG Center for Biomedical Engineering Research, University of Bern, Bern, Switzerland
- Department of Radiation Oncology, University Hospital Bern, University of Bern, Bern, Switzerland
| | - Michael A Riegler
- Simula Metropolitan Center for Digital Engineering, Oslo, Norway
- UiT The Arctic University of Norway, Tromsø, Norway
| | | | - Julio Saez-Rodriguez
- Institute for Computational Biomedicine, Heidelberg University, Heidelberg, Germany
- Faculty of Medicine, Heidelberg University Hospital, Heidelberg, Germany
| | - Clara I Sánchez
- Informatics Institute, Faculty of Science, University of Amsterdam, Amsterdam, the Netherlands
| | | | - Ronald M Summers
- National Institutes of Health Clinical Center, Bethesda, MD, USA
| | - Abdel A Taha
- Institute of Information Systems Engineering, TU Wien, Vienna, Austria
| | - Aleksei Tiulpin
- Research Unit of Health Sciences and Technology, Faculty of Medicine, University of Oulu, Oulu, Finland
- Neurocenter Oulu, Oulu University Hospital, Oulu, Finland
| | | | - Ben Van Calster
- Department of Development and Regeneration and EPI-centre, KU Leuven, Leuven, Belgium
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, the Netherlands
| | - Gaël Varoquaux
- Parietal project team, INRIA Saclay-Île de France, Palaiseau, France
| | - Ziv R Yaniv
- National Institute of Allergy and Infectious Diseases, Bethesda, MD, USA
| | - Paul F Jäger
- German Cancer Research Center (DKFZ) Heidelberg, HI Helmholtz Imaging, Heidelberg, Germany.
- German Cancer Research Center (DKFZ) Heidelberg, Interactive Machine Learning Group, Heidelberg, Germany.
| | - Lena Maier-Hein
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Heidelberg, Germany.
- German Cancer Research Center (DKFZ) Heidelberg, HI Helmholtz Imaging, Heidelberg, Germany.
- Faculty of Mathematics and Computer Science, Heidelberg University, Heidelberg, Germany.
- National Center for Tumor Diseases (NCT), NCT Heidelberg, a partnership between DKFZ and University Medical Center Heidelberg, Heidelberg, Germany.
- Faculty of Medicine, Heidelberg University Hospital, Heidelberg, Germany.
| |
Collapse
|
4
|
Maier-Hein L, Reinke A, Godau P, Tizabi MD, Buettner F, Christodoulou E, Glocker B, Isensee F, Kleesiek J, Kozubek M, Reyes M, Riegler MA, Wiesenfarth M, Kavur AE, Sudre CH, Baumgartner M, Eisenmann M, Heckmann-Nötzel D, Rädsch T, Acion L, Antonelli M, Arbel T, Bakas S, Benis A, Blaschko MB, Cardoso MJ, Cheplygina V, Cimini BA, Collins GS, Farahani K, Ferrer L, Galdran A, van Ginneken B, Haase R, Hashimoto DA, Hoffman MM, Huisman M, Jannin P, Kahn CE, Kainmueller D, Kainz B, Karargyris A, Karthikesalingam A, Kofler F, Kopp-Schneider A, Kreshuk A, Kurc T, Landman BA, Litjens G, Madani A, Maier-Hein K, Martel AL, Mattson P, Meijering E, Menze B, Moons KGM, Müller H, Nichyporuk B, Nickel F, Petersen J, Rajpoot N, Rieke N, Saez-Rodriguez J, Sánchez CI, Shetty S, van Smeden M, Summers RM, Taha AA, Tiulpin A, Tsaftaris SA, Van Calster B, Varoquaux G, Jäger PF. Metrics reloaded: recommendations for image analysis validation. Nat Methods 2024; 21:195-212. [PMID: 38347141 DOI: 10.1038/s41592-023-02151-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Accepted: 12/12/2023] [Indexed: 02/15/2024]
Abstract
Increasing evidence shows that flaws in machine learning (ML) algorithm validation are an underestimated global problem. In biomedical image analysis, chosen performance metrics often do not reflect the domain interest, and thus fail to adequately measure scientific progress and hinder translation of ML techniques into practice. To overcome this, we created Metrics Reloaded, a comprehensive framework guiding researchers in the problem-aware selection of metrics. Developed by a large international consortium in a multistage Delphi process, it is based on the novel concept of a problem fingerprint-a structured representation of the given problem that captures all aspects that are relevant for metric selection, from the domain interest to the properties of the target structure(s), dataset and algorithm output. On the basis of the problem fingerprint, users are guided through the process of choosing and applying appropriate validation metrics while being made aware of potential pitfalls. Metrics Reloaded targets image analysis problems that can be interpreted as classification tasks at image, object or pixel level, namely image-level classification, object detection, semantic segmentation and instance segmentation tasks. To improve the user experience, we implemented the framework in the Metrics Reloaded online tool. Following the convergence of ML methodology across application domains, Metrics Reloaded fosters the convergence of validation methodology. Its applicability is demonstrated for various biomedical use cases.
Collapse
Affiliation(s)
- Lena Maier-Hein
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Heidelberg, Germany.
- German Cancer Research Center (DKFZ) Heidelberg, HI Helmholtz Imaging, Heidelberg, Germany.
- Faculty of Mathematics and Computer Science, Heidelberg University, Heidelberg, Germany.
- Medical Faculty, Heidelberg University, Heidelberg, Germany.
- National Center for Tumor Diseases (NCT), NCT Heidelberg, a partnership between DKFZ and University Medical Center Heidelberg, Heidelberg, Germany.
| | - Annika Reinke
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Heidelberg, Germany.
- German Cancer Research Center (DKFZ) Heidelberg, HI Helmholtz Imaging, Heidelberg, Germany.
- Faculty of Mathematics and Computer Science, Heidelberg University, Heidelberg, Germany.
| | - Patrick Godau
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Heidelberg, Germany
- Faculty of Mathematics and Computer Science, Heidelberg University, Heidelberg, Germany
- National Center for Tumor Diseases (NCT), NCT Heidelberg, a partnership between DKFZ and University Medical Center Heidelberg, Heidelberg, Germany
| | - Minu D Tizabi
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Heidelberg, Germany
- National Center for Tumor Diseases (NCT), NCT Heidelberg, a partnership between DKFZ and University Medical Center Heidelberg, Heidelberg, Germany
| | - Florian Buettner
- German Cancer Consortium (DKTK), partner site Frankfurt/Mainz, a partnership between DKFZ and UCT Frankfurt-Marburg, Frankfurt am Main, Germany
- German Cancer Research Center (DKFZ) Heidelberg, Heidelberg, Germany
- Department of Medicine, Goethe University Frankfurt, Frankfurt am Main, Germany
- Department of Informatics, Goethe University Frankfurt, Frankfurt am Main, Germany
- Frankfurt Cancer Insititute, Frankfurt am Main, Germany
| | - Evangelia Christodoulou
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Heidelberg, Germany
| | - Ben Glocker
- Department of Computing, Imperial College London, South Kensington Campus, London, UK
| | - Fabian Isensee
- German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Heidelberg, Germany
- German Cancer Research Center (DKFZ) Heidelberg, HI Applied Computer Vision Lab, Heidelberg, Germany
| | - Jens Kleesiek
- Institute for AI in Medicine, University Medicine Essen, Essen, Germany
| | - Michal Kozubek
- Centre for Biomedical Image Analysis and Faculty of Informatics, Masaryk University, Brno, Czech Republic
| | - Mauricio Reyes
- ARTORG Center for Biomedical Engineering Research, University of Bern, Bern, Switzerland
- Department of Radiation Oncology, University Hospital Bern, University of Bern, Bern, Switzerland
| | - Michael A Riegler
- Simula Metropolitan Center for Digital Engineering, Oslo, Norway
- Department of Computer Science, UiT The Arctic University of Norway, Tromsø, Norway
| | - Manuel Wiesenfarth
- German Cancer Research Center (DKFZ) Heidelberg, Division of Biostatistics, Heidelberg, Germany
| | - A Emre Kavur
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Heidelberg, Germany
- German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Heidelberg, Germany
- German Cancer Research Center (DKFZ) Heidelberg, HI Applied Computer Vision Lab, Heidelberg, Germany
| | - Carole H Sudre
- MRC Unit for Lifelong Health and Ageing at UCL and Centre for Medical Image Computing, Department of Computer Science, University College London, London, UK
- School of Biomedical Engineering and Imaging Science, King's College London, London, UK
| | - Michael Baumgartner
- German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Heidelberg, Germany
| | - Matthias Eisenmann
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Heidelberg, Germany
| | - Doreen Heckmann-Nötzel
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Heidelberg, Germany
- National Center for Tumor Diseases (NCT), NCT Heidelberg, a partnership between DKFZ and University Medical Center Heidelberg, Heidelberg, Germany
| | - Tim Rädsch
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Heidelberg, Germany
- German Cancer Research Center (DKFZ) Heidelberg, HI Helmholtz Imaging, Heidelberg, Germany
| | - Laura Acion
- Instituto de Cálculo, CONICET - Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Michela Antonelli
- School of Biomedical Engineering and Imaging Science, King's College London, London, UK
- Centre for Medical Image Computing, University College London, London, UK
| | - Tal Arbel
- Centre for Intelligent Machines and MILA (Québec Artificial Intelligence Institute), McGill University, Montréal, Quebec, Canada
| | - Spyridon Bakas
- Division of Computational Pathology, Department of Pathology & Laboratory Medicine, Indiana University School of Medicine, IU Health Information and Translational Sciences Building, Indianapolis, IN, USA
- Center for Biomedical Image Computing and Analytics (CBICA), University of Pennsylvania, Philadelphia, PA, USA
| | - Arriel Benis
- Department of Digital Medical Technologies, Holon Institute of Technology, Holon, Israel
- European Federation for Medical Informatics, Le Mont-sur-Lausanne, Switzerland
| | - Matthew B Blaschko
- Center for Processing Speech and Images, Department of Electrical Engineering, KU Leuven, Leuven, Belgium
| | - M Jorge Cardoso
- School of Biomedical Engineering and Imaging Science, King's College London, London, UK
| | - Veronika Cheplygina
- Department of Computer Science, IT University of Copenhagen, Copenhagen, Denmark
| | - Beth A Cimini
- Imaging Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Gary S Collins
- Centre for Statistics in Medicine, University of Oxford, Nuffield Orthopaedic Centre, Oxford, UK
| | - Keyvan Farahani
- Center for Biomedical Informatics and Information Technology, National Cancer Institute, Bethesda, MD, USA
| | - Luciana Ferrer
- Instituto de Investigación en Ciencias de la Computación (ICC), CONICET-UBA, Ciudad Autónoma de Buenos Aires, Buenos Aires, Argentina
| | - Adrian Galdran
- BCN Medtech, Universitat Pompeu Fabra, Barcelona, Spain
- Australian Institute for Machine Learning AIML, University of Adelaide, Adelaide, South Australia, Australia
| | - Bram van Ginneken
- Fraunhofer MEVIS, Bremen, Germany
- Radboud Institute for Health Sciences, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Robert Haase
- Technische Universität (TU) Dresden, DFG Cluster of Excellence 'Physics of Life', Dresden, Germany
- Center for Systems Biology, Dresden, Germany
- Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI), Leipzig University, Leipzig, Germany
| | - Daniel A Hashimoto
- Department of Surgery, Perelman School of Medicine, Philadelphia, PA, USA
- General Robotics Automation Sensing and Perception Laboratory, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA
| | - Michael M Hoffman
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| | - Merel Huisman
- Department of Radiology and Nuclear Medicine, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Pierre Jannin
- Laboratoire Traitement du Signal et de l'Image - UMR_S 1099, Université de Rennes 1, Rennes, France
- INSERM, Paris, France
| | - Charles E Kahn
- Department of Radiology and Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA, USA
| | - Dagmar Kainmueller
- Max-Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Biomedical Image Analysis and HI Helmholtz Imaging, Berlin, Germany
- Digital Engineering Faculty, University of Potsdam, Potsdam, Germany
| | - Bernhard Kainz
- Department of Computing, Faculty of Engineering, Imperial College London, London, UK
- Department AIBE, Friedrich-Alexander-Universität (FAU), Erlangen-Nürnberg, Germany
| | | | | | | | - Annette Kopp-Schneider
- German Cancer Research Center (DKFZ) Heidelberg, Division of Biostatistics, Heidelberg, Germany
| | - Anna Kreshuk
- Cell Biology and Biophysics Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Tahsin Kurc
- Department of Biomedical Informatics, Stony Brook University, Health Science Center, Stony Brook, NY, USA
| | | | - Geert Litjens
- Department of Pathology, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Amin Madani
- Department of Surgery, University Health Network, Philadelphia, PA, USA
| | - Klaus Maier-Hein
- German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Heidelberg, Germany
- Pattern Analysis and Learning Group, Department of Radiation Oncology, Heidelberg University Hospital, Heidelberg, Germany
| | - Anne L Martel
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Physical Sciences, Sunnybrook Research Institute, Toronto, Ontario, Canada
| | - Peter Mattson
- Google, 1600 Amphitheatre Pkwy, Mountain View, CA, USA
| | - Erik Meijering
- School of Computer Science and Engineering, University of New South Wales, UNSW Sydney, Kensington, New South Wales, Australia
| | - Bjoern Menze
- Department of Quantitative Biomedicine, University of Zurich, Zurich, Switzerland
| | - Karel G M Moons
- Julius Center for Health Sciences and Primary Care, UMC Utrecht, Utrecht University, Utrecht, the Netherlands
| | - Henning Müller
- Information Systems Institute, University of Applied Sciences Western Switzerland (HES-SO), Sierre, Switzerland
- Medical Faculty, University of Geneva, Geneva, Switzerland
| | - Brennan Nichyporuk
- MILA (Québec Artificial Intelligence Institute), Montréal, Quebec, Canada
| | - Felix Nickel
- Department of General, Visceral and Thoracic Surgery, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Jens Petersen
- German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Heidelberg, Germany
| | - Nasir Rajpoot
- Tissue Image Analytics Laboratory, Department of Computer Science, University of Warwick, Coventry, UK
| | | | - Julio Saez-Rodriguez
- Institute for Computational Biomedicine, Heidelberg University, Heidelberg, Germany
- Faculty of Medicine, Heidelberg University Hospital, Heidelberg, Germany
| | - Clara I Sánchez
- Informatics Institute, Faculty of Science, University of Amsterdam, Amsterdam, the Netherlands
| | | | - Maarten van Smeden
- Julius Center for Health Sciences and Primary Care, UMC Utrecht, Utrecht University, Utrecht, the Netherlands
| | - Ronald M Summers
- National Institutes of Health Clinical Center, Bethesda, MD, USA
| | - Abdel A Taha
- Institute of Information Systems Engineering, TU Wien, Vienna, Austria
| | - Aleksei Tiulpin
- Research Unit of Health Sciences and Technology, Faculty of Medicine, University of Oulu, Oulu, Finland
- Neurocenter Oulu, Oulu University Hospital, Oulu, Finland
| | | | - Ben Van Calster
- Department of Development and Regeneration and EPI-centre, KU Leuven, Leuven, Belgium
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, the Netherlands
| | - Gaël Varoquaux
- Parietal project team, INRIA Saclay-Île de France, Palaiseau, France
| | - Paul F Jäger
- German Cancer Research Center (DKFZ) Heidelberg, HI Helmholtz Imaging, Heidelberg, Germany.
- German Cancer Research Center (DKFZ) Heidelberg, Interactive Machine Learning Group, Heidelberg, Germany.
| |
Collapse
|
5
|
Viner C, Ishak CA, Johnson J, Walker NJ, Shi H, Sjöberg-Herrera MK, Shen SY, Lardo SM, Adams DJ, Ferguson-Smith AC, De Carvalho DD, Hainer SJ, Bailey TL, Hoffman MM. Modeling methyl-sensitive transcription factor motifs with an expanded epigenetic alphabet. Genome Biol 2024; 25:11. [PMID: 38191487 PMCID: PMC10773111 DOI: 10.1186/s13059-023-03070-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 09/21/2023] [Indexed: 01/10/2024] Open
Abstract
BACKGROUND Transcription factors bind DNA in specific sequence contexts. In addition to distinguishing one nucleobase from another, some transcription factors can distinguish between unmodified and modified bases. Current models of transcription factor binding tend not to take DNA modifications into account, while the recent few that do often have limitations. This makes a comprehensive and accurate profiling of transcription factor affinities difficult. RESULTS Here, we develop methods to identify transcription factor binding sites in modified DNA. Our models expand the standard A/C/G/T DNA alphabet to include cytosine modifications. We develop Cytomod to create modified genomic sequences and we also enhance the MEME Suite, adding the capacity to handle custom alphabets. We adapt the well-established position weight matrix (PWM) model of transcription factor binding affinity to this expanded DNA alphabet. Using these methods, we identify modification-sensitive transcription factor binding motifs. We confirm established binding preferences, such as the preference of ZFP57 and C/EBPβ for methylated motifs and the preference of c-Myc for unmethylated E-box motifs. CONCLUSIONS Using known binding preferences to tune model parameters, we discover novel modified motifs for a wide array of transcription factors. Finally, we validate our binding preference predictions for OCT4 using cleavage under targets and release using nuclease (CUT&RUN) experiments across conventional, methylation-, and hydroxymethylation-enriched sequences. Our approach readily extends to other DNA modifications. As more genome-wide single-base resolution modification data becomes available, we expect that our method will yield insights into altered transcription factor binding affinities across many different modifications.
Collapse
Affiliation(s)
- Coby Viner
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
| | - Charles A Ishak
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
- Department of Epigenetics and Molecular Carcinogenesis, University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - James Johnson
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Nicolas J Walker
- Department of Genetics, University of Cambridge, Cambridge, England
| | - Hui Shi
- Department of Genetics, University of Cambridge, Cambridge, England
| | - Marcela K Sjöberg-Herrera
- Wellcome Sanger Institute, Cambridge, England
- Faculty of Biological Sciences, Pontificia Universidad Católica de Chile, Santiago, Chile
| | - Shu Yi Shen
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
| | - Santana M Lardo
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA, USA
| | | | | | - Daniel D De Carvalho
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
| | - Sarah J Hainer
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA, USA
| | - Timothy L Bailey
- Department of Pharmacology, University of Nevada, Reno, Reno, NV, USA
| | - Michael M Hoffman
- Department of Computer Science, University of Toronto, Toronto, ON, Canada.
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada.
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada.
| |
Collapse
|
6
|
Karimzadeh M, Arlidge C, Rostami A, Lupien M, Bratman SV, Hoffman MM. Human papillomavirus integration transforms chromatin to drive oncogenesis. Genome Biol 2023; 24:142. [PMID: 37365652 DOI: 10.1186/s13059-023-02926-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Accepted: 04/07/2023] [Indexed: 06/28/2023] Open
Abstract
BACKGROUND Human papillomavirus (HPV) drives almost all cervical cancers and up to 70% of head and neck cancers. Frequent integration into the host genome occurs predominantly in tumorigenic types of HPV. We hypothesize that changes in chromatin state at the location of integration can result in changes in gene expression that contribute to the tumorigenicity of HPV. RESULTS We find that viral integration events often occur along with changes in chromatin state and expression of genes near the integration site. We investigate whether introduction of new transcription factor binding sites due to HPV integration could invoke these changes. Some regions within the HPV genome, particularly the position of a conserved CTCF binding site, show enriched chromatin accessibility signal. ChIP-seq reveals that the conserved CTCF binding site within the HPV genome binds CTCF in 4 HPV+ cancer cell lines. Significant changes in CTCF binding pattern and increases in chromatin accessibility occur exclusively within 100 kbp of HPV integration sites. The chromatin changes co-occur with out-sized changes in transcription and alternative splicing of local genes. Analysis of The Cancer Genome Atlas (TCGA) HPV+ tumors indicates that HPV integration upregulates genes which have significantly higher essentiality scores compared to randomly selected upregulated genes from the same tumors. CONCLUSIONS Our results suggest that introduction of a new CTCF binding site due to HPV integration reorganizes chromatin state and upregulates genes essential for tumor viability in some HPV+ tumors. These findings emphasize a newly recognized role of HPV integration in oncogenesis.
Collapse
Affiliation(s)
- Mehran Karimzadeh
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada
| | - Christopher Arlidge
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
| | - Ariana Rostami
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
| | - Mathieu Lupien
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada.
| | - Scott V Bratman
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada.
| | - Michael M Hoffman
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada.
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada.
- Department of Computer Science, University of Toronto, Toronto, ON, Canada.
| |
Collapse
|
7
|
Boury H, Albert M, Chen RHC, Chow JCL, DaCosta R, Hoffman MM, Keshavarz B, Kontos P, McAndrews MP, Protze S, Gagliardi AR. Exploring the merits of research performance measures that comply with the San Francisco Declaration on Research Assessment and strategies to overcome barriers of adoption: qualitative interviews with administrators and researchers. Health Res Policy Syst 2023; 21:43. [PMID: 37277824 DOI: 10.1186/s12961-023-01001-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Accepted: 05/16/2023] [Indexed: 06/07/2023] Open
Abstract
BACKGROUND In prior research, we identified and prioritized ten measures to assess research performance that comply with the San Francisco Declaration on Research Assessment, a principle adopted worldwide that discourages metrics-based assessment. Given the shift away from assessment based on Journal Impact Factor, we explored potential barriers to implementing and adopting the prioritized measures. METHODS We identified administrators and researchers across six research institutes, conducted telephone interviews with consenting participants, and used qualitative description and inductive content analysis to derive themes. RESULTS We interviewed 18 participants: 6 administrators (research institute business managers and directors) and 12 researchers (7 on appointment committees) who varied by career stage (2 early, 5 mid, 5 late). Participants appreciated that the measures were similar to those currently in use, comprehensive, relevant across disciplines, and generated using a rigorous process. They also said the reporting template was easy to understand and use. In contrast, a few administrators thought the measures were not relevant across disciplines. A few participants said it would be time-consuming and difficult to prepare narratives when reporting the measures, and several thought that it would be difficult to objectively evaluate researchers from a different discipline without considerable effort to read their work. Strategies viewed as necessary to overcome barriers and support implementation of the measures included high-level endorsement of the measures, an official launch accompanied by a multi-pronged communication strategy, training for both researchers and evaluators, administrative support or automated reporting for researchers, guidance for evaluators, and sharing of approaches across research institutes. CONCLUSIONS While participants identified many strengths of the measures, they also identified a few limitations and offered corresponding strategies to address the barriers that we will apply at our organization. Ongoing work is needed to develop a framework to help evaluators translate the measures into an overall assessment. Given little prior research that identified research assessment measures and strategies to support adoption of those measures, this research may be of interest to other organizations that assess the quality and impact of research.
Collapse
Affiliation(s)
- Himani Boury
- Toronto General Hospital Research Institute, University Health Network, Toronto, Canada
| | - Mathieu Albert
- The Institute for Education Research, University Health Network, Toronto, Canada
| | - Robert H C Chen
- Research Solutions and Services, University Health Network, Toronto, Canada
| | - James C L Chow
- Techna Institute, University Health Network, Toronto, Canada
| | - Ralph DaCosta
- Princess Margaret Cancer Centre, University Health Network, Toronto, Canada
| | - Michael M Hoffman
- Princess Margaret Cancer Centre, University Health Network, Toronto, Canada
| | - Behrang Keshavarz
- KITE Research Institute, Toronto Rehabilitation Institute, University Health Network, Toronto, Canada
| | - Pia Kontos
- KITE Research Institute, Toronto Rehabilitation Institute, University Health Network, Toronto, Canada
| | | | - Stephanie Protze
- McEwen Stem Cell Institute, University Health Network, Toronto, Canada
| | - Anna R Gagliardi
- Toronto General Hospital Research Institute, University Health Network, Toronto, Canada.
| |
Collapse
|
8
|
Denisko D, Viner C, Hoffman MM. Motif elucidation in ChIP-seq datasets with a knockout control. Bioinformatics Advances 2023; 3:vbad031. [PMID: 37033469 PMCID: PMC10074035 DOI: 10.1093/bioadv/vbad031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 12/21/2022] [Accepted: 03/15/2023] [Indexed: 03/18/2023]
Abstract
Abstract
Chromatin immunoprecipitation-sequencing (ChIP-seq) is widely used to find transcription factor binding sites, but suffers from various sources of noise. Knocking out the target factor mitigates noise by acting as a negative control. Paired wild-type and knockout experiments can generate improved motifs but require optimal differential analysis. We introduce peaKO—a computational method to automatically optimize motif analyses with knockout controls, which we compare to two other methods. PeaKO often improves elucidation of the target factor and highlights the benefits of knockout controls, which far outperform input controls. It is freely available at https://peako.hoffmanlab.org.
Collapse
|
9
|
Wilson SL, Shen SY, Harmon L, Burgener JM, Triche T, Bratman SV, De Carvalho DD, Hoffman MM. Sensitive and reproducible cell-free methylome quantification with synthetic spike-in controls. Cell Rep Methods 2022; 2:100294. [PMID: 36160046 PMCID: PMC9499995 DOI: 10.1016/j.crmeth.2022.100294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Revised: 05/17/2022] [Accepted: 08/19/2022] [Indexed: 06/16/2023]
Abstract
Cell-free methylated DNA immunoprecipitation sequencing (cfMeDIP-seq) identifies genomic regions with DNA methylation, using a protocol adapted to work with low-input DNA samples and with cell-free DNA (cfDNA). We developed a set of synthetic spike-in DNA controls for cfMeDIP-seq to provide a simple and inexpensive reference for quantitative normalization. We designed 54 DNA fragments with combinations of methylation status (methylated and unmethylated), fragment length (80 bp, 160 bp, 320 bp), G + C content (35%, 50%, 65%), and fraction of CpG dinucleotides within the fragment (1/80 bp, 1/40 bp, 1/20 bp). Using 0.01 ng of spike-in controls enables training a generalized linear model that absolutely quantifies methylated cfDNA in MeDIP-seq experiments. It mitigates batch effects and corrects for biases in enrichment due to known biophysical properties of DNA fragments and other technical biases.
Collapse
Affiliation(s)
- Samantha L. Wilson
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
| | - Shu Yi Shen
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
| | | | - Justin M. Burgener
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
| | - Tim Triche
- Van Andel Institute, Grand Rapids, MI, USA
| | - Scott V. Bratman
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
| | - Daniel D. De Carvalho
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
| | - Michael M. Hoffman
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada
| |
Collapse
|
10
|
Karimzadeh M, Hoffman MM. Virtual ChIP-seq: predicting transcription factor binding by learning from the transcriptome. Genome Biol 2022; 23:126. [PMID: 35681170 PMCID: PMC9185870 DOI: 10.1186/s13059-022-02690-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Accepted: 05/16/2022] [Indexed: 11/29/2022] Open
Abstract
Existing methods for computational prediction of transcription factor (TF) binding sites evaluate genomic regions with similarity to known TF sequence preferences. Most TF binding sites, however, do not resemble known TF sequence motifs, and many TFs are not sequence-specific. We developed Virtual ChIP-seq, which predicts binding of individual TFs in new cell types, integrating learned associations with gene expression and binding, TF binding sites from other cell types, and chromatin accessibility data in the new cell type. This approach outperforms methods that predict TF binding solely based on sequence preference, predicting binding for 36 TFs (MCC>0.3).
Collapse
Affiliation(s)
- Mehran Karimzadeh
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.,Princess Margaret Cancer Centre, Toronto, ON, Canada.,Vector Institute, Toronto, ON, Canada
| | - Michael M Hoffman
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada. .,Princess Margaret Cancer Centre, Toronto, ON, Canada. .,Vector Institute, Toronto, ON, Canada. .,Department of Computer Science, University of Toronto, Toronto, ON, Canada.
| |
Collapse
|
11
|
Niu YN, Roberts EG, Denisko D, Hoffman MM. Assessing and assuring interoperability of a genomics file format. Bioinformatics 2022; 38:3327-3336. [PMID: 35575355 PMCID: PMC9237710 DOI: 10.1093/bioinformatics/btac327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Revised: 03/30/2022] [Accepted: 05/11/2022] [Indexed: 12/01/2022] Open
Abstract
Motivation Bioinformatics software tools operate largely through the use of specialized genomics file formats. Often these formats lack formal specification, making it difficult or impossible for the creators of these tools to robustly test them for correct handling of input and output. This causes problems in interoperability between different tools that, at best, wastes time and frustrates users. At worst, interoperability issues could lead to undetected errors in scientific results. Results We developed a new verification system, Acidbio, which tests for correct behavior in bioinformatics software packages. We crafted tests to unify correct behavior when tools encounter various edge cases—potentially unexpected inputs that exemplify the limits of the format. To analyze the performance of existing software, we tested the input validation of 80 Bioconda packages that parsed the Browser Extensible Data (BED) format. We also used a fuzzing approach to automatically perform additional testing. Of 80 software packages examined, 75 achieved less than 70% correctness on our test suite. We categorized multiple root causes for the poor performance of different types of software. Fuzzing detected other errors that the manually designed test suite could not. We also created a badge system that developers can use to indicate more precisely which BED variants their software accepts and to advertise the software’s performance on the test suite. Availability and implementation Acidbio is available at https://github.com/hoffmangroup/acidbio. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yi Nian Niu
- Princess Margaret Cancer Centre University Health Network, Toronto, ON, M5G 2C1, Canada
| | - Eric G Roberts
- Princess Margaret Cancer Centre University Health Network, Toronto, ON, M5G 2C1, Canada
| | - Danielle Denisko
- Princess Margaret Cancer Centre University Health Network, Toronto, ON, M5G 2C1, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, ON, M5G 1L7, Canada
| | - Michael M Hoffman
- Princess Margaret Cancer Centre University Health Network, Toronto, ON, M5G 2C1, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, ON, M5G 1L7, Canada.,Department of Computer Science, University of Toronto, Toronto, ON, M5S 2E4, Canada.,Vector Institute, Toronto, ON, M5G 1M1, Canada
| |
Collapse
|
12
|
Rehm HL, Page AJ, Smith L, Adams JB, Alterovitz G, Babb LJ, Barkley MP, Baudis M, Beauvais MJ, Beck T, Beckmann JS, Beltran S, Bernick D, Bernier A, Bonfield JK, Boughtwood TF, Bourque G, Bowers SR, Brookes AJ, Brudno M, Brush MH, Bujold D, Burdett T, Buske OJ, Cabili MN, Cameron DL, Carroll RJ, Casas-Silva E, Chakravarty D, Chaudhari BP, Chen SH, Cherry JM, Chung J, Cline M, Clissold HL, Cook-Deegan RM, Courtot M, Cunningham F, Cupak M, Davies RM, Denisko D, Doerr MJ, Dolman LI, Dove ES, Dursi LJ, Dyke SO, Eddy JA, Eilbeck K, Ellrott KP, Fairley S, Fakhro KA, Firth HV, Fitzsimons MS, Fiume M, Flicek P, Fore IM, Freeberg MA, Freimuth RR, Fromont LA, Fuerth J, Gaff CL, Gan W, Ghanaim EM, Glazer D, Green RC, Griffith M, Griffith OL, Grossman RL, Groza T, Guidry Auvil JM, Guigó R, Gupta D, Haendel MA, Hamosh A, Hansen DP, Hart RK, Hartley DM, Haussler D, Hendricks-Sturrup RM, Ho CW, Hobb AE, Hoffman MM, Hofmann OM, Holub P, Hsu JS, Hubaux JP, Hunt SE, Husami A, Jacobsen JO, Jamuar SS, Janes EL, Jeanson F, Jené A, Johns AL, Joly Y, Jones SJ, Kanitz A, Kato K, Keane TM, Kekesi-Lafrance K, Kelleher J, Kerry G, Khor SS, Knoppers BM, Konopko MA, Kosaki K, Kuba M, Lawson J, Leinonen R, Li S, Lin MF, Linden M, Liu X, Liyanage IU, Lopez J, Lucassen AM, Lukowski M, Mann AL, Marshall J, Mattioni M, Metke-Jimenez A, Middleton A, Milne RJ, Molnár-Gábor F, Mulder N, Munoz-Torres MC, Nag R, Nakagawa H, Nasir J, Navarro A, Nelson TH, Niewielska A, Nisselle A, Niu J, Nyrönen TH, O’Connor BD, Oesterle S, Ogishima S, Ota Wang V, Paglione LA, Palumbo E, Parkinson HE, Philippakis AA, Pizarro AD, Prlic A, Rambla J, Rendon A, Rider RA, Robinson PN, Rodarmer KW, Rodriguez LL, Rubin AF, Rueda M, Rushton GA, Ryan RS, Saunders GI, Schuilenburg H, Schwede T, Scollen S, Senf A, Sheffield NC, Skantharajah N, Smith AV, Sofia HJ, Spalding D, Spurdle AB, Stark Z, Stein LD, Suematsu M, Tan P, Tedds JA, Thomson AA, Thorogood A, Tickle TL, Tokunaga K, Törnroos J, Torrents D, Upchurch S, Valencia A, Guimera RV, Vamathevan J, Varma S, Vears DF, Viner C, Voisin C, Wagner AH, Wallace SE, Walsh BP, Williams MS, Winkler EC, Wold BJ, Wood GM, Woolley JP, Yamasaki C, Yates AD, Yung CK, Zass LJ, Zaytseva K, Zhang J, Goodhand P, North K, Birney E. GA4GH: International policies and standards for data sharing across genomic research and healthcare. Cell Genom 2021; 1:100029. [PMID: 35072136 PMCID: PMC8774288 DOI: 10.1016/j.xgen.2021.100029] [Citation(s) in RCA: 64] [Impact Index Per Article: 21.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
The Global Alliance for Genomics and Health (GA4GH) aims to accelerate biomedical advances by enabling the responsible sharing of clinical and genomic data through both harmonized data aggregation and federated approaches. The decreasing cost of genomic sequencing (along with other genome-wide molecular assays) and increasing evidence of its clinical utility will soon drive the generation of sequence data from tens of millions of humans, with increasing levels of diversity. In this perspective, we present the GA4GH strategies for addressing the major challenges of this data revolution. We describe the GA4GH organization, which is fueled by the development efforts of eight Work Streams and informed by the needs of 24 Driver Projects and other key stakeholders. We present the GA4GH suite of secure, interoperable technical standards and policy frameworks and review the current status of standards, their relevance to key domains of research and clinical care, and future plans of GA4GH. Broad international participation in building, adopting, and deploying GA4GH standards and frameworks will catalyze an unprecedented effort in data sharing that will be critical to advancing genomic medicine and ensuring that all populations can access its benefits.
Collapse
Affiliation(s)
- Heidi L. Rehm
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Massachusetts General Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Angela J.H. Page
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Global Alliance for Genomics and Health, Toronto, ON, Canada
| | - Lindsay Smith
- Global Alliance for Genomics and Health, Toronto, ON, Canada
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Jeremy B. Adams
- Global Alliance for Genomics and Health, Toronto, ON, Canada
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Gil Alterovitz
- Brigham and Women’s Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | | | | | - Michael Baudis
- University of Zurich, Zurich, Switzerland
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Michael J.S. Beauvais
- Global Alliance for Genomics and Health, Toronto, ON, Canada
- McGill University, Montreal, QC, Canada
| | - Tim Beck
- University of Leicester, Leicester, UK
| | | | - Sergi Beltran
- CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- Universitat de Barcelona, Barcelona, Spain
| | - David Bernick
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | | | - Tiffany F. Boughtwood
- Australian Genomics, Parkville, VIC, Australia
- Murdoch Children’s Research Institute, Parkville, VIC, Australia
| | - Guillaume Bourque
- McGill University, Montreal, QC, Canada
- Canadian Center for Computational Genomics, Montreal, QC, Canada
| | | | | | - Michael Brudno
- Canadian Center for Computational Genomics, Montreal, QC, Canada
- University of Toronto, Toronto, ON, Canada
- University Health Network, Toronto, ON, Canada
- Vector Institute, Toronto, ON, Canada
- Canadian Distributed Infrastructure for Genomics (CanDIG), Toronto, ON, Canada
| | | | - David Bujold
- McGill University, Montreal, QC, Canada
- Canadian Center for Computational Genomics, Montreal, QC, Canada
- Canadian Distributed Infrastructure for Genomics (CanDIG), Toronto, ON, Canada
| | - Tony Burdett
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | | | | | - Daniel L. Cameron
- Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia
- University of Melbourne, Melbourne, VIC, Australia
| | | | | | | | - Bimal P. Chaudhari
- Nationwide Children’s Hospital, Columbus, OH, USA
- The Ohio State University, Columbus, OH, USA
| | - Shu Hui Chen
- National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | | | - Justina Chung
- Global Alliance for Genomics and Health, Toronto, ON, Canada
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Melissa Cline
- UC Santa Cruz Genomics Institute, Santa Cruz, CA, USA
| | | | | | - Mélanie Courtot
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | - Fiona Cunningham
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | | | | | | | | | | | | | - L. Jonathan Dursi
- University Health Network, Toronto, ON, Canada
- Canadian Distributed Infrastructure for Genomics (CanDIG), Toronto, ON, Canada
| | | | | | | | | | - Susan Fairley
- Global Alliance for Genomics and Health, Toronto, ON, Canada
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | - Khalid A. Fakhro
- Sidra Medicine, Doha, Qatar
- Weill Cornell Medicine - Qatar, Doha, Qatar
| | - Helen V. Firth
- Wellcome Sanger Institute, Hinxton, UK
- Addenbrooke’s Hospital, Cambridge, UK
| | | | | | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | - Ian M. Fore
- National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Mallory A. Freeberg
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | | | - Lauren A. Fromont
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | | | - Clara L. Gaff
- Australian Genomics, Parkville, VIC, Australia
- Murdoch Children’s Research Institute, Parkville, VIC, Australia
- Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia
- University of Melbourne, Melbourne, VIC, Australia
| | - Weiniu Gan
- National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | - Elena M. Ghanaim
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - David Glazer
- Verily Life Sciences, South San Francisco, CA, USA
| | - Robert C. Green
- Brigham and Women’s Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Malachi Griffith
- Washington University School of Medicine in St. Louis, St. Louis, MO, USA
| | - Obi L. Griffith
- Washington University School of Medicine in St. Louis, St. Louis, MO, USA
| | | | | | | | - Roderic Guigó
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Dipayan Gupta
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | | | - Ada Hamosh
- Johns Hopkins University, Baltimore, MD, USA
| | - David P. Hansen
- Australian Genomics, Parkville, VIC, Australia
- The Australian e-Health Research Centre, CSIRO, Herston, QLD, Australia
| | - Reece K. Hart
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Invitae, San Francisco, CA, USA
- MyOme, Inc, San Bruno, CA, USA
| | | | - David Haussler
- UC Santa Cruz Genomics Institute, Santa Cruz, CA, USA
- Howard Hughes Medical Institute, University of California, Santa Cruz, CA, USA
| | | | | | | | - Michael M. Hoffman
- University of Toronto, Toronto, ON, Canada
- University Health Network, Toronto, ON, Canada
- Vector Institute, Toronto, ON, Canada
| | - Oliver M. Hofmann
- University of Toronto, Toronto, ON, Canada
- University of Melbourne, Melbourne, VIC, Australia
| | - Petr Holub
- BBMRI-ERIC, Graz, Austria
- Masaryk University, Brno, Czech Republic
| | | | | | - Sarah E. Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | - Ammar Husami
- Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
| | | | - Saumya S. Jamuar
- SingHealth Duke-NUS Genomic Medicine Centre, Singapore, Republic of Singapore
- SingHealth Duke-NUS Institute of Precision Medicine, Singapore, Republic of Singapore
| | - Elizabeth L. Janes
- Global Alliance for Genomics and Health, Toronto, ON, Canada
- University of Waterloo, Waterloo, ON, Canada
| | | | - Aina Jené
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Amber L. Johns
- Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
| | - Yann Joly
- McGill University, Montreal, QC, Canada
| | - Steven J.M. Jones
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC, Canada
| | - Alexander Kanitz
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
- University of Basel, Basel, Switzerland
| | | | - Thomas M. Keane
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
- University of Nottingham, Nottingham, UK
| | - Kristina Kekesi-Lafrance
- Global Alliance for Genomics and Health, Toronto, ON, Canada
- McGill University, Montreal, QC, Canada
| | | | - Giselle Kerry
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | - Seik-Soon Khor
- National Center for Global Health and Medicine Hospital, Tokyo, Japan
- University of Tokyo, Tokyo, Japan
| | | | | | | | | | | | - Rasko Leinonen
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | - Stephanie Li
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Global Alliance for Genomics and Health, Toronto, ON, Canada
| | | | - Mikael Linden
- CSC–IT Center for Science, Espoo, Finland
- ELIXIR Finland, Espoo, Finland
| | | | - Isuru Udara Liyanage
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | | | | | | | - Alice L. Mann
- Global Alliance for Genomics and Health, Toronto, ON, Canada
- Wellcome Sanger Institute, Hinxton, UK
| | | | | | | | - Anna Middleton
- Wellcome Connecting Science, Hinxton, UK
- University of Cambridge, Cambridge, UK
| | - Richard J. Milne
- Wellcome Connecting Science, Hinxton, UK
- University of Cambridge, Cambridge, UK
| | | | - Nicola Mulder
- H3ABioNet, Computational Biology Division, IDM, Faculty of Health Sciences, Cape Town, South Africa
| | | | - Rishi Nag
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | - Hidewaki Nakagawa
- Japan Agency for Medical Research & Development (AMED), Tokyo, Japan
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | | | - Arcadi Navarro
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Institute of Evolutionary Biology (UPF-CSIC), Universitat Pompeu Fabra (UPF), Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats, Barcelona, Spain
- Barcelonaβeta Brain Research Center (BBRC), Pasqual Maragall Foundation, Barcelona, Spain
| | | | - Ania Niewielska
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | - Amy Nisselle
- Murdoch Children’s Research Institute, Parkville, VIC, Australia
- University of Melbourne, Melbourne, VIC, Australia
- Human Genetics Society of Australasia Education, Ethics & Social Issues Committee, Alexandria, NSW, Australia
| | - Jeffrey Niu
- University Health Network, Toronto, ON, Canada
| | - Tommi H. Nyrönen
- CSC–IT Center for Science, Espoo, Finland
- ELIXIR Finland, Espoo, Finland
| | | | - Sabine Oesterle
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | | | - Vivian Ota Wang
- National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | | | - Emilio Palumbo
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Helen E. Parkinson
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | | | | | | | - Jordi Rambla
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | | | - Renee A. Rider
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Peter N. Robinson
- The Jackson Laboratory, Farmington, CT, USA
- University of Connecticut, Farmington, CT, USA
| | - Kurt W. Rodarmer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | | | - Alan F. Rubin
- Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia
- University of Melbourne, Melbourne, VIC, Australia
| | - Manuel Rueda
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | | | | | | | - Helen Schuilenburg
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | - Torsten Schwede
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
- University of Basel, Basel, Switzerland
| | | | | | | | - Neerjah Skantharajah
- Global Alliance for Genomics and Health, Toronto, ON, Canada
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | | | - Heidi J. Sofia
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Dylan Spalding
- CSC–IT Center for Science, Espoo, Finland
- ELIXIR Finland, Espoo, Finland
| | | | - Zornitza Stark
- Australian Genomics, Parkville, VIC, Australia
- Murdoch Children’s Research Institute, Parkville, VIC, Australia
- University of Melbourne, Melbourne, VIC, Australia
| | - Lincoln D. Stein
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- University of Toronto, Toronto, ON, Canada
| | | | - Patrick Tan
- SingHealth Duke-NUS Genomic Medicine Centre, Singapore, Republic of Singapore
- Precision Health Research Singapore, Singapore, Republic of Singapore
- Genome Institute of Singapore, Singapore, Republic of Singapore
| | | | - Alastair A. Thomson
- National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | - Adrian Thorogood
- McGill University, Montreal, QC, Canada
- University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | | | - Katsushi Tokunaga
- University of Tokyo, Tokyo, Japan
- National Center for Global Health and Medicine, Tokyo, Japan
| | - Juha Törnroos
- CSC–IT Center for Science, Espoo, Finland
- ELIXIR Finland, Espoo, Finland
| | - David Torrents
- Institució Catalana de Recerca i Estudis Avançats, Barcelona, Spain
- Barcelona Supercomputing Center, Barcelona, Spain
| | - Sean Upchurch
- California Institute of Technology, Pasadena, CA, USA
| | - Alfonso Valencia
- Institució Catalana de Recerca i Estudis Avançats, Barcelona, Spain
- Barcelona Supercomputing Center, Barcelona, Spain
| | | | - Jessica Vamathevan
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | - Susheel Varma
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
- Health Data Research UK, London, UK
| | - Danya F. Vears
- Murdoch Children’s Research Institute, Parkville, VIC, Australia
- University of Melbourne, Melbourne, VIC, Australia
- Human Genetics Society of Australasia Education, Ethics & Social Issues Committee, Alexandria, NSW, Australia
- Melbourne Law School, University of Melbourne, Parkville, VIC, Australia
| | - Coby Viner
- University of Toronto, Toronto, ON, Canada
- University Health Network, Toronto, ON, Canada
| | | | - Alex H. Wagner
- Nationwide Children’s Hospital, Columbus, OH, USA
- The Ohio State University, Columbus, OH, USA
| | | | | | | | - Eva C. Winkler
- Section of Translational Medical Ethics, University Hospital Heidelberg, Heidelberg, Germany
| | | | | | | | | | - Andrew D. Yates
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | - Christina K. Yung
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- Indoc Research, Toronto, ON, Canada
| | - Lyndon J. Zass
- H3ABioNet, Computational Biology Division, IDM, Faculty of Health Sciences, Cape Town, South Africa
| | - Ksenia Zaytseva
- McGill University, Montreal, QC, Canada
- Canadian Centre for Computational Genomics, Montreal, QC, Canada
| | - Junjun Zhang
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Peter Goodhand
- Global Alliance for Genomics and Health, Toronto, ON, Canada
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Kathryn North
- Murdoch Children’s Research Institute, Parkville, VIC, Australia
- University of Toronto, Toronto, ON, Canada
- University of Melbourne, Melbourne, VIC, Australia
| | - Ewan Birney
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
- European Molecular Biology Laboratory, Heidelberg, Germany
| |
Collapse
|
13
|
Abstract
To make machine learning analyses in the life sciences more computationally reproducible, we propose standards based on data, model, and code publication, programming best practices, and workflow automation. By meeting these standards, the community of researchers applying machine learning methods in the life sciences can ensure that their analyses are worthy of trust. this article has been peer reviewed.
Collapse
Affiliation(s)
- Benjamin J Heil
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Michael M Hoffman
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Vector Institute, Toronto, Ontario, Canada
| | - Florian Markowetz
- Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK
| | - Su-In Lee
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA, USA
| | - Casey S Greene
- Department of Biochemistry and Molecular Genetics, University of Colorado School of Medicine, Aurora, CO, USA.
- Center for Health AI, University of Colorado School of Medicine, Aurora, CO, USA.
| | - Stephanie C Hicks
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA.
| |
Collapse
|
14
|
Abstract
Segmentation and genome annotation (SAGA) algorithms are widely used to understand genome activity and gene regulation. These algorithms take as input epigenomic datasets, such as chromatin immunoprecipitation-sequencing (ChIP-seq) measurements of histone modifications or transcription factor binding. They partition the genome and assign a label to each segment such that positions with the same label exhibit similar patterns of input data. SAGA algorithms discover categories of activity such as promoters, enhancers, or parts of genes without prior knowledge of known genomic elements. In this sense, they generally act in an unsupervised fashion like clustering algorithms, but with the additional simultaneous function of segmenting the genome. Here, we review the common methodological framework that underlies these methods, review variants of and improvements upon this basic framework, and discuss the outlook for future work. This review is intended for those interested in applying SAGA methods and for computational researchers interested in improving upon them.
Collapse
Affiliation(s)
| | - Rachel C. W. Chan
- Department of Computer Science, University of Toronto, Toronto, Canada
- Princess Margaret Cancer Centre, University Health Network, Toronto, Canada
| | - Michael M. Hoffman
- Department of Computer Science, University of Toronto, Toronto, Canada
- Princess Margaret Cancer Centre, University Health Network, Toronto, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, Canada
- Vector Institute for Artificial Intelligence, Toronto, Canada
| |
Collapse
|
15
|
Burgener JM, Zou J, Zhao Z, Zheng Y, Shen SY, Huang SH, Keshavarzi S, Xu W, Liu FF, Liu G, Waldron JN, Weinreb I, Spreafico A, Siu LL, de Almeida JR, Goldstein DP, Hoffman MM, De Carvalho DD, Bratman SV. Tumor-Naïve Multimodal Profiling of Circulating Tumor DNA in Head and Neck Squamous Cell Carcinoma. Clin Cancer Res 2021; 27:4230-4244. [PMID: 34158359 PMCID: PMC9401560 DOI: 10.1158/1078-0432.ccr-21-0110] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2021] [Revised: 03/16/2021] [Accepted: 05/28/2021] [Indexed: 01/07/2023]
Abstract
PURPOSE Circulating tumor DNA (ctDNA) enables personalized treatment strategies in oncology by providing a noninvasive source of clinical biomarkers. In patients with low ctDNA abundance, tumor-naïve methods are needed to facilitate clinical implementation. Here, using locoregionally confined head and neck squamous cell carcinoma (HNSCC) as an example, we demonstrate tumor-naïve detection of ctDNA by simultaneous profiling of mutations and methylation. EXPERIMENTAL DESIGN We conducted CAncer Personalized Profiling by deep Sequencing (CAPP-seq) and cell-free Methylated DNA ImmunoPrecipitation and high-throughput sequencing (cfMeDIP-seq) for detection of ctDNA-derived somatic mutations and aberrant methylation, respectively. We analyzed 77 plasma samples from 30 patients with stage I-IVA human papillomavirus-negative HNSCC as well as plasma samples from 20 risk-matched healthy controls. In addition, we analyzed leukocytes from patients and controls. RESULTS CAPP-seq identified mutations in 20 of 30 patients at frequencies similar to that of The Tumor Genome Atlas (TCGA). Differential methylation analysis of cfMeDIP-seq profiles identified 941 ctDNA-derived hypermethylated regions enriched for CpG islands and HNSCC-specific methylation patterns. Both methods demonstrated an association between ctDNA abundance and shorter fragment lengths. In addition, mutation- and methylation-based ctDNA abundance was highly correlated (r > 0.85). Patients with detectable pretreatment ctDNA by both methods demonstrated significantly worse overall survival (HR = 7.5; P = 0.025) independent of clinical stage, with lack of ctDNA clearance post-treatment strongly correlating with recurrence. We further leveraged cfMeDIP-seq profiles to validate a prognostic signature identified from TCGA samples. CONCLUSIONS Tumor-naïve detection of ctDNA by multimodal profiling may facilitate biomarker discovery and clinical use in low ctDNA abundance applications.
Collapse
Affiliation(s)
- Justin M. Burgener
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Jinfeng Zou
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Zhen Zhao
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Yangqiao Zheng
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Shu Yi Shen
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Shao Hui Huang
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Deparment of Radiation Oncology, University of Toronto, Toronto, Ontario, Canada.,Deparment of Otolaryngology – Head and Neck Surgery, University of Toronto, Toronto, Ontario, Canada
| | - Sareh Keshavarzi
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Wei Xu
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Fei-Fei Liu
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.,Deparment of Radiation Oncology, University of Toronto, Toronto, Ontario, Canada
| | - Geoffrey Liu
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.,Division of Medical Oncology and Hematology, Department of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - John N. Waldron
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Deparment of Radiation Oncology, University of Toronto, Toronto, Ontario, Canada
| | - Ilan Weinreb
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Ontario, Canada
| | - Anna Spreafico
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Division of Medical Oncology and Hematology, Department of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Lillian L. Siu
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Division of Medical Oncology and Hematology, Department of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - John R. de Almeida
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Deparment of Otolaryngology – Head and Neck Surgery, University of Toronto, Toronto, Ontario, Canada
| | - David P. Goldstein
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Deparment of Otolaryngology – Head and Neck Surgery, University of Toronto, Toronto, Ontario, Canada
| | - Michael M. Hoffman
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.,Department of Computer Science, University of Toronto, Toronto, Ontario, Canada.,Vector Institute, Toronto, Ontario, Canada
| | - Daniel D. De Carvalho
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.,Corresponding Authors: Scott V. Bratman, Medical Biophysics, Princess Margaret Cancer Centre, University Health Network, 101 College Street, Toronto, ON M5G 1L7, Canada. Phone: 416-946-2121; E-mail: ; and Daniel D. De Carvalho,
| | - Scott V. Bratman
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.,Deparment of Radiation Oncology, University of Toronto, Toronto, Ontario, Canada.,Corresponding Authors: Scott V. Bratman, Medical Biophysics, Princess Margaret Cancer Centre, University Health Network, 101 College Street, Toronto, ON M5G 1L7, Canada. Phone: 416-946-2121; E-mail: ; and Daniel D. De Carvalho,
| |
Collapse
|
16
|
Affiliation(s)
- Samantha L Wilson
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
| | - Gregory P Way
- Imaging Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Wout Bittremieux
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA.,Department of Computer Science, University of Antwerp, Antwerpen, Belgium
| | - Jean-Paul Armache
- Department of Biochemistry & Molecular Biology, The Huck Institutes of Life Sciences, Pennsylvania State University, University Park, PA, USA
| | | | - Michael M Hoffman
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada.,Department of Medical Biophysics, Department of Computer Science, University of Toronto, Toronto, ON, Canada.,Vector Institute, Toronto, ON, Canada
| |
Collapse
|
17
|
Haibe-Kains B, Adam GA, Hosny A, Khodakarami F, Waldron L, Wang B, McIntosh C, Goldenberg A, Kundaje A, Greene CS, Broderick T, Hoffman MM, Leek JT, Korthauer K, Huber W, Brazma A, Pineau J, Tibshirani R, Hastie T, Ioannidis JPA, Quackenbush J, Aerts HJWL. Transparency and reproducibility in artificial intelligence. Nature 2020; 586:E14-E16. [PMID: 33057217 PMCID: PMC8144864 DOI: 10.1038/s41586-020-2766-y] [Citation(s) in RCA: 140] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2020] [Accepted: 08/10/2020] [Indexed: 01/15/2023]
Abstract
Breakthroughs in artificial intelligence (AI) hold enormous potential as it can automate complex tasks and go even beyond human performance. In their study, McKinney et al. showed the high potential of AI for breast cancer screening. However, the lack of methods’ details and algorithm code undermines its scientific value. Here, we identify obstacles hindering transparent and reproducible AI research as faced by McKinney et al., and provide solutions to these obstacles with implications for the broader field.
Collapse
Affiliation(s)
- Benjamin Haibe-Kains
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada.
- Ontario Institute for Cancer Research, Toronto, Ontario, Canada.
- Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada.
| | - George Alexandru Adam
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada
| | - Ahmed Hosny
- Artificial Intelligence in Medicine (AIM) Program, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Radiation Oncology and Radiology, Dana-Farber Cancer Institute, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Farnoosh Khodakarami
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Levi Waldron
- Department of Epidemiology and Biostatistics and Institute for Implementation Science in Population Health, CUNY Graduate School of Public Health and Health Policy, New York, NY, USA
| | - Bo Wang
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada
- Peter Munk Cardiac Centre, University Health Network, Toronto, Ontario, Canada
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Ontario, Canada
| | - Chris McIntosh
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada
- Peter Munk Cardiac Centre, University Health Network, Toronto, Ontario, Canada
| | - Anna Goldenberg
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada
- SickKids Research Institute, Toronto, Ontario, Canada
- Child and Brain Development Program, CIFAR, Toronto, Ontario, Canada
| | - Anshul Kundaje
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Casey S Greene
- Dept. of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Childhood Cancer Data Lab, Alex's Lemonade Stand Foundation, Philadelphia, PA, USA
| | - Tamara Broderick
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Michael M Hoffman
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada
| | - Jeffrey T Leek
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Keegan Korthauer
- Department of Statistics, University of British Columbia, Vancouver, British Columbia, Canada
- BC Children's Hospital Research Institute, Vancouver, British Columbia, Canada
| | - Wolfgang Huber
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Alvis Brazma
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Hinxton, UK
| | - Joelle Pineau
- McGill University, Montreal, Quebec, Canada
- Montreal Institute for Learning Algorithms, Quebec, Canada
| | - Robert Tibshirani
- Department of Statistics, Stanford University School of Humanities and Sciences, Stanford, CA, USA
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA, USA
| | - Trevor Hastie
- Department of Statistics, Stanford University School of Humanities and Sciences, Stanford, CA, USA
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA, USA
| | - John P A Ioannidis
- Department of Statistics, Stanford University School of Humanities and Sciences, Stanford, CA, USA
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA, USA
- Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
- Meta-Research Innovation Center at Stanford (METRICS), Stanford, CA, USA
- Department of Epidemiology and Population Health, Stanford University School of Medicine, Stanford, CA, USA
| | - John Quackenbush
- Department of Biostatistics, Harvard T.H Chan School of Public Health, Boston, MA, USA
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Hugo J W L Aerts
- Artificial Intelligence in Medicine (AIM) Program, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Radiation Oncology and Radiology, Dana-Farber Cancer Institute, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Radiology and Nuclear Medicine, Maastricht University, Maastricht, The Netherlands
- Cardiovascular Imaging Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
18
|
Ramilowski JA, Yip CW, Agrawal S, Chang JC, Ciani Y, Kulakovskiy IV, Mendez M, Ooi JLC, Ouyang JF, Parkinson N, Petri A, Roos L, Severin J, Yasuzawa K, Abugessaisa I, Akalin A, Antonov IV, Arner E, Bonetti A, Bono H, Borsari B, Brombacher F, Cameron CJ, Cannistraci CV, Cardenas R, Cardon M, Chang H, Dostie J, Ducoli L, Favorov A, Fort A, Garrido D, Gil N, Gimenez J, Guler R, Handoko L, Harshbarger J, Hasegawa A, Hasegawa Y, Hashimoto K, Hayatsu N, Heutink P, Hirose T, Imada EL, Itoh M, Kaczkowski B, Kanhere A, Kawabata E, Kawaji H, Kawashima T, Kelly ST, Kojima M, Kondo N, Koseki H, Kouno T, Kratz A, Kurowska-Stolarska M, Kwon ATJ, Leek J, Lennartsson A, Lizio M, López-Redondo F, Luginbühl J, Maeda S, Makeev VJ, Marchionni L, Medvedeva YA, Minoda A, Müller F, Muñoz-Aguirre M, Murata M, Nishiyori H, Nitta KR, Noguchi S, Noro Y, Nurtdinov R, Okazaki Y, Orlando V, Paquette D, Parr CJ, Rackham OJ, Rizzu P, Martinez DFS, Sandelin A, Sanjana P, Semple CA, Shibayama Y, Sivaraman DM, Suzuki T, Szumowski SC, Tagami M, Taylor MS, Terao C, Thodberg M, Thongjuea S, Tripathi V, Ulitsky I, Verardo R, Vorontsov IE, Yamamoto C, Young RS, Baillie JK, Forrest AR, Guigó R, Hoffman MM, Hon CC, Kasukawa T, Kauppinen S, Kere J, Lenhard B, Schneider C, Suzuki H, Yagi K, de Hoon MJ, Shin JW, Carninci P. Corrigendum: Functional annotation of human long noncoding RNAs via molecular phenotyping. Genome Res 2020. [DOI: 10.1101/gr.270330.120] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
19
|
Ramilowski JA, Yip CW, Agrawal S, Chang JC, Ciani Y, Kulakovskiy IV, Mendez M, Ooi JLC, Ouyang JF, Parkinson N, Petri A, Roos L, Severin J, Yasuzawa K, Abugessaisa I, Akalin A, Antonov IV, Arner E, Bonetti A, Bono H, Borsari B, Brombacher F, Cameron CJF, Cannistraci CV, Cardenas R, Cardon M, Chang H, Dostie J, Ducoli L, Favorov A, Fort A, Garrido D, Gil N, Gimenez J, Guler R, Handoko L, Harshbarger J, Hasegawa A, Hasegawa Y, Hashimoto K, Hayatsu N, Heutink P, Hirose T, Imada EL, Itoh M, Kaczkowski B, Kanhere A, Kawabata E, Kawaji H, Kawashima T, Kelly ST, Kojima M, Kondo N, Koseki H, Kouno T, Kratz A, Kurowska-Stolarska M, Kwon ATJ, Leek J, Lennartsson A, Lizio M, López-Redondo F, Luginbühl J, Maeda S, Makeev VJ, Marchionni L, Medvedeva YA, Minoda A, Müller F, Muñoz-Aguirre M, Murata M, Nishiyori H, Nitta KR, Noguchi S, Noro Y, Nurtdinov R, Okazaki Y, Orlando V, Paquette D, Parr CJC, Rackham OJL, Rizzu P, Sánchez Martinez DF, Sandelin A, Sanjana P, Semple CAM, Shibayama Y, Sivaraman DM, Suzuki T, Szumowski SC, Tagami M, Taylor MS, Terao C, Thodberg M, Thongjuea S, Tripathi V, Ulitsky I, Verardo R, Vorontsov IE, Yamamoto C, Young RS, Baillie JK, Forrest ARR, Guigó R, Hoffman MM, Hon CC, Kasukawa T, Kauppinen S, Kere J, Lenhard B, Schneider C, Suzuki H, Yagi K, de Hoon MJL, Shin JW, Carninci P. Functional annotation of human long noncoding RNAs via molecular phenotyping. Genome Res 2020; 30:1060-1072. [PMID: 32718982 PMCID: PMC7397864 DOI: 10.1101/gr.254219.119] [Citation(s) in RCA: 71] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2019] [Accepted: 06/24/2020] [Indexed: 12/12/2022]
Abstract
Long noncoding RNAs (lncRNAs) constitute the majority of transcripts in the mammalian genomes, and yet, their functions remain largely unknown. As part of the FANTOM6 project, we systematically knocked down the expression of 285 lncRNAs in human dermal fibroblasts and quantified cellular growth, morphological changes, and transcriptomic responses using Capped Analysis of Gene Expression (CAGE). Antisense oligonucleotides targeting the same lncRNAs exhibited global concordance, and the molecular phenotype, measured by CAGE, recapitulated the observed cellular phenotypes while providing additional insights on the affected genes and pathways. Here, we disseminate the largest-to-date lncRNA knockdown data set with molecular phenotyping (over 1000 CAGE deep-sequencing libraries) for further exploration and highlight functional roles for ZNF213-AS1 and lnc-KHDC3L-2.
Collapse
Affiliation(s)
- Jordan A Ramilowski
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Chi Wai Yip
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Saumya Agrawal
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Jen-Chien Chang
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Yari Ciani
- Laboratorio Nazionale Consorzio Interuniversitario Biotecnologie (CIB), Trieste 34127, Italy
| | - Ivan V Kulakovskiy
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow 119991, Russia.,Institute of Protein Research, Russian Academy of Sciences, Pushchino 142290, Russia
| | - Mickaël Mendez
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 1A1, Canada
| | | | - John F Ouyang
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore Medical School, Singapore 169857, Singapore
| | - Nick Parkinson
- Roslin Institute, University of Edinburgh, Edinburgh EH25 9RG, United Kingdom
| | - Andreas Petri
- Center for RNA Medicine, Department of Clinical Medicine, Aalborg University, Copenhagen 9220, Denmark
| | - Leonie Roos
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London W12 0NN, United Kingdom.,Computational Regulatory Genomics, MRC London Institute of Medical Sciences, London W12 0NN, United Kingdom
| | - Jessica Severin
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Kayoko Yasuzawa
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Imad Abugessaisa
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Altuna Akalin
- Berlin Institute for Medical Systems Biology, Max Delbrük Center for Molecular Medicine in the Helmholtz Association, Berlin 13125, Germany
| | - Ivan V Antonov
- Institute of Bioengineering, Research Center of Biotechnology, Russian Academy of Sciences, Moscow 117312, Russia
| | - Erik Arner
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Alessandro Bonetti
- RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Hidemasa Bono
- Graduate School of Integrated Sciences for Life, Hiroshima University, Higashi-Hiroshima City 739-0046, Japan
| | - Beatrice Borsari
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Catalonia 08003, Spain
| | - Frank Brombacher
- International Centre for Genetic Engineering and Biotechnology (ICGEB), University of Cape Town, Cape Town 7925, South Africa.,Institute of Infectious Diseases and Molecular Medicine (IDM), Department of Pathology, Division of Immunology and South African Medical Research Council (SAMRC) Immunology of Infectious Diseases, Faculty of Health Sciences, University of Cape Town, Cape Town 7925, South Africa
| | - Christopher JF Cameron
- School of Computer Science, McGill University, Montréal, Québec H3G 1Y6, Canada.,Department of Biochemistry, Rosalind and Morris Goodman Cancer Research Center, McGill University, Montréal, Québec H3G 1Y6, Canada.,Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06510, USA
| | - Carlo Vittorio Cannistraci
- Biomedical Cybernetics Group, Biotechnology Center (BIOTEC), Center for Molecular and Cellular Bioengineering (CMCB), Center for Systems Biology Dresden (CSBD), Cluster of Excellence Physics of Life (PoL), Department of Physics, Technische Universität Dresden, Dresden 01062, Germany.,Center for Complex Network Intelligence (CCNI) at the Tsinghua Laboratory of Brain and Intelligence (THBI), Department of Bioengineering, Tsinghua University, Beijing 100084, China
| | - Ryan Cardenas
- Institute of Cancer and Genomic Sciences, College of Medical and Dental Sciences, University of Birmingham, Birmingham B15 2TT, United Kingdom
| | - Melissa Cardon
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Howard Chang
- Center for Personal Dynamic Regulome, Stanford University, Stanford, California 94305, USA
| | - Josée Dostie
- Department of Biochemistry, Rosalind and Morris Goodman Cancer Research Center, McGill University, Montréal, Québec H3G 1Y6, Canada
| | - Luca Ducoli
- Institute of Pharmaceutical Sciences, Swiss Federal Institute of Technology, Zurich 8093, Switzerland
| | - Alexander Favorov
- Department of Computational Systems Biology, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow 119991, Russia.,Department of Oncology, Johns Hopkins University, Baltimore, Maryland 21287, USA
| | - Alexandre Fort
- RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Diego Garrido
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Catalonia 08003, Spain
| | - Noa Gil
- Department of Biological Regulation, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Juliette Gimenez
- Epigenetics and Genome Reprogramming Laboratory, IRCCS Fondazione Santa Lucia, Rome 00179, Italy
| | - Reto Guler
- International Centre for Genetic Engineering and Biotechnology (ICGEB), University of Cape Town, Cape Town 7925, South Africa.,Institute of Infectious Diseases and Molecular Medicine (IDM), Department of Pathology, Division of Immunology and South African Medical Research Council (SAMRC) Immunology of Infectious Diseases, Faculty of Health Sciences, University of Cape Town, Cape Town 7925, South Africa
| | - Lusy Handoko
- RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Jayson Harshbarger
- RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Akira Hasegawa
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Yuki Hasegawa
- RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Kosuke Hashimoto
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Norihito Hayatsu
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Peter Heutink
- Genome Biology of Neurodegenerative Diseases, German Center for Neurodegenerative Diseases (DZNE), Tübingen 72076, Germany
| | - Tetsuro Hirose
- Graduate School of Frontier Biosciences, Osaka University, Suita 565-0871, Japan
| | - Eddie L Imada
- Department of Oncology, Johns Hopkins University, Baltimore, Maryland 21287, USA
| | - Masayoshi Itoh
- RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Preventive Medicine and Diagnosis Innovation Program (PMI), Saitama 351-0198, Japan
| | - Bogumil Kaczkowski
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Aditi Kanhere
- Institute of Cancer and Genomic Sciences, College of Medical and Dental Sciences, University of Birmingham, Birmingham B15 2TT, United Kingdom
| | - Emily Kawabata
- RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Hideya Kawaji
- RIKEN Preventive Medicine and Diagnosis Innovation Program (PMI), Saitama 351-0198, Japan
| | - Tsugumi Kawashima
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - S Thomas Kelly
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Miki Kojima
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Naoto Kondo
- RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Haruhiko Koseki
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Tsukasa Kouno
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Anton Kratz
- RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Mariola Kurowska-Stolarska
- Institute of Infection, Immunity, and Inflammation, University of Glasgow, Glasgow, Scotland G12 8QQ, United Kingdom
| | - Andrew Tae Jun Kwon
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Jeffrey Leek
- Department of Oncology, Johns Hopkins University, Baltimore, Maryland 21287, USA
| | - Andreas Lennartsson
- Department of Biosciences and Nutrition, Karolinska Institutet, Huddinge 14157, Sweden
| | - Marina Lizio
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Fernando López-Redondo
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Joachim Luginbühl
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Shiori Maeda
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Vsevolod J Makeev
- Department of Computational Systems Biology, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow 119991, Russia.,Moscow Institute of Physics and Technology, Dolgoprudny 141701, Russia
| | - Luigi Marchionni
- Department of Oncology, Johns Hopkins University, Baltimore, Maryland 21287, USA
| | - Yulia A Medvedeva
- Institute of Bioengineering, Research Center of Biotechnology, Russian Academy of Sciences, Moscow 117312, Russia.,Moscow Institute of Physics and Technology, Dolgoprudny 141701, Russia
| | - Aki Minoda
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Ferenc Müller
- Institute of Cancer and Genomic Sciences, College of Medical and Dental Sciences, University of Birmingham, Birmingham B15 2TT, United Kingdom
| | - Manuel Muñoz-Aguirre
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Catalonia 08003, Spain
| | - Mitsuyoshi Murata
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Hiromi Nishiyori
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Kazuhiro R Nitta
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Shuhei Noguchi
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Yukihiko Noro
- RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Ramil Nurtdinov
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Catalonia 08003, Spain
| | - Yasushi Okazaki
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Valerio Orlando
- Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal 23955-6900, Kingdom of Saudi Arabia
| | - Denis Paquette
- Department of Biochemistry, Rosalind and Morris Goodman Cancer Research Center, McGill University, Montréal, Québec H3G 1Y6, Canada
| | - Callum J C Parr
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Owen J L Rackham
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore Medical School, Singapore 169857, Singapore
| | - Patrizia Rizzu
- Genome Biology of Neurodegenerative Diseases, German Center for Neurodegenerative Diseases (DZNE), Tübingen 72076, Germany
| | | | - Albin Sandelin
- Department of Biology and BRIC, University of Copenhagen, Denmark, Copenhagen N DK2200, Denmark
| | - Pillay Sanjana
- Institute of Cancer and Genomic Sciences, College of Medical and Dental Sciences, University of Birmingham, Birmingham B15 2TT, United Kingdom
| | - Colin A M Semple
- MRC Human Genetics Unit, University of Edinburgh, Edinburgh EH4 2XU, United Kingdom
| | - Youtaro Shibayama
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Divya M Sivaraman
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Takahiro Suzuki
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | | | - Michihira Tagami
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Martin S Taylor
- MRC Human Genetics Unit, University of Edinburgh, Edinburgh EH4 2XU, United Kingdom
| | - Chikashi Terao
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan
| | - Malte Thodberg
- Department of Biology and BRIC, University of Copenhagen, Denmark, Copenhagen N DK2200, Denmark
| | - Supat Thongjuea
- RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Vidisha Tripathi
- National Centre for Cell Science, Pune, Maharashtra 411007, India
| | - Igor Ulitsky
- Department of Biological Regulation, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Roberto Verardo
- Laboratorio Nazionale Consorzio Interuniversitario Biotecnologie (CIB), Trieste 34127, Italy
| | - Ilya E Vorontsov
- Department of Computational Systems Biology, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow 119991, Russia
| | - Chinatsu Yamamoto
- RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Robert S Young
- Centre for Global Health Research, Usher Institute, University of Edinburgh, Edinburgh EH8 9AG, United Kingdom
| | - J Kenneth Baillie
- Roslin Institute, University of Edinburgh, Edinburgh EH25 9RG, United Kingdom
| | - Alistair R R Forrest
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan.,Harry Perkins Institute of Medical Research, QEII Medical Centre and Centre for Medical Research, The University of Western Australia, Nedlands, Perth, Western Australia 6009, Australia
| | - Roderic Guigó
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Catalonia 08003, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Catalonia 08002, Spain
| | | | - Chung Chau Hon
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Takeya Kasukawa
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Sakari Kauppinen
- Center for RNA Medicine, Department of Clinical Medicine, Aalborg University, Copenhagen 9220, Denmark
| | - Juha Kere
- Department of Biosciences and Nutrition, Karolinska Institutet, Huddinge 14157, Sweden.,Stem Cells and Metabolism Research Program, University of Helsinki and Folkhälsan Research Center, 00290 Helsinki, Finland
| | - Boris Lenhard
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London W12 0NN, United Kingdom.,Computational Regulatory Genomics, MRC London Institute of Medical Sciences, London W12 0NN, United Kingdom.,Sars International Centre for Marine Molecular Biology, University of Bergen, Bergen N-5008, Norway
| | - Claudio Schneider
- Laboratorio Nazionale Consorzio Interuniversitario Biotecnologie (CIB), Trieste 34127, Italy.,Department of Medicine and Consorzio Interuniversitario Biotecnologie p.zle Kolbe 1 University of Udine, Udine 33100, Italy
| | - Harukazu Suzuki
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Ken Yagi
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Michiel J L de Hoon
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Jay W Shin
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| | - Piero Carninci
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan.,RIKEN Center for Life Science Technologies, Yokohama, Kanagawa 230-0045, Japan
| |
Collapse
|
20
|
Zitnik M, Nguyen F, Wang B, Leskovec J, Goldenberg A, Hoffman MM. Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities. Inf Fusion 2019; 50:71-91. [PMID: 30467459 PMCID: PMC6242341 DOI: 10.1016/j.inffus.2018.09.012] [Citation(s) in RCA: 210] [Impact Index Per Article: 42.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
New technologies have enabled the investigation of biology and human health at an unprecedented scale and in multiple dimensions. These dimensions include myriad properties describing genome, epigenome, transcriptome, microbiome, phenotype, and lifestyle. No single data type, however, can capture the complexity of all the factors relevant to understanding a phenomenon such as a disease. Integrative methods that combine data from multiple technologies have thus emerged as critical statistical and computational approaches. The key challenge in developing such approaches is the identification of effective models to provide a comprehensive and relevant systems view. An ideal method can answer a biological or medical question, identifying important features and predicting outcomes, by harnessing heterogeneous data across several dimensions of biological variation. In this Review, we describe the principles of data integration and discuss current methods and available implementations. We provide examples of successful data integration in biology and medicine. Finally, we discuss current challenges in biomedical integrative methods and our perspective on the future development of the field.
Collapse
Affiliation(s)
- Marinka Zitnik
- Department of Computer Science, Stanford University,
Stanford, CA, USA
| | - Francis Nguyen
- Department of Medical Biophysics, University of Toronto,
Toronto, ON, Canada
- Princess Margaret Cancer Centre, Toronto, ON, Canada
| | - Bo Wang
- Hikvision Research Institute, Santa Clara, CA, USA
| | - Jure Leskovec
- Department of Computer Science, Stanford University,
Stanford, CA, USA
- Chan Zuckerberg Biohub, San Francisco, CA, USA
| | - Anna Goldenberg
- Genetics & Genome Biology, SickKids Research Institute,
Toronto, ON, Canada
- Department of Computer Science, University of Toronto,
Toronto, ON, Canada
- Vector Institute, Toronto, ON, Canada
| | - Michael M. Hoffman
- Department of Medical Biophysics, University of Toronto,
Toronto, ON, Canada
- Princess Margaret Cancer Centre, Toronto, ON, Canada
- Department of Computer Science, University of Toronto,
Toronto, ON, Canada
- Vector Institute, Toronto, ON, Canada
| |
Collapse
|
21
|
Libbrecht MW, Rodriguez OL, Weng Z, Bilmes JA, Hoffman MM, Noble WS. A unified encyclopedia of human functional DNA elements through fully automated annotation of 164 human cell types. Genome Biol 2019; 20:180. [PMID: 31462275 PMCID: PMC6714098 DOI: 10.1186/s13059-019-1784-2] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2018] [Accepted: 08/05/2019] [Indexed: 12/31/2022] Open
Abstract
Semi-automated genome annotation methods such as Segway take as input a set of genome-wide measurements such as of histone modification or DNA accessibility and output an annotation of genomic activity in the target cell type. Here we present annotations of 164 human cell types using 1615 data sets. To produce these annotations, we automated the label interpretation step to produce a fully automated annotation strategy. Using these annotations, we developed a measure of the importance of each genomic position called the “conservation-associated activity score.” We further combined all annotations into a single, cell type-agnostic encyclopedia that catalogs all human regulatory elements.
Collapse
Affiliation(s)
| | - Oscar L Rodriguez
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, USA
| | - Zhiping Weng
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Boston, USA
| | - Jeffrey A Bilmes
- Department of Electrical Engineering, University of Washington, Seattle, USA
| | - Michael M Hoffman
- Princess Margaret Cancer Centre, Toronto, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Canada.,Department of Computer Science, University of Toronto, Toronto, Canada
| | - William Stafford Noble
- Department of Genome Sciences, University of Washington, Seattle, USA. .,Department of Computer Science, University of Washington, Seattle, USA.
| |
Collapse
|
22
|
Karimzadeh M, Ernst C, Kundaje A, Hoffman MM. Umap and Bismap: quantifying genome and methylome mappability. Nucleic Acids Res 2019; 46:e120. [PMID: 30169659 PMCID: PMC6237805 DOI: 10.1093/nar/gky677] [Citation(s) in RCA: 60] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2017] [Accepted: 07/22/2018] [Indexed: 11/14/2022] Open
Abstract
Short-read sequencing enables assessment of genetic and biochemical traits of individual genomic regions, such as the location of genetic variation, protein binding and chemical modifications. Every region in a genome assembly has a property called 'mappability', which measures the extent to which it can be uniquely mapped by sequence reads. In regions of lower mappability, estimates of genomic and epigenomic characteristics from sequencing assays are less reliable. These regions have increased susceptibility to spurious mapping from reads from other regions of the genome with sequencing errors or unexpected genetic variation. Bisulfite sequencing approaches used to identify DNA methylation exacerbate these problems by introducing large numbers of reads that map to multiple regions. Both to correct assumptions of uniformity in downstream analysis and to identify regions where the analysis is less reliable, it is necessary to know the mappability of both ordinary and bisulfite-converted genomes. We introduce the Umap software for identifying uniquely mappable regions of any genome. Its Bismap extension identifies mappability of the bisulfite-converted genome. A Umap and Bismap track hub for human genome assemblies GRCh37/hg19 and GRCh38/hg38, and mouse assemblies GRCm37/mm9 and GRCm38/mm10 is available at https://bismap.hoffmanlab.org for use with genome browsers.
Collapse
Affiliation(s)
- Mehran Karimzadeh
- Princess Margaret Cancer Centre, M5G 1L7, Toronto, ON, Canada.,Department of Medical Biophysics, M5G 1L7, University of Toronto, Toronto, ON, Canada.,Vector Institute, M5G 1M1, Toronto, ON, Canada
| | - Carl Ernst
- Department of Human Genetics, McGill University, H3A 0C7, Montreal, QC, Canada
| | - Anshul Kundaje
- Department of Genetics, Stanford University, 94305-9025, Stanford, CA, USA.,Department of Computer Science, Stanford University, 94305-5120, Stanford, CA, USA
| | - Michael M Hoffman
- Princess Margaret Cancer Centre, M5G 1L7, Toronto, ON, Canada.,Department of Medical Biophysics, M5G 1L7, University of Toronto, Toronto, ON, Canada.,Vector Institute, M5G 1M1, Toronto, ON, Canada.,Department of Computer Science, University of Toronto, M5S 2E4, Toronto, ON, Canada
| |
Collapse
|
23
|
Abstract
Covalent DNA modifications, such as 5-methylcytosine (5mC), are increasingly the focus of numerous research programs. In eukaryotes, both 5mC and 5-hydroxymethylcytosine (5hmC) are now recognized as stable epigenetic marks, with diverse functions. Bacteria, archaea, and viruses contain various other modified DNA nucleobases. Numerous databases describe RNA and histone modifications, but no database specifically catalogues DNA modifications, despite their broad importance in epigenetic regulation. To address this need, we have developed DNAmod: the DNA modification database. DNAmod is an open-source database ( https://dnamod.hoffmanlab.org ) that catalogues DNA modifications and provides a single source to learn about their properties. DNAmod provides a web interface to easily browse and search through these modifications. The database annotates the chemical properties and structures of all curated modified DNA bases, and a much larger list of candidate chemical entities. DNAmod includes manual annotations of available sequencing methods, descriptions of their occurrence in nature, and provides existing and suggested nomenclature. DNAmod enables researchers to rapidly review previous work, select mapping techniques, and track recent developments concerning modified bases of interest.
Collapse
Affiliation(s)
- Ankur Jai Sood
- Department of Medical Biophysics, University of Toronto, Princess Margaret Cancer Research Tower 15-701, 101 College Street, Toronto, ON, M5G 1L7, Canada.,Princess Margaret Cancer Centre, Princess Margaret Cancer Research Tower 11-311, 101 College Street, Toronto, ON, M5G 1L7, Canada
| | - Coby Viner
- Princess Margaret Cancer Centre, Princess Margaret Cancer Research Tower 11-311, 101 College Street, Toronto, ON, M5G 1L7, Canada.,Department of Computer Science, University of Toronto, Sandford Fleming Building 3302, 10 King's College Road, Toronto, ON, M5S 3G4, Canada
| | - Michael M Hoffman
- Department of Medical Biophysics, University of Toronto, Princess Margaret Cancer Research Tower 15-701, 101 College Street, Toronto, ON, M5G 1L7, Canada. .,Princess Margaret Cancer Centre, Princess Margaret Cancer Research Tower 11-311, 101 College Street, Toronto, ON, M5G 1L7, Canada. .,Department of Computer Science, University of Toronto, Sandford Fleming Building 3302, 10 King's College Road, Toronto, ON, M5S 3G4, Canada. .,Vector Institute, MaRS Centre, West Tower, Suite 710, 661 University Avenue, Toronto, ON, M5G 1M1, Canada.
| |
Collapse
|
24
|
Abstract
Investing in documenting your bioinformatics software well can increase its impact and save your time. To maximize the effectiveness of your documentation, we suggest following a few guidelines we propose here. We recommend providing multiple avenues for users to use your research software, including a navigable HTML interface with a quick start, useful help messages with detailed explanation and thorough examples for each feature of your software. By following these guidelines, you can assure that your hard work maximally benefits yourself and others.
Collapse
Affiliation(s)
- Mehran Karimzadeh
- Department of Medical Biophysics, University of Toronto, Toronto, Canada
| | - Michael M Hoffman
- Department of Medical Biophysics, University of Toronto, Toronto, Canada
| |
Collapse
|
25
|
Chan RCW, Libbrecht MW, Roberts EG, Bilmes JA, Noble WS, Hoffman MM. Segway 2.0: Gaussian mixture models and minibatch training. Bioinformatics 2018; 34:669-671. [PMID: 29028889 PMCID: PMC5860603 DOI: 10.1093/bioinformatics/btx603] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2017] [Accepted: 09/19/2017] [Indexed: 01/08/2023] Open
Abstract
Summary Segway performs semi-automated genome annotation, discovering joint patterns across multiple genomic signal datasets. We discuss a major new version of Segway and highlight its ability to model data with substantially greater accuracy. Major enhancements in Segway 2.0 include the ability to model data with a mixture of Gaussians, enabling capture of arbitrarily complex signal distributions, and minibatch training, leading to better learned parameters. Availability and implementation Segway and its source code are freely available for download at http://segway.hoffmanlab.org. We have made available scripts (https://doi.org/10.5281/zenodo.802939) and datasets (https://doi.org/10.5281/zenodo.802906) for this paper's analysis. Contact michael.hoffman@utoronto.ca. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Rachel C W Chan
- Princess Margaret Cancer Centre, Toronto, ON M5G 1L7, Canada.,Engineering Physics Program, University of British Columbia, Vancouver, BC V6T 1Z1, Canada
| | | | - Eric G Roberts
- Princess Margaret Cancer Centre, Toronto, ON M5G 1L7, Canada
| | - Jeffrey A Bilmes
- Department of Computer Science and Engineering.,Department of Electrical Engineering
| | - William Stafford Noble
- Department of Computer Science and Engineering.,Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Michael M Hoffman
- Princess Margaret Cancer Centre, Toronto, ON M5G 1L7, Canada.,Department of Computer Science, University of Toronto, Toronto, ON M5S 3G4, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, ON M5G 1L7, Canada
| |
Collapse
|
26
|
Shen SY, Singhania R, Fehringer G, Chakravarthy A, Roehrl MHA, Chadwick D, Zuzarte PC, Borgida A, Wang TT, Li T, Kis O, Zhao Z, Spreafico A, Medina TDS, Wang Y, Roulois D, Ettayebi I, Chen Z, Chow S, Murphy T, Arruda A, O'Kane GM, Liu J, Mansour M, McPherson JD, O'Brien C, Leighl N, Bedard PL, Fleshner N, Liu G, Minden MD, Gallinger S, Goldenberg A, Pugh TJ, Hoffman MM, Bratman SV, Hung RJ, De Carvalho DD. Sensitive tumour detection and classification using plasma cell-free DNA methylomes. Nature 2018; 563:579-583. [PMID: 30429608 DOI: 10.1038/s41586-018-0703-0] [Citation(s) in RCA: 494] [Impact Index Per Article: 82.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2016] [Accepted: 09/25/2018] [Indexed: 12/16/2022]
Abstract
The use of liquid biopsies for cancer detection and management is rapidly gaining prominence1. Current methods for the detection of circulating tumour DNA involve sequencing somatic mutations using cell-free DNA, but the sensitivity of these methods may be low among patients with early-stage cancer given the limited number of recurrent mutations2-5. By contrast, large-scale epigenetic alterations-which are tissue- and cancer-type specific-are not similarly constrained6 and therefore potentially have greater ability to detect and classify cancers in patients with early-stage disease. Here we develop a sensitive, immunoprecipitation-based protocol to analyse the methylome of small quantities of circulating cell-free DNA, and demonstrate the ability to detect large-scale DNA methylation changes that are enriched for tumour-specific patterns. We also demonstrate robust performance in cancer detection and classification across an extensive collection of plasma samples from several tumour types. This work sets the stage to establish biomarkers for the minimally invasive detection, interception and classification of early-stage cancers based on plasma cell-free DNA methylation patterns.
Collapse
Affiliation(s)
- Shu Yi Shen
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Rajat Singhania
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Gordon Fehringer
- Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, Ontario, Canada
| | - Ankur Chakravarthy
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Michael H A Roehrl
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Memorial Sloan Kettering Cancer Center, New York, NY, USA.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Dianne Chadwick
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Philip C Zuzarte
- Genome Technologies, Ontario Institute for Cancer Research, Toronto, Ontario, Canada
| | - Ayelet Borgida
- Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, Ontario, Canada
| | - Ting Ting Wang
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Tiantian Li
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Olena Kis
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Zhen Zhao
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Anna Spreafico
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Tiago da Silva Medina
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Yadon Wang
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - David Roulois
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,UMR_S 1236, Univ Rennes 1, Inserm, Etablissement Français du sang Bretagne, Rennes, France
| | - Ilias Ettayebi
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Zhuo Chen
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Signy Chow
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Tracy Murphy
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Andrea Arruda
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Grainne M O'Kane
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Jessica Liu
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Mark Mansour
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - John D McPherson
- Department of Biochemistry and Molecular Medicine, UC Davis Comprehensive Cancer Center, Sacramento, CA, USA
| | - Catherine O'Brien
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Natasha Leighl
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Philippe L Bedard
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Neil Fleshner
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Geoffrey Liu
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.,Division of Epidemiology, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada
| | - Mark D Minden
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Steven Gallinger
- Fred Litwin Centre for Cancer Genetics, Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, Ontario, Canada.,Department of Surgery, Toronto General Hospital, Toronto, Ontario, Canada
| | - Anna Goldenberg
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| | - Trevor J Pugh
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Michael M Hoffman
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.,Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| | - Scott V Bratman
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Rayjean J Hung
- Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, Ontario, Canada. .,Division of Epidemiology, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada.
| | - Daniel D De Carvalho
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada. .,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.
| |
Collapse
|
27
|
Unzue A, Cribiú R, Hoffman MM, Knehans T, Lafleur K, Caflisch A, Nevado C. Iriomoteolides: novel chemical tools to study actin dynamics. Chem Sci 2018; 9:3793-3802. [PMID: 29780512 PMCID: PMC5939837 DOI: 10.1039/c7sc04286h] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2017] [Accepted: 03/10/2018] [Indexed: 01/25/2023] Open
Abstract
Despite its promising biological profile, the cellular targets of iriomoteolide-3a, a novel 15-membered macrolide isolated from Amphidinium sp., have remained unknown. A small library of non-natural iriomoteolide-3a analogues is presented here as a result of a novel, highly convergent, catalysis-based scaffold-diversification campaign, which revealed the suitable sites for chemical editing in the original core. We provide compelling experimental evidence for actin as one of iriomoteolides' primary cellular targets, establishing the ability of these secondary metabolites to inhibit cell migration, induce severe morphological changes in cells and cause a reversible cytoplasmic retraction and reduction of F-actin fibers in a time and dose dependent manner. These results are interpreted in light of the ability of iriomoteolides to stabilize F-actin filaments. Molecular dynamics simulations provide evidence for iriomoteolide-3a binding to the barbed end of G-actin. These results showcase iriomoteolides as novel and easily tunable chemical probes for the in vitro study of actin dynamics in the context of cell motility processes including cell invasion and division.
Collapse
Affiliation(s)
- A Unzue
- Department of Chemistry , University of Zürich , Winterthurerstrasse 190 , CH-8057 , Zürich , Switzerland .
| | - R Cribiú
- Department of Chemistry , University of Zürich , Winterthurerstrasse 190 , CH-8057 , Zürich , Switzerland .
| | - M M Hoffman
- Department of Chemistry , University of Zürich , Winterthurerstrasse 190 , CH-8057 , Zürich , Switzerland .
| | - T Knehans
- Department of Biochemistry , University of Zürich , Winterthurerstrasse 190 , CH-8057 , Zürich , Switzerland .
| | - K Lafleur
- Department of Chemistry , University of Zürich , Winterthurerstrasse 190 , CH-8057 , Zürich , Switzerland .
| | - A Caflisch
- Department of Biochemistry , University of Zürich , Winterthurerstrasse 190 , CH-8057 , Zürich , Switzerland .
| | - C Nevado
- Department of Chemistry , University of Zürich , Winterthurerstrasse 190 , CH-8057 , Zürich , Switzerland .
| |
Collapse
|
28
|
Ching T, Himmelstein DS, Beaulieu-Jones BK, Kalinin AA, Do BT, Way GP, Ferrero E, Agapow PM, Zietz M, Hoffman MM, Xie W, Rosen GL, Lengerich BJ, Israeli J, Lanchantin J, Woloszynek S, Carpenter AE, Shrikumar A, Xu J, Cofer EM, Lavender CA, Turaga SC, Alexandari AM, Lu Z, Harris DJ, DeCaprio D, Qi Y, Kundaje A, Peng Y, Wiley LK, Segler MHS, Boca SM, Swamidass SJ, Huang A, Gitter A, Greene CS. Opportunities and obstacles for deep learning in biology and medicine. J R Soc Interface 2018; 15:20170387. [PMID: 29618526 PMCID: PMC5938574 DOI: 10.1098/rsif.2017.0387] [Citation(s) in RCA: 764] [Impact Index Per Article: 127.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2017] [Accepted: 03/07/2018] [Indexed: 11/12/2022] Open
Abstract
Deep learning describes a class of machine learning algorithms that are capable of combining raw inputs into layers of intermediate features. These algorithms have recently shown impressive results across a variety of domains. Biology and medicine are data-rich disciplines, but the data are complex and often ill-understood. Hence, deep learning techniques may be particularly well suited to solve problems of these fields. We examine applications of deep learning to a variety of biomedical problems-patient classification, fundamental biological processes and treatment of patients-and discuss whether deep learning will be able to transform these tasks or if the biomedical sphere poses unique challenges. Following from an extensive literature review, we find that deep learning has yet to revolutionize biomedicine or definitively resolve any of the most pressing challenges in the field, but promising advances have been made on the prior state of the art. Even though improvements over previous baselines have been modest in general, the recent progress indicates that deep learning methods will provide valuable means for speeding up or aiding human investigation. Though progress has been made linking a specific neural network's prediction to input features, understanding how users should interpret these models to make testable hypotheses about the system under study remains an open challenge. Furthermore, the limited amount of labelled data for training presents problems in some domains, as do legal and privacy constraints on work with sensitive health records. Nonetheless, we foresee deep learning enabling changes at both bench and bedside with the potential to transform several areas of biology and medicine.
Collapse
Affiliation(s)
- Travers Ching
- Molecular Biosciences and Bioengineering Graduate Program, University of Hawaii at Manoa, Honolulu, HI, USA
| | - Daniel S Himmelstein
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Brett K Beaulieu-Jones
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Alexandr A Kalinin
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | | | - Gregory P Way
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Enrico Ferrero
- Computational Biology and Stats, Target Sciences, GlaxoSmithKline, Stevenage, UK
| | | | - Michael Zietz
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Michael M Hoffman
- Princess Margaret Cancer Centre, Toronto, Ontario, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| | - Wei Xie
- Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN, USA
| | - Gail L Rosen
- Ecological and Evolutionary Signal-processing and Informatics Laboratory, Department of Electrical and Computer Engineering, Drexel University, Philadelphia, PA, USA
| | - Benjamin J Lengerich
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Johnny Israeli
- Biophysics Program, Stanford University, Stanford, CA, USA
| | - Jack Lanchantin
- Department of Computer Science, University of Virginia, Charlottesville, VA, USA
| | - Stephen Woloszynek
- Ecological and Evolutionary Signal-processing and Informatics Laboratory, Department of Electrical and Computer Engineering, Drexel University, Philadelphia, PA, USA
| | - Anne E Carpenter
- Imaging Platform, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Avanti Shrikumar
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Jinbo Xu
- Toyota Technological Institute at Chicago, Chicago, IL, USA
| | - Evan M Cofer
- Department of Computer Science, Trinity University, San Antonio, TX, USA
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
| | - Christopher A Lavender
- Integrative Bioinformatics, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, NC, USA
| | - Srinivas C Turaga
- Howard Hughes Medical Institute, Janelia Research Campus, Ashburn, VA, USA
| | - Amr M Alexandari
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Zhiyong Lu
- National Center for Biotechnology Information and National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - David J Harris
- Department of Wildlife Ecology and Conservation, University of Florida, Gainesville, FL, USA
| | | | - Yanjun Qi
- Department of Computer Science, University of Virginia, Charlottesville, VA, USA
| | - Anshul Kundaje
- Department of Computer Science, Stanford University, Stanford, CA, USA
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - Yifan Peng
- National Center for Biotechnology Information and National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Laura K Wiley
- Division of Biomedical Informatics and Personalized Medicine, University of Colorado School of Medicine, Aurora, CO, USA
| | - Marwin H S Segler
- Institute of Organic Chemistry, Westfälische Wilhelms-Universität Münster, Münster, Germany
| | - Simina M Boca
- Innovation Center for Biomedical Informatics, Georgetown University Medical Center, Washington, DC, USA
| | - S Joshua Swamidass
- Department of Pathology and Immunology, Washington University in Saint Louis, St Louis, MO, USA
| | - Austin Huang
- Department of Medicine, Brown University, Providence, RI, USA
| | - Anthony Gitter
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, USA
- Morgridge Institute for Research, Madison, WI, USA
| | - Casey S Greene
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|
29
|
Abstract
A report on the Genome Informatics conference, held at the Wellcome Genome Campus Conference Centre, Hinxton, United Kingdom, 19-22 September 2016.
Collapse
Affiliation(s)
| | - Michael M. Hoffman
- Princess Margaret Cancer Centre, Toronto, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, Canada
- Department of Computer Science, University of Toronto, Toronto, Canada
| |
Collapse
|
30
|
Franke B, Plante J, Roscher R, Lee EA, Smyth C, Hatefi A, Chen F, Gil E, Schwing A, Selvitella A, Hoffman MM, Grosse R, Hendricks D, Reid N. Statistical Inference, Learning and Models in Big Data. Int Stat Rev 2016. [DOI: 10.1111/insr.12176] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Affiliation(s)
| | | | | | | | | | | | - Fuqi Chen
- Western University London Ontario Canada
| | - Einat Gil
- University of Toronto Toronto Ontario Canada
| | | | | | | | | | | | - Nancy Reid
- University of Toronto Toronto Ontario Canada
| |
Collapse
|
31
|
Lundberg SM, Tu WB, Raught B, Penn LZ, Hoffman MM, Lee SI. ChromNet: Learning the human chromatin network from all ENCODE ChIP-seq data. Genome Biol 2016; 17:82. [PMID: 27139377 PMCID: PMC4852466 DOI: 10.1186/s13059-016-0925-0] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2016] [Accepted: 03/15/2016] [Indexed: 01/12/2023] Open
Abstract
A cell's epigenome arises from interactions among regulatory factors-transcription factors and histone modifications-co-localized at particular genomic regions. We developed a novel statistical method, ChromNet, to infer a network of these interactions, the chromatin network, by inferring conditional-dependence relationships among a large number of ChIP-seq data sets. We applied ChromNet to all available 1451 ChIP-seq data sets from the ENCODE Project, and showed that ChromNet revealed previously known physical interactions better than alternative approaches. We experimentally validated one of the previously unreported interactions, MYC-HCFC1. An interactive visualization tool is available at http://chromnet.cs.washington.edu.
Collapse
Affiliation(s)
- Scott M Lundberg
- Department of Computer Science and Engineering, University of Washington, Seattle, WA, USA
| | - William B Tu
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.,Princess Margaret Cancer Centre, Toronto, ON, Canada
| | - Brian Raught
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.,Princess Margaret Cancer Centre, Toronto, ON, Canada
| | - Linda Z Penn
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.,Princess Margaret Cancer Centre, Toronto, ON, Canada
| | - Michael M Hoffman
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.,Princess Margaret Cancer Centre, Toronto, ON, Canada.,Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Su-In Lee
- Department of Computer Science and Engineering, University of Washington, Seattle, WA, USA. .,Department of Genome Sciences, University of Washington, Seattle, WA, USA.
| |
Collapse
|
32
|
Ginther OJ, Siddiqui MAR, Baldrighi JM, Hoffman MM. Stimulation of regressing subordinate follicles of wave 2 with a gonadotropin product in heifers. Domest Anim Endocrinol 2016; 55:46-50. [PMID: 26773367 DOI: 10.1016/j.domaniend.2015.11.006] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/12/2015] [Revised: 11/19/2015] [Accepted: 11/21/2015] [Indexed: 10/22/2022]
Abstract
The recovery of regressing wave-2 subordinate follicles was studied by treating heifers with a gonadotropin product that had about 84% and 16% of follicle-stimulating hormone and luteinizing hormone activity, respectively. A treated group (n = 8) received a single dose of 50 mg (2.5 mL) of the gonadotropin product, and a control group (n = 8) received 2.5 mL of saline vehicle. The group assignment of heifers was not known to the ultrasonographer who tracked the follicles and measured follicle diameters. Follicle measurements began on the day of expected follicle deviation in wave 2 (largest follicle closest to 8.5 mm), and treatment (hour 0) was given on Day 13.4 ± 0.2 (Day 0 = ovulation) when the dominant follicles of waves 1 and 2 were 14.1 ± 0.3 mm and 10.7 ± 0.1 mm, respectively. Subordinate follicles of wave 2 that had regressed to a 3-mm category (3.0-3.9 mm) or 4-mm category by hour 0 decreased in diameter for at least 48 h before hour 0, whereas follicles that were in the 5-mm or 6-mm categories at hour 0 did not change significantly in diameter during the previous 48 h. About 55% of the follicles that had regressed to the 3-mm and 4-mm categories at hour 0% and 78% of the follicles in the 5-mm and 6-mm categories increased in diameter after gonadotropin treatment, whereas follicles in the control group continued to decrease (regress) in diameter. The follicles for each of the 4 diameter categories were greater (P < 0.05) in diameter 9 h after treatment in the treated group than in the control group. The dominant follicle of wave 1 and the largest subordinate follicle of wave 2 in the treated group also increased in diameter so that diameter was greater (P < 0.05) than in the controls at hour 9. The results demonstrated that subordinate follicles of wave 2 that had decreased in diameter (regressed) for at least 48 h retained the capability to recover as indicated by a diameter increase when exposed to a gonadotropin product.
Collapse
Affiliation(s)
- O J Ginther
- Eutheria Foundation, Cross Plains, WI 53528, USA; Department of Pathobiological Sciences, School of Veterinary Medicine, University of Wisconsin-Madison, 1656 Linden Dr, Madison, WI 53706, USA.
| | - M A R Siddiqui
- Eutheria Foundation, Cross Plains, WI 53528, USA; Department of Pathobiological Sciences, School of Veterinary Medicine, University of Wisconsin-Madison, 1656 Linden Dr, Madison, WI 53706, USA
| | - J M Baldrighi
- Eutheria Foundation, Cross Plains, WI 53528, USA; Department of Pathobiological Sciences, School of Veterinary Medicine, University of Wisconsin-Madison, 1656 Linden Dr, Madison, WI 53706, USA
| | - M M Hoffman
- Eutheria Foundation, Cross Plains, WI 53528, USA; Department of Pathobiological Sciences, School of Veterinary Medicine, University of Wisconsin-Madison, 1656 Linden Dr, Madison, WI 53706, USA
| |
Collapse
|
33
|
Viner C, Hoffman MM. Determining the epigenome using DNA alone. Nat Methods 2015; 12:191-2. [DOI: 10.1038/nmeth.3291] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
34
|
Libbrecht MW, Ay F, Hoffman MM, Gilbert DM, Bilmes JA, Noble WS. Joint annotation of chromatin state and chromatin conformation reveals relationships among domain types and identifies domains of cell-type-specific expression. Genome Res 2015; 25:544-57. [PMID: 25677182 PMCID: PMC4381526 DOI: 10.1101/gr.184341.114] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2014] [Accepted: 02/06/2015] [Indexed: 11/24/2022]
Abstract
The genomic neighborhood of a gene influences its activity, a behavior that is attributable in part to domain-scale regulation. Previous genomic studies have identified many types of regulatory domains. However, due to the difficulty of integrating genomics data sets, the relationships among these domain types are poorly understood. Semi-automated genome annotation (SAGA) algorithms facilitate human interpretation of heterogeneous collections of genomics data by simultaneously partitioning the human genome and assigning labels to the resulting genomic segments. However, existing SAGA methods cannot integrate inherently pairwise chromatin conformation data. We developed a new computational method, called graph-based regularization (GBR), for expressing a pairwise prior that encourages certain pairs of genomic loci to receive the same label in a genome annotation. We used GBR to exploit chromatin conformation information during genome annotation by encouraging positions that are close in 3D to occupy the same type of domain. Using this approach, we produced a model of chromatin domains in eight human cell types, thereby revealing the relationships among known domain types. Through this model, we identified clusters of tightly regulated genes expressed in only a small number of cell types, which we term “specific expression domains.” We found that domain boundaries marked by promoters and CTCF motifs are consistent between cell types even when domain activity changes. Finally, we showed that GBR can be used to transfer information from well-studied cell types to less well-characterized cell types during genome annotation, making it possible to produce high-quality annotations of the hundreds of cell types with limited available data.
Collapse
Affiliation(s)
- Maxwell W Libbrecht
- Department of Computer Science and Engineering, University of Washington, Seattle, Washington 98195, USA
| | - Ferhat Ay
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Michael M Hoffman
- Princess Margaret Cancer Centre, University of Toronto, ON M5G 1L7, Canada Department of Medical Biophysics, University of Toronto, ON M5G 1L7, Canada
| | - David M Gilbert
- Department of Biological Science, The Florida State University, Tallahassee, Florida 32304, USA
| | - Jeffrey A Bilmes
- Department of Electrical Engineering, University of Washington, Seattle, Washington 98195, USA
| | - William Stafford Noble
- Department of Computer Science and Engineering, University of Washington, Seattle, Washington 98195, USA Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| |
Collapse
|
35
|
Church DM, Schneider VA, Steinberg KM, Schatz MC, Quinlan AR, Chin CS, Kitts PA, Aken B, Marth GT, Hoffman MM, Herrero J, Mendoza MLZ, Durbin R, Flicek P. Extending reference assembly models. Genome Biol 2015; 16:13. [PMID: 25651527 PMCID: PMC4305238 DOI: 10.1186/s13059-015-0587-3] [Citation(s) in RCA: 94] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
The human genome reference assembly is crucial for aligning and analyzing sequence data, and for genome annotation, among other roles. However, the models and analysis assumptions that underlie the current assembly need revising to fully represent human sequence diversity. Improved analysis tools and updated data reporting formats are also required.
Collapse
Affiliation(s)
| | - Valerie A Schneider
- />National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD 20894 USA
| | - Karyn Meltz Steinberg
- />The Genome Institute, Washington University School of Medicine, St Louis, MO 63108 USA
| | - Michael C Schatz
- />Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA
| | - Aaron R Quinlan
- />Department of Public Health Sciences and Center for Public Health Genomics, Charlottesville, VA 22908 USA
| | | | - Paul A Kitts
- />National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD 20894 USA
| | - Bronwen Aken
- />European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD UK
| | - Gabor T Marth
- />Department of Human Genetics & USTAR Center for Genetic Discovery, University of Utah School of Medicine, Salt Lake City, UT 84132 USA
| | - Michael M Hoffman
- />Princess Margaret Cancer Centre, Toronto, ON M5G 1L7 Canada
- />Department of Medical Biophysics, University of Toronto, Toronto, ON M5G 1L7 Canada
- />Department of Computer Science, University of Toronto, Toronto, ON M5S 3G4 Canada
| | - Javier Herrero
- />Bill Lyons Informatics Centre, UCL Cancer Institute, University College London, London, WC1E 6BT UK
| | | | - Richard Durbin
- />Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD UK
| | - Paul Flicek
- />European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD UK
| |
Collapse
|
36
|
Ginther OJ, Bashir ST, Hoffman MM, Beg MA. Endocrinology of number of follicular waves per estrous cycle and contralateral or ipsilateral relationship between corpus luteum and preovulatory follicle in heifers. Domest Anim Endocrinol 2013; 45:64-71. [PMID: 23806855 DOI: 10.1016/j.domaniend.2013.05.002] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/11/2013] [Revised: 05/13/2013] [Accepted: 05/15/2013] [Indexed: 11/28/2022]
Abstract
A 3-d extension of the luteal phase occurs in interovulatory intervals (IOIs) with a contralateral relationship between the corpus luteum (CL) and preovulatory follicle with 3 follicular waves (Contra-3W group). Concentrations of FSH, progesterone, LH, and estradiol-17β for the ipsilateral versus contralateral CL and/or follicle relationship and 2 versus 3 waves per IOI were studied in 14 heifers. Follicular waves and FSH surges were designated 1, 2, or 3, according to order of occurrence in the IOI. The day (day 0 = ovulation) of the FSH peak in surge 2 occurred earlier (P < 0.02) in 3-wave IOIs (day 6.3 ± 0.5) than in 2-wave IOIs (day 8.5 ± 0.5). Mean FSH was higher in 3-wave than in 2-wave IOI on 82% of the days in the IOI. Repeatability or individuality in FSH concentration was indicated by a correlation (r = 0.54, P < 0.04) in FSH concentrations between ovulations at the beginning and at the end of the IOI. Concentrations of LH and estradiol increased (P < 0.05) near the beginning of the luteolytic period in 2-wave IOI regardless of the CL and/or follicle relationship. In the Contra-3W group, LH and estradiol remained at basal concentrations concurrently with FSH surge 3 and extension of the luteal phase. The hypotheses were supported that FSH surge 2 occurs earlier in 3-wave IOIs than in 2-wave IOIs and that the development of 3-wave IOIs occurs in individuals with greater FSH concentrations. Extension of the luteal phase in the Contra-3W group was temporally associated with lower concentrations of LH and estradiol.
Collapse
|
37
|
Landt SG, Marinov GK, Kundaje A, Kheradpour P, Pauli F, Batzoglou S, Bernstein BE, Bickel P, Brown JB, Cayting P, Chen Y, DeSalvo G, Epstein C, Fisher-Aylor KI, Euskirchen G, Gerstein M, Gertz J, Hartemink AJ, Hoffman MM, Iyer VR, Jung YL, Karmakar S, Kellis M, Kharchenko PV, Li Q, Liu T, Liu XS, Ma L, Milosavljevic A, Myers RM, Park PJ, Pazin MJ, Perry MD, Raha D, Reddy TE, Rozowsky J, Shoresh N, Sidow A, Slattery M, Stamatoyannopoulos JA, Tolstorukov MY, White KP, Xi S, Farnham PJ, Lieb JD, Wold BJ, Snyder M. ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia. Genome Res 2013; 22:1813-31. [PMID: 22955991 PMCID: PMC3431496 DOI: 10.1101/gr.136184.111] [Citation(s) in RCA: 1280] [Impact Index Per Article: 116.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Chromatin immunoprecipitation (ChIP) followed by high-throughput DNA sequencing (ChIP-seq) has become a valuable and widely used approach for mapping the genomic location of transcription-factor binding and histone modifications in living cells. Despite its widespread use, there are considerable differences in how these experiments are conducted, how the results are scored and evaluated for quality, and how the data and metadata are archived for public use. These practices affect the quality and utility of any global ChIP experiment. Through our experience in performing ChIP-seq experiments, the ENCODE and modENCODE consortia have developed a set of working standards and guidelines for ChIP experiments that are updated routinely. The current guidelines address antibody validation, experimental replication, sequencing depth, data and metadata reporting, and data quality assessment. We discuss how ChIP quality, assessed in these ways, affects different uses of ChIP-seq data. All data sets used in the analysis have been deposited for public viewing and downloading at the ENCODE (http://encodeproject.org/ENCODE/) and modENCODE (http://www.modencode.org/) portals.
Collapse
Affiliation(s)
- Stephen G Landt
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
38
|
Hoffman MM, Ernst J, Wilder SP, Kundaje A, Harris RS, Libbrecht M, Giardine B, Ellenbogen PM, Bilmes JA, Birney E, Hardison RC, Dunham I, Kellis M, Noble WS. Integrative annotation of chromatin elements from ENCODE data. Nucleic Acids Res 2012; 41:827-41. [PMID: 23221638 PMCID: PMC3553955 DOI: 10.1093/nar/gks1284] [Citation(s) in RCA: 357] [Impact Index Per Article: 29.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
The ENCODE Project has generated a wealth of experimental information mapping diverse chromatin properties in several human cell lines. Although each such data track is independently informative toward the annotation of regulatory elements, their interrelations contain much richer information for the systematic annotation of regulatory elements. To uncover these interrelations and to generate an interpretable summary of the massive datasets of the ENCODE Project, we apply unsupervised learning methodologies, converting dozens of chromatin datasets into discrete annotation maps of regulatory regions and other chromatin elements across the human genome. These methods rediscover and summarize diverse aspects of chromatin architecture, elucidate the interplay between chromatin activity and RNA transcription, and reveal that a large proportion of the genome lies in a quiescent state, even across multiple cell types. The resulting annotation of non-coding regulatory elements correlate strongly with mammalian evolutionary constraint, and provide an unbiased approach for evaluating metrics of evolutionary constraint in human. Lastly, we use the regulatory annotations to revisit previously uncharacterized disease-associated loci, resulting in focused, testable hypotheses through the lens of the chromatin landscape.
Collapse
Affiliation(s)
- Michael M Hoffman
- Department of Genome Sciences, University of Washington, 3720 15th Ave NE, Seattle, WA 98195-5065, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Hoffman MM, Buske OJ, Wang J, Weng Z, Bilmes JA, Noble WS. Unsupervised pattern discovery in human chromatin structure through genomic segmentation. Nat Methods 2012; 9:473-6. [PMID: 22426492 DOI: 10.1038/nmeth.1937] [Citation(s) in RCA: 383] [Impact Index Per Article: 31.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2011] [Accepted: 02/14/2012] [Indexed: 01/24/2023]
Abstract
We trained Segway, a dynamic Bayesian network method, simultaneously on chromatin data from multiple experiments, including positions of histone modifications, transcription-factor binding and open chromatin, all derived from a human chronic myeloid leukemia cell line. In an unsupervised fashion, we identified patterns associated with transcription start sites, gene ends, enhancers, transcriptional regulator CTCF-binding regions and repressed regions. Software and genome browser tracks are at http://noble.gs.washington.edu/proj/segway/.
Collapse
Affiliation(s)
- Michael M Hoffman
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | | | | | | | | | | |
Collapse
|
40
|
Buske OJ, Hoffman MM, Ponts N, Le Roch KG, Noble WS. Exploratory analysis of genomic segmentations with Segtools. BMC Bioinformatics 2011; 12:415. [PMID: 22029426 PMCID: PMC3224787 DOI: 10.1186/1471-2105-12-415] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2011] [Accepted: 10/26/2011] [Indexed: 11/23/2022] Open
Abstract
Background As genome-wide experiments and annotations become more prevalent, researchers increasingly require tools to help interpret data at this scale. Many functional genomics experiments involve partitioning the genome into labeled segments, such that segments sharing the same label exhibit one or more biochemical or functional traits. For example, a collection of ChlP-seq experiments yields a compendium of peaks, each labeled with one or more associated DNA-binding proteins. Similarly, manually or automatically generated annotations of functional genomic elements, including cis-regulatory modules and protein-coding or RNA genes, can also be summarized as genomic segmentations. Results We present a software toolkit called Segtools that simplifies and automates the exploration of genomic segmentations. The software operates as a series of interacting tools, each of which provides one mode of summarization. These various tools can be pipelined and summarized in a single HTML page. We describe the Segtools toolkit and demonstrate its use in interpreting a collection of human histone modification data sets and Plasmodium falciparum local chromatin structure data sets. Conclusions Segtools provides a convenient, powerful means of interpreting a genomic segmentation.
Collapse
Affiliation(s)
- Orion J Buske
- Department of Genome Sciences, University of Washington, PO Box 355065, Seattle, WA 98195-5065, USA
| | | | | | | | | |
Collapse
|
41
|
Chen X, Hoffman MM, Bilmes JA, Hesselberth JR, Noble WS. A dynamic Bayesian network for identifying protein-binding footprints from single molecule-based sequencing data. ACTA ACUST UNITED AC 2010; 26:i334-42. [PMID: 20529925 PMCID: PMC2881360 DOI: 10.1093/bioinformatics/btq175] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Motivation: A global map of transcription factor binding sites (TFBSs) is critical to understanding gene regulation and genome function. DNaseI digestion of chromatin coupled with massively parallel sequencing (digital genomic footprinting) enables the identification of protein-binding footprints with high resolution on a genome-wide scale. However, accurately inferring the locations of these footprints remains a challenging computational problem. Results: We present a dynamic Bayesian network-based approach for the identification and assignment of statistical confidence estimates to protein-binding footprints from digital genomic footprinting data. The method, DBFP, allows footprints to be identified in a probabilistic framework and outperforms our previously described algorithm in terms of precision at a fixed recall. Applied to a digital footprinting data set from Saccharomyces cerevisiae, DBFP identifies 4679 statistically significant footprints within intergenic regions. These footprints are mainly located near transcription start sites and are strongly enriched for known TFBSs. Footprints containing no known motif are preferentially located proximal to other footprints, consistent with cooperative binding of these footprints. DBFP also identifies a set of statistically significant footprints in the yeast coding regions. Many of these footprints coincide with the boundaries of antisense transcripts, and the most significant footprints are enriched for binding sites of the chromatin-associated factors Abf1 and Rap1. Contact:jay.hesselberth@ucdenver.edu; william-noble@u.washington.edu Supplementary information:Supplementary material is available at Bioinformatics online.
Collapse
Affiliation(s)
- Xiaoyu Chen
- Department of Computer Science and Engineering, University of Washington, Seattle, WA, USA
| | | | | | | | | |
Collapse
|
42
|
Abstract
Summary: We present a format for efficient storage of multiple tracks of numeric data anchored to a genome. The format allows fast random access to hundreds of gigabytes of data, while retaining a small disk space footprint. We have also developed utilities to load data into this format. We show that retrieving data from this format is more than 2900 times faster than a naive approach using wiggle files. Availability and Implementation: Reference implementation in Python and C components available at http://noble.gs.washington.edu/proj/genomedata/ under the GNU General Public License. Contact:william-noble@uw.edu
Collapse
Affiliation(s)
- Michael M Hoffman
- Department of Genome Sciences, University of Washington, PO Box 355065, Seattle, WA 98195-5065, USA
| | | | | |
Collapse
|
43
|
Abstract
We have produced an evolutionary model for promoters, analogous to the commonly used synonymous/nonsynonymous mutation models for protein-coding sequences. Although our model, called Sunflower, relies on some simple assumptions, it captures enough of the biology of transcription factor action to show clear correlation with other biological features. Sunflower predicts a binding profile of transcription factors to DNA sequences, in which different factors compete for the same potential binding sites. The parametrized model simultaneously estimates a continuous measurement of binding occupancy across the genomic sequence for each factor. We can then introduce a localized mutation, rerun the binding model, and record the difference in binding profiles. A single mutation can alter interactions both upstream and downstream of its position due to potential overlapping binding sites, and our statistic captures this domino effect. Over evolutionary time, we observe a clear excess of low-scoring mutations fixed in promoters, consistent with most changes being neutral. However, this is not consistent across all promoters, and some promoters show more rapid divergence. This divergence often occurs in the presence of relatively constant protein-coding divergence. Interestingly, different classes of promoters show different sensitivity to mutations, with phosphorylation-related genes having promoters inherently more sensitive to mutations than immune genes. Although there have previously been a number of models attempting to handle transcription factor binding, Sunflower provides a richer biological model, incorporating weak binding sites and the possibility of competition. The results show the first clear correlations between such a model and evolutionary processes.
Collapse
Affiliation(s)
- Michael M Hoffman
- EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, United Kingdom
| | | |
Collapse
|
44
|
Abstract
Evolutionary biologists frequently rely on estimates of the neutral rate of evolution when characterizing the selective pressure on protein-coding genes. We introduce a new method to estimate this value based on intron nucleotide substitutions. The new method uses a metascript model that considers alternative splicing forms and an algorithm to pair orthologous introns, which we call Introndeuce. We compare the intron method with a widely used method that uses observed substitutions in synonymous coding nucleotides, by using both methods to estimate the neutral rate for human-dog and mouse-rat comparisons. The estimates of the 2 methods correlate strongly (r(S) = 0.75), but cannot be considered directly equivalent. We also investigate the effect of alignment error and G + C content on the variance in the intron method: in both cases there is an effect, and it is species-pair specific. Although the intron method may be more useful for shorter evolutionary distances, it is less useful at longer distances due to the poor alignment of less-conserved positions.
Collapse
Affiliation(s)
- Michael M Hoffman
- European Molecular Biology Laboratory-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, England
| | | |
Collapse
|
45
|
Abstract
Many mammalian cells regulate their volume by the osmotic movement of water directed by anion and cation flux. Ubiquitous volume-dependent anion currents permit cells to recover volume after swelling in response to a hypotonic environment. This study addressed competition between glutamate (Glu) and Cl(-) permeation in volume-activated anion currents in order to provide insight into the ionic requirements for volume regulation, volume-dependent anion channel activity and to the architecture of the channel pore. The effect of changing the intracellular molar fraction (MF) of Glu and Cl(-) on conductance and relative anion permeability was evaluated as a function of the extracellular permeant anion and/or the ionic strength. Relative permeability of Glu to Cl(-) was determined by measuring reversal potentials under defined ionic conditions. Under conditions with high (150 m M) or low (50 m M) ionic strength solutions on both sides of the membrane, Cl(-) was always more permeable than Glu. When a transmembrane ionic strength gradient (150 m M extracellular: 50 m M intracellular) was set to drive water into the cell, and in the presence of extracellular Cl(-), Glu became up to 16-fold more permeable than Cl(-). Replacement of extracellular Cl(-) with Glu abolished this effect. These results indicate that it is possible for Glu to move into the extracellular environment during volume-regulatory events and they support the emerging role of glutamate as a modulator of anion channel activity.
Collapse
Affiliation(s)
- M M Hoffman
- Dept. of Pharmacology and Physiology, College of Medicine, Drexel University, USA
| | | |
Collapse
|
46
|
Abstract
We have created an Amino Acid-Nucleotide Interaction Database (AANT; http://aant.icmb.utexas. edu/) that categorizes all amino acid-nucleotide interactions from experimentally determined protein-nucleic acid structures, and provides users with a graphic interface for visualizing these interactions in aggregate. AANT accomplishes this by extracting individual amino acid-nucleotide interactions from structures in the Protein Data Bank, combining and superimposing these interactions into multiple structure files (e.g. 20 amino acids x 5 nucleotides) and grouping structurally similar interactions into more readily identifiable clusters. Using the Chime web browser plug-in, users can view 3D representations of the superimpositions and clusters. The unique collection and representation of data on amino acid-nucleotide interactions facilitates understanding the specificity of protein-nucleic acid interactions at a more fundamental level, and allows comparison of otherwise extremely disparate sets of structures. Moreover, by modularly representing the fundamental interactions that govern binding specificity it may prove possible to better engineer nucleic acid binding proteins.
Collapse
Affiliation(s)
- Michael M Hoffman
- Institute for Cellular and Molecular Biology, Department of Chemistry and Biochemistry, University of Texas at Austin, Austin, TX 78712-0159, USA
| | | | | | | | | | | |
Collapse
|
47
|
Ahmed S, Karim A, Hoffman MM, Bhuiya TA, Mattana J. Focal renal mass: an unusual manifestation of chronic lymphocytic leukemia. Clin Nephrol 2002; 58:324-7. [PMID: 12400852 DOI: 10.5414/cnp58324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
|
48
|
Abstract
I(Cln), a cytosolic protein associated with a nucleotide-sensitive chloride current, may be involved in the regulation of a volume-regulated anion current (VRAC) associated with hypotonic cell swelling. We have determined the nucleic acid sequences of I(Cln) from human tsA201a, colonic (T84) and myeloma (RPMI 8826) cell lines. The amino acid sequences are highly homologous (>/=99%) to each other but less homologous to I(Cln) protein from other species. Using the whole-cell patch clamp technique, we examined the effect of I(Cln) protein expression levels on VRAC properties during a hyposmotic challenge. Overexpression of T84 or RPMI 8226-derived I(Cln) protein in tsA201a cells results in a more than 9-fold increase in the rate of VRAC activation over control values, while having no effect on VRAC inactivation properties. Underexpression of endogenous I(Cln) protein in tsA201a cells using antisense oligonucleotides results in a more than 180-fold decrease in VRAC activation rate as compared to control values. These results indicate that I(Cln) protein expression modulates VRAC activation but not inactivation.
Collapse
Affiliation(s)
- M D Hubert
- Department of Physiology and Biophysics, Finch University of Health Sciences, The Chicago Medical School, North Chicago, IL 60064-3095, USA
| | | | | | | | | | | |
Collapse
|
49
|
Abstract
Recently [Hoffman, M. M., and Roepe, P. D. (1997) Biochemistry 36, 11153-11168] we presented evidence for a novel Na+- and Cl--dependent H+ transport process in LR73/hu MDR 1 CHO transfectants that likely explains pHi, volume, and membrane potential changes in eukaryotic cells overexpressing the hu MDR 1 protein. To further explore this process, we have overexpressed human MDR 1 protein in yeast strain 9.3 following a combination of approaches used previously [Kuchler, K., and Thorner, J. (1992) Proc. Natl. Acad. Sci. U.S.A. 89, 2302-2306; Ruetz, S., et al. (1993) Proc. Natl. Acad. Sci. U.S.A. 90, 11588-11592]. Thus, a truncated hu MDR 1 cDNA was cloned behind a tandem array of sterile 6 (Ste6) and alchohol dehydrogenase (Adh) promoters to create the yeast expression vector pFF1. Valinomycin resistance of intact cells and Western blot analysis with purified yeast plasma membranes confirmed the overexpression of full length, functional, and properly localized hu MDR 1 protein in independently isolated 9.3/pFF1 colonies. Interestingly, relative valinomycin resistance and growth of the 9.3/hu MDR 1 strains are found to strongly depend on the ionic composition of the growth medium. Atomic absorption reveals significant differences in intracellular K+ for 9.3/hu MDR 1 versus control yeast. Transport assays using [3H]tetraphenylphosphonium ([3H]TPP+) reveal perturbations in membrane potential for 9.3/hu MDR 1 yeast that are stimulated by KCl and alkaline pHex. ATPase activity of purified plasma membrane fractions from yeast strains and LR73/hu MDR 1 CHO transfectants constructed previously [Hoffman, M. M., et al. (1996) J. Gen. Physiol. 108, 295-313] was compared. MDR 1 ATPase activity exhibits a higher pH optimum and different salt dependencies, relative to yeast H+ ATPase. Inside-out plasma membrane vesicles (ISOV) fabricated from 9.3/hu MDR 1 and control strains were analyzed for formation of H+ gradients +/- verapamil. Similar pharmacologic profiles are found for verapamil stimulation of MDR 1 ATPase activity and H+ pumping in 9.3/hu MDR 1 ISOV. In sum, these experiments strongly support the notion that hu MDR 1 catalyzes H+ transport in some fashion and lowers membrane potential in yeast when K+ contributes strongly to that potential. In the accompanying paper [Santai, C. T., Fritz, F., and Roepe, P. D. (1999) Biochemistry 38, XXXX-XXXX] the effects of ion gradients on H+ transport by hu MDR 1 are examined.
Collapse
Affiliation(s)
- F Fritz
- Department of Chemistry, Lombardi Cancer Center Program in Tumor Biology, Georgetown University, Washington, D.C. 20057, USA
| | | | | | | |
Collapse
|
50
|
Abstract
In previous work [Luz et al. (1994) Biochemistry 33, 7239-7249; Roepe et al. (1994) Biochemistry 33, 11008-11015] we measured changes in Cl- and HCO3--dependent pHi regulation for LR73 Chinese hamster ovary fibroblasts overexpressing mu MDR 1 protein. However, only one clonal cell line overexpressing the protein but not previously exposed to chemotherapeutic drug (i.e., a "true" transfectant) was examined, since very few MDR cell lines of this nature have been constructed. Recently [Hoffman et al. (1996) J. Gen. Physiol. 108, 295-313] we derived a series of true LR73/hu MDR 1 transfectants that are valuable for defining the MDR phenotype mediated by MDR protein alone, without the additional complexities introduced by exposing cells to chemotherapeutic drugs. Several independently derived clones from these and additional transfection experiments exhibit expression of MDR protein that is higher than that found in other true transfectants, and that is similar to the highest level of overexpression yet recorded for drug selected MDR cells. We examined altered Cl--dependent pHi regulation for these clones using improved single-cell photometry (SCP) techniques. Short-term isotonic Cl- substitution experiments performed in the presence of CO2/HCO3- reveal that mild overexpression of hu MDR 1 protein alters anion exchange (Cl-/HCO3- exchange or AE) for LR73 cells, as expected on the basis of previous work [Luz et al. (1994) Biochemistry 33, 7239-7249]. Interestingly, we now find that several independently selected high-level MDR 1 overexpressing clones acidify quite extensively upon isotonic exchange of Cl- and then rapidly alkalinize upon restoring normal [Cl-]. These data suggest that MDR protein may effectively compete against AE. The MDR protein effect is not dependent on HCO3-/CO2 or K+, is partially inhibited by verapamil, is completely inhibited by substituting K+ or N-methylglucamine (NMG+) for Na+ in the SCP perfusate but is not affected by 100 microM levels of amiloride, bumetanide, chlorothiazide, or stilbene. ATP depletion inhibits the MDR 1 effect. We are unable to restore normal AE activity for the MDR clones via manipulation of Cl- or HCO3- gradients. We thus suggest that MDR protein overexpression provides a novel Na+- and Cl--dependent pathway for transmembrane H+ transport. From analysis of ion dependency and inhibitor sensitivities, we conclude the transport is not via altered regulation of any known K+/H+, Na+/H+, or Cl-/HCO3- antiporters, Na+:K+:2Cl-, Na+:K+:2HCO3-, K+:HCO3-, or Na+:HCO3- co-transporters, or any combination of these. Thus, it appears to represent a novel ATP and Na+-dependent Cl-/H+ antiport process that (1) may be directly mediated by the MDR protein, (2) may represent the modulation of one or more currently unidentified ion transport proteins by MDR protein, (3) may be due to some combination of direct ion transport and regulation of ion transport, or (4) may represent unusual passive H+ movement in response to a novel Cl--dependent electrical perturbation that occurs during our Cl- substitution protocol. The results have important implications for understanding drug resistance mediated by MDR 1 overexpression, as well as the physiologic function of endogenously expressed MDR protein.
Collapse
Affiliation(s)
- M M Hoffman
- Molecular Pharmacology and Therapeutics Program at the Raymond and Beverly Sackler Foundation Laboratory, Memorial Sloan-Kettering Cancer Center, Cornell University Medical College, New York, New York 10021, USA
| | | |
Collapse
|