Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Basu S, Kumbier K, Brown JB, Yu B. Iterative random forests to discover predictive and stable high-order interactions. Proc Natl Acad Sci U S A 2018;115:1943-8. [PMID: 29351989 DOI: 10.1073/pnas.1711236115] [Citation(s) in RCA: 112] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

For:	Basu S, Kumbier K, Brown JB, Yu B. Iterative random forests to discover predictive and stable high-order interactions. Proc Natl Acad Sci U S A 2018;115:1943-8. [PMID: 29351989 DOI: 10.1073/pnas.1711236115] [Citation(s) in RCA: 112] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

Number

Cited by Other Article(s)

101

Ramchandran M, Patil P, Parmigiani G. Tree-Weighting for Multi-Study Ensemble Learners. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2020;25:451-462. [PMID: 31797618 PMCID: PMC6980320] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

102

Pfister N, Bauer S, Peters J. Learning stable and predictive structures in kinetic systems. Proc Natl Acad Sci U S A 2019;116:25405-25411. [PMID: 31776252 PMCID: PMC6925987 DOI: 10.1073/pnas.1905688116] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

103

A High-Performance Computing Implementation of Iterative Random Forest for the Creation of Predictive Expression Networks. Genes (Basel) 2019;10:genes10120996. [PMID: 31810264 PMCID: PMC6947651 DOI: 10.3390/genes10120996] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2019] [Revised: 11/23/2019] [Accepted: 11/26/2019] [Indexed: 12/12/2022] Open

104

Harfouche AL, Jacobson DA, Kainer D, Romero JC, Harfouche AH, Scarascia Mugnozza G, Moshelion M, Tuskan GA, Keurentjes JJ, Altman A. Accelerating Climate Resilient Plant Breeding by Applying Next-Generation Artificial Intelligence. Trends Biotechnol 2019;37:1217-1235. [DOI: 10.1016/j.tibtech.2019.05.007] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2019] [Revised: 05/18/2019] [Accepted: 05/23/2019] [Indexed: 12/20/2022]

105

Murdoch WJ, Singh C, Kumbier K, Abbasi-Asl R, Yu B. Definitions, methods, and applications in interpretable machine learning. Proc Natl Acad Sci U S A 2019. [PMID: 31619572 DOI: 10.1073/pnas.1900654116/suppl_file/pnas.1900654116.sapp.pdf] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/12/2023] Open

106

Murdoch WJ, Singh C, Kumbier K, Abbasi-Asl R, Yu B. Definitions, methods, and applications in interpretable machine learning. Proc Natl Acad Sci U S A 2019;116:22071-22080. [PMID: 31619572 PMCID: PMC6825274 DOI: 10.1073/pnas.1900654116] [Citation(s) in RCA: 373] [Impact Index Per Article: 74.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

107

Vervier K, Michaelson JJ. TiSAn: estimating tissue-specific effects of coding and non-coding variants. Bioinformatics 2019;34:3061-3068. [PMID: 29912365 PMCID: PMC6137979 DOI: 10.1093/bioinformatics/bty301] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2017] [Accepted: 04/16/2018] [Indexed: 02/06/2023] Open

108

Long GS, Hussen M, Dench J, Aris-Brosou S. Identifying genetic determinants of complex phenotypes from whole genome sequence data. BMC Genomics 2019;20:470. [PMID: 31182025 PMCID: PMC6558885 DOI: 10.1186/s12864-019-5820-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2018] [Accepted: 05/21/2019] [Indexed: 02/08/2023] Open

Abstract

BACKGROUND

A critical goal in biology is to relate the phenotype to the genotype, that is, to find the genetic determinants of various traits. However, while simple monofactorial determinants are relatively easy to identify, the underpinnings of complex phenotypes are harder to predict. While traditional approaches rely on genome-wide association studies based on Single Nucleotide Polymorphism data, the ability of machine learning algorithms to find these determinants in whole proteome data is still not well known.

RESULTS

To better understand the applicability of machine learning in this case, we implemented two such algorithms, adaptive boosting (AB) and repeated random forest (RRF), and developed a chunking layer that facilitates the analysis of whole proteome data. We first assessed the performance of these algorithms and tuned them on an influenza data set, for which the determinants of three complex phenotypes (infectivity, transmissibility, and pathogenicity) are known based on experimental evidence. This allowed us to show that chunking improves runtimes by an order of magnitude. Based on simulations, we showed that chunking also increases sensitivity of the predictions, reaching 100% with as few as 20 sequences in a small proteome as in the influenza case (5k sites), but may require at least 30 sequences to reach 90% on larger alignments (500k sites). While RRF has less specificity than random forest, it was never <50%, and RRF sensitivity was significantly higher at smaller chunk sizes. We then used these algorithms to predict the determinants of three types of drug resistance (to Ciprofloxacin, Ceftazidime, and Gentamicin) in a bacterium, Pseudomonas aeruginosa. While both algorithms performed well in the case of the influenza data, results were more nuanced in the bacterial case, with RRF making more sensible predictions, with smaller errors rates, than AB.

CONCLUSIONS

Altogether, we demonstrated that ML algorithms can be used to identify genetic determinants in small proteomes (viruses), even when trained on small numbers of individuals. We further showed that our RRF algorithm may deserve more scrutiny, which should be facilitated by the decreasing costs of both sequencing and phenotyping of large cohorts of individuals.

Collapse

109

A Brief Review of Random Forests for Water Scientists and Practitioners and Their Recent History in Water Resources. WATER 2019. [DOI: 10.3390/w11050910] [Citation(s) in RCA: 93] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]

110

Farbehi N, Patrick R, Dorison A, Xaymardan M, Janbandhu V, Wystub-Lis K, Ho JW, Nordon RE, Harvey RP. Single-cell expression profiling reveals dynamic flux of cardiac stromal, vascular and immune cells in health and injury. eLife 2019;8:43882. [PMID: 30912746 PMCID: PMC6459677 DOI: 10.7554/elife.43882] [Citation(s) in RCA: 318] [Impact Index Per Article: 63.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2018] [Accepted: 03/25/2019] [Indexed: 12/11/2022] Open

Abstract

Besides cardiomyocytes (CM), the heart contains numerous interstitial cell types which play key roles in heart repair, regeneration and disease, including fibroblast, vascular and immune cells. However, a comprehensive understanding of this interactive cell community is lacking. We performed single-cell RNA-sequencing of the total non-CM fraction and enriched (Pdgfra-GFP⁺) fibroblast lineage cells from murine hearts at days 3 and 7 post-sham or myocardial infarction (MI) surgery. Clustering of >30,000 single cells identified >30 populations representing nine cell lineages, including a previously undescribed fibroblast lineage trajectory present in both sham and MI hearts leading to a uniquely activated cell state defined in part by a strong anti-WNT transcriptome signature. We also uncovered novel myofibroblast subtypes expressing either pro-fibrotic or anti-fibrotic signatures. Our data highlight non-linear dynamics in myeloid and fibroblast lineages after cardiac injury, and provide an entry point for deeper analysis of cardiac homeostasis, inflammation, fibrosis, repair and regeneration.

In our bodies, heart attacks lead to cell death and inflammation. This is then followed by a healing phase where the organ repairs itself. There are many types of heart cells, from muscle and pacemaker cells that help to create the beating motion, to so-called fibroblasts that act as a supporting network. Yet, it is still unclear how individual cells participate in the heart's response to injury.

All cells possess the same genetic information, but they turn on or off different genes depending on the specific tasks that they need to perform. Spotting which genes are activated in individual cells can therefore provide clues about their exact roles in the body. Until recently, technological limitations meant that this information was difficult to access, because it was only possible to capture the global response of a group of cells in a sample.

A new method called single-cell RNA sequencing is now allowing researchers to study the activities of many genes in thousands of individual cells at the same time. Here, Farbehi, Patrick et al. performed single-cell RNA sequencing on over 30,000 individual cells from healthy and injured mouse hearts. Computational approaches were then used to cluster cells into groups according to the activities of their genes.

The experiments identified over 30 distinct sub-types of cell, including several that were previously unknown. For example, a group of fibroblasts that express a gene called Wif1 was discovered. Previous genetic studies have shown that Wif1 is essential for the heart's response to injury. Further experiments by Farbehi, Patrick et al. indicated that this new sub-type of cells may control the timing of the different aspects of heart repair after damage.

Tens of millions of people around the world suffer from heart attacks and other heart diseases. Knowing how different types of heart cells participate in repair mechanisms may help to find new targets for drugs and other treatments.

Collapse

Affiliation(s)

Nona Farbehi Victor Chang Cardiac Research Institute, Darlinghurst, Australia.,Stem Cells Australia, Melbourne Brain Centre, University of Melbourne, Victoria, Australia.,Garvan Weizmann Centre for Cellular Genomics, Garvan Institute of Medical Research, Sydney, Australia.,Graduate School of Biomedical Engineering, UNSW Sydney, Kensington, Australia
Ralph Patrick Victor Chang Cardiac Research Institute, Darlinghurst, Australia.,Stem Cells Australia, Melbourne Brain Centre, University of Melbourne, Victoria, Australia.,St. Vincent's Clinical School, UNSW Sydney, Kensington, Australia
Aude Dorison Victor Chang Cardiac Research Institute, Darlinghurst, Australia.,Stem Cells Australia, Melbourne Brain Centre, University of Melbourne, Victoria, Australia
Munira Xaymardan Victor Chang Cardiac Research Institute, Darlinghurst, Australia.,Stem Cells Australia, Melbourne Brain Centre, University of Melbourne, Victoria, Australia.,School of Dentistry, Faculty of Medicine and Health, University of Sydney, Westmead Hospital, Westmead, Australia
Vaibhao Janbandhu Victor Chang Cardiac Research Institute, Darlinghurst, Australia.,Stem Cells Australia, Melbourne Brain Centre, University of Melbourne, Victoria, Australia.,St. Vincent's Clinical School, UNSW Sydney, Kensington, Australia
Katharina Wystub-Lis Victor Chang Cardiac Research Institute, Darlinghurst, Australia
Joshua Wk Ho Victor Chang Cardiac Research Institute, Darlinghurst, Australia.,St. Vincent's Clinical School, UNSW Sydney, Kensington, Australia
Robert E Nordon Stem Cells Australia, Melbourne Brain Centre, University of Melbourne, Victoria, Australia.,Graduate School of Biomedical Engineering, UNSW Sydney, Kensington, Australia
Richard P Harvey Victor Chang Cardiac Research Institute, Darlinghurst, Australia.,Stem Cells Australia, Melbourne Brain Centre, University of Melbourne, Victoria, Australia.,School of Biotechnology and Biomolecular Science, UNSW Sydney, Kensington, Australia

Collapse

111

Azuaje F. Artificial intelligence for precision oncology: beyond patient stratification. NPJ Precis Oncol 2019;3:6. [PMID: 30820462 PMCID: PMC6389974 DOI: 10.1038/s41698-019-0078-1] [Citation(s) in RCA: 64] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Accepted: 01/22/2019] [Indexed: 12/18/2022] Open

112

Deshpande S, Shuttleworth J, Yang J, Taramonli S, England M. PLIT: An alignment-free computational tool for identification of long non-coding RNAs in plant transcriptomic datasets. Comput Biol Med 2019;105:169-181. [DOI: 10.1016/j.compbiomed.2018.12.014] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2018] [Revised: 12/27/2018] [Accepted: 12/29/2018] [Indexed: 02/05/2023]

113

Classification and interaction in random forests. Proc Natl Acad Sci U S A 2018;115:1690-1692. [PMID: 29440440 DOI: 10.1073/pnas.1800256115] [Citation(s) in RCA: 62] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open