1
|
Lin PI, Moni MA, Gau SSF, Eapen V. Identifying Subgroups of Patients With Autism by Gene Expression Profiles Using Machine Learning Algorithms. Front Psychiatry 2021; 12:637022. [PMID: 34054599 PMCID: PMC8149626 DOI: 10.3389/fpsyt.2021.637022] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Accepted: 04/13/2021] [Indexed: 12/22/2022] Open
Abstract
Objectives: The identification of subgroups of autism spectrum disorder (ASD) may partially remedy the problems of clinical heterogeneity to facilitate the improvement of clinical management. The current study aims to use machine learning algorithms to analyze microarray data to identify clusters with relatively homogeneous clinical features. Methods: The whole-genome gene expression microarray data were used to predict communication quotient (SCQ) scores against all probes to select differential expression regions (DERs). Gene set enrichment analysis was performed for DERs with a fold-change >2 to identify hub pathways that play a role in the severity of social communication deficits inherent to ASD. We then used two machine learning methods, random forest classification (RF) and support vector machine (SVM), to identify two clusters using DERs. Finally, we evaluated how accurately the clusters predicted language impairment. Results: A total of 191 DERs were initially identified, and 54 of them with a fold-change >2 were selected for the pathway analysis. Cholesterol biosynthesis and metabolisms pathways appear to act as hubs that connect other trait-associated pathways to influence the severity of social communication deficits inherent to ASD. Both RF and SVM algorithms can yield a classification accuracy level >90% when all 191 DERs were analyzed. The ASD subtypes defined by the presence of language impairment, a strong indicator for prognosis, can be predicted by transcriptomic profiles associated with social communication deficits and cholesterol biosynthesis and metabolism. Conclusion: The results suggest that both RF and SVM are acceptable options for machine learning algorithms to identify AD subgroups characterized by clinical homogeneity related to prognosis.
Collapse
Affiliation(s)
- Ping-I Lin
- School of Psychiatry, The University of New South Wales, Sydney, NSW, Australia.,South Western Sydney Local Health District, Liverpool, NSW, Australia
| | - Mohammad Ali Moni
- School of Psychiatry, The University of New South Wales, Sydney, NSW, Australia
| | - Susan Shur-Fen Gau
- Department of Psychiatry, National Taiwan University Hospital and College of Medicine, Taipei, Taiwan
| | - Valsamma Eapen
- School of Psychiatry, The University of New South Wales, Sydney, NSW, Australia.,South Western Sydney Local Health District, Liverpool, NSW, Australia
| |
Collapse
|
2
|
Abraham JS, Sripoorna S, Maurya S, Makhija S, Gupta R, Toteja R. Techniques and tools for species identification in ciliates: a review. Int J Syst Evol Microbiol 2019; 69:877-894. [DOI: 10.1099/ijsem.0.003176] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Ciliates are highly divergent unicellular eukaryotic organisms with nuclear dualism and a highly specialized ciliary pattern. They inhabit all biotopes and play crucial roles in regulating microbial food webs as they prey on bacteria, protists and even on microscopic animals. Nevertheless, subtle morphological differences and tiny sizes hinder proper species identification for many ciliates. In the present review, an attempt has been made to elaborate the various approaches used by modern day ciliate taxonomists for species identification. The different approaches involved in taxonomic characterization of ciliates such as classical (using live-cell observations, staining techniques, etc.), molecular (involving various marker genes) and statistical (delimitation of cryptic species) methods have been reviewed. Ecological and behavioural aspects in species identification have also been discussed. In present-day taxonomy, it is important to use a ‘total evidence’ approach in identifying ciliates, relying on both classical and molecular information whenever possible. This integrative approach will help in the mergence of classical methods with modern-day tools for comprehensive species description in future.
Collapse
Affiliation(s)
- Jeeva Susan Abraham
- Acharya Narendra Dev College, University of Delhi, Govindpuri, Kalkaji, New Delhi 110019, India
| | - S. Sripoorna
- Acharya Narendra Dev College, University of Delhi, Govindpuri, Kalkaji, New Delhi 110019, India
| | - Swati Maurya
- Acharya Narendra Dev College, University of Delhi, Govindpuri, Kalkaji, New Delhi 110019, India
| | - Seema Makhija
- Acharya Narendra Dev College, University of Delhi, Govindpuri, Kalkaji, New Delhi 110019, India
| | - Renu Gupta
- Maitreyi College, University of Delhi, Bapu dham, Chanakyapuri, New Delhi 110021, India
| | - Ravi Toteja
- Acharya Narendra Dev College, University of Delhi, Govindpuri, Kalkaji, New Delhi 110019, India
| |
Collapse
|
3
|
Bruijning M, Visser MD, Hallmann CA, Jongejans E. trackdem
: Automated particle tracking to obtain population counts and size distributions from videos in
r. Methods Ecol Evol 2018. [DOI: 10.1111/2041-210x.12975] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Affiliation(s)
- Marjolein Bruijning
- Radboud UniversityDepartments of Animal Ecology and Physiology & Experimental Plant Ecology Nijmegen The Netherlands
| | - Marco D. Visser
- Radboud UniversityDepartments of Animal Ecology and Physiology & Experimental Plant Ecology Nijmegen The Netherlands
- Princeton UniversityDepartment of Ecology and Evolutionary Biology Princeton NJ USA
| | - Caspar A. Hallmann
- Radboud UniversityDepartments of Animal Ecology and Physiology & Experimental Plant Ecology Nijmegen The Netherlands
| | - Eelke Jongejans
- Radboud UniversityDepartments of Animal Ecology and Physiology & Experimental Plant Ecology Nijmegen The Netherlands
| |
Collapse
|
4
|
Pennekamp F, Griffiths JI, Fronhofer EA, Garnier A, Seymour M, Altermatt F, Petchey OL. Dynamic species classification of microorganisms across time, abiotic and biotic environments-A sliding window approach. PLoS One 2017; 12:e0176682. [PMID: 28472193 PMCID: PMC5417602 DOI: 10.1371/journal.pone.0176682] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2017] [Accepted: 04/14/2017] [Indexed: 11/18/2022] Open
Abstract
The development of video-based monitoring methods allows for rapid, dynamic and accurate monitoring of individuals or communities, compared to slower traditional methods, with far reaching ecological and evolutionary applications. Large amounts of data are generated using video-based methods, which can be effectively processed using machine learning (ML) algorithms into meaningful ecological information. ML uses user defined classes (e.g. species), derived from a subset (i.e. training data) of video-observed quantitative features (e.g. phenotypic variation), to infer classes in subsequent observations. However, phenotypic variation often changes due to environmental conditions, which may lead to poor classification, if environmentally induced variation in phenotypes is not accounted for. Here we describe a framework for classifying species under changing environmental conditions based on the random forest classification. A sliding window approach was developed that restricts temporal and environmentally conditions to improve the classification. We tested our approach by applying the classification framework to experimental data. The experiment used a set of six ciliate species to monitor changes in community structure and behavior over hundreds of generations, in dozens of species combinations and across a temperature gradient. Differences in biotic and abiotic conditions caused simplistic classification approaches to be unsuccessful. In contrast, the sliding window approach allowed classification to be highly successful, as phenotypic differences driven by environmental change, could be captured by the classifier. Importantly, classification using the random forest algorithm showed comparable success when validated against traditional, slower, manual identification. Our framework allows for reliable classification in dynamic environments, and may help to improve strategies for long-term monitoring of species in changing environments. Our classification pipeline can be applied in fields assessing species community dynamics, such as eco-toxicology, ecology and evolutionary ecology.
Collapse
Affiliation(s)
- Frank Pennekamp
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
- * E-mail:
| | - Jason I. Griffiths
- Department of Animal and Plant Sciences, University of Sheffield, Western Bank, Sheffield, S10 2TN, United Kingdom
| | - Emanuel A. Fronhofer
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
- Department of Aquatic Ecology, Eawag: Swiss Federal Institute of Aquatic Science and Technology, Überlandstrasse 133, CH-8600 Dübendorf, Switzerland
| | - Aurélie Garnier
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| | - Mathew Seymour
- Department of Aquatic Ecology, Eawag: Swiss Federal Institute of Aquatic Science and Technology, Überlandstrasse 133, CH-8600 Dübendorf, Switzerland
- Molecular Ecology and Fisheries Genetics Laboratory, School of Biological Sciences, Bangor University, Deiniol Road, Gwynedd LL57 2UW, United Kingdom
| | - Florian Altermatt
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
- Department of Aquatic Ecology, Eawag: Swiss Federal Institute of Aquatic Science and Technology, Überlandstrasse 133, CH-8600 Dübendorf, Switzerland
| | - Owen L. Petchey
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| |
Collapse
|
5
|
Soleymani A, Pennekamp F, Dodge S, Weibel R. Characterizing change points and continuous transitions in movement behaviours using wavelet decomposition. Methods Ecol Evol 2017. [DOI: 10.1111/2041-210x.12755] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Ali Soleymani
- Department of Geography University of Zurich Zurich Switzerland
| | - Frank Pennekamp
- Institute of Evolutionary Biology and Environmental Studies University of Zurich Zurich Switzerland
| | - Somayeh Dodge
- Department of Geography, Environment, and Society University of Minnesota Twin Cities MN USA
| | - Robert Weibel
- Department of Geography University of Zurich Zurich Switzerland
| |
Collapse
|
6
|
Correction: Developing and Integrating Advanced Movement Features Improves Automated Classification of Ciliate Species. PLoS One 2016; 11:e0148592. [PMID: 26824617 PMCID: PMC4732673 DOI: 10.1371/journal.pone.0148592] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
|