1
|
Petschner P, Baksa D, Hullam G, Torok D, Millinghoffer A, Deakin JFW, Bagdy G, Juhasz G. A replication study separates polymorphisms behind migraine with and without depression. PLoS One 2021; 16:e0261477. [PMID: 34972135 PMCID: PMC8719675 DOI: 10.1371/journal.pone.0261477] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2021] [Accepted: 12/03/2021] [Indexed: 11/29/2022] Open
Abstract
The largest migraine genome-wide association study identified 38 candidate loci. In this study we assessed whether these results replicate on a gene level in our European cohort and whether effects are altered by lifetime depression. We tested SNPs of the loci and their vicinity with or without interaction with depression in regression models. Advanced analysis methods such as Bayesian relevance analysis and a neural network based classifier were used to confirm findings. Main effects were found for rs2455107 of PRDM16 (OR = 1.304, p = 0.007) and five intergenic polymorphisms in 1p31.1 region: two of them showed risk effect (OR = 1.277, p = 0.003 for both rs11209657 and rs6686879), while the other three variants were protective factors (OR = 0.4956, p = 0.006 for both rs12090642 and rs72948266; OR = 0.4756, p = 0.005 for rs77864828). Additionally, 26 polymorphisms within ADGRL2, 2 in REST, 1 in HPSE2 and 33 mostly intergenic SNPs from 1p31.1 showed interaction effects. Among clumped results representing these significant regions, only rs11163394 of ADGRL2 showed a protective effect (OR = 0.607, p = 0.002), all other variants were risk factors (rs1043215 of REST with the strongest effect: OR = 6.596, p = 0.003). Bayesian relevance analysis confirmed the relevance of intergenic rs6660757 and rs12128399 (p31.1), rs1043215 (REST), rs1889974 (HPSE2) and rs11163394 (ADGRL2) from depression interaction results, and the moderate relevance of rs77864828 and rs2455107 of PRDM16 from main effect analysis. Both main and interaction effect SNPs could enhance predictive power with the neural network based classifier. In summary, we replicated p31.1, PRDM16, REST, HPSE2 and ADGRL2 genes with classic genetic and advanced analysis methods. While the p31.1 region and PRDM16 are worthy of further investigations in migraine in general, REST, HPSE2 and ADGRL2 may be prime candidates behind migraine pathophysiology in patients with comorbid depression.
Collapse
Affiliation(s)
- Peter Petschner
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho, Uji, Kyoto, Japan
- Department of Pharmacodynamics, Faculty of Pharmacy, Semmelweis University, Budapest, Hungary
- MTA-SE Neuropsychopharmacology and Neurochemistry Research Group, Hungarian Academy of Sciences, Semmelweis University, Budapest, Hungary
| | - Daniel Baksa
- Department of Pharmacodynamics, Faculty of Pharmacy, Semmelweis University, Budapest, Hungary
- SE-NAP2 Genetic Brain Imaging Migraine Research Group, Hungarian Brain Research Program, Semmelweis University, Budapest, Hungary
| | - Gabor Hullam
- Department of Measurement and Information Systems, Budapest University of Technology and Economics, Budapest, Hungary
| | - Dora Torok
- Department of Pharmacodynamics, Faculty of Pharmacy, Semmelweis University, Budapest, Hungary
| | - Andras Millinghoffer
- MTA-SE Neuropsychopharmacology and Neurochemistry Research Group, Hungarian Academy of Sciences, Semmelweis University, Budapest, Hungary
- Department of Measurement and Information Systems, Budapest University of Technology and Economics, Budapest, Hungary
- NAP-2-SE New Antidepressant Target Research Group, Semmelweis University, Budapest, Hungary
| | - J. F. William Deakin
- Neuroscience and Psychiatry Unit, Division of Neuroscience and Experimental Psychology, The University of Manchester and Manchester Academic Health Sciences Centre, Manchester, United Kingdom
| | - Gyorgy Bagdy
- Department of Pharmacodynamics, Faculty of Pharmacy, Semmelweis University, Budapest, Hungary
- MTA-SE Neuropsychopharmacology and Neurochemistry Research Group, Hungarian Academy of Sciences, Semmelweis University, Budapest, Hungary
- NAP-2-SE New Antidepressant Target Research Group, Semmelweis University, Budapest, Hungary
| | - Gabriella Juhasz
- Department of Pharmacodynamics, Faculty of Pharmacy, Semmelweis University, Budapest, Hungary
- MTA-SE Neuropsychopharmacology and Neurochemistry Research Group, Hungarian Academy of Sciences, Semmelweis University, Budapest, Hungary
- SE-NAP2 Genetic Brain Imaging Migraine Research Group, Hungarian Brain Research Program, Semmelweis University, Budapest, Hungary
| |
Collapse
|
2
|
Abstract
Background MicroRNAs (miRNAs) are a family of short, non-coding RNAs that have been linked to critical cellular activities, most notably regulation of gene expression. The identification of miRNA is a cross-disciplinary approach that requires both computational identification methods and wet-lab validation experiments, making it a resource-intensive procedure. While numerous machine learning methods have been developed to increase classification accuracy and thus reduce validation costs, most methods use supervised learning and thus require large labeled training data sets, often not feasible for less-sequenced species. On the other hand, there is now an abundance of unlabeled RNA sequence data due to the emergence of high-throughput wet-lab experimental procedures, such as next-generation sequencing. Results This paper explores the application of semi-supervised machine learning for miRNA classification in order to maximize the utility of both labeled and unlabeled data. We here present the novel combination of two semi-supervised approaches: active learning and multi-view co-training. Results across six diverse species show that this multi-stage semi-supervised approach is able to improve classification performance using very small numbers of labeled instances, effectively leveraging the available unlabeled data. Conclusions The proposed semi-supervised miRNA classification pipeline holds the potential to identify novel miRNA with high recall and precision while requiring very small numbers of previously known miRNA. Such a method could be highly beneficial when studying miRNA in newly sequenced genomes of niche species with few known examples of miRNA.
Collapse
|
3
|
Multi-view Co-training for microRNA Prediction. Sci Rep 2019; 9:10931. [PMID: 31358877 PMCID: PMC6662744 DOI: 10.1038/s41598-019-47399-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2019] [Accepted: 07/05/2019] [Indexed: 12/13/2022] Open
Abstract
MicroRNA (miRNA) are short, non-coding RNAs involved in cell regulation at post-transcriptional and translational levels. Numerous computational predictors of miRNA been developed that generally classify miRNA based on either sequence- or expression-based features. While these methods are highly effective, they require large labelled training data sets, which are often not available for many species. Simultaneously, emerging high-throughput wet-lab experimental procedures are producing large unlabelled data sets of genomic sequence and RNA expression profiles. Existing methods use supervised machine learning and are therefore unable to leverage these unlabelled data. In this paper, we design and develop a multi-view co-training approach for the classification of miRNA to maximize the utility of unlabelled training data by taking advantage of multiple views of the problem. Starting with only 10 labelled training data, co-training is shown to significantly (p < 0.01) increase classification accuracy of both sequence- and expression-based classifiers, without requiring any new labelled training data. After 11 iterations of co-training, the expression-based view of miRNA classification experiences an average increase in AUPRC of 15.81% over six species, compared to 11.90% for self-training and 4.84% for passive learning. Similar results are observed for sequence-based classifiers with increases of 46.47%, 39.53% and 29.43%, for co-training, self-training, and passive learning, respectively. The final co-trained sequence and expression-based classifiers are integrated into a final confidence-based classifier which shows improved performance compared to both the expression (1.5%, p = 0.021) and sequence (3.7%, p = 0.006) views. This study represents the first application of multi-view co-training to miRNA prediction and shows great promise, particularly for understudied species with few available training data.
Collapse
|
4
|
Du B, Xinyao T, Wang Z, Zhang L, Tao D. Robust Graph-Based Semisupervised Learning for Noisy Labeled Data via Maximum Correntropy Criterion. IEEE TRANSACTIONS ON CYBERNETICS 2019; 49:1440-1453. [PMID: 29994595 DOI: 10.1109/tcyb.2018.2804326] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Semisupervised learning (SSL) methods have been proved to be effective at solving the labeled samples shortage problem by using a large number of unlabeled samples together with a small number of labeled samples. However, many traditional SSL methods may not be robust with too much labeling noisy data. To address this issue, in this paper, we propose a robust graph-based SSL method based on maximum correntropy criterion to learn a robust and strong generalization model. In detail, the graph-based SSL framework is improved by imposing supervised information on the regularizer, which can strengthen the constraint on labels, thus ensuring that the predicted labels of each cluster are close to the true labels. Furthermore, the maximum correntropy criterion is introduced into the graph-based SSL framework to suppress labeling noise. Extensive image classification experiments prove the generalization and robustness of the proposed SSL method.
Collapse
|
5
|
Torrent-Fontbona F, Lopez B. Personalized Adaptive CBR Bolus Recommender System for Type 1 Diabetes. IEEE J Biomed Health Inform 2018; 23:387-394. [PMID: 29994082 DOI: 10.1109/jbhi.2018.2813424] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Type 1 diabetes mellitus (T1DM) is a chronic disease. Those who have it must administer themselves with insulin to control their blood glucose level. It is difficult to estimate the correct insulin dosage due to the complex glucose metabolism, which can lead to less than optimal blood glucose levels. This paper presents PepperRec, a case-based reasoning (CBR) bolus insulin recommender system capable of dealing with an unrestricted number of situations in which T1DM persons can find themselves. PepperRec considers several factors that affect glucose metabolism, such as data about the physical activity of the user, and can also cope with missing values for these factors. Based on CBR methodology, PepperRec uses new methods to adapt past recommendations to the current state of the user, and retains updated historical patient information to deal with slow and gradual changes in the patient over time (concept drift). The proposed approach is tested using the UVA/PADOVA simulator with 33 virtual subjects and compared with other methods in the literature, and with the default insulin therapy of the simulator. The achieved results demonstrate that PepperRec increases the amount of time the users are in their target glycaemic range, reduces the time spent below it, while maintaining, or even reducing, the time spent above it.
Collapse
|