Discovering a trans-omics biomarker signature that predisposes high risk diabetic patients to diabetic kidney disease.
NPJ Digit Med 2022;
5:166. [PMID:
36323795 PMCID:
PMC9630270 DOI:
10.1038/s41746-022-00713-7]
[Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Accepted: 10/18/2022] [Indexed: 11/27/2022] Open
Abstract
Diabetic kidney disease is the leading cause of end-stage kidney disease worldwide; however, the integration of high-dimensional trans-omics data to predict this diabetic complication is rare. We develop artificial intelligence (AI)-assisted models using machine learning algorithms to identify a biomarker signature that predisposes high risk patients with diabetes mellitus (DM) to diabetic kidney disease based on clinical information, untargeted metabolomics, targeted lipidomics and genome-wide single nucleotide polymorphism (SNP) datasets. This involves 618 individuals who are split into training and testing cohorts of 557 and 61 subjects, respectively. Three models are developed. In model 1, the top 20 features selected by AI give an accuracy rate of 0.83 and an area under curve (AUC) of 0.89 when differentiating DM and non-DM individuals. In model 2, among DM patients, a biomarker signature of 10 AI-selected features gives an accuracy rate of 0.70 and an AUC of 0.76 when identifying subjects at high risk of renal impairment. In model 3, among non-DM patients, a biomarker signature of 25 AI-selected features gives an accuracy rate of 0.82 and an AUC of 0.76 when pinpointing subjects at high risk of chronic kidney disease. In addition, the performance of the three models is rigorously verified using an independent validation cohort. Intriguingly, analysis of the protein-protein interaction network of the genes containing the identified SNPs (RPTOR, CLPTM1L, ALDH1L1, LY6D, PCDH9, B3GNTL1, CDS1, ADCYAP and FAM53A) reveals that, at the molecular level, there seems to be interconnected factors that have an effect on the progression of renal impairment among DM patients. In conclusion, our findings reveal the potential of employing machine learning algorithms to augment traditional methods and our findings suggest what molecular mechanisms may underlie the complex interaction between DM and chronic kidney disease. Moreover, the development of our AI-assisted models will improve precision when diagnosing renal impairment in predisposed patients, both DM and non-DM. Finally, a large prospective cohort study is needed to validate the clinical utility and mechanistic implications of these biomarker signatures.
Collapse