Xiao YH, Hu YL, Lv XY, Huang LJ, Geng LH, Liao P, Ding YB, Niu CC. The construction of machine learning-based predictive models for high-quality embryo formation in poor ovarian response patients with progestin-primed ovarian stimulation.
Reprod Biol Endocrinol 2024;
22:78. [PMID:
38987797 PMCID:
PMC11234746 DOI:
10.1186/s12958-024-01251-5]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Accepted: 06/27/2024] [Indexed: 07/12/2024] Open
Abstract
OBJECTIVE
To explore the optimal models for predicting the formation of high-quality embryos in Poor Ovarian Response (POR) Patients with Progestin-Primed Ovarian Stimulation (PPOS) using machine learning algorithms.
METHODS
A retrospective analysis was conducted on the clinical data of 4,216 POR cycles who underwent in vitro fertilization (IVF) / intracytoplasmic sperm injection (ICSI) at Sichuan Jinxin Xinan Women and Children's Hospital from January 2015 to December 2021. Based on the presence of high-quality cleavage embryos 72 h post-fertilization, the samples were divided into the high-quality cleavage embryo group (N = 1950) and the non-high-quality cleavage embryo group (N = 2266). Additionally, based on whether high-quality blastocysts were observed following full blastocyst culture, the samples were categorized into the high-quality blastocyst group (N = 124) and the non-high-quality blastocyst group (N = 1800). The factors influencing the formation of high-quality embryos were analyzed using logistic regression. The predictive models based on machine learning methods were constructed and evaluated accordingly.
RESULTS
Differential analysis revealed that there are statistically significant differences in 14 factors between high-quality and non-high-quality cleavage embryos. Logistic regression analysis identified 14 factors as influential in forming high-quality cleavage embryos. In models excluding three variables (retrieved oocytes, MII oocytes, and 2PN fertilized oocytes), the XGBoost model performed slightly better (AUC = 0.672, 95% CI = 0.636-0.708). Conversely, in models including these three variables, the Random Forest model exhibited the best performance (AUC = 0.788, 95% CI = 0.759-0.818). In the analysis of high-quality blastocysts, significant differences were found in 17 factors. Logistic regression analysis indicated that 13 factors influence the formation of high-quality blastocysts. Including these variables in the predictive model, the XGBoost model showed the highest performance (AUC = 0.813, 95% CI = 0.741-0.884).
CONCLUSION
We developed a predictive model for the formation of high-quality embryos using machine learning methods for patients with POR undergoing treatment with the PPOS protocol. This model can help infertility patients better understand the likelihood of forming high-quality embryos following treatment and help clinicians better understand and predict treatment outcomes, thus facilitating more targeted and effective interventions.
Collapse