This methodological study aims to determine the agreement between expert nurses and an artificial intelligence (AI) system (ChatGPT-5) in providing scenario-based discharge education for patients who have undergone coronary artery bypass graft (CABG) surgery. The purpose of the study is to evaluate whether ChatGPT-5 can generate discharge education content that is comparable in accuracy, completeness, and clinical appropriateness to that prepared by experienced cardiovascular surgery nurses.
Thirty standardized patient scenarios will be developed to represent a wide range of CABG cases with diverse demographic, socioeconomic, psychosocial, and clinical characteristics. Each scenario will simulate realistic postoperative conditions, including potential complications (e.g., delirium, wound infection, bleeding, arrhythmia), comorbidities (e.g., diabetes, hypertension, COPD), and psychosocial variables such as anxiety level, family structure, and social support. All scenarios will be reviewed and validated by a multidisciplinary expert panel including cardiovascular surgeons and academic nurse specialists to ensure clinical realism and content validity.
Discharge education will be structured around six main domains and twenty-four subtopics derived from national and international guidelines and evidence-based literature. These domains include: (1) medical management and follow-up, (2) daily life and functional recovery, (3) psychosocial and social support, (4) risk factors and preventive health, (5) quality of life and specific conditions, and (6) religious practices. For each scenario, both expert nurses and ChatGPT-5 will independently prepare written discharge education materials using this standardized framework.
The educational materials will be anonymized and evaluated by two blinded reviewers in terms of scientific accuracy, content completeness, linguistic clarity, and alignment with clinical standards. In case of disagreement, a third independent reviewer will provide a final decision to ensure objectivity. Statistical analyses will include Cohen's Kappa coefficient to measure inter-rater agreement and Fisher's Exact Test for categorical comparisons. Diagnostic performance measures such as sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and F1 score will also be computed.
Data will be analyzed using SPSS v25 (IBM Corp., Armonk, NY, USA). Descriptive statistics (frequencies, percentages, means, and standard deviations) will be reported to summarize the characteristics of the scenarios and evaluations. Agreement levels will be interpreted according to Landis and Koch's classification. A p-value of \<0.05 will be considered statistically significant.
The findings of this study are expected to provide evidence regarding the reliability, validity, and usability of ChatGPT-5 as an innovative and supportive tool for preparing individualized discharge education materials in cardiovascular surgery nursing. Results may contribute to developing new technology-assisted educational models that can reduce nurse workload, improve the standardization of discharge education, and enhance patient understanding and satisfaction in the postoperative period.