Comparative Analysis of ChatGPT and Gemini for Patient Education in Heart Failure: A Readability Study

Orhan Özsoy; Tansu Gençer

Sarcouncil Journal of Medical Series

An Open access peer reviewed international Journal
Publication Frequency- Monthly
Publisher Name-SARC Publisher

ISSN Online- 2945-3550
Country of origin- PHILIPPINES
Impact Factor- 3.7
Language- English

Keywords

Editors

Dr Hazim Abdul-Rahman
Associate Editor
Sarcouncil Journal of Applied Sciences

Entessar Al Jbawi
Associate Editor
Sarcouncil Journal of Multidisciplinary

Rishabh Rajesh Shanbhag
Associate Editor
Sarcouncil Journal of Engineering and Computer Sciences

Dr Md. Rezowan ur Rahman
Associate Editor
Sarcouncil Journal of Biomedical Sciences

Dr Ifeoma Christy
Associate Editor
Sarcouncil Journal of Entrepreneurship And Business Management

Comparative Analysis of ChatGPT and Gemini for Patient Education in Heart Failure: A Readability Study

Keywords: Heart Failure, Large Language Models, Readability Analysis.

Abstract: Background: Heart failure (HF) remains a significant global health burden, necessitating effective patient education and accessible medical information. Artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has emerged as a potential tool for providing on-demand medical explanations. However, the linguistic complexity and readability of these AI-generated responses in the context of HF require systematic evaluation. Objective: This study aimed to comparatively evaluate the readability levels of responses generated by ChatGPT and Gemini regarding heart failure–related queries. Methods: A comparative cross-sectional study was conducted using 40 standardized questions based on established HF guidelines, covering domains such as pathophysiology, symptoms, treatment, and self-care. Responses were collected over 20 repetitions. Readability was assessed using Flesch Reading Ease (FRE), Flesch-Kincaid Grade Level, Gunning Fog Index, and SMOG Index. Statistical analyses were performed using JASP 0.96, with a significance threshold of p < 0.05. Results: No statistically significant differences were found between Gemini and ChatGPT in FRE scores (p=0.542), Flesch-Kincaid Grade Level (p=0.101), or Gunning Fog Index (p=0.094). However, Gemini (Median: 11.00) demonstrated a statistically significantly higher SMOG Index score compared to GPT (Median: 6.00) (p=0.046). Overall, both models produced content at an academic or "medium difficulty" level. Conclusion: While both models exhibit similar readability in most metrics, Gemini produces more complex text according to the SMOG Index, suggesting a higher density of technical terminology. The academic level of these responses may pose barriers for patients with low health literacy. Therefore, AI-generated HF information should be supervised by healthcare professionals to ensure clarity and accessibility for patient education.

Sarcouncil Journal of Medical Series

Keywords

Editors

Home

Aims & Scope

Archive

Indexing

Submit Article

Comparative Analysis of ChatGPT and Gemini for Patient Education in Heart Failure: A Readability Study

Author

People

Policies

Submission

About Us

QUICK LINKS

JOIN US

USEFILL LINKS

SOCIAL MEDIA