Scientific Reports (Nov 2024)

Assessing the ability of GPT-4o to visually recognize medications and provide patient education

  • Amjad H. Bazzari,
  • Firas H. Bazzari

DOI
https://doi.org/10.1038/s41598-024-78577-y
Journal volume & issue
Vol. 14, no. 1
pp. 1 – 8

Abstract

Read online

Abstract Various studies have investigated the ability of ChatGPT (OpenAI) to provide medication information; however, a new promising feature has now been added, which allows visual input and is yet to be evaluated. Here, we aimed to qualitatively assess its ability to visually recognize medications, through medication picture input, and provide patient education via written and visual output. The responses were evaluated by accuracy, precision and clarity using a 4-point Likert-like scale. In regards to handling visual input and providing written responses, GPT-4o was able to recognize all 20 tested medications from packaging pictures, even with blurring, retrieve their active ingredients, identify formulations and dosage forms and provide detailed, yet concise enough, patient education in an almost completely accurate, precise and clear manner with a score of 3.55 ± 0.605 (85%). In contrast, the visual output through GPT-4o generated images illustrating usage instructions contained many errors that would either hinder the effectiveness of the medication or cause direct harm to the patient with a poor score of 1.5 ± 0.577 (16.7%). In conclusion, GPT-4o is capable of identifying medications from pictures and exhibits contrasting patient education performance between written and visual output with very impressive and poor scores, respectively.