Vol 31, No 3 (2024)
Original Article
Published online: 2023-10-12

open access

Page views 1160
Article views/downloads 813
Get Citation

Connect on Social Media

Connect on Social Media

Reshaping medical education: Performance of ChatGPT on a PES medical examination

Simona Wójcik1, Anna Rulkiewicz1, Piotr Pruszczyk2, Wojciech Lisik3, Marcin Poboży4, Justyna Domienik-Karłowicz21
Pubmed: 37830257
Cardiol J 2024;31(3):442-450.

Abstract

Background: We are currently experiencing a third digital revolution driven by artificial intelligence (AI), and the emergence of new chat generative pre-trained transformer (ChatGPT) represents a significant technological advancement with profound implications for global society, especially in the field of education.

Methods: The aim of this study was to see how well ChatGPT performed on medical school exams and to highlight how it might change medical education and practice. Recently, OpenAI’s ChatGPT (OpenAI, San Francisco; GPT-4 May 24 Version) was put to the test against a significant Polish medical specialization licensing exam (PES), and the results are in. The version of ChatGPT-4 used in this study was the most up-to-date model at the time of publication (GPT-4). ChatGPT answered questions from June 28, 2023, to June 30, 2023.

Results: ChatGPT demonstrates notable advancements in natural language processing models on the tasks of medical question answering. In June 2023, the performance of ChatGPT was assessed based on its ability to answer a set of 120 questions, where it achieved a correct response rate of 67.1%, accurately responding to 80 questions.

Conclusions: ChatGPT may be used as an assistance tool in medical education. While ChatGPT can serve as a valuable tool in medical education, it cannot fully replace human expertise and knowledge due to its inherent limitations.

Article available in PDF format

View PDF Download PDF file

References

  1. Adamopoulou E, Moussiades L. An overview of chatbot technology. In IFIP International Conference on Artificial Intelligence Applications and Innovations. Springer 2020: 373–383.
  2. Ritson M. Is ChatGPT the next significant threat to Google’s dominance in the AI market? Marketing Weekly. https://www.marketingweek.com/ritson-chatgpt-google-ai (2022, December 9).
  3. Brent AA. Why ChatGPT is such a big deal for education,” C2C Digital Magazine. C2C Digital Magazine. 2023; 1(18): 14.
  4. AI Will Transform Teaching and Learning. Let’s Get it Right. https://hai.stanford.edu/news/ai-will-transform-teaching-and-learning-lets-get-it-right (Accessed June 2023).
  5. Epstein RH, Dexter F. Variability in Large Language Models' Responses to Medical Licensing and Certification Examinations. Comment on "How Does ChatGPT Perform on the United States Medical Licensing Examination? The Implications of Large Language Models for Medical Education and Knowledge Assessment". JMIR Med Educ. 2023; 9: e48305.
  6. McGee R. Is chat GPT biased against conservatives? An empirical study. SSRN Electron J. 2023.
  7. Choi J, Hickman K, Monahan A, et al. ChatGPT Goes to Law School. SSRN Electron J. 2023.
  8. Carbone C. How ChatGPT could make it easy to cheat on written tests and homework: ‘You can NO LONGER give take-home exams or homework.’. https://www.dailymail.co.uk/sciencetech/article-11513127/ChatGPT-OpenAI-cheat-testshomework.htm (2022, December 7).
  9. Kelly SM. ChatGPT passes exams from law and business schools. https://edition.cnn.com/2023/01/26/tech/chatgpt-passes-exams/ (2023, January 26).
  10. Terwiesch C. Would Chat GPT3 Get a Wharton MBA? A Prediction Based on Its Performance in the Operations Management Course. https://mackinstitute.wharton.upenn.edu/2023/would-chat-gpt3-get-a-wharton-mba-newwhite-paper-by-christian-terwiesch (2023, January 17).
  11. Kung TH, Cheatham M, Medenilla A, et al. Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models. PLOS Digit Health. 2023; 2(2): e0000198.
  12. Antaki F, Touma S, Milad D, et al. Evaluating the performance of ChatGPT in ophthalmology: an analysis of its successes and shortcomings. Ophthalmol Sci. 2023; 3(4): 100324.
  13. Bhayana R, Krishna S, Bleakney RR. Performance of ChatGPT on a radiology board-style examination: insights into current strengths and limitations. Radiology. 2023; 307(5): e230582.
  14. Humar P, Asaad M, Bengur FB, et al. ChatGPT is equivalent to first year plastic surgery residents: evaluation of ChatGPT on the plastic surgery in-service exam. Aesthet Surg J. 2023 [Epub ahead of print].
  15. Kumar AHS. Analysis of ChatGPT tool to assess the potential of its utility for academic writing in biomedical domain. Biol Eng Med Sci Rep. 2023; 9(1): 24–30.
  16. Benoit J. ChatGPT for clinical vignette generation, revision, and evaluation. medRxiv. 2023.
  17. Murphy JFA. Assessment in medical education. Ir Med J. 2007; 100(2): 356.
  18. Lee H. The rise of ChatGPT: Exploring its potential in medical education. Anat Sci Educ. 2023 [Epub ahead of print].
  19. Tsang R. Practical applications of ChatGPT in undergraduate medical education. J Med Educ Curric Dev. 2023; 10: 23821205231178449.
  20. Mbakwe AB, Lourentzou I, Celi LA, et al. ChatGPT passing USMLE shines a spotlight on the flaws of medical education. PLOS Digit Health. 2023; 2(2): e0000205.
  21. https://cem.edu.pl/pytcem/wyswietl_pytania_pes. (Accessed June 2023).
  22. Kumar AHS. Analysis of ChatGPT tool to assess the potential of its utility for academic writing in biomedical domain. Biol Eng Med Sci Rep. 2023; 9(1): 24–30.
  23. Benoit J. ChatGPT for clinical vignette generation, revision, and evaluation. medRxiv. 2023.
  24. https://isap.sejm.gov.pl/isap.nsf/download.xsp/WDU19970280152/O/D19970152.pdf (Accessed June 2023).
  25. Nath S, Marie A, Ellershaw S, et al. New meaning for NLP: the trials and tribulations of natural language processing with GPT-3 in ophthalmology. Br J Ophthalmol. 2022; 106(7): 889–892.