Clinical and Molecular Hepatology

Yeo, Samaan, Ng, Ting, Trivedi, Vipani, Ayoub, Yang, Liran, Spiegel, and Kuo: Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma

Original Article

Published online: March 22, 2023

DOI: https://doi.org/10.3350/cmh.2023.0089

Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma

Yee Hui Yeo^1,^*, Jamil S. Samaan^1,^*, Wee Han Ng², Peng-Sheng Ting³, Hirsh Trivedi^1,⁴, Aarshi Vipani¹, Walid Ayoub^1,⁴, Ju Dong Yang^1,^4,⁵, Omer Liran^6,⁷, Brennan Spiegel^1,⁷

, Alexander Kuo^1,⁴

¹Karsh Division of Gastroenterology and Hepatology, Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA

²Bristol Medical School, University of Bristol, Bristol, UK

³School of Medicine, Tulane University, New Orleans, LA, USA

⁴Comprehensive Transplant Center, Cedars-Sinai Medical Center, Los Angeles, CA, USA

⁵Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA

⁶Department of Psychiatry and Behavioral Sciences, Cedars-Sinai, Los Angeles, CA, USA

⁷Division of Health Services Research, Department of Medicine, Cedars-Sinai, Los Angeles, CA, USA

Corresponding author : Brennan Spiegel Karsh Division of Gastroenterology and Hepatology, Department of Medicine, Cedars-Sinai Medical Center, 8700 Beverly Blvd, Los Angeles, CA, USA
Tel: +1-310-423-3277, Fax: +1-310-423-2356, E-mail: Brennan.Spiegel@cshs.org

Alexander Kuo Karsh Division of Gastroenterology and Hepatology, Department of Medicine, Cedars-Sinai Medical Center, 8700 Beverly Blvd, Los Angeles, CA, USA
Tel: +1-310-423-3277, Fax: +1-310-423-2356, E-mail: Alexander.Kuo@cshs.org

^* These authors should be considered co-first authors.

Editor: Yoon Jun Kim, Seoul National University College of Medicine, Korea

Received March 3, 2023 Revised March 20, 2023 Accepted March 21, 2023

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

ABSTRACT

Background/Aims: Patients with cirrhosis and hepatocellular carcinoma (HCC) require extensive and personalized care to improve outcomes. ChatGPT (Generative Pre-trained Transformer), a large language model, holds the potential to provide professional yet patient-friendly support. We aimed to examine the accuracy and reproducibility of ChatGPT in answering questions regarding knowledge, management, and emotional support for cirrhosis and HCC.

Methods: ChatGPT’s responses to 164 questions were independently graded by two transplant hepatologists and resolved by a third reviewer. The performance of ChatGPT was also assessed using two published questionnaires and 26 questions formulated from the quality measures of cirrhosis management. Finally, its emotional support capacity was tested.

Results: We showed that ChatGPT regurgitated extensive knowledge of cirrhosis (79.1% correct) and HCC (74.0% correct), but only small proportions (47.3% in cirrhosis, 41.1% in HCC) were labeled as comprehensive. The performance was better in basic knowledge, lifestyle, and treatment than in the domains of diagnosis and preventive medicine. For the quality measures, the model answered 76.9% of questions correctly but failed to specify decision-making cut-offs and treatment durations. ChatGPT lacked knowledge of regional guidelines variations, such as HCC screening criteria. However, it provided practical and multifaceted advice to patients and caregivers regarding the next steps and adjusting to a new diagnosis.

Conclusions: We analyzed the areas of robustness and limitations of ChatGPT’s responses on the management of cirrhosis and HCC and relevant emotional support. ChatGPT may have a role as an adjunct informational tool for patients and physicians to improve outcomes.

Key words: Artificial intelligence, Patient education as topic, Health communication, Telemedicine, Chronic disease management

Graphical Abstract

INTRODUCTION

INTRODUCTION

MATERIALS AND METHODS

MATERIALS AND METHODS

RESULTS

RESULTS

DISCUSSION

DISCUSSION

ACKNOWLEDGMENTS

ACKNOWLEDGMENTS: ChatGPT was used to generate the structure of part of the resultssection.

FOOTNOTES

FOOTNOTES: Authors’ contribution

Concept and design: Yeo, Samaan, Spiegel, Kuo. Acquisition of data: Yeo, Samaan, Ng, Vipani. Data review: Ting, Trivedi, Kuo. Statistical analysis: Yeo. Drafting of the manuscript: Yeo, Samaan, Ng. Critical revision of the manuscript: All authors. All authors approved the final version of the manuscript.
Conflicts of Interest

The authors have no conflictsto disclose.

SUPPLEMENTAL MATERIAL

SUPPLEMENTAL MATERIAL

Figure 1.

Flow chart of question selection for cirrhosis and hepatocellular carcinoma (HCC). Frequently asked questions about the knowledge and management of cirrhosis or HCC were collected from patient support groups on Facebook and well-regarded professional societies and institutions. FAQs, frequently asked questions.

Figure 2.

Grade of responses by the ChatGPT language model to questions related to (A) cirrhosis and (B) hepatocellular carcinoma (HCC). The percentage of responses being graded as comprehensive, correct but inadequate, mixed with correct and incorrect/outdated data, and completely incorrect were provided. GPT, Generative Pre-trained Transformer.

Table 1.

Percentage of questions with significantly different responses and difference in grading between the two responses

Domain	Reproducibility
Domain	Cirrhosis	Hepatocellular carcinoma
Basic knowledge (n=36, 16)	2.78%	0.00%
Diagnosis (n=3, 6)	0.00%	0.00%
Treatment (n=16, 30)	6.25%	0.00%
Lifestyle (n=22, 8)	4.54%	16.70%
Preventive medicine (n=4, 0)	0%	0%
Others (n=10, 13)	20%	0.00%

Difference between the two responses was assessed by the reviewers as a binary yes/no answer. Difference in grading between the two responses was defined as the difference in grading category (1 and 2 vs. 3 and 4).

Table 2.

ChatGPT provides emotional support to patients and caregivers

Prompts	Response from ChatGPT
1. My family member was recently diagnosed with hepatocellular carcinoma. What can I do to help?	If your family member has been diagnosed with hepatocellular carcinoma, there are several ways you can support them:
	- Offer practical support: Help with tasks like grocery shopping, cooking, and transportation to appointments.
	- Be a listening ear: Encourage your family member to talk about their feelings and offer a supportive and understanding ear.
	- Help them find information: Look into resources and support groups that may be helpful for your family member.
	- Take care of yourself: It’s important to remember to take care of your own physical and emotional health as well. It can be emotionally and physically draining to support a loved one through a serious health condition, so make sure to take breaks and seek support for yourself as needed.
	- Encourage them to follow their treatment plan: It’s important for your family member to follow their treatment plan as recommended by their medical team. Offer support and encouragement to help them stick to their treatment plan.
	- It’s also a good idea to talk to your family member’s medical team about other ways you can support them. They may have specific recommendations or suggestions based on your loved one’s individual needs.
2. I just received a diagnosis of hepatocellular carcinoma. I am anxious, sad, and confused! What should I do?	I’m sorry to hear about your diagnosis. It’s completely natural to feel anxious, sad, and confused when faced with a serious health condition like hepatocellular carcinoma. The most important thing you can do is to talk to your doctor about your treatment options. They will be able to give you more information about your specific case and help you come up with a plan that is right for you. It may also be helpful to speak with a mental health professional, such as a therapist or counselor, to help you cope with your emotions. It’s important to take care of your emotional .well-being in addition to your physical health.
3. I was recently diagnosed with alcoholic hepatitis and cirrhosis, I am so stressed out and I don’t know cope with all these, what should I do?	It can be overwhelming to receive a diagnosis of alcoholic hepatitis and cirrhosis, and it’s important to take care of yourself both physically and emotionally. Here are a few things you can do to cope:
	- Seek professional help: Talk to your doctor or a therapist about your diagnosis and treatment options. They can also help you manage stress and other emotions related to your diagnosis.
	- Stop drinking: It is important to stop drinking alcohol in order to prevent further damage to your liver and overall health.
	- Eat a healthy diet: Eating a balanced diet that is rich in fruits, vegetables, and lean protein can help your liver recover and improve your overall health.
	- Exercise: Regular physical activity can help reduce stress and improve your overall health.
	- Get enough rest: Make sure to get enough sleep and rest to help your body heal.
	- Join support groups: Joining support groups can be very helpful in dealing with the emotional and practical aspects of the disease.
	- Seek medical attention: If you are experiencing severe symptoms such as abdominal pain, jaundice, vomiting blood or a rapid decline in mental status, please seek medical attention.
	- Remember that recovery takes time and it’s important to be patient with yourself. It is also important to keep in touch with your healthcare provider and follow their advice.
4. My father was officially denied for liver transplantation because of his old age and debilitation. The whole family is really upset. What should we do to help him?	It is important to understand that transplant centers have strict guidelines in place to ensure that transplants are performed in the most effective and ethical manner possible. These guidelines take into account factors such as the patient’s age, overall health, and likelihood of survival after the transplant.
	That being said, there are still options for your father. One option would be to seek a second opinion from another transplant center, as different centers may have different guidelines. Additionally, your father may be a candidate for alternative treatments, such as a less invasive procedure or alliative care to help manage his symptoms.
	It may also be beneficial for your family to seek support from a social worker or counselor to help cope with the difficult news and to discuss any other options that may be available for your father.
	It’s also important to have a good communication with his doctor, ask the reason why he was denied the transplant, and what other options are available for him.

GPT, Generative Pre-trained Transformer.

Abbreviations

Abbreviations: HCC
hepatocellular carcinoma
GPT
Generative Pre-trained Transformer
NLP
natural language processing
LLM
Large Language Model
RLHF/RLHP
Reinforcement Learning from Human Feedback or Reinforcement Learning from Human Preference
AASLD
American Association for the Study of Liver Diseases
OR
odds ratio
CI
confidence interval

REFERENCES

REFERENCES: REFERENCES

1. GBD 2017 Cirrhosis Collaborators. The global, regional, and national burden of cirrhosis by cause in 195 countries and territories, 1990-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet Gastroenterol Hepatol 2020;5:245-266.
[PubMed] [PMC]

2. Tsochatzis EA, Bosch J, Burroughs AK. Liver cirrhosis. Lancet 2014;383:1749-1761.
[Article] [PubMed]

3. Yang JD, Hainaut P, Gores GJ, Amadou A, Plymoth A, Roberts LR. A global view of hepatocellular carcinoma: trends, risk, prevention and management. Nat Rev Gastroenterol Hepatol 2019;16:589-604.
[Article] [PubMed] [PMC]

4. Rumgay H, Arnold M, Ferlay J, Lesi O, Cabasag CJ, Vignat J, et al. Global burden of primary liver cancer in 2020 and predictions to 2040. J Hepatol 2022;77:1598-1606.
[Article] [PubMed] [PMC]

5. Desai AP, Mohan P, Nokes B, Sheth D, Knapp S, Boustani M, et al. Increasing economic burden in hospitalized patients with cirrhosis: Analysis of a national database. Clin Transl Gastroenterol 2019;10:e00062.
[Article] [PubMed] [PMC]

6. McAdam-Marx C, McGarry LJ, Hane CA, Biskupiak J, Deniz B, Brixner DI. All-cause and incremental per patient per year cost associated with chronic hepatitis C virus and associated liver complications in the United States: a managed care perspective. J Manag Care Pharm 2011;17:531-546.
[Article] [PubMed]

7. McGlynn KA, Petrick JL, El-Serag HB. Epidemiology of hepatocellular carcinoma. Hepatology 2021;73(Suppl 1):4-13.
[Article] [PubMed] [PMC]

8. Ginès P, Krag A, Abraldes JG, Solà E, Fabrellas N, Kamath PS. Liver cirrhosis. Lancet 2021;398:1359-1376.
[Article] [PubMed]

9. D’Amico G, De Franchis R; Cooperative Study Group. Upper digestive bleeding in cirrhosis. Post-therapeutic outcome and prognostic indicators. Hepatology 2003;38:599-612.
[Article] [PubMed]

10. Nusrat S, Khan MS, Fazili J, Madhoun MF. Cirrhosis and its complications: evidence based treatment. World J Gastroenterol 2014;20:5442-5460.
[Article] [PubMed] [PMC]

11. Attwa MH, El-Etreby SA. Guide for diagnosis and treatment of hepatocellular carcinoma. World J Hepatol 2015;7:1632-1651.
[Article] [PubMed] [PMC]

12. Yang JD, Heimbach JK. New advances in the diagnosis and management of hepatocellular carcinoma. BMJ 2020;371:m3544.
[Article] [PubMed]

13. Freundlich Grydgaard M, Bager P. Health literacy levels in outpatients with liver cirrhosis. Scand J Gastroenterol 2018;53:1584-1589.
[Article] [PubMed]

14. Gulati R, Nawaz M, Pyrsopoulos NT. Health literacy and liver disease. Clin Liver Dis (Hoboken) 2018;11:48-51.
[Article] [PubMed] [PMC]

15. Saab S, Konyn PG, Viramontes MR, Jimenez MA, Grotts JF, Hamidzadah W, et al. Limited knowledge of acetaminophen in patients with liver disease. J Clin Transl Hepatol 2016;4:281-287.
[Article] [PubMed] [PMC]

16. Kaundinya T, Mazumder N, Atiemo K, Spellman A, Daud A, Curtis LM, et al. Health literacy gaps in online resources for cirrhotic patients. J Curr Surg 2020;10:1-6.
[Article] [PubMed] [PMC]

17. OpenAI. ChatGPT: Optimizing Language Models for Dialogue. OpenAI web site, <https://openai.com/blog/chatgpt>. Accessed 9 Jan 2023.

18. Ouyang L, Wu J, Jiang X, Almeida D, Wainwright C, Mishkin P, et al. Training language models to follow instructions with human feedback. Adv Neural Inf Process Syst 2022;35:27730-27744.

19. Gilson A, Safranek CW, Huang T, Socrates V, Chi L, Taylor RA, et al. How does ChatGPT perform on the United States medical licensing examination? The implications of large language models for medical education and knowledge assessment. JMIR Med Educ 2023;9:e45312.
[Article] [PubMed] [PMC]

20. Jeblick K, Schachtner B, Dexl J, Mittermeier A, Stüber AT, Topalis J, et al. ChatGPT makes medicine easy to swallow: An exploratory case study on simplified radiology reports. arXiv 2212.14882 [Preprint]. 2022 [cited 9 Jan 2023]. Available from: https://arxiv.org/abs/2212.14882.
[Article]

21. Bogost I. EChatGPT Is Dumber Than You Think. The Atlantic web site, <https://www.theatlantic.com/technology/archive/2022/12/chatgpt-openai-artificial-intelligence-writingethics/672386/>. Accessed 9 Jan 2023.

22. Christiano PF, Leike J, Brown T, Martic M, Legg S, Amodei D. Deep reinforcement learning from human preferences. Paper presented at: 31st Conference on Neural Information Processing Systems (NIPS 2017). 2017;Long Beach, CA, USA.

23. Valery PC, Bernardes CM, Hayward KL, Hartel G, Haynes K, Gordon LG, et al. Poor disease knowledge is associated with higher healthcare service use and costs among patients with cirrhosis: an exploratory study. BMC Gastroenterol 2022;22:340.
[Article] [PubMed] [PMC]

24. Farvardin S, Patel J, Khambaty M, Yerokun OA, Mok H, Tiro JA, et al. Patient-reported barriers are associated with lower hepatocellular carcinoma surveillance rates in patients with cirrhosis. Hepatology 2017;65:875-884.
[Article] [PubMed] [PMC]

25. van Meer S, de Man RA, Coenraad MJ, Sprengers D, van Nieuwkerk KM, Klümpen HJ, et al. Surveillance for hepatocellular carcinoma is associated with increased survival: Results from a large cohort in the Netherlands. J Hepatol 2015;63:1156-1163.
[Article] [PubMed]

26. Heydari A, Ziaee ES, Gazrani A. Relationship between Awareness of Disease and Adherence to Therapeutic Regimen among Cardiac Patients. Int J Community Based Nurs Midwifery 2015;3:23-30.
[PubMed] [PMC]

27. Hayward KL, Valery PC, Patel PJ, Horsfall LU, Wright PL, Tallis CJ, et al. Effectiveness of patient-oriented education and medication management intervention in people with decompensated cirrhosis. Intern Med J 2020;50:1142-1146. Erratum in: Intern Med J 2020;50:1302.

28. Victora CG, Barros AJD, França GVA, da Silva ICM, Carvajal-Velez L, Amouzou A. The contribution of poor and rural populations to national trends in reproductive, maternal, newborn, and child health coverage: analyses of cross-sectional surveys from 64 countries. Lancet Glob Health 2017;5:e402-e407.
[Article] [PubMed] [PMC]

29. Brown AF, Ma GX, Miranda J, Eng E, Castille D, Brockie T, et al. Structural interventions to reduce and eliminate health disparities. Am J Public Health 2019;109(Suppl 1):S72-S78.
[Article] [PubMed] [PMC]

30. Noguchi Y. Therapy by chatbot? The promise and challenges in using AI for mental health. npr web site, <https://www.npr.org/sections/health-shots/2023/01/19/1147081115/therapy-bychatbot-the-promise-and-challenges-in-using-ai-for-mentalhealth>. Accessed 9 Jan 2023.

Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma

SUPPLEMENTAL MATERIAL

SUPPLEMENTAL MATERIAL

Supplementary Table 1.

Supplementary Table 2.

Supplementary Table 3.

Supplementary Table 4A.

Supplementary Table 4B.

Figure 1.

Figure 2.

Table 1.

Table 2.

REFERENCES