英文摘要 |
Objectives: The recent surge in enthusiasm for artificial intelligence has revolved around GPT (Generative Pretrained Transformer) technology, a technology that relies on large language models to generate diverse texts across topics and contexts. However, a GPT’s performance in answering clinical questions remains to be comprehensively evaluated. Thus, this study was conducted to evaluate the performance of chatbots in answering complex clinical questions. Methods: In this study, Bing Chat (developed by Microsoft) served as a representative of GPT technology. This chatbot can be used in the Edge browser or Bing search engine. We randomly extracted a clinical question from the website of the Cochrane Taiwan Research Center (regarding whether preoperative oral carbohydrate could reduce postoperative discomfort) and asked Bing Chat to answer it using the PICO (Population, Intervention, Comparison, Outcome) framework and to provide relevant references and evidence summaries. A relevant English prompt was entered into the conversation box of Bing Chat, and its answer was recorded. The completeness and accuracy of the chatbot-provided evidence were analyzed through a comparison with the literature (PubMed). Results: Bing Chat rapidly identified the elements of the PICO framework and provided a concise answer on the basis of the literature. However, it offered a limited number of references (seven articles), some of which had incorrect names of journals and authors (a phenomenon known as GPT hallucination). Furthermore, the chatbot provided insufficient evidence summaries that did not cover research methods or results. Nonetheless, after receiving multiple prompts, Bing Chat provided relatively specific information. Conclusions: Bing Chat can answer medical questions to some extent. However, the accuracy and completeness of its data collection and processing methods require further improvement. Therefore, when using this chatbot, health-care providers are recommended to carefully check the information that it provides and to remain aware of its limitations. |