Qualitative metrics from the biomedical literature for evaluating large language models in clinical decision-making: a narrative review

Abstract Background The large language models (LLMs), most notably ChatGPT, released since November 30, 2022, have prompted shifting attention to their use in medicine, particularly for supporting clinical decision-making. However, there is little consensus in the medical community on how LLM perfor...

وصف كامل

التفاصيل البيبلوغرافية
الحاوية / القاعدة:BMC Medical Informatics and Decision Making
المؤلفون الرئيسيون: Cindy N. Ho, Tiffany Tian, Alessandra T. Ayers, Rachel E. Aaron, Vidith Phillips, Risa M. Wolf, Nestoras Mathioudakis, Tinglong Dai, David C. Klonoff
التنسيق: مقال
اللغة:الإنجليزية
منشور في: BMC 2024-11-01
الموضوعات:
الوصول للمادة أونلاين:https://doi.org/10.1186/s12911-024-02757-z