arxiv:2409.13191

An adapted large language model facilitates multiple medical tasks in diabetes care

Published on Sep 20, 2024

· Submitted by

Lai Wei on Sep 24, 2024

Upvote

Authors:

Lai Wei ,

Zhen Ying ,

Weiran Huang ,

Abstract

A framework for developing and evaluating diabetes-specific language models demonstrates superior performance in various diabetes tasks and potential applications in personalized healthcare and medical education.

AI-generated summary

Diabetes is a chronic disease that poses a significant global health burden, and optimizing diabetes management requires multi-stakeholder collaboration. Large language models (LLMs) have shown promise in various healthcare scenarios, but their effectiveness across a diverse range of diabetes tasks remains unproven. In this study, we introduced a framework to train and validate diabetes-specific LLMs. We first developed a comprehensive data processing pipeline that includes data collection, filtering, augmentation and refinement. This approach contributes to creating a high-quality, diabetes-specific dataset, and several evaluation benchmarks entirely from scratch. Utilizing the collected training dataset, we fine-tuned a diabetes-specific LLM family that demonstrated state-of-the-art proficiency in understanding and processing various diabetes tasks compared to other LLMs. Furthermore, clinical studies showed the potential applications of our models in diabetes care, including providing personalized healthcare, assisting medical education, and streamlining clinical tasks. In conclusion, our study introduced a framework to develop and evaluate a diabetes-specific LLM family, and highlighted its potential to enhance clinical practice and provide personalized, data-driven support for diabetes support when facing different end users. The code is provided via GitHub at https://github.com/waltonfuture/Diabetica.

View arXiv page View PDF GitHub 53 Add to collection

Community

WaltonFuture

Paper author Paper submitter Sep 24, 2024

•

edited Sep 24, 2024

In this study, we introduced a reproducible paradigm that includes data processing, model construction, benchmark assessment and clinical evaluation to develop a specialized large language model called Diabetica that could handle a wide range of diabetes-related tasks. We hope that Diabetica can effectively help diabetic patients and doctors in their daily lives.