Most Searched
Originally published January 2, 2025
Last updated January 2, 2025
Reading Time: 3 minutes
Search more articles
News & Magazine
Topics
As artificial intelligence improves, gastroenterologists are asking whether AI can soon play a larger role in their practice. A new study by researchers from Keck Medicine of USC and the Keck School of Medicine of USC examined whether a large language model can accurately predict when patients should get their next colonoscopy.
Today, primary care physicians and gastroenterologists advise patients when to get screened for colorectal cancer. To make their recommendations, these experts consider several factors, including colonoscopy results, pathology results, risk factors for the disease as well as established, general-population screening guidelines.
But what if AI could make those recommendations instead?
In their study published in Clinical Gastroenterology and Hepatology, researchers from the Keck School of Medicine of USC fed large language model ChatGPT-4 the following data:
Study subjects were patients aged 18 or older who had undergone colorectal cancer screening with either Keck Medicine of USC or Los Angeles General Medical Center. The study excluded patients with serious gastrointestinal conditions such as cancer, inflammatory bowel disease and hereditary polyposis syndromes.
Researchers fed ChatGPT-4 the information and the system to generate recommendations for when the patients should get follow-up colonoscopies based on 2020 screening guidelines from the U.S. Multi-Society Task Force (USMSTF). They then compared the AI recommendations with recommendations generated by both human physicians and the USMSTF.
“This is the first investigation to study ChatGPT-4 for its accuracy and concordance of recommendations on rescreening and surveillance colonoscopy intervals compared to USMSTF guidelines,” researchers added.
The results were positive. When researchers compared ChatGPT-4 and USMSTF screening recommendations, they found the two aligned in 85.7% of cases. In fact, USMSTF screening recommendations were more often in line with recommendations from ChatGPT-4 than recommendations from human physicians. Real-life physician recommendations gathered from medical records matched USMSTF guidelines in only 75.4% of cases.
ChatGPT-4’s performance is especially notable given the large breadth of unorganized, verbatim patient-specific data it was fed. The fact that it could make accurate predictions using this data shows promise for clinical use of large language models.
“Initial real-world results suggest that ChatGPT-4 can accurately define routine colonoscopy screening intervals based on verbatim input of clinical data,” the researchers said.
Currently, AI’s main application in the gastroenterology field has been in assisting with polyp detection, says Ara Sahakian, MD, a gastroenterologist with the USC Digestive Health Institute, part of Keck Medicine, and the senior lead study author. “It helps to avoid missing things and improves the accuracy of diagnosis,” he says.
But one shouldn’t overlook AI’s potential to reduce the time and cost of what Sahakian calls the “workflow side” of clinical care. Such care includes ensuring patients get follow-up, which involves substantial interaction with the patient, including discussion of results, scheduling and authorizations.
“Our study shows where this type of large language model could potentially be built into our electronic health records,” Sahakian says. “It could scan a patient’s pathology, endoscopy and clinical chart data and automatically send a reminder to the patient and physician when it’s time for the next colonoscopy.”
Also consider the fact that the AI engine in this study was not specifically designed around health care. If it were, “it could be even more accurate and powerful for this purpose,” he says. “You can imagine that with this type of powerful AI engine, it doesn’t need to be a doctor doing all the data input and work. Our staff could input the data. It could create a lot of efficiency.”
As AI’s accuracy and data improves, it can reduce the time and cost of day-to-day clinical operations. “After further optimization, [large language models] have potential to extend the clinician’s abilities,” Sahakian and his coauthors state.
Physician oversight is still undoubtedly key. “We do need experts to review everything and make sure the recommendations made by AI are accurate,” Sahakian says. “AI is going to change the game in terms of what we do in detection, diagnosis and workflow. It’s going to make our jobs easier. It’s going to make us faster, better and more accurate at what we do.”
Share