We use cookies to understand how you use our site and to improve your experience. This includes personalizing content and advertising. To learn more, click here. By continuing to use our site, you accept our use of cookies. Cookie Policy.

Features Partner Sites Information LinkXpress hp
Sign In
Advertise with Us
Radcal IBA  Group

Download Mobile App




New Scoring Systems Increase Accuracy of AI-Generated Radiology Reports

By HospiMedica International staff writers
Posted on 07 Aug 2023

Artificial intelligence (AI) tools that efficiently produce detailed narrative reports of CT scans or X-rays can significantly lighten the workload of busy radiologists. More...

These AI reports go beyond simple identification of abnormalities and instead provide complex diagnostic information, detailed descriptions, nuanced findings, and appropriate degrees of uncertainty, similar to how human radiologists describe scan results. While several AI models capable of generating such detailed medical imaging reports have emerged, automated scoring systems meant to assess these tools are proving to be less effective at gauging their performance, according to a new study.

In the study, researchers at Harvard Medical School (Boston, MA, USA) tested various scoring metrics on AI-generated narrative reports and had six human radiologists read these reports. The analysis revealed that automated scoring systems performed poorly compared to human radiologists when it came to evaluating AI-generated reports. These systems misinterpreted and even missed significant clinical errors made by the AI tool. Ensuring the reliability of scoring systems is crucial for AI tools to continue improving and gaining clinicians' trust. However, the metrics tested in the study failed to reliably identify clinical errors in the AI reports, highlighting an urgent need for improvement and the development of high-fidelity scoring systems that accurately monitor tool performance.

In order to create better scoring metrics, the research team designed a new method called RadGraph F1 for evaluating the performance of AI tools generating radiology reports from medical images. Additionally, they created a composite evaluation tool called RadCliQ, which combines multiple metrics to produce a single score that more closely aligns with how a human radiologist would assess an AI model's performance. Using these new scoring tools, the researchers evaluated several state-of-the-art AI models and found a notable gap between their actual scores and the top possible scores.

Going forward, the researchers envision building generalist medical AI models capable of performing various complex tasks, including solving novel problems. Such AI systems could effectively communicate with radiologists and physicians about medical images, assisting in diagnosis and treatment decisions. The team also aims to develop AI assistants that can explain imaging findings directly to patients using everyday language, enhancing patient understanding and engagement. Ultimately, these advancements could revolutionize medical imaging practices, improving efficiency, accuracy, and patient care.

“Accurately evaluating AI systems is the critical first step toward generating radiology reports that are clinically useful and trustworthy,” said study senior author Pranav Rajpurkar, assistant professor of biomedical informatics in the Blavatnik Institute at HMS. “By aligning better with radiologists, our new metrics will accelerate development of AI that integrates seamlessly into the clinical workflow to improve patient care,”

Related Links:
Harvard Medical School 


Platinum Member
Real-Time Diagnostics Onscreen Viewer
GEMweb Live
Gold Member
Temperature Monitor
ThermoScan Temperature Monitoring Unit
Newborn Hearing Screener
ALGO 7i
Exam Table
PF400
Read the full article by registering today, it's FREE! It's Free!
Register now for FREE to HospiMedica.com and get access to news and events that shape the world of Hospital Medicine.
  • Free digital version edition of HospiMedica International sent by email on regular basis
  • Free print version of HospiMedica International magazine (available only outside USA and Canada).
  • Free and unlimited access to back issues of HospiMedica International in digital format
  • Free HospiMedica International Newsletter sent every week containing the latest news
  • Free breaking news sent via email
  • Free access to Events Calendar
  • Free access to LinkXpress new product services
  • REGISTRATION IS FREE AND EASY!
Click here to Register








Channels

Surgical Techniques

view channel
Image: Professor Bumsoo Han and postdoctoral researcher Sae Rome Choi of Illinois co-authored a study on using DNA origami to enhance imaging of dense pancreatic tissue (Photo courtesy of Fred Zwicky/University of Illinois Urbana-Champaign)

DNA Origami Improves Imaging of Dense Pancreatic Tissue for Cancer Detection and Treatment

One of the challenges of fighting pancreatic cancer is finding ways to penetrate the organ’s dense tissue to define the margins between malignant and normal tissue. Now, a new study uses DNA origami structures... Read more

Patient Care

view channel
Image: The portable biosensor platform uses printed electrochemical sensors for the rapid, selective detection of Staphylococcus aureus (Photo courtesy of AIMPLAS)

Portable Biosensor Platform to Reduce Hospital-Acquired Infections

Approximately 4 million patients in the European Union acquire healthcare-associated infections (HAIs) or nosocomial infections each year, with around 37,000 deaths directly resulting from these infections,... Read more
Copyright © 2000-2025 Globetech Media. All rights reserved.