A team of researchers from Carnegie Mellon University, in collaboration with Cleveland Clinic’s Cardiovascular Innovation Research Center, has developed an artificial intelligence (AI) system capable of interpreting some of the most complex heart scans in medicine, cardiac magnetic resonance imaging (MRI), without the need for manually labeled training data.
The novel system, called CMR-CLIP, is designed to interpret cardiac MRI scans by connecting moving images of the heart with corresponding clinical radiology reports. In testing, it significantly outperformed general-purpose AI models, in some cases by more than 35 per cent. The system also showed strong potential for improving cardiac imaging analysis, case retrieval and clinical decision support.
“This work demonstrates that domain-specific foundation models can significantly outperform general-purpose AI systems in specialized clinical applications,” said Ding Zhao, associate professor in Carnegie Mellon University’s Department of Mechanical Engineering and co-principal investigator on the study. “By designing models that reflect the structure and complexity of cardiac MRI data, rather than adapting generic image models, we can unlock new levels of performance and clinical utility.”
David Chen, Ph.D., of Cleveland Clinic, a co-principal investigator on the project, emphasized the clinical implications of the work. “Cardiac MRI interpretation is highly specialized and time intensive. Systems like CMR-CLIP have the potential to support clinicians through automated screening, and interpretation support, particularly in settings where expert readers are limited. Such reader assistant tools are critical to improving patient access to this powerful diagnostic technology.”
Cardiac MRI is widely regarded as the gold standard for evaluating heart structure, function and tissue health. A single scan can provide a comprehensive view of the heart, including pumping performance, muscle damage, blood flow and structural abnormalities. However, each study can contain hundreds to thousands of images across multiple views and time points. Even for trained specialists, interpreting a single exam can take 40 minutes or more. Because the technology is expensive and concentrated in major medical centers, there is a limited supply of experts available to meet growing clinical demand.
This combination of complexity and limited data has also made cardiac MRI one of the most challenging domains for AI. Most machine learning systems rely on large, carefully labeled datasets, but in cardiac imaging, expert annotations are scarce, time-consuming to produce and costly to scale.


