Solutions

Resources

Company

Solutions

Resources

Company

LLM EHR Mining: Optimizing Clinical Trial Screening and Mitigating Screen Failures

portrait of a doctor wearing a short sleeve white lab coat
portrait of a doctor wearing a short sleeve white lab coat

Because clinical trial operations are heavily reliant on time-intensive manual chart reviews by medically trained personnel, the industry is currently confronting a severe administrative bottleneck. Contemporary electronic health records (EHRs) are exceptionally complex, making manual pre-screening processes both costly and inefficient. This reliance on human review introduces intrinsic human error into patient identification, as fatigue and the sheer volume of data often lead to inaccuracies. Compounding this issue is the reality that approximately 80% of vital clinical data—including imaging reports, pathology findings, and physician progress notes—resides in unstructured free-text formats that standard structured data queries cannot properly evaluate. Consequently, the manual extraction of unstructured data remains a highly error-prone segment of clinical research, directly contributing to missed eligibility criteria, elevated screen failure rates, and systematic failures to meet enrollment targets.

Validated by Research: How LLMs Parse Unstructured Clinical Data

Historically, automated chart review efforts faced significant limitations, often relying on simulated patient data or failing to synthesize structured and unstructured data to form cohesive, auditable clinical conclusions. However, recent studies validate that Large Language Models (LLMs) can bridge this gap with high precision.

A landmark study published by Cleveland Clinic researchers in the Journal of Cardiac Failure demonstrated that an AI system deployed within a health system's firewall accurately parsed complex, real-world EHRs to determine trial eligibility. To find patients for a rare heart disease trial, the AI combined standard medical data—like billing codes and lab results—with the nuanced details buried inside doctors' written notes.

When tested against human doctors, the AI evaluated trial criteria with 96.2% accuracy. Even more impressively, the AI uncovered 29 ready-to-enroll patients that traditional manual screening had completely missed over the previous 90 days. Because the system provides 100% clear, auditable reasons for why a patient should be included or excluded, it successfully handles the complicated medical logic that human reviewers often overlook.

A recent JMIR Medical Informatics study breaks down exactly how this works by testing how well AI analyzes messy, unstructured oncology reports. The results proved that advanced models like GPT-4 can match or even beat specialized doctors when it comes to catching hard-to-find clinical details. In fact, GPT-4 correctly identified patient comorbidities from raw text with a 96.8% sensitivity rate, outperforming the 88.8% rate achieved by the physicians. The researchers pointed out that while human reviewers sometimes miss explicitly stated conditions due to normal fatigue or distraction, the AI easily decodes complicated medical acronyms and correctly infers unstated health issues based on the patient's medications and clinical context. By rapidly reading huge blocks of text to spot patterns, these models turn unformatted chart notes into neatly organized JSON datasets. This makes it possible to mine clinical data accurately and at scale—without forcing humans to manually abstract the charts. 

Industrializing the Framework: From Academic Proof to Trially Match

The empirical validations provided by the Cleveland Clinic and Journal of Cardiac Failure establish a definitive academic consensus: large language models can successfully execute deep, highly accurate content analysis on complex electronic health record (EHR) text. However, translating a controlled academic pilot into a scalable, enterprise-grade architecture requires a robust underlying infrastructure. The academic concept of "automated, continuous EHR parsing" is brought to life commercially via Trially’s Match platform. While the aforementioned academic studies utilized distinct proprietary or general-purpose algorithms, Trially’s software framework is engineered specifically to operationalize and industrialize the exact LLM-driven methodologies proven successful in that foundational research. By integrating directly into existing clinical tech stacks, Trially bridges the gap between academic proof-of-concept and a turnkey, multi-site recruitment pipeline.

Semantic Inclusion / Exclusion Mapping

Clinical research demonstrates that relying solely on shallow structured data—such as basic ICD-10 codes or simple lab values—is fundamentally inadequate for trial screening, given that approximately 80% of critical clinical context is buried within unstructured free-text narratives. Trially's Match platform systematically solves this data fragmentation by executing advanced semantic verification across the entirety of the patient record.

Through the Match platform, clinical operations teams can directly upload highly complex trial protocols, which the AI rapidly decomposes into granular, filter-friendly inclusion and exclusion criteria cards in minutes. Moving past surface-level queries, Trially’s engine comprehensively parses the actual natural language within physician progress notes, imaging reports, and pathology findings to map protocol requirements to patient histories. This semantic inclusion/exclusion mapping perfectly mirrors the synthesis of structured and unstructured data validated in the Cleveland Clinic research. By doing so, Trially ensures that highly nuanced eligibility criteria—such as unstated comorbidities, exact disease staging, or specific concomitant medications—are meticulously evaluated, mitigating the human errors that traditionally drive up screen failure rates.

Real-Time Cohort Discovery

Historically, clinical trial patient identification has been a highly reactive, analog process, heavily dependent on manual chart pulls that can consume over 250 hours of site staff time per month. Trially Match fundamentally reengineers this operational workflow by transitioning sites from reactive manual abstraction to a proactive, automated pipeline that instantly flags eligible phenotypic cohorts.

By leveraging pre-built API connectors that plug seamlessly into an institution's existing EHR or clinical trial management system (CTMS), Trially establishes a continuous, real-time data synchronization. This enables the platform to continuously evaluate incoming, real-world clinical data against active trial protocols, automatically surfacing trial-ready candidates who might otherwise slip through the cracks. This capability reflects the commercial realization of the Cleveland Clinic study's most striking exploratory finding: the AI's ability to instantly identify readily recruitable patients who had been entirely missed by routine manual screening processes. By automating real-time cohort discovery, Trially empowers research coordinators to focus their efforts on high-value patient outreach and consent, ultimately multiplying enrollment rates while slashing manual review hours by up to 91%.

The Operational Payoff: Slashing Screen Failures by 60%

For sponsors and clinical research organizations (CROs), the financial toll of outdated recruitment workflows is staggering. Delayed clinical trials can cost sponsors over $600,000 per day in lost time and opportunity. The primary driver of this operational waste is the traditional manual screening process, a frustrating reality where 81% of screened patients are ultimately deemed ineligible. As Phase III trials grow increasingly complex—averaging nearly 26 distinct endpoints per protocol—forcing site staff to manually sift through charts creates an unsustainable administrative bottleneck.

However, the real-world operational benchmarks for AI-assisted prescreening offer a dramatic shift in this financial reality. A landmark, randomized clinical trial published in the Journal of the American Medical Association (JAMA) directly pitted manual chart reviewers against an LLM-powered AI tool.  By deploying the AI-powered framework, research teams accelerated the eligibility verification process by 78% compared to standard human chart review. More importantly, the AI approach achieved a 20.4% eligibility rate compared to just 12.7% for the manual group—a 60% improvement in screening yield without altering any inclusion or exclusion criteria. This massive reduction in wasted effort translated directly to an 84% increase in actual patient enrollments (35 patients for the AI group versus 19 for the human reviewers).

When deployed at scale via enterprise platforms like Trially, these academic benchmarks translate to even greater commercial returns. By instantly processing unstructured data that human reviewers frequently miss, Trially's framework reduces screen failure rates by up to 73% (dropping failures from 54% down to just 14%). Furthermore, it slashes the time coordinators spend on manual chart review by 91%, turning a 36-hour monthly chore into a streamlined 2.5-hour automated process. For clinical sites and Phase III trials, cutting screen failures by these margins fundamentally alters their operational reality. It eliminates thousands of hours of wasted labor, drastically reduces site burnout, and routinely doubles the rate of successful patient enrollments.

Conclusion: Moving from Academic Proof to Live Execution

Clinical research has officially entered its AI era, and the industry debate over the viability of automated chart review is over. Moving forward, the true competitive edge belongs to the sponsors, CROs, and clinical sites that transition from manual spreadsheets and reactive chart pulls to live, automated execution the fastest.

Organizations that adopt these LLM-driven frameworks will not only meet their enrollment targets with unprecedented predictability but will also eliminate the costly operational waste that has burdened the industry for decades. To empower your clinical operations team, industrialize this validated technology, and permanently solve the pre-screening bottleneck, explore Trially’s intelligent matching framework Trially Match.

Get the latest from Trially AI delivered to your inbox.

Get the latest from Trially AI delivered to your inbox.

Get the latest from Trially AI delivered to your inbox.

©

All rights reserved.

All information presented is for illustrative purposes only and does not represent actual data. Trially's product is fully compliant with HIPAA, SOC 2, FDA 21 CFR Part 11 and ISO 27001 regulations, ensuring the highest level of data security, safety and privacy.

©

All rights reserved.

All information presented is for illustrative purposes only and does not represent actual data. Trially's product is fully compliant with HIPAA, SOC 2, FDA 21 CFR Part 11 and ISO 27001 regulations, ensuring the highest level of data security, safety and privacy.

©

All rights reserved.

All information presented is for illustrative purposes only and does not represent actual data. Trially's product is fully compliant with HIPAA, SOC 2, FDA 21 CFR Part 11 and ISO 27001 regulations, ensuring the highest level of data security, safety and privacy.