Modernizing Healthcare with AI: How RADPAIR and Fireworks Unlock Smarter Radiology Workflows

Executive Summary

RADPAIR is transforming radiology workflows with its intent to create an open-source SDK standard – anchored by the Report Document Schema (RDS) and Actions and Event Protocol (AEP) – which enables safe, intelligent AI interactions across reporting systems. RADPAIR’s SDK establishes a new AI orchestration standard for healthcare, designed for adoption by coalition partners to enable interoperable, safe, multi-agent workflows across institutions. Fireworks AI provides the enterprise-grade infrastructure and orchestration platform for RADPAIR’s fine-tuned models and multi-agent pipelines, ensuring real-time, scalable, and compliant performance.

Today, radiologists at institutions including Radiology Partners, which handles 40-50 million cases annually, benefit from AI-assisted workflows that integrate real-time dictation and generative AI structured reporting, reducing cognitive load, accelerating throughput, and improving diagnostic confidence. Key performance gains observed in production include:

•Report turnaround: Reduced from 15-20 seconds → 2-5 seconds, meeting clinical Service Level Agreement (SLA) requirements
•Workflow efficiency: Early data suggests ~25% reduction in time per case, thanks to auto-populated fields, smart suggestions, and reduced context switching
•Reporting accuracy: Pilot studies and independent research indicate 12% fewer reporting errors (Radiology Business).
•Scalability: Supports 1,000+ concurrent users, with sub-second Speech-to-Text (STT) latency and near real-time display of intermediate transcripts (<200ms).

By combining RADPAIR’s innovative AI orchestration with Fireworks’ scalable, low-latency infrastructure, the partnership unlocks clinically meaningful impact: radiologists focus on interpretation and patient care, patients receive faster, more accurate diagnoses, and hospitals gain auditable, compliant workflows without building their own AI infrastructure.

Key Outcomes at a Glance

Outcome	Before	After / Target
Report Turnaround	15-20 seconds	2-5 seconds (meeting SLA)
Workflow Efficiency	High cognitive load, multiple context switches	~25% faster per case, auto-populated structured reports
Reporting Accuracy	Frequent transcription errors, hallucinations	~12% fewer errors, reliable dictation capture
Scale & Concurrency	Limited by infrastructure	Supports 1,000+ simultaneous microphones, 100-200 reports/day per physician, sub-second STT latency, multi-model orchestration scalable to billions of tokens/month
Audit & Compliance	Fragmented reporting, siloed data	Structured, auditable workflows; RDS/AEP enable interoperability
AI Model Performance	Generic transcription, reasoning models	Supports RADPAIR’s fine-tuned STT model, hosted and orchestrated in real-time via Fireworks infrastructure. Trained on radiology audio, ensures accurate capture of specialized terminology; downstream reasoning and multi-step orchestration are handled by RADPAIR.

RADPAIR’s Mission: Smarter, Safer Radiology Workflows

Radiology departments are critical to modern healthcare, yet reporting workflows remain slow, fragmented, and cognitively taxing. Radiologists switch between multiple screens, dictate findings line by line, and manually retrieve prior measurements or reports. Studies indicate that radiologists spend approximately 36% - 54% of their time on image interpretation, with the rest on administrative tasks (JACR, 2023). This imbalance not only reduces productivity, but also contributes to burnout and can affect patient outcomes.

Founded by practicing radiologist Avez Rizvi, RADPAIR aims to redesign radiology workflows. Today, RADPAIR represents 13-14% of the U.S. radiology reporting market, including its largest client, Radiology Partners, supporting 4,000 physicians across 40-50 million cases annually. Their mission is to enable next-generation, agentic AI workflows that orchestrate multiple AI agents from dictation to reasoning to system control so radiologists can prioritize patient care and high-value interpretations.

The Challenge: Fragmented Workflows and Antiquated Standards

Traditional radiology workflows are fragmented and cumbersome:

Imaging → Switch screens → Report software → Dictate findings line by line → Manually retrieve prior reports → Compile measurements → Submit report

Key pain points:

•Cognitive overload: Multiple context switches slow work and increase fatigue
•Siloed data streams: Difficult to incorporate prior studies, measurements, or external information, increasing risk of errors
•Outdated standards: Legacy HL7 limits integration with modern AI
•AI adoption challenges: Existing tools cannot reliably orchestrate tasks across systems
•Administrative burden: Limits focus on patient care

STT-Specific Bottlenecks and Multi-Agent Requirements

Scaling AI-assisted transcription is particularly challenging, with constraints that impact physician adoption and workflow trust:

•Speed & Perceived Responsiveness: Early cloud-based STT systems were slower and less accurate than on-premise tools like legacy systems, frustrating radiologists.
•Domain-Specific Terminology: Radiologists often use shorthand, incomplete phrases, or specialized terms (“AAA” for abdominal aortic aneurysm, “tib-fib” for tibia/fibula). Off-the-shelf transcription models often struggle to capture these nuances accurately, which undermines both trust and clinical precision. At the same time, overly specialized speech engines tuned only for medical or radiology lexicons limit natural, conversational input—preventing users from fully unlocking agentic AI workflows.
•Multi-Agent Orchestration: Real workflows require simultaneous execution of multiple models - transcription, reasoning, report auto-population, viewer control, and voice feedback - all in a streaming, real-time context to be ready for upcoming agentic AI workflows.
•Clinical Impact & Scalability: Inaccuracies in critical terminology can lead to misreporting, slowing diagnosis, and increasing risk. RADPAIR needed a solution that supported 1,000+ concurrent microphones, sub-second latency, high accuracy for radiology vocabulary, and auditable, HIPAA-compliant workflows.
•Latency: Fine-tuned STT model must operate in a streaming audio context with <200ms display of intermediate transcripts
•Cost: Hosting on legacy infrastructure is expensive; a flexible, multi-model hosting platform is needed without compromising quality
•Accuracy: Use RADPAIR’s fine-tuned models for the specialized radiology lexicon, validated by practicing radiologists

“Even if word error rate is low, missing a commonly used medical term can have huge consequences. Cloud latency disrupted dictation and workflow, making real-time AI unusable.”

Dr. Vikram Krishnasetty, Associate CMO at Radiology Partners

“Building agentic AI solutions in radiology is uniquely challenging—especially when orchestrating multiple workflows that span both inside and outside the reporting environment. From controlling external systems to retrieving and synthesizing data across diverse platforms, true orchestration requires seamless collaboration between specialized models.”

Avez Rizvi, CEO and Co-Founder, RADPAIR

These workflow and technical bottlenecks slowed throughput, increased operational costs, and threatened adoption of AI-assisted reporting, impacting the onboarding of new customers. The broader challenge: enabling natural, voice-driven orchestration across multiple AI agents in radiology workflows. And that starts with solving real-time reporting issues that erode physician trust and slowed documentation, creating inefficiencies that were clearly unsustainable for clinical teams.

RADPAIR’s SDK: Enabling Intelligent, Safe AI Workflows

RADPAIR is developing a modern SDK standard with the intent to open-source it that allows AI to act intelligently and safely across workflows:

•Report Document Schema (RDS): Converts templates into structured, AI-readable reports
•Actions and Event Protocol (AEP): Defines AI tasks like editing reports, controlling viewports, retrieving prior measurements, and updating summaries
•Parallel orchestration: Multiple AI-driven tasks happen simultaneously with full audit trails
•Open-source release: RDS and AEP available for industry partners to adopt, encouraging interoperability

Here is the AI-enabled workflow (powered by Fireworks):

Step	Actor	Action
1	Radiologist	Dictates findings naturally
2	Fireworks AI	Hosts RADPAIR’s STT model, provides real-time transcription, streaming, and low-latency multi-model orchestration
3	RADPAIR	Populates structured report. Fetches prior reports/measurements. Updates viewports/images. Performs safety checks. Finalizes report.

The table below highlights the contributions of Fireworks and RADPAIR within the AI-enabled workflow.

Fireworks AI (Infrastructure & Transcription)	RADPAIR (Orchestration & Reporting)
Host and serve Radpair’s fine-tuned transcription and reasoning models with high reliability	Defines RDS/AEP orchestration, structured reporting, and audit rules
Provide scalable streaming infrastructure supporting 1,000+ concurrent users	Populate structured report fields from transcript
Enable sub-second display of intermediate transcripts (<200ms) by hosting RADPAIR’s STT model	Fetch prior reports and measurements automatically
Optimize hosting and inference for cost-efficient scaling	Update viewports and images dynamically
Provides infrastructure foundation to enable RADPAIR’s downstream AI workflows	Perform safety checks, enforce reporting guidelines, and manage multi-agent orchestration
Support hundreds of reports/day per physician; scalable to billions of tokens/month	Enables smart suggestions, auto-completion, and multi-modal reasoning
Ensure reliability, uptime, and performance of hosted models	Ensure auditability, regulatory compliance, and safe report finalization
Maintain secure, auditable environment	Integrate structured reports with EMR/PACS
Enable low-latency integration with AI-driven workflows	Orchestrate reasoning and downstream model actions in real time

Fireworks AI: Scalable, Low-Latency AI Infrastructure

Fireworks provides the enterprise-grade infrastructure to host and orchestrate multiple AI models concurrently, powering RADPAIR’s next-generation reporting workflows (and thereby addresses RADPAIR’s constraints around scalability, latency, costs, and accuracy with):

•Real-time voice transcription (STT): Captures radiologists’ observations via speech instantly using RADPAIR’s proprietary fine-tuned models. Fireworks supports hosting and inference of models, while RADPAIR manages any model fine-tuning on their datasets.
•Low-latency inference engine: Sub-second display of intermediate transcription in near real-time (<200ms), meeting or exceeding OpenAI/Gemini baselines. Reports return in 2-5 seconds, maintaining clinical workflow efficiency and SLA compliance.
•Scalable streaming infrastructure: Supports 1,000+ concurrent users and hundreds of reports per physician per day, without requiring RADPAIR to build in-house STT infrastructure.
•Dynamic load management: Ensures uninterrupted transcription by balancing tasks across models or fallback paths when needed.
•Analytics and feedback loop: Collects anonymized usage data to monitor transcription performance and improve models over time, without compromising privacy.
•Cost efficiency: Optimized hosting and inference reduce operational costs compared with building and maintaining STT infrastructure internally.

Downstream tasks including populated structured reports, retrieving prior studies, updating viewports and safety checks are orchestrated by RADPAIR, leveraging the transcription Fireworks provides.

Fireworks enables RADPAIR to deploy AI-assisted, multi-agent radiology workflows at scale, letting radiologists focus on interpreting images while patients benefit from faster, more accurate diagnoses.

Before & After: The Transformation to AI-Enabled Radiology Workflows

Before: Fragmented Workflows and Distrust in AI

Imaging → Switch screens → Report software → Dictate findings → Fetch prior reports → Compile measurements → Submit report

Before RADPAIR’s speech engine was optimized, radiologists experienced significant latency between speaking and seeing finalized text appear in the transcript. Cloud-based speech engines often introduced noticeable delays compared to locally installed systems, interrupting dictation flow and breaking concentration. In addition, editing and correcting transcript errors was cumbersome, and cloud-processing artifacts occasionally produced hallucinated or misplaced words—further reducing reliability and trust in real-time reporting.

After: Reliable, Real-time AI Reporting at Scale

Radiologist speaks → [RADPAIR’s hosted STT model on Fireworks] Real-time transcription → [RADPAIR] Populates structured report → [RADPAIR] Retrieves prior reports & measurements → [RADPAIR] Updates viewports → [RADPAIR] Performs safety checks → Finalize report

Radiologists dictate naturally; AI transcribes shorthand and specialized terms in real time, populating structured reports instantly. Physicians focus on interpretation, not troubleshooting technology.

Fireworks provides the infrastructure to host and scale RADPAIR’s fine-tuned STT and other AI models, enabling sub-second, near-real-time transcription and model orchestration (<200ms intermediate display), which supports RADPAIR’s delivery of accurate transcription while preventing missed words, hallucinations, and macro misfires. RADPAIR’s orchestration layer handles multi-tasking, error-checking, and audit logging so radiologists can focus on high-value interpretation rather than troubleshooting technology. The result: faster throughput, higher accuracy, and restored physician trust in AI-assisted reporting.

Impact on physicians and healthcare operations:

•Eliminates context switching and transcription errors
•Reduces cognitive fatigue as repetitive tasks are automated
•Ensures reliable capture of dictation, even for complex macro phrases
•Speeds reporting while maintaining clinical accuracy
•Improves diagnostic confidence with automatic incorporation of prior reports and measurements
•Provides auditable, safe workflows for regulatory compliance
•Supports scalable adoption across institutions without custom infrastructure

Human and Clinical Impact:

•Physicians can focus on interpreting images and patient care
•Patients receive faster, more accurate diagnoses
•Organizations gain efficiency, lower operational costs, and higher throughput

“We’re at a point in radiology facing a 35% workforce deficiency nationally. Patients wait longer for reports, and there aren’t enough radiologists. We want to do everything on the tech front to increase capacity, reduce burnout, and enable radiologists to perform at the top of their license. They shouldn’t be secretaries and transcriptionists. Working with RADPAIR— and through their partnership with Fireworks — has been key to pushing this forward. Now, radiologists can dictate naturally and have AI keep up in real time, which is a major leap forward,”

Dr. Vikram Krishnasetty, MD, Associate CMO, Radiology Partners

From Radiology to Healthcare at Scale: Smarter Workflows Powered by AI

RADPAIR’s open-source SDK standard and Fireworks AI infrastructure jointly address a broader industry challenge: fragmented, siloed healthcare workflows stuck on outdated standards. Their unified approach to AI integration delivers benefits at two levels:

Ecosystem-Level Benefits

These capabilities create a thriving ecosystem of interoperable partners, enabling innovation and collaboration across the healthcare industry:

•Interoperability: Any compliant AI model can plug into the workflow
•Faster innovation: Partners can train models specific to their workflows
•Shared ecosystem: Private practices, imaging vendors, and B2B partners adopt a single standard
•Scalable improvements in patient care: Faster, more accurate reporting across institutions

Institution-Level Benefits

At the hospital or radiology department level, the partnership drives tangible operational and clinical improvements:

•Full control over domain-specific models, including STT and reasoning models fine-tuned for clinical accuracy
•Guaranteed real-time performance: Transcription and report generation operate in <200ms latency, ensuring timely and reliable outputs
•Low-latency, high-accuracy inference: Meets or surpasses OpenAI/Gemini baselines, critical where errors can have serious consequences
•Reduced cognitive load and improved throughput: Radiologists can focus on interpreting images, boosting diagnostic confidence while workflows remain fully auditable

Without this partnership, radiology AI remains fragmented and error-prone. RADPAIR and Fireworks provides a reliable, scalable foundation for safe deployment of fine-tuned AI in healthcare.

Future enhancements extend beyond transcription to LLM-driven reasoning, speech-to-speech, or text-to-speech, and automated control of external systems, building on the multi-model orchestration foundation Fireworks provides. The extended impact of this includes the ability to suggest differential diagnoses, flag critical findings in real time, and continue reducing cognitive burden.

“Fireworks AI has been an exceptional partner in this mission—not only hosting RADPAIR’s advanced speech engine, but also powering our proprietary models with unmatched performance, scalability, and reliability. Their platform allows us to bring agentic intelligence in radiology to life at unprecedented speed and scale.”

Avez Rizvi, CEO and Co-Founder, RADPAIR

About RADPAIR

RADPAIR provides AI-assisted reporting tools for radiology departments. The company represents approximately 13 to 14 percent of the U.S. radiology reporting market. Its key client, Radiology Partners, processes 40 to 50 million cases annually with 4,000+ physicians.

About Fireworks AI

Fireworks AI delivers enterprise-grade infrastructure for AI hosting, orchestration, and domain-specific model deployment. Its platform enables safe, scalable, and compliant AI solutions for healthcare and other specialized industries.