Onshore, Offshore or Automate? When Decisions Cannot Fail

 

4 min read

All Posts

AI in the Exam Room? A Critical Look at DeepMind’s AMIE for Healthcare

 

2 min read

The paper titled "Towards Conversational AI for Disease Management" (full paper here: arXiv:2503.06074) introduces a novel large language model (LLM)-based system named AMIE, designed to assist in clinical disease management. AMIE extends the capabilities of the Articulate Medical Intelligence Explorer by integrating reasoning over disease progression, therapeutic responses, and medication prescriptions across multiple patient visits. It utilises Gemini's long-context capabilities, combining in-context retrieval with structured reasoning to align its outputs with authoritative clinical guidelines and drug formularies.​

In a randomised, blinded virtual Objective Structured Clinical Examination (OSCE), AMIE was compared to 21 primary care physicians (PCPs) across 100 multi-visit case scenarios based on UK NICE and BMJ Best Practice guidelines. Specialist physicians assessed AMIE's management reasoning as non-inferior to that of PCPs. Additionally, AMIE demonstrated superior performance in the precision of treatments and investigations, as well as in aligning and grounding management plans with clinical guidelines.​

To benchmark medication reasoning, the authors developed RxQA, a multiple-choice question benchmark derived from US and UK drug formularies and validated by board-certified pharmacists. AMIE outperformed PCPs on higher difficulty questions, particularly when both had access to external drug information.​

The paper concludes by mentioning that, while further research is needed before real-world implementation, AMIE's strong performance across evaluations marks a significant step towards conversational AI as a tool in disease management. The findings, however, do not suggest that AMIE is ready for clinical care.

GenAI (LLM)-based systems are not suitable for handling healthcare processes mainly for the following reasons:

  1. Lack of clinical accountability
    GenAI models do not hold legal or ethical responsibility for clinical decisions, making it risky to rely on them without physician oversight in life-impacting situations.

  2. Unpredictable errors and hallucinations
    Even advanced GenAI systems can produce plausible-sounding but incorrect or fabricated medical information, which could mislead clinicians or harm patients.

  3. Insufficient real-world validation
    Most GenAI models are tested in simulated or controlled environments. They lack robust, peer-reviewed evidence from diverse, real-world clinical settings needed to ensure safety and generalizability.

  4. Missing a strong auditing schema
    Many GenAI models function as "black boxes," offering limited transparency into their decision-making processes, which challenges trust and hinders error tracing in critical care contexts.

In conclusion, GenAI tools can engage with patients for low-value requests. But critical high-value processes must be carried out by a conversational process automation, which is auditable, fully configurable by experts and designed to deliver patients' requests end-to-end.

Recent Posts

Onshore, Offshore or Automate? When Decisions Cannot Fail

4 min read For decades, regulated businesses have relied on a familiar operating model to manage cost and scale: keep critical work onshore, move volu...

Read more

AI in 2026: Leadership, Risk and Real-World Returns

4 min read Artificial intelligence is no longer a technology experiment. In 2026, AI is becoming a strategic business imperative for organisations see...

Read more

IPMI Insurers and the Automation Imperative for 2026

2 min read International Private Medical Insurance (IPMI) is entering a transformative phase. Insurers manage complex risks, high claims volumes, and ...

Read more

Why Some Business Processes Cannot Tolerate AI Error

4 min read AI automation is delivering real operational gains across regulated industries. From faster case handling to lower costs, the benefits are ...

Read more
2026 Spixii white-paper - Automation with Certainty for Regulated Businesses

Download your FREE Spixii White Paper copy

Unlock the full potential of AI and automation in regulated insurance by downloading our white paper, which provides actionable insights for 2026:
  • Discover how to embed automation while ensuring 100% compliance and deterministic decision-making
  • Learn strategies to increase operational margin, speed of transaction, and customer trust
  • Understand how risk-aware innovation can drive sustainable growth and competitive advantage