Is a healthcare chatbot considered a medical device?

It depends on what it does. A chatbot that schedules appointments is not a medical device. A chatbot that provides clinical triage recommendations or diagnostic guidance may fall under FDA oversight. The distinction is whether the software is intended to diagnose, treat, or prevent disease. Get regulatory counsel involved early if your use case touches clinical decision-making.

Can we use a general-purpose LLM like GPT-4 for patient-facing conversations?

You can, but not without guardrails. A general-purpose model needs retrieval-augmented generation to ground its answers, output filtering to prevent hallucinated medical advice, a BAA-covered hosting arrangement, and human review for any clinical content. The model itself is a component, not a solution.

How long does implementation typically take?

A single-workflow pilot (scheduling or intake) with one channel and one system integration can reach production in 8 to 14 weeks, depending on integration complexity and compliance review cycles. Multi-workflow, multi-channel deployments take longer and benefit from phased rollouts.

What's the most common mistake organizations make?

Deploying a chatbot without connecting it to the systems where work actually happens. If the AI confirms an appointment but doesn't update the PMS, or drafts a message but doesn't route it to a review queue, you've added a new source of errors instead of reducing workload.

Conversational AI in Healthcare: Use Cases & How to Implement

Most healthcare organizations exploring conversational AI in healthcare are not looking for a general-purpose chatbot that answers medical questions. They're looking for something narrower and more useful: a controlled interface that handles scheduling, intake, billing questions, or care navigation without adding to the message backlog their staff already can't keep up with.

That distinction matters. The organizations getting real results treat conversational AI as a workflow tool with strict boundaries, not as a digital clinician. This article covers what that looks like in practice, which use cases pay off first, where the risks sit, and how to implement without creating more problems than you solve.

What conversational AI in healthcare actually means

The term covers a wide range of products, and lumping them together causes confusion. It helps to separate the categories:

FAQ chatbots that answer common questions from a fixed script (hours, locations, accepted insurance).
Workflow chatbots that handle structured tasks like scheduling, intake forms, or appointment reminders through a guided conversation.
Voice assistants and IVR replacements that route callers, confirm appointments, or collect pre-visit information by phone.
AI-generated draft replies that suggest responses to patient portal messages for a clinician to review and send.
EHR-integrated assistants that pull patient context (upcoming appointments, recent labs, care plan details) into the conversation.
Clinical decision support tools that surface protocols or triage guidance for staff, not patients.

The chat or voice interface is only one layer. Behind it, you need identity verification, consent management, routing logic, system integrations, escalation rules, monitoring, and content governance. Without those, you have a demo, not a product.

The WHO's January 2024 guidance on large multimodal models in health identifies five broad application areas: diagnosis and clinical care, patient-guided use, clerical and administrative tasks, medical education, and research. Most conversational AI in healthcare projects today sit squarely in the clerical and administrative category, and that's where they should start.

Use cases worth considering first

Not every conversational AI use case carries the same risk or the same return. The ones below are ordered roughly by implementation complexity and clinical risk, starting with the most straightforward.

Scheduling, confirmations, and reminders

The AI handles appointment booking, rescheduling, and confirmations through SMS, web chat, or voice. It checks provider availability, matches the patient to the right visit type, and sends reminders.

A human still owns exception handling: complex multi-provider visits, urgent add-ons, and schedule conflicts that require clinical judgment.

Integration needed: PMS or EHR scheduling module, patient identity verification.

Ochsner Health deployed conversational AI for SMS-based appointment scheduling and confirmation. Over seven months, they saved more than 500 call center hours and improved appointment confirmation rates from 43% to 52%.

Intake and pre-visit preparation

The AI collects demographics, insurance details, medication lists, and reason-for-visit information before the appointment. It can walk patients through consent forms and pre-visit instructions.

A human reviews flagged responses (new allergies, medication changes, high-risk symptoms) and handles anything the patient can't complete digitally.

Integration needed: EHR intake workflows, document management, insurance eligibility verification.

Benefits, billing, and administrative Q&A

The AI answers questions about copays, deductibles, prior authorization status, and payment plans using approved content and patient-specific billing data.

A human handles disputes, complex claims, and financial hardship conversations.

Integration needed: billing/RCM system, insurance eligibility APIs, patient engagement software or portal.

The AI helps patients find the right provider, department, or service line based on their symptoms or needs. It can use an approved triage protocol to suggest urgency levels (emergency, urgent care, primary care, self-care).

A human owns any clinical judgment. The AI should never diagnose, and its triage recommendations need clear disclaimers and escalation paths.

Integration needed: provider directory, triage protocol engine, escalation queue.

Patient portal message drafting

The AI generates draft responses to patient messages for clinicians to review, edit, and send. This is one of the more studied use cases. A Stanford Health Care study involving 162 clinicians found that AI-generated draft replies were used about 20% of the time. The drafts did not reduce reply time or write time, but clinicians reported significant reductions in task load and work exhaustion.

The takeaway: draft replies can reduce cognitive burden, but they are not an automatic time-saving tool. Set expectations accordingly.

A human reviews and sends every message. The AI drafts; the clinician decides.

Integration needed: EHR messaging module, clinician review queue.

Post-visit follow-up and care-plan reminders

The AI sends follow-up messages after visits: medication reminders, wound care instructions, physical therapy exercises, or prompts to schedule follow-up appointments.

A human handles patient-reported complications, adverse reactions, or questions that fall outside the approved follow-up content.

Integration needed: EHR care plan data, outbound messaging platform.

Staff-facing knowledge assistant

The AI helps clinical and administrative staff find internal policies, formulary information, prior authorization requirements, HR procedures, or clinical protocols.

A human owns content updates and handles ambiguous or conflicting policy questions.

Integration needed: internal knowledge base, document repository, identity and role-based access control.

Where healthcare conversational AI gets risky

The WHO guidance flags several risks that apply directly to conversational AI deployments: false or inaccurate statements, automation bias (clinicians over-trusting AI output), cybersecurity threats, and patient-information risks. Here's how those risks show up in practice.

Medical advice beyond approved content

If the AI can generate free-form medical guidance, it will eventually say something wrong. Grounding responses in approved, reviewed content is not optional.

Hallucinated answers

Large language models generate plausible-sounding text that may be factually wrong. In a billing context, that's a customer service problem. In a clinical context, it's a patient safety problem.

Poor escalation logic

If the AI doesn't know when to hand off to a human, patients get stuck in loops or receive inappropriate responses. Escalation rules need to be explicit, tested, and monitored.

AI models can reflect biases in training data, leading to different recommendations for different patient populations. Covered healthcare entities have nondiscrimination obligations that extend to AI-supported decision tools.

PHI exposure and vendor model training

If patient data flows through a third-party model, you need to know whether that data is used for model training, how it's stored, and who can access it. Your BAA with the vendor needs to cover this explicitly.

Workflow failure

An AI chatbot that generates messages without connecting to your scheduling, billing, or EHR systems creates more work, not less. Staff end up manually reconciling what the AI promised with what the systems actually show.

Architecture and compliance basics

For conversational AI in healthcare to work in production, the system needs several layers beyond the language model itself.

Channels: web chat, patient portal, SMS, voice/IVR, mobile app. Most organizations start with one or two and expand.
Identity and consent: verify who the patient is before sharing any PHI. Collect and record consent for AI-assisted communication.
RAG and approved knowledge base: use retrieval-augmented generation to ground the AI's responses in reviewed content rather than letting it generate answers from its general training data.
EHR/PMS/CRM integration: the AI needs to read from and write to your systems of record. Without this, it's a standalone tool that creates data silos. If your workflows span multiple systems, healthcare workflow automation principles apply.
Audit logs, review queues, and analytics: every AI-generated response should be logged. Clinical-facing outputs need human review queues. You need analytics on containment rate, escalation patterns, and error rates.
HIPAA and security controls: BAA with every vendor that touches PHI. Role-based access control. Encryption in transit and at rest. Audit logging. Minimum necessary PHI in each interaction. Data retention rules that match your policies.

How to implement without creating another inbox

The biggest failure mode is deploying a chatbot that generates patient interactions nobody is staffed to handle. Here's a practical sequence:

Pick one workflow with high volume and low clinical risk (scheduling or intake are common starting points).
Define approved intents: what the AI can do, what it should refuse, and what triggers escalation.
Map escalation paths: who gets the handoff, in what system, with what context.
Prepare content and data: build or curate the knowledge base, clean up the integration data, and test identity verification.
Integrate with your systems of record so the AI's actions are reflected in the EHR, PMS, or CRM.
Pilot with a limited patient group or a single clinic location.
Measure against specific metrics.
Expand to additional workflows or channels based on results.

Metrics that matter:

Containment rate (percentage of conversations resolved without human intervention)
Escalation accuracy (did the AI escalate when it should have, and only when it should have?)
Average handle time for escalated conversations
Time to appointment confirmation
No-show rate changes
Unresolved message backlog
Staff edit rate on AI-generated drafts
Safety escalation volume and outcomes
Patient satisfaction scores

Physician adoption is moving fast. The AMA reported that 66% of physicians used AI in practice in 2024, up from 38% in 2023. That doesn't mean every use case works, but it does mean clinical staff are increasingly willing to engage with well-designed AI tools.

Build, buy, or customize

Buy when the use case is standard and your EHR vendor or a mature SaaS product covers it well. Appointment reminders and basic FAQ bots often fall here.

Customize when your workflows span multiple systems, require domain-specific rules, or need to match your organization's clinical protocols and brand voice. This is where AI integration services and platform configuration work come in.

Build when conversational AI is central to your product, when you need custom orchestration across channels and systems, or when your compliance and analytics requirements go beyond what off-the-shelf tools support. AI chatbot development at this level involves workflow discovery, integration architecture, risk boundary design, and ongoing model governance.

At Attract Group, healthcare software projects that involve conversational AI typically start with workflow mapping and integration design before any model selection happens. The AI layer is the last thing you add, not the first.

The safest first project is a single, well-defined workflow with clear boundaries on what the AI can and cannot do, clean integrations with your systems of record, a tested escalation path to a human, and specific metrics you'll use to decide whether to expand. Start there, measure honestly, and build from results.

#Healthcare/Telemedicine#AI & Automation#Chatbot#HIPAA#Interoperability

Vladimir Terekhov

Co-founder and CEO at Attract Group

Conversational AI in Healthcare: Use Cases & How to Implement

What conversational AI in healthcare actually means