AI voice agents for restaurants that handle every reservation, every night

Your hostess is with a guest. The phone keeps ringing. An AI voice agent answers every call — books reservations, quotes waits, takes takeout orders, and handles questions about the menu in whatever language the caller speaks.

Vapi · LiveKit · Retell OpenTable · Resy · SevenRooms · Toast English · Spanish · Mandarin · French · Italian $0.09/call unit economics

One hostess, three phones, six people at the door.

Restaurants lose bookings to a simple operational problem: the hostess is physically busy seating a party, the phone is ringing, and the caller on the other end is deciding whether to call the next place on their list. Peak call times collide with peak seating times. After-hours calls don't get returned. Voicemail doesn't work for reservations — most callers don't leave one, and the ones who do get called back in the morning after the slot is gone.

The workarounds are worse. A traditional IVR tree that asks the caller to "press 1 for reservations" is a downgrade from a human. Staffing multi-language service is a staffing problem most single-location restaurants can't solve. Answering services miss the operational context — the chef's tasting on Tuesday, the private-event policy, the allergen matrix — and get every other detail wrong.

An AI phone agent answers every call, in every language, at $0.09 per handled call. It books directly into OpenTable, Resy, SevenRooms, or Toast Tables. It takes takeout orders into your POS. It answers the hours, parking, and dietary questions that would otherwise tie up the floor. And when it can't, it hands the call to a named human with full context.

Three AI deployments that fit how a restaurant actually runs.

Metrics below are from live restaurant deployments, not pilots.

Reservations

Reservation & booking agent

Inbound AI on Vapi/LiveKit that books reservations in OpenTable, Resy, SevenRooms, or Toast in under 90 seconds. Quotes real-time wait, offers alternatives when booked out, and holds the slot while checking availability.

$0.09per handled call (Milina NYC, LiveKit + Deepgram + GPT-4o-mini + Cartesia)
Takeout

Takeout & order-taking agent

Voice agent that takes takeout orders across phone and drive-thru, writes into your POS (Toast, Square, Revel), and confirms the order with the customer before submit. Handles modifiers, allergen flags, and upsells without being pushy.

50+calls handled a night without human intervention
FAQ & hours

FAQ, hours, and location agent

Answers the calls that don't need to tie up the hostess — hours, location, parking, dietary menu questions, gift card purchases, private-event inquiries routed to the right manager. Frees the floor to actually host.

~70%inbound call deflection at a typical single-location restaurant

Milina — AI voice reservation agent running on LiveKit.

Restaurant · Single-location · New York City · iconic Japanese concept

Milina — AI voice reservation agent running on LiveKit.

Milina, a reservation-driven NYC restaurant, needed a voice agent that could handle peak-hour reservation traffic without sounding robotic. We deployed LiveKit voice infra with Deepgram STT, GPT-4o-mini for dialog, and Cartesia for low-latency TTS. It handles 50+ calls a night, books directly into the reservation system, and the unit economics land at $0.09 per handled call. Customers can't tell it's AI.

$0.09per call
50+calls a night
LiveKit+ Deepgram + GPT-4o-mini + Cartesia
0humans needed after 9pm
Read the full Milina case →

The stack we deploy when the phone has to be answered.

Low per-call cost, low latency, and direct writeback into the reservation platform and POS the restaurant already runs.

Voice infra

LiveKit · Vapi · Retell

For a single location with heavy reservation volume we usually deploy LiveKit for the lowest per-call cost; Vapi for fastest time-to-MVP; Retell for hosted convenience.

Speech

Deepgram · Cartesia · ElevenLabs

Deepgram for STT tuned to noisy restaurant backgrounds; Cartesia for the fastest TTS when latency shows; ElevenLabs when voice quality matters most.

Models

GPT-4o-mini · Claude Haiku

Small models are the right answer for restaurant reservations. Faster, cheaper, and accurate enough for a bounded conversation.

Reservations

OpenTable · Resy · SevenRooms · Tock

Direct API integration where available; browser automation fallback where it isn't (yes, we'll build both).

POS

Toast · Square · Revel · Clover

For takeout and drive-thru deployments, direct POS writeback so the order lands in the kitchen already confirmed.

Observability

LangSmith · Helicone · CloudWatch

Every call fully traced. Call recordings retained per your policy; cost and latency tracked per-call.

The caller shouldn't ask if they're talking to AI.

Four questions every restaurateur asks before signing. Here are ours, answered honestly.

What languages?

English, Spanish, French, Italian, Mandarin, Cantonese, Japanese, Portuguese are standard. Mixed-language calls (customer starts Spanish, switches to English) are handled natively with no reset. We use language-specific voices — not one voice with an accent layer.

How robotic does it sound?

The bar is "the caller doesn't ask if they're talking to AI." We get there with Cartesia or ElevenLabs voices tuned to the restaurant's tone, sub-600ms response latency, and interruption handling. We play a sample before you sign.

What about dietary & allergen questions?

The agent knows your menu, your allergen matrix, and your "ask the chef" flags. When it doesn't know, it says so and offers a callback from a human — never guesses on a peanut allergy.

Can it upsell without being annoying?

Yes, within rules you set — the Cabernet on Tuesday, the new dessert for first-time guests, the chef's tasting on slow nights. Off-theme upsells are blocked at the prompt layer. You approve the playbook before it goes live.

Single location to restaurant group.

Fixed project pricing. Per-call unit economics modeled in the discovery workshop so you know the monthly run-rate before you sign.

DeploymentScopePrice
Single-location voice agent Inbound reservations + takeout, 1 POS, 1 reservation platform, shadow mode $6,000–$12,000
Multi-location rollout Shared templates across locations, per-location config, consolidated analytics $15,000–$28,000
Enterprise restaurant group Group HQ + franchisee config, approval flows, multi-language, custom voice $25,000–$45,000
Outbound reservation recovery agent Callback agent that converts no-shows and waitlist, review-request flow $7,000–$14,000
Monthly retainer Ops, menu updates, seasonal changes, new language, analytics $1,500–$6,000/mo

Per-call cost lands between $0.08 and $0.18 depending on language and call length. A single-location restaurant on 1,500–3,000 handled calls/month spends $150–$500 on AI voice runtime.

Restaurant questions we answer on every intro call.

How much does an AI voice agent for a restaurant cost?
Per-call cost is typically $0.08–$0.18. Fixed-build cost: single-location deployment $6,000–$12,000. A 1,500–3,000-calls-per-month restaurant spends $150–$500 on runtime.
Which reservation platforms do you integrate with?
OpenTable, Resy, SevenRooms, Tock, Yelp Reservations, Toast Tables, Google Reserve. Also direct integration with custom internal systems via REST or webhook.
Can the AI handle multiple languages?
Yes. English, Spanish, French, Italian, Mandarin, Cantonese, Japanese, Portuguese are standard. The agent handles mid-call language switches natively.
Will customers know they're talking to AI?
In blind tests with Milina, most callers didn't realize. We use Cartesia/ElevenLabs voices, sub-600ms latency, interruption handling, and natural prosody.
What happens if the AI can't answer a question?
It escalates to a human — routed to the manager's mobile during service hours, to voicemail with a structured transcript after hours.
How fast can we go live?
Single-location restaurant: 3–4 weeks including reservation integration, voice tuning, shadow mode, and staff handoff. Multi-location groups add 1–2 weeks per POS/reservation-platform variant.

Ready to answer every call, every night?

One 20-minute call for restaurateurs. We'll listen to a sample call, look at your reservation platform and POS, and tell you what's realistic — and what the unit economics will be. If it doesn't make sense for your volume, we'll say so.