• Assistants and Squads — Two Agent Primitives — Assistants are single-system-prompt agents with tools and structured output for standard call flows — customer support, lead qualification, booking, FAQ; Squads orchestrate multiple specialized assistants in a single call with context-preserving transfers — enabling medical triage → scheduling → billing, or e-commerce order → returns → VIP flows, all within one continuous call session where each specialist receives full structured conversation context from the previous agent.
• Workflows 2.0 — Visual Conversation Flow Builder — A major June 2025 upgrade replacing single-prompt design with a node-based visual flow builder; map conversation branches, conditional steps, variable extraction, global nodes, call transfer logic, and dynamic routing visually — providing the control of single-prompt design with the scalability of a full workflow system without sacrificing developer-level precision.
• Test Suite and Pre-Launch Call Simulation — Define success criteria per use case, simulate hundreds of conversation scenarios in a controlled environment before any live calls, and automatically identify hallucination risks, logic failures, and edge case breakdowns — with independent YouTube reviewers confirming systematic Test Suite use achieves 95%+ production reliability on live deployments.
• Bring Your Own Keys (BYOK) — Provider-Agnostic Architecture — Plug in your own API keys for any STT provider (Deepgram, Gladia, AssemblyAI), any LLM (OpenAI GPT-4.1, Anthropic Claude, Google Gemini, self-hosted models), and any TTS provider (ElevenLabs, Cartesia, LMNT, Deepgram Aura) — enabling teams to use existing provider relationships, negotiate volume pricing independently, and maintain full control over the AI stack Vapi orchestrates.
• Built-In Hallucination Guardrails — Conversation guardrails embedded in the Vapi orchestration layer prevent model hallucinations and ensure data integrity across all assistant types — operating at the infrastructure level rather than relying solely on LLM-level instruction compliance, providing a safety net that survives prompt engineering edge cases.
• 4,200+ API Configuration Points — Every parameter of the voice agent pipeline is exposed as an API endpoint — latency thresholds, interruption sensitivity, silence detection, turn-taking behavior, endpointing detection, backchannel audio, custom vocabulary, SSML injection, webhook triggers, and hundreds more — enabling teams to tune voice agent behavior with a precision no low-code platform provides.
• SOC 2, HIPAA, and PCI Compliance — SOC 2 on Enterprise, HIPAA for healthcare deployments, and a dedicated PCI Compliance mode that uses Squads to selectively disable recording, logging, and transcription during payment collection phases while maintaining call quality audit capability on non-sensitive call segments — confirmed in official Vapi documentation.
• Scalable Infrastructure — Sub-600ms Latency at Enterprise Volume — Custom real-time audio infrastructure scales from single-agent testing to millions of simultaneous calls in minutes; ultra-low latency confirmed at sub-400ms in independent reviewer tests; round-the-clock monitoring and multi-region infrastructure with dedicated forward-deployed engineer support on Enterprise plans for teams that need to go live in one week.