Deepgram
Deepgram operates a custom-built ASR architecture optimized for real-time streaming with sub-300ms latency targets. Operations choose Deepgram when streaming transcription drives the workload — voice agents, call center transcription, live captioning, conversational AI. The Nova-3 model in 2026 hits accuracy parity with the strongest competitors while maintaining the latency advantage.
Pricing starts at $0.0043/min for batch (Nova-3 streaming) and runs $0.0145/min for premium tiers. The pricing model is straightforward per-minute — no opaque add-ons or per-feature charges. At scale (1M+ minutes/month) Deepgram is consistently the cheapest production-grade option for streaming workloads, with self-hosted deployment available for compliance-sensitive operations.