LLMRTC
LLMRTC is a TypeScript SDK for building real-time voice and vision AI applications. It integrates WebRTC for low-latency …
LLMRTC is a TypeScript SDK for building real-time voice and vision AI applications. It integrates WebRTC for low-latency audio/video streaming with LLMs, speech-to-text, and text-to-speech technologies through a unified, provider-agnostic API. Developers can focus on application logic while LLMRTC handles complex conversational AI infrastructure.
Models
Models by Hathora offers a curated catalog of low-latency ASR, TTS, and LLM models optimized for voice AI …
Models by Hathora offers a curated catalog of low-latency ASR, TTS, and LLM models optimized for voice AI and real-time applications. Developers can explore, test, and deploy production-ready models quickly, featuring interactive sandboxes and direct API access for seamless integration into voice agents and other applications.
Gabber
Gabber is a powerful platform for building real-time, multimodal AI applications that can see, hear, and speak. It …
Gabber is a powerful platform for building real-time, multimodal AI applications that can see, hear, and speak. It offers low-latency inference for Vision Language Models (VLM), Text-to-Speech (TTS), and Speech-to-Text (STT), coupled with a graph-based orchestration system for rapid development and deployment.
Release.ai
Release.ai is an enterprise-grade platform for developers to easily deploy, manage, and scale high-performance AI models. It offers …
Release.ai is an enterprise-grade platform for developers to easily deploy, manage, and scale high-performance AI models. It offers sub-100ms inference latency, seamless auto-scaling, robust security, and a vast library of pre-optimized models, enabling rapid integration into any development workflow with just a few lines of code.
Daily
Daily is a developer platform for real-time video, voice, and AI. It provides robust APIs and SDKs for …
Daily is a developer platform for real-time video, voice, and AI. It provides robust APIs and SDKs for building ultra-low latency, scalable, and high-quality conversational experiences, including human-to-human video calls and advanced voice AI agents through its open-source framework, Pipecat.
Prodia
Prodia is a high-speed, scalable generative AI API for developers. It enables seamless integration of image and video …
Prodia is a high-speed, scalable generative AI API for developers. It enables seamless integration of image and video generation into applications, offering ultra-low latency and eliminating the need for GPU infrastructure management. Built for production, it powers the next generation of creative tools.
Telnyx
Telnyx is a full-stack communications platform that enables developers and enterprises to build and deploy high-performance, real-time conversational …
Telnyx is a full-stack communications platform that enables developers and enterprises to build and deploy high-performance, real-time conversational AI. It integrates global telephony, dedicated AI infrastructure, and powerful APIs on a single platform, providing ultra-low latency and complete control for creating natural-sounding voice assistants and automating communication workflows.
Squawk Market
Squawk Market is an AI-powered, real-time audio feed for traders. It delivers critical market news, data, and alerts …
Squawk Market is an AI-powered, real-time audio feed for traders. It delivers critical market news, data, and alerts with ultra-low latency (<1s). The platform helps traders capitalize on volatility and intraday moves by providing instant updates on momentum stocks, breaking news, and economic events.
Moshi AI
Moshi AI is an advanced, low-latency conversational voice AI model developed by Kyutai. It enables natural, expressive, and …
Moshi AI is an advanced, low-latency conversational voice AI model developed by Kyutai. It enables natural, expressive, and interruptible dialogues, designed to run locally on various hardware for offline use. This makes it ideal for privacy-focused applications like smart home devices and in-car systems.
Groq
Groq is a revolutionary AI inference platform providing developers with unparalleled speed and cost-efficiency. Powered by its custom-built …
Groq is a revolutionary AI inference platform providing developers with unparalleled speed and cost-efficiency. Powered by its custom-built Language Processing Unit (LPU), Groq delivers real-time performance for large language models (LLMs), speech recognition, and text-to-speech applications. It offers a developer-friendly API, enabling seamless integration for building next-generation, low-latency AI solutions at scale.
Sindarin
Sindarin is an accelerated cloud platform for developers building low-latency, conversational voice AI. It provides an API and …
Sindarin is an accelerated cloud platform for developers building low-latency, conversational voice AI. It provides an API and a no-code platform to create highly responsive and natural-sounding AI personas. With industry-leading turn-taking and seamless interruption handling, Sindarin enables the creation of truly interactive voice experiences for applications in customer service, wellness, gaming, and more, offering enterprise-grade scale and reliability.
Cartesia
Cartesia is a high-performance voice AI platform for developers, offering the fastest, ultra-realistic Text-to-Speech (TTS), real-time Voice Cloning, …
Cartesia is a high-performance voice AI platform for developers, offering the fastest, ultra-realistic Text-to-Speech (TTS), real-time Voice Cloning, and low-latency Speech-to-Text (STT). Powered by proprietary State Space Model technology, it's designed for building interactive and immersive voice applications with seamless integration and enterprise-grade security.
Outspeed
An API and SDK for developers to build and deploy AI voice companions with real-time emotion and memory. …
An API and SDK for developers to build and deploy AI voice companions with real-time emotion and memory. Easily integrate natural, low-latency voice interactions into web and mobile applications.
Tencent RTC
A comprehensive developer platform providing powerful APIs and SDKs for real-time voice, video, chat, and live streaming. Tencent …
A comprehensive developer platform providing powerful APIs and SDKs for real-time voice, video, chat, and live streaming. Tencent RTC enables businesses to build scalable, low-latency, and interactive communication experiences directly into their applications across various industries.
Inception Labs
Inception Labs introduces a new generation of Diffusion Large Language Models (dLLMs) that are up to 10x faster …
Inception Labs introduces a new generation of Diffusion Large Language Models (dLLMs) that are up to 10x faster and cheaper than traditional models. Leveraging a parallel, diffusion-based approach, it offers unprecedented speed, quality, and control for text and code generation, ideal for enterprise-grade applications.
Millis AI
Millis AI is a platform for building next-generation voice agents with ultra-low 600ms latency. It enables both developers …
Millis AI is a platform for building next-generation voice agents with ultra-low 600ms latency. It enables both developers and non-technical users to create and deploy human-like, affordable voice agents for inbound and outbound calls in minutes, with easy integration capabilities.