Nexa SDK
Nexa SDK is a powerful toolkit enabling developers to deploy any AI model, including frontier and state-of-the-art models, …
Nexa SDK is a powerful toolkit enabling developers to deploy any AI model, including frontier and state-of-the-art models, to any device (mobile, PC, IoT, automotive) in minutes. It offers production-ready on-device inference with hardware acceleration across NPUs, GPUs, and CPUs, optimized for speed and energy efficiency.
Models
Models by Hathora offers a curated catalog of low-latency ASR, TTS, and LLM models optimized for voice AI …
Models by Hathora offers a curated catalog of low-latency ASR, TTS, and LLM models optimized for voice AI and real-time applications. Developers can explore, test, and deploy production-ready models quickly, featuring interactive sandboxes and direct API access for seamless integration into voice agents and other applications.
Speechmatics
Speechmatics is a leading AI-powered speech-to-text API, providing highly accurate and scalable transcription services for businesses. It supports …
Speechmatics is a leading AI-powered speech-to-text API, providing highly accurate and scalable transcription services for businesses. It supports over 50 languages in real-time and batch modes, offering flexible deployment options including cloud and on-premises solutions. Designed for developers, it enables the integration of advanced voice recognition into any application, from contact centers to media captioning.
voice_vector
voice_vector is a powerful AI voice platform offering high-fidelity voice cloning, expressive text-to-speech (TTS), and accurate speech recognition. …
voice_vector is a powerful AI voice platform offering high-fidelity voice cloning, expressive text-to-speech (TTS), and accurate speech recognition. With a unique pay-as-you-go and subscription hybrid model, it provides a flexible, cost-effective solution for content creators, developers, and businesses. Create unlimited private cloned voices and integrate advanced voice capabilities into your projects via a robust API.
voicetotextapp
An AI-powered transcription service that accurately converts voice and audio into text in real-time. Supports multiple languages, speaker …
An AI-powered transcription service that accurately converts voice and audio into text in real-time. Supports multiple languages, speaker identification, and various export formats. Ideal for transcribing meetings, interviews, podcasts, and lectures with high speed and precision.
speechtotextai
speechtotextai is a free, AI-powered web tool that quickly transcribes audio files and YouTube videos into text. Simply …
speechtotextai is a free, AI-powered web tool that quickly transcribes audio files and YouTube videos into text. Simply upload a file or paste a YouTube link to receive an accurate, machine-generated transcript. Ideal for content creators, students, and professionals who need to convert spoken content into written format efficiently.
AppTek.ai
AppTek.ai is a global leader in AI and machine learning for language technologies. It provides enterprise-grade solutions for …
AppTek.ai is a global leader in AI and machine learning for language technologies. It provides enterprise-grade solutions for Automatic Speech Recognition (ASR), Neural Machine Translation (NMT), Natural Language Processing (NLP), and Text-to-Speech (TTS), serving industries like media, contact centers, and government.
neoformai
neoformai provides advanced AI models for African dialects, including Automatic Speech Recognition (ASR) and Text-to-Speech (TTS). It empowers …
neoformai provides advanced AI models for African dialects, including Automatic Speech Recognition (ASR) and Text-to-Speech (TTS). It empowers developers and businesses to create inclusive applications, bridging language barriers and making digital experiences accessible to millions across Africa.
Line 21 Live Captions
Line 21 is an intelligent captioning solution that combines professional human captioners with advanced AI technology. It offers …
Line 21 is an intelligent captioning solution that combines professional human captioners with advanced AI technology. It offers real-time captioning, live translation in over 120 languages, AI-powered proofreading, and automatic speech recognition (ASR). Designed for live events, broadcasts, and meetings, it ensures fast, accurate, and accessible content delivery to global audiences across platforms like YouTube, Zoom, and Teams.