Nexa SDK
Nexa SDK is a powerful toolkit enabling developers to deploy any AI model, including frontier and state-of-the-art models, …
Nexa SDK is a powerful toolkit enabling developers to deploy any AI model, including frontier and state-of-the-art models, to any device (mobile, PC, IoT, automotive) in minutes. It offers production-ready on-device inference with hardware acceleration across NPUs, GPUs, and CPUs, optimized for speed and energy efficiency.
OCR Arena
OCR Arena is a free online platform designed for testing and evaluating leading foundation Vision-Language Models (VLMs) and …
OCR Arena is a free online platform designed for testing and evaluating leading foundation Vision-Language Models (VLMs) and open-source Optical Character Recognition (OCR) models. It allows users to upload documents, measure accuracy, and compare model performance on a public leaderboard.
Hakko
Hakko is an advanced AI game companion leveraging Visual Language Models (VLMs) to provide real-time voice guidance, emotional …
Hakko is an advanced AI game companion leveraging Visual Language Models (VLMs) to provide real-time voice guidance, emotional companionship, and intelligent assistance across various games. It enhances your gaming experience with scene recognition, knowledge search, and personalized interactions, extending its support to daily life scenarios for a truly integrated AI partnership.
Gabber
Gabber is a powerful platform for building real-time, multimodal AI applications that can see, hear, and speak. It …
Gabber is a powerful platform for building real-time, multimodal AI applications that can see, hear, and speak. It offers low-latency inference for Vision Language Models (VLM), Text-to-Speech (TTS), and Speech-to-Text (STT), coupled with a graph-based orchestration system for rapid development and deployment.
Reducto
Reducto is an advanced Document Ingestion API for developers and enterprises. It uses Agentic OCR and Vision-Language Models …
Reducto is an advanced Document Ingestion API for developers and enterprises. It uses Agentic OCR and Vision-Language Models to accurately parse, split, extract, and even edit documents. It transforms unstructured data from various file formats into structured, LLM-ready inputs, automating complex document processing workflows with high precision and enterprise-grade security.
Moondream
Moondream is a powerful, open-source visual language model (VLM) that is incredibly lightweight and fast. With a tiny …
Moondream is a powerful, open-source visual language model (VLM) that is incredibly lightweight and fast. With a tiny 1GB footprint, it runs anywhere from edge devices to laptops. It allows developers to understand images through simple text prompts for tasks like captioning, object detection, OCR, and visual Q&A, without needing complex training or heavy infrastructure. It's designed for simplicity, versatility, and affordability.
Prism Replay
Prism Replay is an AI-native product analytics platform that automatically watches, summarizes, and analyzes user session replays. It …
Prism Replay is an AI-native product analytics platform that automatically watches, summarizes, and analyzes user session replays. It provides actionable insights to help product teams optimize conversions, understand user behavior, and identify friction points without manual effort.
Oda Studio
Oda Studio provides bespoke AI solutions to transform complex, unstructured data into actionable insights. Specializing in Vision-Language Models …
Oda Studio provides bespoke AI solutions to transform complex, unstructured data into actionable insights. Specializing in Vision-Language Models (VLMs) and custom data pipelines, they serve industries like construction, finance, and media. Their expert team delivers end-to-end services from data annotation to model deployment, enabling businesses to make smarter, faster decisions.
OpalAi
OpalAi is an advanced Spatial AI platform that transforms complex spatial, visual, textual, and audio data into actionable …
OpalAi is an advanced Spatial AI platform that transforms complex spatial, visual, textual, and audio data into actionable insights for enterprises. It leverages cutting-edge technologies like Vision Language Models (VLMs) and 3D reconstruction to offer specialized solutions for industries such as PropTech, InsurTech, transportation, and wildfire management, accelerating data-driven decision-making.
moondream2
moondream2 is a lightweight, open-source visual language model (VLM) designed for high efficiency on edge devices. It excels …
moondream2 is a lightweight, open-source visual language model (VLM) designed for high efficiency on edge devices. It excels at generating image descriptions, understanding complex documents, and performing visual Q&A, making it ideal for mobile applications and IoT scenarios with limited resources.