RoryPlans
RoryPlans is a specialized AI tool designed for teams to collaboratively generate, review, and manage synthetic datasets for …
RoryPlans is a specialized AI tool designed for teams to collaboratively generate, review, and manage synthetic datasets for function calling. It aims to accelerate the development of more reliable AI agents by providing high-quality, structured data.
Bolt Foundry
Bolt Foundry provides open-source tooling for developers to perform unit tests on Large Language Models (LLMs). It transforms …
Bolt Foundry provides open-source tooling for developers to perform unit tests on Large Language Models (LLMs). It transforms prompt engineering into a scientific, data-driven process by using structured, testable prompts called 'graders'. This ensures reliable, consistent, and measurable AI outputs, making it ideal for building production-grade applications.
Basalt
Basalt is an end-to-end platform for developers and product teams to build, evaluate, and monitor reliable AI agents. …
Basalt is an end-to-end platform for developers and product teams to build, evaluate, and monitor reliable AI agents. It provides a comprehensive suite of tools, including automated evaluations, A/B testing, prompt engineering with an AI co-pilot, and a developer-friendly SDK to ensure your AI features are trustworthy and production-ready.
Superface
Superface is a tooling and reliability platform for AI agents, enabling them to connect to and interact with …
Superface is a tooling and reliability platform for AI agents, enabling them to connect to and interact with external APIs and enterprise systems with human-level accuracy. It provides a library of pre-built, reliable tools and automated builders to significantly improve task completion rates and reduce development time.
Hamming AI
Hamming AI is an advanced platform for automated testing, production monitoring, and analytics for AI voice agents. It …
Hamming AI is an advanced platform for automated testing, production monitoring, and analytics for AI voice agents. It enables developers to simulate thousands of calls, audit live conversations, and instantly catch regressions to ensure voice AI reliability and performance across multiple languages.