ManyPI
ManyPI is a modern data gathering platform that transforms any website into a type-safe API. It simplifies structured …
ManyPI is a modern data gathering platform that transforms any website into a type-safe API. It simplifies structured data extraction with built-in schema definition, data extraction, and record transformation, empowering developers and technical teams to reliably collect web data at scale.
FlowDyno
FlowDyno is an AI-powered tool that transforms natural language descriptions into dynamic, animated architecture diagrams. It simplifies complex …
FlowDyno is an AI-powered tool that transforms natural language descriptions into dynamic, animated architecture diagrams. It simplifies complex diagramming workflows, offering one-click export and a rich library of technical icons.
DAGForge
DAGForge is an AI-powered platform that combines conversational AI with a visual drag-and-drop interface to build production-ready Airflow …
DAGForge is an AI-powered platform that combines conversational AI with a visual drag-and-drop interface to build production-ready Airflow DAGs 10x faster. It enables data professionals to describe data pipelines in plain English and deploy them in minutes, not days, streamlining data orchestration and development.
Vectorize
Vectorize is a RAG-as-a-Service platform that simplifies building AI applications on unstructured data. It offers managed RAG pipelines, …
Vectorize is a RAG-as-a-Service platform that simplifies building AI applications on unstructured data. It offers managed RAG pipelines, extensive data source connectors, and the flexibility to use its managed vector database or connect your own, enabling developers to deploy production-ready AI solutions quickly.
Dagster
Dagster is a modern, open-source data orchestrator designed for building, scaling, and observing AI and data pipelines. It …
Dagster is a modern, open-source data orchestrator designed for building, scaling, and observing AI and data pipelines. It acts as a unified control plane, allowing teams to model data assets, track lineage, and ensure data quality with confidence. By integrating software engineering best practices like local testing and reusable components, Dagster helps data engineers and ML teams ship products faster and more reliably.
Observo AI
Observo AI is an intelligent data pipeline platform for Security and DevOps teams. It uses AI to optimize …
Observo AI is an intelligent data pipeline platform for Security and DevOps teams. It uses AI to optimize telemetry data, reducing log volumes by up to 80% and observability costs by over 50%. The platform accelerates threat detection, enriches data in real-time, and eliminates blind spots, making security and operations more efficient and cost-effective.
Pipekit
Pipekit is an enterprise-grade control plane and support service for Argo Workflows. It empowers platform and data teams …
Pipekit is an enterprise-grade control plane and support service for Argo Workflows. It empowers platform and data teams to run, monitor, and govern large-scale data, MLOps, and CI/CD pipelines on Kubernetes across multiple clusters and clouds.
Fivetran
Fivetran is an automated data movement platform that centralizes data from hundreds of sources into cloud data warehouses, …
Fivetran is an automated data movement platform that centralizes data from hundreds of sources into cloud data warehouses, lakes, and databases. It simplifies and accelerates data integration with pre-built, zero-maintenance pipelines, enabling teams to focus on analytics, AI, and business intelligence rather than on engineering.
Orchestra
Orchestra is a unified control plane for data orchestration and pipelining, designed for lean data teams. It offers …
Orchestra is a unified control plane for data orchestration and pipelining, designed for lean data teams. It offers an AI-native solution to build, monitor, and manage governed data pipelines with end-to-end observability, proactive alerting, and extensive integrations. It simplifies complex data workflows, reduces maintenance time, and ensures data is reliable and AI-ready.
Graphlit
Graphlit is a developer-focused Knowledge API platform for building AI applications and agents. It streamlines the ingestion, memory, …
Graphlit is a developer-focused Knowledge API platform for building AI applications and agents. It streamlines the ingestion, memory, and retrieval of unstructured data from any source, offering a powerful RAG-as-a-Service solution. With SDKs for major languages and tools for AI agent integration, it simplifies the creation of sophisticated AI systems.
Metaflow
A human-centric Python framework, originally from Netflix, for building and managing real-life data science, ML, and AI projects. …
A human-centric Python framework, originally from Netflix, for building and managing real-life data science, ML, and AI projects. It simplifies workflow orchestration, data management, and model deployment, enabling rapid prototyping and scalable production pipelines.
fleak
Fleak is an enterprise-ready, serverless platform for building self-healing AI data workflows. It simplifies data transformation and integration …
Fleak is an enterprise-ready, serverless platform for building self-healing AI data workflows. It simplifies data transformation and integration across systems using a low-code, drag-and-drop interface. Fleak unifies API services and streaming data processing, orchestrates LLMs, and ensures enterprise-grade governance, reducing engineering time by up to 90% without requiring infrastructure management.
Weld
Weld is an AI-powered data platform that automates data integration and transformation. It centralizes data from all your …
Weld is an AI-powered data platform that automates data integration and transformation. It centralizes data from all your SaaS tools and databases into a cloud warehouse like Snowflake or BigQuery. With its AI assistant, Ed, teams can easily clean, model, and prepare data for analytics, business intelligence, and AI applications, breaking down data silos and unlocking real-time insights.
Paradime
Paradime is an AI-powered ELT platform for analytics and AI, designed as a superior alternative to dbt Cloud. …
Paradime is an AI-powered ELT platform for analytics and AI, designed as a superior alternative to dbt Cloud. It integrates an AI-enhanced Code IDE, automated data pipelines (Bolt), and a FinOps cost-saving tool (Radar) into a single, unified platform. This empowers data teams to accelerate development, increase reliability, and significantly reduce data warehouse costs, streamlining the entire analytics engineering workflow.
Union.ai
Union.ai is an enterprise-grade, production-ready platform for orchestrating complex AI and machine learning workflows. Built on the open-source …
Union.ai is an enterprise-grade, production-ready platform for orchestrating complex AI and machine learning workflows. Built on the open-source Flyte, it empowers teams to build, serve, and scale compound AI systems with unparalleled performance and efficiency. It bridges the data-ML gap, optimizes cloud costs with features like scale-to-zero, and enhances developer velocity through a seamless, integrated experience.
aiflow.ai
aiflow.ai is a no-code platform for building and automating AI-powered workflows. Visually connect your favorite apps and AI …
aiflow.ai is a no-code platform for building and automating AI-powered workflows. Visually connect your favorite apps and AI models to streamline tasks, from content creation and data analysis to customer support, boosting productivity and innovation for your business.
Reworkd
Reworkd is an AI-powered, no-code platform that automates the entire web data extraction process. It uses AI agents …
Reworkd is an AI-powered, no-code platform that automates the entire web data extraction process. It uses AI agents to understand websites, generate scraping code, and deliver structured data at scale. Ideal for building datasets, market research, and enriching data pipelines without manual coding or maintenance.
Isomeric
Isomeric is an AI-powered API that transforms messy, unstructured text from any source into clean, structured JSON data. …
Isomeric is an AI-powered API that transforms messy, unstructured text from any source into clean, structured JSON data. By defining a simple JSON schema, you can automatically extract specific information from websites, legal documents, customer support transcripts, and more, streamlining data pipelines and automation.
Airbyte
Airbyte is an open-source data integration platform that simplifies building and managing data pipelines. It enables you to …
Airbyte is an open-source data integration platform that simplifies building and managing data pipelines. It enables you to move data from hundreds of sources to destinations like data warehouses, lakes, and vector databases in minutes, using a vast catalog of pre-built connectors or by creating your own with a low-code builder. It supports both cloud and self-hosted deployments, focusing on data security, governance, and scalability for modern data and AI applications.
nao
nao is an AI-powered code editor designed for data teams. It streamlines SQL and Python data pipeline creation, …
nao is an AI-powered code editor designed for data teams. It streamlines SQL and Python data pipeline creation, dbt workflows, and analytics by natively connecting to your data warehouse. Its intelligent agent provides data-aware code suggestions, quality checks, and instant diff previews to help you ship data faster and more safely.
dagworks
Dagworks provides a suite of open-source developer tools, Hamilton and Burr, designed to build, debug, and observe reliable …
Dagworks provides a suite of open-source developer tools, Hamilton and Burr, designed to build, debug, and observe reliable AI applications. Hamilton standardizes ML and data pipelines for faster iteration and clear lineage, while Burr simplifies the creation of complex, stateful RAG and agentic systems with built-in observability.
DataChain
DataChain is a developer-first platform for managing "Heavy Data"—large-scale, unstructured, multimodal datasets. It enables teams to curate, enrich, …
DataChain is a developer-first platform for managing "Heavy Data"—large-scale, unstructured, multimodal datasets. It enables teams to curate, enrich, and version data like videos, images, audio, and PDFs for AI applications, featuring Python-based ETL pipelines, full data lineage, and scalable processing from local IDE to cloud.
Nimbleway
Nimbleway is an enterprise-grade platform for AI-driven web data collection and scalable data pipelines. It empowers businesses to …
Nimbleway is an enterprise-grade platform for AI-driven web data collection and scalable data pipelines. It empowers businesses to interact with real-time web data, offering tools like agentic web search, an online knowledge cloud, and a robust SDK. Ideal for retail, finance, and AI, it provides hypergranular, structured data for competitive analysis, price monitoring, and feeding LLMs, ensuring ethical and compliant data gathering.
Kadoa
Kadoa is an AI-powered, no-code web scraping platform that automates data extraction from any website or document. It …
Kadoa is an AI-powered, no-code web scraping platform that automates data extraction from any website or document. It enables users to build scalable, self-healing data pipelines in minutes, eliminating engineering bottlenecks and providing real-time insights for finance, retail, and market intelligence.
Ask On Data
Ask On Data is an open-source, GenAI-powered data engineering tool that lets you build and manage data pipelines …
Ask On Data is an open-source, GenAI-powered data engineering tool that lets you build and manage data pipelines using a simple chat interface. By translating natural language commands into complex data operations, it eliminates the need for coding, making data engineering accessible to everyone. It supports various data sources, offers real-time previews, and provides both cloud-hosted and self-hosted options.
relayed.ai
relayed.ai is an AI-powered automation platform that intelligently connects your apps and workflows. It automates the process of …
relayed.ai is an AI-powered automation platform that intelligently connects your apps and workflows. It automates the process of relaying information, tasks, and data between services like Slack, email, CRMs, and project management tools, ensuring seamless communication and operational efficiency.
Flyte
Flyte is an open-source, cloud-native workflow orchestration platform designed for building, deploying, and managing production-grade data, machine learning, …
Flyte is an open-source, cloud-native workflow orchestration platform designed for building, deploying, and managing production-grade data, machine learning, and analytics pipelines. It emphasizes scalability, reproducibility, and ease of use, enabling teams to move from local development to large-scale production seamlessly. With a Python-first SDK and support for multiple languages, Flyte empowers data scientists and engineers to create complex, versioned, and maintainable workflows.
Lume AI
Lume AI is an AI-powered platform designed to automate and accelerate customer data implementation. It intelligently maps, analyzes, …
Lume AI is an AI-powered platform designed to automate and accelerate customer data implementation. It intelligently maps, analyzes, and ingests customer data, eliminating engineering bottlenecks and reducing onboarding time from weeks to days. By offering both a no-code interface and a flexible API, Lume AI helps businesses streamline data integration, normalize data from various sources, and manage complex data pipelines, allowing teams to focus on their core product value.
Mezmo
Mezmo is a comprehensive telemetry data pipeline platform designed for developers, DevOps, and SRE teams. It enables users …
Mezmo is a comprehensive telemetry data pipeline platform designed for developers, DevOps, and SRE teams. It enables users to ingest, process, and analyze logs, metrics, and traces from any source. With a focus on control and cost-efficiency, Mezmo allows you to filter, transform, and route your observability data to any destination, optimizing performance and reducing expenses.