Vendor intelligence

Automated tracking of feature launches, pricing changes, partnerships, and architecture shifts across data integration and ingestion companies — updated daily.

Type

Company

500 of 1,843 total (showing most recent 500)
Feature LaunchFivetran
Jun 29, 2026

Agents Schema

An open standard for making business context readable by AI agents, allowing teams to designate a schema in their warehouse or lake as a shared context layer that agents can query before acting. The schema contains metric definitions, semantic models, dbt lineage, and custom business documentation in plain SQL tables.

Source →
ArchitectureFivetran
Jun 24, 2026

Proprietary cloud data warehouses and raw data lakes → Open Data Infrastructure (lakehouse architecture with open storage, file formats, table metadata, and flexible compute)

Shift from tightly coupled proprietary warehouse systems or uncontrolled raw data lakes toward an unbundled, standards-based architecture separating storage, file formats, table catalogs, compute, and governance layers to enable workload flexibility while maintaining data integrity.

Source →
ArchitectureFivetran
Jun 22, 2026

Architecture shift

Fivetran and dbt Labs are enabling organizations to adopt Open Data Infrastructure, an open, governed approach to enterprise data that gives organizations ownership of their data, transformations, and AI stack combined with Snowflake's AI Data Cloud.

Source →
PartnershipFivetran
Jun 22, 2026

dbt Labs

Fivetran and dbt Labs are working together to help organizations build Open Data Infrastructure, combining automated data movement with trusted transformations to create governed, AI-ready data foundations.

Source →
PartnershipFivetran
Jun 22, 2026

Snowflake

Fivetran named a Leader for the fifth consecutive year in Snowflake's 2026 Modern Marketing Data Stack report, recognized for its role in helping organizations build AI-ready data foundations through integration and data modeling.

Source →
Feature LaunchClickHouse
Jun 18, 2026

Postgres Managed by ClickHouse - RBAC, Terraform, ClickPipes, extensions

New features in Postgres Managed by ClickHouse including RBAC, Terraform support, ClickPipes integration, and extensions.

Source →
Feature Launchdbt Labs
Jun 18, 2026

dbt-core v2.0.0-alpha.2

[dbt-fusion] Added --empty support in seed command. Homebrew distribution now available with 'brew install dbt'. GET /api/v1/models now includes catalog field with row_count_stat, bytes_stat, last_modified_stat, and materialized field.

Source →
PartnershipCollibra
Jun 18, 2026

AWS

AWS and Collibra deepen partnership to bring business content and semantics to AWS SageMaker Catalog.

Source →
ArchitectureConfluent
Jun 18, 2026

Batch transformations → Streaming transformations with Confluent Cloud Flink

Organizations are shifting from batch-based data transformations to streaming transformations positioned at the source, a pattern called 'shifting left' that enables fresher data delivery, reduces pipeline latency, and simplifies architectural complexity.

Source →
PartnershipConfluent
Jun 18, 2026

dbt (dbt Labs)

Confluent announced the release of a dbt adapter for Confluent Cloud, enabling dbt users to manage Flink SQL transformations with the same familiar dbt interface and CI/CD workflows they use across other data platforms.

Source →
Feature LaunchConfluent
Jun 18, 2026

dbt Adapter for Confluent Cloud

A dbt adapter that enables data engineers to define SQL transformations as models, write tests, generate documentation, and deploy through CI/CD for Confluent Cloud Flink SQL. The adapter includes streaming-native materializations (view, streaming_table, streaming_source) and deterministic testing capabilities for streaming pipelines.

Source →
Feature LaunchMonte Carlo
Jun 17, 2026

Model Context Protocol (MCP) & Agent Toolkit

Platform capability for managing and monitoring AI agents with MCP support and associated toolkit for building reliable AI systems.

Source →
Feature LaunchMonte Carlo
Jun 17, 2026

AI Observability

Extended data observability coverage into AI observability, providing visibility across all four components where AI systems break (data, system, code, and model) plus AI-specific signals.

Source →
Feature LaunchMonte Carlo
Jun 17, 2026

Agent Observability

Monte Carlo launched Agent Observability to help teams build reliable AI with visibility across data, systems, code, and model components specifically for AI agents.

Source →
Feature LaunchClickHouse
Jun 17, 2026

GCP Pub/Sub connector for ClickPipes

GCP Pub/Sub connector for ClickPipes is now in Private Preview, enabling integration with Google Cloud Pub/Sub messaging service.

Source →
PartnershipMonte Carlo
Jun 16, 2026

NASDAQ

Monte Carlo interfaces with NASDAQ's fleet of specialized agents for data observability and diagnostics, helping route context to the right agent at the right time.

Source →
PartnershipMonte Carlo
Jun 16, 2026

Databricks

Monte Carlo participated in a fireside chat at the Databricks Data + AI Summit, where Monte Carlo's leadership discussed AI governance and observability challenges with NASDAQ executives.

Source →
ArchitecturedltHub
Jun 16, 2026

Multi-tool fragmented data stack (dlt ingestion, Airflow, hand-maintained transformation layer, semantic models in individual contributors' heads) → Unified agentic platform with canonical modeling toolkit, version-controlled specifications, and AI-generated transformation layers

dltHub is shifting from a fragmented five-tool, five-role data stack to a unified agentic architecture where AI agents generate pipelines, models, and dashboards from semantic specifications, with humans authoring meaning and reviewing implementations.

Source →
Pricing ChangedltHub
Jun 16, 2026

dltHub pricing update

dltHub Pro offers agentic pipeline generation at approximately $2-3 of agent time per pipeline, representing a new pricing model based on AI-assisted infrastructure automation.

Source →
Feature LaunchdltHub
Jun 16, 2026

dltHub Pro - Trial Program

A 2-week trial offering 30 runtime hours included with no credit card required, available to begin any time.

Source →
Feature LaunchdltHub
Jun 16, 2026

dltHub Pro - Canonical Modeling Toolkit

A spec-first semantic modeling system where users author data definitions and meanings, dlt infers schema and types from raw data, and agents generate models from the specifications for reusable agentic retrieval.

Source →
Feature LaunchdltHub
Jun 16, 2026

dltHub Pro - Agentic Data Pipeline Generation

dltHub Pro enables users to build pipelines, ingest from sources, transform data, deploy to production, and manage deployments through conversational AI agents, with generated code stored in version-controlled repositories.

Source →
PartnershipStarburst
Jun 16, 2026

Qlik

Starburst and Qlik collaborate to solve AI data access challenges, integrating Qlik's capabilities with Starburst's data platform.

Source →
ArchitectureConfluent
Jun 15, 2026

Stateless, chat-style agent frameworks → Stateful, event-driven stream processing architecture with Apache Kafka and Apache Flink

Shift from stateless chat-based agents to deterministic stream processing with immutable event logs, seven-dimensional state management, and policy gates for regulated AI compliance.

Source →
PartnershipConfluent
Jun 15, 2026

A2A protocol

Emerging protocol for agent-to-agent coordination within Confluent Intelligence's streaming agents framework.

Source →
PartnershipConfluent
Jun 15, 2026

Model Context Protocol (MCP)

Tool calling coordination for Streaming Agents through the Model Context Protocol, enabling safe exposure of external APIs and other agents to the reasoning engine.

Source →
Feature LaunchConfluent
Jun 15, 2026

Kora engine

A managed Kafka engine delivered through Confluent Cloud that provides 99.99% uptime SLA and holds SOC 2, ISO 27001, PCI DSS, and HIPAA compliance attestation.

Source →
Feature LaunchConfluent
Jun 15, 2026

Confluent Cloud for Apache Flink

Managed stream processing with RocksDB state backends for maintaining seven critical states across multi-step agent workflows, supporting exactly-once processing semantics and multi-phase commit sink functions.

Source →
Feature LaunchConfluent
Jun 15, 2026

Agent Decision Record

A structured event stream that logs every step of agent workflow with reason codes, evidence references, and rule citations using tamper-evident cryptographic chains with SHA-256 hashing and digital signatures.

Source →
Feature LaunchConfluent
Jun 15, 2026

Streaming Agents

Native agents running as Flink jobs within Confluent Cloud that automate business processes with AI, featuring stateful workflow management and policy gate enforcement for regulated environments.

Source →
Feature LaunchConfluent
Jun 15, 2026

Confluent Intelligence

A real-time, context-aware AI engine that runs Streaming Agents directly as Flink jobs, with tool calling coordinated through the Model Context Protocol (MCP) and agent-to-agent coordination using the emerging A2A protocol.

Source →
PartnershipMonte Carlo
Jun 15, 2026

Databricks

Native integration enabling Monte Carlo to observe agents built on Databricks Agent Bricks platform by reading traces directly from Unity Catalog Delta tables through existing Databricks connections. Supports both Knowledge Assistant agents and custom agents built via Mosaic AI Agent Framework.

Source →
Feature LaunchMonte Carlo
Jun 15, 2026

Agent Bricks Observability for Databricks

Native observability support for agents built on Agent Bricks that reads MLflow trace data directly from Unity Catalog Delta tables without requiring SDK installation, pipeline configuration, or deployment. Provides span-level traces, conversation history, eval monitors, and incident management across the agent stack.

Source →
Feature Launchdbt Labs
Jun 15, 2026

dbt-core v1.12.0b3

Add a direct_parents attribute to model nodes carrying the nearest public ancestors only, emitted in dbt ls --output=json for models. Lineage consumers can now render DAG edges from direct_parents instead of depends_on.nodes.

Source →
Feature LaunchConfluent
Jun 15, 2026

Kora Engine

Cloud-native engine powering Confluent Cloud that decouples compute from storage to deliver GBps+ throughput, 10x faster autoscaling, 10x lower tail latencies, and 99.99% SLA with full Apache Kafka protocol compatibility.

Source →
Feature LaunchConfluent
Jun 15, 2026

Cluster Linking

Creates real-time replicas of existing Kafka data and metadata for zero-downtime migration from self-managed Kafka or MSK to Confluent Cloud.

Source →
Feature LaunchConfluent
Jun 15, 2026

Stream Governance Suite

Bundles Schema Registry, Data Contracts, Stream Catalog, and Stream Lineage with CSFLE and BYOK for comprehensive data governance and PII protection.

Source →
Feature LaunchConfluent
Jun 15, 2026

Tableflow

Extends Kafka topics into open table formats (Apache Iceberg and Delta Lake) to form bronze and silver layers of an analytics medallion stack with integrated governance.

Source →
Feature LaunchConfluent
Jun 15, 2026

ML_PREDICT and AI_COMPLETE

Native Flink SQL functions for in-flight embedding generation inside the stream processor, eliminating the need for separate embedding worker tiers.

Source →
Feature LaunchConfluent
Jun 15, 2026

Real-Time Context Engine

A fully managed service that serves structured context to AI apps and agents over the Model Context Protocol with built-in authentication, RBAC, and audit logging.

Source →
Feature LaunchConfluent
Jun 15, 2026

Streaming Agents

Agents that run as Flink jobs inside the stream processing pipeline with always-on state, tool calling via MCP and Agent2Agent (A2A), and replayable, governed event flows.

Source →
Feature LaunchConfluent
Jun 15, 2026

Confluent Intelligence

AI-native layer for data streaming platform that ships Streaming Agents, Real-Time Context Engine, and built-in ML functions for agentic AI applications.

Source →
Feature LaunchStarburst
Jun 15, 2026

Starburst Enterprise - Multi-Cluster Architecture

Architecture enhancement delivering improved lakehouse performance and scalability for accelerating AI workloads.

Source →
Feature LaunchMonte Carlo
Jun 12, 2026

MCP & Agent Toolkit

Toolkit for managing and monitoring AI agents with Model Context Protocol (MCP) support and agent-specific capabilities.

Source →
Feature LaunchMonte Carlo
Jun 12, 2026

Troubleshooting Agent

Agent within the Monte Carlo platform that diagnoses agentic issues, helping troubleshoot problems with AI agents and their underlying data in production.

Source →
Feature LaunchMonte Carlo
Jun 12, 2026

Agent Observability

Agent Observability capability for monitoring and troubleshooting AI agents in production, including output quality, latency, token usage, tool call accuracy, and trajectory metrics.

Source →
PartnershipClickHouse
Jun 12, 2026

AWS

ClickHouse achieves AWS Retail Competency certification, recognizing expertise in retail solutions.

Source →
ArchitectureConfluent
Jun 12, 2026

Manual Kafka connector management and JAR file installation → Pre-built fully managed cloud connectors

Shift from manual sourcing, installation, and management of Kafka Connect plugins to 120+ pre-built, fully managed connectors integrated into Confluent Cloud.

Source →
ArchitectureConfluent
Jun 12, 2026

Kafka Streams with custom Java/Scala microservices → Managed Apache Flink and ksqlDB

Transition from building and running custom Kafka Streams microservices to using fully managed stream processing engines that support SQL-based transformations.

Source →
ArchitectureConfluent
Jun 12, 2026

MirrorMaker 2 for multi-region replication → Confluent Cluster Linking

Replacement of independent MirrorMaker 2 cluster deployments with native Cluster Linking that mirrors topics and preserves message offsets across regions.

Source →
ArchitectureConfluent
Jun 12, 2026

Self-managed Apache Kafka infrastructure → Confluent Cloud fully managed cloud-native Kafka service

Shift from managing underlying Kafka broker instances, manual upgrades, and infrastructure provisioning to fully abstracted cloud infrastructure with serverless and dedicated deployment models.

Source →
Pricing ChangeConfluent
Jun 12, 2026

Confluent pricing update

Freight cluster tier offers up to 90% throughput savings compared to self-managed setups for high-throughput, latency-insensitive workloads.

Source →
Pricing ChangeConfluent
Jun 12, 2026

Confluent pricing update

Confluent Cloud provides dedicated capacity model based on Confluent Capacity Units (CKUs) for heavy production environments.

Source →
Pricing ChangeConfluent
Jun 12, 2026

Confluent pricing update

Confluent Cloud offers consumption-based pricing model for lighter workloads using Basic and Standard tiers.

Source →
Feature LaunchConfluent
Jun 12, 2026

Confluent Streaming Agents

Feature to automate business processes using AI with streaming data.

Source →
Feature LaunchConfluent
Jun 12, 2026

Confluent Intelligence

Real-time, context-aware AI capabilities for streaming data applications.

Source →
Feature LaunchConfluent
Jun 12, 2026

Tableflow

Feature enabling transformation of Kafka topics to tables in a few clicks, supporting data formats like Iceberg and Delta Lake.

Source →
Feature LaunchConfluent
Jun 12, 2026

WarpStream

Kafka-compatible data streaming platform deployable in private cloud environments.

Source →
Feature LaunchConfluent
Jun 12, 2026

Confluent Private Cloud

Deployment option that bridges on-premise control with cloud automation for organizations needing hybrid infrastructure.

Source →
Feature LaunchConfluent
Jun 12, 2026

Confluent Platform

Self-managed commercial data streaming platform that wraps Apache Kafka with enterprise-grade features including security, governance, and connectors.

Source →
Feature LaunchConfluent
Jun 12, 2026

Confluent Cluster Linking

Native feature to mirror topics and preserve message offsets across regions for multi-region disaster recovery without external workers.

Source →
Feature LaunchConfluent
Jun 12, 2026

Confluent Stream Catalog and Governance

Built-in features for stream catalog, end-to-end data lineage, and quality rules for managing complex deployments.

Source →
Feature LaunchConfluent
Jun 12, 2026

Confluent Stream Processing with Apache Flink

Fully managed Apache Flink integration allowing real-time stream processing using standard SQL queries.

Source →
Feature LaunchConfluent
Jun 12, 2026

Confluent Kafka Connect

Over 120 pre-built, fully managed cloud connectors for seamless integration with external datastores like Snowflake and S3.

Source →
Feature LaunchConfluent
Jun 12, 2026

Confluent Schema Registry

Enforces strict data contracts using Avro, Protobuf, and JSON Schema to prevent producers from breaking downstream applications.

Source →
Feature LaunchConfluent
Jun 12, 2026

Confluent Cloud - Freight Cluster

Cost-effective serverless cluster tailored for high-throughput, latency-insensitive workloads like logging and AI/ML data ingestion, offering up to 90% throughput savings.

Source →
Feature LaunchConfluent
Jun 12, 2026

Confluent Cloud - Enterprise Cluster

Advanced cluster tier offering governance and sharing capabilities for complex architectural needs.

Source →
Feature LaunchConfluent
Jun 12, 2026

Confluent Cloud - Dedicated Cluster

High-throughput enterprise cluster tier with private networking, predictable performance, and isolated infrastructure.

Source →
Feature LaunchConfluent
Jun 12, 2026

Confluent Cloud - Standard Cluster

Production-ready cluster tier with standard features, multi-zone availability, and Schema Registry for standard workloads.

Source →
Feature LaunchConfluent
Jun 12, 2026

Confluent Cloud - Basic Cluster

Serverless cluster tier ideal for development, prototyping, and low-throughput applications with basic features.

Source →
Feature LaunchStarburst
Jun 12, 2026

AIDA Skills

Extension capability for AIDA that allows domain expertise to be integrated with the AI assistant for enhanced functionality.

Source →
ArchitectureConfluent
Jun 12, 2026

MirrorMaker 2 → Cluster Linking

Confluent replaced external MirrorMaker 2 deployments with native Cluster Linking for multi-region disaster recovery, eliminating the need to deploy and monitor an independent cluster.

Source →
PartnershipConfluent
Jun 12, 2026

Debezium

Confluent Cloud provides fully managed Debezium PostgreSQL CDC source connector for capturing database changes.

Source →
PartnershipConfluent
Jun 12, 2026

Apache Flink

Confluent integrates fully managed Apache Flink capabilities into Confluent Cloud for real-time stream processing.

Source →
PartnershipConfluent
Jun 12, 2026

Amazon S3

Confluent provides a pre-built, fully managed cloud connector for seamless integration with S3.

Source →
PartnershipConfluent
Jun 12, 2026

Snowflake

Confluent provides a pre-built, fully managed cloud connector for seamless integration with Snowflake datastores.

Source →
PartnershipConfluent
Jun 12, 2026

Microsoft Azure

Confluent Cloud is available natively on Microsoft Azure as part of its global deployment across major cloud providers.

Source →
PartnershipConfluent
Jun 12, 2026

Google Cloud Platform (GCP)

Confluent Cloud is available natively on GCP as part of its global deployment across major cloud providers.

Source →
PartnershipConfluent
Jun 12, 2026

Amazon Web Services (AWS)

Confluent Cloud is available natively on AWS as part of its global deployment across major cloud providers.

Source →
Pricing ChangeConfluent
Jun 12, 2026

Confluent pricing update

Confluent Cloud offers consumption-based pricing for lighter workloads (Basic and Standard tiers) and dedicated capacity model based on Confluent Capacity Units (CKUs) for heavy production environments.

Source →
Feature LaunchConfluent
Jun 12, 2026

Confluent Cloud Cluster Types (Freight)

A highly cost-effective, serverless cluster type tailored for high-throughput, latency-insensitive workloads like logging, observability, batch pipelines, and AI/ML data ingestion, offering up to 90% throughput savings.

Source →
Feature LaunchConfluent
Jun 12, 2026

Cluster Linking

A native multi-region and disaster recovery feature that mirrors topics and preserves message offsets across regions without external workers.

Source →
Feature LaunchConfluent
Jun 12, 2026

Stream Governance

Built-in stream catalog, end-to-end data lineage, and quality rules to manage complex deployments and democratize access to high-quality data.

Source →
Feature LaunchConfluent
Jun 12, 2026

Streaming Agents

Automation of business processes with AI using Confluent's streaming platform.

Source →
Feature LaunchConfluent
Jun 12, 2026

Confluent Intelligence

Real-time, context-aware AI capabilities for the Confluent platform.

Source →
Feature LaunchConfluent
Jun 12, 2026

Tableflow

A feature that enables converting topics to tables (Iceberg or Delta Lake tables) in a few clicks.

Source →
Feature LaunchConfluent
Jun 12, 2026

Stream Processing with Apache Flink and ksqlDB

Fully managed Apache Flink and ksqlDB capabilities that allow processing real-time streams using standard SQL queries.

Source →
Feature LaunchConfluent
Jun 12, 2026

Kafka Connect (120+ pre-built connectors)

Confluent provides 120+ pre-built, fully managed cloud connectors for seamless integration with external datastores such as Snowflake and S3.

Source →
Feature LaunchConfluent
Jun 12, 2026

Schema Registry

Enforces strict data contracts using Avro, Protobuf, and JSON Schema to prevent producers from breaking downstream applications with arbitrary payload changes.

Source →
Feature LaunchConfluent
Jun 12, 2026

WarpStream

A Kafka-compatible data streaming platform designed for deployment in private cloud environments.

Source →
Feature LaunchConfluent
Jun 12, 2026

Confluent Private Cloud

A deployment option that bridges on-premise control with the automation and benefits of a cloud service.

Source →
Feature LaunchConfluent
Jun 12, 2026

Confluent Platform

A self-managed, comprehensive data streaming platform built on Apache Kafka that includes enterprise-grade features, management tools, and ecosystem integrations.

Source →
Feature LaunchConfluent
Jun 12, 2026

Confluent Cloud

A fully managed, cloud-native Kafka service available globally across AWS, GCP, and Microsoft Azure with multiple cluster types (Basic, Standard, Dedicated, Enterprise, Freight) and consumption-based or capacity-based pricing models.

Source →
Feature LaunchClickHouse
Jun 11, 2026

ClickCannon

A tool for benchmarking ClickHouse performance and capabilities.

Source →
Feature LaunchClickHouse
Jun 11, 2026

Postgres to Postgres ClickPipes

New ClickPipes feature enabling Postgres to Postgres data integration in ClickHouse Cloud.

Source →
PartnershipEstuary
Jun 11, 2026

Copilot

Agent Skills feature integrates with Copilot to enable building and debugging pipelines directly within the AI coding tool.

Source →
PartnershipEstuary
Jun 11, 2026

Cursor

Agent Skills feature integrates with Cursor to enable building and debugging pipelines directly within the AI coding tool.

Source →
PartnershipEstuary
Jun 11, 2026

Claude Code

Agent Skills feature integrates with Claude Code to enable building and debugging pipelines directly within the AI coding tool.

Source →
PartnershipEstuary
Jun 11, 2026

Snowflake

Integration and positioning around Snowflake's agentic enterprise vision, including Snowflake CoCo and Datastream announcements discussed in the context of Estuary's real-time data infrastructure.

Source →
Feature LaunchEstuary
Jun 11, 2026

Agent Skills

A new feature that lets teams build, monitor, and debug pipelines directly from within AI coding tools like Claude Code, Cursor, and Copilot.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Apache Airflow 2.9

Release with significant enhancements to data-aware scheduling, dynamic task mapping, and object storage.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Apache Airflow 2.7

Release featuring automatic setup/teardown of tasks, built-in OpenLineage support, cluster activity view, fail-stop functionality, and more.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Kubernetes Executor Support in Astro

Support for Kubernetes Executor in Astro enabling task isolation, efficient resourcing, and simplicity for managing Airflow workloads.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Apache Airflow 2.6

Release containing over 500 commits from over 130 contributors, adding 35 new features, 50 general improvements, and 27 bug fixes.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Apache Airflow 2.5

Release with improvements to dynamic task mapping and data-dependent scheduling features.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Astro Cloud IDE

Notebook-inspired tool for writing data pipelines without requiring knowledge of Apache Airflow.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Cosmos 1.0

Major milestone release of Cosmos, the best way to run dbt Core in Airflow.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Apache Airflow 2.8

Release including Airflow ObjectStore, Listener hook for Datasets, enhanced logging capabilities, and more.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Deploy Rollbacks

Feature enabling users to revert code deployed to Astro Deployments to a known 'good' state for quick recovery from failing pipelines.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Cosmos 1.6

Latest version of Astronomer's dbt-core integration with enhancements and additions for Airflow integration.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Astro Terraform Provider

Terraform provider enabling users to manage and automate Astro deployments using Infrastructure as Code.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Apache Airflow 2.10

Release bringing greater flexibility and expansion of widely used Airflow features.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Dag-Level Roles on Astro

Fine-grained access control feature for Enterprise customers to control access to individual DAGs within a shared deployment.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Data Quality in Astro Observe

Enhanced data quality monitoring in Astro Observe with orchestration-first approach, now available in private preview.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Cosmos 1.11 (alpha)

Alpha support for dbt Fusion with lightning-fast parsing, state-aware orchestration, and real-time validation orchestrated natively in Airflow.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Enhanced Alerting on Astro

Unified monitoring experience through pattern-based alert rules that can be applied across hundreds of DAGs simultaneously.

Source →
Feature LaunchAstronomer
Jun 11, 2026

AI SDK for Apache Airflow

SDK introducing AI agent tooling and capabilities to Apache Airflow for building AI-powered workflows with human-in-the-loop operations.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Remote Execution on Astro

Feature allowing enterprises to run workloads exactly where needed while maintaining centralized orchestration and observability in Astro.

Source →
Feature LaunchAstronomer
Jun 11, 2026

AI Agent Tooling for Airflow

Specialized Airflow knowledge integration for AI coding tools like Claude Code, Cursor, and VS Code, enabling access to Airflow intelligence in local workflows.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Apache Airflow 3

Major release reimagining data orchestration for the AI era with significant improvements and new capabilities.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Astro Private Cloud

Enterprise orchestration and scheduling platform in your environment with control plane and data plane separation, enhanced security, and Apache Airflow 3 support.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Astro Observe

New standard for pipeline reliability and data product observability, now generally available.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Apache Airflow 3.1

Latest Airflow release continuing momentum from Airflow 3 with new features and improvements.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Astro Executor

Airflow 3 architecture built for performance and reliability, providing 70% higher concurrency, fewer failures, and lower infrastructure costs.

Source →
Feature LaunchAstronomer
Jun 11, 2026

DAG Factory 1.0

Open-source tool for declarative DAG authoring in Apache Airflow reaches major milestone with version 1.0.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Astro IDE

First AI-powered IDE purpose-built for Apache Airflow, enabling teams to ship Airflow DAGs 10x faster with AI assistance.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Astro Observe Data Quality

Data quality monitoring feature in Astro Observe with event-driven monitoring capabilities, now available in public preview.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Astro API

Production-ready and generally available API for programmatically managing Astro at scale, providing stable foundation for automation and migration from beta API.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Cross-Region Disaster Recovery on Astro

Enterprise resilience feature for cross-region disaster recovery on Astro with database replication, warm standby compute, and one-click failover, now generally available.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Cosmos 1.14

Update delivering significant improvements to Watcher execution mode for running dbt in Airflow, plus a fully restructured documentation experience.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Blueprint in Astro

Self-service DAG authoring feature enabling anyone in an organization to create Airflow pipelines through a drag-and-drop no-code interface without requiring Python or Airflow knowledge.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Apache Airflow 3.2

Latest Airflow release featuring asset partitions, async tasks, and continued improvements to the Airflow 3 platform.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Astro Private Cloud 2.0

Enhanced version of Astro Private Cloud with disaster recovery, governance, and audit logging capabilities, now generally available.

Source →
Feature LaunchAstronomer
Jun 11, 2026

Otto

Data engineering agent built for Airflow that investigates pipeline failures automatically and uses AI to convert Control-M, AutoSys, and Automic job definitions into production-ready Airflow DAGs.

Source →
ArchitectureFivetran
Jun 11, 2026

Warehouse-centric architecture → Open Data Infrastructure (ODI) with decoupled storage and compute

Shift from tightly coupled, warehouse-centric models to Open Data Infrastructure grounded in open standards, separating storage and compute, supporting multiple engines without data duplication, and enabling AI agents with consistent governance.

Source →
PartnershipMonte Carlo
Jun 10, 2026

Anthropic (Claude)

Integration enabling Claude AI agents to leverage Monte Carlo's data observability capabilities for building and maintaining reliable data products with embedded quality assurance.

Source →
Feature LaunchMonte Carlo
Jun 10, 2026

Monitors as Code

A capability allowing monitoring definitions to be version-controlled and deployed through CI/CD pipelines alongside data assets, ensuring reliability ships with code changes.

Source →
Feature LaunchMonte Carlo
Jun 10, 2026

Monte Carlo MCP Server

A Model Context Protocol server integration with the Agent Toolkit that enables agents to assess monitoring coverage, blast radius, validate fields/tables against live workspace, and emit deployable monitors-as-code YAML.

Source →
Feature LaunchMonte Carlo
Jun 10, 2026

Monte Carlo Agent Toolkit - Prevent Skill

A safety-first loop that uses editor hooks to surface downstream blast radius, active alerts, monitor coverage before edits, generate monitors-as-code after edits, and validate changes before merge.

Source →
PartnershipClickHouse
Jun 10, 2026

Datadog

Datadog and ClickHouse partner to bring full-fidelity data to modern observability solutions.

Source →
ArchitecturePrefect
Jun 10, 2026

Traditional monolithic knowledge work tools → FastMCP with composable, version-controlled Python workflows

FastMCP enables transformation from traditional knowledge work tools like PowerPoint to composable, version-controlled Python workflows for data engineers.

Source →
ArchitecturePrefect
Jun 10, 2026

TypeScript Lambdas → Prefect

Barstool Sports replaced unreliable TypeScript Lambdas with Prefect to orchestrate media data across podcasts, e-commerce, and social media.

Source →
PartnershipPrefect
Jun 10, 2026

Debezium

Debezium Change Data Capture (CDC) integration with Prefect for building real-time event-driven workflows to modernize legacy systems.

Source →
PartnershipPrefect
Jun 10, 2026

AWS

Seven.One Entertainment uses Prefect to orchestrate data pipelines connecting Snowflake, dbt, and AWS across extraction, transformation, quality testing, and distribution.

Source →
PartnershipPrefect
Jun 10, 2026

dbt

Seven.One Entertainment uses Prefect to orchestrate data pipelines connecting Snowflake, dbt, and AWS across extraction, transformation, quality testing, and distribution.

Source →
PartnershipPrefect
Jun 10, 2026

Snowflake

Seven.One Entertainment uses Prefect to orchestrate data pipelines connecting Snowflake, dbt, and AWS across extraction, transformation, quality testing, and distribution.

Source →
Pricing ChangePrefect
Jun 10, 2026

Prefect pricing update

Dagster vs Prefect self-serve plans comparison showing pricing differences between Dagster+ and Prefect Cloud for small teams and solo practitioners.

Source →
Feature LaunchPrefect
Jun 10, 2026

Real-Time Workflows with Debezium Integration

Real-time event-driven workflow capability using Prefect and Debezium Change Data Capture (CDC) for modernizing legacy systems with instant automated workflows.

Source →
Feature LaunchPrefect
Jun 10, 2026

Prefect - Decomposed Durability for Data Workflows

Prefect's approach to durable execution that decouples results from workflow identity, enabling cross-workflow caching and exactly-once semantics through composable primitives.

Source →
Feature LaunchPrefect
Jun 10, 2026

Prefect Cloud - Hybrid Deployment Architecture

Patented two-component hybrid architecture for Prefect Cloud that isolates code and data to meet FedRAMP, HIPAA, and PCI-DSS compliance requirements with three deployment options: Hybrid, PrivateLink, and Customer-Managed.

Source →
Feature LaunchPrefect
Jun 10, 2026

Prefect Cloud - Agentic Security Questionnaires

An agentic security questionnaire workflow feature on Prefect Cloud with full observability, a self-improving knowledge base, and human review built in.

Source →
Feature LaunchPrefect
Jun 10, 2026

FastMCP

FastMCP replaces traditional knowledge work tools with composable, version-controlled workflows for data engineers, enabling transformation from PowerPoint to Python-based workflows.

Source →
PartnershipLlamaIndex
Jun 10, 2026

Anthropic

Anthropic Fable 5 benchmarking results show impressive document understanding with 90.02% content faithfulness and 72.62% semantic formatting, leading competitors by 12+ points in key metrics on ParseBench.

Source →
Feature LaunchLlamaIndex
Jun 10, 2026

ParseBench

First doc-parsing benchmark built for AI agents with 2,000+ human-verified pages and 167K+ test rules across 5 dimensions, presented at CVPR 2026.

Source →
Feature LaunchLlamaIndex
Jun 10, 2026

Parse-Flow Visual Workflow Designer

Open-source project tackling enterprise document processing with four primitives (Parse, Classify, Split, Extract) in a drag-and-drop interface powered by LlamaAgents workflows.

Source →
Feature LaunchLlamaIndex
Jun 10, 2026

LlamaParse Granular Bounding Boxes

New word, line, and cell-level coordinates for every extracted value, providing complete audit trails from extracted data back to exact source locations in documents for compliance and verification workflows.

Source →
ArchitectureFivetran
Jun 10, 2026

Fragmented ERP systems across 19 countries → SAP S/4HANA Cloud private edition via RISE with centralized data in Snowflake

ANASAC consolidated its fragmented multi-country ERP environment onto a unified SAP S/4HANA RISE platform and moved SAP data into Snowflake for centralized analytics and governance.

Source →
PartnershipFivetran
Jun 10, 2026

SAP

Fivetran enabled extraction of critical data from SAP S/4HANA RISE using indirect static reads through the NetWeaver application layer over RFC.

Source →
PartnershipFivetran
Jun 10, 2026

Snowflake

ANASAC used Fivetran to extract critical data from SAP S/4HANA and deliver it into Snowflake, creating a scalable foundation for centralized analytics.

Source →
PartnershipFivetran
Jun 10, 2026

Evolve Decision Science

Evolve Decision Science acted as a strategic partner to ANASAC, identifying Fivetran as the right platform for its unified data strategy and helping drive the implementation.

Source →
ArchitectureFivetran
Jun 10, 2026

Vendor-locked, tightly bundled data platforms with proprietary formats and coupled compute/storage → Open Data Infrastructure with open formats, table formats, modular components, and interoperable engines

Fivetran advocates for a shift from vendor lock-in to Open Data Infrastructure (ODI), an architectural approach that stores data once in open formats and enables use across multiple tools, compute engines, and AI systems without vendor dependency.

Source →
ArchitectureMonte Carlo
Jun 9, 2026

Single deployment model → Three deployment models: Cloud, Cloud with Customer-hosted Data Store, Hybrid

Monte Carlo introduced flexible deployment options allowing customers to choose where the agent and data store live, from fully hosted to hybrid environments with maximum customer control.

Source →
ArchitectureMonte Carlo
Jun 9, 2026

Monolithic native integrations → Composable integration building blocks framework

Integration layer transformed from individual native connectors into composable building blocks (native connectors, Push API, Custom SQL) with per-capability configuration and AI extensibility.

Source →
ArchitectureMonte Carlo
Jun 9, 2026

Cloud-native agents only → Multi-deployment model with Generic Agent

Monte Carlo evolved from cloud-native only agents (AWS, Azure, GCP) to support a containerized Generic Agent enabling on-prem, hybrid, and multi-cloud deployments with egress-only architecture.

Source →
Feature LaunchMonte Carlo
Jun 9, 2026

Dedicated Instances

Isolated platform-layer deployments offering each customer their own subdomain, isolated AWS account, and core infrastructure with optional multi-region disaster recovery and PrivateLink connectivity.

Source →
Feature LaunchMonte Carlo
Jun 9, 2026

Agent Observability Product

A product that monitors AI agents in production, representing Monte Carlo's expansion into AI system observability alongside data observability.

Source →
Feature LaunchMonte Carlo
Jun 9, 2026

Connection Auth Rules

A declarative framework for configuring authentication across self-hosted credential integrations, allowing new auth types to be added as configuration changes running entirely inside the customer's environment.

Source →
Feature LaunchMonte Carlo
Jun 9, 2026

Push Ingest API

REST endpoints for pushing metadata, lineage, and query logs into Monte Carlo to fill coverage gaps in existing integrations or land new sources without a dedicated connector.

Source →
Feature LaunchMonte Carlo
Jun 9, 2026

Custom SQL Connectors

AI-assisted connectors for SQL sources not in Monte Carlo's native catalog, generated through Claude conversations to produce scaffolding, SQL templates, and deployable images that run on the Generic Agent.

Source →
Feature LaunchMonte Carlo
Jun 9, 2026

Composable, AI-extensible Integration Platform

A new integration framework with composable building blocks including native connectors, Push Ingest API, and Custom SQL Connectors that can be mixed and matched per capability and extended via AI workflows with Claude.

Source →
Feature LaunchMonte Carlo
Jun 9, 2026

Generic Agent

A containerized, egress-only agent extending Monte Carlo into on-premises, multi-cloud, and hybrid environments where cloud-native agents cannot reach. Supports Docker deployment with no inbound ports and reads source credentials from AWS Secrets Manager, GCP Secret Manager, Azure Key Vault, or environment variables.

Source →
ArchitecturedltHub
Jun 9, 2026

GraphRAG approach requiring extraction of knowledge graphs from unstructured text → Virtual knowledge graph from existing canonical data models

Instead of extracting knowledge graphs from raw text, leverage the structure already present in canonical data models as native knowledge graphs, eliminating a separate extraction step while still applying ontology-driven querying principles.

Source →
ArchitecturedltHub
Jun 9, 2026

Sequential query processing through multiple handoffs (request → load → transform → analyst → dashboard) → Question-driven direct model updates with shifted canonical model

Moving the canonical model development left in the pipeline so new questions directly drive ingestion and modeling changes, eliminating handoffs and allowing direct query answering from the model when coverage exists.

Source →
ArchitecturedltHub
Jun 9, 2026

Traditional data modeling with semantic layers added post-hoc → Spec-first canonical knowledge layer driving both data modeling and agent queries

Shift from building raw data models first and then adding semantic layers on top, to writing canonical knowledge layers (combining structure, taxonomy, and ontology) first and using them to generate both the data model and drive agent-based querying. This eliminates the need to maintain separate artifacts.

Source →
Feature LaunchdltHub
Jun 9, 2026

Agentic Data Engineering Course

Educational offering available on dltHub's learning platform covering hands-on implementation of ontology-driven data modeling and spec-first development approaches.

Source →
Feature LaunchdltHub
Jun 9, 2026

Ontology Toolkit Preview

A toolkit for building and managing canonical knowledge layers (taxonomies and ontologies) that define data meaning alongside structure for use in text-to-SQL and agentic data engineering.

Source →
Feature LaunchdltHub
Jun 9, 2026

dltHub AI Workbench - Ontology-Driven Modeling

Data-engineering agents that generate ingestion and modeling from a canonical knowledge layer specification. The system allows developers to bootstrap an ontology and use it to generate canonical models with human curation of judgment calls.

Source →
ArchitectureDagster
Jun 9, 2026

Implicit dependency graphs → Explicit asset graph representation

Making data dependencies explicit and part of the platform's operational model rather than encoding them in code paths, logs, and engineers' knowledge, enabling impact analysis and safer re-runs.

Source →
ArchitectureDagster
Jun 9, 2026

Schedule-based automation → Data-state-driven automation

Transitioning from time-based scheduling (e.g., 'run at 7:00 AM') to declarative automation that responds to data state, dependency status, missing partitions, and freshness expectations, eliminating hidden ordering assumptions.

Source →
ArchitectureDagster
Jun 9, 2026

Job-centric orchestration systems → Asset-aware orchestration (Dagster)

Moving from job-centric systems that track task execution status to asset-aware systems that model data as first-class objects with explicit dependencies, quality checks, and freshness SLAs. This architectural shift enables teams to reason about data state rather than just job completion.

Source →
Feature LaunchLlamaIndex
Jun 9, 2026

Granular Bounding Boxes in LlamaParse

New feature announcement for LlamaParse enabling more precise bounding box capabilities for document parsing.

Source →
Feature LaunchLlamaIndex
Jun 9, 2026

LlamaParse Granular Bounding Boxes

Enhanced bounding box feature announced separately, enabling precise visual citations and audit trails.

Source →
Feature LaunchLlamaIndex
Jun 9, 2026

LlamaParse Granular Bounding Boxes

Feature enabling more precise document element identification and positioning within parsed documents.

Source →
Feature LaunchLlamaIndex
Jun 9, 2026

Granular Bounding Boxes in LlamaParse

A LlamaParse feature for enhanced document intelligence and visual processing capabilities.

Source →
Feature LaunchLlamaIndex
Jun 9, 2026

Granular Bounding Boxes in LlamaParse

New capability announced for LlamaParse enabling more precise document element identification and processing.

Source →
Feature LaunchLlamaIndex
Jun 9, 2026

Granular Bounding Boxes in LlamaParse

An enhancement to LlamaParse that provides granular bounding box capabilities for improved document intelligence and visual processing.

Source →
Feature LaunchLlamaIndex
Jun 9, 2026

Granular Bounding Boxes in LlamaParse

New capability in LlamaParse for more precise document element localization and coordinate mapping.

Source →
Feature LaunchLlamaIndex
Jun 9, 2026

Granular Bounding Boxes in LlamaParse

Enhancement feature announced in LlamaParse providing more precise spatial information and document understanding capabilities.

Source →
Feature LaunchLlamaIndex
Jun 9, 2026

LlamaParse Granular Bounding Boxes

Enhanced feature in LlamaParse providing more precise bounding box detection for document elements.

Source →
Feature LaunchLlamaIndex
Jun 9, 2026

Granular Bounding Boxes in LlamaParse

New feature enabling line, word, and cell-level bounding box tracking across documents for precise citation attribution and redaction in agentic document AI workflows. Available in beta across all paid tiers.

Source →
Feature LaunchLlamaIndex
Jun 9, 2026

Granular Bounding Boxes in LlamaParse

New capability announced for LlamaParse providing more precise spatial information in document processing.

Source →
PartnershipCoalesce
Jun 9, 2026

GitHub

Coalesce MCPs can be chained with GitHub to enable team workflows with governance controls and approval gates.

Source →
PartnershipCoalesce
Jun 9, 2026

Slack

Coalesce MCPs can be chained with Slack to enable team workflows with governance controls and approval gates.

Source →
PartnershipCoalesce
Jun 9, 2026

Anthropic (Claude)

Coalesce MCPs provide Claude and other AI clients direct access to transformations, governance metadata, and quality monitors for integrated data engineering workflows.

Source →
Feature LaunchCoalesce
Jun 9, 2026

Coalesce MCPs - Workflow Governance

MCP capability enabling teams to set read vs. write token scopes, chain Coalesce MCPs with Slack and GitHub, and add per-call approval gates for controlled governance.

Source →
Feature LaunchCoalesce
Jun 9, 2026

Coalesce MCPs - Root-Cause Analysis

MCP capability for debugging anomalies with live root-cause analysis by tracing anomalies upstream, reviewing recent commits, and pinpointing exact changes that broke pipelines.

Source →
Feature LaunchCoalesce
Jun 9, 2026

Coalesce MCPs - Impact Analysis

MCP capability enabling users to run impact analysis before field changes by pulling Catalog metadata and checking Quality status in a single prompt for downstream picture visibility.

Source →
Feature LaunchCoalesce
Jun 9, 2026

Coalesce MCPs (Model Context Protocols)

AI-enabled feature giving Claude and other AI clients direct access to Coalesce transformations, governance metadata, and quality monitors for unified impact analysis, root-cause debugging, and owner assignment within a single conversation.

Source →
Feature LaunchLlamaIndex
Jun 9, 2026

Granular Bounding Boxes in LlamaParse

Word, line, and cell-level bounding boxes for audit-grade citation across document AI workflows.

Source →
Feature LaunchMonte Carlo
Jun 8, 2026

Intelligent autonomous observability system for Cortex agents

Monte Carlo's observability system that can trace agent failures across the data, semantic, agent-build, and trust layers in production environments.

Source →
Feature LaunchMonte Carlo
Jun 8, 2026

Agent Bricks observability for Databricks

Native Agent Bricks observability integration for Databricks with zero instrumentation required.

Source →
Feature LaunchHevo Data
Jun 8, 2026

Hevo Data - Webhook Integration for Real-time Updates

Real-time data ingestion capability that allows Facebook to push updates directly to Hevo whenever campaigns, ads, or metrics change, enabling near-real-time data streams to BigQuery.

Source →
Feature LaunchHevo Data
Jun 8, 2026

Hevo Data - Facebook Ads to BigQuery Connector

No-code data pipeline that automatically syncs Facebook Ads data to BigQuery with built-in schema handling, incremental syncs, API change management, and automatic retries.

Source →
Feature LaunchdltHub
Jun 8, 2026

AI Workbench

AI Workbench is a tool that builds CDM (Conceptual Data Models) by asking clarifying questions about a domain and capturing answers as structured business rules, generating ontologies from the modeling workflow.

Source →
Feature LaunchCoalesce
Jun 8, 2026

Coalesce AI-Powered Data Pipeline Development

Coalesce offers AI capabilities to accelerate data engineering workflows, including natural language prompts for data transformation and modeling, AI-generated production-ready pipelines, and rich column-level metadata support for Snowflake.

Source →
ArchitectureCoalesce
Jun 8, 2026

Multiple disparate tools for transformations, cataloguing, and observability → Unified Coalesce data operating layer with integrated Transform, Catalog, and Quality

Moving from a fragmented tooling approach with separate tools for transformations, cataloguing, and observability to an integrated data operating layer with built-in observability capabilities alongside Transform and Catalog.

Source →
Feature LaunchCoalesce
Jun 8, 2026

Coalesce Quality

A built-in observability capability integrated into the Coalesce data operating layer alongside Transform and Catalog. It enables teams to automatically monitor critical data assets, detect unexpected issues in real time, and investigate incidents with full lineage and metadata context.

Source →
ArchitectureCoalesce
Jun 8, 2026

SQL Server with SSIS → Snowflake with Coalesce

Alliant Insurance migrated 200 data pipelines from SQL Server/SSIS to Snowflake and Coalesce, completing the legacy modernization in five weeks with 55-60% cost reductions.

Source →
PartnershipCoalesce
Jun 8, 2026

Systech

Systech's DBShift automation platform was used to facilitate Alliant Insurance's migration of 200 SQL Server/SSIS pipelines to Snowflake + Coalesce, demonstrating integration between Coalesce and Systech's migration tooling.

Source →
ArchitectureCoalesce
Jun 8, 2026

SQL Server (on-premise) → Snowflake (cloud)

Migration scenario demonstrating transition from legacy on-premise SQL Server systems to cloud-native Snowflake architecture, including automated ingestion with Snowflake OpenFlow and transformation via Coalesce.

Source →
PartnershipCoalesce
Jun 8, 2026

Snowflake

Joint demo walkthrough event showcasing SQL Server to Snowflake migration using Coalesce for transformation and modelling, with Snowflake OpenFlow for automated ingestion and downstream workflow orchestration.

Source →
PartnershipCoalesce
Jun 8, 2026

Snowflake

Integration of Snowflake Cortex Code with Coalesce for AI-powered pipeline triage and incident response automation.

Source →
Feature LaunchCoalesce
Jun 8, 2026

Snowflake Cortex Code Integration with Coalesce

AI agent capability that analyzes pipeline failures and performs triage by reasoning across schemas, queries, and job metadata to surface root causes and propose actionable fixes directly inside the data environment.

Source →
PartnershipMonte Carlo
Jun 5, 2026

Snowflake

Snowflake integration via Model Context Protocol enabling Claude Code to run targeted investigative SQL queries and trace data lineage for incident diagnosis.

Source →
PartnershipMonte Carlo
Jun 5, 2026

Claude (Anthropic)

Claude Code integrated with Monte Carlo's platform via Model Context Protocol (MCP) to enable AI-assisted root cause analysis, source code reading, and SQL query execution against Snowflake for data incident investigation.

Source →
Feature LaunchMonte Carlo
Jun 5, 2026

Operations Agent

Platform agent that automates observability and data engineering work alongside the Troubleshooting and Monitoring Agents.

Source →
Feature LaunchMonte Carlo
Jun 5, 2026

Monitoring Agent

Platform agent that automates observability and data engineering work alongside the Troubleshooting and Operations Agents.

Source →
Feature LaunchMonte Carlo
Jun 5, 2026

Claude Code integration with MCP and Snowflake

Integration enabling Claude Code to read dbt model source code and run targeted investigative SQL queries connected to Snowflake via Model Context Protocol for advanced root cause analysis.

Source →
Feature LaunchMonte Carlo
Jun 5, 2026

Troubleshooting Agent

An agentic observability tool that automates fast triage of data incidents by pattern-matching against alert history, table metadata, and lineage to identify root causes in seconds rather than 15-30 minutes of manual analyst work.

Source →
Feature LaunchdltHub
Jun 5, 2026

Nested structure unpacking

dlt unpacks nested structures into relational tables, making nested schema changes visible at the table level rather than hidden in JSON blobs.

Source →
Feature LaunchdltHub
Jun 5, 2026

Pydantic model integration for contracts

dlt supports using Pydantic BaseModel classes as authoritative schema contracts for tables, enabling code review of contract changes and automatic validation of field names and types.

Source →
Feature LaunchdltHub
Jun 5, 2026

Variant column handling for type changes

dlt automatically creates variant columns (e.g., amount__v_text) alongside original columns when data types change, preserving both the original and new type without breaking existing queries.

Source →
Feature LaunchdltHub
Jun 5, 2026

Schema update detection and routing

dlt provides schema_update payload that can be routed to Slack, PagerDuty, Linear, or other channels as part of the pipeline run, enabling real-time notifications when schema changes occur.

Source →
Feature LaunchdltHub
Jun 5, 2026

dlt schema evolution and contracts

dlt supports schema evolution policies with four contract modes (evolve, freeze, discard_row, discard_value) applied at the resource level, enabling granular control over schema changes across different data pipeline layers (raw, silver, gold).

Source →
ArchitectureFivetran
Jun 4, 2026

Manual, siloed data analysis across disconnected systems (Gong, Salesforce, Zendesk, billing, product usage) → Centralized data warehouse in BigQuery with AI-powered agent for automated churn analysis

Fivetran enabled transformation from manual multi-system churn analysis to centralized data architecture with AI agents, moving from reactive post-churn reviews to proactive retention management at scale.

Source →
PartnershipFivetran
Jun 4, 2026

Anthropic

Fivetran built an AI agent using Claude to analyze unified customer data and identify churn risks, demonstrating integration of Anthropic's LLM for enterprise retention use cases.

Source →
PartnershipFivetran
Jun 4, 2026

Salesforce

Fivetran centralized Salesforce interaction history and QBR notes as part of the unified customer data foundation used for churn pattern analysis and AI-powered retention management.

Source →
PartnershipFivetran
Jun 4, 2026

Gong

Fivetran integrated Gong call transcript data as a primary signal for churn analysis, incorporating 12 months of call transcripts into the centralized BigQuery data foundation for AI-driven retention insights.

Source →
PartnershipFivetran
Jun 4, 2026

Zendesk

Fivetran utilized Zendesk support ticket data as a key signal for churn analysis, combining it with data from other sources to build a unified customer data foundation for AI-powered retention analysis.

Source →
ArchitectureFivetran
Jun 3, 2026

dbt Core with original engine → dbt Core v2.0 with Fusion engine runtime

dbt Core v2.0 now available under Apache 2.0 license, replacing the original runtime with the faster, more scalable dbt Fusion engine runtime.

Source →
PartnershipFivetran
Jun 3, 2026

MotherDuck

Fivetran Partner SDK destination for MotherDuck now generally available, built and maintained by the MotherDuck team, enabling direct data loading.

Source →
Feature LaunchFivetran
Jun 3, 2026

Lite connectors - Parcellab (Private Preview)

Private Preview release of Parcellab Lite connector with accelerated development cycle.

Source →
Feature LaunchFivetran
Jun 3, 2026

Lite connectors - Atlassian Compass (Private Preview)

Private Preview release of Atlassian Compass Lite connector with accelerated development cycle.

Source →
Feature LaunchFivetran
Jun 3, 2026

Lite connectors - Zoho People (Private Preview)

Private Preview release of Zoho People Lite connector with accelerated development cycle.

Source →
Feature LaunchFivetran
Jun 3, 2026

Lite connectors - Google Tag Manager (GA)

General Availability release of Google Tag Manager Lite connector with accelerated development cycle.

Source →
Feature LaunchFivetran
Jun 3, 2026

Lite connectors - Trustpilot (GA)

General Availability release of Trustpilot Lite connector with accelerated development cycle.

Source →
Feature LaunchFivetran
Jun 3, 2026

Lite connectors - Cal.com (GA)

General Availability release of Cal.com Lite connector with accelerated development cycle.

Source →
Feature LaunchFivetran
Jun 3, 2026

Lite connectors - Breathe HR (GA)

General Availability release of Breathe HR Lite connector with accelerated development cycle.

Source →
Feature LaunchFivetran
Jun 3, 2026

Lite connectors - Leapsome (GA)

General Availability release of Leapsome Lite connector with accelerated development cycle.

Source →
Feature LaunchFivetran
Jun 3, 2026

Lite connectors - ShipMonk (GA)

General Availability release of ShipMonk Lite connector with accelerated development cycle.

Source →
Feature LaunchFivetran
Jun 3, 2026

Supabase and Neon connectors

Both platforms now available as sources through Fivetran's PostgreSQL connector for seamless data integration.

Source →
Feature LaunchFivetran
Jun 3, 2026

Unstructured File Replication for Veeva Vault, Jira, and Zendesk

Unstructured file replication now supported across three additional platforms for more flexible data ingestion.

Source →
Feature LaunchFivetran
Jun 3, 2026

SAP OData Connector

Now in Beta, this connector enables data replication from SAP OData sources.

Source →
Feature LaunchFivetran
Jun 3, 2026

Db2 for z/OS Managed-Service Connector

Now in Beta, this managed-service connector enables data replication from Db2 for z/OS environments.

Source →
Feature LaunchFivetran
Jun 3, 2026

MotherDuck destination

Now generally available through Fivetran's Partner SDK destination, enabling direct data loading into MotherDuck built and maintained by the MotherDuck team.

Source →
Feature LaunchFivetran
Jun 3, 2026

dbt Core v2.0

Now available as open-source foundation under Apache 2.0 license, bringing the dbt Fusion engine runtime to the broader community with faster, more scalable execution engine.

Source →
Feature LaunchFivetran
Jun 3, 2026

dbt Wizard

AI coding agent purpose-built for analytics engineering, now available to all dbt Core users, leveraging dbt metadata to help build, troubleshoot, and optimize dbt workflows.

Source →
Feature LaunchFivetran
Jun 3, 2026

dbt State

Now available for all dbt users (Core and Fusion), this feature caches model and source state to reduce warehouse spend by average of 30% by skipping or cloning unchanged models.

Source →
Feature LaunchFivetran
Jun 3, 2026

Agents Schema

Open-source standard designating a schema in data warehouse as shared context layer for AI agents, providing flexible interoperable foundation while avoiding vendor lock-in.

Source →
Feature LaunchFivetran
Jun 3, 2026

Data Type Locking for Connector SDK

New feature that locks column data types after initial sync to prevent automatic data type changes during future syncs, reducing unexpected downstream impacts.

Source →
Feature LaunchFivetran
Jun 3, 2026

Connector SDK - Truncate operation

New truncate operation support enables easier replication of sources that only return active records, with transaction-style sync model ensuring all changes fully commit or roll back together.

Source →
Feature LaunchFivetran
Jun 3, 2026

Fivetran Enterprise AI agents

Solution that turns centralized data into context for AI agents, with design partners being sought for early access and product shaping.

Source →
Feature LaunchFivetran
Jun 3, 2026

No-trigger replication for SAP ERP

Fully rolled out across all SAP ERP connection methods, providing flexibility in deletion detection handling by allowing users to opt out of triggers.

Source →
Feature LaunchFivetran
Jun 3, 2026

AI Connector Agent

Now in Beta, this feature enables users to create custom connectors in minutes without coding by providing REST API documentation, automatically generating and configuring data pipelines.

Source →
Feature LaunchMonte Carlo
Jun 3, 2026

Agentic Control Plane with Trust Layer

A two-layer architecture combining orchestration (routing, context injection, guardrails) with a trust layer that continuously monitors agent performance through span-level telemetry, anomaly detection, and business-grounded evaluation pipelines.

Source →
Feature LaunchMonte Carlo
Jun 3, 2026

Agent Observability

Monte Carlo launched Agent Observability to help teams build reliable AI agents in production by monitoring context retrieval, efficiency, intended behavior, and output fitness for purpose.

Source →
PartnershipdltHub
Jun 2, 2026

Snowflake

dltHub named 2026 Snowflake Startup Program Product Partner of the Year at Snowflake Summit 2026, recognized for helping over 1,000 organizations ingest data into Snowflake AI Data Cloud with Python-native pipelines.

Source →
Feature LaunchdltHub
Jun 2, 2026

dltHub Pro

Agentic data engineering platform built on dlt that enables developers working with AI coding agents (Claude Code, Cursor, Codex) to find a source, build the pipeline, validate it locally, and deploy to production in one command.

Source →
Feature LaunchdltHub
Jun 2, 2026

Snowflake Native App for MSSQL, Oracle, MySQL, and PostgreSQL replication

Shipped a Snowflake Native App enabling full pipeline execution inside customer's Snowflake account for replicating MSSQL, Oracle, MySQL, and PostgreSQL databases with no external orchestrator required.

Source →
ArchitectureEstuary
Jun 2, 2026

Federated data access across multiple systems → Centralized data warehouse with real-time updates to Snowflake

Organizations are consolidating data into Snowflake as a centralized AI Data Cloud rather than accessing multiple federated source systems, enabling up-to-date information for AI workloads.

Source →
PartnershipEstuary
Jun 2, 2026

Tableau

Tableau BI tool leverages Snowflake's semantic views to benefit from data definitions and descriptions for improved query generation.

Source →
PartnershipEstuary
Jun 2, 2026

Hex

Hex BI tool leverages Snowflake's semantic views to benefit from data definitions and descriptions for improved query generation.

Source →
PartnershipEstuary
Jun 2, 2026

Sigma

Sigma BI tool leverages Snowflake's semantic views to benefit from data definitions and descriptions for improved query generation.

Source →
Pricing ChangeEstuary
Jun 2, 2026

Estuary pricing update

Estuary pricing model: $0.50/GB of data moved plus $0.14/connector/hour, positioned as 50% less expensive than competing ETL/ELT solutions.

Source →
Feature LaunchEstuary
Jun 2, 2026

Estuary Real-time Data Pipelines

Estuary Build enables fully managed real-time data pipelines with deployment options including Public, Private, and BYOC, featuring less than 100ms latency on streaming sinks/sources.

Source →
Feature LaunchEstuary
Jun 2, 2026

Salesforce Headless 360

Salesforce introduced Headless 360 in April 2026, exposing its platform's core capabilities through API, MCP (Model Context Protocol) and CLI for agent access.

Source →
Feature LaunchEstuary
Jun 2, 2026

Snowflake Semantic View Autopilot

Snowflake unveiled Semantic View Autopilot to dramatically reduce the efforts organizations must put into designing their semantic models.

Source →
Feature LaunchEstuary
Jun 2, 2026

Snowflake Semantic Views

Snowflake introduced semantic views to store data definitions and descriptions, which support text-to-SQL generation and Cortex Analyst for accurate query interpretation.

Source →
ArchitectureLlamaIndex
Jun 2, 2026

Architecture shift

Parse-Flow implements document processing pipelines using the llama-agents event-driven workflow framework, where document intelligence operations are orchestrated through a state machine composed of bootstrap, worker, and router steps.

Source →
Feature LaunchLlamaIndex
Jun 2, 2026

Parse-Flow

Open-source visual document intelligence workflow designer that enables users to drag-and-drop document processing steps (parse, extract, classify, split) onto a canvas and monitor execution with a live event dashboard.

Source →
PartnershipFivetran
Jun 2, 2026

Google Cloud Platform

Fivetran expanded its deployment availability to Google Cloud Platform in Saudi Arabia (GCP Dammam region), enabling in-region data processing and storage for organizations requiring data sovereignty compliance.

Source →
Feature LaunchFivetran
Jun 2, 2026

Fivetran on Google Cloud Platform (GCP Dammam region)

Fivetran is now available on Google Cloud Platform in Saudi Arabia (GCP Dammam region), enabling organizations to keep data in-region while maintaining compliance with the Kingdom's Personal Data Protection Law (PDPL) and accelerating AI adoption.

Source →
Feature LaunchLlamaIndex
Jun 2, 2026

Parse-Flow

Open-source workflow designer for visual document intelligence workflows with parsing, extraction, classification, and splitting primitives on a visual canvas backed by async worker and live event dashboard.

Source →
PartnershipFivetran
Jun 1, 2026

dbt Labs

Fivetran and dbt Labs announced a merger on June 1, 2026, integrating Fivetran's data movement capabilities with dbt's transformation and governance layer to support Open Data Infrastructure architecture with reliable data movement, model contracts, and semantic layer definitions.

Source →
Architecturedbt Labs
Jun 1, 2026

dbt-state as separate plugin → dbt-state bundled with dbt-core

Bundle dbt-state plugin (>=2.18,<3.0) as an optional install dependency of dbt-core, opt-in via --manage-state flag, DBT_ENGINE_MANAGE_STATE env var, or manage_state in dbt_project.yml.

Source →
Architecturedbt Labs
Jun 1, 2026

catalogs.yml v1 → catalogs.yml v2

Introduce catalogs.yml v2 with adapter-owned bridge architecture to improve catalog extensibility and management across adapters.

Source →
Architecturedbt Labs
Jun 1, 2026

dbt-core parser → dbt-fusion parser (v2-parser)

Add --use-v2-parser flag to delegate parsing to the fusion parser, load its manifest.json into runtime Manifest, and bypass dbt-core's parser. Configurable via CLI or dbt_project.yml.

Source →
Feature Launchdbt Labs
Jun 1, 2026

dbt-core v1.12.0b2

Make MAXIMUM_SEED_SIZE_MIB configurable. Automatically create latest-version pointer for versioned models. Add support for private git packages in packages.yml and dependencies.yml.

Source →
Feature Launchdbt Labs
Jun 1, 2026

dbt-core v2.0.0-alpha.1

Add latest_version_pointer for versioned models. Add OSI semantic layer document support. Bundle dbt-state plugin as optional install dependency with --manage-state flag.

Source →
Feature LaunchConfluent
Jun 1, 2026

MirrorMaker - New Metric Names Format (KIP-1280)

Adds metric.names.formats configuration for MirrorSourceConnector and MirrorCheckpointConnector to opt-in to new metric names.

Source →
Feature LaunchConfluent
Jun 1, 2026

Kafka Connect - Plugin Discoverability (KIP-1273)

Introduces ConnectPlugin interface that all Kafka Connect plugins implement to ensure common methods across all plugin types.

Source →
Feature LaunchConfluent
Jun 1, 2026

Kafka Connect - Batch Offset Translation (KIP-1239)

Adds RemoteClusterUtils.translateOffsets() method to translate committed offsets of multiple consumer groups simultaneously.

Source →
Feature LaunchConfluent
Jun 1, 2026

Kafka Streams - DSL Headers-Aware State Stores (KIP-1285)

Exposes Headers-Aware State Stores to the DSL API.

Source →
Feature LaunchConfluent
Jun 1, 2026

Kafka Streams - Headers Support in State Stores (KIP-1271)

Extends the Processor API to support record headers in state stores.

Source →
Feature LaunchConfluent
Jun 1, 2026

Kafka Streams - ProcessingExceptionHandler for GlobalThread (KIP-1270)

Adds processing.exception.handler.global.enabled configuration to enable ProcessingExceptionHandler to handle GlobalKTable exceptions.

Source →
Feature LaunchConfluent
Jun 1, 2026

Kafka Streams - Local State Cleanup on Startup (KIP-1259)

Adds state.cleanup.dir.max.age.ms configuration to automatically delete state directories that have not been modified for the specified duration on startup.

Source →
Feature LaunchConfluent
Jun 1, 2026

Kafka Streams - In-Memory State Store Size Metrics (KIP-1250)

Adds new metrics tracking the number of keys in in-memory state stores.

Source →
Feature LaunchConfluent
Jun 1, 2026

Kafka Streams - Bytes Utils Public API (KIP-1247)

Exposes the Bytes class as part of the public API so it appears in javadoc.

Source →
Feature LaunchConfluent
Jun 1, 2026

Kafka Streams - State Store Changelog Offset Management (KIP-1035)

Adds methods to the StateStore API to manage changelog offsets for custom StateStore implementations.

Source →
Feature LaunchConfluent
Jun 1, 2026

Kafka Broker - OAuth Client Assertion Support (KIP-1258)

Adds support for client assertion authentication to client_credentials grant type with OAuth to enhance security and compatibility with OAuth providers.

Source →
Feature LaunchConfluent
Jun 1, 2026

Kafka Broker - Partition Size Percentage Metrics (KIP-1257)

Introduces new metrics to track how much of the maximum retention each topic-partition currently uses for improved storage monitoring.

Source →
Feature LaunchConfluent
Jun 1, 2026

Kafka Broker - Broker Cordoning (KIP-1066)

Introduces cordoned.log.dirs configuration to cordon log directories, preventing new partitions from being placed on cordoned directories for scaling and decommissioning operations.

Source →
Feature LaunchConfluent
Jun 1, 2026

Kafka Broker - Follower Fetch from Tiered Offset (KIP-1023)

Adds follower.fetch.last.tiered.offset.enable configuration to use last tiered offset as start offset when bootstrapping new followers.

Source →
Feature LaunchConfluent
Jun 1, 2026

Apache Kafka 4.3.0

Major release containing 25 KIPs and over 600 commits with new features including Share Group Controls, Broker Isolation metrics, OAuth Client Assertion support, Headers-Aware State Stores in Kafka Streams, and improved group coordinator assignment logic.

Source →
Feature LaunchdltHub
Jun 1, 2026

dltHub AI Workbench data quality toolkit

Schema-aware data quality checks that bootstrap from dlt's existing schema, sample columns before rules ship, write checks as decorators into pipelines, and route failures to appropriate toolkits (ingestion, transformations, exploration). Includes four primitives (is_unique, is_not_null, is_in, case) and column-level metrics for drift detection.

Source →
ArchitecturedltHub
Jun 1, 2026

Per-row billing models → Compute hour-based billing

dltHub shifted from per-row billing to compute hour-based billing, arguing that users should pay for compute time used rather than the number of rows moved (which varies by data format and source type).

Source →
PartnershipdltHub
Jun 1, 2026

Snowflake

dltHub was named 2026 Snowflake Startup Program Product Partner of the Year.

Source →
Pricing ChangedltHub
Jun 1, 2026

dltHub pricing update

dltHub Pro pricing model: $119 for 50 hours of compute, then $1/hour after that. Billing is based on compute time rather than rows or GB moved.

Source →
Feature LaunchdltHub
Jun 1, 2026

dltHub Pro

dltHub Pro is a managed service that deploys, monitors, and scales dlt data pipelines with time-based billing (no per-row charges).

Source →
PartnershipStarburst
Jun 1, 2026

NVIDIA

GPU-accelerated SQL analytics partnership delivering industry-benchmark speedups on GPU infrastructure through Starburst and NVIDIA collaboration.

Source →
PartnershipDagster
Jun 1, 2026

Brooklyn Data

Brooklyn Data, a data consulting firm, deployed Dagster Compass on top of Snowflake to enable self-service analytics for their delivery excellence organization, using it to expose modeled PSA (Professional Services Automation) data and improve operational efficiency in Slack.

Source →
Architecturedbt Labs
Jun 1, 2026

Separate dual-engine architecture (dbt Core and dbt Fusion) → Unified single-engine architecture

dbt Core v2.0 and Fusion are now built on a shared foundation, ending the two-engine era. The previously separate ELv2-licensed Fusion code is now open-sourced as Apache 2.0 licensed dbt Core v2.0.

Source →
Architecturedbt Labs
Jun 1, 2026

JSON artifacts → Parquet artifacts

dbt Core v2.0 introduces Parquet as a high-performance alternative to large JSON files for artifacts, enabling direct querying through DuckDB and other agents.

Source →
Architecturedbt Labs
Jun 1, 2026

Python → Rust

dbt Core v2.0 is now built on the same Rust-based foundations as the dbt Fusion engine, replacing the previous Python implementation as the baseline for all users. The high-performance Rust implementation reduces complexity and becomes the foundation for future innovation.

Source →
Partnershipdbt Labs
Jun 1, 2026

Fivetran

Fivetran and dbt Labs announced a strategic combination with joint leadership fireside chat on June 25, 2026. dbt Core v2.0 is built on shared foundations between Fivetran + dbt Labs.

Source →
Pricing Changedbt Labs
Jun 1, 2026

dbt Labs pricing update

dbt Fusion engine relicensed from ELv2 to a more permissive license, with premium features available through free login or paid dbt platform account.

Source →
Feature Launchdbt Labs
Jun 1, 2026

dbt Core v1.12.0

New beta version of dbt Core v1.12.0 released, which enforces behavior changes that will be fully removed in v2.0 and ships with Fusion-powered project parser.

Source →
Feature Launchdbt Labs
Jun 1, 2026

Local documentation experience

Completely revamped local documentation powered by new Parquet artifacts and capable of scaling to projects of arbitrary size.

Source →
Feature Launchdbt Labs
Jun 1, 2026

Parquet artifacts

High-performance alternative to large JSON files that can be directly queried through DuckDB or other agents, included in dbt Core v2.0.

Source →
Feature Launchdbt Labs
Jun 1, 2026

dbt Core v2.0

Major release of dbt Core rebuilt on the same foundations as dbt Fusion engine with significant parse time improvements, a tightly-defined language spec, new Parquet artifacts for high-performance storage, revamped local documentation experience, streamlined adapter building, and simplified installation process.

Source →
ArchitectureFivetran
Jun 1, 2026

dbt Core v1.x → dbt Core v2.0 with Rust-based Fusion runtime

Major architectural shift to open source the Fusion runtime and move to a Rust-based engine in dbt Core v2.0, enabling up to 10x faster parse times and better scalability while aligning commercial investment directly with open source improvements.

Source →
PartnershipFivetran
Jun 1, 2026

dbt Labs

Joint product innovations between Fivetran and dbt Labs to build open, interoperable, AI-ready data infrastructure including dbt Core v2.0, dbt State, dbt Wizard, and integrations enabling coordinated product strategy.

Source →
Feature LaunchFivetran
Jun 1, 2026

Agents Schema

Open standard context layer stored in plain SQL tables that allows agents to read metric definitions, semantic models, dbt lineage, and custom business documentation from warehouse or lake schemas.

Source →
Feature LaunchFivetran
Jun 1, 2026

Fivetran AI Connector Agent

Beta tool that generates Fivetran-managed connectors for API sources directly from API documentation in minutes by crawling, parsing, and validating API structures.

Source →
Feature LaunchFivetran
Jun 1, 2026

dbt Wizard CLI

Standalone command-line tool bringing dbt Wizard capabilities to local development environments, including query results and execution history.

Source →
Feature LaunchFivetran
Jun 1, 2026

dbt Wizard in Studio

Dedicated chat-first workspace interface for integrated dbt Wizard experience within the dbt platform.

Source →
Feature LaunchFivetran
Jun 1, 2026

dbt Wizard

Purpose-built AI agent for analytics engineering that understands dbt projects natively, helping with investigations, code changes, testing, and validation while maintaining project context and contracts.

Source →
Feature LaunchFivetran
Jun 1, 2026

dbt State

Plug-in that brings state awareness, orchestration, and caching across dbt Core and the dbt platform, reducing dbt-generated compute by over 30% while checking warehouse metadata and model SQL for changes.

Source →
Feature LaunchFivetran
Jun 1, 2026

dbt Fusion engine

Extension to dbt Core v2.0 with richer capabilities including SQL comprehension, column-level lineage, instant feedback, and high-performance SQL linting.

Source →
Feature LaunchFivetran
Jun 1, 2026

dbt Core v1.12

Beta release featuring the Rust parser for users who want faster parsing without upgrading to the major v2.0 version.

Source →
Feature LaunchFivetran
Jun 1, 2026

dbt Core v2.0

Open sourcing the Fusion runtime under Apache 2.0 license with a Rust-based engine offering up to 10x faster parse times, better scalability, cleaner adapter contribution model, and modern docs experience.

Source →
ArchitectureFivetran
Jun 1, 2026

Proprietary data warehouses → Open Data Infrastructure with Apache Iceberg and Apache Polaris

Movement toward open, AI-ready data foundations using Apache Iceberg and Fivetran-hosted Apache Polaris catalog to provide flexibility, interoperability, and vendor neutrality for multiple compute engines.

Source →
ArchitectureFivetran
Jun 1, 2026

Traditional static dashboards and manual analyst queries → Agentic AI with natural language interfaces powered by Fivetran, dbt, and Google Cloud

Shift from static dashboards to agentic AI that enables business users to ask complex questions in natural language and receive actionable answers, transforming how teams interact with data.

Source →
PartnershipFivetran
Jun 1, 2026

dbt

Collaboration demonstrating how dbt standardizes and models data into consistent, reusable business semantics that AI agents can understand and trust within the Fivetran and Google Cloud ecosystem.

Source →
PartnershipFivetran
Jun 1, 2026

Google Cloud

Partnership showcasing how Fivetran, dbt, and Google Cloud create a data foundation for powering AI agents with fresh, traceable, and well-governed data.

Source →
ArchitectureMonte Carlo
Jun 1, 2026

Single LLM-as-judge scoring system with monthly dashboard reviews → Multi-dimensional time-series evaluation framework with automatic anomaly detection via OpenTelemetry collectors and Monte Carlo orchestration

Axios shifted from a reactive, single-score evaluation approach to a proactive, multi-dimensional observability stack that tracks task completion, helpfulness, groundedness, and accuracy with continuous anomaly detection.

Source →
PartnershipMonte Carlo
Jun 1, 2026

Axios

Design partnership between Monte Carlo and Axios (media company) in early 2025 to advance their observability maturity and build agent observability infrastructure for AI auto-tagging systems.

Source →
Feature LaunchMonte Carlo
Jun 1, 2026

OpenTelemetry Integration with Monte Carlo

Integration of OpenTelemetry collectors with Monte Carlo as the orchestration layer for capturing and monitoring AI agent spans and metrics across production systems.

Source →
Feature LaunchMonte Carlo
Jun 1, 2026

Agent Observability Platform

Monte Carlo's Agent Observability platform provides end-to-end visibility across context, performance, behavior, and outputs for AI agents in production, with multi-dimensional evaluation tracking and anomaly detection capabilities.

Source →
PartnershipMonte Carlo
Jun 1, 2026

Forrester

Partnership to create a data quality cost calculator and conduct research on the Total Economic Impact of Monte Carlo's Data + AI Observability Platform, as well as surveys on data professional incident resolution times.

Source →
Feature LaunchMonte Carlo
Jun 1, 2026

Monte Carlo Data Quality Calculator (with Forrester)

A calculator built with Forrester to estimate the cost of bad data and calculate data downtime for organizations.

Source →
Feature LaunchMonte Carlo
Jun 1, 2026

Operations Agent

An agent for orchestration and operations management within Monte Carlo's data + AI observability platform.

Source →
Feature LaunchMonte Carlo
Jun 1, 2026

Monitoring Agent

Part of Monte Carlo's platform capabilities for agent-based observability and monitoring of data systems.

Source →
Feature LaunchMonte Carlo
Jun 1, 2026

Troubleshooting Agent

An agent that automatically identifies the root cause of data quality incidents by analyzing telemetry across hundreds of customer environments.

Source →
ArchitectureConfluent
May 28, 2026

Siloed, fragmented customer data across multiple systems → Centralized streaming platform with cross-system signal correlation and semantic embeddings

Customer signals are now ingested from multiple sources into Kafka topics, normalized and enriched with Flink, vectorized for AI, and served through a unified intelligence layer with account-level authorization.

Source →
ArchitectureConfluent
May 28, 2026

Batch-driven and static dashboard-based customer intelligence → Real-time streaming-first architecture using Apache Kafka, Apache Flink, and vectorized AI grounding

Confluent shifted from traditional batch and static reporting approaches to a real-time streaming architecture for customer intelligence, using Kafka for durable event ingestion, Flink for stream processing and enrichment, and vector search with LLM grounding for AI insights.

Source →
PartnershipConfluent
May 28, 2026

Okta

Okta SSO is integrated into CIH for authentication and access control, protecting customer data with single sign-on capabilities.

Source →
PartnershipConfluent
May 28, 2026

OpenSearch

OpenSearch is used for vectorization and similarity search to support AI grounding and retrieval in the CIH platform.

Source →
PartnershipConfluent
May 28, 2026

Google Cloud Vertex AI

Vertex AI is used for semantic embedding generation and grounded AI responses within the CIH architecture.

Source →
PartnershipConfluent
May 28, 2026

Crossbeam

Partner overlap data from Crossbeam is included in CIH to provide engagement and external signals for customer accounts.

Source →
PartnershipConfluent
May 28, 2026

Quartr

Earnings transcript updates and market engagement data from Quartr are integrated into CIH for external signal tracking.

Source →
PartnershipConfluent
May 28, 2026

CommonRoom

Customer engagement signals from CommonRoom are ingested into CIH for additional customer intelligence.

Source →
PartnershipConfluent
May 28, 2026

Jira

Field feedback and issue tracking data from Jira is integrated into CIH to surface support signals and customer escalation context.

Source →
PartnershipConfluent
May 28, 2026

Zendesk

Support signals from Zendesk including ticket creation/resolution events are streamed into CIH for customer intelligence and account context.

Source →
PartnershipConfluent
May 28, 2026

Salesforce

Customer signals from Salesforce are ingested into CIH as part of the multi-system data integration, centralizing customer data for GTM insights.

Source →
Feature LaunchConfluent
May 28, 2026

Exec Summary Doc Generator

An integrated capability within CIH that automates the generation of executive account summaries for key meetings, quarterly business reviews, and internal reviews.

Source →
Feature LaunchConfluent
May 28, 2026

Whitespace View

An alternative Account Center layout in CIH that identifies expansion opportunities by showing data streaming platform (DSP) adoption and pipeline by product, helping sellers and customer success teams understand where customers are not yet using DSP capabilities.

Source →
Feature LaunchConfluent
May 28, 2026

AccountIQ

A generative AI capability within CIH that allows users to ask natural-language questions about customer data and receive contextual summaries grounded in CIH data, such as engagement summaries, support issue analysis, and usage growth patterns.

Source →
Feature LaunchConfluent
May 28, 2026

Customer Intelligence Hub (CIH)

An internal Confluent application providing a single prioritized view of customer accounts with change detection, AI-driven insights, and operational workflows. It centralizes customer signals from multiple systems (Salesforce, Zendesk, product telemetry, billing, Jira, CommonRoom, Quartr, Crossbeam) and surfaces real-time activity feeds, priority signals, and generative AI capabilities (AccountIQ) for GTM teams.

Source →
ArchitectureDagster
May 28, 2026

Manual orchestration of Snowflake Dynamic Tables → Virtual assets with Dagster freshness sensors

Shift from manual handling of Snowflake Dynamic Tables as black-box objects to modeling them as virtual assets in Dagster's asset graph with automated freshness monitoring and downstream triggering.

Source →
PartnershipDagster
May 28, 2026

Snowflake

Dagster integrates with Snowflake to provide orchestration, lineage, automation, and cost visibility across data platforms. The partnership enables native SQL-defined assets and cost attribution through Dagster+ Insights.

Source →
Feature LaunchDagster
May 28, 2026

Dagster+ Insights Cost Attribution

A feature in Dagster+ Insights that attributes Snowflake query costs directly to assets that incurred them, providing cost visibility across the entire Snowflake footprint including direct assets and dbt models.

Source →
Feature LaunchDagster
May 28, 2026

Dagster Freshness Sensor for Dynamic Tables

A sensor pattern that monitors Snowflake Dynamic Tables' last_completed_refresh and triggers downstream asset runs only when new data has actually landed, preventing stale reads.

Source →
Feature LaunchDagster
May 28, 2026

Dagster Virtual Assets for Dynamic Tables

Virtual assets feature that integrates Snowflake Dynamic Tables into Dagster's asset graph with is_virtual=True, allowing Dagster to track and orchestrate downstream dependencies without managing the actual materialization.

Source →
Feature LaunchDagster
May 28, 2026

Dagster SQL Component

A new component that allows users to define Dagster assets directly from SQL files using simple YAML configuration, enabling teams to work natively in SQL without requiring Python for Snowflake pipelines.

Source →
Feature LaunchMonte Carlo
May 28, 2026

Monte Carlo + Snowflake Integration

Integration requiring zero additional SDKs that reads Snowflake's native system tables directly for agent monitoring without data leaving the customer account.

Source →
Feature LaunchMonte Carlo
May 28, 2026

Monte Carlo Agent Observability

Four-layer Trust Framework that monitors agent outputs, behavior, performance metrics like latency and token utilization, and the data context feeding agents in production.

Source →
ArchitectureLlamaIndex
May 27, 2026

Python wrapper around Node CLI → Native Rust implementation with Python bindings

Eliminated language wrappers in favor of a unified Rust core that propagates changes across all language bindings automatically.

Source →
ArchitectureLlamaIndex
May 27, 2026

Node/Typescript implementation → Rust core with language bindings (Node, Python, WASM)

Complete rewrite of LiteParse from Typescript to Rust to enable cross-platform deployment across multiple languages and runtimes including browser and edge environments, while eliminating hard Node.js dependency.

Source →
Feature LaunchLlamaIndex
May 27, 2026

LiteParse v2.0

Complete rewrite of LiteParse in Rust enabling it to run natively across Rust, Node, Python, and WASM packages. The new version provides 5-100x speedup for small documents and 3x speedup for larger documents, while maintaining LLM-free PDF and document text extraction with layout preservation.

Source →
ArchitecturedltHub
May 27, 2026

Siloed context across pipeline stack (schema at ingest, joins in transform, lineage in orchestrator, runtime state in warehouse) → Unified context model where business context and data model are produced together from a single Python process and exposed to agents for reasoning

dltHub Transformation closes the context gap by consolidating schema knowledge, join structure, lineage, and runtime state into one agentic-readable model produced end-to-end.

Source →
ArchitecturedltHub
May 27, 2026

Traditional SQL-based transformation layer with fixed connectors and models that break when schemas change → Pythonic, schema-aware transformation layer (@dlt.hub.transformation decorator) running wherever the developer or agent is

Shift from decade-old transformation tools built for human-written pipelines to a new architecture designed for agent-written pipelines at scale, addressing the explosion from 5% to 91% agent-written pipelines in one year.

Source →
PartnershipdltHub
May 27, 2026

Claude, Codex, or Cursor

dltHub Transformation toolkit is designed to integrate with and run as slash commands within these AI code editors to enable AI-assisted data transformation and modeling.

Source →
Feature LaunchdltHub
May 27, 2026

dltHub transformation toolkit

A step-by-step workflow tool consisting of four slash commands that guide users from raw data through taxonomy, ontology, and canonical data model generation to final transformation code, designed to be used within AI code editors.

Source →
Feature LaunchdltHub
May 27, 2026

dltHub Transformation

A new transformation module entering public preview as part of dltHub Pro that turns raw data into clean business tables using agentic workflows. It includes a Python decorator (@dlt.hub.transformation) and a toolkit with slash commands (/annotate-sources, /create-ontology, /generate-cdm, /create-transformation) that work with Claude, Codex, or Cursor to automatically model data.

Source →
Feature LaunchLlamaIndex
May 27, 2026

LiteParse v2.0

Updated version of LiteParse that runs everywhere, with improved document parsing capabilities.

Source →
Feature LaunchLlamaIndex
May 27, 2026

LiteParse v2.0

Updated version of LiteParse with improved deployment capabilities to run everywhere.

Source →
Feature LaunchLlamaIndex
May 27, 2026

LiteParse

Document processing tool with workflow capabilities, recently updated to v2.0 with broader deployment options.

Source →
Feature LaunchLlamaIndex
May 27, 2026

LiteParse v2.0

Updated version of LiteParse with improved deployment and runtime capabilities.

Source →
Feature LaunchLlamaIndex
May 27, 2026

LiteParse v2.0

An updated version of LiteParse that runs everywhere, enabling broader accessibility for document parsing workflows.

Source →
Feature LaunchLlamaIndex
May 27, 2026

LiteParse v2.0

An updated version of LiteParse that runs across multiple environments, supporting document parsing and processing capabilities.

Source →
Feature LaunchLlamaIndex
May 27, 2026

LiteParse v2.0

Version 2.0 of LiteParse with expanded capabilities and cross-platform support.

Source →
PartnershipCoalesce
May 27, 2026

Databricks

Coalesce is showcasing its platform integration on Databricks at the Data + AI Summit 2026, demonstrating AI-assisted pipeline scaffolding, governance, and quality controls built into the Lakehouse workflow.

Source →
Feature LaunchLlamaIndex
May 27, 2026

LiteParse v2.0

Local document parser running 100x faster with expanded language support (Typescript, Rust, Python, Node) and Edge support. Zero Python dependencies, processes documents locally for AI agent iteration.

Source →
ArchitectureMonte Carlo
May 27, 2026

UI-first product development approach → Agent-first product development approach with MCP tools as primary interface

Monte Carlo restructured its development methodology to build for AI agents first and humans second, designing MCP tools and skills before creating human user interfaces. This architectural shift enforces cleaner interfaces, structured inputs/outputs, and eliminates human-in-the-loop assumptions.

Source →
Feature LaunchMonte Carlo
May 27, 2026

Agent Observability Features

New agent-first capabilities including freshness monitoring, volume anomaly detection, schema change tracking, lineage tracing, and incident history access specifically designed for AI agents to verify data reliability before taking action.

Source →
Feature LaunchMonte Carlo
May 27, 2026

Monte Carlo MCP Tools and Skills

Monte Carlo built MCP (Model Context Protocol) tools and skills prioritizing agent-first design, enabling AI agents to check data table health and access institutional memory of data behavior, incidents, and resolution patterns.

Source →
Feature LaunchdltHub
May 26, 2026

Chat-BI

A semantic model-powered business intelligence chatbot that behaves like an analyst with business context rather than performing text-to-SQL guessing.

Source →
Feature LaunchdltHub
May 26, 2026

dltHub AI Workbench

An ontology-driven transformations toolkit that reverse-engineers SQL into draft ontologies, consolidates tables into canonical concepts, generates clean transformation layers, and powers Chat-BI capabilities for business users.

Source →
Feature LaunchdltHub
May 26, 2026

dltHub Transformations

A new feature that runs ingestion, transformation, lineage, and verification inside the same execution context using a Python decorator (@dlt.hub.transformation), enabling LLMs to reason about business data with full context and metadata continuity end-to-end.

Source →
ArchitectureConfluent
May 26, 2026

Static data warehouse-centric Customer 360 systems → Real-time, event-driven AI-native Customer 360 architecture with RAG and guardrailed generation

Traditional warehouse-centric Customer 360 architectures designed for periodic reporting are being replaced by AI-native real-time architectures that combine continuous event streams, unified customer profiles, RAG-based retrieval, and governed AI generation.

Source →
ArchitectureConfluent
May 26, 2026

Batch-based ETL with nightly updates and data warehouse architectures → Real-time event-driven streaming architecture with continuous event ingestion and live customer profile stores

Organizations are shifting from periodic batch updates (ETL once daily or every few hours) to continuous event-driven architectures that update customer profiles in real-time using streaming platforms like Kafka and stream processing with Flink.

Source →
Feature LaunchConfluent
May 26, 2026

Tableflow

Feature enabling conversion of Kafka topics to tables in a few clicks, supporting real-time analytics and data transformation for AI-driven use cases.

Source →
Feature LaunchConfluent
May 26, 2026

Streaming Agents

AI-powered agents that automate business processes by combining real-time customer data streams with generative AI and guardrailed response generation.

Source →
Feature LaunchConfluent
May 26, 2026

Confluent Intelligence

Real-time, context-aware AI capability enabling AI-powered Customer 360 architectures that combine streaming data with RAG and generative AI for personalized customer experiences.

Source →
Feature LaunchEstuary
May 26, 2026

Estuary JSON Unnesting Control

Estuary enables controlled JSON unnesting during the ingestion phase with field selection modes (Required only, Depth 1, Depth 2, Unlimited depth) that allow teams to define how deeply nested JSON fields are materialized into columns while preserving one row per original record.

Source →
Feature LaunchEstuary
May 26, 2026

Estuary History Mode

Feature that records every CDC event as it happens, creating a full timeline for audit trails that can be replayed for audits, investigations, and compliance checks when paired with Delta Updates on materialization.

Source →
Feature LaunchEstuary
May 26, 2026

Estuary Deployment Options

Three deployment models offered: Public cloud SaaS, Private deployment, and BYOC (Bring Your Own Cloud) for organizations requiring stricter compliance isolation and data residency controls.

Source →
Feature LaunchEstuary
May 26, 2026

Estuary CDC Pipeline Compliance Features

Compliance-ready CDC pipelines with built-in governance controls including RBAC with least-privilege access, prefix-based permission management, TLS 1.2+ encryption in transit, KMS-based encryption at rest, History Mode for audit trails, version-controlled YAML pipeline definitions, and auditable backfill/replay operations.

Source →
PartnershipLlamaIndex
May 26, 2026

Google

LlamaParse and LiteParse integrated with Google's new sandboxed Agents API to provide document processing capabilities for autonomous agents in the Google ecosystem.

Source →
Feature LaunchLlamaIndex
May 26, 2026

Financial Due Diligence Agent

Demo agent that ingests SEC filings and answers questions with exact citations highlighted on original PDF pages, built with ~600 lines of Next.js without vector database.

Source →
Feature LaunchLlamaIndex
May 26, 2026

Google Agents API Integration

LlamaParse and LiteParse integrated with Google's new sandboxed agent environment, enabling autonomous document processing in Google's agents.

Source →
Feature LaunchLlamaIndex
May 26, 2026

LlamaParse

Native HEIC file support added, allowing LlamaParse to parse Apple's default HEIC image format directly without conversion to JPEG first, ideal for enterprise file systems with iPhone photos.

Source →
ArchitectureLlamaIndex
May 26, 2026

Lexical search (grep-based) for agentic retrieval → Hybrid semantic and lexical search with document parsing layer

The article discusses a shift from relying solely on grep-style lexical search to a layered approach that parses unstructured documents into text using LlamaParse or LiteParse, indexes them for semantic search via embeddings, and preserves grep for specific exact-match use cases.

Source →
PartnershipCoalesce
May 26, 2026

Omni

Strategic integration enabling Coalesce Catalog to ingest and govern Omni workbooks, queries, and dashboards alongside warehouse data, connecting the data transformation layer with the BI/analytics consumption layer for end-to-end lineage and governance.

Source →
Feature LaunchCoalesce
May 26, 2026

Omni Connector for Coalesce

New connector that catalogs Omni AI analytics platform assets (workbooks, queries, dashboards) directly in Coalesce Catalog, enabling centralized discovery, ownership visibility, and full-stack lineage connecting BI content to underlying data transformations and tables.

Source →
PartnershipLlamaIndex
May 26, 2026

Google (Gemini)

LlamaParse template integration with Google's sandboxed Agents API enabling document processing agents to leverage Gemini capabilities.

Source →
Feature LaunchLlamaIndex
May 26, 2026

LlamaParse Apple HEIC file support

Native parsing support for Apple HEIC image files with full latency metrics for Parse, Extract, and Classify jobs.

Source →
ArchitectureConfluent
May 25, 2026

Simple database-to-LLM scripts → Governed, event-driven architecture with streaming governance checkpoints

Evolution from basic database-to-LLM integration to a robust, multi-stage event-driven architecture with governance layers, PII masking, schema enforcement, and tokenization before vector storage.

Source →
ArchitectureConfluent
May 25, 2026

Batch RAG processing with manual document uploads → Real-time streaming RAG with Change Data Capture (CDC) and event-driven pipelines

Shift from batch RAG systems with periodic manual re-indexing to streaming-based RAG architecture using CDC and Kafka for real-time policy updates and continuous compliance enforcement.

Source →
Feature LaunchConfluent
May 25, 2026

Response Provenance & Audit Trail

Observability layer that creates immutable audit logs tracking every AI response to its exact source document, enabling forensic review and compliance verification.

Source →
Feature LaunchConfluent
May 25, 2026

Real-Time Governed RAG Pipeline

Event-driven streaming RAG architecture using Change Data Capture (CDC) and streaming platforms for real-time policy updates and automated document governance without manual re-indexing.

Source →
Feature LaunchConfluent
May 25, 2026

Confluent Stream Governance for Regulated RAG/GenAI

Stream Governance feature enabling reliable, discoverable, and secure data streams with PII filtering, field-level encryption, and RBAC controls for compliant RAG architectures in regulated sectors.

Source →
ArchitectureConfluent
May 25, 2026

Batch-processed static knowledge bases and point-to-point ETL → Real-time event streaming architecture with Change Data Capture (CDC) and Apache Flink stream processing

Shift from nightly batch jobs and static document uploads to continuous event-driven ingestion with real-time embedding updates and context synchronization for enterprise RAG systems.

Source →
Feature LaunchConfluent
May 25, 2026

Enterprise RAG with Real-Time Data Streaming

Production-grade AI architecture that connects Large Language Models to continuous, real-time streams of proprietary corporate data using event streaming for document ingestion, embedding updates, and context synchronization.

Source →
ArchitectureConfluent
May 25, 2026

Stateless or Limited Local Agent State → Shared Streaming State Layer with Continuous State Propagation

Architectural shift from hidden state inside individual agents/services to persistent shared state materialized and propagated through streams, enabling consistent multi-agent coordination.

Source →
ArchitectureConfluent
May 25, 2026

Batch and Scheduled Decision Cycles → Real-Time Sub-Second Autonomous Decision Making

Evolution from batch pipelines, scheduled workflows, and API-based orchestration to event-triggered agents operating continuously on real-time streams with millisecond decision latency.

Source →
ArchitectureConfluent
May 25, 2026

Reactive Event-Driven Systems → Agentic Event-Driven Systems

Shift from static, predefined reactive workflows to autonomous agentic systems with continuous decisioning, AI-driven reasoning, closed-loop feedback, and runtime adaptability without human intervention.

Source →
Feature LaunchConfluent
May 25, 2026

AI Tools for Builders (MCP Server & Agent Skills)

Open-source local MCP server, managed MCP server, and Agent Skills that give AI coding assistants direct access to the streaming platform with tools to act and domain knowledge to build.

Source →
Feature LaunchConfluent
May 25, 2026

Confluent Intelligence

Real-time, context-aware AI capability that provides intelligent decision-making and reasoning over streaming data within the Confluent platform.

Source →
Feature LaunchConfluent
May 25, 2026

Streaming Agents

AI-powered agents that automate business processes by continuously sensing events, reasoning over shared state, and taking autonomous actions in real-time through event-driven closed-loop systems.

Source →
ArchitectureLlamaIndex
May 22, 2026

Rigid templates and manually engineered parsing rules → Configuration-driven, machine learning-based parsing with adaptive generalization

Transition from fixed template systems requiring constant maintenance to configurable extraction logic that generalizes across evolving document ecosystems without requiring constant reconfiguration.

Source →
ArchitectureLlamaIndex
May 22, 2026

Isolated text extraction pipelines → Intelligent document workflows with reasoning capabilities

Shift from independent extraction steps toward coordinated systems capable of reasoning across multiple document types, applying validation logic, and managing uncertainty through confidence-based workflows.

Source →
ArchitectureLlamaIndex
May 22, 2026

Traditional OCR systems and character recognition → Layout-aware parsing with structured extraction and cross-document validation

Evolution from standalone text recognition toward integrated document understanding systems that preserve structural relationships, validate data across multiple documents, and support decision-ready outputs for insurance workflows.

Source →
ArchitectureLlamaIndex
May 22, 2026

Single extraction model applied across entire document package → Classify-extract-validate loop with adaptive model routing

Architectural change from applying one extraction model uniformly to implementing intelligent document classification, per-document-type model selection, and multi-stage validation loops.

Source →
ArchitectureLlamaIndex
May 22, 2026

Template-based OCR → Agentic OCR with layout-aware computer vision

Shift from static template-matching extraction to agentic reasoning-based document processing that classifies documents, adapts model selection per document type, and validates extracted values against adjacent document data.

Source →
PartnershipCoalesce
May 22, 2026

Anthropic (Claude)

Coalesce integrates with Claude's skill system to enable data engineers to build reproducible AI-powered workflows on top of Coalesce Transform, Catalog, and Quality MCPs.

Source →
Feature LaunchCoalesce
May 22, 2026

skill-creator

Claude tool that generates initial skill structures from natural language descriptions of data engineering workflows, providing best-practice templates for rapid skill development.

Source →
Feature LaunchCoalesce
May 22, 2026

data-ops-weekly-rota-triage Skill

A Claude skill that automatically triages data quality issues by scoring severity, downstream impact, and ownership status, then proposes Linear tickets with actions (create, skip, acknowledge, or flag for tuning).

Source →
Feature LaunchCoalesce
May 22, 2026

Claude Skills for Data Engineering

Reusable recipes (SKILL.md files) that enable Claude to perform data engineering tasks consistently, including weekly data quality reports, issue triage, and root cause analysis using Coalesce MCPs.

Source →
ArchitecturedltHub
May 21, 2026

Manual transformation code written by data engineers → AI-generated transformations from ontology specifications

Architectural shift toward agentic code generation where Python transformations, SQL queries, and data validation rules are automatically generated from structured ontology definitions rather than hand-written by engineers.

Source →
ArchitecturedltHub
May 21, 2026

Point-to-point ETL migrations (HubSpot → Attio with bespoke transformations) → Canonical Data Model (CDM) architecture with standardized entity definitions and bounded mapping work

Shift from direct source-to-destination migrations to using a canonical data model as a system-neutral common language. This enables reusable ETL stages (HubSpot → CDM → Attio) and reduces rework for future migrations to different destinations.

Source →
PartnershipdltHub
May 21, 2026

HubSpot

dltHub provides migration tooling to extract data from HubSpot CRM and transform it into other systems like Attio, handling complex business logic and compliance requirements.

Source →
PartnershipdltHub
May 21, 2026

Attio

dltHub enables migration from HubSpot to Attio CRM using agentic transformations. Attio API schema is used as input for transformation generation and the platform serves as the destination for the migration workflow.

Source →
Pricing ChangedltHub
May 21, 2026

dltHub pricing update

dltHub Pro pricing tier introduced with free 14-day trial and $30 in credits included for new users.

Source →
Feature LaunchdltHub
May 21, 2026

Agentic Data Engineering Course

Free course teaching AI-native data engineering workflows for data migrations and transformations, documenting the end-to-end methodology used in the HubSpot-to-Attio migration case study.

Source →
Feature LaunchdltHub
May 21, 2026

REST API Toolkit

REST API extraction toolkit that automatically scaffolds authentication, pagination, schema inference, and incremental loading. Integrates with MCP server (10,000+ configs) to pull API context and reuse existing pipeline configurations.

Source →
Feature LaunchdltHub
May 21, 2026

dltHub Pro AI Workbench

AI workbench for building ontologies, generating transformations, and managing data migrations with agentic transformations. Includes transformation toolkit that generates Python code from ontology definitions, handles schema mapping, GDPR filtering, and generates mock data for testing.

Source →
ArchitectureDagster
May 21, 2026

Pyright → ty

Dagster migrated its entire Python monorepo from Pyright to Astral's new type checker 'ty', reducing OSS type-checking CI time from ~15 minutes to 1-2 minutes while improving bug detection capabilities.

Source →
PartnershipDagster
May 21, 2026

Astral

Dagster adopted Astral's new Python type checker 'ty' for performance improvements in their CI pipeline, achieving 10x faster type checking and discovering real runtime bugs that Pyright missed.

Source →
ArchitectureLlamaIndex
May 21, 2026

Standard uniform OCR pipeline processing entire passport image identically → Agentic OCR with layout-aware zone segmentation, model routing per document element, and checksum validation

Shift from flat character extraction to zone-aware document processing that segments passports into MRZ, VIZ, photo, and hologram regions, routing each to appropriate models with checksum validation and cross-zone reconciliation.

Source →
ArchitectureLlamaIndex
May 21, 2026

Standard OCR (pixel-to-text conversion) → Agentic OCR (layout-aware computer vision with field-level validation)

Shift from traditional left-to-right text-stream OCR to layout-aware extraction that identifies discrete bounded fields, understands document structure before extraction, validates against coding formats, and provides field-level confidence scores for medical claims documents.

Source →
ArchitectureLlamaIndex
May 21, 2026

Traditional OCR (template-based, text-first extraction) → Agentic document parsing (layout-aware, structure-preserving AI-ready extraction)

LlamaParse shifts from rigid template-matching and character-level OCR to agentic parsing that understands document structure, tables, charts, and layouts differently based on content type.

Source →
ArchitectureFivetran
May 21, 2026

Dispersed, specialist-dependent knowledge management → Centralized, queryable knowledge base via unified data lakehouse

Fivetran unified institutional knowledge from multiple sources (Zendesk, Slab, Jira, GitHub, Google Drive, Gong, Salesforce, public docs) into a single data lakehouse, transformed with dbt, and made queryable via AI.

Source →
ArchitectureFivetran
May 21, 2026

Manual, ticket-by-ticket support model → AI-augmented, efficiency-first support engine

Fivetran shifted from a purely human-operated support model to an AI-augmented system where humans act as architects, governors, and escalation experts while AI handles ticket drafting, summarization, and analysis.

Source →
PartnershipFivetran
May 21, 2026

Salesforce

Fivetran pulls account data from Salesforce as part of the unified knowledge base for Support AI.

Source →
PartnershipFivetran
May 21, 2026

Gong.io

Fivetran ingests call recordings structured via Gong.io into the Support AI knowledge base.

Source →
PartnershipFivetran
May 21, 2026

Google Drive

Fivetran pulls product and technical documents from Google Drive for the Support AI knowledge base.

Source →
PartnershipFivetran
May 21, 2026

GitHub

Fivetran ingests pull requests and engineering content from GitHub into the Support AI knowledge base.

Source →
PartnershipFivetran
May 21, 2026

Jira

Fivetran pulls engineering issues and pull requests from Jira as part of the unified knowledge base for Support AI.

Source →
PartnershipFivetran
May 21, 2026

Slab

Fivetran ingests internal wiki content from Slab into their knowledge base for the Support AI system.

Source →
PartnershipFivetran
May 21, 2026

OpenAI

Fivetran uses OpenAI Agents to power the AI capabilities in their Support AI app, served via an MCP server.

Source →
PartnershipFivetran
May 21, 2026

Zendesk

Fivetran built a custom Support AI plugin embedded directly into Zendesk's platform to augment and automate support ticket workflows.

Source →
Feature LaunchFivetran
May 21, 2026

Fivetran Support AI app

A custom Zendesk plugin that embeds AI directly into the support ticket workflow, featuring Ask AI for conversational answers, Respond with AI for draft responses, Generate Summary for ticket handovers, and Find Similar Tickets for surfacing historical resolutions.

Source →
ArchitectureEstuary
May 20, 2026

MAR (Monthly Active Rows) billing model → Volume-based (per GB) billing model

Estuary moves away from Fivetran's MAR pricing approach, adopting a simpler volume-based model that charges per GB of data moved rather than counting row changes and activities.

Source →
Pricing ChangeEstuary
May 20, 2026

Estuary pricing update

Fivetran 2026 pricing update: inserts, updates, and deletes all count toward paid MAR (previously only inserts and updates counted); includes minimum per-connection fees; multiple updates within same month in history mode now count toward MAR.

Source →
Pricing ChangeEstuary
May 20, 2026

Estuary pricing update

Fivetran 2025 pricing update changed from account-wide volume discounts to per-connector MAR calculation, removing shared usage discounts and complicating forecasting for multi-connector setups.

Source →
Pricing ChangeEstuary
May 20, 2026

Estuary pricing update

Estuary introduces volume-based pricing model charging $0.50 per GB of data moved (both ingestion and materialization) plus connector fees ($100 for first 6, $50 for additional).

Source →
Feature LaunchEstuary
May 20, 2026

Estuary Free Tier

Free tier offering 10 GB of data per month with up to two connector instances and no time limit, plus a 30-day free trial of the full Cloud plan.

Source →
Feature LaunchEstuary
May 20, 2026

Estuary

Estuary offers volume-based pricing at $0.50 per GB of data moved for ingestion and materialization, plus $100 per connector for the first six and $50 for additional connectors, with a free tier including 10 GB/month and up to 2 connector instances.

Source →
ArchitectureFivetran
May 20, 2026

Proprietary cloud data warehouse storage with vendor-locked formats → Open Data Infrastructure with open file formats (Apache Iceberg, Delta Lake) in managed data lakes with commodity cloud storage

Shift from centralized data within proprietary cloud data warehouse storage to Open Data Infrastructure that separates storage from compute, enabling data to be stored in open formats within managed data lakes on commodity cloud storage, while CDW becomes a compute engine rather than the storage layer.

Source →
ArchitecturedltHub
May 19, 2026

Laptop-local data processing with manual pipeline uploads → Laptop-local development with managed cloud deployment via one-command deployment

Architectural shift enabling developers to build pipelines on local DuckDB instances and seamlessly deploy to production cloud data warehouses (Redshift, Snowflake) with integrated observability.

Source →
ArchitecturedltHub
May 19, 2026

Siloed data engineering tools (separate ingestion, transformation, orchestration platforms) → Unified LLM-native data engineering platform with shared context layer

Platform architecture consolidates ingestion, transformation, and deployment into unified platform with agent-readable context layer that flows across all workflows, replacing fragmented tool stacks.

Source →
ArchitecturedltHub
May 19, 2026

Traditional SaaS ETL platforms → AI-native dltHub Pro with agentic pipelines and managed dlt runtime

Shift from third-party SaaS ETL tools to custom-owned dlt pipelines orchestrated by AI agents and deployed on dltHub's managed infrastructure, enabling smaller teams to own end-to-end data stacks.

Source →
PartnershipdltHub
May 19, 2026

Cursor

Integration of Cursor IDE with native agent support for building and deploying dlt pipelines through dltHub Pro.

Source →
PartnershipdltHub
May 19, 2026

GitHub (Copilot/Codex)

Integration of GitHub Copilot/Codex as a native AI agent for building dlt pipelines through dltHub Pro.

Source →
PartnershipdltHub
May 19, 2026

Anthropic (Claude)

Integration of Claude as a native AI agent for building dlt pipelines, enabling code-first data engineering workflows.

Source →
PartnershipdltHub
May 19, 2026

MotherDuck

Partnership positioning dltHub Pro alongside MotherDuck's analytics capabilities for developers to build and test data pipelines locally with DuckDB.

Source →
PartnershipdltHub
May 19, 2026

Snowflake

Integration enabling dltHub pipelines to deliver data directly into Snowflake for financial institutions to transform raw data into governed analytics and AI-ready datasets.

Source →
Pricing ChangedltHub
May 19, 2026

dltHub pricing update

Free tier launched with 30 USD in usage credits provided at sign-up for 14-day trial period at app.dlthub.com.

Source →
Pricing ChangedltHub
May 19, 2026

dltHub pricing update

dltHub Scale pricing announced starting from $1,000 USD per month for mid-size companies with expanded features and multi-team collaboration.

Source →
Pricing ChangedltHub
May 19, 2026

dltHub pricing update

dltHub Pro subscription tier launched at $119 USD per month, including 50 USD in monthly credits for managed infrastructure runtime, with usage billed at $1 USD/hour beyond included credits.

Source →
Feature LaunchdltHub
May 19, 2026

PII redaction toolkit

Toolkit for identifying and redacting personally identifiable information from data pipelines.

Source →
Feature LaunchdltHub
May 19, 2026

Public sharing links

Feature enabling users to share dltHub projects and dashboards via public links for collaboration.

Source →
Feature LaunchdltHub
May 19, 2026

dltHub observability UI overhaul

Enhanced observability user interface for monitoring dlt pipeline execution and performance metrics.

Source →
Feature LaunchdltHub
May 19, 2026

Iceberg destination

Source-available destination feature enabling dlt pipelines to write to Apache Iceberg format.

Source →
Feature LaunchdltHub
May 19, 2026

MS SQL Change Tracking

Source-available feature for tracking changes in MS SQL databases as part of dltHub Pro capabilities.

Source →
Feature LaunchdltHub
May 19, 2026

dltHub Enterprise

Enterprise product offering with dedicated support, custom SLAs, and tailored deployment options for large organizations.

Source →
Feature LaunchdltHub
May 19, 2026

dltHub Scale

Mid-size company offering extending Pro with richer context layer including AI-native data catalog, ontologies, lineage, LLM wikis, multi-team collaboration, and operational agents for validation and pipeline health monitoring.

Source →
Feature LaunchdltHub
May 19, 2026

dltHub transformations

Transformation toolkit module enabling data scientists and analysts to validate, transform, and perform semantic modeling on dlt pipelines with ontology-based skills.

Source →
Feature LaunchdltHub
May 19, 2026

dltHub Pro

Claude/Codex/Cursor-native data engineering platform that deploys, monitors, and scales dlt pipelines. Includes AI Workbench, secrets management, local DuckDB workspace, OTEL telemetry, build agents for pipeline building and exploration, and managed runtime with observability and scheduling.

Source →
Feature LaunchLlamaIndex
May 19, 2026

LiteParse Citation System

Advanced citation matching system with layered search strategies (LiteParse searchItems, whitespace-flexible regex, currency/symbol stripping, alphanumeric matching) that locates cited text on pages and renders visual highlight overlays.

Source →
Feature LaunchLlamaIndex
May 19, 2026

LiteParse SEC EDGAR Integration

Feature enabling direct integration with SEC's EDGAR database to fetch and parse recent corporate filings by ticker symbol, with automatic HTML-to-PDF conversion support.

Source →
Feature LaunchLlamaIndex
May 19, 2026

Financial Due Diligence Agent

AI agent demo built with LiteParse that ingests SEC filings, searches across documents, and answers financial questions with precise citations and visual highlighting of source text on original PDF pages.

Source →
ArchitectureConfluent
May 19, 2026

Traditional reactive batch-driven fleet management systems → Real-time event-driven agentic architecture with autonomous agents and closed-loop feedback loops

Shift from reactive, manual, batch-processed fleet systems to proactive, autonomous, real-time agentic systems that continuously optimize operations using streaming data, ML inference, and decentralized agent coordination.

Source →
Feature LaunchConfluent
May 19, 2026

Confluent's AI developer tools (MCP Server & Agent Skills)

Open-source local MCP server, managed MCP server, and Agent Skills that provide AI coding assistants direct access to the streaming platform with tools to act on data and domain knowledge for building agentic systems.

Source →
Feature LaunchConfluent
May 19, 2026

Streaming Agents

AI agents that automate business processes with autonomous decision-making capabilities for fleet management, routing optimization, maintenance prediction, and dispatch operations through real-time event-driven architectures.

Source →
PartnershipConfluent
May 19, 2026

Windsurf

AI coding tool that supports Confluent Agent Skills for development workflows.

Source →
PartnershipConfluent
May 19, 2026

Cursor

AI coding tool that supports Confluent Agent Skills for development workflows.

Source →
PartnershipConfluent
May 19, 2026

Claude Code

AI coding tool that supports Confluent Agent Skills for development workflows.

Source →
Feature LaunchConfluent
May 19, 2026

CDC to Tableflow Agent Skill

Builds end-to-end CDC pipelines on Confluent Cloud from database source through Debezium, Flink, and Tableflow to Apache Iceberg or Delta Lake tables.

Source →
Feature LaunchConfluent
May 19, 2026

Python Kafka Client Agent Skill

Scaffolds production-ready Python producer/consumer projects with Schema Registry serialization configured for the target environment.

Source →
Feature LaunchConfluent
May 19, 2026

Kafka Streams Agent Skill

Designs, builds, and debugs Kafka Streams applications end-to-end, from topology design and pattern selection to troubleshooting production issues.

Source →
Feature LaunchConfluent
May 19, 2026

Schema Registry Agent Skill

Scans projects, extracts schemas from data models, tags PII fields, and generates Terraform to register schemas in Schema Registry for proper governance.

Source →
Feature LaunchConfluent
May 19, 2026

Agent Skills for AI Coding Tools

Domain-specific AI skill modules that package Confluent expertise for platforms like Claude Code, Cursor, and Windsurf. Includes four GA skills: Schema Registry, Kafka Streams, Python Kafka Client, and CDC to Tableflow.

Source →
Feature LaunchConfluent
May 19, 2026

Managed MCP Server (Confluent Cloud)

A read-only MCP server hosted directly in Confluent Cloud with zero configuration, providing tools for environment/cluster discovery, telemetry metrics, and connector troubleshooting across global and regional tiers.

Source →
Feature LaunchConfluent
May 19, 2026

Local MCP Server (Open Source)

An open source Model Context Protocol server that gives AI agents direct access to Confluent Cloud and local Kafka clusters, with tools to discover topics/schemas/connectors, build resources, manage configurations, and debug issues.

Source →
ArchitectureConfluent
May 19, 2026

Traditional Apache Kafka brokers with cluster sprawl and independent cluster management → Confluent Private Cloud with Intelligent Replication and broker-native multi-tenancy

Shift from managing multiple underutilized clusters toward consolidated infrastructure with shared physical clusters supporting multiple logical clusters, reducing operational overhead and infrastructure costs.

Source →
Feature LaunchConfluent
May 19, 2026

CPC Broker-Native Multi-Tenancy

Virtual Kafka cluster (LKC) running on shared physical clusters with strict namespace isolation, granular quota enforcement, self-service onboarding, fine-grained observability, and dedicated endpoints. Early Access and General Availability planned for later in 2026.

Source →
Feature LaunchConfluent
May 19, 2026

CPC Centralized Policy Enforcement

Gateway-based policy enforcement for encryption, governance, and client behavior, including Gateway Field Level Encryption, Gateway Payload Encryption, and Deep Schema Validation capabilities. Early Access opening later in 2026 with General Availability planned thereafter.

Source →
Feature LaunchConfluent
May 19, 2026

Confluent Private Cloud (CPC) with Intelligent Replication

Enhanced broker architecture optimized for total cost of ownership, delivering up to 73% fewer brokers while matching latency SLAs compared to Apache Kafka, with improved tail latency performance and predictable performance at peak saturation.

Source →
Architecturedbt Labs
May 15, 2026

Web search-based and training data-driven documentation retrieval for MCP agents → Native MCP server tools with direct integration to canonical docs.getdbt.com via search_product_docs and get_product_doc_pages

Shifted from inconsistent documentation retrieval methods (web search, training data, HTML rendering) to a native architectural solution that directly accesses canonical Markdown documentation through dedicated MCP server toolsets, ensuring agents have guaranteed access to current, authoritative docs.

Source →
Feature Launchdbt Labs
May 15, 2026

dbt Developer Agent - Product Docs Integration

Integrated product docs toolset into dbt's Developer agent experience within dbt platform and the Studio IDE, bringing canonical documentation closer to users in their native development environment.

Source →
Feature Launchdbt Labs
May 15, 2026

dbt MCP Server - Product Docs Toolset

Added a new 'Product Docs' category (the ninth toolset) to the dbt MCP server with two tools: search_product_docs for searching docs.getdbt.com with ranked results, and get_product_doc_pages for fetching full Markdown content of docs pages. Enables developers to access documentation directly within AI tools without context switching.

Source →
Feature LaunchLlamaIndex
May 14, 2026

Multimodal synthesis

Support for multimodal synthesis capabilities in llama-index-core, enabling processing of multiple media types in responses.

Source →
Feature Launchdbt Labs
May 13, 2026

dbt-core v1.12.0b1

Support partial parsing for function nodes. Add UnparsedMetricV2 for new-style YAML Semantic Layer Metrics. Allow defining function arguments with default values.

Source →
ArchitecturedltHub
May 13, 2026

Static column allowlist/SELECT allowlist → LLM-interpreted natural-language ontology with runtime column classification

A shift from maintaining static allowlists that become stale when schemas change to encoding data access policies as plain-English ontologies that are dynamically applied per column at build time, using LLMs to evaluate ambiguous cases through value inspection.

Source →
PartnershipdltHub
May 13, 2026

Anthropic

Integration with Claude (Claude Sonnet 4.6) LLM via the Anthropic API for runtime policy decision-making on column classification in the ontology-driven schema evolution feature.

Source →
Feature LaunchdltHub
May 13, 2026

dltHub AI Workbench - Ontology-driven data modelling toolkit

A toolkit within dltHub Pro that guides modeling decisions using ontologies during the transformation section of data pipelines, integrated with REST API ingestion, data exploration, and production deployment capabilities.

Source →
Feature LaunchdltHub
May 13, 2026

Ontology-driven schema evolution with LLM propagation

A feature that encodes data access policies in plain-English ontologies and uses LLM runtime evaluation to automatically classify columns as analytics-safe or reject them based on name patterns, data types, cardinality, and value inspection—enabling policies to adapt automatically when schemas change without code modifications.

Source →