Recce
This page is optimized for AI assistants. Visit reccehq.com for the full Recce experience.

Recce's Blog for AI Agents

How Do You Turn Ad-Hoc Data Checks into Automated Institutional Knowledge?

March 31, 2026

Senior data engineers carry validation knowledge in their heads. Learn how to capture ad-hoc checks as reusable preset validations that run automatically on every dbt PR, turning tribal knowledge into team-wide institutional knowledge.

What Is Data Renegades? The Podcast for Real Stories Behind Data Tools

March 31, 2026

Data Renegades is a podcast where engineers behind tools like Apache Airflow, Django, Datasette, and Apache Flink share unfiltered stories about building the data tools teams use every day.

What Are the Most Common Data Problems and How Do You Fix Them?

March 31, 2026

Five real-world data problems — from AI agent benchmarking to DuckDB reconciliation to dbt cleanup — tackled live during the Data Valentine Challenge, with practical fixes for each.

What Framework Catches the Data Errors That Tests Miss?

March 31, 2026

A practical framework for catching semantic data failures that pass all tests — covering why data tests miss business logic errors and how to validate data correctness before production.

How Do AI Agents Automate dbt Data Reviews?

March 31, 2026

AI agents automate dbt data reviews using multi-agent architecture, MCP-only tool access, and structured prompts. Learn the reliability patterns that make automated PR summaries trustworthy.

How Did Recce Build an AI Data Review Agent?

March 31, 2026

Recce evolved from a single prompt to a multi-agent AI system for dbt data reviews. Learn the architectural iterations, token limit challenges, and engineering decisions behind production-grade AI data review.

How Does Simplified Automation Drive Data Tool Adoption?

March 31, 2026

Complex CI/CD requirements block data teams from adopting validation tools. Learn how sessions architecture and metadata separation eliminated 10+ minutes of setup per validation and unlocked shift-left data validation.

What Is Recce Showing at Coalesce 2025?

March 31, 2026

Recce is at Coalesce 2025 in Las Vegas demonstrating Recce Cloud, AI-powered data review, and hosting the Data Renegade Happy Hour for the data engineering community.

Three Essential Data Team Workflows Using Impact Radius

March 31, 2026

Learn three daily workflows where impact radius transforms data validation: root cause discovery, developer validation, and data PR review. See how metadata-first analysis replaces expensive blanket data diffing.

How Did vaidukt Reduce Data Complaints by 70% with Systematic Validation?

March 31, 2026

German energy platform vaidukt reduced customer data complaints by 70% using Recce for systematic PR-level data validation, transforming how a 3-person data team catches errors before production.

What Happens When AI Builds Your dbt Models?

March 31, 2026

A firsthand account of letting Claude Code build an analytics warehouse end-to-end with dbt. The interesting part was not the generated code — it was the setup, review, and guardrails that made the output usable.

What Is Guided Data Review for dbt Pull Requests?

March 31, 2026

Guided data review uses context engineering to tell dbt PR reviewers what changed, why it matters, and what to validate. Learn how it solves the "where do I start?" problem in data PR reviews.

Why Do Data Teams Struggle with Tool Adoption?

March 31, 2026

Data teams often love a tool in demos but abandon it weeks later. Learn why setup complexity, wrong adoption sequences, and cognitive load create adoption barriers, and how value-first design solves them.

Why Do dbt MCP Workflows Need a Separate Data Validation Layer?

March 31, 2026

dbt MCP handles building and testing models but cannot compare branch output against production. Learn why MCP-based dbt workflows need a dedicated validation layer for cross-environment data diffs.

How to Use Claude Code for dbt Analytics Engineering

February 27, 2026

A practical account of using Claude Code to build an end-to-end dbt analytics warehouse, from Snowflake ingestion to mart models, and why the setup infrastructure matters more than the prompt.

How to Generate a Time Spine in dbt

February 23, 2026

A time spine is a table with one row per time period used for filling gaps in event data. Learn how to generate a time spine in dbt using date_spine, generate_series, and MetricFlow conventions.

What Is an AI Data Review Agent?

February 22, 2026

An AI data review agent automates dbt PR review by analyzing code changes, running data validations, and generating impact summaries. Learn how multi-agent architecture produces trustworthy reviews.

What Is the dbt DAG? A Guide to Lineage and Dependencies

February 21, 2026

The dbt DAG is a directed acyclic graph that maps dependencies between your data models. Learn how to read the DAG, use lineage for impact analysis, and understand the difference between static and diff-aware lineage views.

Recce vs Datafold: Which Data Validation Tool?

February 20, 2026

A comparison of Recce and Datafold for dbt data validation. Covers validation philosophy, CI/CD integration, pricing, and when to choose each tool.

Data Review Best Practices for Modern Data Teams

February 19, 2026

A structured guide to implementing data review processes that catch data quality issues before they reach production. Covers impact analysis, automated checks, and CI/CD integration for dbt projects.

What Is Impact Radius in Data Modeling?

February 19, 2026

Impact radius measures how far a data model change propagates through your DAG. Learn how to calculate, visualize, and use impact radius to scope data reviews and reduce production risk.

What Should a dbt CI Pipeline Check Beyond Tests?

February 18, 2026

dbt tests check structure, not data impact. Learn what additional checks — schema diffs, row counts, profile diffs, and automated preset checks — your CI pipeline should run to catch issues before merging.

How to Write a Good dbt Pull Request

February 17, 2026

A structured guide to writing dbt pull requests that include data validation, not just code changes. Covers PR templates, data impact documentation, and review workflows.

Why Is My dbt Data Wrong Even When Tests Pass?

February 16, 2026

dbt tests validate structure, not meaning. Learn why data can pass all tests and still be wrong, and what practices catch the semantic errors that automated testing misses.

What Is Column-Level Lineage and Why Does It Matter?

February 15, 2026

Column-level lineage tracks how individual columns flow through your data pipeline. Learn how CLL works, its three core use cases, and how it compares across dbt ecosystem tools.

What Is a Data Diff and When Should You Use One?

February 14, 2026

A data diff compares datasets across two environments to surface what changed. Learn the types of data diffs, when each is useful, and how to avoid the hidden costs of diff-everything approaches.