SATURDAY, MAY 23, 2026 · MINED FROM 131 PODCASTS

May 23 · Declarative APIs cut 1,000 lines to 50

Good morning.Teams are aggressively constraining agent action spaces to guarantee deterministic outcomes.

Constraining coding agents to high-level declarative APIs reduces error-prone generation targets from 1,000 lines of raw PyTorch down to 50 lines of simple calls. Jure Leskovec reports this approach entirely eliminates subtle data science bugs like temporal information leakage during complex enterprise workflows.

Ship This Week

Exposing a custom Marimo linter to Claude 4 via a `uv` tool call allowed the agent to auto-heal, solving 60% of environment-specific syntax errors.

agents

Environment-specific agent linters

Build custom linters for your specific environment to let agents auto-heal instead of relying on complex prompt engineering.

▾ Show more ▴ Show less

Problem: Agents lack awareness of environment-specific constraints (like Marimo's cell structure), leading to broken code that prompts alone can't reliably fix.

On: custom linters for agent auto-healing

“what if we make a linter? And the whole point of the linter is it's going to be super Marimo-specific... that solved about 60% of all the problems.”

Recipe Create a highly specific linter for your environment. Expose it to the agent via a standard tool call (runnable via `uv` without installing a full virtual environment). When the agent makes an environment-specific error, the linter provides exact feedback, allowing the LLM to auto-heal its own mistakes.

Measured Evidence 60% of agent errors solved

Counterpoint Prompt engineering is brittle for environment-specific syntax; deterministic linters provide reliable auto-healing loops.

Agent-Harness.ipynb* · May 20, 2026

▶ Apple show ▶ Spotify

Claude 4Marimouv

May 23 · Declarative APIs cut 1,000 lines to 50

Environment-specific agent linters

RL over SFT for hallucination reduction

Environment-specific agent linters

RL over SFT for hallucination reduction

Declarative APIs for coding agents

Two-pronged admin and user agent architecture

State-aware meta-agent for workflow deduplication

Classification-only fine-tuning constraint

Multi-language formal verification via Strata

Training on unpublished negative data

Game manual in-context evals

Template-constrained generative UI

Emergent agent skill synthesis

E-values for anytime eval inference

Restricting LLMs to workflow orchestration

Semantic evals for design agents

GRPO for long-rollout agentic RL

High-density mid-training before post-training

Domain-specific RL for coding models

Fast models for exploratory flow

In-context learning for relational subgraphs

Accelerator-routed decode phase

Massive multi-agent security harness

Agents shift from monolithic models to specialized multi-agent harnesses

Continuous evaluation pipelines adopt anytime inference

High-density mid-training phases emerge before RL

Fine-tuning is for massive-scale classification only

LLMs cannot perform direct scientific reasoning

Fast models preserve exploratory developer flow

Strata

Canvas

Composer 2.5

OpenAI's Yann Dubois: Why AI Progress Suddenly Feels Real

Agent-Harness.ipynb*

Relational Foundation Models for Enterprise Data with Jure Leskovec - #768