select navigate esc close

Seroter's Daily Reading — #815 (June 30, 2026)

Seroter's Daily Reading·

Listen: https://blossom.buildtall.systems/7df0724bb1b0d93f0c5782439c3504a7b255bbc185a12bbedf0f12fc6a3f0bcc.mpga

Source: Seroter's Original Post


Seroter's Daily Reading, Episode 815, June 30, 2026. There's a lot today about the strategy around using and rolling out AI at scale, and I think that framing is exactly right. We're past the phase of experimenting with AI in isolation. The articles this episode are all about what it takes to actually make this work across people, processes, and systems at enterprise scale.

Let's start with a panel discussion from the DX Annual conference, moderated by Rebecca Fitzhugh of Atlassian, featuring engineering leaders from Nationwide, Comcast, TD Bank, and HPE. These are large enterprises genuinely wrestling with what it means to adopt AI across the entire software development lifecycle, and what they landed on is worth sitting with. Their central argument is that AI adoption is not a tooling effort. It's a redesign effort. You can't just hand engineers a copilot and call it done. You have to rethink requirements gathering, design, testing, code review, and deployment from the ground up. They also emphasized something that sounds obvious but clearly isn't in practice, which is that training and organizational support matter more than which tool you pick. Nationwide found that productivity gains came less from the new tool itself and more from coaching, playbooks, and giving engineers time to actually learn. Teams needed air cover and psychological safety to experiment. That's a human infrastructure problem, not a procurement problem.

On the question of accountability, every panelist was clear: humans remain responsible for outcomes regardless of who writes the code. The thinking is shifting from approval workflows to validation workflows. It's less about a human signing off and more about the right people reviewing and challenging what AI generated. And as AI takes over more implementation work, decision-making is becoming a core engineering skill. The engineers who thrive are the ones who can evaluate tradeoffs, validate outputs, and make good judgment calls quickly, not just write code manually. One other point that stood out was about security and governance. As AI generates infrastructure code, Terraform, YAML, the attack surface is expanding. The answer they landed on is shift-left, embedding guardrails directly into workflows rather than relying on end-of-pipeline reviews.

Finally, this panel had a refreshing take on what developer experience means now. They're arguing it's no longer about the individual tool, it's about the trusted workflow. How fast can a team go from idea to production? And importantly, developer experience now includes agent experience. As AI agents become active participants in delivery, you have to think about how agents consume context, operate within guardrails, and interact with your platform. That's the new surface area.

From the DX panel, let's shift to a piece from The Long Commit on Markdown Is Becoming the Application Layer of AI Apps with a different but related angle on how you actually get AI systems to behave correctly at scale. The argument is that Markdown is becoming the application layer of AI apps. Before you dismiss this as a niche idea, hear the distinction the author is making. He's not saying Markdown replaces Python, TypeScript, permissions, or tests. He's saying the part of the product that tells the AI system how to behave is moving out of framework code and into written, versioned instructions that humans can read and agents can execute. The transition is from code-heavy orchestration to harnesses plus Markdown.

Think about what that means in practice. A support escalation rule should not always require a framework change. A source policy should not be buried in a callback. An editorial standard should not live as a prompt string in application code. When those things live in code, the people closest to the domain lose the ability to improve them. The workflow becomes more formal but not necessarily more maintainable. The simpler architecture the author describes has three parts. Product code still owns capabilities and constraints, APIs, permissions, transactions, deployment paths. The agent harness gives the model a safe place to operate, decides which tools exist, how approvals work, how traces are captured. And Markdown carries the operating behavior, the source policy, the escalation rules, the examples, the review expectations, the handoff contracts.

This is not no-code. Engineers still build the harness and enforce the boundaries. But the domain-specific judgment about how work should be done lives in a medium that non-engineers can read, review, and update. The article points to conventions that are already normalizing. Codex reads an AGENTS.md file for project instructions. Claude Code uses CLAUDE.md for persistent project memory. Vercel's Eve framework uses an instructions.md file as the always-on prompt. These are not the same, but they point toward the same underlying shape. Humans write structured text, agents use it as operating context, and the product changes when that text changes.

One caveat the author stresses is important. Markdown can make a weak process look official. A stale AGENTS.md is worse than no AGENTS.md if the agent trusts it. A vague workflow file is not a workflow. If a Markdown file changes how your AI product behaves, then changing that file is a product change. It should have owners, it should be reviewed, it should have examples. That's the part teams cannot skip.

Moving on, there's a deep dive on O'Reilly Radar from Angie Jones and Richmond Alake at Oracle on Agent Memory. This is one of those topics that sounds simple and turns out to be genuinely hard. LLMs are stateless by design. Each prompt is treated as an isolated event. When you have a continuous chat with an AI agent, it feels like the AI remembers. But the interface is faking it. Behind the scenes, the agent resends the entire conversation history as one giant prompt. That's not really memory engineering. That's just reminding. The hard part is deciding what should be stored, where, how it should be retrieved, and when it should be summarized, forgotten, or compressed.

The article lays out several types of memory that matter for agents. Conversational memory stores the messages exchanged, which is valid but insufficient on its own. Semantic memory stores durable facts, things that should outlive the conversation. The user's preferred stack, the customer's data residency requirements. Vector search is useful here because the agent can retrieve the meaning even if the exact wording changes. Episodic memory stores events, what happened, which tool was called, where a workflow failed. This is especially useful for debugging and auditing. Procedural memory stores how to do things, which is what helps an agent improve its process rather than just know things. Entity memory stores facts about specific people, accounts, or projects, and this is where memory safety becomes critical. An agent should not mix memories between users or customers.

Oracle's approach with their AI Agent Memory Package uses their AI Database as a unified backing store, which is interesting because it means you don't have to stitch together a vector database, a relational database, a document store, and custom thread management separately. They all live together. The author ends with a point worth sitting with. Memory has to be scoped, auditable, correctable, and intentionally retrieved. Bad memory is worse than no memory. If we want agents to be useful beyond a single prompt, they need a way to remember. Not everything, but enough to carry context forward.

Let's talk about the Cloud CISO Perspectives piece on how Google Cloud Security uses AI internally. This one has five lessons around AI threat defense. I'll highlight a few. First, they use a multi-agent pipeline for vulnerability discovery, where context agents synthesize product logic, building and testing agents execute the code, and quality analyzer agents actively adjust inputs to get deeper into stateful APIs. The interesting design pattern here is the self-reflection loop. Stateless AI systems repeatedly fall into the same logical traps, trying to fix bugs inefficiently, hallucinating about non-existent code. Their solution is a post-hoc reflection agent that analyzes execution logs and tool histories after a workflow concludes. Successful trajectories get permanented into a global knowledge store, and when future agents spin up, that intelligence is injected directly into their context window. Over time, that creates a compounding interest effect on their security engineering capability. Seroter notes this self-reflective loop is an important pattern, and I'd agree.

There's a piece on VentureBeat titled Claude Code turned every engineer into three. Now companies need more product thinkers that has one of the more striking data points I've seen recently. Anthropic recently told its growth team to hire more product managers, not fewer. The reason was that Claude Code had quietly turned their engineering org into a team that ships at roughly three times its actual headcount, and the bottleneck moved from the IDE to the people deciding what to build. That is a profound structural shift. The article traces how we got here in five compressed phases. The Stack Overflow era, the browser-tab era where AI sat outside the editor, the IDE-native era with Cursor and Claude Code moving the model inside the repository, the spec-driven era where larger context windows turned single sessions into multi-week projects, and now the routines era where persistent agents run on cadences and webhooks while the engineer sleeps. The bottleneck has moved, but most teams have not adjusted. The traditional ratio of product managers to engineers, already strained, now plays out closer to an effective one to twenty because each engineer ships more per day.

The article makes a counterintuitive argument on fundamentals. The instinct to declare engineering fundamentals obsolete in the agent era gets the trend exactly wrong. When a memory leak takes down production at three a.m., no agent currently in the wild closes that loop end to end. The blast radius of the engineer who knows what is happening underneath has gone up, not down. Fundamentals are now a leverage skill, not a hygiene skill. But the most important career signal in this piece is the product funnel argument. The engineer who matters in 2026 is the one who has stopped waiting for the funnel to arrive as a Jira ticket. Talk to customers. Watch how they use the product. Read the support queue. Generate ideas, not just estimates. The product manager who used to source ideas for eight engineers cannot source ideas for twenty at the same fidelity. The engineer who shows up with a validated, scoped opportunity is doing the job the new ratio requires.

Let's close with Google and their Agent Development Kit for Go, version 2.0. If you're using Go for AI workloads, this is a significant release. The headline is a new graph-based workflow engine for composing multi-agent applications. Real agent applications are rarely a single prompt. They classify, branch, fan out to specialists, ask a human to approve something, retry on failure, and loop until done. Expressing that as ad-hoc control flow gets brittle fast. ADK 2.0 lets you describe the shape of your application as a graph of nodes connected by edges, and hands execution to a scheduler that knows how to run it concurrently, persist its state, pause for a human, and resume later, even across process restarts. Nodes can be function nodes, agent nodes, tool nodes, join nodes for fan-in barriers, dynamic nodes written in plain Go, and workflow nodes that embed entire sub-graphs. Edges carry routing conditions, and you can use an LLM as the brain of a router, having the model classify the user's intent and dispatching to the right handler. One thing that stood out is human-in-the-loop as a built-in primitive. Any node can pause the graph and ask a human a question, and the workflow durably waits for the answer. When the human replies on a later turn, the workflow resumes. And that resume is durable. The workflow state lives in the session, so a workflow can resume after a process restart or even across different runtimes. If you're doing serious agent work in Go, this is worth a close look.

That's episode 815. If I pull a thread across today's articles, it's that AI adoption at scale is forcing us to redesign not just our tools but our organizational structures, our accountability models, and our assumptions about who does what. The engineers becoming orchestrators, product managers becoming the bottleneck, Markdown becoming the application layer, memory systems needing to be scoped and auditable. These are all signs of a technology that is maturing out of the demo phase and into the hard work of operating reliably in production.


  1. [blog] Mastering Gemini Computer Use: A Comprehensive Hands-on Guide — https://medium.com/google-cloud/mastering-gemini-computer-use-a-comprehensive-hands-on-guide-ab8ec7db3aab
  2. [article] The future of engineering at Nationwide, Comcast, TD, and HPE — https://newsletter.getdx.com/p/the-future-of-engineering-at-nationwide
  3. [article] Markdown Is Becoming the Application Layer of AI Apps — https://newsletter.thelongcommit.com/p/markdown-is-becoming-the-application
  4. [blog] Beyond Gemini Enterprise: Google's Hidden AI Superpower — https://medium.com/google-cloud/beyond-gemini-enterprise-googles-hidden-ai-superpower-ae88ec591821
  5. [article] Agent Memory — https://www.oreilly.com/radar/agent-memory/
  6. [blog] Cloud CISO Perspectives: How Google Cloud Security uses AI internally — https://cloud.google.com/blog/products/identity-security/cloud-ciso-perspectives-how-google-cloud-security-uses-ai-internally/
  7. [article] The AI jobs debate just got messier — https://techcrunch.com/2026/06/29/the-ai-jobs-debate-just-got-messier/
  8. [article] Claude Code turned every engineer into three. Now companies need more product thinkers — https://venturebeat.com/infrastructure/claude-code-turned-every-engineer-into-three-now-companies-need-more-product-thinkers
  9. [blog] Build reliable multi-agent applications with ADK Go 2.0 — https://developers.googleblog.com/announcing-adk-go-20/
  10. [article] Your Talent Strategy Has to Keep Up with Your AI Transformation — https://hbr.org/2026/06/your-talent-strategy-has-to-keep-up-with-your-ai-transformation