AI Coding Agents Won't Replace You. But They Will Replace How You Think About Code.

Kunal Ganglani March 3, 2026 33 min read

AI Coding Agents Won't Replace You. But They Will Replace How You Think About Code.

Last month, I watched a junior developer on my team scaffold an entire CRUD service in about 20 minutes using an AI coding agent. Models, routes, validation, tests. The whole thing. Six months ago that's most of a morning. My first reaction was genuine excitement. My second reaction, about 30 seconds later, was dread. Not for his job. For the three hours we spent the next day debugging a subtle race condition the agent had introduced in the database layer.

That moment captured something I've been chewing on for a while: AI coding agents are real, they're useful, and the conversation around them is almost entirely wrong.

The Productivity Illusion

The headline numbers are seductive. GitHub reports that developers using Copilot complete tasks up to 55% faster. Every AI company has a stat like this. And I believe the numbers are directionally correct. If you're measuring time-to-first-commit or lines-of-code-per-hour, yes, agents make you faster.

But here's the thing nobody's saying about these stats: they measure output, not outcome.

I've shipped enough features to know that writing the code is rarely the bottleneck. Understanding the problem is. Designing the right abstraction. Navigating the existing codebase. Handling edge cases that nobody mentioned in the ticket. Reviewing for security implications. That's where real engineering happens. AI agents are, at best, mediocre at all of it.

A study by METR (an AI safety research organization) actually found that experienced open-source developers were slower when using AI coding tools on real-world tasks in familiar codebases. Not faster. Slower. The overhead of reviewing, correcting, and re-prompting the AI ate up more time than it saved.

This doesn't mean AI coding tools are useless. It means we're measuring the wrong things.

Where Agents Actually Help (and Where They Don't)

I'm not a doomer on this. I use AI coding tools every single day. But I've gotten specific about when they help and when they actively hurt.

Boilerplate and scaffolding? Let the agent rip. Need a new service following your team's existing patterns? A migration file? Unit tests for a pure function? This is grunt work that doesn't need deep contextual understanding, and the agent cranks it out faster than you can type.

Exploration is a strong use case too. When I'm evaluating a new library or trying to grok an unfamiliar API, I'll have an agent generate example usage. It's like a very fast, somewhat unreliable Stack Overflow that doesn't make you scroll past three paragraphs of someone's life story.

Mechanical refactors — renaming, extracting functions, converting callback-style code to async/await. Tedious transformations where the logic is straightforward but the surface area is huge. Agents handle these well.

Now the bad parts.

Architecture? Terrible. Every time I've asked an agent to help design a system, it produces something that looks plausible on the surface and crumbles under real constraints. It doesn't know your traffic patterns. It doesn't know your team's operational capabilities, your compliance requirements, or the political reality of your org. Architecture is context, and context is exactly what these models don't have.

Security-sensitive code is where things get genuinely dangerous. I've seen agents generate auth flows with subtle vulnerabilities, database queries with injection risks that wouldn't show up in basic tests, API endpoints that expose data they shouldn't. If you're vibe-coding your auth layer, you deserve what's coming.

The "Vibe Coding" Trap

Andrej Karpathy coined "vibe coding" earlier this year, and it stuck because it describes something real: developers accepting AI-generated code based on vibes rather than understanding. Does it look right? Does it run? Ship it.

For prototypes and weekend projects, vibe coding is genuinely fun. I've built small tools this way that I never would have bothered building before. The activation energy is so low that it unlocks a whole category of "not worth the effort" projects.

But for production systems? Vibe coding is technical debt on a payment plan with a 400% interest rate.

The most dangerous code is code that works but that nobody understands. AI agents are extraordinarily good at producing exactly this.

I've started noticing a pattern in codebases that lean heavily on AI generation: the code works, but there's no coherent design philosophy holding it together. Each file feels like it was written by a different person (because, in a sense, it was). No consistent error handling strategy. No unified approach to state management. No shared understanding of where module boundaries are. Functional but fragile.

The engineers who thrive with AI tools are the ones who read generated code critically, understand what it's doing at every level, and reshape it to fit the system's larger architecture. You need to be a strong engineer first to get real value from these tools. There's no shortcut around that.

What Actually Changes for Engineering Teams

Stripping away the hype and the backlash, here's what I think is genuinely shifting.

The cost of code is dropping. The cost of decisions isn't. Writing code is getting cheaper and faster. Deciding what to build, how to build it, where the boundaries should be? Still expensive. Still hard. Teams that confuse "we can produce code faster" with "we can ship good software faster" are going to learn some painful lessons over the next couple of years.

Code review matters more than ever. When a human writes code, the act of writing creates understanding. The author knows their intent, their tradeoffs, where the tricky parts are. AI-generated code has none of that implicit knowledge baked in. Review has to compensate. I've started asking my team to add comments explaining why they accepted certain AI suggestions, not just what the code does. It's an extra step. It's worth it.

Junior engineer development needs a serious rethink. This is the one that keeps me up at night. The traditional path — writing lots of code, making mistakes, building intuition through repetition — is being short-circuited. If a junior can generate working code without understanding it, they skip the struggle that builds the deep mental models senior engineers rely on. Pair programming matters more now, not less. We need to be way more intentional about how we grow people.

The highest-leverage skill is shifting from writing code to evaluating it. Can you look at a piece of generated code and quickly spot what's wrong? What's subtly wrong? What's going to break at 10x traffic? That's the skill that compounds.

What I'd Actually Tell You to Do

If you're a senior engineer, your value isn't diminished by AI agents. It's amplified. You're the person who looks at what the agent produced and says "this won't survive a traffic spike" or "this violates our data residency requirements" or "this is solving the wrong problem entirely." That judgment comes from years of building, shipping, and debugging real systems. No model has it.

If you're earlier in your career, my honest advice: don't lean on AI as a crutch. Use it as a sparring partner. Generate code, then rewrite it from scratch without the agent. Compare the two versions. Figure out why the agent made certain choices and whether they were actually good. The engineers who come out of this era strong will be the ones who resisted the temptation to let the machine do their thinking.

And if you're leading an engineering team, stop measuring on velocity metrics that AI tools inflate. Lines of code, PRs merged, tickets closed. These are about to become meaningless as productivity signals. Measure what you should have been measuring all along: reliability, time-to-recovery, customer-facing quality, and whether your team actually understands the systems they own.

The teams that win over the next five years won't be the ones writing code the fastest. They'll be the ones that understand their systems the deepest. AI makes writing code trivially easy. It makes understanding systems no easier at all. Bet on understanding.

Photo by Rahul Viswanath on Unsplash.

#ai-agents #developer-productivity #software-engineering #ai-coding #engineering-leadership

Share this post

Share on X LinkedIn

Stay in the loop

Get new posts on AI, engineering, and emerging tech — no spam, unsubscribe anytime.

Or subscribe via RSS

Written by Kunal Ganglani

Software engineering leader based in Toronto. Building intelligent systems at the intersection of AI and practical software architecture.

Apple's M5 Max Just Made the Case for Local AI Development. NVIDIA Should Pay Attention.

128GB of unified memory at 614 GB/s in a laptop. The M5 Max isn't just a faster chip — it's a completely different approach to running large language models locally.

Why AI Latency Matters More Than Intelligence: The 232ms Lesson From GPT-4o

Everyone's obsessed with making AI smarter. The real breakthrough is making it faster. GPT-4o's 232ms response time changes what AI can actually be.

Anthropic Said No to the Pentagon. OpenAI Said Yes. Now What?

Two AI companies made opposite bets on military collaboration, and the market just picked a side.

AI Coding Agents Won't Replace You. But They Will Replace How You Think About Code.

The Productivity Illusion

Where Agents Actually Help (and Where They Don't)

The "Vibe Coding" Trap

What Actually Changes for Engineering Teams

What I'd Actually Tell You to Do

Stay in the loop

Related Posts

Apple's M5 Max Just Made the Case for Local AI Development. NVIDIA Should Pay Attention.

Why AI Latency Matters More Than Intelligence: The 232ms Lesson From GPT-4o

Anthropic Said No to the Pentagon. OpenAI Said Yes. Now What?