AI Coding Agents Are Already Commoditized (seangoedecke.com) 64

Posted by msmash on Saturday July 05, 2025 @07:00AM from the how-about-that dept.

Software engineer Sean Goedecke argues that AI coding agents have already been commoditized because they require no special technical advantages, just better base models. He writes: All of a sudden, it's the year of AI coding agents. Claude released Claude Code, OpenAI released their Codex agent, GitHub released its own autonomous coding agent, and so on. I've done my fair share of writing about whether AI coding agents will replace developers, and in the meantime how best to use them in your work. Instead, I want to make what I think is now a pretty firm observation: AI coding agents have no secret sauce.

[...] The reason everyone's doing agents now is the same reason everyone's doing reinforcement learning now -- from one day to the next, the models got good enough. Claude Sonnet 3.7 is the clear frontrunner here. It's not the smartest model (in my opinion), but it is the most agentic: it can stick with a task and make good decisions over time better than other models with more raw brainpower. But other AI labs have more agentic models now as well. There is no moat.

There's also no moat to the actual agent code. It turns out that "put the model in a loop with a 'read file' and 'write file' tool" is good enough to do basically anything you want. I don't know for sure that the closed-source options operate like this, but it's an educated guess. In other words, the agent hackers in 2023 were correct, and the only reason they couldn't build Claude Code then was that they were too early to get to use the really good models.

AI Coding Agents Are Already Commoditized

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 64 Comments Log In/Create an Account

Comments Filter:

- Re: (Score:3)
  
  by Tony Isaac ( 1301187 ) writes:
  
  GitHub already has plenty of "terrible code"...written by humans.
  - Too prone to meta, but AI loves meta (Score:1)
    
    by shanen ( 462549 ) writes:
    
    So many AC shite posts. So many real people feeding them and propagating their vacuous Subjects.
    I'm always too prone to meta-comments, but going meta is actually rather relevant here. The "art of programming" with generative AI is in writing accurate meta-descriptions of what the code should do. The thing that looks magical is how often the AI writes code that does what you wanted, not what you said. (DWIM.) Not that the AI has any comprehension of the difference. Just a matter of whether or not the AI was
    - Re: (Score:1)
      
      by gweihir ( 88907 ) writes:
      
      So many AC shite posts.
      You nicely just proved again that this is not an AC-only thing.
If this makes any sense to someone (Score:2)

by Retired Chemist ( 5039029 ) writes:

can they explain what that actually means. It seems like a collection of gibberish probably written by an "AI".
- Re: (Score:1)
  
  by laosland ( 55769 ) writes:
  
  From what I've read, it means that the agents are interchangeable.
  I've been using Cline.bot's VS Code extension with various agents and have found that Claude Sonnet has been most reliable. It is getting somewhat expensive but works reasonably well.
  - Re: (Score:3)
    
    by Berkyjay ( 1225604 ) writes:
    
    Can I ask what you use the agents for? I for the life of me can't imagine giving any AI the ability to autonomously change code.
- Re: (Score:2)
  
  by AleRunner ( 4556245 ) writes:
  
  You can use any AI coding tool. It will work as a hyper-autocompletion agent. It will do all the basic stuff which is often useful. None of them have solved the problem that sometimes they will put out awful code and you will have to think. They are all basically the same.
  That means that there's no real benefit from going for a proprietary one and you should target the one which is most clear and open about how it works and where it gets its training data from.
  "Agentic" means "work as a software component t
  - Re: (Score:2)
    
    by martin-boundary ( 547041 ) writes:
    
    Let it read a file on one side and output a file on the other and it works great.
    Reminds me of sed(1).
    Although I don't recommend the special case of reading one file on one side and outputting the same file on the other:)
- Re:If this makes any sense to someone (Score:5, Insightful)
  
  by gweihir ( 88907 ) writes: on Saturday July 05, 2025 @08:54AM (#65498836)
  
  It means "Must keep AI hype going! Must pretent it is the only true thing! Must make more money!".
  It can safely be ignored as total bullshit of the marketing variant.
  
- Kind of automating a lot of preexisting tools ... (Score:3)
  
  by drnb ( 2434720 ) writes:
  
  can they explain what that actually means. It seems like a collection of gibberish probably written by an "AI".
  "AI coding" is a replacement for walking to a bookshelf and picking up your old textbook to look up a well known well studied algorithm and see some sample code. Sample code written for brevity and clarity, that lacks a lot of the defensive coding that production code should have.
  
  It's one step beyond relying on an online community for the above, where one sacrifices convenience for the professional reviews and checks that went into the creation of the reference book. Settling for amateur reviews and chec
  - Re: (Score:2)
    
    by narcc ( 412956 ) writes:
    
    The difference between "AI coding" and the other tools you mention is that those are accurate, consistent, reliable, and inexpensive. LLMs are not.
    That said, AI coding agents do provide a competitive advantage ... to the shops that have figured this out already and aren't using them.
  - Re: (Score:2)
    
    by cascadingstylesheet ( 140919 ) writes:
    
    In short, "AI coding" is not as mystical as it seems. Doing little that prior sets of tools were not doing. It's just more convenient, perhaps automating the use of numerous such existing tools. It still requires a skeptical review of the code and likely the addition of defensive code.
    Right. It's a tool.
    It's not the singularity, and it's also not "useless" like some energetic posters here want it to be. LLMs are just tools, which devs need to figure out how best to use for their use cases.
Fuck "good enough" (Score:1)

by devslash0 ( 4203435 ) writes:

If you write "good enough" using agents, don't expect me to fix it for you when it goes horribly wrong.
- - Re: (Score:3)
    
    by Mr. Dollar Ton ( 5495648 ) writes:
    
    The literal meaning of "good enough" is "enough to get paid and forget it".
    No bearing at all on fitness for a purpose or need of fixing.
    - Re: (Score:2)
      
      by devslash0 ( 4203435 ) writes:
      
      Thanks for saying that. "Good enough" is a management-level bullshit that allows them to stretch "in progress" into "done".
      - Where development turns into maintenance (Score:2)
        
        by drnb ( 2434720 ) writes:
        
        Thanks for saying that. "Good enough" is a management-level bullshit that allows them to stretch "in progress" into "done".
        "Good enough" defines the point where a development contract is replaced by a maintenance contract. Ideally.
      - Or dev level BS coding for performance metrics (Score:3)
        
        by drnb ( 2434720 ) writes:
        
        Thanks for saying that. "Good enough" is a management-level bullshit that allows them to stretch "in progress" into "done".
        Or developer level BS where it meets the literal requirements of the sprint task and I can check it off as "done" and look like a "rock star" to management for my quick implementations. Never mind the various shortcomings and bugs in the code, things outside of the requirements of the spring task. Sure I could fix those problems, but that would slow me down. Better for my personal stats to let those come back as future bugs to be fixed in future sprints. They will come back, right? QA won't miss them and al
        
        Re: (Score:2)
        
        by drnb ( 2434720 ) writes:
        
        Or developer level BS where it meets the literal requirements of the sprint task and I can check it off as "done" and look like a "rock star" to management for my quick implementations.
        Found the manager.
        LOL, how wrong you are.
        
        BTW, how did you interpret a rant against management and those sucking up to management as pro management?
  - Manual code varies with skill level .... (Score:2)
    
    by drnb ( 2434720 ) writes:
    
    The literal meaning of "good enough" would be "doesn't need any more fixing than your manual code does".
    That varies with the skill level of the developer. AI more at the intern level. At the level of things well studied and well documented and with preexisting sample code.
- Re: Fuck "good enough" (Score:1)
  
  by sixminuteabs ( 1452973 ) writes:
  
  Somebody is feeling threatened
- Re: (Score:3)
  
  by gweihir ( 88907 ) writes:
  
  Hahah, yes. And remember that "good enough" looks massively different when you take security, usability and maintainability into account. I guess the next few years will get interesting for some enterprises that depend on software and quite a few will drown in a mountain of AI generated technological debt and die. Stupid people doing stupid things.
  - Re: (Score:2)
    
    by devslash0 ( 4203435 ) writes:
    
    Perhaps then I'll come back to them and say "Sure, I can fix it, but to work with this mess my rate is 2k a day".
    - Re: (Score:2)
      
      by gweihir ( 88907 ) writes:
      
      You are selling yourself and your sanity far too cheap.
- Re:Fuck "good enough" (Score:5, Insightful)
  
  by LinuxRulz ( 678500 ) writes: on Saturday July 05, 2025 @09:02AM (#65498852)
  
  The point they all miss is that writing code which works was never the problem. Any junior dev can do it.
  Software engineering always was about balancing tradeoffs, figuring integration points, ensuring long term maintainability, structuring for release and deployment, aligning design with roadmap, communication and collaboration, etc.
  Maybe an AI can eventually get there, but your prompt will be way bigger than the code. I'd rather write the code.
  For the rest, we already had cookiecutters and snippets.
  
  - Re: (Score:3)
    
    by devslash0 ( 4203435 ) writes:
    
    You're absolutely right. In order to get anywhere near functional with AI you need to be so specific with the description of what you want AI to produce, that it undermines the whole undertaking. If you allow for any degree of ambiguity, AI will do all it can to satisfy your reqs at the cost of anything else that you've left unspoken - things that human beings take for granted but computers are unable to comprehend its importance.
    - Re: (Score:2)
      
      by gweihir ( 88907 ) writes:
      
      Exactly. And then it will tell you that the "optimized" functionality it just produced is what you actually want.
  - Re: (Score:2)
    
    by gweihir ( 88907 ) writes:
    
    Indeed. This whole hype of "AI coding" is by people that are lying or never have written code in the context of any non-toy projects.
  - Re: (Score:2)
    
    by strikethree ( 811449 ) writes:
    
    Maybe an AI can eventually get there, but your prompt will be way bigger than the code. I'd rather write the code.
    A very rare +6 Insightful.
I still get terrible results from "coding" agents (Score:5, Interesting)

by ffkom ( 3519199 ) writes: on Saturday July 05, 2025 @08:41AM (#65498810)

Working for a company that is almost obsessed in its attempts to utilize "coding agents", I have attempted time and again to delegate mundane sub-tasks to such agents - for easy things like reading configuration files of a given format. And the results I got, up until today, are so awful, that even in the rare cases when they were not just defunct, I ended up rewriting the code to not be the one signing a commit of inefficient slop code into the repository.

I can only image what terrible code must be the norm in the places where the "coding agents" available today are considered "good enough".

- Re:I still get terrible results from "coding" agen (Score:5, Insightful)
  
  by gweihir ( 88907 ) writes: on Saturday July 05, 2025 @08:51AM (#65498830)
  
  You are not the only one. I am beginning to think that _everybody_ reporting great sucesses in this space is lying, deep in delusion or only doing very simplistic code (and struggling to do even that by themselves).
  
  - Re: I still get terrible results from "coding" age (Score:3)
    
    by Touvan ( 868256 ) writes:
    
    The thing that AI lets them get around is bad management. They suddenly don't have to have aeeting about every little aspect of every little feature. They can just get AI to generate them some slop. For these types of management traps, it's going to lead to seriously accelerated production. But it's also going to lead to a lot of production failures, and then a return to slow processes.
    - Re: (Score:2)
      
      by phantomfive ( 622387 ) writes:
      
      What is a management trap?
  - Re: (Score:2)
    
    by Tom ( 822 ) writes:
    
    It's like visual coding or RAD all over again. Whenever suits and PHBs are told there's a magic wand that'll allow them to do without paying people for the nitty-gritty bits, they get all excited and convince each other in their echo chamber that their dream of a company of all managers and no workers is just around the corner.
    Then reality says "hi", the hype dies down, a few scam artists got rich and the world continues as it was, with a couple new cool tools in the toolbox of those who know how to use the
  - Re: (Score:2)
    
    by strikethree ( 811449 ) writes:
    
    Your usage needs to be very disciplined. In many cases, as a commenter described above, you would need a prompt larger than the actual code generated to get 'perfection'.
    But AI is very useful for smaller sub-tasks.
- - Re: (Score:2)
    
    by WaffleMonster ( 969671 ) writes:
    
    Do not let the perfect be the enemy of the good. If you have some kind of personality hiccup that means you need every i crossed and every t dotted and will spend endless amounts of time cleaning up perfectly functional code for no other reason than it's not how you would have done it, you will find yourself unemployed within 1-2 years.
    Code == debt. More code tends to translate to higher overall costs managing product lifecycle. There is no single right answer - costs have to be weighed according to particulars of each project. If you are a code mill doing cookie cutter projects for clients AI might be great. For others it might well have a similar impact to hiring a crappy coder whose entire productive output is a net negative to the overall effort.
    You can either work with the new technology, or stand against and and be run over.
    New means jack shit. These sorts of blind appeals to "new" technology rather than tec
    - Re: (Score:2)
      
      by narcc ( 412956 ) writes:
      
      New means jack shit. These sorts of blind appeals to "new" technology rather than technical merit have always been a drag on the industry.
      I've never once regretted ignoring the latest programming fad. The time, effort, and money saved over 40+ years is incalculable.
- Re: (Score:2)
  
  by Tony Isaac ( 1301187 ) writes:
  
  On one hand, I agree with you that the quality is very poor. On the other, I do find AI tools (specifically, GitHub Copilot) useful and time-saving. I almost always have to fix what it generates, but it still saves me time.
  Some specific areas of success include:
  - SQL commands that manipulate XML or JSON database blobs
  - XPATH or JSONPATH generation
  - Converting jQuery ajax to async/await with fetch
  - Generating unit test skeletons
  In each of these cases, you do have to know what you're doing, and you have to be
  - Re: (Score:2)
    
    by cascadingstylesheet ( 140919 ) writes:
    
    In each of these cases, you do have to know what you're doing, and you have to be able to know when it's right. But the things it does, do save me time and research effort.
    Likewise.
    They are just tools. Each user has to learn how to use them for their own use cases. They are neither panaceas nor useless.
- Wally coding up his minivan (Score:2)
  
  by drnb ( 2434720 ) writes:
  
  It does not matter that the AI coding agent saves no time by the time your review and fix its code, add the omitted defensive coding, etc.
  
  What matters is that you can label that block of code as AI generated and look good in the performance metrics of management. That the company can tell wall street it has met its goal of X% AI generated code.
  
  It's another spin on Wally coding up his minivan. In the end, we typically code to match what management rewards. That's the ugly truth. We don't necessarily do
- Re: (Score:2)
  
  by WaffleMonster ( 969671 ) writes:
  
  Working for a company that is almost obsessed in its attempts to utilize "coding agents", I have attempted time and again to delegate mundane sub-tasks to such agents - for easy things like reading configuration files of a given format. And the results I got, up until today, are so awful, that even in the rare cases when they were not just defunct, I ended up rewriting the code to not be the one signing a commit of inefficient slop code into the repository.
  I can only image what terrible code must be the norm in the places where the "coding agents" available today are considered "good enough".
  Even industry cheerleaders admit agents are resulting in substantial increases in codebases.
Hahahaha, no (Score:3)

by gweihir ( 88907 ) writes: on Saturday July 05, 2025 @08:48AM (#65498824)

This is just more assholes portraying things they profit from as "inevitable", trying to keep the hype going and delaying the point where enough people realize that LLMs are not the revolution they are advertized as. You know, like in _all_ other AI hypes before.
And "just" better base models? Good luck with that. It is like saying space travel "just" needs a cheap and reliable FTL engine and we are all set. True, but meaningless.

On A Long Enough Timeline (Score:3)

by DewDude ( 537374 ) writes: on Saturday July 05, 2025 @08:55AM (#65498838) Homepage

This all fails. Everyone gets replaced with AI, no one has a job, no one has money to support companies.
It's just they get to screw us first.
We need an AI ban. This is not going to be a good thing for society. It already isn't. People are going to die because of bullshit decisions made by AI...likely already have. When the black box is making the decision and no one lets you look in the black box....is there really a black box?
We're all fucked. Congrats. It's only going to get worse before it gets better.

- Re: (Score:3)
  
  by Tony Isaac ( 1301187 ) writes:
  
  Your prediction is like predictions from the 20th century, that calculators would destroy the field of mathematics, or that chess computers would destroy the game of chess. Or more recently, that Google Maps would destroy people's ability to navigate, or that Waymo would obliterate Uber and taxi services. All these technologies impacted the various fields in significant ways, but did not destroy them.
  As a daily user of AI, I know that we are a LONG way from being displaced by AI. And even to the extent that
  - Re: (Score:3)
    
    by DewDude ( 537374 ) writes:
    
    There are a lot of industries it will take over before it's ready. Customer service for example. Every call center operator is frothing at the mouth to fire every single phone agent they've got. "People are a liability. They cost too much and they're not perfect enough." is the mentality I hear from the executives...the people that cut the checks.
    Customer service already sucks across the board for most corporations. They will not be hurt if they put broken AI in place of people. It already sucks. They will
    - Re: (Score:3)
      
      by Tony Isaac ( 1301187 ) writes:
      
      Customer service is an example of an industry that is *ripe* for AI takeover. I think AI would be an improvement over today's call trees and human script readers. In fact, last week I called an air conditioner contractor and was greeted by an AI assistant. I know it wasn't just another call tree because it was able to ask questions and respond appropriately, well beyond the capabilities of the best call trees. It actually did the job of a receptionist, and did it well.
      On Google Maps, people couldn't figure
    - Re: (Score:2)
      
      by narcc ( 412956 ) writes:
      
      If an executive can code an app talking to AI...he won't hire programmers.
      It's not like we haven't seen that countless times over the years [wikipedia.org]. It always ends the same way. It turns out that specifying the kind of program you want in sufficient detail for a computer to generate it is just programming.
      If a call center can handle all of it's calls with a couple of AI bots...they will fire the 1200 employees they have
      Sure ... except that the bots are both expensive and can't actually do the job.
- Re: (Score:2)
  
  by gweihir ( 88907 ) writes:
  
  Naa, don't worry about it. Like _all_ previously AI hypes, this one will fizzle out and only smaller things will remain. Most jobs _cannot_ be replaced by AI. Where it is possible, you typeically need higher qualification people to guide and monitor that AI and these people are rare. The only reason why the hype is even going is greed, ego and stupidity. As in all the previous AI hypes.
I'll be honest (Score:2)

by PoopMelon ( 10494390 ) writes:

I have no idea what the article is trying to tell us. Tl;dr for me is "people use ai agents". Ok and what?
AI marketplace (Score:2)

by ZipNada ( 10152669 ) writes:

The IDE I use offers 25 backend AI agents to pick from in a dropdown menu. All of them are either free or very cheap except for Claude Sonnet 4.0, which is reportedly the best but it burns through credits. All of them are probably operating at a loss.
- Re: (Score:2)
  
  by m00sh ( 2538182 ) writes:
  
  The IDE I use offers 25 backend AI agents to pick from in a dropdown menu. All of them are either free or very cheap except for Claude Sonnet 4.0, which is reportedly the best but it burns through credits. All of them are probably operating at a loss.
  The future will be locally run models.
  Only problem is that to promote cloud models, the hardware is being slow walked to do local models.
  There is just so much consolidation and conflicts that AI development is being done in a very investment friendly way rather than the most efficient way.
  - Re: (Score:2)
    
    by ZipNada ( 10152669 ) writes:
    
    >> The future will be locally run models
    Maybe so, but;
    - someone has to make money on those models and you would probably have to buy a good one.
    - it may require special hardware to run them, a good Nvidia GPU for example
    - the cloud models run on top-end hardware, they are always being upgraded and therefore may be better, and they are very cheap.
So it won't last (Score:2)

by rsilvergun ( 571051 ) writes:

You need data to train them and as the internet becomes filled with slop it's going to become increasingly hard to get that data. The only people who are going to be able to get it are large platform holders that can monitor what their users do and tell the difference between AI slop and real users because they can do heavy duty fingerprinting.

AI is a technology that is going to very quickly belong to a handful of super wealthy companies. The AI's dependency on training data guarantees market consolidat
- Re: (Score:2)
  
  by gweihir ( 88907 ) writes:
  
  I think your outlook is to positive. My take is it already has become mostly impossible to get new general training data sets. (Specialized ones you can, but at huge cost.) Hence the training data is now slowly aging into irrelevance.
No Funny here (Score:2)

by shanen ( 462549 ) writes:

As usual.
a free intern for everyone (Score:2)

by Tom ( 822 ) writes:

That's how I see AI. I've been writing software for the better part of 40 years. What I see from AI is sometimes astonishing and sometimes pathetic. I would never, ever, ever put AI generated code into production software without carefull checking and refactoring, and I would fire anyone who does.
Code completion is mostly in the "astonishing" part. If I write a couple lines of near-identical stuff, like assigning values from an input to a structured format for processing, the AI most of the time gets right

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Re: (Score:3)

Too prone to meta, but AI loves meta (Score:1)

Re: (Score:1)

If this makes any sense to someone (Score:2)

Re: (Score:1)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re:If this makes any sense to someone (Score:5, Insightful)

Kind of automating a lot of preexisting tools ... (Score:3)

Re: (Score:2)

Re: (Score:2)

Fuck "good enough" (Score:1)

Re: (Score:3)

Re: (Score:2)

Where development turns into maintenance (Score:2)

Or dev level BS coding for performance metrics (Score:3)

Re: (Score:2)

Manual code varies with skill level .... (Score:2)

Re: Fuck "good enough" (Score:1)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re:Fuck "good enough" (Score:5, Insightful)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

I still get terrible results from "coding" agents (Score:5, Interesting)

Re:I still get terrible results from "coding" agen (Score:5, Insightful)

Re: I still get terrible results from "coding" age (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Wally coding up his minivan (Score:2)

Re: (Score:2)

Hahahaha, no (Score:3)

On A Long Enough Timeline (Score:3)

Re: (Score:3)

Re: (Score:3)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

I'll be honest (Score:2)

AI marketplace (Score:2)

Re: (Score:2)

Re: (Score:2)

So it won't last (Score:2)

Re: (Score:2)

No Funny here (Score:2)

a free intern for everyone (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals