You're most certainly wrong on this one. Superior models give superior products and security over time. Until every 3-6 months stops bringing a large improvement in coding capability and scaffolding, there's no reason to assume we are nearing a hard limit.
You also have to factor in that bespoke software is... bespoke. ie. much more suited to your org's use-cases than the primary solution is. Way less bloat. Way less vulnerability when you don't need an enterprise SaaS solution and instead can host on your private networks.
And as far as security considerations: Imagine you had a separate Opus 4.6 agent tasked with managing and monitoring and updating devoted to a specific slice of vulnerabilities. Of course this is highly inefficient, but it would take care of the vast majority of vulnerabilities that even enterprise SaaS have. This is simply a scaffolding issue at this point, not model ability. Scaffolding issues like this will continue to dominoe.
>Until every 3-6 months stops bringing a large improvement in coding capability and scaffolding, there's no reason to assume we are nearing a hard limit.
How much of that is better models, and how much is it AI companies throwing more resources at each one? E.g. larger context windows and higher token/s correlate with the better models.
I also suspect AI is going to make software more secure rather than less.
Even today it can probably find a lot of issues automatically. With basic knowledge of what to look for, it certainly helps in understanding data flow too.
Agreed. This sentiment you are replying to is a common one and is just people self-aggrandizing. No, almost nobody is working on code novel enough to be difficult for an LLM. All code projects build on things LLM's understand very well.
Even if your architectural idea is completely unique... a never before seen magnum opus, the building blocks are still legos.
Another heavily overengineered AND underengineered abomination. I'm convinced anyone who advocates for these types of tools would find just as much success just prompting claude code normally and taking a little bit to plan first. Such a waste of time to bother with these tools that solve a problem that never existed in the first place.
No to be fair I do see what he's saying. I see a major difference between the more expensive models and the cheaper ones. The cheaper (usually default) ones make mistakes all the damn time. You can be as clear as day with them and they simply don't have the context window or specs to make accurate, well reasoned desicions and it is a bit like having a terrible junior work alongside you, fresh out of university.
The cheaper models can't be taught or improved due to their inherit limitations, which makes it a huge pain to even try with even the simplest of tasks. Perpetually, no matter your instruction file(s).
I agree. The more expensive models I must admit have impressed me, but sometimes they take so long and are so expensive you might as well do it yourself. That being said if you're feeling particularly lazy there is now a "do it for me" button built into code editors, but until perhaps 2035 this technology is still somewhat pedestrian compared to what it could be in the future.
> The only people who use LLMs "as a tool" are those who are incapable of doing it without using it at all.
Do you mean that? It's clearly false, but I don't want to waste time gathering famous-person counterexamples if you already know it's a huge exaggeration at best.
It sure look like the US is transitioning from the former to the latter"
I honestly can't understand how anyone sees things this way. The US isn't transitioning at all. It is and has been a complete oligarchy for at least 70 years.
I've taken to discounting any political talking-head that wont consistently acknowledge and push that axiom as THE fundamental assumption of all political discourse.
Cancelling my account may be a small action but it is not pointless. Expressing my views and voting with my wallet is my right. Even your seemingly pointless question is a good reminder of the impact we can have - thanks!
You also have to factor in that bespoke software is... bespoke. ie. much more suited to your org's use-cases than the primary solution is. Way less bloat. Way less vulnerability when you don't need an enterprise SaaS solution and instead can host on your private networks.
And as far as security considerations: Imagine you had a separate Opus 4.6 agent tasked with managing and monitoring and updating devoted to a specific slice of vulnerabilities. Of course this is highly inefficient, but it would take care of the vast majority of vulnerabilities that even enterprise SaaS have. This is simply a scaffolding issue at this point, not model ability. Scaffolding issues like this will continue to dominoe.
reply