More

xmodem · 2026-03-10T12:44:32 1773146672

The problem is that the well you are drinking from has in fact been poisoned. Maybe you think you can tolerate it but some projects are taking a policy decision that any exposure is too dangerous and that is IMO perfectly reasonable.

xmodem · 2026-03-06T19:14:25 1772824465

The fact that the AI agent will just go and attempt to do whatever insane shit I can dream up is both the most fun thing about playing with it, and also terrifying enough to make me review its output carefully before it goes anywhere near production.

(Hot take: If you're not using --dangerously-skip-permissions, you don't have enough confidence in your sandbox and you probably shouldn't be using a coding agent in that environment)

Lothbrok · 2026-03-11T16:05:48 1773245148

That Terraform blast radius is exactly the problem I'm building Daedalab around: agents need hard approvals, scoped permissions, and an audit trail before prod is even reachable. If you're curios: www.daedalab.app

EdNutting · 2026-03-06T19:18:59 1772824739

Hot take indeed. Unfortunately it's too blunt an instrument. I can't control "you may search for XYZ about my codebase but not W because W is IP-sensitive". So, to retain Web Search / Web Fetch for when it's useful, all such tool uses must be reviewed to ensure nothing sensitive goes outside the trust boundary.

Yes, I'm aware this implies differing levels of trust for data passing through Claude versus through public search. It's okay for everyone to have different policies on this depending on specific context, use-case and trust policies.

xmodem · 2026-03-06T18:41:08 1772822468

An engineer recklessly ran untrusted code directly in a production environment. And then told on himself on Twitter.

petcat · 2026-03-06T18:52:27 1772823147

From the article, it sounds like that engineer did a lot of other reckless things even before handing the tasks over to the AI agent to continue the recklessness with even more abandon.

This is a case study in "if you don't know what you're doing, the answer is not just to hand it over to some AI bot to do it for you."

The answer is to hire a professional. That is if you care about your data, or even just your reputation.

VWWHFSfQ · 2026-03-06T19:05:55 1772823955

> before handing the tasks over to the AI agent to continue the recklessness with even more abandon

Which is a funny outcome of this because apparently the AI agent (Claude) tried to talk him out of doing some of the crazy stuff he wanted to do! Not only did he make bad decisions before invoking the AI, he even ignored and overruled the agent when it was flagging problems with the approach.

abustamam · 2026-03-06T19:00:25 1772823625

Yeah I always considered AI to be an accelerator. If you don't know what you're doing and would break stuff without Ai, AI will just accelerate that.

EdNutting · 2026-03-06T19:33:38 1772825618

"To err is human; To really foul things up requires a computer."

Extended with: "To really foul things up quickly, requires an AI tool."

Ancalagon · 2026-03-06T19:07:22 1772824042

This is literally what major company execs want engineers and eventually their agents to do.

xmodem · 2026-03-04T19:37:24 1772653044

The A-series has supported virtualization since long before the M-series existed. iOS disables it in early boot, though.

On the other hand, how much virtualization are you really going to be doing with 8GB of RAM?

nolist_policy · 2026-03-04T19:38:09 1772653089

A lot: https://news.ycombinator.com/item?id=47249309

xmodem · 2026-02-21T17:09:59 1771693799

> Unless your strategy is to create a photo-lab-like screen in pure black and red, or wear deep-red-tinted glasses, it’s unlikely that a pure colorshift strategy will cut out that big of a chunk of the spectrum.

The writer is dismissing this out of hand but to me this sounds like a great idea.

xmodem · 2026-02-02T18:17:38 1770056258

> And I'm guessing that the reason macOS doesn't give more details is because macOS is likely not involved in the step that fails

And I guess because of the wide variety of third-party hardware macOS has to support, it's not practical to write a pre-flight check into the update process either.

xmodem · 2026-01-22T13:47:27 1769089647

I've never tried it myself, but it's oft-repeated folk wisdom in Apple circles that enabling filesystem case-sensitivity breaks all manner of third-party software that has only ever been tested on the case-insensitive default.

xmodem · 2026-01-19T15:09:06 1768835346

Are you hosted on cloud platforms that are SOC2 compliant? Or have you achieved and been audited for SOC2 compliance yourself? I'm going to have to assume it's the former because if it was the latter you would directly say so. To me that type of sleight-of-hand inspires distrust, which is fatal to any prospect of me evaluating the product.

Beyond that, a key risk that has been brought into focus more and more lately is data portability and vendor lock-in. At this point I do not deploy a new vendor without documenting the exit strategy.

The best exit strategy you can offer is an open source, self-hostable version of the product with a simple migration plan. Some of the other existing competitors in the enterprise chat space already offer this. Even if no-one uses it, by offering it you keep your priorities aligned with your customers.

xmodem · 2026-01-19T14:54:14 1768834454

The point is that no indie dev was able to plan around the surprise release of Silksong, precisely because it was a surprise.

embedding-shape · 2026-01-19T14:58:14 1768834694

Ah, so then just an agreement with "Matters way less than people think" in the end?

xmodem · 2026-01-19T15:33:33 1768836813

No.

A lot of devs delayed their launches:

https://www.gamespot.com/articles/silksong-release-date-has-...

Those that didn't or couldn't think it hurt them pretty badly:

https://www.dexerto.com/gaming/hell-is-us-boss-slams-silkson...

In general I think you are probably right. But there are definitely exceptions and this is one of them.

xmodem · 2026-01-12T11:49:23 1768218563

I have not been as aggressive as GP in trying new AI tools. But the last few months I have been trying more and more and I'm just not seeing it.

One project I tried out recently I took a test-driven approach. I built out the test suite while asking the AI to do the actual implementation. This was one of my more successful attempts, and may have saved me 20-30% time overall - but I still had to throw out 80% of what it built because the agent just refused to implement the architecture I was describing.

It's at its most useful if I'm trying to bootstrap something new on a stack I barely know, OR if I decide I just don't care about the quality of the output.

I have tried different CLI tools, IDE tools. Overall I've had the best success with Claude Code but I'm open to trying new things.

Do you have any good resources you would recommend for getting LLM's to perform better, or staying up-to-date on the field in general?

eloisant · 2026-01-12T12:08:37 1768219717

If you haven't yet, check Claude Code's plan mode:

https://claudelog.com/mechanics/plan-mode/