More

NinjaTrance · 2026-04-07T19:09:45 1775588985

Interesting reading.

They are still focusing on "catastrophic risks" related to chemical and biological weapons production; or misaligned models wreaking havoc.

But they are not addressing the elephant in the room:

* Political risks, such as dictators using AI to implement opressive bureaucracy. * Socio-economic risks, such as mass unemployement.

jph00 · 2026-04-07T19:32:52 1775590372

Yeah this has always been the glaring blind spot for most of the "AI Safety" community; and most of the proposals for "improving" AI safety actually make these risks far worse and far more likely.

stratos123 · 2026-04-08T09:16:55 1775639815

It makes quite a lot of sense to focus on reducing the risks of every human everywhere dying, rather than the risks of already existing oppression getting worse.

jph00 · 2026-04-09T21:05:24 1775768724

No, you are deeply misunderstanding the issue. Creating a rivalrous good that powers fight over for control, then use violence to maintain control of, creating a global feudalism, is not "existing oppression getting worse". It actually makes the risks of every human everywhere dying far higher, and even if that doesn't happen, decreases global utility by a similar percentage (99%, instead of 100%). It could actually be worse, if average human utility becomes negative.

andrewstuart2 · 2026-04-07T20:10:32 1775592632

I'm getting flashbacks to the 2018 hit:

    This is extremely dangerous to our democracy

We evolved to share information through text and media, and with the advent of printing and now the internet, we often derive our feelings of consensus and sureness from the preponderance of information that used to take more effort to produce. Now we're now at a point where a disproportionately small input can produce a massively proliferated, coherent-enough output, that can give the appearance of consensus, and I'm not sure how we are going to deal with that.

lovecg · 2026-04-09T10:35:22 1775730922

This could have been written almost verbatim after the printing press came out and printed pamphlets became ubiquitous.

unglaublich · 2026-04-07T20:06:55 1775592415

> * Political risks, such as dictators using AI to implement opressive bureaucracy. * Socio-economic risks, such as mass unemployement.

Even Haiku would score 90% on that.

ronsor · 2026-04-07T20:33:28 1775594008

> Political risks, such as dictators using AI to implement opressive bureaucracy.

I think we're pretty good at that without AI.

dgellow · 2026-04-07T21:27:32 1775597252

It’s because that would be fairly speculative and cannot be measured. I don’t think that’s something that would make much sense in a system card. But Anthropic leadership does seem to communicate on that topic: https://www.darioamodei.com/essay/the-adolescence-of-technol...

astrange · 2026-04-07T21:19:00 1775596740

The unemployment rate in the US is whatever the Fed wants it to be, and isn't a function of available technology.

girvo · 2026-04-07T20:30:11 1775593811

They don’t care about those risks, because they’re unsolvable and would mean they wouldn’t make money/gain power.

dgellow · 2026-04-07T21:25:22 1775597122

Dario Amodei, CEO of Anthropic discusses all those risks in this essay: https://www.darioamodei.com/essay/the-adolescence-of-technol...

He seems to care quite a lot?

girvo · 2026-04-07T22:11:05 1775599865

Not enough to not do it, though. Actions, not words, and the actions are simple: they're building this while promising to wipe out entire industries.

NinjaTrance · 2026-04-06T17:35:40 1775496940

Considering the advances in software and hardware, I would expect that in 2 or 3 years.

And I hope we will eventually reach a point where models become "good enough" for certain tasks, and we won't have to replace them every 6 months.

(That would be similar to the evolution of other technologies like personal computers and smartphones.)

NinjaTrance · 2026-03-15T21:37:37 1773610657

> I use Claude Opus (4.5, 4.6) all the time and catch it making making subtle mistakes, all the time.

Didn't we make subtle mistakes without AI?

Why did we spend so much time debugging and doing code reviews?

> Are you really being more productive (let’s say 3x times more)

At least 2x more productive, and that's huge.

csto12 · 2026-03-15T21:57:21 1773611841

I think you’ve forgotten about the context of OP’s post. He said he uninstalled vscode and uses a dashboard for managing his agents. How are you going to be able to do code review well when you don’t even know what’s going on in your own project? I catch subtle bugs Claude emits because I know exactly what’s happening because I’m actively working with Claude, not letting Claude do everything.

turlockmike · 2026-03-15T22:01:54 1773612114

The code is still visible if i want to review it.

But since I have a strong rule about always writing unit tests before code, my confidence is a lot higher.

https://simonwillison.net/2025/Dec/18/code-proven-to-work/

csto12 · 2026-03-15T22:56:39 1773615399

>The code is still visible if i want to review it.

I agree that the test harness is the most important part, which is only possible to create successfully if you are very familiar with exactly how your code works and how it should work. How would you reach this point using a dashboard and just reviewing PRs?

jplusequalt · 2026-03-15T21:45:17 1773611117

Are you getting paid 2x more?

NinjaTrance · 2026-03-07T12:43:15 1772887395

Even as a principal engineer, there is an infinite number of things you don't know.

Suppose you get out of your comfort zone to do something entirely new; AI will be much more helpful for you than it is for people who spent years developing their skills.

AI is the great equalizer.

NinjaTrance · 2026-03-05T20:18:50 1772741930

The scary thing is that Amodei only opposes to domestic mass-surveilance.

He doesn't seem to care if the DoW uses his AI for international spying.

That's one more reason why Europe needs sovereign tech.

NinjaTrance · 2026-03-05T20:08:01 1772741281

"When our time traveler peered into the windows of these shops, the first thing he'd notice was how large all the watches were."

My only question about this entire essay is... where did this time traveler came from???

"Our" time traveler was never mentioned until this line.

badc0ffee · 2026-03-05T20:18:52 1772741932

Not true:

> The best way to answer that might be to imagine what someone from the golden age would notice if we brought him here in a time machine. [...] The first thing he'd notice, if he walked through a fancy shopping district, is that all the prominent watchmakers of the golden age seem to be doing better than ever.

zahlman · 2026-03-06T00:08:42 1772755722

Clearly, the time traveler went back in time to get inserted into a paragraph that GP overlooked the first time.

randallsquared · 2026-03-05T20:19:01 1772741941

It was several paragraphs before that, where pg said "[...] what someone from the golden age would notice if we brought him here in a time machine."

rixrax · 2026-03-06T09:08:26 1772788106

Although PG isn't wrong here, people also have larger wrists nowadays that are able to comfortably support larger watches compared to ~50 years a go, never mind at the 1940s or 1950s.

NinjaTrance · 2026-02-22T09:47:37 1771753657

To run Llama 3.1 8B locally, you would need a GPU with a minimum of 16 GB of VRAM, such as an NVIDIA RTX 3090.

Talas promises a 10x higher throughtput, being 10x cheaper and using 10x less electricity.

Looks like a good value proposition.

ac29 · 2026-02-22T16:34:19 1771778059

> To run Llama 3.1 8B locally, you would need a GPU with a minimum of 16 GB of VRAM, such as an NVIDIA RTX 3090

In full precision, yes. But this talaas chip uses a heavily quantized version (the article calls it "3/6 bit quant", probably similar to Q4_K_M). You dont even need a GPU to run that with reasonable performance, a CPU is fine.

lm28469 · 2026-02-22T10:30:03 1771756203

What do you do with 8b models ? They can't even reliably create a .txt file or do any kind of tool calling

joquarky · 2026-02-22T23:48:08 1771804088

Exploration, summarization, classification, translation

NinjaTrance · 2026-02-21T19:55:29 1771703729

The possibility that anyone can easily replicate any startup scares A16Z.

toomuchtodo · 2026-02-21T19:58:24 1771703904

This is what always confused me about VC AI enthusiasm. Their moat is the capital. As AI improves, it destroys their moat. And yet, they are stoked to invest in it, the architects of their own demise.

ironhaven · 2026-02-21T21:03:30 1771707810

Don't you have that backwards? If AI gets so good that it can replace all human labor, will capital like money and data centers be the only moat left?

georgemcbay · 2026-02-21T21:16:54 1771708614

> If AI gets so good that it can replace all human labor, will capital like money and data centers be the only moat left?

If AI gets good enough to replace all human labor then actual physical moats to keep the hungry, rioting replaced humans away will be the most important moats.

alfiedotwtf · 2026-02-22T07:14:31 1771744471

Did you see those Chinese robots from last week? I’m pretty sure they’ve got their moats covered

satvikpendem · 2026-02-21T22:44:34 1771713874

Which is bought by money in the first place, see billionaire doomsday bunkers. The poor will not have such a bunker.

acuozzo · 2026-02-22T06:44:36 1771742676

Unless they intend on generating their own oxygen to breathe, I don't see how these bunkers stand a chance.

satvikpendem · 2026-02-22T06:47:55 1771742875

Fortunately they do.

tartoran · 2026-02-22T21:04:23 1771794263

For how many weeks? Or months? Or years? Then what?

crazylogger · 2026-02-22T01:46:26 1771724786

Money is useful mostly for hiring human labor to outcompete others, e.g. Satya Nadella has 100K employees under his command, you don't, so you can't realistically compete with MS today - this is their main moat.

If AI renders human labor a cheap commodity (say you can orchestrate a bunch of agents to develop + market a Windows competitor for $1000 of compute), what used to be "Satya + his army vs. you" now becomes mostly a 1:1 fair fight, which favors the startup.

seg_lol · 2026-02-22T04:32:16 1771734736

Frankly, you have a pretty good chance of displacing windows right now. You should go for it.

toomuchtodo · 2026-02-21T21:23:22 1771709002

How powerful is the device you wrote this comment from? On prem or self hosted affordable inference is inevitable.

fullshark · 2026-02-21T22:01:04 1771711264

There’s no alternative, they can’t collectively freeze out all AI investment and force it to die.

SilverElfin · 2026-02-28T18:51:39 1772304699

I don’t know about that. I’ve looked at things like the rise of AI protects those who currently have capital. They won’t need labor or as much of it. So maybe it is what they want - to retain power permanently. Isn’t that the tech oligarch Curtis Yarvin fantasy - to replace democracy with themselves as a permanent ruling class?

themafia · 2026-02-21T20:03:41 1771704221

The incompetent have always pantomimed the competent. It never works. Although the incompetent will always pay a huge amount to try to achieve this fantasy.

TeMPOraL · 2026-02-22T11:34:37 1771760077

You're joking. Most startups are the incompetent. Throwing enough money at sales and marketing can make anything work.

NinjaTrance · 2026-02-21T19:45:59 1771703159

The irony is that the outage was caused by a change from the "Code Orange: Fail Small initiative".

They definitely failed big this time.

NinjaTrance · 2026-02-21T19:41:08 1771702868

Engineers have been vibe coding a lot recently...

jsheard · 2026-02-21T19:49:56 1771703396

The featured blog post where one of their senior engineering PMs presented an allegedly "production grade" Matrix implementation, in which authentication was stubbed out as a TODO, says it all really. I'm glad a quarter of the internet is in such responsible hands.

gtowey · 2026-02-21T20:55:22 1771707322

It's spreading and only going to get worse.

Management thinks AI tools should make everyone 10x as productive, so they're all trying to run lean teams and load up the remaining engineers with all the work. This will end about as well as the great offshoring of the early 2000s.

blibble · 2026-02-21T20:18:30 1771705110

there was also a post here where an engineer was parading around a vibe-coded oauth library he'd made as a demonstration of how great LLMs were

at which point the CVEs started to fly in

ranger_danger · 2026-02-21T22:39:20 1771713560

Matrix doesn't actually define how one should do authentication though... every homeserver software is free to implement it however they want.

Arathorn · 2026-02-22T00:09:31 1771718971

the main bit of auth which was left unimplemented on matrix-workers was the critical logic which authorizes traffic over federation: https://spec.matrix.org/latest/server-server-api/#authorizat...

Auth for clients is also specified in the spec - there is some scope for homeservers to freestyle, but nowadays they have to implement OIDC: https://spec.matrix.org/latest/client-server-api/#client-aut...

dana321 · 2026-02-21T19:53:48 1771703628

Thats a classic claude move, even the new sonnet 4.6 still does this.

bonesss · 2026-02-21T20:01:05 1771704065

It’s almost as classic as just short circuiting tests in lightly obfuscated ways.

I could be quite the kernel developer if making the test green was the only criteria.

allovertheworld · 2026-02-22T06:23:15 1771741395

Wait till you get AI to write unit tests and tell it the test must pass. After a few rounds it will make the test “assert(true)” when the code cant get the test to pass

dakiol · 2026-02-21T20:06:03 1771704363

No joke. In my company we "sabotaged" the AI initiative led by the CTO. We used LLMs to deliver features as requested by the CTO, but we introduced a couple of bugs here and there (intentionally). As a result, the quarter ended up with more time allocated to fix bugs and tons of customer claims. The CTO is now undoing his initiative. We all have now some time more to keep our jobs.

samrus · 2026-02-21T20:42:22 1771706542

Thats actively malicious. I understand not going out of your way to catch the LLMs' bugs so as to show the folly of the initiative, but actively sabotaging it is legitimately dangerous behavior. Its acting in bad faith. And i say this as someone who would mostly oppose such an initiative myself

I would go so far as to say that you shouldnt be employed in the industry. Malicious actors like you will contribute to an erosion of trust thatll make everything worse

sp00chy · 2026-02-21T20:53:45 1771707225

Might be but sometimes you don’t have another choice when employers are enforcing AIs which have no „feeling“ for context of all business processes involved created by human workers in the years before. Those who spent a lot of love and energy for them mostly. And who are now forced to work against an inferior but overpowered workforce.

Don’t stop sabotaging AI efforts.

samrus · 2026-02-22T00:56:43 1771721803

Honestly i kinda like the aesthetic of cyberanarchism, but its not for me. It erodes trust

tovej · 2026-02-21T22:14:59 1771712099

Forcing developers to use unsafe LLM tools is also malicious. This is completely ethical to me. Not commenting on legality.

samrus · 2026-02-22T00:54:48 1771721688

I dont like it either but its not malicious. The LLM isnt accessing your homeserver, its accessing corporate information. Your employer can order you to be reckless with their information, thats not malicious, its not your information. You should CYA and not do anything illegal even if your asked. But using LLMs isnt illegal. This is bad faith argument

tovej · 2026-02-22T11:49:52 1771760992

You're talking about legality again. I'm talking about ethics.

Using LLMs for software development is a safety hazard. It also has a societal risk, because it centralizes more data, more power, more money to tech oligarchs.

It's ethical to fight this. Still not commenting on legality.

hypeatei · 2026-02-22T13:23:36 1771766616

You're not forced to work there and use those tools. If you don't like it, then leave the job. Intentionally breaking things is unethical especially when you're receiving a paycheck to do the opposite.

tovej · 2026-02-22T17:00:41 1771779641

It may be illegal, but it's not unethical.

Doing unethical things because someone pays you would still be unethical. Opposing those while someone pays you is still ethical.

hypeatei · 2026-02-22T17:50:14 1771782614

Again, no one is forcing him to be there. He's breaking something on purpose. I think you should read up on ethics because this take "I don't like it therefore whatever I do is ethical" is juvenile.

tovej · 2026-02-23T01:34:47 1771810487

That's quite the strawman. The reason it's ethical is not that LLM's are unpopular or someone dislikes them. It's ethical because LLMs introduce safety hazards, i.e. they cause harm.

rixed · 2026-02-22T07:18:14 1771744694

Sounds like what an LLM would post if it were tasked to advertise LLM coding abilities. Nice manipulation of human emotions, well played.

renegade-otter · 2026-02-21T20:57:04 1771707424

I see someone is not familiar with the joys of the current job market.

hypeatei · 2026-02-21T20:48:15 1771706895

That's extremely unethical. You're being paid to do something and you deliberately broke it which not only cost your employer additional time and money, but it also cost your customers time and money. If I were you, I'd probably just quit and find another profession.

logicchains · 2026-02-21T20:22:42 1771705362

That's not "sabotaged", that's sabotaged, if you intentionally introduced the bugs. Be very careful admitting something like that publicly unless you're absolutely completely sure nobody could map your HN username to your real identity.