More

not_ai · 2026-04-16T14:42:12 1776350532

Oh look it was too powerful to release, now it’s just a matter of safeguards.

This story sounds a lot like GPT2.

tabbott · 2026-04-16T14:54:05 1776351245

The original blog post for Mythos did lay out this safeguard testing strategy as part of their plan.

hgoel · 2026-04-16T15:07:01 1776352021

This seems needlessly cynical. I don't think they said they never planned to release it.

They seemed to make it clear that they expect other labs to reach that level sooner or later, and they're just holding it off until they've helped patch enough vulnerabilities.

camdenreslink · 2026-04-16T15:18:24 1776352704

My guess is that it is just too expensive to make generally available. Sounds similar to ChatGPT 4.5 which was too expensive to be practical.

poszlem · 2026-04-16T14:49:42 1776350982

It's too powerful now. Once GPT6 is released it will suddenly, magically, become not too powerful to release.

latentsea · 2026-04-16T15:00:20 1776351620

For a second there I read that as 'GTA 6', and that got me thinking maybe the reason GTA 6 hasn't come out all of these years is because of how dangerous and powerful it's going to be.

mrbombastic · 2026-04-16T15:06:20 1776351980

productivity going right back down again, ah well they weren't going to pay us more anyway

thomasahle · 2026-04-16T14:59:06 1776351546

Or, you know, they will have improved the safe guards

poszlem · 2026-04-16T15:36:46 1776353806

Sure thing.

not_ai · 2026-04-10T12:13:46 1775823226

I think this is fantastic. I recently started playing DnD with a local group and can’t wait to dive into this to better understand the mechanics.

not_ai · 2026-03-27T11:27:40 1774610860

And expensive, exactly the way a pay per use product would push its customers…

“It’s not working well enough!” We tell them. They respond with “Have you tried using it more?”

3yr-i-frew-up · 2026-03-27T13:31:50 1774618310

Back in 2024 I read a study saying: "Ask 4 LLMs the same question, if they all give you the same answer there is some 95-99% chance its correct"

Soooo... Its not just greed. There is something there.

axldelafosse · 2026-03-27T14:58:02 1774623482

Yes exactly. I’m talking about this in the article. I found out that when Claude and Codex both review the same PR and both find the same issue, our team fixes it 100% of the time.

zombot · 2026-03-27T16:36:57 1774629417

What's the point of pair programming then if they both have the same opinions?

axldelafosse · 2026-03-27T17:38:44 1774633124

They don't. And you would be surprised how a good model actually pushes back on some comments.

The point was: when they do agree, it is a very strong signal.

pixl97 · 2026-03-27T16:54:54 1774630494

There are a number of different models out there.

shafyy · 2026-03-27T13:22:18 1774617738

Haha yeah... Wait until they start jacking up the subscription prices

observationist · 2026-03-27T15:20:55 1774624855

They don't change the prices, they just modify the amount of compute allocated - slower speeds and fewer tokens, they can set everything in the background to optimize costs and returns, and the user never realizes anything has changed.

Sometimes they'll announce the changes, and they'll even try to spin it as improving services or increasing value.

Local AI capabilities are improving at a rapid pace, at some point soon we'll have an RWKV or a 4B LLM that performs at a GPT-5 level, with reasoning and all the bells and whistles, and hopefully that'll shake out most of the deceptive and shady tactics the big platforms are using.

shafyy · 2026-03-28T14:43:59 1774709039

> They don't change the prices, they just modify the amount of compute allocated - slower speeds and fewer tokens, they can set everything in the background to optimize costs and returns, and the user never realizes anything has changed.

I can't imagine that this is the way it will go... Tokens haven't been getting cheaper for flagship models, have they? You already see something closer to their real cost if you compare e.g. the Claude subscriptions to their actual token pricing.

> Local AI capabilities are improving at a rapid pace, at some point soon we'll have an RWKV or a 4B LLM that performs at a GPT-5 level, with reasoning and all the bells and whistles, and hopefully that'll shake out most of the deceptive and shady tactics the big platforms are using.

Maybe, but LLMs are scale game, and data center will always be more capable than your local device. So, you will always be getting a worse version locally. Or do you think we'll LLMs in data centers stop getting better and local LLMs will somehow catch up?

not_ai · 2026-03-12T20:37:20 1773347840

I have a very different experience. Claude code tui is the worst tui I have ever used. How is it possible that an inactive tui regularly eats 8gb of ram, has freezing issues and rendering issues?

If I wasn’t forced to use it I wouldn’t as there are better options available.

sevenseacat · 2026-03-15T16:29:40 1773592180

So many rendering issues...

not_ai · 2026-03-06T21:58:17 1772834297

Thanks for sharing this is the first I’ve seen this. I wish they had expanded on exactly what mid-level might be missing rather then just saying “fundamentals” and “practical intuition”

not_ai · 2026-03-04T20:53:02 1772657582

Preferably the C-Suite.

nickff · 2026-03-04T21:50:00 1772661000

I understand the impulse in this direction, but I’m not sure it would serve as much of a disincentive, as there would likely just be a highly-paid scapegoat. Why not something more lasting and less difficult to ignore, like compulsory disclosure of the model’s source code (in addition to compensation for the victim(s)). Compulsory disclosure of the source would be a massive disadvantage.

autoexec · 2026-03-05T10:20:51 1772706051

The source code isn't where the money is, what you want is the training data. Force them to serve and make freely available all the data they stole to sell back to us. That way everyone and anyone can use it when training their own models. That might just be punitive enough.

red-iron-pine · 2026-03-05T20:32:02 1772742722

> as there would likely just be a highly-paid scapegoat

the point of executives is someone has to take responsibility. that's why they get paid. the buck has to stop somewhere.

autoexec · 2026-03-04T21:01:51 1772658111

exactly. That's why they get the big bucks. They're ultimately responsible

ryandrake · 2026-03-05T04:12:10 1772683930

The C-suite is only responsible when the company does good or stonks go up. When they do something bad, it's either: external market forces, the laws of physics, an uncertain macroeconomic environment, unfair competition, or lone wolf individual employees way down the totem pole.

not_ai · 2026-02-19T23:38:50 1771544330

After spending all that money and firing a bunch of people? Is the new group doing anything at this point?

dekhn · 2026-02-20T00:16:19 1771546579

They are busy demonstrating that Mark Zuckerberg has no sense at all.

not_ai · 2026-02-06T21:32:58 1770413578

I’m happy to see that Flink is in this stack, I wish that Pulsar was as well instead of Kafka.

not_ai · 2026-02-03T00:13:30 1770077610

The monthly releases seem to indicate otherwise.

aboardRat4 · 2026-02-03T00:15:05 1770077705

Something's deeply wrong here.

s5fs · 2026-02-03T00:37:55 1770079075

Things have changed quite a bit in the past 30 years!

I encourage you to peek at their changelog (https://www.sudo.ws/releases/changelog/) for more insight into why this project is still under active development.

jibal · 2026-02-03T10:19:43 1770113983

I just learned about amathia (https://modernstoicism.com/there-is-nothing-banal-about-phil...), which seems to apply here.

mulmen · 2026-02-03T00:34:23 1770078863

Then fork it and finish it. I’m sure it will be a huge success.

alt187 · 2026-02-03T02:26:10 1770085570

You should look up "doas". It might enlighten you.

mulmen · 2026-02-03T08:29:00 1770107340

If you have a point to make then make it. I don’t accept anonymous homework assignments.

account42 · 2026-02-03T12:25:22 1770121522

It's a kitchen sink tool that does way too many things.

not_ai · 2026-02-02T14:07:56 1770041276

At the company I work for they locked down installing extensions through the marketplace. Some are available but most are not and there is a process to get them reviews and approved. You might be able to side load them still but I haven’t cared enough to want to try.

They did the same with Chrome extensions.