More

gxs · 2026-04-29T00:12:12 1777421532

This is gross

It feels like we’ve been in the golden age and the window is coming to a close

Let the enshitification begin, I guess

dannyw · 2026-04-29T00:36:14 1777422974

How do you expect the spend & COGS for free LLM inference to be funded? For users who don't want to pay, or maybe can't pay?

derektank · 2026-04-29T00:53:49 1777424029

Perhaps it’s a glib and easy thing to say, but after a teaser period, I would simply not offer free LLM inference. Agreeing to serve ads just completely re-aligns your interests away from providing the best possible user experience to something else entirely.

infinite_spin · 2026-04-29T00:46:45 1777423605

From things like defense/private contracts

e.g. colleges pay for institutional subscriptions

2ndorderthought · 2026-04-29T00:54:59 1777424099

The average person doesn't benefit from defense contracts ... Like ever.

IX-103 · 2026-04-29T01:20:59 1777425659

The average person is slightly more female than male and has 2.1 children, but they do benefit from defense contracts since it makes up a small percentage of their salary.

2ndorderthought · 2026-04-29T01:39:01 1777426741

You are a fun person. We should be friends

iammrpayments · 2026-04-29T00:47:10 1777423630

It has begun ever since they nerfed chatgpt4 before releasing 4o

2ndorderthought · 2026-04-29T00:15:54 1777421754

In the past month local models have been ramping up in major way meanwhile the namesake providers have upped prices, went offline randomly, and started doing slimier and slimier things.

I really think the future is local compute. Or at least self hosted models.

SchemaLoad · 2026-04-29T00:22:54 1777422174

The hosted ones still have the advantage of being able to search the internet for live info rather than being limited to a knowledge cut off date.

gbear605 · 2026-04-29T00:23:47 1777422227

I’m not sure why a model needs to be hosted in order to make network calls?

hansvm · 2026-04-29T00:25:23 1777422323

Is there a library of good tools for LLMs to call? I have to imagine the bot-detection avoidance mechanisms are a major engineering effort and not likely to work out of the box with a simple harness and random local LLM.

ossa-ma · 2026-04-29T00:33:54 1777422834

Even the hosted ones are blocked from searching certain sites, for example Claude is banned from searching Reddit:

`Error: "The following domains are not accessible to our user agent: ['reddit.com']."`

wyre · 2026-04-29T00:39:08 1777423148

Tavily, Exa, Firecrawl, Perplexity, and Linkup are all tools for agents to search the web.

I’ve been building a harness the past few months and supports them all out of the box with an API key.

goosejuice · 2026-04-29T01:15:45 1777425345

Kagi also has an API. People who hate ads are probably the same folk that should be paying for Kagi. That's the sane alternative world where companies respect their users.

wyre · 2026-04-29T07:43:29 1777448609

Oh, you got me so excited. I've had a Kagi sub for 3 years, but their API is still in closed beta. I guess I could (and should reach out and ask for access).

lukewarm707 · 2026-04-29T13:36:57 1777469817

be warned though:

firecrawl: "if you post content or intellectual property within the Services or give us Feedback about the Services, you hereby grant to us a worldwide, irrevocable, non-exclusive, royalty-free license to use, reproduce, modify, publish, translate and distribute any content that you submit in any form [...] You also grant to us the right to sub-license these rights"

exa: "Query Data is used to improve our products and technology, including by training and fine-tuning models that power our Services"

perplexity: "Perplexity may retain, copy, distribute and otherwise use Search Data for its lawful business purposes, including the improvement and development of products and services."

linkup: "Client grants Linkup a worldwide right to use, reproduce and modify the Client Data, including prompts, for the purposes of providing, maintaining, developing, training"

tavily: "we may use certain portions of your query data to improve our responses to future queries"..."We may share your query data with third-party search index providers (e.g., Google)"

gbear605 · 2026-04-29T04:21:29 1777436489

If your volume is low enough, it should be pretty fine. It can just piggy back onto your personal browser cookies for Cloudflare.

chrisweekly · 2026-04-29T02:08:03 1777428483

That's not how it works. Whether local or hosted, every modern model has a cutoff date for its training data, and can be leveraged by agents / harnesses / tools to fetch context from the internet or wherever.

darepublic · 2026-04-29T00:23:46 1777422226

Local ones that support tool use can do the same

eightysixfour · 2026-04-29T00:24:05 1777422245

You can do that locally too!

CSMastermind · 2026-04-29T00:24:16 1777422256

What's the rough equivalent of a local model? Are we talking GPT-4?

2ndorderthought · 2026-04-29T00:49:10 1777423750

Qwen 3.6 which was released this month is a large but still smaller model. Supposedly it's at about sonnet level when configured correctly. It can be run on commodity hardware without purchasing a data center. https://www.reddit.com/r/LocalLLaMA/comments/1so1533/qwen36_...

Then there are middle size ones which require multiple gpus which are like gpts latest flagships.

Then there is kimi 2.6 which is a monster that is beating opus in some benchmarks. https://www.reddit.com/r/LocalLLaMA/comments/1sr8p49/kimi_k2...

It's basically whatever you can afford. Any trash heap laptop can run code auto complete models locally no problem. The rest require some level of investment, an idle gaming pc, or a serious investment

Terretta · 2026-04-29T00:29:30 1777422570

Depends on your VRAM or "unified" memory for how smart it is, and CPU/GPU for how quick it is.

128GB of RAM? Sure, the early to mid 4s releases, except maybe 4o. And on an M5 Max, about the same speed.

I wouldn't really bother under 64GB (meaning 32GB or less) except for entertainment value (chats, summaries, tasky read-only agent things).

kay_o · 2026-04-29T00:30:36 1777422636

GLM 5.1 and DeepSeek 4 are acceptable, but the cost of hardware and energy cost that depending on your use case you may as well purchase a Tokens. They get useless and stupid rapidilty if you quant enough to run on single 16-24GB GPU style.

rnxrx · 2026-04-29T00:26:23 1777422383

The arc of the technological universe is short, but it bends toward enshitification.

gxs · 2026-04-26T00:00:09 1777161609

You could just always buy a cheap one on Amazon and then make a real investment if you like

gxs · 2026-04-25T01:52:10 1777081930

If you look at the past 3-4 decades, China has just played their cards so well

If/when they overtake the US, all things aside, they deserve it. There is no world where the US overtakes China but there’s a world where China overtakes the US. Best outcome for the US atm is parity.

Just remarkable the things they’ve accomplished in the time they’ve accomplished them.

gxs · 2026-04-21T08:01:07 1776758467

Always makes me wonder how people use their machine when I read comments like this

I’ve worked in big tech and fast growing startups, side by side at one point or another next to hundreds of nerds that love talking about hardware and software

The touchpad is almost universally loved - I have never ever once her anyone complain about the click - most people didn’t even notice the switch

It has 3D Touch and all that and I’ve never gotten a false click - ever - not exaggerating, in however long they’ve been out

The only complaint I’ve ever heard more than once is that sometimes it takes a second to respond

So I ask you: how do you use your laptop? If no one else complains about this, it’s at least worth asking the question: what do you think you’re doing differently than everybody else?

pxc · 2026-04-21T11:59:28 1776772768

Sure, I can tell you one thing that's different right now: I use third-party software to get a three-finger middle click. If Apple's operating system weren't missing basic features like the ability to middle click via the trackpad, I wouldn't have to do that and maybe wouldn't have this problem.

gxs · 2026-04-20T23:00:07 1776726007

The difference is you can make full use of Google without logging in

Even with a throw away, no chance I use OpenAI now - if/when Anthropocene does this I’ll be in a tough spot

spongebobstoes · 2026-04-20T23:05:53 1776726353

you can use chatgpt without an account, just not all of it

and you can't make full use of Google without an account. for example, you need an account to upload to YouTube, manage your website in search, place ads, opt out of data usage. the list goes on

oaweoifjwpo · 2026-04-20T23:10:33 1776726633

None of those examples are "run an internet search".

spongebobstoes · 2026-04-21T00:47:03 1776732423

I don't understand. you can talk to chatgpt without an account, what's the difference?

both are a limited subset of what the companies offer, available for free

TZubiri · 2026-04-25T00:46:38 1777077998

And you can also search on google with an account, and your queries are stored for you to see right? I'm pretty sure I can see a history of my searches.

gxs · 2026-04-17T06:32:50 1776407570

This was a great comment, you challenged them but in a reasonable way and with really good questions

I wish public discourse were more this way - if someone is arguing in good faith, actually answering what you asked moves the conversation forward, it’s just on the person to give you a serious answer

gxs · 2026-04-14T03:31:05 1776137465

Wow that was your takeaway?

> “2025 was the year when AI really started being useful for many different tasks,” said Terence Tao

I think I’ll go out on a limb and agree with Terrence Tao, I think the dude is well known in the math community, or something

themafia · 2026-04-14T04:19:08 1776140348

> go out on a limb and agree with Terrence Tao

Is AI his specialty?

> I think the dude is well known in the math community, or something

I believe this is called "appeal to authority." Which is why, instead of disagreeing with him, I suggested a more cogent endpoint that could be used to establish the facts the article's title suggests.

p1dda · 2026-04-14T03:48:36 1776138516

I think he means useful for mathematicians getting paid shilling for AI models

noobermin · 2026-04-14T03:34:32 1776137672

If anything his simping for AI models makes me more suspect of him than I ever was because my own eyes show me their limits.

jryle70 · 2026-04-14T04:11:15 1776139875

Any chance your eyes are wrong? Or only people who disagree with you are.

gxs · 2026-04-14T03:29:07 1776137347

If I had to wager a lazy, armchair guess, I think it forces it to think harder/longer

The answer is probably more straightforward than we think, e.g. “the user thinks I can do this so I better make sure I didn’t miss anything”

gxs · 2026-04-12T20:07:23 1776024443

I had a similar thought, though not as extreme, the second they started nerfing and filtering models

Their intensions were good, they always are, but the minute you decide to nerf something powerful for someone, it means someone out there has access to the full blown, unnerfed version

Which means there are powerful people out there using AI in ways or for activities in which you will never be allowed to anyway

So yeah, this is just more of the same

gxs · 2026-04-08T22:01:48 1775685708

Ha so well known it has a name, Cunningham’s law