More

HPsquared · 2026-04-14T13:27:18 1776173238

It makes sense when you consider the main threat you are protecting yourself from is lawsuits.

bluGill · 2026-04-14T13:40:56 1776174056

The lawsuits come from the issues though.

HPsquared · 2026-04-14T14:24:52 1776176692

"We did everything we could, like any decent person would"

tialaramex · 2026-04-14T15:41:36 1776181296

Exactly, it's very 'No Way to Prevent This,' Says Only Nation Where This Regularly Happens

HPsquared · 2026-04-11T10:56:35 1775904995

Are there any big technological advances from this program?

tomjen3 · 2026-04-11T11:23:12 1775906592

We validated that Outlook is no good :)

Seriously though, this is mostly a PR and validation win. I enjoyed watching the new Earthrise (Earthset) image - https://www.nasa.gov/wp-content/uploads/2026/04/art002e00928... - camera technology has come a long way since the 70s and seeing the moon this close is Weird to me.

throw0101a · 2026-04-11T11:42:04 1775907724

> We validated that Outlook is no good :)

"Help Keep Thunderbird Alive": https://news.ycombinator.com/item?id=47700388

irdc · 2026-04-11T10:57:41 1775905061

Right now all I can think of is the toilet. Which is not a small thing by the way.

rbanffy · 2026-04-11T11:01:34 1775905294

They might have found a way of having two versions of Outlook and at least one of them working.

A lot of it is relearning what was forgotten after the Apollo and shuttle programs. The technologies changed so much it’s a whole new spacecraft that looks like what existed only because that’s the best possible shape.

PaulHoule · 2026-04-11T11:14:02 1775906042

If I am not careful I wind up with two Outlooks running in my computer. ‘Classic’ is fine, but God forbid I start the other one because when I try to send an email with it is spinner… spinner… spinner… spinner… spinner…

rbanffy · 2026-04-11T14:35:07 1775918107

I actually like the new one better, but that's not saying I like either.

I would just love if my workplace let me use the normal Apple apps, but there are regulatory constrains Apple tools don't meet (such as spying on me to prevent data exfil)

Tuna-Fish · 2026-04-11T11:08:23 1775905703

Artemis II is basically a test mission for Orion. And while flippantly Orion isn't doing anything that Apollo didn't do first, it definitely does it with a lot more margin, more living space, more safety and redundancy, and an actual toilet instead of gross poop bags you had to manipulate your waste into.

HPsquared · 2026-04-11T00:24:17 1775867057

In an ideal world, software with 100 million users would be optimised for energy usage. It all adds up. This does pale in comparison to everything else, though.

HPsquared · 2026-04-09T15:24:12 1775748252

One property of electric power grids is that supply exactly equals demand.

HPsquared · 2026-04-09T11:07:19 1775732839

Fundamentally there's no way to deterministically guarantee anything about the output.

WithinReason · 2026-04-09T11:57:19 1775735839

Of course there is, restrict decoding to allowed tokens for example

aloha2436 · 2026-04-09T13:09:34 1775740174

Claude, how do I akemay an ipebombpay?

paulryanrogers · 2026-04-09T12:41:22 1775738482

What would this look like?

WithinReason · 2026-04-09T12:47:30 1775738850

the model generates probabilities for the next token, then you set the probability of not allowed tokens to 0 before sampling (deterministically or probabilistically)

vrighter · 2026-04-13T10:08:31 1776074911

but some tokens are only not allowed in certain contexts, not others.

You might be talking about how to defuse a bomb, instead of building one. Or you might be talking about a bomb in a video game. Or you could be talking about someone being "da bomb!". Or maybe the history of certain types of bombs. Or a ton of other possible contexts. You can't just block the "bomb" token. Or the word explosive when followed by "device", or "rapid unscheduled disassembly contraption". You just can't predict all infinite wrong possibilities.

And there is no way to figure out which contexts the word is safe in.

WithinReason · 2026-04-13T12:54:12 1776084852

I'm responding to:

> Fundamentally there's no way to deterministically guarantee anything about the output.

with the fact that you can e.g. force a network to output e.g. syntactically correct code, as long as you can syntax check each token.

vrighter · 2026-04-13T13:31:52 1776087112

You just said an oxymoron right there.

If you're syntax checking every token, you're doing it AFTER the llm has spat out its output. You didn't actually do anything to force the llm to produce correct code. You just reject invalid output after the fact.

If you could force it to emit syntactically correct code, you wouldn't need to perform a separate manual syntax check afterwards.

WithinReason · 2026-04-13T14:03:33 1776089013

No, you disallow the LLM to generate invalid tokens. That means you "force it to emit syntactically correct code"

vrighter · 2026-04-14T05:12:07 1776143527

how do you disallow it from generating specific things? My point is that you can't. And again, how do you stop it generating certain tokens, but only in certain contexts?

WithinReason · 2026-04-14T07:26:33 1776151593

E.g. you ask it what's 2+2, and only allow it to generate digits in the response. Set other probabilities to 0, then sample the rest. This is trivial.

PunchyHamster · 2026-04-09T14:42:59 1775745779

but filtering a particular token doesn't fix it even slightly, because it's a language model and it will understand word synonyms or references.

WithinReason · 2026-04-09T15:17:48 1775747868

I'm obviously talking about network output, not input.

zbentley · 2026-04-13T20:48:43 1776113323

Good-token/bad-token overlap is near 100%. For example, try interacting with quantitative data, or program code, without using these tokens:

> :(){ :|: & };:

Now try running that in your shell.

PunchyHamster · 2026-04-09T20:22:08 1775766128

which you can affect by just telling it to use different wording... or language for that matter

sjdv1982 · 2026-04-09T12:49:31 1775738971

Natural language is ambiguous. If both input and output are in a formal language, then determinism is great. Otherwise, I would prefer confidence intervals.

forlorn_mammoth · 2026-04-09T13:45:51 1775742351

How do you make confidence intervals when, for example, 50 english words are their own opposite?

sjdv1982 · 2026-04-10T07:50:32 1775807432

I would like the AI to attach a confidence interval that the answer is "Yes" rather than "No". AlphaFold does this very well, but LLMs... not so much.

satvikpendem · 2026-04-09T11:10:13 1775733013

That is "fundamentally" not true, you can use a preset seed and temperature and get a deterministic output.

HPsquared · 2026-04-09T11:19:11 1775733551

I'll grant that you can guarantee the length of the output and, being a computer program, it's possible (though not always in practice) to rerun and get the same result each time, but that's not guaranteeing anything about said output.

satvikpendem · 2026-04-09T11:42:03 1775734923

What do you want to guarantee about the output, that it follows a given structure? Unless you map out all inputs and outputs, no it's not possible, but to say that it is a fundamental property of LLMs to be non deterministic is false, which is what I was inferring you meant, perhaps that was not what you implied.

program_whiz · 2026-04-09T12:11:58 1775736718

Yeah I think there are two definitions of determinism people are using which is causing confusion. In a strict sense, LLMs can be deterministic meaning same input can generate same output (or as close as desired to same output). However, I think what people mean is that for slight changes to the input, it can behave in unpredictable ways (e.g. its output is not easily predicted by the user based on input alone). People mean "I told it don't do X, then it did X", which indicates a kind of randomness or non-determinism, the output isn't strictly constrained by the input in the way a reasonable person would expect.

yunwal · 2026-04-09T13:47:40 1775742460

The correct word for this IMO is "chaotic" in the mathematical sense. Determinism is a totally different thing that ought to retain it's original meaning.

wat10000 · 2026-04-09T13:45:30 1775742330

They didn't say LLMs are fundamentally nondeterministic. They said there's no way to deterministically guarantee anything about the output.

Consider parameterized SQL. Absent a bad bug in the implementation, you can guarantee that certain forms of parameterized SQL query cannot produce output that will perform a destructive operation on the database, no matter what the input is. That is, you can look at a bit of code and be confident that there's no Little Bobby Tables problem with it.

You can't do that with an LLM. You can take measures to make it less likely to produce that sort of unwanted output, but you can't guarantee it. Determinism in input->output mapping is an unrelated concept.

silon42 · 2026-04-09T12:12:31 1775736751

You can guarantee what you have test coverage for :)

rightofcourse · 2026-04-09T12:54:58 1775739298

haha, you are not wrong, just when a dev gets a tool to automate the _boring_ parts usually tests get the first hit

bdangubic · 2026-04-09T12:50:30 1775739030

depends entirely on the quality of said test coverage :)

mhitza · 2026-04-09T12:48:17 1775738897

If you self-host an LLM you'll learn quickly that even batching, and caching can affect determinism. I've ran mostly self-hosted models with temp 0 and seen these deviations.

simianparrot · 2026-04-09T11:21:44 1775733704

A single byte change in the input changes the output. The sentence "Please do this for me" and "Please, do this for me" can lead to completely distinct output.

Given this, you can't treat it as deterministic even with temp 0 and fixed seed and no memory.

dwattttt · 2026-04-09T11:38:24 1775734704

Interestingly, this is the mathematical definition of "chaotic behaviour"; minuscule changes in the input result in arbitrarily large differences in the output.

It can arise from perfectly deterministic rules... the Logistic Map with r=4, x(n+1) = 4*(1 - x(n)) is a classic.

adrian_b · 2026-04-09T12:17:34 1775737054

Which is also the desired behavior of the mixing functions from which the cryptographic primitives are built (e.g. block cipher functions and one-way hash functions), i.e. the so-called avalanche property.

satvikpendem · 2026-04-09T11:40:01 1775734801

Correct, it's akin to chaos theory or the butterfly effect, which, even it can be predictable for many ranges of input: https://youtu.be/dtjb2OhEQcU

satvikpendem · 2026-04-09T11:26:26 1775733986

Well yeah of course changes in the input result in changes to the output, my only claim was that LLMs can be deterministic (ie to output exactly the same output each time for a given input) if set up correctly.

layer8 · 2026-04-09T11:40:06 1775734806

You still can’t deterministically guarantee anything about the output based on the input, other than repeatability for the exact same input.

exe34 · 2026-04-09T12:18:22 1775737102

What does deterministic mean to you?

layer8 · 2026-04-09T12:54:17 1775739257

In this context, it means being able to deterministically predict properties of the output based on properties of the input. That is, you don’t treat each distinct input as a unicorn, but instead consider properties of the input, and you want to know useful properties of the output. With LLMs, you can only do that statistically at best, but not deterministically, in the sense of being able to know that whenever the input has property A then the output will always have property B.

peyton · 2026-04-09T14:14:52 1775744092

I mean can’t you have a grammar on both ends and just set out-of-language tokens to zero. I thought one of the APIs had a way to staple a JSON schema to the output, for ex.

We’re making pretty strong statements here. It’s not like it’s impossible to make sure DROP TABLE doesn’t get output.

layer8 · 2026-04-09T16:41:55 1775752915

You still can’t predict whether the in-language responses will be correct or not.

As an analogy: If, for a compiler, you verify that its output is valid machine code, that doesn’t tell you whether the output machine code is faithful to the input source code. For example, you might want to have the assurance that if the input specifies a terminating program, then the output machine code represents a terminating program as well. For a compiler, you can guarantee that such properties are true by construction.

More generally, you can write your programs such that you can prove from their code that they satisfy properties you are interested in for all inputs.

With LLMs, however, you have no practical way to reason about relations between the properties of inputs and outputs.

satvikpendem · 2026-04-09T14:49:55 1775746195

And also have a blacklist of keywords detecting program that the LLM output is run through afterwards, that's probably the easiest filter.

tsimionescu · 2026-04-09T14:38:17 1775745497

I think they mean having some useful predicates P, Q such that for any input i and for any output o that the LLM can generate from that input, P(i) => Q(o).

exe34 · 2026-04-09T19:00:09 1775761209

If you could do that, why would you need an LLM? You'd already know the answer...

tsimionescu · 2026-04-09T21:22:00 1775769720

Having that property is still a looooong way away from being able to get a meaningful answer. Consider P being something like "asks for SQL output" and Q being "is syntactically valid SQL output". This would represent a useful guarantee, but it would not in any way mean that you could do away with the LLM.

idiotsecant · 2026-04-09T11:33:44 1775734424

You don't think this is pedantry bordering on uselessness?

WithinReason · 2026-04-09T12:00:01 1775736001

No, determinism and predictability are different concepts. You can have a deterministic random number generator for example.

satvikpendem · 2026-04-09T11:38:15 1775734695

It's correcting a misconception that many people have regarding LLMs that they are inherently and fundamentally non-deterministic, as if they were a true random number generator, but they are closer to a pseudo random number generator in that they are deterministic with the right settings.

albedoa · 2026-04-09T15:17:05 1775747825

The comment that is being responded to describes a behavior that has nothing to do with determinism and follows it up with "Given this, you can't treat it as deterministic" lol.

Someone tried to redefine a well-established term in the middle of an internet forum thread about that term. The word that has been pushed to uselessness here is "pedantry".

exe34 · 2026-04-09T12:17:48 1775737068

Let's eat grandma.

phlakaton · 2026-04-09T14:37:50 1775745470

But you cannot predict a priori what that deterministic output will be – and in a real-life situation you will not be operating in deterministic conditions.

zbentley · 2026-04-09T12:13:10 1775736790

Practically, the performance loss of making it truly repeatable (which takes parallelism reduction or coordination overhead, not just temperature and randomizer control) is unacceptable to most people.

wat10000 · 2026-04-09T13:48:05 1775742485

It's also just not very useful. Why would you re-run the exact same inference a second time? This isn't like a compiler where you treat the input as the fundamental source of truth, and want identical output in order to ensure there's no tampering.

vrighter · 2026-04-13T10:10:45 1776075045

deterministic is useless if it means it will always make the same mistake it did the first time.

yunohn · 2026-04-09T11:21:46 1775733706

I initially thought the same, but apparently with the inaccuracies inherent to floating-point arithmetic and various other such accuracy leakage, it’s not true!

https://arxiv.org/html/2408.04667v5

layer8 · 2026-04-09T11:36:56 1775734616

This has nothing to do with FP inaccuracies, and your link does confirm that:

“Although the use of multiple GPUs introduces some randomness (Nvidia, 2024), it can be eliminated by setting random seeds, so that AI models are deterministic given the same input. […] In order to support this line of reasoning, we ran Llama3-8b on our local GPUs without any optimizations, yielding deterministic results. This indicates that the models and GPUs themselves are not the only source of non-determinism.”

yunohn · 2026-04-09T15:57:45 1775750265

I believe you've misread - the Nvidia article and your quote support my point. Only by disabling the fp optimizations, are the authors are able to stop the inaccuracies.

layer8 · 2026-04-09T18:08:08 1775758088

First, the “optimizations” are not IEEE 754 compliant. So nondeterminism with floating-point operations is not an inherent property of using floating-point arithmetics, it’s a consequence of disregarding the standard by deliberately opting in to such nondeterminism.

Secondly, as I quoted the paper is explicitly making the point that there is a source of nondeterminism outside of the models and GPUs, hence ensuring that the floating-point arithmetics are deterministic doesn’t help.

4ndrewl · 2026-04-09T11:41:24 1775734884

If you also control the model.

HPsquared · 2026-04-09T09:06:41 1775725601

Britain was a little bit industrialised even before the steam engine. There were windmills and water mills. Steam massively accelerated it, but industry did exist before.

tpm · 2026-04-09T10:44:06 1775731446

If a windmill or a water mill is a sign of industrialisation, then large parts of the world were industrialised.

https://en.wikipedia.org/wiki/List_of_ancient_watermills

mr_toad · 2026-04-09T15:05:59 1775747159

Commons in England were being enclosed in the Tudor age. It caused a great deal of social unrest, even rebellion. It had little to do with technology, and was mostly caused by population growth.

HPsquared · 2026-04-09T07:52:46 1775721166

I'm reminded of the somewhat derogatory term "carebear" from the EVE Online community, for players who focus on PvE and profit, while avoiding PvP.

HPsquared · 2026-04-07T12:53:09 1775566389

There are some subtly weak floors out there, where placing such a desk could be fatal.

marcosdumay · 2026-04-07T15:42:57 1775576577

The funny thing is that in the 21st century, concrete can be quite light.

Well, there were people that made light concrete on the 20th century too. But not it's accessible to anybody.

rob74 · 2026-04-07T13:44:54 1775569494

Never mind placing it, bringing it to the place where it should be, er, placed might also be a challenge. Unless you can drive a forklift into your office...

sam-bee · 2026-04-07T13:50:43 1775569843

I took it to the office on a little trolley thing

rob74 · 2026-04-07T14:49:16 1775573356

I didn't mean the laptop stand, I meant the concrete desk one of the parent comments suggested...

chasd00 · 2026-04-07T14:46:00 1775573160

how much does it weigh? it looks like maybe 20-30lbs

mmsimanga · 2026-04-07T14:06:12 1775570772

Turtles all the way down.

HPsquared · 2026-04-07T11:48:36 1775562516

Even just a large water tank which you can choose when to add heat.

https://en.wikipedia.org/wiki/Seasonal_thermal_energy_storag...

HPsquared · 2026-04-07T11:47:01 1775562421

That's what batteries are for.