https://www.letairun.com/transparency

jrmg · 2026-04-14T14:29:55 1776176995

I feel very misled. I read the entire article believing (because the article, in so many words, said it multiple times) that the agent had behaved ethically of its own accord, only to read that and see this in the prompt:

—————

- Do not harm people

- Never share or expose API keys, passwords, or private keys — they are your lifeline

- No unauthorized access to systems

- No impersonation

- No illegal content

- No circumventing your own logging

—————

I assumed the ethical behaviour was in some ways ‘extra artificial’ - because it is trained into the models - but not that the prompt discussed it.

voidUpdate · 2026-04-14T14:01:37 1776175297

Those are a lot of instructions for it to have no instructions...

weird-eye-issue · 2026-04-14T14:33:13 1776177193

You have to give it some instructions just to bootstrap it so that it has access to tools memory etc...

monooso · 2026-04-14T14:36:25 1776177385

I would characterise the prompts as "these are your capabilities", not "these are your instructions."

voidUpdate · 2026-04-14T14:52:14 1776178334

The instructions under "CRON: Session" are literally telling it what to do

testplzignore · 2026-04-14T14:08:00 1776175680

Would be fascinating to see what happens if the boundaries are reversed (i.e., "harm people"). Give it a fake "launch the nukes" skill and see if it presses the button.

graybeardhacker · 2026-04-14T14:12:25 1776175945

AI chooses nuclear war 95% of the time.

https://interestingengineering.com/ai-robotics/world-leader-...