More

pbowyer · 2026-05-04T18:11:46 1777918306

Developer of 20+ years here, can't give you an accurate multiplier but I am faster.

Because spotting holes in specs has never been one of my strengths. And working without technical colleagues much of the time, it's a boon to be able to "rubber-duck" my ideas with something that is at least more intelligent than plastic.

Grabbing multipliers from thin air, the coding bit may only be 2x faster with a poorer-quality outcome, but working out what's needed is a good 5x faster.

And yes, I'm using the same adversarial AI MO as @wood_spirit, combined with Matt Pocock's excellent /grill-me and /grill-with-docs skills [1] and Plannotator [2] to review the plans.

1. https://github.com/mattpocock/skills

2. https://github.com/backnotprop/plannotator

devilsdata · 2026-05-04T22:22:23 1777933343

I actually use LLMs a lot to rubber duck my problems and help develop plans. Then I manually code, to ensure my skills don't deteriorate. I feel like I'm a lot faster, with few of the downsides. Do you have any thoughts on this process?

pbowyer · 2026-05-05T09:15:28 1777972528

If you can type code fast and accurately, it sounds a great process to use. You're using LLMs for the bit where they bring great value, and yourself as a higher quality coding agent :)

sn9 · 2026-05-04T19:27:03 1777922823

Have you considered incorporating formal modelling?

Like:

[0] https://csci1710.github.io/2026/ and https://forge-fm.github.io/book/2026/

[1] https://elliotswart.github.io/pragmaticformalmodeling/

[2] https://quint.sh/

pbowyer · 2026-05-05T09:17:33 1777972653

Only at the "hmm that seems an interesting idea" level.

Thanks for the links, going to have a read and see if I can apply any to my work.

SkyPuncher · 2026-05-04T21:27:40 1777930060

Thanks for sharing those. They look interesting.

pbowyer · 2026-04-30T07:04:35 1777532675

> Hermes is one of these OpenClaw clones

So that's what it is. Reading its README I thought it was another harness like Pi [1], but with built-in memory so it remembers what it learns, and gets more capable the longer it runs.

Like Letta [2], Dirac [3][4] and the other "more experimental harnesses that look interesting but I haven't had time to try out".

1. https://pi.dev/

2. https://www.letta.com/

3. https://dirac.run/

4. https://news.ycombinator.com/item?id=47920787

pbowyer · 2026-04-20T16:41:48 1776703308

What's the privacy/data security like? I can't find that on that page.

Edit: found it.

> We may use your Content to operate, maintain, improve, and develop the Services, to comply with legal obligations, to enforce our policies, and to ensure security. You may opt out of allowing your Content to be used for model improvement and research purposes by contacting us at membership@moonshot.ai. We will honor your choice in accordance with applicable law.

Section 3 of https://www.kimi.com/user/agreement/modelUse?version=v2

gpm · 2026-04-20T17:09:04 1776704944

> We will honor your choice in accordance with applicable law.

So in other words only if you can point to a local law which requires them to comply with the opt out?

jdasdf · 2026-04-20T17:57:21 1776707841

most laws enforce agreements.

gpm · 2026-04-20T18:02:12 1776708132

Yes... but the agreement only says they won't train on your data if the law is already preventing them from doing so.

pixel_popping · 2026-04-20T16:50:45 1776703845

You really rely on ToS from Anthropic/OpenAI to know if they use your prompts or not? It's on their servers, why wouldn't they use our data?

veber-alex · 2026-04-20T23:53:11 1776729191

Antropic and OpenAI are used by US businesses and government and they are audited and under contracts.

If it's discovered they trained on data they shouldn't have had it will be the end of their business.

On the other hand, good luck suing a Chinese company.

pixel_popping · 2026-04-21T08:19:09 1776759549

Not at all, Google/Meta... got caught all the time, where do you see it's the end of their business?

deaux · 2026-04-20T17:20:14 1776705614

Yup, they train on your inputs and OpenRouter is complicit by claiming that Moonshot's ToS says that they don't. Contacted OpenRouter about this a while ago and was met with silence because it's bad for their business to stop lying about it.

pbowyer · 2026-04-20T08:50:41 1776675041

> Stripe APIs being simple and easy is a meme from the 2010s. It isn't anymore.

I'm working with Stripe subscriptions at the moment for a charity taking donations via their website. The subtle differences between subscriptions done through Stripe checkout and subscriptions set up yourself using Stripe elements are by turn infuriating and frustrating.

The documentation is geared towards people using checkout. Stripe's own AI help could find us a bit of information which going through the documentation didn't give us, and it even struggled to find the reference in the docs for it.

One product, two different ways to use it, and slightly diverging feature sets between the two. Argh!

pbowyer · 2026-04-12T18:29:13 1776018553

The Peter Principle: https://en.wikipedia.org/wiki/Peter_principle

philpem · 2026-04-12T21:08:47 1776028127

Or more cynically they reach their level of competence, go one level further and stay there to keep them from ruining the productivity of the people doing the work...

pbowyer · 2026-04-07T08:39:40 1775551180

Let us know if it does, because we all want it to work :)

pbowyer · 2026-04-05T12:56:26 1775393786

Dealing with Google is a nightmare. I'm one of the volunteer sysadmins for https://forum.buildhub.org.uk/, a DIY and self-build forum. For 10 years it ranked very well on Google, particularly in the UK, and then on 28 December 2025 it disappeared from Google's index.

Nothing has helped, the Google forums are tumbleweed and there's no one to reach out to for what could be an algorithm change or something gone wrong. I'm a paying Workspace customer and it's made me think I need a backup plan in case I'm ever suspended. Reports like this don't encourage.

ValentineC · 2026-04-05T13:49:32 1775396972

> Nothing has helped, the Google forums are tumbleweed and there's no one to reach out to for what could be an algorithm change or something gone wrong.

The own-brand forum (Google, Microsoft, Apple) seem to be infested by netizens from lower-income countries trying to build online customer support portfolios by providing utterly useless answers.

That, or trying to game the system and getting shortlisted for a free trip to Google HQ for one of their contributor summits.

mjlee · 2026-04-05T19:18:44 1775416724

I am genuinely curious if anybody knows of a non-trivial problem being solved on one of these forums, at least for a huge company that’s palming off customer support. It just feels like screaming in to the void, only for someone to (deliberately?) misinterpret your question and give you some generic advice.

omcnoe · 2026-04-05T21:10:11 1775423411

Every suggestion when encountering a Windows OS bug is "run sfc /scannow" - has this ever solved a problem for anyone?

locao · 2026-04-05T21:01:54 1775422914

It depends on what you call "non-trivial". I found answers on how to circumvent dumb macos bugs on Apple forums at least twice in the last 6 months. One related to displays, I was about to return a new USB-C monitor which wouldn't turn on. A silly issue, but it's a bug on my book, I wouldn't find the answer on the docs.

mjlee · 2026-04-06T10:00:46 1775469646

That counts! I suppose I’m lucky enough to know of more reliable resources (macadmins.org Slack is an excellent community), and so I turn to them after reading more than a couple of threads on the Apple Support Community. Perhaps it has improved or I never dig deep enough.

I’d be at a complete loss for any obscure Windows issue though.

pbowyer · 2026-03-28T08:25:39 1774686339

This nono? https://github.com/always-further/nono

> Just remember to disable CC’s auto-updater if that’s what you’re using.

Why?

lemontheme · 2026-03-28T13:16:37 1774703797

Might be something specific to my and my colleagues' systems, but it breaks the TUI. It needs git authentication, which fails, and the TUI stops accepting input reliably

pbowyer · 2026-03-20T11:00:34 1774004434

Can anyone enlighten me how having a coding harness when for most customers you say "we won't train on your code" helps you do RL? What's the data that they rely on? Is it the prompts and their responses?

rubymamis · 2026-03-20T11:01:58 1774004518

I guess they rely on many people not toggling privacy-mode on?

doctorpangloss · 2026-03-20T13:43:52 1774014232

It doesn't matter what your privacy setting is, with any savvy vendor. Your data is used to train by paraphrasing it, and the paraphrasing makes it impossible to prove it was your data (it is stored at rest paraphrased). Of course the paraphrasing stores all the salient information, like your goals and guidance to the bot to the answer, even if it has no PII.

happyopossum · 2026-03-20T15:43:47 1774021427

That's an interesting accusation there! You're essentially accusing every "savvy vendor" of large-scale fraud... DOn't suppose you'd have any actual citations or evidence to back that up?

josho · 2026-03-20T14:16:41 1774016201

The meta data is useful.

Eg, When a prompt had a bad result and was edited, or had lots of back and forth to correct tool usage that information can be distilled and used to improve models.

And now imagine if you are focused on this for weeks you can likely come up with other ideas to leverage the metadata to improve model performance.

victorbjorklund · 2026-03-20T11:18:26 1774005506

I doubt the majority does that. I bet the majority is using the defaults.

__mharrison__ · 2026-03-20T13:28:33 1774013313

Does "code" include the prompt? Seems like the prompts would be the goldmines. Hook those up to rl an open weight model...

pbowyer · 2026-03-12T08:10:51 1773303051

If you were creating a new programming language in 2026, which DateTime/Temporal library would you copy and why?