Hacker Newsnew | past | comments | ask | show | jobs | submit | Powdering7082's commentslogin

So by default it pulls out recent PRs, grabs only the tests that were commited and then checks to see how well an agent with different claude.mds can finish the test suite?

This excerpt is well worth reading IMO

Back when I first started doing these clean-up projects, I started by just picking up litter that was in my own neighborhood. (Because that was where I lived, and because I had never been to a lot of the other neighborhoods in my area.) But I found that the more that I did this kind of work, the more that I wanted to do it, and I eventually found myself going beyond my own neighborhood and into neighborhoods that I had never been to before. (Including the ones that I had always heard were "bad neighborhoods".)

Then to make things more interesting, I started using the city bus system for the first time, and I started making it a point to go someplace new that I had never been to before whenever I picked up litter. And after going through a big stack of monthly bus passes, and walking down just about every street in the city (and doing it alone and without a phone) I want to say that not only has nothing bad ever happened to me, but I've encountered a lot of strangers who were almost "too nice" to me...

Because these clean-up projects involve a lot of walking and lugging around heavy stuff, it seems that no matter where I go, strangers will keep pulling over to offer me a ride. And because I do these projects even during extreme weather, the more intense the weather gets, the nicer people will become. (During the summer on really hot days, strangers will keep pulling over just to ask if I'm going to be OK working outside in the heat and if they can go and buy some cold water for me, and sometimes people will even try to give me an umbrella or an extra coat on days when it's raining or snowing.)

And there were times when I would pick up a penny that was in the middle of road or stuck in a crack in the sidewalk, and I guess that it would give strangers walking by the impression that I must need money, and sometimes people would actually pull out their wallet and start trying to give me money!

Strangers will also come up and thank me for what I'm doing, and sometimes they will end up talking to me for a long time, and I've ended up meeting a lot of friendly people this way.

I have been shown such a good side of people, that it simply wouldn't make sense for me to go back to being fearful of strangers and automatically imagining the worst-case scenarios about them. (Like I tended to do back when I didn't get out much and my view of the outside world was being shaped by watching the News.)

I don't doubt that there is crime in my area. (After all, "littering" itself is a crime, and there are MILLIONS of examples of this crime in plain sight where I live.)

But because I have been doing these clean-up projects, I've spent more time outside and less time looking at a screen in the past few years than I have at any other time in my life. And I know that what I am about to say will probably sound crazy to anyone who did the exact opposite of that and who spent the past few years locked in their homes and being bombarded all day long by the media with stories about crime, riots, racism, sickness, and war, but I honestly have never felt safer going outside than I do today.

I started picking up litter in my neighborhood because I wanted to help make the world a better place, and because it got me to get out more and start to base my view of the outside world on my actual experience in the outside world, the world is a much better place to me now, and that is the priceless treasure that I found while picking up a zillion pieces of litter.


> started using the city bus system for the first time, and I started making it a point to go someplace new

That's a fun thing to do when you move cities, or countries.

I spent several weekends riding every single tram line in Helsinki with my son. We'd pick a number we'd not yet done and ride each both ways to the terminus.

Get out at the end of the line and see what was nearby, have a cake, then come back home.

We had a map from the local transport company and we'd put stickers on the lines we'd done, and the last stops.

A good way to see different neighbourhoods in the same city.


I mean they are being sued for insider trading, not exactly a bad headline maybe it could say alleged?

It seems extremely unlikely to me, a casual observer of the shit show that was Luna/Terra, that the suit would be successful


Yeah I loved the guests he had, but eventually had to stop listening to him


No erasure coding seems like a pretty big loss in terms of how much resources do you need to get good resiliency & efficiency


I was looking at using this on an LTO tape library, it seems the only resiliency is through replication, but this was my main concern with this project, what happens with HW goes bad


If you have replication, you can lose one of the replica, that's the point. This is what Garage was designed for, and it works.

Erasure coding is another debate, for now we have chosen not to implement it, but I would personally be open to have it supported by Garage if someone codes it up.


Erasure coding is an interesting topic for me. I've run some calculations on the theoretical longevity of digital storage. If you assume that today's technology is close to what we'll be using for a long time, then cross-device erasure coding wins, statistically. However, if you factor in the current exponential rate of technological development, simply making lots of copies and hoping for price reductions over the next few years turns out to be a winning strategy, as long as you don't have vendor lock-in. In other words, I think you're making great choices.


I question that math. Erasure coding needs less than half as much space as replication, and imposes pretty small costs itself. Maybe we can say the difference is irrelevant if storage prices will drop 4x over the next five years? But looking at pricing trends right now... that's not likely. Hard drives and SSDs are about the same price they were 5 years ago. The 5 years before that SSDs were seeing good advancements, but hard drive prices only advanced 2x.


Except that you don't own the things you buy on steam


That is true for all media purchases since the invention of copyright in 1662.

You think you own the Silmarillion because you have a paper copy? Hah! No, you have a transferrable license to read it.

Every hard copy movie you have starts with a big green FBI warning reminding you that having that disc does not means you own the movie, it means you have a transferrable license to play it for yourself and small groups on small screens.

Digital media with DRM allow content distributors to remove the "transferrable" part of the license if they want, which often allows them to sell for cheaper since they know that each sale represents only one person recieving the experience. The license comes with less rights (no transferrance), so it can be priced lower.


This is true. But it doesn't matter to me.

Most media for me is a one and done. A book, a movie, a computer game. Granted a computer games version of "done" might mean "played on and off for a year".

There are exceptions to this - books I read again, shows I'd watch again, but games seem to age poorly by comparison. Original Syndicate or Deus Ex - while playable - is not what I remember it to be and I'd rather keep the nostalgic memories than shatter them with a replay.

This rarity of exceptions means that I wouldn't lose much if my Steam account disappeared - mainly just "whatever I'm playing now". Create a new account and go again, or buy off GOG or something.

However in return for using Steam I get a lot of convenience - updates, propogated save files, easy chat and "Right click -> Join Game" with friends. That "Right click -> Join Game" is almost worth it on it's own for ease of social gaming.


Most people consume like this, but some like the warm fuzzies that hoarding gives them.


I would like to see change there for sure. That said, DRM is optional for publishers on Steam. Once you've downloaded a game without DRM (steam's or otherwise) you can back it up and play it without Steam.


This is true for all digital purchases, video games or otherwise.

There is no such thing as "owning" a game unless you're the company that developed the game (or bought the company that did).


With respect to winning a game of chess?


With respect to playing a game of chess.


Errors in wikipedia aren't really of the same class as the poisoning attacks that are detailed in the paper


Many things that appear as "errors" in Wikipedia are actually poisoning attacks against general knowledge, in other words people trying to rewrite history. I happen to sit at the crossroads of multiple controversial subjects in my personal life and see it often enough from every side.


Fnord


yeah, I'm still hoping that Wikipedia remains valuable and vigilant against attacks by the radical right but its obvious that Trump and congress could easily shut down wikipedia if they set their mind to it.


you're ignoring that both sides are doing poisoning attacks on wikipedia, trying to control the narrative. it's not just the "radical right"


Not to mention that there is subset of people that are on neither side, and just want to watch the world burn for the sake of enjoying flames.


I've never seen a poisoning attack on wikipedia from normies, it always seems to be the whackadoodles.


> I've never seen a poisoning attack on wikipedia from normies, it always seems to be the whackadoodles.

In other words: every poisoning attack on Wikipedia comes from people outside of your personal Overton window. [1] :-)

[1] https://en.wikipedia.org/wiki/Overton_window


very true. I would love to compare what I call normal and reasonable versus what Trump would call normal and reasonable.


> As of 2024, SpaceX's internal costs for a Falcon 9 launch are estimated between $15 million[186] and $28 million,[185] factoring in workforce expenses, refurbishment, assembly, operations, and facility depreciation.[187] These efficiencies are primarily due to the reuse of first-stage boosters and payload fairings.[188] The second stage, which is not reused, is believed to be the largest expense per launch, with the company's COO stating that each costs $12 million to produce.[189]

From wikipedia: https://en.wikipedia.org/wiki/Falcon_9#Pricing


Do yourself a favor and skip what Gary Marcus has to say on this and read the METR study it's self


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: