Hacker Newsnew | past | comments | ask | show | jobs | submit | ECCME's commentslogin

"Here is a challenge, designed to be unsolvable or so. We'll give you a bazillion dollars if you complete the challenge, and, in the meantime, we will use your attempts to train an as AI that will be worth the cost!!"


In the most charitable interpretation of this comment - I can understand the feeling, when so much of social media interactions are in the form 'It's post a picture of you as a baby, 10 year old, and current age!'. Those and many other instances can bring out excessive skepticism

But the people involved in this haven't signaled that they are in that path, either in the message about the challenge (precisely the opposite) or seemingly in their careers so far

So I guess I don't share the concern but a better way to phrase your comment could be -

"how can we be sure the human-provided solutions won't turn out to be just fodder for training a RL model or something that will later be monetized, closed and proprietary? Do the challenge organizers provide any guarantees on that?"


No, you missed the point. The striking thing about ARC is the puzzles are super easy, for humans. The average person solves 85% of the tasks, but the worlds best LLMs are only solving 5%. The challenge is to simply make an AI score as well as the average human.


[flagged]


Amazon Mechanical Turk workers, who might not be 100 average IQ but wouldn't be far off that.


Did you even try the puzzles? They’re not particularly “unsolvable”.


ARC-AGI: "here are some pretty simple puzzles, we'll give you a million dollars to solve them!"

Human: "They're quite challenging, this might be a trick to engage activity for the purpose of training models."

skrebbel: "You're stupid".


Did you try the puzzles?


No. What is the purpose of this competition? Unlikely that the reason for it is to pay out an enormous reward, right? Easy or not easy, the fortune is only rewarded to the system that solves the puzzles. The reward is too valuable to be given away easily. Ipso facto, solving the puzzles is deemed challenging by those who present the competition.


Are you writing this under every challenge with a monetary reward? The point of the challenge is that it is hard to do for an AI and easy for a human. Of course it is not easy to solve, that’s the point of the challenge. But the puzzle itself is not very hard.


Well you've got to get it. Google is running on a 2.x Linux kernel, with thousands of patches. The hypervisor runs on bare-metal, and it's not "debian". You need to start thinking about debian as just another virtual system and get with the program. If you're installing an operating system on bare metal, and it's not a bsd with jails, you may be doing it wrong. Or, maybe you have an offline developer's environment bare-metal install, debian, arch, whatever, a linux efistub is enough, and then host a passed thru network vm, in a container, with strong guest to host protections. You do you?


Hey guys, After openbsd and a pc engine apu, and always keeping in mind cosmic rays and that quote from the Google engineer when asked what he'd do differently, I've been seeking an ideal home Internet firewall/routing Device. The ISP where i am provides managed devices for customers, or they can also disable whatever kind of nat traversal is on, and provide a static address. With such an address, security is paramount, and, for example, after hours of troubleshooting why port forwarding wasn't working on a cheap and old router the isp technician warned me that i was letting the wolves in (::rollseyes::) and then turned off the NAT traversal complexity (+$5/mo), and the cheap Walmart router was immediately pwned. Wow. The new router is a Mac pro a1481, which must be like a honey pot for state actors, but is a great host for an openbsd VM guest, and an openwrt access point VM. Gotta have that 128GB of ECC on the 12-core system. Those ad/spam/ai filtering LLMs need some elbow room! :) Have at the IP, and (please!) let me if I'm being dense. Peace and love my digital friends.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: