From the README at https://github.com/unslothai/unsloth: "Unsloth uses a dual-licensing model of Apache 2.0 and AGPL-3.0. The core Unsloth package remains licensed under Apache 2.0, while certain optional components, such as the Unsloth Studio UI are licensed under AGPL-3.0."
What do you mean by custom LMStudio license? Your employer requires reviews of proprietary EULAs or do you try to get a custom licensing deal from LMStudio?
Letting a few cold feet throw away your relationship with the US is absolutely just as stupid as Trump throwing away the US's relationship with Europe/whoever.
I think you can justify this logic only in the case you sincerely believe that the current admin is a fluke and things will return to roughly the previous status quo on the order of a few years. And that isn't unreasonable to think, but you might also want to have a backup plan.
I think it is very clear from the way all US allies have reacted to various provocations that we are taking a long term view. That is the reason we are still spying on our domestic populations for the US despite our reservations about the current executive and their actions.
No the US clearly believes they would be better off not part of the rest of the world, the best thing we can do is not to drown in that tantrum, and provide the economic embargo they clearly think will bring them prosperity.
Less so if the US is going to try to request current (prior?) allies to assist in a war against Iran which has already been declared 'won' and was recommended against by pretty much everyone outside of current participants.
I love how HN is loving this idea when it's the exact same thing Anthropic and OpenAi (and every other llm maker) did.
It's God's gift to them when it lets them bypass ads and dl copyrighted material. But it's Satan's curse on humanity when the Zuck does it to train his llm and dl copyrighted material.
So you’re that Hal Jordan then? Why would a Green Lantern feel the need to defend either? I feel like the Guardians would not accept your arguments as soon as you got to Oa, poozer. I guess what I am saying is don’t have a famous name. Seems obvious.
You conflate web crawling for inference with web crawling for training.
Web crawling for training is when you ingest content on a mass scale, usually indiscriminately, usually with a dumb crawler for scale's sake, for the purposes of training an LLM. You don't really care whether one particular website is in the dataset (unless it's the size of Reddit), you just want a large, diverse, high-quality data mix.
Web crawling for inference is when a user asks a targeted question, you do a web search, and fetch exactly those resources that are likely to be relevant to that search. Nothing ends up in the training data, it's just context enrichment.
People have a much larger issue with crawling for training than for inference (though I personally think both are equally ok).
You would be much better served blacklisting those websites and read the NYTs article. It's time to take a moment and reflect that if you cant read the news you might be the problem
It's not a botnet. The fact that you're trying to use that term because you prefer the emotions it gives you instead of a different term based on reality is all anyone should need to reject your suggestion
They don't "just have taps" in whatever isp you come across. And they certainly, and i cannot be clear enough on this, they don't just spy on Americans. It's literally the one thing they expressly forbidden from doing
Even in the days of telegrams, FDR was opening and reading millions of American’s telegrams to use the information therein to target his political enemies.
You can’t build centralized systems that enable spying and not expect people to do the “forbidden” thing. We have to build systems that make this impractical.
I’m sure you thought GSM was secure too. They absolutely have taps, just search for YouTube videos then cross reference the exact places and situations they talk about.
reply