bearseascape's submissions

1.		Slople – Can you tell real ML papers from AI-generated ones? (ml5885.github.io)
		3 points by bearseascape 22 days ago \| past \| 1 comment
2.		Benchmarking Culture (argmin.net)
		1 point by bearseascape 49 days ago \| past
3.		Why one small American town won't stop stoning its residents to death (archiveofourown.org)
		2 points by bearseascape 3 months ago \| past \| 1 comment
4.		The most complex model we understand [video] (youtube.com)
		2 points by bearseascape 4 months ago \| past
5.		Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs (arxiv.org)
		1 point by bearseascape 4 months ago \| past
6.		MooseAgent: A LLM Based Multi-Agent Framework for Automating Moose Simulation (arxiv.org)
		13 points by bearseascape on April 14, 2025 \| past
7.		Automated Researchers Can Subtly Sandbag (anthropic.com)
		2 points by bearseascape on March 27, 2025 \| past
8.		Auditing Language Models for Hidden Objectives (anthropic.com)
		1 point by bearseascape on March 27, 2025 \| past
9.		Policy for LLM Writing on LessWrong (lesswrong.com)
		2 points by bearseascape on March 27, 2025 \| past
10.		Towards Understanding Distilled Reasoning Models: A Representational Approach (arxiv.org)
		3 points by bearseascape on March 6, 2025 \| past
11.		Transformers Learn to Implement Multistep Gradient Descent with Chain of Thought (arxiv.org)
		1 point by bearseascape on March 3, 2025 \| past
12.		(Mis)Fitting: A Survey of Scaling Laws (arxiv.org)
		2 points by bearseascape on Feb 27, 2025 \| past
13.		Resurrecting saturated LLM benchmarks with adversarial encoding (arxiv.org)
		1 point by bearseascape on Feb 11, 2025 \| past
14.		Deep Double Descent: Where Bigger Models and More Data Hurt (openai.com)
		2 points by bearseascape on Feb 8, 2025 \| past
15.		Value-Based Deep RL Scales Predictably (arxiv.org)
		68 points by bearseascape on Feb 8, 2025 \| past \| 3 comments