| 1. | | Slople – Can you tell real ML papers from AI-generated ones? (ml5885.github.io) |
| 3 points by bearseascape 22 days ago | past | 1 comment |
|
| 2. | | Benchmarking Culture (argmin.net) |
| 1 point by bearseascape 49 days ago | past |
|
| 3. | | Why one small American town won't stop stoning its residents to death (archiveofourown.org) |
| 2 points by bearseascape 3 months ago | past | 1 comment |
|
| 4. | | The most complex model we understand [video] (youtube.com) |
| 2 points by bearseascape 4 months ago | past |
|
| 5. | | Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs (arxiv.org) |
| 1 point by bearseascape 4 months ago | past |
|
| 6. | | MooseAgent: A LLM Based Multi-Agent Framework for Automating Moose Simulation (arxiv.org) |
| 13 points by bearseascape on April 14, 2025 | past |
|
| 7. | | Automated Researchers Can Subtly Sandbag (anthropic.com) |
| 2 points by bearseascape on March 27, 2025 | past |
|
| 8. | | Auditing Language Models for Hidden Objectives (anthropic.com) |
| 1 point by bearseascape on March 27, 2025 | past |
|
| 9. | | Policy for LLM Writing on LessWrong (lesswrong.com) |
| 2 points by bearseascape on March 27, 2025 | past |
|
| 10. | | Towards Understanding Distilled Reasoning Models: A Representational Approach (arxiv.org) |
| 3 points by bearseascape on March 6, 2025 | past |
|
| 11. | | Transformers Learn to Implement Multistep Gradient Descent with Chain of Thought (arxiv.org) |
| 1 point by bearseascape on March 3, 2025 | past |
|
| 12. | | (Mis)Fitting: A Survey of Scaling Laws (arxiv.org) |
| 2 points by bearseascape on Feb 27, 2025 | past |
|
| 13. | | Resurrecting saturated LLM benchmarks with adversarial encoding (arxiv.org) |
| 1 point by bearseascape on Feb 11, 2025 | past |
|
| 14. | | Deep Double Descent: Where Bigger Models and More Data Hurt (openai.com) |
| 2 points by bearseascape on Feb 8, 2025 | past |
|
| 15. | | Value-Based Deep RL Scales Predictably (arxiv.org) |
| 68 points by bearseascape on Feb 8, 2025 | past | 3 comments |
|