Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Computing Diffusion Geometry (arxiv.org)
3 points by aanet 5 days ago | past | 1 comment
Constructing Unlearnable Data with Solely Linear Classifiers (arxiv.org)
2 points by PaulHoule 5 days ago | past | discuss
Experiential Reinforcement Learning (arxiv.org)
3 points by geophile 5 days ago | past | discuss
Identity, Cooperation and Framing Within Groups of Real and Simulated Humans (arxiv.org)
2 points by PaulHoule 5 days ago | past | discuss
Investigating the Downstream Effect of AI Assistants on Software Maintainability (arxiv.org)
2 points by KallDrexx 5 days ago | past | 2 comments
A New Perspective on Drawing Venn Diagrams for Data Visualization (arxiv.org)
21 points by IdealeZahlen 5 days ago | past | 5 comments
Language Models Entangle Language and Culture (arxiv.org)
1 point by paraschopra 5 days ago | past | discuss
Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook (arxiv.org)
1 point by simonpure 5 days ago | past | 1 comment
Biases in the Blind Spot: Detecting What LLMs Fail to Mention (arxiv.org)
2 points by azalemeth 6 days ago | past | discuss
Prompt Repetition Improves Non Reasoning LLM (arxiv.org)
2 points by jdthedisciple 6 days ago | past | discuss
GLM-5 Technical Report (arxiv.org)
12 points by meetpateltech 6 days ago | past | discuss
Training-Free Group Relative Policy Optimization (arxiv.org)
1 point by readitalready 6 days ago | past | discuss
Composition-RL: Compose Verifiable Prompts for Reinforcement Learning of LLMs (arxiv.org)
3 points by gmays 6 days ago | past | discuss
Reducing the cost of breaking RSA-2048 to 100000 physical qubits (arxiv.org)
3 points by fuglede_ 6 days ago | past | discuss
Intelligent AI Delegation (arxiv.org)
2 points by gmays 6 days ago | past | discuss
Randomness in Agentic Evals (arxiv.org)
1 point by andre15silva 6 days ago | past | discuss
Hunt Globally (arxiv.org)
1 point by salkahfi 6 days ago | past | discuss
Frontier Models Exhibit Sophisticated Reasoning in Simulated Nuclear Crises (arxiv.org)
1 point by salkahfi 6 days ago | past | discuss
Learning State-Tracking from Code Using Linear RNNs (arxiv.org)
2 points by jul8234 7 days ago | past | 1 comment
A Survey of In-Context Reinforcement Learning (arxiv.org)
2 points by handfuloflight 7 days ago | past | discuss
Soft Contamination Means Benchmarks Test Shallow Generalization (arxiv.org)
2 points by cjbarber 7 days ago | past | 1 comment
SkillsBench: Benchmarking how well agent skills work across diverse tasks (arxiv.org)
362 points by mustaphah 7 days ago | past | 163 comments
Virtual Width Networks (VWN) (arxiv.org)
9 points by tesserato 7 days ago | past | discuss
CodeLogician: Neuro-symbolic reasoning for precise software analysis (arxiv.org)
2 points by NTCTech 7 days ago | past | 1 comment
Intelligent AI Delegation (2026) (arxiv.org)
1 point by Nydhal 7 days ago | past | discuss
Delegated Agent Authorization Constrained to Semantic Task-to-Scope Matching (arxiv.org)
1 point by mooreds 7 days ago | past | discuss
Evaluating AGENTS.md: are they helpful for coding agents? (arxiv.org)
218 points by mustaphah 7 days ago | past | 156 comments
Multi-Agent Teams Hold Experts Back (arxiv.org)
1 point by fauigerzigerk 8 days ago | past | discuss
Large Language Model Reasoning Failures (arxiv.org)
1 point by kawera 8 days ago | past | discuss
Towards Autonomous Mathematics Research (arxiv.org)
106 points by gmays 8 days ago | past | 56 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: