Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
The Fundamental Limits of LLMs at Scale (arxiv.org)
4 points by o4c 2 days ago | past | discuss
GenAI for Systems: Recurring Challenges&Design Principles from SW to Silicon (arxiv.org)
2 points by matt_d 2 days ago | past | discuss
Fork, Explore, Commit: OS Primitives for Agentic Exploration (arxiv.org)
3 points by wang_cong 2 days ago | past | discuss
Multi-Turn Intent Detection for LLM and Agent Security (ArXiv) (arxiv.org)
1 point by sharathr 3 days ago | past | 1 comment
Prompt Repetition Improves Non-Reasoning LLMs (arxiv.org)
1 point by elorant 3 days ago | past | discuss
Multi-agent cooperation through in-context co-player inference (arxiv.org)
2 points by gmays 3 days ago | past | discuss
Performance of Deep Material Networks for Multiscale Material Modeling (arxiv.org)
1 point by PaulHoule 3 days ago | past | discuss
Wisdom of the Crowd: How Network Topology Distorts Collective Perception (arxiv.org)
2 points by Anon84 3 days ago | past | discuss
Large-scale online deanonymization with LLMs (arxiv.org)
3 points by DalasNoin 3 days ago | past | discuss
Monolith – The research paper behind TikToks algorithm (2022) (arxiv.org)
2 points by Alifatisk 3 days ago | past | discuss
Soft Contamination Means Benchmarks Test Shallow Generalization (arxiv.org)
2 points by gmays 3 days ago | past | discuss
The Existence and Behavior of Secondary Attention Sinks (arxiv.org)
1 point by thw20 3 days ago | past | discuss
Improving Interactive In-Context Learning from Natural Lang Feedback – DeepMind (arxiv.org)
2 points by zerop 3 days ago | past | discuss
Fast KV Compaction via Attention Matching (arxiv.org)
73 points by cbracketdash 3 days ago | past | 15 comments
Prompt Repetition Improves Non-Reasoning LLMs (arxiv.org)
1 point by beatthatflight 3 days ago | past | discuss
When Models Manipulate Manifolds: The Geometry of a Counting Task [pdf] (arxiv.org)
1 point by vinhnx 3 days ago | past | discuss
Computer Science as Infrastructure: The Spine of the Lean CSLib (arxiv.org)
2 points by matt_d 3 days ago | past | discuss
Towards Industrial-Scale Verification: LLM-Driven Theorem Proving on SeL4 (arxiv.org)
1 point by lr0 3 days ago | past | discuss
Surprising Effectiveness of Masking Updates in Adaptive Optimizers (arxiv.org)
4 points by gmays 4 days ago | past | discuss
Realization of a Synthetic Hall Torus with a Spinor Bose-Einstein Condensate (arxiv.org)
2 points by bryanrasmussen 4 days ago | past | discuss
GLM-5: From Vibe Coding to Agentic Engineering (arxiv.org)
1 point by gmays 4 days ago | past | discuss
Prompt Repetition Improves Non-Reasoning LLMs [pdf] (arxiv.org)
1 point by 8ig8 4 days ago | past | discuss
LongCLI-Bench: Benchmark and Study for Long-Horizon Agentic Programming in CLIs (arxiv.org)
2 points by simonpure 4 days ago | past | discuss
A Primer of Mathematical Writing (arxiv.org)
1 point by paulpauper 4 days ago | past | discuss
Can We Trust LLM Detectors? (arxiv.org)
3 points by PaulHoule 4 days ago | past | discuss
Persona: Controlling LLM Personality with Vector Algebra (arxiv.org)
3 points by mldev_exe 4 days ago | past | discuss
GR3EN: Generative Relighting for 3D Environments (arxiv.org)
2 points by PaulHoule 4 days ago | past | discuss
WebWorld: A Large-Scale World Model for Web Agent Training (arxiv.org)
2 points by gmays 5 days ago | past | discuss
CMind: An AI Agent for Localizing C Memory Bugs (arxiv.org)
2 points by PaulHoule 5 days ago | past | discuss
A benchmark for vericoding: formally verified program synthesis (arxiv.org)
3 points by luskira 5 days ago | past | discuss

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: