Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Sanity Checks for Sparse Autoencoders: Do SAEs Beat Random Baselines? (arxiv.org)
2 points by yorwba 5 days ago | past | discuss
Towards a Science of AI Agent Reliability (arxiv.org)
2 points by smartmic 5 days ago | past | discuss
Improving Chain-of-Thought Monitorability Through Information Theory (arxiv.org)
1 point by simonpure 5 days ago | past | discuss
The Landscape of Non-Equilibrium Memories with Neural Cellular Automata (arxiv.org)
1 point by PaulHoule 5 days ago | past | discuss
Package Managers à la Carte: a formal model of dependency resolution (arxiv.org)
54 points by avsm 5 days ago | past | 16 comments
Fast and Optimal Mapping for Accelerator Modeling and Evaluation (arxiv.org)
2 points by PaulHoule 5 days ago | past | discuss
Agents of Chaos: Breaches of trust in autonomous LLM agents (arxiv.org)
4 points by cool-RR 5 days ago | past | 1 comment
Computer-Using World Model (arxiv.org)
1 point by steamboatwillie 5 days ago | past | discuss
Are functions just syntactic sugar for inheritance? (arxiv.org)
2 points by yangbo 5 days ago | past | 5 comments
VisPhyWorld: Probing Physical Reasoning via Code-Driven Video Reconstruction (arxiv.org)
1 point by PaulHoule 6 days ago | past | discuss
The Statistical Signature of LLMs (arxiv.org)
1 point by bikenaga 6 days ago | past | 1 comment
Towards Compressive and Scalable Recurrent Memory (arxiv.org)
2 points by PaulHoule 6 days ago | past | discuss
A Visual Document Benchmark for Scientific Retrieval and Question Answering (arxiv.org)
2 points by bobvanluijt 6 days ago | past | discuss
Bioinspired Kirigami Capsule Robot for Minimally Invasive GI Biopsy (arxiv.org)
1 point by PaulHoule 6 days ago | past | discuss
Impact of Programming Languages on Code Quality (2019) (arxiv.org)
1 point by tosh 6 days ago | past | discuss
Specieslike clusters based on identical ancestor points (arxiv.org)
1 point by xamuel 6 days ago | past | 1 comment
Unified Latents (UL): How to train your latents (arxiv.org)
2 points by pama 6 days ago | past | discuss
Stop Saying "AI" (arxiv.org)
5 points by Hard_Space 6 days ago | past | discuss
Who's in Charge? Disempowerment Patterns in Real-World LLM Usage (arxiv.org)
2 points by Gillesray 6 days ago | past | 1 comment
The Principles of Deep Learning Theory (2021) (arxiv.org)
2 points by vinhnx 7 days ago | past | discuss
Duan et al. 2026 algorithm beats Duan et al. 2025 for the SSSP Problem (arxiv.org)
3 points by samyadn12 7 days ago | past | discuss
HLE-Verified: A Verification and Revision of Humanity's Last Exam (arxiv.org)
6 points by ravenical 7 days ago | past | discuss
CSLib: The Lean Computer Science Library (arxiv.org)
2 points by matt_d 7 days ago | past | discuss
Realistic Adversarial Testing of Computer-Use Agents in Web-OS Environments (arxiv.org)
2 points by yakkomajuri 7 days ago | past | discuss
Large-scale online deanonymization with LLMs (including HN users) (arxiv.org)
3 points by salkahfi 7 days ago | past | 2 comments
Stress Tests Reveal Fragile Grounding in Video-Language Models (arxiv.org)
2 points by PaulHoule 7 days ago | past | discuss
Deception Analysis with Artificial Intelligence an Interdisciplinary Perspective (arxiv.org)
1 point by rramadass 7 days ago | past | 1 comment
Volumetric Non-Invasive Cardiac Mapping for Global Arrhythmia Characterization (arxiv.org)
1 point by PaulHoule 7 days ago | past | discuss
A 26-Gram Butterfly-Inspired Robot Achieving Autonomous Tailless Flight (arxiv.org)
71 points by Terretta 7 days ago | past | 17 comments
Surprising Effectiveness of Masking Updates in Adaptive Optimizers (arxiv.org)
2 points by energy123 7 days ago | past | discuss

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: