Submissions from arxiv.org

		Sanity Checks for Sparse Autoencoders: Do SAEs Beat Random Baselines? (arxiv.org)
		2 points by yorwba 5 days ago \| past \| discuss
		Towards a Science of AI Agent Reliability (arxiv.org)
		2 points by smartmic 5 days ago \| past \| discuss
		Improving Chain-of-Thought Monitorability Through Information Theory (arxiv.org)
		1 point by simonpure 5 days ago \| past \| discuss
		The Landscape of Non-Equilibrium Memories with Neural Cellular Automata (arxiv.org)
		1 point by PaulHoule 5 days ago \| past \| discuss
		Package Managers à la Carte: a formal model of dependency resolution (arxiv.org)
		54 points by avsm 5 days ago \| past \| 16 comments
		Fast and Optimal Mapping for Accelerator Modeling and Evaluation (arxiv.org)
		2 points by PaulHoule 5 days ago \| past \| discuss
		Agents of Chaos: Breaches of trust in autonomous LLM agents (arxiv.org)
		4 points by cool-RR 5 days ago \| past \| 1 comment
		Computer-Using World Model (arxiv.org)
		1 point by steamboatwillie 5 days ago \| past \| discuss
		Are functions just syntactic sugar for inheritance? (arxiv.org)
		2 points by yangbo 5 days ago \| past \| 5 comments
		VisPhyWorld: Probing Physical Reasoning via Code-Driven Video Reconstruction (arxiv.org)
		1 point by PaulHoule 6 days ago \| past \| discuss
		The Statistical Signature of LLMs (arxiv.org)
		1 point by bikenaga 6 days ago \| past \| 1 comment
		Towards Compressive and Scalable Recurrent Memory (arxiv.org)
		2 points by PaulHoule 6 days ago \| past \| discuss
		A Visual Document Benchmark for Scientific Retrieval and Question Answering (arxiv.org)
		2 points by bobvanluijt 6 days ago \| past \| discuss
		Bioinspired Kirigami Capsule Robot for Minimally Invasive GI Biopsy (arxiv.org)
		1 point by PaulHoule 6 days ago \| past \| discuss
		Impact of Programming Languages on Code Quality (2019) (arxiv.org)
		1 point by tosh 6 days ago \| past \| discuss
		Specieslike clusters based on identical ancestor points (arxiv.org)
		1 point by xamuel 6 days ago \| past \| 1 comment
		Unified Latents (UL): How to train your latents (arxiv.org)
		2 points by pama 6 days ago \| past \| discuss
		Stop Saying "AI" (arxiv.org)
		5 points by Hard_Space 6 days ago \| past \| discuss
		Who's in Charge? Disempowerment Patterns in Real-World LLM Usage (arxiv.org)
		2 points by Gillesray 6 days ago \| past \| 1 comment
		The Principles of Deep Learning Theory (2021) (arxiv.org)
		2 points by vinhnx 7 days ago \| past \| discuss
		Duan et al. 2026 algorithm beats Duan et al. 2025 for the SSSP Problem (arxiv.org)
		3 points by samyadn12 7 days ago \| past \| discuss
		HLE-Verified: A Verification and Revision of Humanity's Last Exam (arxiv.org)
		6 points by ravenical 7 days ago \| past \| discuss
		CSLib: The Lean Computer Science Library (arxiv.org)
		2 points by matt_d 7 days ago \| past \| discuss
		Realistic Adversarial Testing of Computer-Use Agents in Web-OS Environments (arxiv.org)
		2 points by yakkomajuri 7 days ago \| past \| discuss
		Large-scale online deanonymization with LLMs (including HN users) (arxiv.org)
		3 points by salkahfi 7 days ago \| past \| 2 comments
		Stress Tests Reveal Fragile Grounding in Video-Language Models (arxiv.org)
		2 points by PaulHoule 7 days ago \| past \| discuss
		Deception Analysis with Artificial Intelligence an Interdisciplinary Perspective (arxiv.org)
		1 point by rramadass 7 days ago \| past \| 1 comment
		Volumetric Non-Invasive Cardiac Mapping for Global Arrhythmia Characterization (arxiv.org)
		1 point by PaulHoule 7 days ago \| past \| discuss
		A 26-Gram Butterfly-Inspired Robot Achieving Autonomous Tailless Flight (arxiv.org)
		71 points by Terretta 7 days ago \| past \| 17 comments
		Surprising Effectiveness of Masking Updates in Adaptive Optimizers (arxiv.org)
		2 points by energy123 7 days ago \| past \| discuss
		More