Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
GLM-5: From Vibe Coding to Agentic Engineering (arxiv.org)
1 point by gmays 10 days ago | past | discuss
Prompt Repetition Improves Non-Reasoning LLMs [pdf] (arxiv.org)
1 point by 8ig8 11 days ago | past | discuss
LongCLI-Bench: Benchmark and Study for Long-Horizon Agentic Programming in CLIs (arxiv.org)
2 points by simonpure 11 days ago | past | discuss
A Primer of Mathematical Writing (arxiv.org)
1 point by paulpauper 11 days ago | past | discuss
Can We Trust LLM Detectors? (arxiv.org)
3 points by PaulHoule 11 days ago | past | discuss
Persona: Controlling LLM Personality with Vector Algebra (arxiv.org)
3 points by mldev_exe 11 days ago | past | discuss
GR3EN: Generative Relighting for 3D Environments (arxiv.org)
2 points by PaulHoule 11 days ago | past | discuss
WebWorld: A Large-Scale World Model for Web Agent Training (arxiv.org)
2 points by gmays 11 days ago | past | discuss
CMind: An AI Agent for Localizing C Memory Bugs (arxiv.org)
2 points by PaulHoule 11 days ago | past | discuss
A benchmark for vericoding: formally verified program synthesis (arxiv.org)
3 points by luskira 11 days ago | past | discuss
Computing Diffusion Geometry (arxiv.org)
3 points by aanet 11 days ago | past | 1 comment
Constructing Unlearnable Data with Solely Linear Classifiers (arxiv.org)
2 points by PaulHoule 11 days ago | past | discuss
Experiential Reinforcement Learning (arxiv.org)
3 points by geophile 12 days ago | past | discuss
Identity, Cooperation and Framing Within Groups of Real and Simulated Humans (arxiv.org)
2 points by PaulHoule 12 days ago | past | discuss
Investigating the Downstream Effect of AI Assistants on Software Maintainability (arxiv.org)
2 points by KallDrexx 12 days ago | past | 2 comments
A New Perspective on Drawing Venn Diagrams for Data Visualization (arxiv.org)
21 points by IdealeZahlen 12 days ago | past | 6 comments
Language Models Entangle Language and Culture (arxiv.org)
1 point by paraschopra 12 days ago | past | discuss
Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook (arxiv.org)
1 point by simonpure 12 days ago | past | 1 comment
Biases in the Blind Spot: Detecting What LLMs Fail to Mention (arxiv.org)
2 points by azalemeth 12 days ago | past | discuss
Prompt Repetition Improves Non Reasoning LLM (arxiv.org)
2 points by jdthedisciple 12 days ago | past | discuss
GLM-5 Technical Report (arxiv.org)
12 points by meetpateltech 12 days ago | past | discuss
Training-Free Group Relative Policy Optimization (arxiv.org)
1 point by readitalready 12 days ago | past | discuss
Composition-RL: Compose Verifiable Prompts for Reinforcement Learning of LLMs (arxiv.org)
3 points by gmays 12 days ago | past | discuss
Reducing the cost of breaking RSA-2048 to 100000 physical qubits (arxiv.org)
3 points by fuglede_ 12 days ago | past | discuss
Intelligent AI Delegation (arxiv.org)
2 points by gmays 12 days ago | past | discuss
Randomness in Agentic Evals (arxiv.org)
1 point by andre15silva 13 days ago | past | discuss
Hunt Globally (arxiv.org)
1 point by salkahfi 13 days ago | past | discuss
Frontier Models Exhibit Sophisticated Reasoning in Simulated Nuclear Crises (arxiv.org)
1 point by salkahfi 13 days ago | past | discuss
Learning State-Tracking from Code Using Linear RNNs (arxiv.org)
2 points by jul8234 13 days ago | past | 1 comment
A Survey of In-Context Reinforcement Learning (arxiv.org)
2 points by handfuloflight 13 days ago | past | discuss

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: