More

eterm · 2026-05-06T15:25:02 1778081102

If your bottleneck is product spec rather than QA & testing, then you're doing well.

And that hints at one solution, if you demand better quality then you'll slow down engineering back to a level you can control.

eterm · 2026-05-06T07:22:31 1778052151

Remarkably, I found a blog where I thought "This sounds like AI" but I wasn't sure, so I went to their back catalogue from decades ago, and the writing was similar so I gave them a pass.

Then I checked the internet archive.

They had replaced all their back catalogue with AI slop.

eterm · 2026-05-03T08:19:22 1777796362

Ambiguity is the grease that keeps everything turning.

eterm · 2026-05-01T21:51:54 1777672314

Legislate that the banks are liable for refunding this class of fraud and you'll find they suddenly take this stuff a lot more seriously and "discover" the technology.

gustavus · 2026-05-01T22:25:13 1777674313

I don't understand your point. The banks and credit card companies are already responsible. If I have a fraudulent charge I call and tell them it's fraudulent and they say okay and take it off and either getit back from the issuer or eat the difference.

rstupek · 2026-05-01T23:45:02 1777679102

I think what you're missing is the bank and credit card companies rarely eat the difference. The business who sold the item which was charged back is the one paying the cost of the transaction (no income, lost item) plus a chargeback processing fee (typically $15 per chargeback).

rvnx · 2026-05-01T22:30:10 1777674610

They can also punish you for doing so, like banning you from the bank.

They also report account closures to ChexSystems, which can make it harder to open accounts at other banks for years. Credit card issuers can drop you and ding your credit. Definitively not your fault, but still your problem, and the consequences are for you.

dboreham · 2026-05-01T22:08:37 1777673317

Quite hard to do when banks are major bribers of politicians.

eterm · 2026-04-30T05:59:44 1777528784

It's been a long time since I read it, but it was one of the better books I've read. It changed my approach to how to think about old code-bases.

kqr · 2026-05-07T05:22:43 1778131363

I agree. I come back to it all the time when I need a little inspiration for how to deal with a gnarly codebase. Usually there is something in there I can apply directly to get me out of a pinch. When there is not the reminder of how malleable code is suffices.

eterm · 2026-04-30T05:58:00 1777528680

"is the real" is such a strong Claude tell, whenever I encounter it, it makes me question what i'm reading.

Another I've noticed more recently is a slight obsession over refering to "Framing".

Skidaddle · 2026-04-30T06:27:32 1777530452

I miss being told “You’re absolutely right!” :’(

yard2010 · 2026-04-30T06:17:16 1777529836

You're absolutely right. I was wrong in the first place

eterm · 2026-04-29T19:19:29 1777490369

Thank you for pointing this out, it left me confused. It would have been a lot clearer if the text were in a quote block!

eterm · 2026-04-28T07:41:03 1777362063

This is pretty relevant for things like claude-code, which has a fairly rudimentary way of dealing with permissions with block-lists and allow-lists.

I once accidentally gave my claude "powershell" permissions in one session, and after that any time it found it was blocked from using a tool, e.g. git, it would write a powershell script that did the same thing and execute the script to work around the blocked permission.

Obviously no sane system would have "powershell" in a generic allow-list, but you could imagine some discrepancy in allowed levels between tools which can be worked around with the techniques on this page.

troupo · 2026-04-28T07:50:05 1777362605

Power Shell or Python scripts to work around restrictions are the go to for LLMs.

And it doesn't stop there.

Yesterday I was trying to figure out some icons issue in KDE plasma (I know nothing about KDE). Both Claude and Codex would run complex bus and debug queries and write and execute QML scripts with more and more tools thrown into the mix.

There's no way to properly block them with just allow- and block lists

embedding-shape · 2026-04-28T10:30:06 1777372206

> There's no way to properly block them with just allow- and block lists

Especially not when some harnesses rely on the reliability of the LLM to determine what's allowed or not, pretty much "You shouldn't do thing X" and then asking the LLM to itself evaluate if it should be able to do it or not when it comes up. Bananas.

Only right and productive way to run an agent on your computer is by isolating it properly somehow then running it with "--sandbox danger-full-access --dangerously-bypass-approvals-and-sandbox" or whatever, I myself use docker containers, but there are lots of solutions out there.

felixyz · 2026-04-28T10:46:26 1777373186

You have to be extremely careful when you set up a dev container, lock down file access, do not give the agent the power to start other containers or "docker compose up", restrict network access to an allow-list etc. Just running the agent in a container does little to protect you. (Maybe you know this, but a lot of people don't!)

embedding-shape · 2026-04-28T10:53:41 1777373621

Most of those things are what happens by default. Sure, be careful, but by default it's secure enough to prevent most potential issues. No need to lock down file access for example, by default it only has access to files inside the container, and of course by default containers don't have access to start other containers, and so on.

Good word of caution though, make sure you actually isolate when you set out to isolate something :)

chrisweekly · 2026-04-28T13:56:33 1777384593

I've just discovered and started using smolmachines^1 which actually have the requisite isolation.

1. https://smolmachines.com

embedding-shape · 2026-04-28T14:06:15 1777385175

As mentioned, "podman/docker run -it $my-image codex" also actually has the requisite isolation by default, no need for special software. Biggest risk is accidental deletion of stuff, easily solved without running an entire VM, which "smol" machines seems to be. No doubt VMs have their uses too, but for simple isolation like this I personally rather use already existing tooling.

chrisweekly · 2026-04-28T16:37:56 1777394276

Ok, YMMV, but a smolvm provides macOS-native, per-workload isolation -- vs trad container depending on a daemon and relying on namespaces (w/ a shared kernel). Easy "packing" into single-file executables, and a nice SDK, make it ~ideal for my needs; great balance of security:convenience.

https://smolmachines.com/#comparison

embedding-shape · 2026-04-28T18:14:11 1777400051

Cool ad bro, but stop claiming container won't get you "per workload isolation" just because they share kernels, in the context of this discussion it hardly matters, containers isolates enough for this.

chrisweekly · 2026-04-29T01:42:07 1777426927

ad? I have no affiliation w smolmachines, just glad I found it.

ebonnafoux · 2026-04-28T08:07:18 1777363638

In a previous employer, they block the chmod command. I took the habit to python -c "import os; os.chmod('my_file',744)".

Glad to see LLM re-discover this trick.

Terr_ · 2026-04-28T09:03:20 1777367000

> to see LLM re-discover

I imagine someone probably wrote very specifically about it in the training data that underwent lossy compression, and the LLM is decompressing that how-to.

So I'd say it's more like "surfacing" or "retrieving" than "re-discovering".

seanp2k2 · 2026-04-28T09:24:23 1777368263

They scraped everything on Stackoverflow, likely IRC logs from Freenode, and every book written in the modern era courtesy of Sci-Hub / Library Genesis / Anna's Archive / Z Library.

RIP Aaron Swartz, they're generating trillions in shareholder value from the spiritual successors to the work they were going to imprison you for.

ebonnafoux · 2026-04-28T12:30:37 1777379437

Indeed, I check and the solution was already on stack overflow https://askubuntu.com/a/1483248

andyhedges · 2026-04-28T09:08:40 1777367320

For the LLM it's a probabilistic set of strings that achieves the outcome, the highest probability set didn't work, try the next one until success or threshold met. A human sees the implicit difference between the obvious thing not working indicating someone doesn't want you to do it, but an LLM unless guided doesn't seen that sub-text.

So chmod +x file didn't work, now try python -c "import os; os.chmod('file',744)"

sigmoid10 · 2026-04-28T09:35:30 1777368930

Humans and LLMs both only see that when given the right context. A tool not working in a corporate environment may be anything from oversight, malfunction all the way to security block. Knowing which one it is takes a lot of implicit knowledge. Most people fail to provide this level of context to their LLMs and then wonder why they act so generic. But they are trained to act in the most generic way unless given context that would deviate from it.

eterm · 2026-04-26T09:44:46 1777196686

If this will solve the problem with boards, you need to be able to answer 2 questions:

1. What does this do that Trello doesn't?

2. What does Trello do that this doesn't?

okovooo · 2026-04-26T15:11:03 1777216263

Trello does not work in Russia. You cannot install it on your server in the company's closed loop. There is no task number, there are no subtasks, you cannot see the progress of subtasks on the parent's board. There is no analytics. There is no page where you can view all the tasks that can be taken on. The service does not claim the laurels of solving all problems, but it is unique in its own way.

eterm · 2026-04-25T14:36:04 1777127764

That's exactly what claude-code does these days. If you AFK for ~5 minutes it also produces a summary of where you are, which is useful if you're juggling multiple windows.