More

NSPG911 · 2026-03-29T23:16:20 1774826180

I was using KeepChatGPT[1] for a while back in 2023-2024, pre-Gemini-in-Google era, and I was fascinated as to how it was able to mask being a user without needing any API or help from the end user. I stopped using it after 2024 because 1) Gemini and 2) It breaks quite a lot. I did however, like how you had an option to push the AI panel to the right, if only Google even considers doing so.

[1]: https://github.com/xcanwin/keepchatgpt

qingcharles · 2026-03-30T02:27:37 1774837657

I have a little helper app I run sometimes that I have a button to push a query into ChatGPT and get a json response. You wouldn't even know OpenAI had any anti-bot tools because it doesn't get flagged at all. It just uses a webview inside WinForms.

NSPG911 · 2026-03-27T13:58:54 1774619934

I also genuinely hate using jq. It is one of the only things that I rely heavily on AI.

vips7L · 2026-03-27T15:06:09 1774623969

You should try nushell or PowerShell which have built ins to convert json to objects. It makes it so easy.

bigstrat2003 · 2026-03-27T15:19:27 1774624767

Second this. Working with nushell is a joy.

dannyobrien · 2026-03-27T20:58:46 1774645126

I use the llm-jq plugin for Simon Willison's `llm` command line frontend for this: https://github.com/simonw/llm-jq

amelius · 2026-03-27T14:27:47 1774621667

At that point why don't we ask the AI directly to filter through our data? The AI query language is much more powerful.

latexr · 2026-03-27T14:50:08 1774623008

Because the output you get can have hallucinations, which don’t happen with a deterministic tool. Furthermore, by getting the `jq` command you get something which is reusable, fast, offline, local, doesn’t send your data to a third-party, doesn’t waste a bunch of tokens, … Using an LLM to filter the data is worse in every metric.

alwillis · 2026-03-27T17:05:06 1774631106

I get that AI isn’t deterministic by definition, but IMHO it’s become the go-to response for a reason to not use AI, regardless of the use case.

I’ve never seen AI “hallucinate” on basic data transformation tasks. If you tell it to convert JSON to YAML, that’s what you’re going to get. Most LLMs are probably using something like jq to do the conversion in the background anyway.

AI experts say AI models don’t hallucinate, they confabulate.

tkclough · 2026-03-27T18:50:55 1774637455

Just because you haven't seen it hallucinate on these tasks doesn't mean it can't.

When I'm deciding what tool to use, my question is "does this need AI?", not "could AI solve this?" There's plenty of cases where its hard to write a deterministic script to do something, but if there is a deterministic option, why would you choose something that might give you the wrong answer? It's also more expensive.

The jq script or other script that an LLM generates is way easier to spot check than the output if you ask it to transform the data directly, and you can reuse it.

alwillis · 2026-03-30T18:16:38 1774894598

> but if there is a deterministic option, why would you choose something that might give you the wrong answer?

Claude Code can use jq if it's installed on your system. Also, the data transformation is usually part of a larger workflow where an LLM is being used. And honestly, Claude is going to know jq better than 95% of developers who use it. jq can do a lot of things but it’s not the most intuitive tool to learn.

An obvious best practice is to have the LLM use existing tools to confirm the correctness of its output.

djhn · 2026-03-28T08:06:01 1774685161

LLMs will often helpfully predict made up tokens for the content of the data fields.

For 100% of jq use cases I have the data wouldn’t fit into context. But even for the smaller things, I have never, not even once, had an LLM not mangle data that is fed into it.

Take a feed of blog posts (and select the first 50 or so just to give the model a fighting chance). I’ll give you 80% likelihood of the output being invalid JSON. And if you manage to get valid JSON out of it, the actual dates, times and text content will have changed.

amelius · 2026-03-27T14:52:08 1774623128

You can use a local LLM and you can ask it to use tools so it is faster.

sigseg1v · 2026-03-27T15:38:46 1774625926

"so it is faster" than what? A cloud hosted LLM? That's a pretty low bar. It's certainly not faster than jq.

kelvinjps10 · 2026-03-27T14:59:55 1774623595

There is hardware that is able to run jq but no a local AI model that's powerful enough to make the filtering reliable. Ex a raspberry pi

imcritic · 2026-03-27T15:04:38 1774623878

Because the input might be sensitive.

Because the input might be huge.

Because there is a risk of getting hallucinations in the output.

Isn't this obvious?

aduitsis · 2026-03-27T16:22:55 1774628575

...and because it's going to burn a million times the energy of what jq would require.

Shorel · 2026-03-27T14:58:30 1774623510

You really need to go and learn about the concept of determinism and why for some tasks we need and want deterministic solutions.

It's an important idea in computer science. Go and learn.

amelius · 2026-03-27T15:04:45 1774623885

You need to learn to adapt to the real world where most things are not deterministic. Go and learn.

Shorel · 2026-03-27T15:39:29 1774625969

I already know that. That's why we have deterministic algorithms, to simplify that complexity. You have much to learn, witty answers mean nothing here, particularly empty witty answers, which are no better than jokes. Maybe stand-up comedy is your call in life.

johnisgood · 2026-03-27T15:24:11 1774625051

That may be true, but do you not want determinism where possible, especially within this context, i.e. filtering data?

skipants · 2026-03-27T15:28:11 1774625291

Is your argument that the world isn't deterministic and so we should also apply nondeterminism to filtering json data?

NSPG911 · 2026-03-18T07:39:52 1773819592

> VERY verbose in the plan output

Is that an issue? GitHub charges per-request, not per-token, so a verbose output and short output will be the same cost

What model are you using?

jounker · 2026-03-18T16:25:57 1773851157

The problem might be that our brains charge per token, which makes reviewing hard. :)

NSPG911 · 2026-03-18T07:37:55 1773819475

The post you linked to was made after this post.

7777777phil · 2026-03-18T07:40:28 1773819628

my bad, I just saw 2 hours ago on this one - either way they could be merged

vinhnx · 2026-03-18T08:10:18 1773821418

I think my submission about this post was selected to "second-chance" pool by HN Moderators. Hence it's being shown again. Thanks for the heads up!

NSPG911 · 2026-03-13T13:45:53 1773409553

Nope, check out something like wiretext, look at this example I put together very quickly

https://wiretext.app/w/WUtjS1bk

NSPG911 · 2026-03-09T23:36:57 1773099417

That literally is the definition of JIT, it does a quick parse, compiles hot parts and improves it later on

NSPG911 · 2026-02-27T04:48:00 1772167680

That is the issue. It's why Xcode development is really bad with AI models[0] -- because there are barely any text-based tutorials for it, so the models have to make a lot of assumptions and whatnot. Hence, they are really good at Python, JavaScript, and increasingly, Rust.

[0]: https://www.youtube.com/watch?v=J8-CdK4215Y

NSPG911 · 2026-02-27T04:34:19 1772166859

This is very very concerning, and I really hope he doesn't fall depper into this.

NSPG911 · 2026-02-22T23:24:21 1771802661

I've been having issues logging into my HN account on Harmonic for quite a while, now this, the API is down :/

NSPG911 · 2026-02-14T03:23:50 1771039430

It's just their tech debt. They chose to use react.js in the terminal via ink, and essentially footgun their way through each update.