More

stefanka · 2026-04-02T15:36:10 1775144170

Can you build a version of chromium where this will just return false always?

stefanka · 2026-03-30T15:55:12 1774886112

Are all of OpenAI’s ip ranges known?

stefanka · 2026-03-30T08:33:24 1774859604

The article is from 2024. Is this still happening?

ImPostingOnHN · 2026-03-30T12:08:52 1774872532

Do we have any evidence they started complying?

If not, we can conclude they did not, until such evidence shows up.

stefanka · 2026-03-30T15:24:01 1774884241

I’m genuinely curious to know whether there was a change in behavior especially after OpenAI informed about how to prevent scraping (robot.txt, etc.).

ImPostingOnHN · 2026-03-30T15:29:45 1774884585

I am as well. Like, is there any evidence of a change, or can we assume nothing changed?

stefanka · 2026-03-30T08:08:27 1774858107

They cannot create original content.

wolvoleo · 2026-03-30T08:27:59 1774859279

Well they can make some up, like hallucination. That's an additional problem: when the original site that provided the training data is gone: how can they use verify the AI output to make sure it's correct?

stefanka · 2026-03-18T17:46:44 1773856004

Godot on the Quest allows you to develop on the device which is at least cool even if it makes little sense. You’d see the virtual world around you adapt to the changes in the editor. That was one on the reasons I bought it, even if I never used it in the end

stefanka · 2026-03-04T05:57:37 1772603857

Digital sovereignty. Europe is a big market and Motorola could gain traction this way

stefanka · 2026-02-13T17:10:23 1771002623

Cool, I haven't seen `graphics` before when I was looking for a simple UI/3D visualization option after rend3 has been abandoded. Have been considering bevy/egui too but seems more effort to learn

stefanka · 2026-02-11T17:52:04 1770832324

Where does one find a good robots.txt? Are there any well maintained out there?

dmit · 2026-02-11T22:48:21 1770850101

https://github.com/ai-robots-txt/ai.robots.txt

skrtskrt · 2026-02-11T22:04:48 1770847488

Cloudflare actually has this as a free tier feature so even if you don't want to use it for your site you can just setup a throwaway domain on Cloudflare and periodically copy the robots.txt they generate from your scraper allow/block preferences, since they'll be keeping up to date with all the latest.

stefanka · 2026-02-09T21:25:59 1770672359

Thank you! Zulip is a great project.

stefanka · 2026-01-30T19:18:03 1769800683

Another excellent source is https://quaternius.com/