Hacker Newsnew | past | comments | ask | show | jobs | submit | stefanka's commentslogin

Can you build a version of chromium where this will just return false always?

Are all of OpenAI’s ip ranges known?

The article is from 2024. Is this still happening?

Do we have any evidence they started complying?

If not, we can conclude they did not, until such evidence shows up.


I’m genuinely curious to know whether there was a change in behavior especially after OpenAI informed about how to prevent scraping (robot.txt, etc.).

I am as well. Like, is there any evidence of a change, or can we assume nothing changed?

They cannot create original content.

Well they can make some up, like hallucination. That's an additional problem: when the original site that provided the training data is gone: how can they use verify the AI output to make sure it's correct?

Godot on the Quest allows you to develop on the device which is at least cool even if it makes little sense. You’d see the virtual world around you adapt to the changes in the editor. That was one on the reasons I bought it, even if I never used it in the end


Digital sovereignty. Europe is a big market and Motorola could gain traction this way


Cool, I haven't seen `graphics` before when I was looking for a simple UI/3D visualization option after rend3 has been abandoded. Have been considering bevy/egui too but seems more effort to learn


Where does one find a good robots.txt? Are there any well maintained out there?



Cloudflare actually has this as a free tier feature so even if you don't want to use it for your site you can just setup a throwaway domain on Cloudflare and periodically copy the robots.txt they generate from your scraper allow/block preferences, since they'll be keeping up to date with all the latest.


Thank you! Zulip is a great project.


Another excellent source is https://quaternius.com/


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: