Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I certainly don't run 6 at a time, but even with just 1 - if it's doing anything visual - how are folks hooking up screenshots to self verify? And how do you keep an eye on it?

The only solution I've seen on a Mac is doing it on a separate monitor.

I couldn't find a solution here and have built similar things in the past so I took a crack at it using CGVirtualDisplay.

Ended up adding a lot of productivity features and polished until it felt good.

Curious if there are similar solutions out there I just haven't seen.

https://github.com/jasonjmcghee/orcv

 help



For macOS, generically, you can run `screencapture -o -l $WINDOW_ID output.png` to screenshot any window. You can list window IDs belonging to a PID with a few lines of Swift (that any agent will generate). Hook this up together and give it as a tool to your agents.

And the compositor renders it unoccluded for the screenshot?

for anything web related, simply with the chrome claude plugin. then claude code can control the browser (and 'see' what's showing),



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: