Hacker Newsnew | past | comments | ask | show | jobs | submit | EugeneOZ's commentslogin

Location: Spain

Remote: Yes, only remote

Willing to relocate: No

Résumé/CV: https://www.linkedin.com/in/newmanoz/ , https://jamm.dev/resume.pdf

Email: normandiggs@gmail.com , oz@jamm.dev

I am a web developer with 21 years of experience (you can read my blog here: https://medium.com/@eugeniyoz).

By combining my technical background with the strategic use of AI agents, I adapt quickly to any tech stack. I review every line of generated code to ensure strict alignment with current best practices.


There are open-source alternatives:

https://mochi1ai.com/

https://wan.video/

and others. There are free to use tools also.


This market will not be abandoned, and other tools already exist:

https://klingai.com/global/

https://aistudio.google.com/models/veo-3

https://runwayml.com


I don't know - it works okay (yet to be tested whether it is actually smarter than Opus 4.6), but it is not bad at all. So far, it works quite fine (I'm not testing the "fast" version).


Not in my experience. Quoting my tweet:

Gave the same prompt to GPT 5.4 (high) and Opus 4.6 (high).

GPT 5.4 implemented the feature, refactored the code (was not asked to), removed comments that were not added in that session, made the code less readable, and introduced a bug. "Undo All".

Opus 4.6 correctly recognized that the feature is already implemented in the current code (yeah, lol) and proposed implementing tests and updating the docs.

Opus 4.6 is still the best coding agent.

So yeah, GPT 5.4 (high) didn't even check if the feature was already implemented.

Tried other tasks, tried "medium" reasoning - disappointment.


I make ChatGPT and Claude code review each other's outputs. ChatGPT thinks its solutions are better than what Claude produces. What was more surprising to me is that Claude, more often than not, prefers ChatGPT's responses too.

I am to sure one can really extrapolate much out of that, but I do find it interesting nonetheless.

I think language is also an important factor. I have a hard time deciding which of the two LLMs is worse at Swift, for example. They both seem equally great and awful in different ways.


I do the same (I have both review a piece of code), and Codex tend to produce more nitpicky feedback. Opus usually agrees with it on around half the feedback, but says that the other half is too nitpicky to implement. I generally agree with Opus' assessment, and do agree that Codex nitpicks a lot.

I can't even use Codex for planning because it goes down deep design rabbit holes, whereas Opus is great at staying at the proper, high level.


Is this sample size of one task, or a consistent finding across many tasks?


I do, 100%, every line.


Location: Spain

Remote: Yes, only remote

Willing to relocate: No

Résumé/CV: https://www.linkedin.com/in/newmanoz/ , https://jamm.dev/resume.pdf

Email: normandiggs@gmail.com , oz@jamm.dev

I have more than 20 years of experience in webdev, here is my blog: https://medium.com/@eugeniyoz

I actively use coding agents and carefully review every generated line.

Main technologies: Angular, Rust, TypeScript, MySQL, PostgreSQL.

Types of projects I've worked on:

* CRM - Tickets, chat, dozens of dynamically modifiable reports, charts, editable tables, Rust REST API.

* Warehouse Management System - Barcode scanner, mobile app (Ionic), dashboard app.

* Vodafone Site Management - App for managing over 100 types of devices (with dynamically generated forms for each type), geolocation tools, 3D room editor (three.js), charts, and real-time data synchronization across tabs.

* SAP/Spartacus plugin for the Sony e-shops.

* correkt.com - e-commerce site with modern UI, SSR, zoneless architecture, Tailwind CSS 4, hydration, and various performance, SEO, and Core Web Vitals optimizations;

* surex.com (frontend) - insurance survey app with multiple complicated forms, rewritten from AngularJS to modern Angular.

* and 30+ other projects, from small startups and individuals to large enterprise companies.


It depends on how much value their talents can bring to humankind, I guess.


Very good guess, right on the money

Too bad humankind is almost never paying attention.


Just checked - not blocked, works just fine (Adamo and Vodafone).


Adamo never blocks, at least for me. Vodafone does.


Still not usable in production, not even near. But I'm happy to see any progress in this area.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: