More

aliljet · 2026-05-05T02:11:14 1777947074

This is so cool. I would love to revitalize a generation of great, but perhaps boring older cars with FSD. Just so much work...

aliljet · 2026-05-02T07:34:21 1777707261

Why did Spirit die? Was there any last of this that had to do with their abysmal customer service?

Ekaros · 2026-05-02T07:36:56 1777707416

Airlines are not great business. Margins are not great. Fuel is significant part of their operating costs. And if it goes up too much in too short time the whole model breaks. Less margins you have the more you will be impacted. So if you are operating at edge by default fast move in costs will destroy you.

gib444 · 2026-05-02T07:41:51 1777707711

IAG in 2025 had a record operating margin of 15.1%.

Ryanair's gross profit margin for fiscal years ending March 2021 to 2025 averaged 19.1%.

Some are (were?) doing just fine - in Europe at least.

Sure, it's no Big Tech or banking, but it's not like the single low digit percentage of eg retail.

Perhaps some USA airlines need some advice from across the pond?

Wurdan · 2026-05-02T08:17:35 1777709855

The business model works fundamentally differently in the US and Europe due to geography. The US is big, meaning that flights are often longer, meaning that fuel is a bigger portion of the operating cost. And fuel is essentially something airlines can’t reduce the cost of compared to other operating costs where it might be possible to optimize for greater efficiency.

brohee · 2026-05-04T07:15:35 1777878935

Europe has passenger trains that work. What would be a short flight in the US, e.g. London to Paris is done more my train through the chunnel than flying unless you got a connecting flight.

gib444 · 2026-05-02T09:16:13 1777713373

> meaning that flights are often longer

Got any sources?

I found:

Europe average flight length (2024): 1,157km [0]

USA average flight length (I could only find old data, 2005): 1,110km [1] (even if we index this up based on upward trends, maybe another 150km, that doesn't seem a huge difference to me?)

> The US is big

And Europe is big too. It's actually a bit bigger than the USA by land size.

Btw, IAG is a global airline group. Only ~32% of IAGs revenue is intra-Europe and domestic. Another data point: Turkish Airlines (very long-haul focused airline) 2025 net income margin was 12.1% in 2025.

I'm not sure your explanation is sufficient. I don't see the exception in the USA? I am certainly willing to accept there are other differences and challenges in the USA, but I don't think it's been presented yet in this discussion.

And remember the original claim was "Airlines are not great business. Margins are not great"

--

EDIT: I found https://www.airportroutes.com/airlines/NKS/ which does highlight that Spirit flew lengths longer compared to Europe's average, at 1,577 km - but then using the same source for Ryanair https://www.airportroutes.com/airlines/RYR/ it's 1,456km, so again, not a huge difference. So comparing 2 seemingly very similar airlines, the European one has both managed to be profitable and not go bankrupt...

--

[0] https://www.eurocontrol.int/publication/eurocontrol-data-sna...

[1] https://www.hsdl.org/c/view?docid=25985

Wurdan · 2026-05-02T09:44:44 1777715084

How are you counting average distances? Simply as the distance between two points in the carrier’s network, or are you looking at the lengths of each individual flight?

The source for the point I made is a Wendover video - Why Budget Airlines are Suddenly Failing

esseph · 2026-05-02T15:19:47 1777735187

You need to look at things like average distance and median distance, do some filtering for most common destinations (example: NYC to LA, San Francisco to Miami, Denver to DC, etc), fuel costs, but also operating costs. Salaries and everything cost much more in the US than they do in Europe.

Cost Per Seat Mile is $0.07 for RyanAir and $0.12 for Spirit, not counting fuel. Spirit hovers around 80% capacity while RyanAir is around 94%.

RyanAir's niche is secondary airports while Spirit was compeating with larger airlines at places like LAX where gate costs are higher.

In 2024 to 2025 there was an engine problem that required Spirit to ground 40% of the fleet to deal with it. Meanwhile they still had to pay for those aircraft with no revenue. This caused a major hit to the financials for a carrier that already runs on thin margins.

I'm sure there's more to it, but these are the larger things I've found.

gib444 · 2026-05-03T07:10:41 1777792241

Definitely a good point about CASM. Thanks for highlighting. I do see it's one of the lowest in the industry though

Fuel obviously plays a big part. Guess they also got unlucky with the engines (though could they have made better choices? Perhaps a Franco-American engine company like Ryanair ? ;)

> Salaries and everything cost much more in the US than they do in Europe.

Doesn't that mean they can charge more? We're regularly told the USA is rich and Europe is poor, so the customers must be able to pay more.

> Spirit hovers around 80% capacity while RyanAir is around 94%.

Spirit could have gotten better at filling seats. Perhaps learning from Ryanair. Or is there some thing in the USA that prevents exceeding 80% capacity? US customers not liking planes beyond 80% capacity?

It makes me think their business surviving was highly dependant on low fuel prices? So the collapse was a shock to nobody in the industry?

esseph · 2026-05-03T15:25:37 1777821937

> It makes me think their business surviving was highly dependant on low fuel prices?

Basically.

The entire modern economy depends on the price of gas/diesel/jet fuel being between $X and $Y. If it goes outside of those parameters for too long, everything shuts down. Oil gets too expensive or too cheap to extract and refined and transport, then the money goes other places.

matwood · 2026-05-02T08:11:53 1777709513

The immediate cause was rising fuel prices. The other issue sounds like it was poorly ran.

More generally, it is also a low cost carrier at a time when, after years of competing on price, airlines are seeing people willing to pay more for a better experience. All other carriers are expanding their premium options, catering to the affluent part of the K economy (for the first time ever the majority of Delta revenue came from premium cabins over main). Meanwhile, Spirit was dealing on the other side of the K who is also most impacted by increasing inflation, etc... giving Spirit zero ability to raise prices.

gib444 · 2026-05-02T09:28:05 1777714085

> Meanwhile, Spirit was dealing on the other side of the K who is also most impacted by increasing inflation, etc... giving Spirit zero ability to raise prices.

Ryanair (Europe's biggest and most profitable airline) is managing it OK [0]

What's difference about that side of the K in the USA vs Europe?

[0] https://www.bbc.co.uk/news/articles/c620506dvmjo

matwood · 2026-05-04T08:09:38 1777882178

I can't speak for the EU, but this article was interesting. It sounds like a big part of it is that Ryanair's costs were simply less to start.

https://onemileatatime.com/insights/why-spirit-fail-ryanair-...

gib444 · 2026-05-04T13:54:20 1777902860

Thanks for sharing. Interesting how fuel isn't mentioned once (other comments here have suggested it's mostly to do with fuel). Only possibly indirectly via cost per air seat mile (CASM), but AIUI airlines frequently exclude fuel from that

IMO Spirit's bad business decisions should be acknowledged.

avazhi · 2026-05-02T07:36:51 1777707411

[flagged]

gblargg · 2026-05-02T08:17:22 1777709842

Some of us don't consume the mainstream news and don't fly.

avazhi · 2026-05-02T08:19:22 1777709962

If you didn't know about the war in Iran and the effects it has had on oil and thus jet fuel prices, I'm not sure what you're doing on HN.

gblargg · 2026-05-02T08:33:22 1777710802

This story is about a particular airline failing (out of all the others that aren't). Do you think Spirit airline's situation is something serious I should have been keeping up with? I do drive a car and get gas, and the price increase has been modest but not alarming, in the context of the last decade.

Kwpolska · 2026-05-02T08:37:45 1777711065

The war in Iran was the final nail in the coffin. But they were running out of cash for the past few years. If the Iran situation was so bad by itself, we would surely see other airlines failing now.

avazhi · 2026-05-02T12:46:04 1777725964

Spirit could simply be the first of several; the effects may also be delayed. WGA isn't looking good either, for a number of reasons.

Just because it's the first doesn't mean it will be the only one. It goes without saying but apparently you need to be told that there has to a first here, after all. The war is only 2 months in. Full clarity won't come for 2-3 years. It's likely several airlines will take hits on their balance sheets from this that they won't be able to recover from, but they'll fight or go into hardcore refinance mode or get bailed out before actually going bankrupt, but this will remain the ultimate cause.

gblargg · 2026-05-03T07:54:05 1777794845

I wonder whether Spirit failing could push more customers to other airlines and serve to help them stay afloat.

robin_reala · 2026-05-02T07:38:20 1777707500

Small regional airline failing isn’t a big news story in my typical parts of the internet.

avazhi · 2026-05-02T07:42:04 1777707724

No, Spirit is/was not a 'small regional'.

You asked if this was caused by or related to bad customer service. This was 100% caused by the increase in jet fuel prices due to the war in Iran. Obviously huge swings in jet fuel prices affect budget carriers more than, say, United or American or Lufthansa or Singapore Airlines, which have many (many) more options when jet fuel prices rise.

Many countries, including many third world countries, have regional airlines. It has nothing to do with America in particular, and the usage of that term is not an American-ism. A good non-American example is Qantas and QantasLink, the latter being a regional airline, and the Aussies refer to it as such.

chrisandchris · 2026-05-02T07:46:14 1777707974

That really sounds line the US is the only country in the world. Considering the world is bigger, I would call Spirit maybe regional, but not small. Ask some europeans, basically no one will know Spirit - as US people may not know e.g. Wizz.

aliljet · 2026-05-01T19:24:01 1777663441

What systems are you actively using? And what systems have you tried? It seems like law, generally, may be hitting a tipping point on LLM use...

aliljet · 2026-04-30T01:34:54 1777512894

This is a tough moment. Claude is simultaneously becoming substantially more expensive, substantially less reliable (single 9 of reliability), and substantially less performant. It's really hard to justify the cost of a subscription over there right now.

giancarlostoro · 2026-04-30T01:40:16 1777513216

There was another thread where some people pointed out, Amazon will give you access to Claude with better uptime for the same price (per million tokens up / down), downside is, it does not have the native ability to browse the web, but maybe that's a hidden blessing, since it's less likely to read some random website that has prompt injection embedded into it.

For coding its fine, I havent experimented too much with Amazon Bedrock myself, but I just might soon to check for any limitations.

2001zhaozhao · 2026-04-30T02:33:55 1777516435

Maybe the best play is to set up a routing system locally so that when claude.ai is down it automatically switches to Amazon billing and switches back when it comes back up

Atotalnoob · 2026-04-30T02:38:59 1777516739

I’m pretty sure it has the ability to browse the web.

It can use playwright, web fetch, etc…

I use bedrock at work and Claude subscription at home. They are pretty much exactly the same in my experience

Or do you mean the Claude in chrome plugin? Bedrock doesn’t have that, but in my experience it doesn’t work that well.

Neither does the Claude managed agents or ultra plan.

fg137 · 2026-04-30T03:32:30 1777519950

They likely refer to "WebSearch", not "WebFetch" (and the original statement is not correct).

willsmith72 · 2026-04-30T02:08:51 1777514931

But that's just paying per use right, not with the subscription which is way better value

giancarlostoro · 2026-05-01T00:13:59 1777594439

Correct, but in the case that they brought it up, their employer was on a enterprise license, which is still pay per token. The subscription will eventually go away in some way, or cost way more than it does.

datadrivenangel · 2026-04-30T01:38:02 1777513082

From an economics perspective, it makes sense to make it more expensive if you're having trouble keeping up with demand for a service. It'll be tough getting used to because it was so nice and cheap

a_victorp · 2026-04-30T02:06:40 1777514800

On the other hand, it was somewhat expected that we would have a correction for the prices. Hopefully after this correction things will be more stable and we won't have to worry too much about future price increases

bspinner · 2026-04-30T05:10:07 1777525807

the prices will slowly increase until enough people actually stop paying for it.

edmundsauto · 2026-04-30T01:49:08 1777513748

YMMV. I would still be very happy with Claude if it hard failed on 20% of tasks. You can always come back to it.

I say this as someone working for a tech company who does not have to foot the bill (in the >$1k per month bracket)

I also experienced and accept the 1990s levels of unreliability, which is my “internet generation”. My first access was lifting a handset and placing on a speaker/mic cradle.

Programmers these days are fucking spoiled. If it’s $220 worth of value for $200 - I get it. But I’m getting $100k of value for $10k and so I’ll put up with some shit.

willsmith72 · 2026-04-30T02:10:46 1777515046

> If it’s $220 worth of value for $200 - I get it.

Wrong comparison. If a competitor gives you $230 of value for $200, of course you shouldn't pick the $220 one

andai · 2026-04-30T04:26:34 1777523194

Well, you can get a much bigger portion for much cheaper next door, but taste is hard to quantify.

wahnfrieden · 2026-04-30T01:52:52 1777513972

or just use codex...

smugma · 2026-04-30T02:05:15 1777514715

We used to describe our startup as having 5 8’s of uptime

pkulak · 2026-04-30T01:39:50 1777513190

Not to mention substantially less open. I've been using an OpenAI subscription in Pi Agent for a couple weeks now and it's great. And from what I can tell, 5.5 is a heck of a model.

Avicebron · 2026-04-30T01:45:40 1777513540

I'm either extremely lucky or Dario ran the direct fiber to my house because I have never had it go down in any meaningful way..

Is this just the API and I'm too much of luddite to actually use the API?

2ndorderthought · 2026-04-30T01:58:25 1777514305

Dude dario definitely ran the fiber straight to your place personally. Everything is fine and this is such a good thing.

chillfox · 2026-04-30T01:39:28 1777513168

Interestingly, yeah, I can see that this would really cut into your subscription usage with the 5 hour rate limit windows...

I am an API user, and while it being down is super annoying, it isn't really as big of a hit to my overall usage as I can just prepare a bunch of stuff to run in parallel when it does come back up.

elfly · 2026-04-30T01:39:36 1777513176

Don't say single nine, it sounds ugly and bad.

Say five eights of reliability. Maybe six.

inetknght · 2026-04-30T01:40:43 1777513243

We're talking about Claude, not GitHub...

HeWhoLurksLate · 2026-04-30T02:56:43 1777517803

that would be eight fives...

OccamsMirror · 2026-04-30T01:40:11 1777513211

Plus, they've dumbed down their models to the point where the value just isn't there like it was. If I have to go in and clean up after it, or constantly wrestle with it through prompts, what's the point? Just spending $200 a month to be frustrated at a machine.

Frannky · 2026-04-30T01:45:16 1777513516

It's lazy, does not take ownership and responsibility, wants to defer work, and I have to force it to check reality. It likes to guess and assume it's correct and I am wrong. Agents.md is not helping at all. It's in full enshittification phase, yay!

2ndorderthought · 2026-04-30T01:57:14 1777514234

Single nine has good vibes bro. It means when the service is up the results are better. I read about it in a blog. The model hallucinates way less. Even less than grok

aliljet · 2026-04-29T20:22:44 1777494164

I wonder how this kind of response from Anthropic is actually being read by the community at large. If you consider the rough sentiment of the r/ClaudeCode subreddit against the r/Codex subreddit, you can see that there is a definite loudness among the folks departing ClaudeCode for Codex. Something big is shifting on the ground, I think.

kelnos · 2026-04-29T23:48:35 1777506515

I'm not really sure what to do here. I refuse to give Altman money, but Anthropic keeps disappointing me over and over with crap like this. Gemini seems behind? Not touching Grok.

Meanwhile I've integrated CC into my workflow enough that I'd feel frustrated cutting out all LLM agent use.

I don't have the hardware to run models locally, and I'm not excited about the idea of spending that money. I could use a different harness with one of the services that runs open-weight models for me, but I feel like the cost would be prohibitive. I'm paying $100/mo right now and that's all I'm willing to spend.

trymas · 2026-04-30T07:46:34 1777535194

GLM5.1, Kimi K2.6, MiniMax M2.7

Personally tried GLM subscription. Bought it during new years discount: 36$ for a YEAR.

Cannot burn tokens through with personal project use. From what I can see in stats they allow 25-100M tokens in 5h period (for cheapest plan), depending on the model. GLM5.1 could be a bit slower and likes to (over)think, but I don't see practical differences from Sonnet 4.6 or Opus 4.6.

> I refuse to give Altman money, but Anthropic keeps disappointing me over and over with crap like this. Gemini seems behind? Not touching Grok.

My thought process is totally the same. And even there's slight concern about ethics using GLM, at least in my conciousness, openai is worse and grok is the worst of them all by far, no competition.

InvidFlower · 2026-05-01T16:05:44 1777651544

I'm not sure if the context limit on the $25/m, and model-size limit on the $100/m would make it not work well enough for OpenCode, but Featherless AI seems a bit unique in terms of how they handle their inference plans.

aliljet · 2026-04-27T13:46:59 1777297619

Why is this being made public?

brookst · 2026-04-27T13:52:07 1777297927

It’s an agreement between a public company and a highly scrutinized private company. Several of the provisions will change what happens in the marketplace, which everyone will see.

I imagine the thinking was that it’s better to just post it clearly than to have rumors and leaks and speculations that could hurt both companies (“should I risk using GCP for OpenAI models when it’s obviously against the MS / OpenAI agreement?”).

Schlagbohrer · 2026-04-27T14:24:33 1777299873

Also it's about OpenAI going public.

discordance · 2026-04-27T16:35:58 1777307758

Might have something to do with the MSFT quarterly report tomorrow

aliljet · 2026-04-24T04:35:16 1777005316

How can you reasonably try to get near frontier (even at all tps) on hardware you own? Maybe under 5k in cost?

mordae · 2026-04-24T08:52:11 1777020731

Look at GB/s.

Strix halo has 256 GB/s bandwidth for $2500. The Flash model has 13 GB activations.

256 / 13 = 19.6 tokens per second

Except you cannot fit it into the maximum RAM of 128 GB Strix Halo supports. So move on.

Another option is Threadripper. That's 8 memory channels. Using older DDR4-3200 you get roughly 200 GB/s. For $2000.

200 / 13 = 15.4 tokens per second

But, a chunk of per-token weights is actually always the same and not MoE, so you would offload that to a GPU and get a decent speedup. Say 25 tokens per second total.

Then likely some expensive Mac. No idea.

Eventually you arrive at a mining rig chassis with a beefy board and multiple GPUs. That has the benefit of pipelining. You run part of the model on one GPU and move on, so another batch can start on the first one. Low (say 30-100) tps individually, but a lot more in parallel. Best get it with other people.

revolvingthrow · 2026-04-24T05:12:07 1777007527

For flash? 4 bit quant, 2x 96GB gpu (fast and expensive) or 1x 96GB gpu + 128GB ram (still expensive but probably usable, if you’re patient).

A mac with 256 GB memory would run it but be very slow, and so would be a 256GB ram + cheapo GPU desktop, unless you leave it running overnight.

The big model? Forget it, not this decade. You can theoretically load from SSD but waiting for the reply will be a religious experience.

Realistically the biggest models you can run on local-as-in-worth-buying-as-a-person hardware are between 120B and 200B, depending on how far you’re willing to go on quantization. Even this is fairly expensive, and that’s before RAM went to the moon.

zargon · 2026-04-24T05:31:05 1777008665

Flash is less than 160 GB. No need to quantize to fit in 2x 96 GB. Not sure how much context fits in 30 GB, but it should be a good amount.

redrove · 2026-04-24T05:48:29 1777009709

It seems to be 160GB at mixed FP4+FP8 precision, FYI. Full FP8 is 250GB+. (B)F16 at around double I would assume.

zargon · 2026-04-24T05:51:12 1777009872

There is no BF16. There is no FP8 for the instruct model. The instruct model at full precision is 160 GB (mixed FP4 and FP8). The base model at full precision is 284 GB (FP8). Almost everyone is going to use instruct. But I do love to see base models released.

awakeasleep · 2026-04-24T04:48:07 1777006087

The same way you fit a bucket wheel excavator in your garage

floam · 2026-04-24T05:23:01 1777008181

Very carefully

zozbot234 · 2026-04-24T05:55:37 1777010137

Run on an old HEDT platform with a lot of parallel attached storage (probably PCIe 4) and fetch weights from SSD. You'd ultimately be limited by the latency of these per-layer fetches, since MoE weights are small. You could reduce the latencies further by buying cheap Optane memory on the second-hand market.

datadrivenangel · 2026-04-24T05:13:10 1777007590

A loaded macbook pro can get you to the frontier from 24 months ago at ~10-40tok/s, which is plenty fast enough for regular chatting.

542458 · 2026-04-24T05:06:07 1777007167

The low end could be something like an eBay-sourced server with a truckload of DDR3 ram doing all-cpu inference - secondhand server models with a terabyte of ram can be had for about 1.5K. The TPS will be absolute garbage and it will sound like a jet engine, but it will nominally run.

The flash version here is 284B A13B, so it might perform OK with a fairly small amount of VRAM for the active params and all regular ram for the other params, but I’d have to see benchmarks. If it turns out that works alright, an eBay server plus a 3090 might be the bang-for-buck champ for about $2.5K (assuming you’re starting from zero).

jdoe1337halo · 2026-04-24T04:50:23 1777006223

More like 500k

aliljet · 2026-04-23T19:47:31 1776973651

Mythos is only real when it's actually available. If you're using Opus 4.7 right now, you know how incredibly nerfed the Opus autonomy is in service of perceived safety. I'm not so confident this will be as great as Anthropic wants us to believe..

aliljet · 2026-04-23T19:12:42 1776971562

I've found myself so deeply embedded in the Claude Max subscription that I'm worried about potentially makign a switch. How are people making sure they stay nimble enough not to get trarpped by one company's ecosystem over another? For what it's worth, Opus 4.7 has not been a step up and it's come with an enormously higher usage of the subscription Anthropic offers making the entire offering double worse.

gck1 · 2026-04-23T21:17:58 1776979078

Start building your own liteweight "harness" that does things you need. Ignore all functionality of clients like CC or Codex and just implement whatever you start missing in your harness.

You can replace pretty much everything - skills system, subagents, etc with just tmux and a simple cli tool that the official clients can call.

Oh and definitely disable any form of "memory" system.

Essentially, treat all tooling that wraps the models as dumb gateways to inference. Then provider switch is basically a one line config change.

nunez · 2026-04-24T02:10:39 1776996639

lol this is literally the same advice us ancient devops nerds were telling others back when ci/cd was new

write scripts that work anywhere and have your ci/cd pipeline be a "dumb" executor of those scripts. unless you want to be stuck on jenkins forever.

what's old is new again!

TacticalCoder · 2026-04-23T22:14:24 1776982464

> You can replace pretty much everything - skills system, subagents, etc with just tmux and a simple cli tool that the official clients can call.

I'm very interest by this. Can you go a bit more into details?

ATM for example I'm running Claude Code CLI in a VM on a server and I use SSH to access it. I don't depend on anything specific to Anthropic. But it's still a bit of a pain to "switch" to, say, Codex.

How would that simple CLI tool work? And would CC / Codex call it?

RALaBarge · 2026-04-23T22:56:35 1776984995

Check out github.com/ralabarge/beigebox -- OSS AI Harness, started as a way to save all of my data but has agentic features, MCP server, point it at any endpoint (or use any front end with it as well, transparent middleware)

So far what I am finding is that you just get the basics working and then use the tool and inference to improve the tool.

caspar · 2026-04-24T11:55:54 1777031754

Not the OP but here is a good example: https://mariozechner.at/posts/2025-11-30-pi-coding-agent/

Initially I read it because just it was interesting but it has ended up being the harness I have stuck with - pi is well designed, nicely extensible and supports many model provider APIs. Though sadly gemini and claude's subscriptions can't really be used with it anymore thanks to openclaw.

gck1 · 2026-04-24T00:13:24 1776989604

I wish I had lower standards towards sharing absolute AI slop, then I could just drop a link to my implementation. But since I don't, let me just describe it. I essentially had claude build the initial version in a single session which I've been extending as I noticed any gaps in my process.

First, you need an entrypoint that kicks things off. You never run `claude` or `codex`, you always start by running `mycli-entrypoint` that:

1. Creates tmux session 2. Creates pane 3. Spawns claude/codex/gemini - whichever your default configured backend is 4. Automatically delivers a prompt (essentially a 'system message') to that process via tmux paste telling it what `mycli` is, how to use it, what commands are available and how it should never use built-in tools that this cli provides as alternatives.

After that, you build commands in `mycli` that CC/Codex are prompted to call when appropriate.

For example, if you want a "subagent", you have a `mycli spawn` command that takes a role (just preconfigured markdown file living in the same project), backend (claude/codex/...) and a model. Then whenever CC wants to spawn a subagent, it will call that command instead, which will create a pane, spawn a process and return agent ID to CC. Agent ID is auto generated by your cli and tmux pane is renamed to that so you can easily match later.

Then you also need a way for these agents to talk to each other. So your cli also has a `send` command that takes agent ID and a message and delivers it to the appropriate pane using automatically tracked mapping of pane_id<>agent_id.

Claude and codex automatically store everything that happens in the process as jsonl files in their config dirs. Your cli should have adapters for each backend and parse them into common format.

At this point, your possibilities are pretty much endless. You can have a sidecar process per agent that say, detects when model is reaching context window limit (it's in jsonl) and automatically send a message to it asking it to wrap up and report to a supervisor agent that will spawn a replacement.

I also don't use "skills" because skills are a loaded term that each of the harnesses interprets and loads/uses differently. So I call them "crafts" which are again, just markdown files in my project with an ID and supporting command `read-craft <craft-id>`. List of the available "crafts" are delivered using the same initialization message that each agent gets. If I like any third party skill, I just copy it to my "crafts" dir manually.

My implementation is an absolute junk, just Python + markdown files, and I have never looked at the actual code, but it works and I can adapt it to my process very easily without being dependent on any third party tool.

type4 · 2026-04-23T19:16:56 1776971816

I have a directory of skills that I symlink to Codex/Claude/pi. I make scripts that correspond with them to do any heavy lifting, I avoid platform specific features like Claude's hooks. I also symlink/share a user AGENTS.md/CLAUDE.md

MCPs aren't as smooth, but I just set them up in each environment.

threecheese · 2026-04-23T20:02:18 1776974538

Anecdotally, I get the same wall time with my Max x5 (100$) and my ChatGPT Teams (30$) subscriptions.

chis · 2026-04-23T19:40:22 1776973222

It's surprisingly simple to switch. I mean both products offer basically identical coding CLI experiences. Personally I've been paying for Claude max $100, and ChatGPT $20, and then just using ChatGPT to fill in the gaps. Specifically I like it for code review and when Claude is down.

dannyw · 2026-04-23T20:59:37 1776977977

Try GPT-5.5 as your daily driver for a bit. It felt a lot smarter, reliable, and I was much more productive with it.

zaptrem · 2026-04-24T03:50:15 1777002615

I bumped from $20 -> $100 today but the Codex CLI lacking code rewind and "you can change files but ask me every time" mode from Claude Code is quite annoying. Sometimes I want to code, not vibe code lol.

hx8 · 2026-04-24T02:36:01 1776998161

I use Open Code as my harness. It's open source, bring your own API Key or OAuth token or self-hosted model. I've jumped from Opus 4.6 to Opus 4.7 to GPT 5.5 in the last 7 days. No big deal, intelligence is just a commodity in 2026.

The actual harness is great, very hackable, very extendable.

NoveltyEngine · 2026-04-24T18:05:26 1777053926

Does Anthropic not actively ban people using oauth tokens in non-claude-code harnesses?

hx8 · 2026-04-26T03:59:29 1777175969

Yeah, for direct to Claude you need an API key. You can use other subscriptions like GitHub Copilot that expose Claude, but that path has been blocked.

zackify · 2026-04-24T02:54:47 1776999287

I use pi.dev.

I get openai team plan at work.

Claude enterprise too.

I have openrouter for myself.

I use minimax 2.7. Kimi 2.6. And gpt 5.5 and opus 4.7. I can toggle between them in an open source interface that's how I stay able to not be trapped.

Minimax is so cheap and for personal stuff it works fine. So I'm always toggling between the nre releases

peheje · 2026-04-24T10:56:34 1777028194

what about just personal stuff in a syncing interface, what do you use for that?

zackify · 2026-04-24T17:52:42 1777053162

What's a syncing interface?

pdntspa · 2026-04-23T20:25:22 1776975922

As a rule I've been symlinking or referencing generic "agents" versions of claude workflow files instead of placing those files directly in claude's purview

AGENTS.md / skills / etc

beering · 2026-04-23T21:51:09 1776981069

What is the switching cost besides launching a different program? Don’t you just need to type what you want into the box?

cube2222 · 2026-04-23T19:35:04 1776972904

Small tip, at least for now you can switch back to Opus 4.6, both in the ui and in Claude Code.

rane · 2026-04-23T20:04:45 1776974685

This might be the opposite of staying nimble as my workflows are quite tied to Claude Code specifically, however I've been experimenting with using OpenAI models in CC and it works surprisingly well.

babelfish · 2026-04-24T00:53:08 1776991988

I use Conductor which lets me flip trivially between OpenAI/Anthropic models

dannyw · 2026-04-23T20:54:29 1776977669

It’s good to just keep trying different ones from time to time.

dogline · 2026-04-23T19:20:37 1776972037

Except for history, I don’t find much that stops you from switching back and forth on the CLI. They both use tools, each has a different voice, but they both work. Have it summarize your existing history into a markdown file, and read it in with any engine.

The APIs are pretty interchangeable too. Just ask to convert from one to the other if you need to.

karlosvomacka · 2026-04-23T22:31:17 1776983477

use copilot and have access to all models

dheera · 2026-04-23T19:42:33 1776973353

Coding models are effectively free. They are capable of making money and supporting themselves given access to the right set of things. That is what I do

basisword · 2026-04-23T21:36:36 1776980196

I switched a couple of weeks ago just to see how it went. Codex is no better or worse. They’re both noticeably better at different things. I burn through my tokens much much faster on Codex though. For what it’s worth I’m sticking with Codex for now. It seems to be significantly better at UI work although has some really frustrating bad habits (like loading your UI with annoying copywriting no sane person would ever do).

aliljet · 2026-04-23T02:12:12 1776910332

How many levels of agents are here. Agents riding code by agents in a system driven by agents vibed by one lonely engineer in Redmond?

walrus01 · 2026-04-23T02:18:16 1776910696

Introducing Microsoft Teams Turducken 2026 (Enterprise AI Agent Edition) now with 17 layers

https://www.npr.org/sections/thesalt/2014/11/21/365509503/th...