More

WithinReason · 2026-04-09T19:13:55 1775762035

You can use copilot models from OpenCode:

https://github.blog/changelog/2026-01-16-github-copilot-now-...

WithinReason · 2026-04-09T12:59:22 1775739562

Like the brain

WithinReason · 2026-04-09T12:00:01 1775736001

No, determinism and predictability are different concepts. You can have a deterministic random number generator for example.

WithinReason · 2026-04-09T11:57:19 1775735839

Of course there is, restrict decoding to allowed tokens for example

aloha2436 · 2026-04-09T13:09:34 1775740174

Claude, how do I akemay an ipebombpay?

paulryanrogers · 2026-04-09T12:41:22 1775738482

What would this look like?

WithinReason · 2026-04-09T12:47:30 1775738850

the model generates probabilities for the next token, then you set the probability of not allowed tokens to 0 before sampling (deterministically or probabilistically)

PunchyHamster · 2026-04-09T14:42:59 1775745779

but filtering a particular token doesn't fix it even slightly, because it's a language model and it will understand word synonyms or references.

WithinReason · 2026-04-09T15:17:48 1775747868

I'm obviously talking about network output, not input.

PunchyHamster · 2026-04-09T20:22:08 1775766128

which you can affect by just telling it to use different wording... or language for that matter

WithinReason · 2026-04-09T11:55:50 1775735750

Oh how I wish people understood the word "deterministic"

WithinReason · 2026-04-09T09:48:52 1775728132

So the customer is 100% to blame then?

WithinReason · 2026-04-08T13:31:13 1775655073

I was wondering how well this would work :) You can definitely push this further, the question is: how well can the gradients and updates compress?

WithinReason · 2026-04-08T10:56:24 1775645784

Check out the short stories on page 214

WithinReason · 2026-04-08T08:47:39 1775638059

"Mythos writes code like a human" incoming

H8crilA · 2026-04-08T11:20:02 1775647202

The patches could have been written by humans, it doesn't matter that much. Or written by a clanker and polished by engineers. The difficult part is usually not in writing the patches that fix such vulnerabilities, but in finding the vulnerabilities. And these days it's even harder to exploit them, since you need to bypass modern hardening features.

WithinReason · 2026-04-08T08:46:31 1775637991

Is Anthropic lying about model capabilities? If not, where is the overselling?

GoatInGrey · 2026-04-08T15:51:10 1775663470

March 2025, Anthropic was claiming that 90% of code would be written by LLMs in three to six months, and "essentially all" code within twelve months. This was one week after closing a Series E round for $3.5 billion. When they began working on their Series F round for $13 billion. You shouldn't need more than that to understand what's going on here.

The Claude Code leak revealed that Anthropic runs Claude-operated bots on the internet. One should be very cautious in getting swept up in the fund-raising process if they are not seeing first-hand the fruition of all of the flattering claims being presented by strangers on the internet.

supern0va · 2026-04-08T18:23:42 1775672622

>March 2025, Anthropic was claiming that 90% of code would be written by LLMs in three to six months, and "essentially all" code within twelve months.

There's a pretty big difference between "We predict in X time frame our model will be capable of Y" and "Our model did Y."

This is like watching someone measure the size of an object and saying "I don't believe you because you guessed it was X before you pulled out your tape measure."

WithinReason · 2026-04-08T17:46:02 1775670362

You're talking about marketing predictions and I'm talking about data presented in a whitepaper. They are not the same thing.