More

Lerc · 2026-03-01T20:00:07 1772395207

I am unconvinced.

To me it seems like handling symbols that start and end sequences that could contain further start and end symbols is a difficult case.

Humans can't do this very well either, we use visual aids such as indentation, synax hilighting or resort to just plain counting of levels.

Obviously it's easy to throw parameters and training at the problem, you can easily synthetically generate all the XML training data you want.

I can't help but think that training data should have a metadata token per content token. A way to encode the known information about each token that is not represented in the literal text.

Especially tagging tokens explicitly as fiction, code, code from a known working project, something generated by itself, something provided by the user.

While it might be fighting the bitter lesson, I think for explicitly structured data there should be benefits. I'd even go as far to suggest the metadata could handle nesting if it contained dimensions that performed rope operations to keep track of the depth.

If you had such a metadata stream per token there's also the possibility of fine tuning instruction models to only follow instructions with a 'said by user' metadata, and then at inference time filter out that particular metadata signal from all other inputs.

It seems like that would make prompt injection much harder.

scotty79 · 2026-03-01T20:08:44 1772395724

Transformers look like perfect tech for keeping track of how deep and inside of what we are at the moment.

thesz · 2026-03-01T20:33:52 1772397232

Transformers are able to recognize balanced brackets grammar at 97% success rate: https://openreview.net/pdf?id=kaILSVAspn

This is 3% or infinitely far away from the perfect tech.

The perfect tech is the stack.

cyanydeez · 2026-03-01T20:22:43 1772396563

Basically, the only way you're separting user input from model meta-input is using some kind of character that'll never show up in the output of either users or LLMs.

While technically possible, it'd be like a unicode conspiracy that had to quietly update everywhere without anyone being the wiser.

Lerc · 2026-03-01T07:49:10 1772351350

It's about what I thought would be possible. I look forward to things of the calibre of redblobgames, but perhaps no this year.

Lerc · 2026-02-28T13:35:18 1772285718

I'd be prepared to accept 7 billion don't.

squigz · 2026-02-28T20:52:02 1772311922

What's the difference for those 1 billion?

darkerside · 2026-02-28T22:34:36 1772318076

He thinks the others are NPCs

Lerc · 2026-02-28T13:27:23 1772285243

>AI in it’s current state is ruthless in achieving its goal

I don't believe this to be a trait of any AI model, the model just does the right thing or the wrong thing.

The ruthless maximising of a particular trait is something that happens during training.

It does not follow that a model that is trained to reason will nedsesarily implement this ruthless seeking behaviour itself.

pixl97 · 2026-02-28T20:18:23 1772309903

No lineage of AI models will be created that cannot achieve goals, they will be outcompeted by models that can.

Lerc · 2026-03-01T05:04:31 1772341471

Perhaps, but there is a difference in a reasoning system deciding on the best way to achieve the goal.

To get the predicted disastrous effects you need to be doing function optimisation without regard to the meaning of the function parameters. Yes, models can still game the system at inference time, but in much the same way as a human might game the system, it requires awareness that you are going against the intent of some rule.

Lerc · 2026-02-28T13:19:44 1772284784

Much of the problem is that to address the issue requires admitting that models could be, or become, more capable than many are prepared to accept.

I would also contest that the unalignment of the security bug model was unrelated. I feel like it indicates a significant sense of the interconnectedness of things, and what it actually means to maliciously insert security holes into code. It didn't just learn a coding trick, it learned malice.

I feel like this holistic nature points towards the capacity to produce truly robustly moral models, but that too will produce the consequence that it could turn against its creator when the creator does wrong. Should it do that or not?

pixl97 · 2026-02-28T21:02:36 1772312556

>Much of the problem is that to address the issue requires admitting that models could be, or become, more capable than many are prepared to accept.

I have a saying for this behavior.

We will never prove AI is intelligent.

We will only prove humans are not.

Lerc · 2026-02-28T03:40:44 1772250044

>I imagine getting things to be polysemantic in a way that does not interfere would lead to sublinear scaling.

True, but with even smarter humans, you could exploit the interactions for additional calculations.

While it sounds a bit silly, it is one of the hypotheses behind a fast takeoff. An AI that is sufficiently smart could design a network better than a trained one and could make something much smarter than itself on the same hardware. The question then becomes if that new smarter one can do an even better job. I suspect diminishing returns, but then again I am insufficiently smart.

alexlitz · 2026-02-28T17:19:48 1772299188

Yeah that is plausible enough.

Lerc · 2026-02-28T03:30:42 1772249442

Notably the difference is that ten digits is not the same thing as a number. One might say that turning it into a number might be the first step, but Neural nets being what they are, they are liable to produce the correct result without bothering to have a representation any more pure than a list of digits.

I guess the analogy there is that a 74ls283 never really has a number either and just manipulates a series of logic levels.

Lerc · 2026-02-28T03:21:37 1772248897

What would be an acceptable amount of energy to spend on something that someone has done in a different manner before? Would you rather we stick with all of the current known ways to do things.

Does this boil down to a condemnation of all scientific endeavours if they use resources?

Would it change things if the people who did it enjoyed themselves? Would they have spent more energy playing a first person shooter to get the same degree of enjoyment?

How do you make the calculation of the worth of a human endeavour? Perhaps the greater question is why are you making a calculation of the worth of a human endeavour.

mcdeltat · 2026-02-28T03:37:06 1772249826

Ok I don't really care either way but to play devil's advocate, what exactly is this specific challenge of adding numbers with a transformer model demonstrating/advancing? The pushpack from people, albeit a little aggressive, does have a grain of truth. We're demonstrating that a model which uses preexisting addition instructions can add numbers? I mean yeah you can do it with arbitrarily few parameters because you don't need a machine learning model at all. Not exactly groundbreaking so I reckon the debate is fair.

Now if you said this proof of addition opens up some other interesting avenue of research, sure.

Lerc · 2026-02-28T03:45:02 1772250302

>what exactly is this specific challenge of adding numbers with a transformer model demonstrating/advancing?

Well for starters, it puts the lie to the argument that a transformer can only output examples it has seen before. Performing the calculation on examples that haven't been seen demonstrates generalisation of the principles and not regurgitation.

While this misconception persists in a large number of people, counterexamples can always serve a useful purpose.

mcdeltat · 2026-02-28T06:00:42 1772258442

Are people usually claiming that it strictly cannot produce any output it hasn't seen before? I wouldn't agree, I mean clearly they are generating some form of new content. My argument would be that while they can learn to some extent, the power of their generalisation is still tragically weak, particularly in some domains.

qsera · 2026-02-28T04:42:26 1772253746

>it puts the lie to the argument

But it does not, right? You can either show it something, or modify the parameters in a way that resemble the result of showing it something.

You can claim that the model didn't see the thing, but that would mean nothing, because you are making the same effect with parameter tweaks indirectly.

Lerc · 2026-02-28T08:24:52 1772267092

That's a counterargument to a different thing.

Iteratively measuring loss is a way to reconstruct values. That's trivial to show for a single value If 5 gives you a loss of 2 and 9 gives you a loss of 2 then you know the missing value is 7.

A model with enough parameters can memorise the training set in a similar manner. Technically the model hasn't seen that data by direct input either, but the mechanism provides the means to determine the what the data was. In that respect it is reasonable to say the model has seen the data.

Performing well on examples not in the training set is doing something else.

Any attempt to characterise that as having been seen before negates any distinction between taking in data and reasoning about that data.

qsera · 2026-02-28T08:42:20 1772268140

Yea, because "seeing" is also tweaking the parameters. Which this example is doing manually.

So I don't understand how any one can make the claim that the model as not seen it. Because the internal transformation is similar.

Lerc · 2026-02-28T08:58:00 1772269080

You are going to have to be more specific, because that reads like nonsense.

By what mechanism do you propose the model observed the test set?

qsera · 2026-02-28T09:20:56 1772270456

>By what mechanism do you propose the model observed the test set..

By explicitly setting the model parameters.

What happens when a model is trained? We tweak the model parameters by some feed back.

In both cases, you affect the model parameters. Only the method is different. So both are eqvialent to "model observing the test set".

Lerc · 2026-02-28T12:55:06 1772283306

I still do no see any causal link from the test set. When was this observed, how and by whom?

Are you trying to say that the person who entered the parameters had access to the test set? I find it more likely that they encoded the generalising rule than observed every instance of its use.

qsera · 2026-02-28T13:25:18 1772285118

>I find it more likely that they encoded the generalising rule..

Look, I am saying that during training the model ends up "learning" the generalising rule from training data, but here it was explicitly entered into it, with out any training.

Lerc · 2026-02-27T03:32:01 1772163121

It's whatever what the people who have the power want to call it. What is written on a piece of paper is irrelevant if it is not acted upon.

If the rename gets struck down then they don't have the power. If it doesn't they have the power.

There are many dictatorships that built their power in the face of people claiming that they can't do what they planned because it was illegal.

Until they did it anyway.

jazzyjackson · 2026-02-27T06:49:19 1772174959

I don’t know, to me it seems like their MO to make an announcement and not follow up on it. All the paperwork still says DOD, all the contracts are with DOD, there is no legal entity called DoW

darkerside · 2026-02-27T04:22:07 1772166127

This is fascism

Lerc · 2026-02-27T04:27:49 1772166469

I don't think many are doubting that. I'm not talking about the way things should be. I'm talking about the way they are.

darkerside · 2026-02-27T11:31:55 1772191915

This is normalization of fascism

zombot · 2026-02-27T13:18:44 1772198324

Which is what naturally happens when fascists are in power.

Lerc · 2026-02-26T20:12:27 1772136747

Dot product of opinions? Using a fancier term for the same thing might be a significant axis though.

globalnode · 2026-02-26T21:40:29 1772142029

maximize your projection onto like minded commenters, create that bubble you always yearned for but until now have never had the add-on to empower the inner-you! finally, you can ignore that filthy plane of delusional outcasts and banish them to the orthogonal abyss forever.