More

m-hodges · 2026-03-27T13:37:43 1774618663

The title I get when I click on this is, "How (and why) to take a logarithm of an image"

peesem · 2026-03-27T13:43:26 1774619006

YouTube has A/B testing features that allow videos to have multiple titles and/or thumbnails.

m-hodges · 2026-03-27T13:46:09 1774619169

Right. So I thought it would be helpful to share the more-descriptive title that I got.

Modified3019 · 2026-03-27T14:26:25 1774621585

This is what I use DeArrow for, crowdsourced titles and thumbnails (from the maker of SponsorBlock): https://github.com/ajayyy/DeArrow

dandanua · 2026-03-27T14:43:52 1774622632

I'm sorry, what? Can people now see different titles? Insanity, if true.

hidroto · 2026-03-27T15:44:34 1774626274

It has been that way for a while now. I see Veritasium video titles and thumbnails change quite often, it can be quite annoying as it sometimes gives the appearance of it being a whole new video.

A/B testing a title feels wrong to me, its almost as bad as A/B testing a UUID. Just pick a title and stick to it unless you need to fix a factual error.

zacmps · 2026-03-27T15:49:53 1774626593

Titles and thumbnails have a huge impact on video performance, and when it's your main income it seems reasonable to try to marginalise the impact.

TeMPOraL · 2026-03-28T11:43:48 1774698228

Right, but then there's this thing called "shared reality" and once you break it, all kinds of bad consequences happen.

This is even worse, as it also breaks temporal continuity for individual reality. E.g. I expect that if I saw a video titled X today, I'll be able to find it under title X tomorrow, and if I can't, it's one of the rare/marginal cases when it got banned/deleted/retitled, or I just misremembered. Titles becoming unstable in the general case is a bad situation.

dandanua · 2026-03-28T12:51:13 1774702273

Gaslighting is now a government policy

throwaway290 · 2026-03-27T17:33:43 1774632823

And video performance = ad revenue.

UltraSane · 2026-03-27T17:38:25 1774633105

Oh yes. Some channels cycle through many different ones as they test them. Veritasium is notorious for this.

close04 · 2026-03-27T15:27:15 1774625235

> How (and why) to take a logarithm of an image

I watched it a few days ago and this descriptive title was part of the reason I clicked. I generally trust 3B1B anyway but normally a title like "This picture broke my brain" would put me off.

3b1b · 2026-03-27T15:53:53 1774626833

In case you're curious, when I ran that title/thumbnail AB test, the option "This picture broke my brain" did end up winning. I was a bit disappointed, because I didn't really _want_ it to win, but I did include it out of curiosity. Ultimately, I changed it to the other title, mostly because I like it better, and the margin was small.

I was genuinely torn about how to title this, because one of my aims is that it stands to be enjoyed by people outside the usual online-math-viewing circles, especially the first 12 minutes, and leaning into the idea of a complex log risks alienating some of those.

cromulent2 · 2026-03-27T16:10:48 1774627848

That makes me wonder: do you see a difference in when viewers drop off between using a more math-y title versus a more accessible one?

The "broke my brain" title originally put me off from watching. I caved after a few days; I think the video is one of your best!

3b1b · 2026-03-27T20:07:52 1774642072

That level of granularity would be interesting. For what it's worth, the metric they go by is not click-through rate; it's expected total watch time. For example, if you have two thumbnails, A and B, and for every 100 impressions of A, there are 51 total minutes of watch time, and for every 100 impressions of B, there are 49 total, then what you'd see in the dashboard is "51% A, 49% B". More total clicks with less engagement will not necessarily win out.

I generally agree that it's a pretty wild choice to just let creators put up multiple titles. That said, it's hard not to play with the shiny toy when it's sitting right there, especially if you know it may mean the lesson reaches more people. In this case, I genuinely don't know what the "right" title is, even setting engagement aside. Is it fundamentally about analyzing an Escher piece? Is it fundamentally a lesson on complex analysis, and complex logs in particular? It's both, but you don't always want to cram two stories into one title. This becomes all the more challenging when titles are, inescapably, marketing.

john_strinlai · 2026-03-27T16:27:54 1774628874

perhaps a bit inappropriate of me to say so here as it is off-topic, but i am going to take the opportunity anyways:

big thanks for all of your work making math both enjoyable and accessible. my kids (and i) love your videos. your positive impact extends far and wide.

UltraSane · 2026-03-27T17:40:09 1774633209

You should be able to have different titles for different ages and education levels of users

gowld · 2026-03-27T18:44:39 1774637079

As annoying as those titles are, the work that you (and few others, like Veritasium) do makes it well worth the tradeoff. Just keep reminding everyone that the annoying title gets the video into the brain of thousands of other people who aren't subscribed yet. It's a tiny price to pay for astounding value.

Everyone who watches your videos loves them and wants everyone else to watch them.

sva_ · 2026-03-27T13:47:22 1774619242

For me it is "Decoding Escher's most mind-bending piece"

emmelaich · 2026-03-28T01:39:41 1774661981

Same for me and this is why I had not clicked on it before.

john_strinlai · 2026-03-27T15:31:50 1774625510

i see "Decoding Escher's most mind-bending piece".

fascinating, and absurdly confusing, that there are multiple titles.

eblume · 2026-03-27T15:35:19 1774625719

It's a pretty common feature of youtube creator studio. https://www.theverge.com/news/840789/youtube-video-title-a-b...

john_strinlai · 2026-03-27T15:49:28 1774626568

i had no idea, thanks. at first glance it seems okay-ish for the creator, but only serves to be confusing for the users.

m-hodges · 2026-03-26T13:58:48 1774533528

This is a really fun project and the family interview transcripts + LLM workflow feels like a genuinely good use of the technology.

I would probably have ended well before "I exported my Google Maps location history, Uber trips, bank transactions, and Shazam history."

Aside: I've started seeing lot of AI projects in this category say some variation of:

> it runs on your machine, your data stays with you, and any model can read it

I don’t think people fully appreciate the tension in those claims, especially when the model most area reaching for is Claude or GPT or Gemini. I think these things need more precise language about where data actually goes and what tradeoffs users are implicitly accepting.

m-hodges · 2026-03-22T14:13:44 1774188824

As frontier models get closer and closer to consumer hardware, what’s the most for the API-driven $trillion labs?

stri8ted · 2026-03-22T14:25:31 1774189531

48 GB is not consumer hardware. But fundamentally, there are economies of scale due to batching, power distribution, better utilization etc.., that means data center tokens will be cheaper. Also, as the cost of training (frontier) models increases, it's not clear the Chinese companies will continue open sourcing them. Notice for example, that Qwen-Max is not open source.

zozbot234 · 2026-03-22T14:44:18 1774190658

Nothing obviously prevents using this approach, e.g. for 3B-active or 10B-active models, which do run on consumer hardware. I'd love to see how the 3B performs with this on the MacBook Neo, for example. More relevantly, data-center scale tokens are only cheaper for the specific type of tokens data centers sell. If you're willing to wait long enough for your inferences (and your overall volume is low enough that you can afford this) you can use approaches like OP's (offloading read-only data to storage) to handle inference on low-performing, slow "edge" devices.

WesolyKubeczek · 2026-03-22T20:21:08 1774210868

It is consumer hardware in the sense that Macbook Pros come with this RAM size as base and that you can buy them as a consumer, without having to sign a special B2B contract, show that your company is big and reputable enough, and order a minimum of 10 or 100.

m-hodges · 2026-03-22T17:26:17 1774200377

> 48 GB is not consumer hardware.

It’s a MacBook.

kelnos · 2026-03-23T00:05:47 1774224347

Technically that's correct (which as we all know is the best kind of correct), but really, how many consumers are buying a high-end MacBook Pro with 48GB or more of RAM? That's a very small percentage of the population. In these kinds of discussions, "consumer" is being used as a proxy for "something your average home laptop buyer might have". And a 48GB MBP is not that.

I know it's annoying, because a 48GB MBP is indeed technically "consumer hardware", but please understand the context and don't be pedantic. You know what the GP meant. (And if not, that's... kinda on you.)

m-hodges · 2026-03-23T03:01:57 1774234917

> but please understand the context and don't be pedantic.

The context is this is something I can pick up at an Apple Store and not some rig I have to build with NVIDIA cards.

I led with:

> get closer and closer to consumer hardware

I think this demonstrates getting closer, whether you think a MacBook is consumer hardware or not. But I'm the one being pedantic.

OJFord · 2026-03-22T14:28:05 1774189685

Assuming 'moat' – they'll push the frontier forward; they don't really have to worry until progress levels off.

At that point, I suppose there's still paid harnesses (people have always paid for IDEs despite FOSS options) partly for mindshare, and they could use expertise & compute capacity to provide application-specific training for enterprises that need it.

BoredomIsFun · 2026-03-22T15:00:46 1774191646

> the API-driven $trillion labs?

here we go: https://huggingface.co/collections/trillionlabs/tri-series

m-hodges · 2026-03-15T17:39:39 1773596379

Should art not of a point of view?

radiator · 2026-03-15T19:47:47 1773604067

That is the whole point. Since decades, it has a single point of view, failing to represent the majority of the people.

padjo · 2026-03-15T21:19:12 1773609552

I think you need to get out more if you really believe all movies have the one point of view

manphone · 2026-03-15T20:14:56 1773605696

And what point of view does all art have now?

MagicMoonlight · 2026-03-16T00:39:23 1773621563

That smugness is the exact point of view we’re talking about

alternatetwo · 2026-03-16T20:52:21 1773694341

I'm confused what you mean by "single point of view" and "the majority of the people". Please elaborate.

jimbo808 · 2026-03-19T00:15:55 1773879355

It's your point of view.

dmitrygr · 2026-03-15T18:30:39 1773599439

> Should art not of a point of view?

It can, sure. However, I will not pay to be lectured to on topics I have no interest getting lectured on. I'll keep my money, they can keep the sermon. Let's see who has more to gain from listening to the other. If they want my money, what I want to hear/see matters a whole lot more than what they want to preach to me.

They simply forgot the golden rule: he who has the gold -- makes the rules. Let them rediscover it.

gzread · 2026-03-15T18:20:40 1773598840

Some is reasonable and then some is obviously just what rich people want you to think. Like America paid Hollywood a lot to always show the US armies being macho and always on the right side of wars.

ludicrousdispla · 2026-03-15T19:32:42 1773603162

Should a sentence have a verb?

christophilus · 2026-03-15T20:38:13 1773607093

Not all sentences.

khazhoux · 2026-03-15T23:24:18 1773617058

Not always, no

m-hodges · 2026-03-15T19:54:32 1773604472

You caught my typo. Gold medal.

nout · 2026-03-15T21:33:08 1773610388

Absolutely!

m-hodges · 2026-03-15T00:09:38 1773533378

When I was studying Computer Science in college, I once remarked how lucky we, English speakers, are that programming languages use English nouns and verbs. A ton of my classmates were here on a student visa, and English was not their first language. I always thought that programming in English put me at an advantage on the learning curve. I also always thought it was silly when someone would quip that programming should count for “foreign language” credit. Anyway, always cool to see non-English programming languages.

localuser13 · 2026-03-15T01:02:20 1773536540

At a risk of going against the hivemind, I disagree.

I self-taught programming quite early in my life, way before I had a good command of the English language. I've read books in my native language, talked on programming forums in my native language. In the end the "english" in programming languages is just a handful of keywords, and it didn't hinder me one bit that I had no idea "int" stands for "integer".

Of course, I started by writing code like "bool es_primo(int numero)" (in my language), but there's nothing in C that says identifiers must be english, just convention. Standard library and packages nowadays would be a problem, but back then standard library were thin and "strcpy" name is obscure anyway. The real hard part was always learning how to program and design properly.

And for more advanced topics, documentation and learning materials in english only are HUGE problem for ESL, because one has to actually read and understand them. But this is not something programming language can help with.

sushid · 2026-03-15T03:12:26 1773544346

That's coming from a Spanish speaker used to the alphabet, QUERTY, etc. I imagine you'd find it much more difficult if C were written in Chinese or Arabic, for instance.

a57721 · 2026-03-15T01:25:27 1773537927

I have a similar experience, I learned English much later than my first programming languages, and picking up some keywords and basic APIs was never an issue (it was BASIC and C/C++ at the time). Maybe I would occasionally look up in a dictionary what is 'needle' and 'haystack' in a code snippet, and I was puzzled by the ubiquitous "foo, bar, baz", which to my relief turned out to be equally cryptic for the native speakers. I still don't feel about code as a kind of English prose, it occupies a separate part of my brain, compared to the natural languages.

nenxk · 2026-03-15T03:15:10 1773544510

For people that use similar keyboards I don’t imagine it’s that different though like you said occasionally knowing that bool means Boolean or int means integer may make it slightly easier for English speakers I think a big disadvantage would likely be for people from say China that use incredibly different keyboards if I had to add a wildly different second language and switch to it every time I wanted to create a var or import something or write an if statement I’m not sure if I would’ve continued learning to code it may have been one step to many

xodn348 · 2026-03-15T01:53:00 1773539580

I agree with your opinion and I was wonder how the Korean could be used in the world with full of Eng. Thanks for your feedback!

thisislife2 · 2026-03-15T00:36:08 1773534968

True. English is a major reason why India is the IT back-office for most of the western world. I too have personally observed how my fellow classmates, who had done their schooling in their regional language, struggled with the coursework in college because it was solely in English. And some of them were state rankers - it felt bad to realise that they had to put in twice the effort needed to keep up their grades. I think there's a lot of potential wasted in India because of this kind of hardship / struggle - a lot of intelligent people are held back just because they lack an aptitude for multilingualism.

deepsun · 2026-03-15T01:12:06 1773537126

Naah, my non-english-speaking friends say that the keywords are less than 1% complexity of a programmer's job, so it really doesn't matter.

Also, in most languages you already can name variables/classes/members in any Unicode letters. So only "if/for/while" keywords and stdlib classes remain English. It makes little sense to translate those.

zlfn · 2026-03-15T07:47:37 1773560857

However, in the vast majority of cases, non-ASCII characters are rarely used for variable or function names during programming. This is because they can cause conflicts when using different encoding systems, and some automation tools fail to recognize them. Consequently, programmers in non-English speaking regions must invest more effort into naming variables than English speakers, as they have to translate all localized expressions into English.

When Toss, a Korean unicorn startup, announced that they would start using Korean for variable names within financial contexts, it sparked significant debate and a wide range of reactions among Korean programmers.

cyberax · 2026-03-15T06:31:14 1773556274

Nah. If anything, treating keywords as special sigils actually helps.

Also, not all natural languages are suitable for programming languages. In highly inflected languages you often end up with grammatically incorrect forms. Or with stilted language.

xodn348 · 2026-03-15T00:25:10 1773534310

Thank you for your empathy. English has been the one of the most frequent languages for globe so that it is reasonable to Eng in many coding project, though.

cubefox · 2026-03-15T01:02:35 1773536555

It's may also be reasonable to make localized translations for a programming language. This is rarely done in reality for obvious reasons. An exception are Excel's function names. People who don't know English, or hardly know it, appreciate it.

kccqzy · 2026-03-15T02:07:39 1773540459

That’s the least of their problems. The best computer science textbooks are published first and foremost in English and only translated belatedly. The research papers are in English and not often translated. Even the manuals of both commercial and FOSS programming tools tend not be translated. A few keywords is what, half an hour of rote memorization.

m-hodges · 2026-03-12T00:24:46 1773275086

Great for opponent monitoring on political campaigns. We made an in-house version of this on Biden ‘20.

m-hodges · 2026-03-09T14:07:51 1773065271

LockerGnome!

m-hodges · 2026-03-09T00:14:34 1773015274

It wasn’t “revoked under Biden.” That implies the Biden administration (or any administration) gets to define this. They don’t. Recessions in the United States are generally demarcated by NBER.¹

¹ https://en.wikipedia.org/wiki/National_Bureau_of_Economic_Re...

gruez · 2026-03-09T00:26:11 1773015971

>It wasn’t “revoked under Biden.” That implies the Biden administration (or any administration) gets to define this.

No, not any more than "the pandemic started under the Trump administration" implies that they caused the pandemic.

m-hodges · 2026-03-09T00:28:05 1773016085

I just plainly disagree that a casual reader wouldn’t see the phrase “revoked under Biden” and believe it meant that Biden did the revoking.

lovich · 2026-03-09T00:36:46 1773016606

[flagged]

gruez · 2026-03-09T00:43:10 1773016990

>It does imply that because the Trump admin killed the group involved with preventing pandemics[1]

No it doesn't, not without massively reading in between the lines. This is getting to absurd levels of nitpicking over wording, like "autistic people" vs "people with autism".

>I assume you are being disingenuous by using that claim while also trying to smear the Biden admin.

Two can play at this game. I assume you're being disingenuous by trying to put words in my mouth over tiny disagreements in wording.

m-hodges · 2026-03-08T23:59:21 1773014361

> , market vet says

m-hodges · 2026-03-06T11:35:59 1772796959

> despite bulletproof input sanitization not having been invented yet!

I don’t think it can be.¹

¹ https://matthodges.com/posts/2025-08-26-music-to-break-model...

msdz · 2026-03-06T13:08:49 1772802529

Interesting article you’ve linked. I’m not sure I agree, but it was a good read and food for thought in any case.

Work is still being done on how to bulletproof input “sanitization”. Research like [1] is what I love to discover, because it’s genuinely promising. If you can formally separate out the “decider” from the “parser” unit (in this case, by running two models), together with a small allowlisted set of tool calls, it might just be possible to get around the injection risks.

[1] Google DeepMind: Defeating Prompt Injections by Design. https://arxiv.org/abs/2503.18813

zbentley · 2026-03-06T13:51:04 1772805064

Sanitization isn’t enough. We need a way to separate code and data (not just to sanitize out instructions from data) that is deterministic. If there’s a “decide whether this input is code or data” model in the mix, you’ve already lost: that model can make a bad call, be influenced or tricked, and then you’re hosed.

At a fundamental level, having two contexts as suggested by some of the research in this area isn’t enough; errors or bad LLM judgement can still leak things back and forth between them. We need something like an SQL driver’s injection prevention: when you use it correctly, code/data confusion cannot occur since the two types of information are processed separately at the protocol level.

TheFlyingFish · 2026-03-06T20:44:12 1772829852

The linked article isn't describing a form of input sanitization, it's a complete separation between trusted and untrusted contexts. The trusted model has no access to untrusted input, and the untrusted model has no access to tools.

Simon Willison has a good explainer on CaMeL: https://simonwillison.net/2025/Apr/11/camel/

zbentley · 2026-03-07T02:22:52 1772850172

That’s still only as good as the ability of the trusted model to delineate instructions from data. The untrusted model will inevitably be compromised so as to pass bad data to the trusted model.

I have significant doubt that a P-LLM (as in the camel paper) operating a programming-language-like instruction set with “really good checks” is sufficient to avoid this issue. If it were, the P-LLM could be replaced with a deterministic tool call.