More

kreelman · 2026-04-15T11:32:32 1776252752

This is great!! .....

But I'd love it if it could run in SQLite3. The stored functions and procedures here are neat, but I wonder if they could be turned into views and combinations of built in functions...

I think triggers can do nearly everything that a stored procedure does, perhaps with a little bit more fiddling. I sometimes make "parameter tables" where I insert all the necessary data and a trigger does what effectively a stored proc does.

Maybe there'd need to be a few imported functions? I wonder if an AI could be persuaded...

kreelman · 2026-04-10T02:56:26 1775789786

This is very much worth watching. It is a tour de force.

Laurie does an amazing job of reimagining Google's strange job optimisation technique (for jobs running on hard disk storage) that uses 2 CPUs to do the same job. The technique simply takes the result of the machine that finishes it first, discarding the slower job's results... It seems expensive in resources, but it works and allows high priority tasks to run optimally.

Laurie re-imagines this process but for RAM!! In doing this she needs to deal with Cores, RAM channels and other relatively undocumented CPU memory management features.

She was even able to work out various undocumented CPU/RAM settings by using her tool to find where timing differences exposed various CPU settings.

She's turned "Tailslayer" into a lib now, available on Github, https://github.com/LaurieWired/tailslayer

You can see her having so much fun, doing cool victory dances as she works out ways of getting around each of the issues that she finds.

The experimentation, explanation and graphing of results is fantastic. Amazing stuff. Perhaps someone will use this somewhere?

As mentioned in the YT comments, the work done here is probably a Master's degrees worth of work, experimentation and documentation.

Go Laurie!

throwaway81523 · 2026-04-10T07:12:50 1775805170

This is a 54 minute video. I watched about 3 minutes and it seemed like some potentially interesting info wrapped in useless visuals. I thought about downloading and reading the transcript (that's faster than watching videos), but it seems to me that it's another video that would be much better as a blog post. Could someone summarize in a sentence or two? Yes we know about the refresh interval. What is the bypass?

Update: found the bypass via the youtube blurb: https://github.com/LaurieWired/tailslayer

"Tailslayer is a C++ library that reduces tail latency in RAM reads caused by DRAM refresh stalls.

"It replicates data across multiple, independent DRAM channels with uncorrelated refresh schedules, using (undocumented!) channel scrambling offsets that works on AMD, Intel, and Graviton. Once the request comes in, Tailslayer issues hedged reads across all replicas, allowing the work to be performed on whichever result responds first."

scrollop · 2026-04-10T13:13:56 1775826836

FYI if you have a video you can't be bothered watching but would like to know the details you have 2 options that I use (and others, of course):

1. Throw the video into notebooklm - it gives transcripts of all youtube videos (AFAIK) - go to sources on teh left and press the arrow key. Ask notbookelm to give you a summary, discuss anything etc.

2. Noticed that youtube now has a little Diamond icon and "Ask" next to it between the Share icon and Save icon. This brings up gemini and you can ask questions about the video (it has no internet access). This may be premium only. I still prefer Claude for general queries over Gemini.

throwaway81523 · 2026-04-11T21:05:45 1775941545

I don't want an AI summary, I just want the author to write concisely, and hopefully make a text post instead of a video.

kelsolaar · 2026-04-10T09:05:28 1775811928

The video could be a shorter, some of the goofiness might not please the most pressed people but that is also what makes it fresh and stand out.

JuniperMesos · 2026-04-10T12:07:51 1775822871

There was nothing goofy about the NERV-logo coffee mug, that was extremely serious business.

fc417fc802 · 2026-04-10T08:20:58 1775809258

> using (undocumented!) channel scrambling offsets that works on AMD, Intel, and Graviton

Seems odd to me that all three architectures implement this yet all three leave it undocumented. Is it intended as some sort of debug functionality or what?

alex_duf · 2026-04-10T09:01:45 1775811705

it's explained in the video, and there's no way I'll be explaining it better than her

em-bee · 2026-04-10T10:13:33 1775816013

you could however link to the timestamp where that particular explanation starts. i am afraid i don't have time to watch a one hour video just to satisfy my curiosity.

vitus · 2026-04-10T10:36:02 1775817362

This is approximately the section in the video titled "Memory controllers hate you" (https://www.youtube.com/watch?v=KKbgulTp3FE&t=1399s), combined with the following section.

The actual explanation starts a couple minutes later, around https://youtu.be/KKbgulTp3FE?t=1553. The short explanation is performance (essentially load balancing against multiple RAM banks for large sequential RAM accesses), combined with a security-via-obscurity layer of defense against rowhammer.

_flux · 2026-04-10T12:38:40 1775824720

I've found Gemini useful in extracting timestamps for particular spots in videos. Presumably it works with transcriptions, given how fast it is.

The three answers it found were:

- Avoiding lock-in to them: http://www.youtube.com/watch?v=KKbgulTp3FE&t=1914

- Competitive advantage: http://www.youtube.com/watch?v=KKbgulTp3FE&t=1852

- Perceived Lack of Use Case: http://www.youtube.com/watch?v=KKbgulTp3FE&t=1971

Those points do actually exist in the video, I checked. If there are more, I don't know about them, as I haven't yet watched the rest of the video.

satvikpendem · 2026-04-10T07:18:58 1775805538

Just use the Ask button on YouTube videos to summarize, that's what it's for.

jasode · 2026-04-10T10:19:43 1775816383

>Just use the Ask button on YouTube videos to summarize,

For anyone confused because they don't see the "Ask" button between the Share and Bookmark buttons...

It looks like you have to be signed-in to Youtube to see it. I always browse Youtube in incognito mode so I never saw the Ask button.

Another source of confusion is that some channels may not have it or some other unexplained reason: https://old.reddit.com/r/youtube/comments/1qaudqd/youtube_as...

dspillett · 2026-04-10T09:49:46 1775814586

Not complaining about the particular presenter here, this is an interesting video with some decent content, I don't find the presentation style overly irritating, and it is documenting a lot of work that has obviously been done experimenting in order to get the end result (rather than just summarising someone else's work). Such a goofy elongated style, that is infuriating if you are looking for quick hard information, is practically required in order to drive wider interest in the channel.

But the “ask the LLM” thing is a sign of how off kilter information passing has become in the current world. A lot of stuff is packaged deliberately inefficiently because that is the way to monetise it, or sometimes just to game the searching & recommendation systems so it gets out to potentially interested people at all, then we are encouraged to use a computationally expensive process to summarise that to distil the information back out.

MS's documentation the large chunks of Azure is that way, but with even less excuse (they aren't a content creator needing to drive interest by being a quirky presenter as well as a potential information source). Instead of telling me to ask copilot to guess what I need to know, why not write some good documentation that you can reference directly (or that I can search through)? Heck, use copilot to draft that documentation if you want to (but please have humans review the result for hallucinations, missed parts, and other inaccuracies, before publishing).

gosub100 · 2026-04-10T12:09:26 1775822966

The video definitely wouldn't be over 50m if she was targeting views. 11m -15m is where you catch a lot of people repeating and bloviating 3m of content to hit that sweet spot of the algorithm. It's sad you can't appreciate when someone puts passion into a project.

This is the damage AI does to society. It robs talented people of appreciation. A phenomenal singer? Nah she just uses auto tune obviously. Great speech? Nah obviously LLM helped. Besides I don't have time to read it anyway. All I want is the summary.

dspillett · 2026-04-10T15:24:29 1775834669

> It's sad you can't appreciate when someone puts passion into a project.

It is sad that read comprehension is dropping such that you interpreted my comment that way.

CamperBob2 · 2026-04-10T18:17:46 1775845066

I don't consider AI to threaten "damage to society" the way you seem to, but I did find it interesting to think about how ridiculously well-produced the video was, and what that might signify in the future.

I kept squinting and scrutinizing it, looking for signs that it was rendered by a video model. Loss of coherence in long shots with continuity flaws between them, unrealistic renderings of obscure objects and hardware, inconsistent textures for skin and clothing, that sort of thing... nope, it was all real, just the result of a lot of hard work and attention to detail.

Trouble is, this degree of perfection is itself unrealistic and distracting in a Goodhart's Law sense. Musicians complain when a drum track is too-perfectly quantized, or when vocals and instruments always stay in tune to within a fraction of a hertz, and I do have to wonder if that's a hazard here. I guess that's where you're coming from? If you wanted to train an AI model to create this type of content, this is exactly what you would want to use as source material. And at that point, success means all that effort is duplicated (or rather simulated) effortlessly.

So will that discourage the next-generation of LaurieWireds from even trying? Or are we going to see content creators deliberately back away from perfect production values, in order to appear more authentic?

satvikpendem · 2026-04-10T14:15:49 1775830549

Yes, I do want the summary because my time is (also) valuable. There is a reason why book covers have synopses, to figure out whether it's worth reading the book in the first place.

throwaway81523 · 2026-04-12T05:55:46 1775973346

In this case the useful info in the book could be distilled down to the cover blurb.

This video really should have been two videos anyway. One to describe how DRAM works (old hat to some of us nerds, but interesting and new to lots of others), and the second one to explain how she got around the refresh interval. Then nerds could skip the first one completely. In reality the two videos could be about 5 minutes each.

scrollop · 2026-04-10T13:15:31 1775826931

Or give the video to notebooklm - you can also get the trasncript (unformatted) using this technique

satvikpendem · 2026-04-10T14:00:08 1775829608

If you just want the transcript, there is a Show Transcript button in the video description.

ufocia · 2026-04-11T14:53:36 1775919216

I think Laurie is still trying to develop her style. She's been at it for just a few years and her delivery greatly improved over that time span. Not a fan (yet?), but I've seen a few of her videos from different time periods.

Perhaps she or someone on her team (the camera work suggests at least a +1) thinks that this geeky/ditsy persona gets more clicks. Other successful YTers behave similarly. I don't find it useful or entertaining, but others might.

Having said this, I, myself, would've liked the video to be a bit more succinct

rationalist · 2026-04-10T18:35:36 1775846136

As requested:

https://news.ycombinator.com/item?id=47713090

I agree, not everyone has 54 minutes to watch a video full of fluff (I tried, but only got so far, even on 1.5x speed).

svrtknst · 2026-04-10T08:24:48 1775809488

Unnecessarily negative imo.

I like the video because I cant read a blog post in the background while doing other stuff, and I like Gadget Hackwrench narrating semi-obscure CS topics lol

fc417fc802 · 2026-04-10T08:32:18 1775809938

> I cant read a blog post in the background

You can consume technical content in the background?

saidnooneever · 2026-04-10T09:40:00 1775814000

this is a thing people do. convince themselves they can consume technical content subconsciously. its now how the brain works though. it will just give you the idea you are following something.

em-bee · 2026-04-10T10:23:03 1775816583

not all technical content is the same, or has the same level of importance. this video does not introduce anything that i need to be able to replicate in my work, so i don't need to catch every detail of it, just grasp the basic concepts and reasons for doing something.

vel0city · 2026-04-10T14:34:57 1775831697

Lots of people will have a show on or something while they're cooking or cleaning or doing other things. Is it worse for it to be interesting technical content with fun other stuff thrown in than if was an episode of Friends or Fraiser or Iron Chef or 9-1-1: Lone Star or The Price is Right?

I guess I'm only allowed to have The Masked Singer on while I make dinner.

em-bee · 2026-04-10T10:15:15 1775816115

if your foreground work doesn't occupy your brain, why not?

vintermann · 2026-04-10T10:51:21 1775818281

Because I prefer not to think about the hair I'm removing from my shower drain?

derbOac · 2026-04-10T10:53:27 1775818407

FWIW, I like her videos but I usually prefer essays or blog posts in general as they're easier to scan and process at my own rate. It's not about this particular video, it's about videos in general.

xpct · 2026-04-10T15:08:57 1775833737

I get a similar feeling for when friends send me 2minute+ Instagram reels, it's as if my brain can't engage with the content. I'd much rather read a few paragraphs about the topic, and It'd probably take less time too.

Cthulhu_ · 2026-04-10T12:03:56 1775822636

Same; thanks to modern technology, videos can be transcribed and translated into blog posts automatically though. I wish that was a default and / or easier to find though.

For years I've been thinking "I should watch the WWDC videos because there's a lot of Really Important Information" in there, but... they're videos. In general I find that I can't pay attention to spoken word (videos, presentations, meetings) that contain important information, probably because processing it costs a lot more energy than reading.

But then I tune out / fall asleep when trying to read long content too, lmao. Glad I never did university or do uni level work.

throwaway81523 · 2026-04-12T20:50:04 1776027004

You're saying that the audio channel of that video has the useful information all by itself. The video channel, which consumes most of the bandwidth, is useless. You could go a little further and say about 80% of the 54 minute audio is also useless, and it could be cut to maybe 10 minutes. Keep going and say to post it as text instead of audio, so you can read it in 2 minutes. Now you don't have to put it in the background.

gosub100 · 2026-04-10T12:03:58 1775822638

Your comment was several paragraphs, and I am busy so I can't read it all. Can you summarize what you are asking for, I might be able to help later.

gopalv · 2026-04-10T04:37:11 1775795831

>> It replicates data across multiple, independent DRAM channels with uncorrelated refresh schedules

This is the sort of thing which was done before in a world where there was NUMA, but that is easy. Just task-set and mbind your way around it to keep your copies in both places.

The crazy part of what she's done is how to determine that the two copies don't get get hit by refresh cycles at the same time.

Particularly by experimenting on something proprietary like Graviton.

rockskon · 2026-04-10T04:59:27 1775797167

She determines that by having three copies. Or four. Or eight.

Tis just probabilities and unlikelihood of hitting a refresh cycle across that many memory channels all at once.

GeneralMayhem · 2026-04-10T07:16:12 1775805372

Right, but the impressive part is finding addresses that are actually on different memory channels.

kzrdude · 2026-04-10T10:00:03 1775815203

Surprising to me that two memory channels are separated by as little as 256 bytes. The short distance makes it easier to find, surely?

PinkSheep · 2026-04-10T17:18:51 1775841531

Access optimization or interleaving at a lower level than linearly mapping DIMMs and channels. x86 cache lane size is 64 bytes, so it must be a multiple. Probably 64*2^n bytes.

weinzierl · 2026-04-10T10:12:49 1775815969

"This is the sort of thing which was done before in a world where there was NUMA"

You sound like NUMA was dead, is this a bit of hyperbole or would really say there is no NUMA anymore. Honest question because I am out if touch.

cestith · 2026-04-10T13:51:13 1775829073

EPYC chips have multiple levels of NUMA - one across CCDs on the one chip, and another between chips in different motherboard sockets. As a user under Linux you can treat it as if it was simple SMP, but you’ll get quite a bit less performance.

Home PCs don’t do NUMA as much anymore because of the number of cores and threads you can get on one core complex. The technology certainly still exists and is still relevant.

josephg · 2026-04-10T09:22:21 1775812941

I hope this approach gets some visibility in the CPU field. It could be obviously improved with a special cpu instruction which simply races two reads and returns the first one which succeeds. She’s doing an insane amount of work, making multiple threads and so on (and burning lots of performance) all to work around the lack of dedicated support for this in silicon.

robinsonb5 · 2026-04-10T14:45:29 1775832329

I actually hope it doesn't!

The results are impressive, but for the vast, vast majority of applications the actual speedup achieved is basically meaningless since it only applies to a tiny fraction of memory accesses.

For the use case Laurie mentioned - i.e. high-frequency trading - then yes, absolutely, it's valuable (if you accept that a technology which doesn't actually achieve anything beyond transmuting energy into money is truly valuable).

For the rest of us, the last thing the world needs is a new way to waste memory, especially given its current availability!

100ms · 2026-04-10T07:21:24 1775805684

> Google's strange job optimisation technique (for jobs running on hard disk storage)

Can you give more context on this? Opus couldn't figure out a reference for it

why_only_15 · 2026-04-10T07:40:42 1775806842

This is a quite old technique. The idea, as I understood it, was that lots of data at Google was stored in triplicate for reliability purposes. Instead of fetching one, you fetched all three and then took the one that arrived first. Then you sent UDP packets cancelling the other two. For something like search where you're issuing hundreds of requests that have to resolve in a few hundred milliseconds, this substantially cut down on tail latency.

yvdriess · 2026-04-10T07:47:26 1775807246

Tournament parallelism is the technical term IIRC.

100ms · 2026-04-10T08:36:11 1775810171

Aha that makes more sense, I thought it was specifically to do with job scheduling from the description. You can do something similar at home as a poor man's CDN by racing requests to regionally replicated S3 buckets. Also magic eyeballs (ipv4/v6 race done in browsers and I think also for Quic/HTTP selection) works pretty much the same way

vitus · 2026-04-10T10:44:56 1775817896

> magic eyeballs

https://en.wikipedia.org/wiki/Happy_Eyeballs is the usual name. It's not quite identical, since you often want to give your preferred transport a nominal headstart so it usually succeeds. But yes, there are some similarities -- you race during connection setup so that you don't have to wait for a connection timeout (on the order of seconds) if the preferred mechanism doesn't work for some reason.

The main term I've seen for this particular approach is "request hedging" (https://grpc.io/docs/guides/request-hedging/, which links to the paper by Dean and Barroso).

spockz · 2026-04-10T18:33:35 1775846015

Request hedging or backup requests are indeed the terms I know for requests where you give the first request a bit of a headstart. I didn’t know about the term happy eyeball to signify that all requests fire at the same time.

vitus · 2026-04-10T21:23:31 1775856211

> I didn’t know about the term happy eyeball to signify that all requests fire at the same time.

It's not quite the same. Usually with Happy Eyeballs, you want to try multiple protocols (e.g. QUIC vs TCP, or IPv6 vs IPv4), and you have a preference for one over the other. As such, you try to establish your connection via IPv6, wait something like 30ms, then try to establish via IPv4. Whichever mechanism completes channel setup first wins, and you can cancel the other one.

It's a mechanism used to drive adoption of newer protocols while limiting the impact on end users.

100ms · 2026-04-10T11:06:54 1775819214

Happy eyeballs, that makes a lot more sense thanks. Someone's "magic eyeballs" here apparently isn't reading his own writing :)

tastroder · 2026-04-10T07:54:37 1775807677

https://cacm.acm.org/research/the-tail-at-scale/ (hedged / tied requests)

ufocia · 2026-04-10T03:43:23 1775792603

I like the video, but this is hardly groundbreaking. You send out two or more messengers hoping at least one of them will get there on time.

rcbdev · 2026-04-10T04:58:34 1775797114

Yeah. These are literally just mainframe techniques from yesteryear.

actionfromafar · 2026-04-10T08:09:26 1775808566

Almost everything "new" was invented by IBM it seems like. And it goes by a completely different name there. It's still nice to rediscover what they knew.

ufocia · 2026-04-11T14:07:01 1775916421

As another poster said, the impressive part here is the work on lining up all the technical features to implement this simple concept. Though I'm the end Laurie realized that essentially brute force worked well.

One thing that I'd think about improving is the boundary search. It seems to me at mere first glance that binary search would usually be much faster. Also, knowing what architecture is used, e.g. channel width, could further optimize the search.

UltraSane · 2026-04-10T04:32:11 1775795531

The clever part is figuring out what RAM is controlled by which controllers.

saidnooneever · 2026-04-10T08:23:15 1775809395

everyone says this but no one says why it was clever. i find her videos have cool results but i cant have patience for them usually because its recycled old stuff (can be cool but its not ground breaking).

there is a ton of info you can pull from: smbios, acpi, msrs, cpuid etc. etc. about cpu/ram topology and connecticity, latencies etc etc.

isnt the info on what controllers/ram relationships exists somewhere in there provided by firmware or platform?

i can hardly imagine it is not just plainly in there with the plethtora info in there...

theres srat/slit/hmat etc. in acpi, then theres MSRs with info (amd expose more than intel ofc, as always) and then there is registers on memory controller itself as well as socket to socket interconnects from upi links..

its just a lot of reading and finding bits here n there. LLms are actually really good at pulling all sorts of stuff from various 6-10k page documents if u are too lazy to dig yourself -_-

UltraSane · 2026-04-10T11:00:09 1775818809

The exact mapping between RAM addresses and memory controllers is intentionally abstracted by the memory subsystem with many abstraction layers between you and the physical RAM locations. Because documentation is sometimes incomplete or proprietary, security researchers often have to write software that probes memory and times the access speeds to reverse-engineer the exact interleaving functions of a specific CPU. in the video she says that ARM CPUs have the least data about this and she had to rely entirely on statistical methods.

sumeno · 2026-04-10T13:27:26 1775827646

It's very funny that you're giving a RTFM response to a video you admit you didn't watch.

WTFV

kzrdude · 2026-04-10T10:01:14 1775815274

I have to say that using drawbridges and differently colored rail pieces to explain it was very clever.

npunt · 2026-04-10T04:24:33 1775795073

and dropbox was just rsync

kreelman · 2026-03-25T09:50:59 1774432259

I wonder if there are a few things here....

It would be great if Linux was able to do simple chroot jails and run tests inside of them before releasing software. In this case, it looks like the whole build process would need to be done in the jail. Tools like lxroot might do enough of what chroot on BSD does.

It seems like software tests need to have a class of test that checks whether any of the components of an application have been compromised in some way. This in itself may be somewhat complex...

We are in a world where we can't assume secure operation of components anymore. This is kinda sad, but here we are....

driftnode · 2026-03-25T14:32:37 1774449157

The sad part is you're right that we can't assume secure operation of components anymore, but the tooling hasn't caught up to that reality. Chroot jails help with runtime isolation but the attack here happened at build time, the malicious code was already in the package before any test could run. And the supply chain is deep. Trivy gets compromised, which gives CI access, which gives PyPI access. Even if you jail your own builds you're trusting that every tool in your pipeline wasn't the entry point. 97 million monthly downloads means a lot of people's "secure" pipelines just ran attacker code with full access.

kreelman · 2026-02-16T01:39:11 1771205951

Wondering....

Has anyone tried doing this on ReactOS? I know this is a touch DIY, but it would be interesting to know if Win sofware could be built on ReactOS...

kreelman · 2026-02-12T23:54:40 1770940480

Wooshka.

I hope they've got good heat sinks... and I hope they've plugged into renewable energy feeds...

thrance · 2026-02-13T01:05:38 1770944738

Fresh water and gas turbines, I'm afraid...

King-Aaron · 2026-02-13T00:28:43 1770942523

Nope! It's gas turbines

atonse · 2026-02-13T15:29:56 1770996596

For now. And also largely because it's easier to get that up and running than the alternative.

Eventually, as we ramp up on domestic solar production, (and even if we get rid of solar tariffs for a short period of time maybe?), the numbers will make them switch to renewable energy.

kreelman · 2026-02-09T10:30:34 1770633034

This is so neat. Thanks for putting it together. A strange number system, but interesting. It would be great if it could be done in Unicode. I'm intrigued to know what it was used for.

RobotToaster · 2026-02-09T11:08:20 1770635300

Until relatively recently the western world didn't really agree on a numeral system.

(As an aside, the western Arabic numerals we use today are quite different to the eastern Arabic numerals used in Arabic writing)

kreelman · 2026-02-08T05:17:42 1770527862

There seems to be a good amount of interest for a boot sector compiler!!

If you're running on Linux, adjust the qemu call to use alsa rather than coreaudio.

I generated a pull request for this on Github. If the author is happy enough with my verbose shell scripting style :-) it might get included.

kreelman · 2026-02-06T00:53:02 1770339182

..A small thing, but it won't compile the RISCV version of hello.c if the source isn't installed on the machine it's running on.

It is standing on the shoulders of giants (all of the compilers of the past, built into it's training data... and the recent learnings about getting these agents to break up tasks) to get itself going. Still fairly impressive.

On a side-quest, I wonder where Anthropic is getting there power from. The whole energy debacle in the US at the moment probably means it made some CO2 in the process. Would be hard to avoid?

kreelman · 2026-02-03T11:06:22 1770116782

Interestingly, if we could do floating orbital platforms above Venus, it could perhaps look a little like the Jetsons.

kreelman · 2026-01-31T04:09:14 1769832554

He was once a big pin in Y Combinator (I think kind of ran it?)... Paul Graham thought he was great for YC.

Interesting that he's got as far as he has with this issue. I don't think you can run a company effectively if you don't deal in truth.

Some of his videos have seemed quite bizarre as well, quite sarcastic about concerns people have about AI in general.

owebmaster · 2026-01-31T08:39:39 1769848779

> He was once a big pin in Y Combinator (I think kind of ran it?)... Paul Graham thought he was great for YC.

And today it seems everyone will at YC hate him but pretend not