More

arkensaw · 2026-05-10T19:13:09 1778440389

Chrome takes up a few gigs on windows for no good reason anyway, mostly caching of websites you went to one time

arkensaw · 2026-05-08T21:42:57 1778276577

> You have been on this page for 92 seconds. You scrolled 0% of the way down. You never left this tab.

Uhm... how did I get to the bottom if I scrolled 0%?

arkensaw · 2026-05-02T23:09:57 1777763397

maybe not great naming. Sounds very similar to the rapper D4vd, who was just arrested for murdering a 14 year old girl

jofzar · 2026-05-02T23:12:11 1777763531

Actually it's closer to https://youtube.com/@dave2d who is a popular tech YouTuber

hackerlytest · 2026-05-03T15:17:00 1777821420

Huh, I also thought the same

bl4ckneon · 2026-05-03T01:45:21 1777772721

That is what I thought of too. Almost live David is a popular name or something... /s

bloggie · 2026-05-03T12:42:08 1777812128

Maybe he should change his name so it's not confused with a popular video codec.

arkensaw · 2026-04-27T20:44:33 1777322673

maybe I'm being dense, but why do people keep reinventing markdown to make it more like HTML when HTML exists?

iamgioh · 2026-04-27T22:21:04 1777328464

Easier to read, easier to write. On top of that, Quarkdown runs compile-time logic and scripting.

arkensaw · 2026-04-27T20:39:48 1777322388

> No Vibe Coding. Classic development style.

This is fast becoming a feature people want.

elpocko · 2026-04-27T21:00:44 1777323644

[flagged]

xantronix · 2026-04-28T04:29:10 1777350550

You know full well that LLMs don't simply spawn from nothingness. These things don't exist in a vacuum, technically nor politically.

arkensaw · 2026-04-22T10:20:03 1776853203

no AI could ever be so poetic. nice

arkensaw · 2026-04-09T13:58:55 1775743135

> In Germany, people complain when the train is late. Everywhere else, the train is just late.

You think people don't complain when the train is late in other countries? That's hardly a uniquely German thing

saalweachter · 2026-04-09T14:45:08 1775745908

To be honest, I complain more often when the train is on time.

arkensaw · 2026-04-09T13:57:15 1775743035

I am German, not autistic.

This confuses me as I have never been to Germany and do not speak German.

But rules are rules.

addandsubtract · 2026-04-09T16:04:49 1775750689

There's a German word for that: Deutschgeistlichveranlagt.

RunningDroid · 2026-04-09T18:41:26 1775760086

Do you have German heritage or live in an area that had a large number of German immigrants at one point?

arkensaw · 2026-04-09T22:13:44 1775772824

hah, no, not even a bit.

nickvec · 2026-04-09T16:00:41 1775750441

So you’re not German?

Markoff · 2026-04-09T16:37:17 1775752637

not necessarily, my kids have citizenship of my home country while daughter never lived there and son only for a year, and for none of them it's their mother tongue

arkensaw · 2026-04-09T22:12:38 1775772758

I am not. But I guess I'd fit right in. If I could speak German

pwdisswordfishy · 2026-04-09T14:39:38 1775745578

So ist der Geist!

ahofmann · 2026-04-09T16:40:41 1775752841

It's always funny to see people try speaking/writing german and screw it up in four words/14 characters :-)

I got 38% german, 58% autistic btw.

arkensaw · 2026-04-09T11:28:28 1775734108

> This class of bug seems to be in the harness, not in the model itself. It’s somehow labelling internal reasoning messages as coming from the user, which is why the model is so confident that “No, you said that.”

from the article.

I don't think the evidence supports this. It's not mislabelling things, it's fabricating things the user said. That's not part of reasoning.

arkensaw · 2026-04-06T23:22:50 1775517770

This is great, and I'm not knocking it, but every time I see these apps it reminds me of my phone.

My 2021 Google Pixel 6, when offline, can transcribe speech to text, and also corrects things contextually. it can make a mistake, and as I continue to speak, it will go back and correct something earlier in the sentence. What tech does Google have shoved in there that predates Whisper and Qwen by five years? And why do we now need a 1Gb of transformers to do it on a more powerful platform?

pushedx · 2026-04-07T07:24:52 1775546692

It's the same model used for the WebSpeech API, which can operate entirely offline.

Google mostly funded the training of this model around 10 years ago, and it's quite good.

There are many websites that are simple frontends for this model which is built into Webkit and Blink based browsers. However to my knowledge the model is a blob packed into the apps which is not open source, hence the no Firefox support.

https://developer.mozilla.org/en-US/docs/Web/API/Web_Speech_...

https://www.google.com/intl/en/chrome/demos/speech.html

com2kid · 2026-04-06T23:38:54 1775518734

Microsoft OneNote had this back in 2007 or so, granted the speech to text model wasn't nearly as advanced as they are now.

I was actually on the OneNote team when they were transitioning to an online only transcription model because there was no one left to maintain the on device legacy system.

It wasn't any sort of planned technical direction, just a lack of anyone wanting to maintain the old system.

rudhdb773b · 2026-04-07T05:37:32 1775540252

I remember trying out some voice-to-text around 2002 that I believe was included with Windows XP.. or maybe Office?

You had to go through some training exercises to tune it to your voice, but then it worked fairly well for transcription or even interacting with applications.

silon42 · 2026-04-07T06:18:57 1775542737

OS/2 had it built in in 1996.

adamsmark · 2026-04-06T23:43:59 1775519039

The accuracy is much lower though.

I've switched away from Gboard to Futo on Android and exclusively use MacWhisper on MacOS instead of the default Apple transcription model.

dotancohen · 2026-04-07T03:23:50 1775532230

Any particular reason why you switched? I've been using Gboard for years, especially the text to speech in four languages. In the past few weeks, there was an update where the TTS feature is now in a separate "panel" of the keyboard, and it hardly works at all.

In English and Hebrew it stops after half a dozen words, and those words must be spoken slowly and mechanically for it to work at all. Russian and Arabic are right out - I can't coax any coherent sentence out of it.

I've gone through all permutations of relevant settings, such as "Faster Voice Dictation" (translated from Hebrew,I don't know what the original English option is called). I think there used to be an option for Online or Offline transcription, but that option is gone now.

This is ridiculous - I tried to copy the version information and there is no way to copy it in-app. Let's try the S24 OCR feature...

17.0.10.880768217 release-arm64-v8a 175712590 ראשית (en_GB) 2025090100 = גרסה עדכני Primary on-device: No packs Fallback on-device: Packs: ru-RU: 200

I'll try to install the English, Hebrew, and Arabic packs, though I'm certain that I've installed them already.

cootsnuck · 2026-04-06T23:45:37 1775519137

Interesting. My Pixel 7 transcription is barely usable for me. Makes way too many mistakes and defeats the purpose of me not having to type, but maybe that's just my experience.

The latest open source local STT models people are running on devices are significantly more robust (e.g. whisper models, parakeet models, etc.). So background noise, mumbling, and/or just not having a perfect audio environment doesn't trip up the SoTA models as much (all of them still do get tripped up).

I work in voice AI and am using these models (both proprietary and local open source) every day. Night and day different for me.

taffydavid · 2026-04-07T08:04:28 1775549068

I've built my own tts apps testing whisper and while it's good it does hallucinate quite a bit if there's noise, or just sometimes when the audio is perfectly clear.

It often gives the illusion of being very good but I could record a half hour of me speaking and discover some very random stuff in the middle that I did not say

cootsnuck · 2026-04-07T16:29:41 1775579381

Yup, you're absolutely right. The open source models do have their rough edges. I use NVIDIA's Parakeet v3 model a lot locally, and it will occasionally do this thing where it just repeats a word like a dozen times.

artdigital · 2026-04-07T06:20:57 1775542857

macOS and iOS can do that to with the baked in dictation. Globe key + D on Mac

dust42 · 2026-04-07T07:13:06 1775545986

When you activate it you agree that your voice input is sent to Apple. As far as I understand this project runs fully locally. Up to you to decide for whatever suits your needs best.

stingraycharles · 2026-04-07T09:27:47 1775554067

Where did you get from that the voice input is sent to Apple / the cloud?

As far as I understand Apple’s voice model runs locally for most languages.

Siri commands can be used for training, but is also executed locally and sent to Apple separately (and this can be disabled).

angristan · 2026-04-07T13:55:21 1775570121

I couldn't believe it either but when you enable it the settings of macOS you get this popup:

> When you dictate text, information like your voice input and contact names are sent to Apple to help your Mac recognize what you’re saying.

wat10000 · 2026-04-07T14:46:42 1775573202

Elsewhere it says:

"When you use Dictation, your device will indicate in Keyboard Settings if your audio and transcripts are processed on your device and not sent to Apple servers. Otherwise, the things you dictate are sent to and processed on the server, but will not be stored unless you opt in to Improve Siri and Dictation."

And:

"Dictation processes many voice inputs on your Mac. Information will be sent to Apple in some cases."

In conclusion... I think they're trying to cover all their bases, but it sounds like things are processed locally as long as the hardware can handle it.

victorbjorklund · 2026-04-07T11:38:28 1775561908

No, that is not correct. It is running one hundred percent local. You can try it by turning off internet on your phone and try running it then. However, the built in model isn't as good, so this is probably better.

dwayne_dibley · 2026-04-07T06:37:22 1775543842

yup, this is how I 'type'

nidnogg · 2026-04-07T12:01:08 1775563268

Nothing comes close to LLM transcription though. I just tried this. I said "globe key dictation, does this work?". Here's the transcription, verbatim:

"Fucking dictation, does this work"

arkensaw · 2026-04-09T13:44:37 1775742277

fun fact: voice typing also worked excellently on Windows Phone, although only in the SMS app

vharish · 2026-04-07T06:49:24 1775544564

IMO.. one of the best. It was surprisingly good. Yet they can't even replicate in on their own systems