Hacker Newsnew | past | comments | ask | show | jobs | submit | qaadika's commentslogin

I'm kinda one of those who believes they 'completely' understand LLMs. But I've also developed my understanding of them such that the internal mechanisms of the transformer, or really any future development in the space based on neural networks and machine learning is irrelevant.

1. A string of unicode characters is converted into an array of integers values (tokens) and input to a black box of choice.

2. The black box takes in the input, does its magic, and returns an output as an array of integer values.

3. The returned output is converted into a string of unicode characters and given to the user, or inserted in a code file, or whatever. At no point does the black box "read" the input in any way analogous to how a human reads.

Where people get "The AIs have emotions!!!" from returning an array of integers values is beyond me. It's definitely more complicated than "next token predictor", but it really is as simple as "Make words look like numbers, numbers go in, numbers come out, we make the numbers look like words."


Yeah nothing personal but my claim here is you’re not smart. The next token predictor aspect is something anyone can understand… the transformer is not quantum physics.

Like look at what you wrote. You called it black box magic and in the same post you claim you understand LLMs. How the heck can you understand and call it a black box at the same time?

The level of mental gymnastics and stupidity is through the roof. Clearly the majority of the utilitarian nature of the LLM is within the whole section you just waved away as “black box”.

> Where people get "The AIs have emotions!!!" from returning an array of integers values is beyond me

Let me spell it out for you. Those integers can be translated to the exact same language humans use when they feel identical emotions. So those people claim that the “black box” feels the emotions because what they observe is identical to what they observe in a human.

The LLM can claim it feels emotions just like a human can claim the same thing. We assume humans feel emotions based off of this evidence but we don’t apply that logic to LLMs? The truth of the matter is we don’t actually know and it’s equally dumb to claim that you know LLMs feel emotions to claiming that they dont feel emotions.

You have to be pretty stupid to not realize this is where they are coming from so there’s an aspect of you lying to yourself here because I don’t think you’re that stupid.


> You never (or rarely) write the wiki yourself — the LLM writes and maintains all of it. You're in charge of sourcing, exploration, and asking the right questions. The LLM does all the grunt work — the summarizing, cross-referencing, filing, and bookkeeping that makes a knowledge base actually useful over time.

I'm not sure how you can get any closer to "turning your thinking over to machines." These tasks may be "grunt work," but it's while doing these things that new ideas pop in, or you decide on a particular or novel way to organize or frame information. Many of my insights in my (analog? vanilla? my human-written) Obsidian vault (that I consider my "personal wiki") have been made or expanded on because I happened to see one note after another in doing the "grunt work", or just by opening one note and seeing its title right beside a previously forgotten one.

There's nothing "personal" about a knowledge base you filled by asking AI questions. It's the AI's database, you just ask it to write stuff. Learn how to learn and answer your own damn questions.

Soon pedagogy will be a piece of paper that says "Ask AI."

I hate this idea that a result is all that matters, and the quicker you can get the result the better, at any cost (mental or financial, short-term or long-term).

If we optimized showers to be 20 seconds, we'd stop having shower thoughts. I like my shower thoughts. And so too my grunt-work thoughts.

---

As an aside, I'm not totally against AI writing in a personal knowledgebase. I include it at times in my own. But since I started my current obsidian vault in 2023 (now 4100 self-written notes, including maybe up to 5% Web Clipper notes), I've had a Templater (Obsidian plugin) template I wrap around anything AI-written to 'quarantine' it from my own words:

==BEGIN AI-GENERATED CONTENT==

<% tp.file.cursor(1) %>

==END AI-GENERATED CONTENT==

I've used this consistently and it's helped me keep (and develop) my own writing voice apart from any of my AI usage. It actually motivates me to write more, because I know I could always take the easy route and chunk whatever I'm thinking into the AI, but I'm choosing not to by writing it myself, with my own vocabulary, in my own voice, with my own framing. I trick myself into writing because my pride tells me I can express my knowledge better than the AI can.

I also manually copy and paste from wherever I'm using AI into my notes. Nothing automated. The friction keeps me from sliding into the happy path of turning my brain off.


Since you're a fellow Obsidian user, you likely remember the early days of back-linking note-taking software like Roam and such. I remember just seeing pictures of the graph being the primary visual symbol representing the depth of learning. I thought "ok well people just want to accumulate stuff." AI tools certainly help with creating a mass of notes.

There's a comment above how this is reminiscent of Licklider's work, but it reminds of the early print culture era, where books were a consumer item, and people just purchased a lot of them to put on shelves built to display them.


I actually never got into note-taking before I found Obsidian. I used Google Drive all throughout college and up to 2023, so any knowledge I had written down was sequestered by an ad-hoc folder structure that was mostly chronological by year, or in my physical notebooks by subject. It was also limited by what I felt was worth writing down enough that it merited a Doc, or what I could write in one session before getting distracted and never touching it again. And mentally I limited myself by always wanting to write something down "right", all spell-checked and grammatically correct and sensically organized, which led to often not writing anything down at all. Now I dump the words down and come back to it later when I want to "garden."

My brother showed me stuff like Trilium circa 2017, but I hadn't the thinking process I do now to even know where to start racking my brain for stuff to write down.

When I read Chernow's biography of Alexander Hamilton, I was in awe of his ability to write so damn much. I never thought I'd be able to do that. Turns out the secret is just three things: have stuff you're passionate about, be able to recall it, and write so damn much about it. When your thinking process is based around "how would I phrase/word/frame this to write it down," if you have the right process and organization to be able to access it later, it's even easier than talking. For some they can keep everything they know in their head. I'm not one of them. but I can write everything down, and in writing it down I end up remembering it better. My professors were right all along.

And if one looks at the actual written letters people like Hamilton [1] and co. wrote, or even back to Isaaci Newtoni [2][3], They're riddled with spelling errors and strange latinesque grammar and informal formalities. Yet they're revered for their ability to write. Because it really is the thought that countts [4], not the words.

(Very little of these comments is new thoughts I'm having now. Most of it is thoughts I had and documented when I was super into PKM in 2023 and since, and now comes back up as those neurons fire again and I consider the new idea of "should AI be my PKM?" after reading the post.)

---

Yeah, the graphs are cool for a little bit. But only post-facto, once one has an amount of data points where it might become useful. If the AI is doing the organization then any personal significance is lost. Or rather it was never there to begin with.

Wikilinks is the feature I use most often, outside of my folder organization (PARA). Now when I have a thought, it goes down a chain of "do I have a note already this can go into," to "no, it should be a new note. are there any notes I should wikilink in this one, or link this one in?"

I think I made a good decision early on when I was inspired by the Emacs documentation to add a basic "Related: " line before the first header (and after the YAML). There I dump any wikilink I think might possibly be something I want to reference, or find this note via a backlink, without having to think about where to put it in the body.

E.g.

{YAML header}

Related: [[Artemis]], [[Artemis II]], [[NASA Engineering]], [[Space MOC]], [[NASA CAPCOM]],

# Artemis II Mission Timeline and Notes

{body and rest of note, my own record of things that happened as I watch the stream}

---

> AI tools certainly help with creating a mass of notes.

Agreed. Presuming the implication is it creates a mass of notes, but of generic information stated generically. I'm really proud of my 4100 notes, because I know (aside from a few catagories like web clipping) even if they're a mess, they're my mess. I definitely could have gone the last three years without having found Obsidian, but I wouldn't have as clear a record of them as I do now. Or the rest of my life, as I slowly add stuff about my past, or migrate old writing into it. I also definitely repeat myself by saying the same information in different places, but in different ways. It's not 'efficient' information density-wise, but it is designed for a human to read and see the human behind the writing.

I also believe I think clearer, as often when I'm recalling information I'm actually recalling my note in my head on that subject. I write so much that in conversation "I was thinking the other day" is analogous to "I wrote down in my notes".

I might be crazy but I would put my vault in my will as something to be passed on, because there's so much me in them. My yearly journals in /02 Areas/Journals/ are the most obvious ones, but I have a /02 Areas/Writing/ folder that's just notes I consider "writing", whic is distincy from the contents of /03 Resources/ folder that's the "general knowledge" knowledgebase.

---

Anyway, I guess my tl;dr is that AI can never write about thinking as well as a human can, and in my opinion it's the thinking that important, not the writing. the writing or the words is merely a tool in thinking. Karpathy mistakes the words to be the goal, rather than the thinking that caused the words.

---

One last thing: I just re-read the HN guidelines out of curiosity, and I noticed they recently added "Don't post generated comments or AI-edited comments."

I could copy and paste almost anything from my vault into an HN comment without violating this rule. Anybody creating a PKM with this sytem could not. They would have to rewrite it in their own words. So one might as well just right it themself in the first place if they ever think they might want to reuse it in a place like HN.

---

[1] https://outhistory.org/exhibits/show/rev/hamilton-laurens-le...

[2] https://www.newtonproject.ox.ac.uk/view/texts/diplomatic/MIN...

[3] A while back, while in a Newton phase, I decided arbitrarily to refer to him as "Isaaci Newtoni," as that's how he called himself. I reinforced that by using that name for him in my notes. Now I call him that instinctually, not consciously.

[4] Intentional.


> Nowadays my writing [] has totally devolved into "prompt-ese."

I've noticed this myself. Even in my Obsidian vault, which only I read and write in. I think it's a development into writing more imperatively, instinctually. Thinking more in instructions and commands than the speaking and writing habits I've developed organically over my life. Or just "talking to the computer" in plain English, after having to convert my thoughts to code anytime I want to make it do something.

I've been thinking about the role of "director" in media as an analogy to writing with LLMs. I'm working right now on an "essay," that I'm not sure I'll share with anyone, even family (who is my first audience). Right now, under the Authorship section, I wrote "Conceived, directed, and edited by Qaadika. Drafted by Claude", with a few sentences noting that I take responsibility for the content, and that the arguments, structure, audience, and editorial judgments are mine.

I had a unique idea and started with a single sentence prompt, and kept going from there until I realized it should be an essay. So the ideas in it are mine. The thesis is mine. I'm going back and forth with the LLM section by section. Some prompts are a sentence. Some are eight paragraphs. I can read the output and see exactly what was mine and what the LLM added. But my readers won't. They'll just see "Author: Qaadika" and presume every single word was mine. Or they'll sniff out the LLM-ness and stop reading.

I can make a film and call myself director without ever being seen in it. Is is the same if I direct the composition of words without ever writing any of the prose myself? Presuming I've written enough in prompts that it's identifiably unique from cheaper prompts and "LLM, fill in the blank".

We credit Steven Spielberg with E.T. But he didn't write the screenplay. He probably had comments on it, though. He didn't operate the camera. But he probably told the operators where to put it. He didn't act in it. But he probably told the actors where to stand and where to move and how to be. He didn't write the music. But he probably had a sense of when and where to place it in the audio. And he didn't spend every moment in the cutting room, placing every frame just so.

But his name is at the top. He must have done something, even if I can't point to anything specific. The "Vibe" of the film is Spielberg, but it's also the result of hundreds of minds, most of whole aren't named until the end of the film, and probably never read by most viewers.

His contribution to the film was instructions. Do this, don't do that. Let's move this scene to here. This shot would be better from this angle. The musical swell should be on this shot; cut it longer to fit.

So where, exactly, is "Spielberg" in E.T.? What can we objective credit him with, aside from the finished product: E.T. the Extra-Terrestrial: Coming June 1982?


Uh, Steven Spielberg is all over E.T. For one thing, he storyboarded the big special effects sequences. He collaborated closely on the screenplay because it was drawn from his own childhood experiences. He was the final say in casting. His relationship with editor Michael Kahn is famously collaborative.

I think comparing your telling an LLM what to do and Steven Spielberg directing a movie just shows a total lack of understanding of how movies are made, and also inflates your own sense of your self.


> Uh, Steven Spielberg is all over E.T. For one thing, he storyboarded the big special effects sequences. He collaborated closely on the screenplay because it was drawn from his own childhood experiences. He was the final say in casting. His relationship with editor Michael Kahn is famously collaborative.

That's all meta. Trivia. Decisions he made or feedback he gave, that while influencing the final product cannot be observed in the final product (e.g. show me the actual Spielberg-drawn storyboard in the film; It doesn't exist, because the storyboard turned into a sequence of shots made by the cinematographer, instructing the camerawoman to point the camera at the actors lit by the gaffers, or into a work breakdown strucutre then followed by the SFX team painstakingly drawing it frame by frame). No one but Spielberg could say "That part was me, this part was Kahn's." I can't find any of that out just by watching the movie. When I engage with a piece of media, I presume the author is dead. What is in the media is canon, and what's not in it isn't. The behind the scenes, or the director's biography, or the interviews aren't part of the art. Art shouldn't rely on "Oh it's good, or even better than you thought it was once you know this cool fact or that wild story from production."

Star Wars isn't good only because George Lucas was a genius, or because they spent a lot of time on the models and tried a cool new text intro sequence, or because of any of the other novel effects. Lots of movies spend a lot of time in production, with a lot of experts and a lot of novel ideas, and still fail. Star Wars is good because the finished movie is good. We credit Star Wars generally as being George Lucas' brainchild, but if you know the backstory, it's only good because he had good editors to reign him in. But that's meta. Nobody knew that in 1977. They just knew they enjoyed the movie and it said "written and directed by George Lucas."

When I watch the movie I don't see the storyboard, or the redlines in the screenplay, or the casting notes, or the conversations and discussions with Kahn. All I know from the movie is the credits, and the credits don't say "Written by Melissa Mathison (with close collaboration by Spielberg based on his childhood experience)". Those are, from a lay viewer's POV, 'facts not in evidence.'

E.T. was a single example. I'm comfortable claiming my argument applies to all directors of all films, and all forms of art that are created by more than one person. Another example: "Over The Edge" and "Off The Wall", two books about deaths in US national parks. They each have two authors. Only one author co-wrote both of them. To whom do I credit my love for those books? Only to Ghiglieri, since I can see the consistent tone between them? That would be unfair to Myers and Farabee. Only to Myers and Farabee, because they're the park rangers that witnessed a number of the emergencies and deaths? That would be unfair to Ghiglieri. What about the editors, who surely worked hard to make books that are basically a list of stories about death interesting as a cohesive narrative. My only option is to credit all the authors, and everyone else involved, equally, and not try to break down paragraphs between "this author wrote this one, and that author wrote that one." They didn't distinguish, so I can't either. [1]

I'm all over my essay. I drafted and organized the original outline. I've made substantial changes to the order of paragraphs and what and how the arguments are built and developed based on my personal experiences. I am the final say for whose quotes are included and which ones are cut. My relationship with myself is famously collaborative (famous among my family and friends).

None of that matters to the reader. Whether I wrote it myself or with a friend, or used a ghostwriter, or used an LLM, the audience is going to credit or blame it on the name at the top. My papers in college weren't graded based on whether I spent 300 hours on them and revised them 20 times, or whether it was I or my classmate who coined that pithy line I then used throughout, or because I used niche knowledge about the subject I knew before taking the class. That's trivia. They were graded on the final single copy I submitted. I got once chance.

The only difference between an essay of mine being written by a ghostwriter I hired and an LLM is that the LLM output is always going to sound like an LLM. They are identical in that neither of them are "me". The ghostwriter will sound either like the ghostwriter or like the ghostwriter trying to write like me. But whether I hired a ghostwriter and published their work under my name, or if I used an LLM and the audience didn't notice, at the end of the day they'll credit or blame me entirely, because my name is at the top, no different as if I'd written the entire thing from scratch. I have no excuses except for the final product.

For this essay specifically, If I ever did release it or publish it, it would be under my real name. Firstly because I've never liked being "anonymous" online (I feel I never act or write like myself unless I'm speaking under my own name; opposite of most in my experience), and second because I would want the reader to know that there's a human they can credit or blame for it. I guess for me that's the tradeoff. When anonymous I won't use LLMs, because my ethos comes from being (and sounding) like a human being who merely doesn't want to share their name. Under my real name, however, I feel more comfortable saying "directed and edited by [real name], drafted by [llm]," because then the reader can decide if the ethos associated with my real name and affilations is strong enough to justify reading a logos and pathos that the human freely admits is not entirely from their own fleshy brain.

[1] They do, actually, at times. When one of the authors was directly involved in one of the stories and is recounting their personal experience, they will write "I (Myers)..." or "I (Farabee).." Aside from that they do not say who wrote what, or who influenced who.


> the big AI companies do have opt out mechanisms for scraping and search.

PRESS RELEASE: UNITED BURGLARS SOCIETY

The United Burglars Society understands that being burgled may be inconvenient for some. In response, UBS has introduced the Opt-Out system for those who wish not to be burgled.

Please understand that each burglar is an independent contractor, so those wishing not to burgled should go to the website for each burglar in their area and opt-out there. UBS is not responsible for unwanted burglaries due to failing to opt-out.


It's been interesting the past year or so watching myself turn more and more into one of the tin-foil wearing linux users. I'm not sure how it happened, but self-hosting became more and more alluring and hyperfocusing on taking as much data as I can offline became worth spending entire weekends on.

I didn't become paranoid, everybody else didn't!


It should take 0 seconds, because I shouldn't have to do it.

That's my bar. My time is my time, and anything that takes time from me better have a damn good excuse. Github is not bringing any good reasons to the table to justify making me take my time to protect privacy I've had by default up to now.


> https://github.blog/news-insights/company-news/updates-to-gi...

> Should you decide to participate in this program, the interaction data we may collect and leverage includes:

> - Outputs accepted or modified by you

> - Inputs sent to GitHub Copilot, including code snippets shown to the model

> - Code context surrounding your cursor position

> - Comments and documentation you write

> - File names, repository structure, and navigation patterns

> - Interactions with Copilot features (chat, inline suggestions, etc.)

> - Your feedback on suggestions (thumbs up/down ratings)

"should you decide to participate.."??? You didn't ask if I wanted to participate. You asked if I didn't.

I didn't get to decide to participate. I had to decide not to. You made me do work to prevent my privacy from being violated.


Do you use copilot?

First response: It doesn't matter if I use copilot right now. It matters if I will ever use copilot in the future. Opting-out is future-focused. What if I said "no, I don't use copilot, so I don't need to opt out", then a year from now start using copilot, completely forgetting about this whole debacle? That's the evil of opt-out. My inaction only benefits them, never me.

Second response: Maybe? I press the little button to auto-generate commit titles and messages that showed up in my Github Desktop. Does that count?

I'm asking sincerely. I don't "use Copilot" as in using it in VS Code or while writing code, so I'm honestly not sure if I am.


Do we get a choice? I did not ever explicitly enable it yet GitHub's web UI by default uses copilot to autofill my web-based edit commit messages. It also shows up on the home screen by default now.

I'm pretty sure if you use the site you're using GitHub Copilot in some way, so your question becomes irrelevant.


Do you think a single person works on a repo?

I felt myself starting to sweat as I read. I can't imagine doing this at my apartment complex, let alone at someone else's. Messing with building controls (old or unused as they may be) sounds like a great way to get your lease nixed and your ass out the door quicker than a lawyer can say "Yeah, I can't help you here, they're well within their rights to evict you for that."

I was hoping they'd mention something about the legality (or lack thereof), but I guess that's an exercise left to the reader who wants to try this out at their own apartment.


> sounds like a great way to get your lease nixed and your ass out the door quicker than a lawyer can say "Yeah, I can't help you here, they're well within their rights to evict you for that."

For repairing a broken thing? After provably trying in vain to get the landlord to fix it?


Well he didn't "fix" it, he hacked it to work for one tenant. And to allow said tenant's non-tenant's friends free access into the building. "fixing it" would be restoring the voice call ability to its original function. Not modding it for one random tenant's Apple Home setup.

And it's definitely possible to get in trouble for "fixing" something if you're not authorized to fix it.

I would call this "bypassing building controls to allow unauthorized access to the building." Frank has access to the building through the allowed means per his lease, not through any means. If his lease is like mine there's a whole page to initial about being granted access through the gates or pool or whatever with only the complex-assigned keys and RFID tags.

(I presume Frank lives in the US, and his state's tenancy laws similar to mine apply.)


> For repairing a broken thing? After provably trying in vain to get the landlord to fix it?

Down the hallway from my office used to be the management of a small hotel chain. We often had lunch together and I got to hear a bunch of interesting anecdotes over the years.

Way back when they started up and didn't yet have enough cash to actually own the buildings they operated in, they rented. One of the buildings turned out to have numerous issues (holes in the roof, gaps near exterior walls, etc...). To the point that they eventually didn't pass a fire inspection. They repeatedly asked the owner to have it fixed. Pressed for time, they themselves eventually payed someone, out of their own pocket, so it would at least be up to code for the fire inspection.

From what I was told, the owner threw a tantrum over them modifying the building, terminated the contract and sued them. Successfully.

If you are a tenant in a rental apartment, you'd probably have more leniency on the legal side (compared to a company renting a business property). But still, I'd be very careful making any assumptions about the legal situation rather than risking some sort of Kafkaesque legal mess.

Over here at least, it is very common in apartment complexes that the apartment owner is a different person/entity than the building owner and only the later has the rights to mess with stuff installed in the walls (e.g. plumbing) and especially stuff elsewhere in the building (e.g. an external intercom system). If you ask the landlord to fix it, the best they could do is forward that request to the building owner. If you pulled a stunt like the OP did, there's a good chance that the building owner will sue your landlord.


In the US states that I know well, a residential tenant may perform necessary repairs to bring the space up to health and safety codes, and may deduct the cost from their rent. They have an obligation to notify the property manager, in advance in the case of non-emergency repairs, or after the fact otherwise. There are additional details to consider as well.

I don't know if this would apply to a commercial tenant.

But it would definitely not apply to non-violating conditions like the OP's case.


> the owner threw a tantrum over them modifying the building, terminated the contract and sued them. Successfully.

Was the unauthorized modification permanent or undoable? If the latter, I think some people should really get their judge card (or landlord card) revoked.

Did the judge at least suggest what alternative action the tenant should have taken to comply with the law and code?


Most likely the (legally) correct thing to do in the US is to first report the landlord to the relevant agency, possibly named something like Licensing and Inspections or Fair Housing or somesuch. Each local jurisdiction will have it's own agencies for this, so do research. Failure to respond to that would next involve a landlord-tenant lawyer.

Whether or not it's worth all the trouble and time is a different matter. For most people, I'd say reporting to relevant authorities to make the landlord's life harder without needing much continuing effort is probably worth doing, but the lawsuit side is likely to be a huge time and money sink and it's almost always easier to just move. Let the city sue them for continuing to accrue complaints of unsafe living conditions.

In the same way, a landlord cannot evict you themself if you just fail to pay rent, but there are multiple legal mechanisms to eventually get the sheriff to do it for them. Basically, if landlord-tenant negotiation fails, I think the only legal recourse is to involve governmental third parties unless you technically open yourself up to legal reprisal.


> an attack on human work

A "software engineering team" is human, so that does change the argument. It becomes a human-human problem, not a human-machine problem. Or in OSHA terms, a problem to be solved with administrative controls rather than engineering controls.


Yeah, I wonder if the author has been in a situation where a brief explanation was taken by a higher up (or a cc'd higher-up x2, or x3) as "It was entirely my fault and I'm withholding details that would further implicate me and giving only the facts that don't."

I've had to work to balance emails like this between "they don't want the nitty gritty, they just want to be satisfied the issue is solved" and "They will definitely want the nitty gritty and think something is up if the details seems suspiciously sparse". Especially if the recipients are technical, and they know that you know that they're technical. what are you hiding, Qaadika? you're usually more verbose than this.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: