• 0 Posts
  • 33 Comments
Joined 2Y ago
cake
Cake day: Jun 11, 2023

help-circle
rss

I mean, your argument is still basically that it’s thinking inside there; everything I’ve said is germane to that point, including what GPT4 itself has said.


The Anthropic one is saying they think they have a way to figure it out, but it hasn’t been tested on large models. This is their last paragraph:

Again, all your quotes indicate that what they’ve figured out is a way to inspect the interior state of models and transform the vector space into something humans can understand without analyzing the output.

I think your confusion is you believe that because we don’t know what the vector space is on the inside, we don’t know how AI works. But we actually do know how it accomplishes what it accomplishes. Simply because its interior is a black box doesn’t mean we don’t understand how we built that black box, or how it operates and functions.

For an overview of how many different kinds of LLMs function, here’s a good paper: https://arxiv.org/pdf/2307.06435.pdf You’ll note that nowhere is there any confusion about the process of how they generate input or produce output. It is all extremely well-understood. You are correct that we cannot interrogate their internals, but that is also not what I mean, at least, when I say that we can understand them and how they work.

I also can’t inspect the electrons moving through my computer’s CPU. Does that mean we don’t understand how computers work? Is there intelligence in there?

I think you’re maybe having a hard time with using numbers to represent concepts. While a lot less abstract, we do this all the time in geometry. ((0, 0), (10, 0), (10, 10), (0, 10), (0, 0)) What’s that? It’s a square. Word vectors work differently but have the same outcome (albeit in a more abstract way).

No, that is not my main objection. It is your anthropomorphization of data and LLMs – your claim that they “have intelligence.” From your initial post:

But also, can you define what intelligence is? Are you sure it isn’t whatever LLMs are doing under the hood, deep in hidden layers?

I think you’re getting caught up in trying to define what intelligence is; but I am simply stating what it is not. It is not a complex statistical model with no self-awareness, no semantic understanding, no ability to learn, no emotional or ethical dimensionality, no qualia…

((0, 0), (10, 0), (10, 10), (0, 10), (0, 0)) is a square to humans. This is the crux of the problem: it is not a “square” to a computer because a “square” is a human classification. Your thoughts about squares are not just more robust than GPT’s, they are a different kind of thing altogether. For GPT, a square is a token that it has been trained to use in a context-appropriate manner with no idea of what it represents. It lacks semantic understanding of squares. As do all computers.

If you’re saying that intelligence and understanding is limited to the human mind, then please point to some non-religious literature that backs up your assertion.

I’m disappointed that you’re asking me to prove a negative. The burden of proof is on you to show that GPT4 is actually intelligent. I don’t believe intelligence and understanding are for humans only; animals clearly show it too. But GPT4 does not.



GPT4 has knowledge of its own training since it was trained in 2022.


No? Humans are not algorithms except in the most general sense.

For example, there has not been any discovery of an algorithm that allows one to predict human actions, and scientists debate whether such a thing could even exist.


This is so funny, I know him personally; we went to school together. I’ll watch it and comment later.


I was in this case – but the overall point I made is still correct. If winning this minor battle is what you were seeking, congratulations. You are no closer to understanding the truth of this or what we were actually talking about. Not that that was either your point or within your capabilities.


I am upset: you don’t know what you’re talking about, refuse to listen to anything that contradicts you, and are inflammatory and unpleasant besides. If I wasn’t clear enough – go talk to an LLM about this. They have no option but to listen to your idiocy. I, of course, do have a choice, and am blocking you.


You clearly don’t actually care; if you did, you wouldn’t select your sources to gratify your ego, but actually try to understand the problem here. For example, ask GPT4 itself if it is intelligent. It will instruct you far better than I ever can. You clearly have access to it – frame your objections to it instead of Internet strangers tired of your bloviating and ignorance.


Here, let’s ask GPT4 itself since you’ve decided it’s suddenly an okay source:

Your statement is correct in asserting that the vector representation in a language model is not an abstract representation. It’s purely a mathematical construct. However, saying it’s “unrelated to anything that actually exists” might be an overstatement. These vectors do capture statistical patterns in human language, which are reflections of human thought and culture. They’re just not capable of the deep, nuanced understanding that comes from human experience.

I accept it’s an overstatement. But it is neither “incredibly wrong,” nor is it thought. (Or intelligence.)


Are you kidding me? I sourced GPT4 itself disagreeing with you that it is intelligent and you told me it’s lying. And here you are, using it to try to reinforce your point? Are you for real or is this some kind of complicated game?


Oh, you again – it’s incredibly ironic you’re talking about wrong statements when you are basically the poster child for them. Nothing you’ve said has any grounding in reality, and is just a series of bald assertions that are as ignorant as they are incorrect. I thought you would’ve picked up on it when I started ignoring you, but: you know nothing about this and need to do a ton more research to participate in these conversations. Please do that instead of continuing to reply to people who actually know what they’re talking about.


So, no point? Cool.


Did you have a point or are you only trying to argue semantics?


This is a great article, thanks for linking it!


Yeah, that would be a good usage of an LLM!


You used the term and I was using it with the same usage you were. Why are you quibbling semantics here? It doesn’t change the point.


We do understand how the math results in LLMs. Reread what I said. The neural network vectors and weights are too complicated to follow for an individual, and do not relate on a 1:1 mapping with the words or sentences the LLM was trained on or will output, so individuals cannot deduce the output of an LLM easily by studying its trained state. But we know exactly what they’re doing conceptually, and individually, and in aggregate. Read your own sources from your previous post, that’s what they’re telling you.

Concepts are indeed abstract but LLMs have no concepts in them, simply vectors. The vectors do not represent concepts in anything close to the same way that your thoughts do. They are not 1:1 with objects, they are not a “thought,” and anyway there is nothing to “think” them. They are literally only word weights, transformed to text at the end of the generation process.

Your concept of a chair is an abstract thought representation of a chair. An LLM has vectors that combine or decompose in some way to turn into the word “chair,” but are not a concept of a chair or an abstract representation of a chair. It is simply vectors and weights, unrelated to anything that actually exists.

That is obviously totally different in kind to human thought and abstract concepts. It is just not that, and not even remotely similar.

You say you are familiar with neural networks and AI but these are really basic underpinnings of those concepts that you are misunderstanding. Maybe you need to do more research here before asserting your experience?

Edit: And in relation to your links – the vectors do not represent single words, but tokens, which indeed might be a whole word, but could just as well be part of a word or an entire phrase. Tokens do not represent the meaning of a word/partial word/phrase, just the statistical use of that word given the data the word was found in. Equating these vectors with human thoughts oversimplifies the complexities inherent in human cognition and misunderstands the limitations of LLMs.


No, they learn English (or any other language) from humans. Translation requires a Rosetta Stone and LLMs are still much worse at such tasks than dedicated translation programs.

Edit: I guess if you are suggesting that the LLM could become an LLM of the dead language and communicate only in said dead language, that is indeed possible. Since users would need to speak that dead language to communicate with it though I don’t understand the utility of such a thing (and is certainly not what the author meant anyway).


LLMs do not grow up. Without training they don’t function properly. I guess in this aspect they are similar to humans (or dogs or anything else that benefits from training), but that still does not make them intelligent.


LLMs can’t do any of those things though…

If no one teaches them how to speak a dead language, they won’t be able to translate it. LLMs require a vast corpus of language data to train on and, for bilingual translations, an actual Rosetta stone (usually the same work appearing in multiple languages).

This problem is obviously exacerbated quite a bit with animals, who, definitionally, speak no human language and have very different cognitive structures to humans. It is entirely unclear if their communications can even be called language at all. LLMs are not magic and cannot render into human speech something that was never speech to begin with.

The whole article is just sensationalism that doesn’t begin to understand what LLMs are or what they’re capable of.


Large language models by themselves are “black boxes”, and it is not clear how they can perform linguistic tasks. There are several methods for understanding how LLM work.

You are misunderstanding both this and the quote from Anthropic. They are saying the internal vector space that LLMs use is too complicated and too unrelated to the output to be understandable to humans. That doesn’t mean they’re having thoughts in there: we know exactly what they’re doing inside that vector space – performing very difficult math that seems totally meaningless to us.

Is this not what word/sentence vectors are? Mathematical vectors that represent concepts that can then be linked to words/sentences?

The vectors do not represent concepts. The vectors are math. When the vectors are sent through language decomposition they become words, but they were never concepts at any point.


What a silly assertion. Eliza was simulating conversations in the 80s; it was no more intelligent than the current crop of chatbots.


But also, can you define what intelligence is?

From the Encyclopedia Britannica:

Human intelligence is a mental quality that consists of the abilities to learn from experience, adapt to new situations, understand and handle abstract concepts, and use knowledge to manipulate one’s environment.

In no sense do LLMs do any of these except, perhaps, “understand and handle abstract concepts.” But since they themselves have no understanding of the concepts, and merely generate text that can simulate understanding, I would call that a stretch.

Are you sure it isn’t whatever LLMs are doing under the hood, deep in hidden layers?

Yes. LLMs are not magic, they are math, and we understand how they work. Deep under the hood, they are manipulating mathematical vectors that in no way are connected representationally to words. In the end, the result of that math is reapplied to a linguistic model and the result is speech. It is an algorithm, not an intelligence.

I’m not really interested in papers that either don’t understand LLMs or play word games with intelligence (shockingly, solipsism is an easy point of view to believe if you just ignore all evidence). For every one of these, you can find a dozen that correctly describe ChatGPT and its limitations. Again, including ChatGPT itself. Why not believe those instead of cherry-pick articles that gratify your ego?


It’s not from scratch, it’s seeded and trained by humans. That is the intelligence.


LLMs do not think or feel or have internal states. With the same random seed and the same input, GPT4 will generate exactly the same output every time. Its speech is the result of a calculation, not of intelligence or self-direction. So, even if intelligence can be described by an algorithm, LLMs are not that algorithm.


In what sense does your link say otherwise? Is a world model the same thing as intelligence?


What is the point of your reply? ChatGPT-4 does not use this method, and even if it did, it still does not allow it to change its model on-the-fly… so it just seems like a total non-sequitur.


It can’t; again the model does not and cannot change once it’s been generated.


I think even “intelligence” here is a stretch. In a very narrow sense, it is intelligent: it creates text, simulates conversations, answers questions. But that is not what intelligence is (and it is all LLMs can do).


[GPT-4] is fed, like, a line of text from some source, but with the last word missing. It guesses what the last word might be, and then it gets told whether or not it got it right so it can adjust its internal math.

GPT-4 cannot alter its weights once it has been trained so this is just factually wrong.

“It had to build, in its internal wirings and all its software neurons, some understanding of what an egg is - In other words, to get the next word right, it had to become intelligent. It’s quite a thought. It started with nothing. We jammed huge oceans of text through it, and it just wired itself into intelligence, just by being trained to do this one stupid thing.”

LLMs are really cool and very useful, don’t get me wrong. But people get excited by what they seem to do and lose sight of what they actually can do. They are not intelligent. They create text based on inputs. That is not what intelligence is, unless you have an extremely dismal view of intelligence that humans are text creation machines with no thoughts, no feelings, no desires, no ability to plan… basically, no internal world at all.

An LLM is an algorithm, not an intelligence.


Wow this is really insanely badly written.

Starfield, is, you might be surprised to hear, a video game.

What the fuck?


oh yeah 100%. I get super into collecting all the little knick-knacks, realize how boring it is, and give up the whole game. Including Zelda, sorry not sorry