I hear people saying things like “chatgpt is basically just a fancy predictive text”. I’m certainly not in the “it’s sentient!” camp, but it seems pretty obvious that a lot more is going on than just predicting the most likely next word.

Even if it’s predicting word by word within a bunch of constraints & structures inferred from the question / prompt, then that’s pretty interesting. Tbh, I’m more impressed by chatgpt’s ability to appearing to “understand” my prompts than I am by the quality of the output. Even though it’s writing is generally a mix of bland, obvious and inaccurate, it mostly does provide a plausible response to whatever I’ve asked / said.

Anyone feel like providing an ELI5 explanation of how it works? Or any good links to articles / videos?

  • bionicjoey@lemmy.ca
    link
    fedilink
    arrow-up
    1
    ·
    edit-2
    1 year ago

    It is literally the same exact kind of algorithm that predicts the next word you will type on your phone based on what’s already been typed. The differences are that it has a much larger training dataset, which means more accurate predictions, it processes based on the entire body of text that has already been given (including the hidden prompt and previous messages), and that it doesn’t always predict whole words, but instead clusters of characters.

    If you want a more general overview of how machine learning works in general, this is a good video series to watch: https://www.youtube.com/watch?v=aircAruvnKk

    If you want to see some evidence that it doesn’t truly understand what it says, try having it generate and explain some jokes or riddles that rely on wordplay. It will completely shatter the illusion.

  • SorteKanin@feddit.dk
    link
    fedilink
    arrow-up
    0
    ·
    1 year ago

    it seems pretty obvious that a lot more is going on than just predicting the most likely next word.

    But that is all that’s going on. It has just been trained on so much text that the predictions “learn” the grammatical structure of language. Once you can form coherent sentences, you’re not that far from ChatGPT.

    The remarkable thing is that prediction of the next word seems to be “sufficient” for ChatGPT’s level of “intelligence”. But it is not thinking or conscious, it is just data and statistics on steroids.

    • datavoid@lemmy.ml
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      Try to use it to solve a difficult problem and it will become extremely obvious that it has no idea what it is talking about.

    • Dran@lemmy.world
      link
      fedilink
      arrow-up
      0
      ·
      1 year ago

      The magic sauce is context length within reasonable compute restraints. Phone predictive text has a context length of like 2-3 words, ChatGPT (and other LLMs) have figured out how to do predictions on thousands or tens of thousands of words of context at a time.

  • huginn@feddit.it
    link
    fedilink
    arrow-up
    0
    ·
    edit-2
    1 year ago

    I think there’s a second, unstated issue at play here: you’re experiencing a very deep cognitive bias. An exploit in the human brain.

    The human brain is a fantastically complex piece of meat but one of its many issues is the anthropomorphic bias: the tendency to ascribe human traits, especially agency and cognition, to things or animals that do not have those traits.

    We tend to believe if it walks like a duck and talks like a duck it must be a duck. ChatGPT is a very complex and highly specialized algorithm that outputs data just like another online human… But 100% of it is just a model processing your input and returning it back out. It talks like a human but is more akin to notepad than it is to us.

    To be clear: that bias exists in everyone. We all do this. Anytime I talk about my dog scheming to get my attention I’m hitting that bias. Anytime my robot vacuum interrupts me doing the dishes I talk at it and tell it to go away. I interact with the world around me as though most things are human.

    • Randomgal@lemmy.ca
      link
      fedilink
      arrow-up
      1
      ·
      1 year ago

      Yep, this is a very good explanation. Seeing ChatGPT “talk” is immediately associated with sentience, because for your entire life, and millions of years of evolution, apeech was in 99.9% of cases, a sign of aentience. So your brain doesn’t even consider it a question, until you consciously stop to think about it.

      An interesting way to antromorphizise GPT that’s still technically correct is to think of it as having essentially perfect memory. So it doesn’t know how to talk, but it has seen so many conversations (literal trillions) that it can recognize the patterns that make up speech and simply “remember” what the most likely combination of words is, given the context, with zero actual “understanding” of language. (Human trainers then fine-tune these guesses to give you the ChatGPT experience)

  • DarkGamer@kbin.social
    link
    fedilink
    arrow-up
    0
    arrow-down
    1
    ·
    1 year ago

    I found this example useful, this dude builds a simple LLM that writes infinite Shakespeare and walks you through each step.

    Eli5: large language models like chat GPT are really good at guessing what combinations of words are most likely to be a good response to questions. They’re so good at it that many people think they are intelligent even though they’re not.