One prominent author responds to the revelation that his writing is being used to coach artificial intelligence.

By Stephen King

Non-paywalled link: https://archive.li/8QMmu

  • FaceDeer@kbin.social
    link
    fedilink
    arrow-up
    3
    ·
    1 年前

    We do know that they can be coaxed into spitting out the original work, though, which sure implies it is in there.

    Only very rarely, under extreme cases of overfitting. Overfitting is a failure state that LLM trainers want to avoid anyway, for reasons unrelated to copyright.

    There simply isn’t enough space in a LLM’s neural network to be storing actual copies of the training data. It’s impossible, from a data compression perspective, to fit it in there.