Millions of articles from The New York Times were used to train chatbots that now compete with it, the lawsuit said.

  • @kromem@lemmy.world
    link
    fedilink
    English
    66 months ago

    Spaceballs extracts almost all of its value from Star Wars without paying for it.

    You absolutely can extract value from things when the way in which you do it is fair use.

    Which is typically considered to be use that is transformative enough so as to not simply be derivative, or in the public interest.

    And I think you’d have a very difficult time showing LLMs general use to be derivative of any specific part of the training data.

    We’ll see soon, as these court cases resolve.

    And if the cases find in favor of the plaintiffs, “not charging” isn’t going to work out. You can’t copy material and not charge for it and get away with it. If there’s prior law that training is infringement, it’s unlikely the decision will be worded so narrowly that similar cases against companies that don’t charge will be found not to be infringement.

    Keep in mind one of the pending cases is against Meta, whose model is completely free to access and use.