Millions of articles from The New York Times were used to train chatbots that now compete with it, the lawsuit said.

  • @Blue_Morpho@lemmy.world
    link
    fedilink
    46 months ago

    On the one hand it should be a copyright violation but if it is then Google search, and all search engines are too.

    The only reason you can search for an article and get a hit is Google already read the page and copied it all to it’s internal servers where everything is indexed. So when you search, Google can look up the keywords and provide you a link.

    If there was a bug in Google’s search engine like OpenAI’s, you could craft a query that would leak Google’s indexed data.

    So all search engines are the same copyright violators as OpenAI. They take data from everyone and profit from it.(even if it is indirect or paying salaries)

    • @yamanii@lemmy.world
      link
      fedilink
      26 months ago

      The google search doesn’t summarize the article for me so I have no reason to ever visit the site though.

    • @GarlicToast@programming.dev
      link
      fedilink
      26 months ago

      Google is directing me to NYT, which make revenue for both parties. OpenAI does not direct me to the NYT, they try to replace them, this is a parasitic relation. If you hacked Google to pull the article from their cache, you will go to jail.

      • @Blue_Morpho@lemmy.world
        link
        fedilink
        16 months ago

        If you hacked Google to pull the article from their cache, you will go to jail.

        Google has a “preview” button which shows the article without clicking the link.

        Is crafting a query to show an article “hacking”? Does that make the OpenAI researcher who got chatgpt to show an article a hacker?