SDF Chatter
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Sunshine (she/her)@piefed.ca to Technology@piefed.socialEnglish · 1 day ago

Google Built Its Empire Scraping The Web. Now It’s Suing To Stop Others From Scraping Google

www.techdirt.com

external-link
message-square
6
link
fedilink
  • cross-posted to:
  • legalnews@lemmy.zip
  • libreculture@lemmy.ca
116
external-link

Google Built Its Empire Scraping The Web. Now It’s Suing To Stop Others From Scraping Google

www.techdirt.com

Sunshine (she/her)@piefed.ca to Technology@piefed.socialEnglish · 1 day ago
message-square
6
link
fedilink
  • cross-posted to:
  • legalnews@lemmy.zip
  • libreculture@lemmy.ca
Last week, Google filed suit against SerpApi, a scraping company that helps businesses pull data from Google search results. The lawsuit claims SerpApi violated DMCA Section 1201 by circumventing G…
alert-triangle
You must log in or register to comment.
  • 0_o7@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    3
    ·
    13 hours ago

    Archive: https://web.archive.org/web/20251224194440/https://www.techdirt.com/2025/12/24/google-built-its-empire-scraping-the-web-now-its-suing-to-stop-others-from-scraping-google/

    Couldn’t archive on archive.today, they put up a captcha, and google one at that. That doesn’t let me through at all.

  • scytale@piefed.zip
    link
    fedilink
    English
    arrow-up
    8
    ·
    19 hours ago

  • mesa@piefed.social
    link
    fedilink
    English
    arrow-up
    18
    ·
    edit-2
    1 day ago

    Google and OpenAI sucks:

    Google’s legal theory has another significant problem: the requirement that a TPM must “effectively control” access. Just last week, a court rejected Ziff Davis’s attempt to turn robots.txt into a 1201 violation when OpenAI allegedly ignored its crawling restrictions. The court’s reasoning is directly applicable here:

    OpenAI slamed my small server into the ground, until I put fail2ban on top. It was really bad, like thousands of requests per second bad.

    • apftwb@lemmy.world
      link
      fedilink
      arrow-up
      1
      ·
      11 hours ago

      How does fail2ban prevent scrapping? My understanding was that fail2ban works on failed login attempts.

      • mesa@piefed.social
        link
        fedilink
        English
        arrow-up
        1
        ·
        9 hours ago

        There’s some premade scripts out there that make it do more. I have it hooked up to nginx and other such logs. Its common enough in login attempts for login portals online, not just ssh. It can work with any grep-able log file.

        I just took two scripts other people have made, verified they soon my mini PC and set it loose. Within about 10 min it caught most scrappers and banned the IPs.

  • watson@sopuli.xyz
    link
    fedilink
    arrow-up
    7
    ·
    1 day ago

    Fuck Google

Technology@piefed.social

technology@piefed.social

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@piefed.social

Tech related news and discussion. Link to anything, it doesn’t need to be a news article.

Let’s keep the politics and business side of things to a minimum.

Rules

No memes

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 175 users / day
  • 706 users / week
  • 1.72K users / month
  • 1.73K users / 6 months
  • 1 local subscriber
  • 1.36K subscribers
  • 187 Posts
  • 393 Comments
  • Modlog
  • mods:
  • Rimu@piefed.social
  • BE: 0.19.11
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org