damn i really hope they stay. this right after their spotify crawl and domain suspension doesn’t inspire hope.

  • nullptr
    link
    fedilink
    English
    arrow-up
    11
    ·
    14 hours ago

    simply a data grab for their ai training sets.

  • abbiistabbii@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    43
    arrow-down
    2
    ·
    19 hours ago

    I don’t know why I find it. Absolutely hilarious that Nvidia, a company currently in the AI business, notorious for not giving a shit about copyright, or just straight up going to Anna’s archive.

  • Almacca@aussie.zone
    link
    fedilink
    English
    arrow-up
    21
    arrow-down
    1
    ·
    edit-2
    18 hours ago

    Have you seen the quality of some of those OCR scans? I’m reaing the Stainless Steel Rat books from Anna’s Archive right now, and the number of errors is ridiculous, and it’s not an isolated case. Pretty much every one I’ve read had at least a few. Good luck getting decent training data from them.

  • BlueSquid0741
    link
    fedilink
    English
    arrow-up
    27
    ·
    20 hours ago

    Anna’s Archive is the perfect place to find specific translations of ebooks. Something I hadn’t thought of the need for until recently.