damn i really hope they stay. this right after their spotify crawl and domain suspension doesn’t inspire hope.
Isn’t that what Anna’s archive is looking for? They even have a separate page exactly for that usecase: https://annas-archive.li/llm
I just hope that nvidia seeds their torrents!!
simply a data grab for their ai training sets.
I don’t know why I find it. Absolutely hilarious that Nvidia, a company currently in the AI business, notorious for not giving a shit about copyright, or just straight up going to Anna’s archive.
Nah, it’s pretty simple actually. If the archive doesn’t exist at all, they can’t even steal from it.
Fucking Schroedinger’s copyright
Have you seen the quality of some of those OCR scans? I’m reaing the Stainless Steel Rat books from Anna’s Archive right now, and the number of errors is ridiculous, and it’s not an isolated case. Pretty much every one I’ve read had at least a few. Good luck getting decent training data from them.
Anna’s Archive is the perfect place to find specific translations of ebooks. Something I hadn’t thought of the need for until recently.
Anyway I didn’t find the confidential book there…





