New User Monitoring

Spyro@programming.dev · edit-2 16 days ago

New User Monitoring

Otter@lemmy.ca · edit-2 16 days ago

For what it’s worth, some other fediverse platforms have built in functionality like this to catch spam accounts. It’s a big problem on Pixelfed, and so there’s an admin feature that runs locally to try and catch that. We check for false positives about once a day, and it’s supposed to get better over time at catching harmful content.

I see the value in keeping the exact implementation a secret so that hostile people can’t game the system. Maybe you can automate a way to release a regular report on which accounts were flagged? That might resolve the transparency concerns people have.

Zink@programming.dev · 16 days ago

Thanks for continuing to put in the volunteer hours to keep this little corner of the internet running and keeping it from being used for harm where possible.

I get that in the world we live in, many people are going to be primed to react harshly to “monitoring” wrapped in language about how it will help us. But considering the source, the public context, and the motivations involved, nah.

Mikina@programming.dev · 16 days ago

I’m mostly ok with it, paradox of tolerance and all that, plus that’s just a common moderation and since I’m not using DMs, I don’t care about that either, but seeing the reaction of others - have you cosidere not scanning DMs in the first iteration at least? Even if DMs are public thanks to how I presume activitypub works, it’s not something you can easily get to or understand as an average user, and reading the sentece “we will scan your DMs” will upset a lot of people, regardless of context.

From the PR standpoint, I don’t think it’s worth it, and it’d be better to just leave it on reports.

You will get headlines “Programmers.dev scans your DMs”, and people will not care or know how do they actually work on Fedi.

UlrikHD@programming.dev · 16 days ago

From the PR standpoint, I don’t think it’s worth it, and it’d be better to just leave it on reports.

The problem is that we can’t rely on reports of DMs since lemmy doesn’t federate them to us.

E.g.
-> troll@programming.dev makes a new account and sends harassment to victim@lemmy.instance
-> victim@lemmy.instance reports the DMs from troll@programming.dev
-> We, the admins of programming.dev, do not see this report because lemmy does not federate the DM report and troll@programming.dev can continue harassing others because we never find out about it.

This isn’t just a theoretical, it happened just last month that one of our users (1 day old account) sent rape and death threats (which were reported), and we found out about it by pure chance when talking to admins from the other instance.

And just to clarify, the tool only automatically monitors new accounts, i.e. accounts that are being registered today. If you account is more than a few weeks old, the tool doesn’t monitor any of your activity.

Mikina@programming.dev · edit-2 16 days ago

I see, that sucks. Is it being discussed/adressed on the protocol level? This sounds like something that should be adressed in general, federation of reports, because it is a serious issue.

I can imagine a solution in upstream Lemmy repo, where report button also sends the report to the lemmy instance of the account instance outside of ActivityPub.

Mniot@programming.dev · 16 days ago

Yeah, not federating reports is a bug-level missing feature.

But having some kind of probation period also seems like an obvious feature to me (that should be optional, not mandatory). It was not uncommon in the past (on BBSes and old forums) to have new members’ posts require approval before anyone could see them. I suspect it partially went away because big platforms needed to “growth hack” and anything that slows down the influx of new users is bad.

UlrikHD@programming.dev · 16 days ago

It’s an open issue on github from 2024, it doesn’t seem to be a priority. This tool allows us to react faster than reports though, and hopefully remove some problematic content before it’s seen by others.

Mikina@programming.dev · 15 days ago

I’ve bumped the issue, since it already had an PR ready for review, and there seem to be some activity going on, so hopefully it will be merged soon.

It sounds like a really important thing to me.

Gonzako@lemmy.world · 16 days ago

Best of luck! I always see your work when a mass commenter gets sent loose, those get shot real quick. Is there a possibility of having victim-side measures? Like a DM whitelist, for example.

Ategon@programming.dev · 15 days ago

I do think having options like letting people disable dms, letting people choose to need to approve dms to get notifications about them and then having a whitelist makes sense. I can make tickets for those in the lemmy repo later if they don’t exist already

interloper@programming.dev · 14 days ago

It would be nice to have a link to some list of expected etiquette. For instance, is it rude to DM people who don’t know you? Is there an acceptable way to go about this? Are there guidelines for what deserves a downvote? Some “Welcome to our server” post to get the lay of the land on consensus proper behavior…

CameronDev@programming.dev · 14 days ago

There is a code of conduct here:

https://legal.programming.dev/docs/code-of-conduct/

If you’re within those guidelines, you’re probably fine.

Dr. Wesker · edit-2 16 days ago

@weskersucks@lemmy.world did you see this?

Nate Cox@programming.dev · 14 days ago

I think I’m mostly ignorant here. I of course know that abuse on these platforms is always an issue but I didn’t know we were having any particularly bad instances of it today. Can you point me to a reference so I can get up to date?

Spyro@programming.dev · 13 days ago

Fortunately, most of the visible stuff has been caught and purged quickly, so you may have just missed it.

The DM abuse is basically invisible except to the targets.

Mystic Mushroom [Ze/Zir]@lemmy.dbzer0.com · 15 days ago

Thank you for putting in the effort to help curb this problem. I know it’s unpopular but automated moderation is unfortunately needed on Lemmy and in the fediverse or people will abuse the system.

username_1@programming.dev · edit-2 16 days ago

deleted by creator

tyler@programming.dev · 16 days ago

Do explain…

username_1@programming.dev · edit-2 16 days ago

deleted by creator

UlrikHD@programming.dev · 16 days ago

We’ll take the fact that you haven’t noticed much spam as a compliment.

The most recent spam that we have seen is harassment/doxxing campaigns that target specific users. It’s not something the average user would notice as the harassment is often DMs or pings on random posts. That, and a ban evader with a recognisable writing style.

As you pointed out, there is no monetary interest on our end. We are just volunteers looking to fix the blindpots in the moderation tools. We have no interest in needlessly censoring speech, and if you ever feel we moderate the instance too harshly, we are open to taking feedback. You can always check the modlog to see our activity. The guidelines for our moderation can be seen here.

adr1an@programming.dev · 16 days ago

You just joined in February so maybe you are taking it personally? In either case, let me share that I have been on this instance for few years and I trust the admins motives as they expressed.

Your claim is very much and accusation without evidence. There’s no such track record for the humans involved. On the contrary, they are trying to build a community.

I rather not argue what is good or bad censorship. I kind of agree with you.

How about giving the admins the benefit of the doubt?

Time will tell if they are censoring a certain voice, and which one that is. Only then it make sense to judge their actions…

username_1@programming.dev · edit-2 16 days ago

deleted by creator

tyler@programming.dev · 16 days ago

what loaded gun? admins were already dealing with spammers, this just allows them to do it faster. lemmy already has a built in profanity filter and yet you’re using it just fine.

onlinepersona@programming.dev · edit-2 16 days ago

We got him, admins! This one right here!

Seriously though, if you want total freedom, no verification and no monitoring, setup your own instance. Tell is how it works out for you.

Or better yet, go back to Truth Social or 4chan

username_1@programming.dev · edit-2 16 days ago

deleted by creator

kingofras@lemmy.world · edit-2 16 days ago

but some of you have been impacted by the systems false positives.

Found the Palentir programmers!

If you’re open to feedback: this is truly horrifying to see entering lemmy on a day that isn’t April 1st.

ETA: this comment was directed to the OP, and the current version is a huge improvement over what was written then, hence the flood of downvotes on this now. The post is also not having an Edit: at the bottom to make clear such a significant edit was made.

Spyro@programming.dev · 16 days ago

We don’t want to do this kind of thing, we are being forced to, because bad actors keep using our server to send harassment.

If you know a better way to handle that, we are all ears

kingofras@lemmy.world · 16 days ago

If your OP and your response to me could literally be word for word what Stephen Miller says about the need for current US immigration policy, you bet there has to be a better way.

That’s probably more than a quick back and forth but would include some questions like:

is this just your instance or does it happen wider?
have other instances had to deal with similar and how did they resolve it?
how are is this being evaluated? OP seems to not really suggest transparency?
is there anything on the instance or its communities that is causing a disproportionate amount of account abuse to come to your instance?
is there merit to simply suspending new membership applications for a month before doing anything privacy invading.

Also: you do what you want on your instance but as a new user I hope this is very clearly advertised to new users signing up, as this is very yuck and privacy invading, not to mention tone deaf for the world we live in.

Then again I’m probably not in your target audience.

UlrikHD@programming.dev · 16 days ago

We only monitor new users on our instance, the tool is written by the admin team and no personal data is sent out of the server. The tool operates on the server communicating directly with the lemmy database on the server.
Other instances have their methods I’m sure, we haven’t cooperated with any other instances with this tool. I’m sure there’s a spectrum of 100% pure manual moderation to a highly automated process.
The tool scans for typical patterns we see in the trolls. Some of the patterns I’m sure you can guess, such as using slurs or uploading images. The tool temporarily removes the content and pings the admin team so that we can manually verify the content and restore it.
No it’s a problem across the fediverse, it seems to be people being banned from various instances that creates new accounts to evade bans. We have seen harassment campaigns where the user creates new accounts on multiple instances and attacks the victim via comments and DMs before moving on to the next instance to repeat the harassment.
That would only fix the issue for the duration of new applications being closed.

There isn’t anything new in regards to privacy with this move. We are just adding a tool to help flag content quicker, and we want to be transparent about out moderation. That’s why we made our admin guidelines public.

kingofras@lemmy.world · 16 days ago

From what I’m gathering is that OP possibly should not be your spokesperson, as your wording sounds far less dystopian and controlling.

I still don’t like it, but if you are disclosing it properly and you’re not offshoring the data that is monitored, that will earn you some goodwill.

It really sounds like Lemmy software is trailing behind on keeping up with the bots.

UlrikHD@programming.dev · 16 days ago

We will have a review of how we write posts and follow up in the comments later today to make sure we are clearer in our communication, and that we don’t end up with 3 different admins chiming in on separate comments.

I still don’t like it, but if you are disclosing it properly and you’re not offshoring the data that is monitored, that will earn you some goodwill.

We try to always make pinned announcements when making notable changes to our operation to keep the community informed. Any changes, minor or major, that we believe our users would want to know, we will always post about here.

It really sounds like Lemmy software is trailing behind on keeping up with the bots.

Lemmy is trailing behind when it comes to moderation tools just in general, which is what that is forcing us to make our own tools to help with moderation.

Spyro@programming.dev · 16 days ago

deleted by creator

Ategon@programming.dev · edit-2 16 days ago

for a couple of those

The things that are checked are public (posts, comments, and dms that are also sent to other servers we dont control) and can be moderated on regardless of this. There isnt anything that breaches privacy

The way its being moderated isnt disclosed so that people cant just bypass it. The processing is local though like said in the original message (so for example were not sending this to some LLM/Ai to then use)

Spyro@programming.dev · 16 days ago

deleted by creator

Ategon@programming.dev · edit-2 16 days ago

Everything is sent for human review so false positives can be reversed quickly and the worst action that happens is removal of content temporarily (before thats reversed)

This just means someone cant spin up a new account and spam things on the site or on federated sites and have that linger until its able to be moderated

tyler@programming.dev · 16 days ago

even before the edit it was fine… you just need to read more carefully.