#2384 - Anti-spam heuristics
0 guests and 0 members have recently viewed this.
The top 3 point earners from 7th Dec 2025 to 14th Dec 2025.
| PDStig |
|
|
|---|---|---|
| Gabri |
|
|
| Master Rat |
|
|
There are no events at this time
But agreed. Technically, virtually any form of content can be submitted by guests... if permissions allow for it. Therefore, there needs to be a pipe for all content.
I don't really agree with much of the discussion, it's tangential to the issue, more related to #2057 and #2374 and #375 which will be considered separately.
The main issue discussed seems to be how can we do posting-frequency detection for guests, as all combined guest postings go under a single ID. However I think there's no real issue because guests get the CAPTCHA, or we'd generally limit guest posting access (who'd want guests submitting news for example). So we can implement posting-frequency for non-guests only, and still have a whole diverse set of other techniques that do work on guests (CAPTCHA, but also all the other heuristics). We couldn't really track guests anyway, people could use TOR (so have rotating IPs and session IDs).
Duplicate content submission can work on the guest ID with no issue - because different guests are not legitimately going to be posting the same content.
We do need to make sure heuristics do work effectively for contact forms though.
That isn't so necessary really. I've implemented a system where it can query via meta-data provided in the CMA hooks, over a time range for a particular submitter ID. That's simpler and better than trying to do it through reporting, because it works without any reporting needing to happen.
https://www.w3.org/TR/turingtest/
The TLDR is that we now do everything we can that isn't awful in some way, but it's still a good reference.