These won't be simple algorithms - most likely NNs.<nods>
A hybrid model is best. Reddit uses a downvote to oblivion model but also requires human moderators. Simply having an algorithm quarantine/remove a post from prominence is easy to disregard. But in my years volunteering as a moderator here, I have learned a lot about how people respond to a human being approaching them about their behavior or posting style. Hard methods (silently banning) are rarely effective long-term. Explaining why certain behaviors are disruptive to the flow of dialogue and the culture at TMC is often, but not always, much more effective.
I think using algorithms to reduce the prominence of certain types of content is a good idea that will lessen the requirement of a moderation team. There are some types of content that might be "liked" heavily, though, and that might encourage violence or doxxing of individuals, etc. That kind of thing is best reviewed by a group of human moderators, at least for now. Once FSD is complete, we can talk about the job of having AI moderate without human intervention.
This seems to be the kind of thing that can be easily gamed. Not disliked? Everything is disliked online, you'd never see a single tweet! What about if they choose to say "not disliked by anyone you follow"? That seems on the surface to help, but in my opinion it only strengthens and further isolates our silos. Reddit suffers from this as well, as was pointed out upthread. If you're in /r/RealTesla on reddit, you post something positive about Elon and get downvoted into oblivion. If you're on /r/Tesla on reddit, you post something negative about Elon and get downvoted into oblivion. People just subscribe to what confirms their biases and makes them feel stronger in their opinions until they become calcified. I think it's better to have more open discussions, but that will require human intervention.
Process could be something like:
- Does NN assume this tweet is safe based upon the author and content? - If so, go step 6)
- Allow followers of the individual only to read then
- If number of up votes exceed downvotes by x then
- Allow Y% of no holds barred users to read then
- If number of up votes exceed downvotes by z then (NN can look at whether the votes come from users with a good track record on previous voting versus moderation)
- Release to all users based upon it's popularity versus other tweets available at the time and in accordance to the user's preferences regarding similar tweets (all NN based decision making)
Last edited: