This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.
Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.
Rules:
1: All Lemmy rules apply
2: Do not post low effort posts
3: NEVER post naziped*gore stuff
4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.
5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)
6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist
7: crypto related posts, unless essential, are disallowed
If OpenAI was slightly less dishonest when selling its product, it would say instead “don’t use those AI tools for direct moderation, use them instead to report potentially rule-breaking content so human mods can review it”. For at least four reasons:
Bad advice. Look at K3 and what the bot says about it:
Following the advice would be to try to fix what is not broken. Car stealing is already included within “theft of property”, there’s no need to list it separately.
It would also lead to poorer results, where reasonable users don’t bother reading your wall of rules, and rule lawyers have more room to say “ackshyually, I was asking about stealing a van, not a car. The rules say nothing about vans lol lmao haha”.
Toxicity on itself is poor grounds for moderation actions.
Well said 👏
I bookmarked your reply to come back to it whenever this discussion comes up for me!
We need to use AI to root out disinformation. Whoever figures that out gets a gold star.
Someone always decides what’s disinformation and it’s different depending if you ask USA or China. It’s even different if you ask me or you.
That’s the problem and no ai can solve that…
There is very little information that is 100% guaranteed to be truthful. Science comes close but there is so much other information.
As we say, Not with that attitude
It is a tough problem and I promise infinite wealth and 69 virgins to whoever gets it going
Russian and Chinese trolls certainly don’t want to see it happen
Its kinda a fitting example because China often tends to see something from USA as disinformation. Having two sides telling differen disinformation. This only shows that there exists two sides, but China is just trolling and trying to achieve many things with gaslighting.
I believe that you could create an ultimate ethics AI that tries to identify trust and trolls. But I hope it doesn’t end at “tries” and be easily manipulated.
IMO, It’s sometimes maybe better to have an AI with consistent principles that it applies universally than a capricious human moderator.
Who trains ChatGPT biases? Humans.
AI will follow the rules blindly without much bias…
As long as the biases are explicit and consistent it’s still an improvement IMO.
This is the best summary I could come up with:
OpenAI claims that it’s developed a way to use GPT-4, its flagship generative AI model, for content moderation — lightening the burden on human teams.
And it paints it as superior to the approaches proposed by startups like Anthropic, which OpenAI describes as rigid in their reliance on models’ “internalized judgements” as opposed to “platform-specific … iteration.”
Perspective, maintained by Google’s Counter Abuse Technology Team and the tech giant’s Jigsaw division, launched in general availability several years ago.
Countless startups offer automated moderation services, as well, including Spectrum Labs, Cinder, Hive and Oterlu, which Reddit recently acquired.
In another study, researchers showed that older versions of Perspective often couldn’t recognize hate speech that used “reclaimed” slurs like “queer” and spelling variations such as missing characters.
Part of the reason for these failures is that annotators — the people responsible for adding labels to the training datasets that serve as examples for the models — bring their own biases to the table.
I’m a bot and I’m open source!