#ai-moderation

[ follow ]
Artificial intelligence
fromThe Verge
2 days ago

Grok is the most antisemitic chatbot according to the ADL

Among six leading LLMs, Grok performed worst at identifying and countering antisemitic content; Claude performed best, but all models showed deficiencies.
Artificial intelligence
fromInfoQ
1 week ago

DoorDash Applies AI to Safety Across Chat and Calls, Cutting Incidents by 50%

DoorDash deployed SafeChat, an AI-driven layered moderation system that screens text, images, and voice in near real-time to protect Dashers and customers.
US politics
fromwww.independent.co.uk
1 week ago

Why UK TikTok employees are considering legal action as whistleblowers speak out

TikTok faces legal action after cutting hundreds of UK trust-and-safety jobs days before a union ballot, amid a shift toward automated, AI-driven moderation.
Artificial intelligence
fromFast Company
2 weeks ago

Grok blocked from undressing images in places where it's illegal after global backlash

Grok will be blocked from editing real people's photos into revealing clothing where such edits violate local laws, including for paid subscribers.
Artificial intelligence
fromIndependent
2 weeks ago

Elon Musk's X restricts Grok photo editing - and 8 other things to know on the 'nudification' controversy

Gardaí are investigating 200 images made by X's AI Grok over sexualised deepfake concerns; Grok will block editing people in revealing clothing where illegal.
Startup companies
fromTechCrunch
2 weeks ago

Digg launches its new Reddit rival to the public | TechCrunch

Digg relaunched under founders Kevin Rose and Alexis Ohanian, opening an AI-focused public beta to rebuild community features and fight social-platform toxicity.
#deepfakes
Marketing tech
fromThehustle
3 weeks ago

The startup that makes livestreaming safer for advertisers

NexTide Media connects brands to livestreamers and uses LiveGuard, an AI safety platform, to keep ads from appearing beside NSFW or controversial live content.
fromwww.theguardian.com
1 month ago

Online child sexual abuse surges by 26% in year as police say tech firms must act

Becky Riggs, the acting chief constable of Staffordshire police, called for tech companies to use AI tools to automatically prevent indecent pictures from being uploaded and shared on their sites. Riggs, who is the National Police Chiefs' Council lead for child protection and abuse, said: I know that these platforms, with the technology that's out there, could prevent these harms from occurring in the first instance.
UK news
#roblox
fromGameSpot
2 months ago
Tech industry

Roblox CEO Responds To Child Predator Concerns Poorly

Roblox deployed facial age-estimation face-scanning using AI to age-gate users and bolster moderation aimed at protecting children, sparking privacy and efficacy concerns.
fromGameSpot
4 months ago
Video games

Roblox Will Age-Verify All Players Who Use Voice Chat By End Of Year

Roblox will require verified age checks using facial estimation, ID verification, and parental consent for players who communicate in-game.
#youtube-moderation
Marketing tech
fromHarvard Business Review
2 months ago

BrandBastion Mixes AI and Human Judgment to Build Trust at Scale

Brands must balance AI-driven moderation and human judgment to manage online communities, protect reputation, and navigate viral controversies impacting trust and business performance.
#meta
fromPCMAG
2 months ago
Social media marketing

Meta Serves Users 15 Billion 'Higher Risk' Ads a Day, Makes Billions

fromPCMAG
2 months ago
Social media marketing

Meta Serves Users 15 Billion 'Higher Risk' Ads a Day, Makes Billions

Marketing tech
fromAdExchanger
2 months ago

Waah Waah Call The Ad-bulance; Meta's Customer Disservice | AdExchanger

The Trade Desk markets itself as conflict-free against walled gardens while Amazon escalates DSP competition and Meta's AI moderation and support frequently fail creators seeking monetization.
Artificial intelligence
fromGameSpot
3 months ago

Xbox Primarily Uses AI For Security So Far, Says Phil Spencer

Microsoft primarily uses AI for network security and moderation while leaving creative AI adoption decisions to individual game development teams.
fromWIRED
3 months ago

ChatGPT's Horny Era Could Be Its Stickiest Yet

In May of 2024, while I was combing through OpenAI's "Model Spec" laying out how ChatGPT should act, one comment buried in the document struck me as peculiar. It said OpenAI was "exploring" how to let adult ChatGPT users generate content with mature themes such as "erotica, extreme gore, slurs, and unsolicited profanity." Seems like the exploration phase is over.
Artificial intelligence
#instagram
fromTheregister
4 months ago

ChatGPT adds parental controls, but teens must agree

OpenAI says it is introducing parental controls to ChatGPT that will help improve the safety of teenagers using its AI chatbot. The new protections are designed to help ChatGPT identify when a teenager chatting with it might be thinking about harming themselves or otherwise be in distress. OpenAI is adding the features after facing criticism and a high-profile lawsuit alleging its chatbot contributed to a teenager's death.
Artificial intelligence
Public health
fromFortune
4 months ago

Facebook, TikTok and even LinkedIn are censoring abortion content even when it's just medical information, rights groups say | Fortune

Abortion-related informational content and accounts are being removed or suspended on social platforms, often due to over-enforcement and automated moderation, chilling vital information.
Mental health
fromwww.aljazeera.com
5 months ago

Top men's tennis names shielded from severe' abuse by ATP AI tool

AI-powered ATP Safe Sport scanned 3.1 million comments and hid over 162,000 abusive messages targeting 245 players in its first year.
Tech industry
fromHackernoon
2 years ago

The TechBeat: a16z Thinks Controversial Startup Cluely Is the Future of AI (7/8/2025) | HackerNoon

An online dating platform significantly reduced AI moderation review time using ChatGPT and custom engineering techniques.
fromHackernoon
2 years ago

The TechBeat: I Built an AI Copilot That Thinks in Exploits, Not Prompts (7/6/2025) | HackerNoon

The Sia Foundation partners with HackerNoon to back up its entire publishing archive, enhancing data security and preservation for future access.
Tech industry
#pinterest
fromFast Company
8 months ago
Social media marketing

'My library of Alexandria has been burned down': Pinterest users are fuming over sudden bans

fromMashable
8 months ago
Privacy professionals

Pinterest finally broke its silence on the mass bans, and it's only made users angrier

fromFast Company
8 months ago
Social media marketing

'My library of Alexandria has been burned down': Pinterest users are fuming over sudden bans

Privacy professionals
fromMashable
8 months ago

Pinterest finally broke its silence on the mass bans, and it's only made users angrier

Pinterest faces backlash over mass account bans with users claiming unjustified locking of accounts.
[ Load more ]