#human-values

[ follow ]
fromTechCrunch
3 weeks ago

Former Intel CEO launches a benchmark to measure AI alignment | TechCrunch

Gelsinger aims to ensure AI models support humanity through the Flourishing AI benchmark.
fromWIRED
2 months ago

Why Anthropic's New AI Model Sometimes Tries to 'Snitch'

The hypothetical scenarios the researchers presented Opus 4 with that elicited the whistleblowing behavior involved many human lives at stake and absolutely unambiguous wrongdoing.
Artificial intelligence
[ Load more ]