#lm-arena
#lm-arena

[ follow ]

New study accuses LM Arena of gaming its popular AI benchmark

LM Arena's ranking may favor large companies due to unfair testing practices, raising concerns about its reliability in assessing AI chatbots.

The incident prompted the maintainers of LM Arena to apologize, change their policies, and score the unmodified, vanilla Maverick.

Artificial intelligence

[ Load more ]