From arstechnica.com
New study accuses LM Arena of gaming its popular AI benchmark
2 2
The popular AI vibe test may not be as fair as it seems.
#ai #meta #google #openai #lmarena #artificialintelligence
7h ago