Study accuses LM Arena of helping top AI labs game its benchmark

A new paper from AI lab Cohere, Stanford, MIT, and Ai2 accuses LM Arena, the organization behind the popular crowdsourced AI benchmark Chatbot Arena, of helping a select group of AI companies achieve better leaderboard scores at the expense of rivals. According to the authors, LM Arena allowed some industry-leading AI companies like Meta, OpenAI, […]

  • Related Posts

    Microsoft’s Satya Nadella is choosing chatbots over podcasts

    While Microsoft CEO Satya Nadella says he likes podcasts, he might not actually be listening to them anymore. That tidbit comes towards the end of a longer Bloomberg profile of…

    Continue reading
    MIT disavows doctoral student paper on AI’s productivity benefits

    MIT says that due to concerns about the “integrity” of a high-profile paper on the effects of artificial intelligence on the productivity of a materials science lab, the paper should…

    Continue reading

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    Microsoft’s Satya Nadella is choosing chatbots over podcasts

    • By staff
    • May 17, 2025
    • 1 views

    MIT disavows doctoral student paper on AI’s productivity benefits

    • By staff
    • May 17, 2025
    • 1 views

    Epic Games asks judge to force Apple to approve Fortnite

    • By staff
    • May 17, 2025
    • 2 views

    Y Combinator startup Firecrawl is ready to pay $1M to hire three AI agents as employees

    • By staff
    • May 17, 2025
    • 2 views