Study accuses LM Arena of helping top AI labs game its benchmark

A new paper from AI lab Cohere, Stanford, MIT, and Ai2 accuses LM Arena, the organization behind the popular crowdsourced AI benchmark Chatbot Arena, of helping a select group of AI companies achieve better leaderboard scores at the expense of rivals. According to the authors, LM Arena allowed some industry-leading AI companies like Meta, OpenAI, […]

  • Related Posts

    One of Africa’s most successful founders is back with a new AI startup and already raised $9M

    In 2023, co-founders Karim Jouini and Jihed Othmani sold their expense management startup Expensya to Swedish procurement software firm Medius in what is widely considered to be one of the…

    Continue reading
    Windsurf says Anthropic is limiting its direct access to Claude AI models

    Windsurf, the popular vibe coding startup that’s reportedly being acquired by OpenAI, said Anthropic significantly reduced its first-party access to its Claude 3.7 Sonnet and Claude 3.5 Sonnet AI models.…

    Continue reading

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    One of Africa’s most successful founders is back with a new AI startup and already raised $9M

    • By staff
    • June 4, 2025
    • 1 views

    Windsurf says Anthropic is limiting its direct access to Claude AI models

    • By staff
    • June 4, 2025
    • 1 views

    Anthropic’s AI is writing its own blog — with human oversight

    • By staff
    • June 3, 2025
    • 2 views

    The OpenAI board drama is reportedly turning into a movie

    • By staff
    • June 3, 2025
    • 2 views