OpenAI’s o3 AI model scores lower on a benchmark than the company initially implied

  • staffstaff
  • AI
  • April 20, 2025
  • 0 Comments

A discrepancy between first- and third-party benchmark results for OpenAI’s o3 AI model is raising questions about the company’s transparency and model testing practices. When OpenAI unveiled o3 in December, the company claimed the model could answer just over a fourth of questions on FrontierMath, a challenging set of math problems. That score blew the […]

  • Related Posts

    People struggle to get useful health advice from chatbots, study finds

    With long waiting lists and rising costs in overburdened healthcare systems, many people are turning to AI-powered chatbots like ChatGPT for medical self-diagnosis. About 1 in 6 American adults already…

    Continue reading
    Google accidentally reveals details about its new Android design language, Material 3 Expressive

    Google will unveil a new version of its Android design language at its upcoming Google I/O developer conference, according to an event schedule posted to its website as well as…

    Continue reading

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    Google accidentally reveals details about its new Android design language, Material 3 Expressive

    • By staff
    • May 5, 2025
    • 1 views

    People struggle to get useful health advice from chatbots, study finds

    • By staff
    • May 5, 2025
    • 0 views

    11x CEO Hasan Sukkar steps down

    • By staff
    • May 5, 2025
    • 0 views

    Over 250 CEOs sign open letter supporting K-12 AI and computer science education

    • By staff
    • May 5, 2025
    • 0 views