People are benchmarking AI by having it make balls bounce in rotating shapes

  • staffstaff
  • AI
  • January 24, 2025
  • 0 Comments

The list of informal, weird AI benchmarks keeps growing. Over the past few days, some in the AI community on X have become obsessed with a test of how different AI models, particularly so-called reasoning models, handle prompts like this: “Write a Python script for a bouncing yellow ball within a shape. Make the shape […]

© 2024 TechCrunch. All rights reserved. For personal use only.

  • Related Posts

    Anthropic, Google score win by nabbing OpenAI-backed Harvey as a user

    Popular legal AI tool Harvey will now be using leading foundation models from Anthropic and Google, moving beyond strictly using OpenAI’s, Harvey announced in a blog post on Tuesday. This…

    Continue reading
    AWS enters into ‘strategic partnership’ with Saudi Arabia-backed Humain

    Amazon says it will work with Humain, the AI company recently launched by Saudi Arabia’s ruler, Mohammed bin Salman, to invest “$5 billion-plus” in a strategic partnership to build an…

    Continue reading

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    Anthropic, Google score win by nabbing OpenAI-backed Harvey as a user

    • By staff
    • May 13, 2025
    • 0 views

    AWS enters into ‘strategic partnership’ with Saudi Arabia-backed Humain

    • By staff
    • May 13, 2025
    • 1 views

    TikTok launches TikTok AI Alive, a new image-to-video tool

    • By staff
    • May 13, 2025
    • 0 views

    Tencent hires WizardLM team, a Microsoft AI group with an odd history

    • By staff
    • May 13, 2025
    • 1 views