Welcome to X69.org
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Pro@programming.dev to AI - Artificial intelligence@programming.devEnglish · 4 months ago

ClockBench: Even the best AI models can't reliably read the clock

clockbench.ai

external-link
message-square
0
link
fedilink
  • cross-posted to:
  • [email protected]
  • [email protected]
1
external-link

ClockBench: Even the best AI models can't reliably read the clock

clockbench.ai

Pro@programming.dev to AI - Artificial intelligence@programming.devEnglish · 4 months ago
message-square
0
link
fedilink
  • cross-posted to:
  • [email protected]
  • [email protected]
ClockBench AI Benchmark
clockbench.ai
external-link
ClockBench evaluates whether models can read analog clocks - a task that is trivial for humans, but current frontier models struggle with.

cross-posted from: https://programming.dev/post/37407786

alert-triangle
You must log in or # to comment.

AI - Artificial intelligence@programming.dev

Aii@programming.dev

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

AI related news and articles.

Rules:

  • No Videos.
  • No self promotion: Don’t post links to your articles.
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 1 user / day
  • 61 users / week
  • 134 users / month
  • 524 users / 6 months
  • 1 local subscriber
  • 183 subscribers
  • 249 Posts
  • 99 Comments
  • Modlog
  • mods:
  • Vacant@programming.dev
  • BE: 0.19.13
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org