Lifecoach5000@lemmy.world to Technology@lemmy.worldEnglish · 1 month agoChatGPT 'got absolutely wrecked' by Atari 2600 in beginner's chess match — OpenAI's newest model bamboozled by 1970s logicwww.tomshardware.comexternal-linkmessage-square206fedilinkarrow-up1905arrow-down180
arrow-up1825arrow-down1external-linkChatGPT 'got absolutely wrecked' by Atari 2600 in beginner's chess match — OpenAI's newest model bamboozled by 1970s logicwww.tomshardware.comLifecoach5000@lemmy.world to Technology@lemmy.worldEnglish · 1 month agomessage-square206fedilink
minus-squareIsaamoonKHGDT_6143@lemmy.ziplinkfedilinkEnglisharrow-up7arrow-down7·1 month agoThey used ChatGPT 4o, instead of using o1 or o3. Obviously it was going to fail.
minus-squarewizardbeard@lemmy.dbzer0.comlinkfedilinkEnglisharrow-up1arrow-down4·edit-21 month agoOther studies (not all chess based or against this old chess AI) show similar lackluster results when using reasoning models. Edit: When comparing reasoning models to existing algorithmic solutions.
They used ChatGPT 4o, instead of using o1 or o3.
Obviously it was going to fail.
Other studies (not all chess based or against this old chess AI) show similar lackluster results when using reasoning models.
Edit: When comparing reasoning models to existing algorithmic solutions.