Last month,Down Town the $61.5 billion-valuated AI startup Anthropic set up a gaming livestream on Twitch. Gaming livestreams are nothing new on Twitch, but this one is a little different: Claude, Anthropic's AI model, is attempting to beat Pokémon Red.
We are now one month in,and the livestream is still going. However, Claude has not progressedall that much. And, at this rate, Anthropic's AI agent may possibly never be the very best, like no one ever was.
According to Anthropic, when it first launched the "Claude Plays Pokémon" project, previous versions of its AI agent Claude failed at some very basic tasks. For example, according to Anthropic, Claude 3.5 would try to run away from almost every battle in June 2024.
A few months and a few versions of Claude later, Anthropic said there was a stark change. In February 2025, Anthropic gave Claude 3.7 Sonnet a whirl at playing Pokémon.
"Within hours, Claude defeated Brock. Days later, it trounced Misty," Anthropic said. "Progress that older models had little hope of achieving."
Anthropic said that Claude 3.7 Sonnet could plan ahead, remember objectives, and learn from its mistakes, unlike previous versions of the AI agent. It also built a knowledge base, saw the screen, and simulated button presses.
However, the progress Claude 3.7 Sonnet originally made in the game seems to have stalled.
For example, livestream viewers watchedas Clause 3.7 took 78 hoursto get through Mt. Moon in the game. On Reddit, gamers estimatedthat it would typically take a child just a few hours to advance through the same stage.
SEE ALSO: Hands-on with the Claude AI app: It's pleasant to use, but jankyClaude can be seen going in circles, stumbling around the same paths, and often knocking into walls as it tries to get around the game.
The livestream is engaging, especially as a text box lays out Claude's "thinking" as the AI agent tries to figure out what moves to make next.
According to Anthropic engineers in an interview with Ars Technica, Claude has an easier time with aspects of the game which involve text, such as Pokémon battles. However, it struggles with the more visual aspects of the game, such as moving around from town to town on the map.
Claude 3.7 Sonnet has gone much further in the game than previous Claude models, so there's been progress. However, for those warning that AI will soon be able to take over the world, we're nowhere close to that being a reality yet. Claude still has 151 Pokémon to catch.
Topics Artificial Intelligence Gaming Pokemon Twitch Streaming
Bye literally everyone: 11 best tweets from Twitter's worst weekStaff Picks: Raymond Pettibon, Jaume Plensa, Carlos FonsecaGoogle launches new tools to find and track shopping dealsWhen “Macaroni” Meant “Sodomy”Fake news tweets take off as Twitter blue checks go up for saleRevisited: Mystery and Melancholy of a StreetStaff Picks: Anthony Heilbut, Caryl Churchill, Carl PhillipsGoogle Pixel Magic Editor won't let you edit IDs, faces, other photosBilly Joel’s “Miami 2017” Is Even More Depressing Than We ThoughtNFT partygoers blame Bored Ape Yacht Club event for loss of visionGot 20 Million Bucks? Move to Grey Gardens, Why Don’t YouRyan Reynolds just joined Tumblr. Did Elon Musk's Twitter have anything to do with it?Wandering the Westminster Dog ShowReal Polaroids, Fake People: Duane Hanson’s Photos of His Lifelike SculpturesStephen King teases extract from upcoming 'Cujo' sequelGoogle Pixel Magic Editor won't let you edit IDs, faces, other photosFake news tweets take off as Twitter blue checks go up for saleRevisited: Oulipian Language GamesWhen “Macaroni” Meant “Sodomy”'Quordle' today: See each 'Quordle' answer and hints for November 8, 2023 I tried 3 TikTok Unconventional, Part 4: William S. Burroughs in Chicago Bastille Day Sale Prints by Peter Howson The Rise of the Spoiler Alert Unconventional, Part 6: Ed Sanders and the Police Nathaniel Mackey & Cathy Park Hong with NYC High Crying CEO goes viral on LinkedIn for being out of touch OpenAI is working on a tool to detect DALL Amazon promises speedy drone deliveries in the UK People Once Dared to Imagine a World Without Billboards Why I Got Really, Really Into Garth Brooks As a Kid #ReadEverywhere, Even in the Ring Women's health app Flo launches feature for partners #ReadEverywhere, Even When You’re Down and Out Twitter / X: Elon Musk considers pulling out of Europe to escape EU law The World of ‘Garfield’ Parodies Runs Deeper Than You’d Dreamed Best video doorbell deal: Save 33% on the wireless Google Nest Doorbell at Amazon 6 ways to make hiring more accessible Dr. Seuss’s Midnight Paintings
2.5189s , 8223.7734375 kb
Copyright © 2025 Powered by 【Down Town】,Warmth Information Network