2 min readfrom Machine Learning

I built AI agents that play Pokemon Showdown autonomously using free LLM APIs via tool-calling [P]

Our take

I developed a system where AI agents autonomously engage in Pokémon Showdown battles using free LLM APIs through structured tool-calling. By leveraging models like Llama 3, Qwen, and Gemma, these agents analyze the entire battle state each turn—considering type matchups, HP, weather, and field conditions—to determine the best action. This innovative approach allows anyone to run the system locally at zero inference cost, supporting over 15 free models.
I built AI agents that play Pokemon Showdown autonomously using free LLM APIs via tool-calling [P]
I built AI agents that play Pokemon Showdown autonomously using free LLM APIs via tool-calling [P]

I've built a system where models like Llama 3, Qwen, and Gemma play Pokémon Showdown battles autonomously. Instead of simple prompt-response, they analyze the full battle state every turn (type matchups, HP, weather, field conditions, revealed opponent info) and decide whether to attack or switch using structured tool calls.

The cool part: I routed everything through LiteLLM and exclusively used models with free API tiers (Groq, Cerebras, OpenRouter, Google AI Studio). So anyone can run this locally with zero inference cost.

Features:

- Human vs. AI (play against the bot)

- AI vs. AI (pit two models against each other)

- 15+ free models supported out of the box

- Full observability via Langfuse to see the exact tool calls and reasoning per turn.

https://i.redd.it/lzx2fd2s0eyg1.gif

▶️ Watch the full video demo with audio on YouTube: https://youtu.be/8ZNadmh-Sy8

GitHub Repo: https://github.com/MohamedMostafa259/pokemon-ai-agent

Would love feedback on the architecture or ideas for improving their reasoning during complex board states!

submitted by /u/ReplacementMoney2484
[link] [comments]

Read on the original site

Open the publisher's page for the full experience

View original article

Tagged with

#rows.com#financial modeling with spreadsheets#natural language processing for spreadsheets#generative AI for data analysis#google sheets#row zero#Excel alternatives for data analysis#spreadsheet API integration#Pokemon Showdown#AI agents#autonomously#LLM APIs#battle state#tool calling#free models#Llama 3#Qwen#Gemma#LiteLLM#complex board states