May 27, 2026•1 min read•from Machine Learning

[R] What 1000+ Harness Experiments Taught Me About Self-Improving Agents [R]

Our take

In my recent exploration of self-improving AI agents, I investigated whether an AI could enhance a harness for terminal bench tasks. While it can propose meaningful one-time changes, continuous self-improvement reveals itself as more of a systems challenge. An effective approach requires a mechanism to determine which enhancements can safely compound over time. This article details my journey, including both successes and setbacks in establishing a self-improvement loop. For further insights, check out "Sarang Kulkarni on Lessons from Building Deep Research Agents in Production."

In a recent exploration of AI capabilities, a detailed analysis titled "[R] What 1000+ Harness Experiments Taught Me About Self-Improving Agents" sheds light on the intricacies of building AI systems that can self-improve. The author reflects on their journey to determine whether an AI agent could autonomously enhance a harness to efficiently tackle terminal bench tasks. While the experiment yielded insights into one-time changes, it revealed that continuous self-improvement presents a more complex challenge, primarily due to the need for robust systems to evaluate which improvements can safely compound. This parallels issues faced in coding-agent customization, as highlighted in other discussions such as "I Built a Deck With AI, Then Made a Second AI Attack It." and "Sarang Kulkarni on Lessons from Building Deep Research Agents in Production."

This inquiry into self-improvement mechanisms is significant for several reasons. Firstly, the insights derived from the author's attempts to create a self-improving system underscore the inherent complexities of AI development. Continuous improvement requires not only a clear definition of what constitutes an improvement but also a mechanism for assessing the safety and efficacy of these enhancements. This is a crucial consideration in the realm of machine learning and AI, where the stakes are high, and missteps can lead to cascading failures. The distinction between one-time adjustments and ongoing enhancements also highlights a fundamental aspect of AI development: the need for careful experimentation and iteration.

Moreover, the author’s findings resonate with broader themes in AI and machine learning. As organizations increasingly adopt AI technologies, understanding how to build systems that learn and adapt over time becomes vital. The challenges faced in creating self-improving agents can inform best practices across various sectors, from software engineering to operational automation. This highlights the importance of fostering environments where innovation is accompanied by rigorous testing and thoughtful evaluation, much like the approaches discussed in articles about the efficiency of Vision Transformers and their computational waste.

The implications of these findings extend beyond technical considerations; they also touch on ethical and practical dimensions of AI. As we encourage machines to adapt and improve autonomously, we must remain vigilant about the potential consequences. Ensuring that improvements are not just beneficial but also safe requires a balance of ambition and caution. This is especially pertinent in contexts where AI systems interact with human users or make decisions that impact lives and livelihoods.

Looking ahead, the exploration of self-improvement in AI raises essential questions: What frameworks can we develop to ensure that self-improving systems operate within safe and beneficial parameters? How can we design systems that not only learn from their experiences but do so in a way that aligns with human values and goals? As researchers and developers continue to push the boundaries of what AI can achieve, the importance of establishing robust, ethical guidelines cannot be overstated. The ongoing experiments and findings in this field promise to shape the future of AI, making it a critical area for continued observation and exploration.

I recently wanted to see whether an AI agent could self-improve a harness to solve terminal bench tasks. It’s possible for an AI agent to propose a meaningful one-time change to the harness, but after experimenting with this for a couple of weeks, I think the continuous self-improvement is mostly an experiment-systems problem. The system needs a way to decide what kind of improvements can safely compound.

Turns out there's a lot of parallels to coding-agent customization (e.g. SKILLS.md etc..) too.

I wrote my experience of building such system here, including the successful and failure attempts during the process, and how I approached the self-improvement loop. It's not intended as a benchmark claim but more of a systems/research writeup.

https://www.henrypan.com/blog/2026-05-25-self-improvement-harness/

submitted by /u/Megadragon9
[link] [comments]

Read on the original site

Open the publisher's page for the full experience

View original article →

Continual Harness: Online Adaptation for Self-Improving Foundation Agents [R]https://preview.redd.it/p9cd2zmfy01h1.png?width=2000&format=png&auto=webp&s=a8e99bac438c2505d97ed3716983aa731da855f8 Sharing a new paper from the GPP and PokeAgent teams. Gemini Plays Pokémon (GPP) was the first AI system to complete Pokémon Blue, Yellow Legacy on hard mode, and Crystal without losing a battle. How? Early signs of iterative harness development. In the Blue era a human watched the stream and edited the harness. By Yellow Legacy and Crystal, the model itself was performing most of the editing through general meta-tools (define_agent, run_code, notepad edits). Our new paper, Continual Harness: Online Adaptation for Self-Improving Foundation Agents, formalizes the loop and automates the refining role end to end. We then carry the same loop into training, enabling model-harness co-learning. The takeaways: 1. Iterative harness refinement closes most of the gap to a hand-engineered version. 2. Long-horizon agency requires self-refinement, and self-refinement requires a useful model. 3. The future of agents is model-harness co-learning. Paper (arXiv). https://arxiv.org/abs/2605.09998 Article (Substack). https://sethkarten.substack.com/p/gemini-plays-pokemon-discovered-something Project page (video demos). https://sethkarten.ai/continual-harness submitted by /u/PokeAgentChallenge [link] [comments]

[R] What 1000+ Harness Experiments Taught Me About Self-Improving Agents [R]

Related Articles