4 min readfrom VentureBeat

Anthropic wants to own your agent's memory, evals, and orchestration — and that should make enterprises nervous

Our take

Anthropic's recent updates to Claude Managed Agents, including 'Dreaming,' 'Outcomes,' and 'Multi-Agent Orchestration,' signal a significant shift in AI agent management. By integrating memory, evaluation, and orchestration into a single platform, Anthropic poses a challenge to the fragmented systems many enterprises currently rely on. This comprehensive approach could simplify deployment but raises concerns about vendor lock-in and data compliance. As organizations navigate their AI strategies, the decision to adopt such an all-in-one solution may redefine their operational flexibility and future capabilities.
Anthropic wants to own your agent's memory, evals, and orchestration — and that should make enterprises nervous

Anthropic's recent updates to its Claude Managed Agents platform, particularly the introduction of capabilities like Dreaming, Outcomes, and Multi-Agent Orchestration, signal a significant shift in the landscape of AI agent deployment. By consolidating memory, evaluation, and orchestration into a single runtime, Anthropic is not only streamlining processes but also challenging the fragmented systems many enterprises currently rely on. As discussed in our previous piece, Anthropic’s Claude Managed Agents gives enterprises a new one-stop shop but raises vendor 'lock-in' risk, this consolidation may bring about a host of challenges, particularly concerning vendor lock-in and compliance issues.

For enterprises, the implications of adopting a more integrated platform like Claude Managed Agents are profound. On one hand, the promise of a unified system that manages state, execution graphs, and routing can simplify operations and reduce overhead. The allure of a system that learns from its own mistakes—through Dreaming—promises to enhance the capabilities of agents, enabling them to handle complex tasks with less manual intervention. However, this innovation may also force organizations to reconsider their existing modular architectures, which often provide the flexibility to adapt to specific needs and compliance demands. The risk of becoming overly dependent on a single vendor's ecosystem could lead to a lack of agility that many businesses can ill afford in today's fast-paced technological landscape.

Moreover, the introduction of Outcomes as a built-in evaluation mechanism shifts the focus from external quality assessments to integrating evaluations within the orchestration layer. This could streamline the feedback loop for agents, allowing for quicker iterations and improvements. However, it raises questions about the robustness and reliability of self-contained evaluations, especially when compared to the multi-faceted assessments that human oversight provides. As noted in our article, Anthropic Introduces Managed Agents to Simplify AI Agent Deployment, enterprises need to carefully weigh the benefits of this integration against the potential drawbacks of diminished oversight.

As organizations grapple with these developments, the decision of whether to adopt Claude Managed Agents becomes a pivotal one. For those in the early stages of AI adoption, the streamlined approach may offer a more accessible entry point into agent deployment. Yet for enterprises already entrenched in AI transformations, the transition could be fraught with complications, necessitating a thorough evaluation of their current processes and the implications of shifting to a more centralized system. The pressure to innovate and adapt is palpable, especially as competitors may begin to align their strategies with Anthropic’s model, potentially leading to a broader industry trend towards integrated platforms.

Looking ahead, the question remains: how will enterprises navigate this evolving landscape of AI agent management? As more companies consider the ramifications of adopting a comprehensive platform like Claude Managed Agents, they will need to balance the desire for simplicity and efficiency with the critical need for flexibility and control. Ultimately, the future of AI deployment hinges on an organization’s ability to adapt its strategies in response to these innovations, ensuring that they remain competitive without sacrificing the unique needs of their operations.

Just a few weeks after announcing Claude Managed Agents, Anthropic has updated the platform with three new capabilities that collapse infrastructure layers like memory, evaluation, and multi-agent orchestration, into a single runtime.

This move could threaten the standalone tools that many enterprises cobble together.

The new capabilities — 'Dreaming,' 'Outcomes,' and 'Multi-Agent Orchestration' — aim to make agents inside Claude Managed Agents “more capable at handling complex tasks with minimal steering,” Anthropic said in a press release.  

Dreaming deals with memory, where agents “reflect” on their many sessions and curate memories so they learns and surface unknown patterns. Outcomes allows teams to define and set specific rubrics to measure an agent's success, while Multi-Agent Orchestration breaks jobs down so a lead agent can delegate to other agents.

Claude Managed Agents ideally provides enterprises with a simpler path to deploy agents and embeds orchestration logic in the model layer. It’s an end-to-end platform to manage state, execution graphs, and routing. With the addition of Dreaming, Outcomes and Multi-agent Orchestration, Claude Managed Agents expands capabilities even further and directly competes with tools like LangGraph or CrewAI, as well as external evaluation frameworks, RAG memory architectures, and QA loops.

An integration threat

Enterprises must now ask: Should we ditch our flexible, modular system in favor of an agent platform that brings almost everything in-house?

Anthropic designed Claude Managed Agents to share context, state, and traceability in one place. This means the platform sees every decision agents make, rather than enterprises having to wire separate systems together. It sounds practical to have one platform that does everything. But not all enterprises want a full-service system. 

Claude Managed Agents already faces criticism that it encourages vendor lock-in because it owns most of the architecture and tools that govern agents. In the current paradigm, an organization may run Managed Agents but keep multi-agent orchestration, memory, or evaluations in a separate space ensures flexibility. 

The platform offers a fully-hosted runtime, which means memory and orchestration run on infrastructure the enterprise does not own. This can become a compliance nightmare for some organizations that have to prove data residency. 

Another problem to consider is that enterprises already in the middle of large-scale AI transformations must cobble together workarounds to deal with the constraints of their tech stack. Not every workflow is easily replaceable by switching to Claude Managed Agents. 

Dreaming and outcomes against current tools

Most enterprises have a fragmented approach to AI deployment.

For example, they may use LangGraph or Crew AI for agent routing and workflow management, Pinecone as a vector database for long-term memory, DeepEval for external evaluation, and a human-in-the-loop quality assurance to review some tasks. Anthropic hopes to do away with all of that. 

With Dreaming, Anthropic approaches memory by allowing users to actively rewrite it between sessions, so the agent essentially learns from its mistakes. Anthropic says this capability is useful for long-running states and orchestration. Current systems often handle memory persistence by storing embeddings, retrieving relevant context, and adding more state over time. 

Outcomes addresses the evaluation portion by detailing expectations for agents. Instead of external quality checks, which are often done by a team of humans, Anthropic is bringing evaluation into the orchestration layer rather than above it. 

But it’s the Multi-Agent Orchestration capability that pits Claude Managed Agents against orchestration frameworks from Microsoft, LangChain, CrewAI, and others. Model providers like Anthropic and OpenAI have already begun pushing aggressively into this space, arguing that bringing this to the model layer gives teams better control.

Big decisions to make

Enterprises face a big decision, and this one could depend on where they are in agent maturity. 

If an organization is still experimenting with agents and has not deployed many in production, they may find moving to Claude Managed Agents and configuring Dreaming and Outcomes to their needs much easier. This is the stage of development where, even if enterprises are using a third-party orchestrator like LangChain, they’re still customizing it. 

But for those who are already further along in the process, the calculation becomes trickier. It’s now a matter of parallel evaluation and better understanding of their processes. 

Businesses, though, will face the same decision even if they don’t intend to use Claude Managed Agents. Anthropic has signaled that other model and platform providers will likely shift their product roadmaps to a similar model that keeps everything locked in the same system — because models may become interchangeable, but the tooling and orchestration infrastructure will not. 

Read on the original site

Open the publisher's page for the full experience

View original article

Related Articles

Tagged with

#financial modeling with spreadsheets#generative AI for data analysis#Excel alternatives for data analysis#natural language processing for spreadsheets#self-service analytics tools#data visualization tools#data analysis tools#big data management in spreadsheets#business intelligence tools#collaborative spreadsheet tools#AutoML capabilities#enterprise data management#big data performance#real-time data collaboration#workflow automation#enterprise-level spreadsheet solutions#conversational data analysis#large dataset processing#rows.com#intelligent data visualization