2 min readfrom Machine Learning

Signals: finding the most informative agent traces without LLM judges [R]

Our take

Introducing "Signals," a groundbreaking research initiative from Katanemo Labs, part of DigitalOcean. Designed for those building agentic systems, Signals offers a lightweight method to compute structured signals from live agent interactions. This approach allows users to identify the most informative trajectories without overwhelming manual review or costly LLM calls. Our study reveals that signal-based sampling achieves an impressive 82% informativeness rate, significantly enhancing efficiency.
Signals: finding the most informative agent traces without LLM judges [R]
Signals: finding the most informative agent traces without LLM judges [R]

Hello Peeps Salman, Shuguang and Adil here from Katanemo Labs (a DigitalOcean company).

Wanted to introduce our latest research on agentic systems called Signals. If you've been building agents, you've probably noticed that there are far too many agent traces/trajectories to review one by one, and using humans or extra LLM calls to inspect all of them gets expensive really fast. The paper proposes a lightweight way to compute structured “signals” from live agent interactions so you can surface the trajectories most worth looking at, without changing the agent’s online behavior. Computing Signals doesn't require a GPU.

Signals are grouped into a simple taxonomy across interaction, execution, and environment patterns, including things like misalignment, stagnation, disengagement, failure, looping, and exhaustion. In an annotation study on τ-bench, signal-based sampling reached an 82% informativeness rate versus 54% for random sampling, which translated to a 1.52x efficiency gain per informative trajectory.

Paper: arXiv 2604.00356. https://arxiv.org/abs/2604.00356
Project where Signals are already implemented: https://github.com/katanemo/plano

Happy to answer questions on the taxonomy, implementation details, or where this breaks down.

submitted by /u/AdditionalWeb107
[link] [comments]

Read on the original site

Open the publisher's page for the full experience

View original article

Tagged with

#rows.com#natural language processing for spreadsheets#generative AI for data analysis#cloud-based spreadsheet applications#Excel alternatives for data analysis#signals#agentic systems#informative trajectories#agent traces#trajectories#lightweight computation#taxonomy#signal-based sampling#interaction patterns#execution patterns#environment patterns#efficiency gain#misalignment#stagnation#disengagement