Challenges of agent execution and human approval

From: aidotengineer

When deploying AI agents into a production setting, several significant challenges arise, particularly concerning human interaction and the reliability of long-running processes [00:00:06]. A new mechanism called agent continuations has been developed by Snaplogic to address these issues [00:00:24] [00:00:44].

Challenges in Agent Execution

Human in the Loop

A primary concern in agent processing is determining the human’s role [00:01:18]. For users to be comfortable with agent automation, human oversight is required for key aspects of agent execution [00:01:23]. This often involves the agent reaching a point where a designated tool or task (e.g., transferring a large sum of money or deleting an account) is high-value or high-risk [00:01:46]. At such junctures, a mechanism is needed to ensure the human can provide the final determination or decision [00:01:58].

Long-Running Agents and Failure Tolerance

Many agents are designed to be long-running, involving numerous steps in their processing [00:02:12]. The longer a process runs, the greater the chance of a failure, such as a network or hardware issue [00:02:17] [00:05:28]. To prevent the loss of significant work, a method is needed to checkpoint the agent’s state, enabling resumption from a point other than the beginning [00:02:24] [00:05:42].

Dynamic LLM APIs and Rate Limiting

The landscape of Large Language Models (LLMs) and their APIs is rapidly changing, leading to dynamic usage patterns [00:04:47]. New models are frequently released and heavily used, which can lead to issues like rate limiting [00:04:55]. Mechanisms are required to handle these situations [00:05:09].

Multi-Level and Nested Agents

Agents are becoming increasingly sophisticated, often involving a main orchestrator agent with several sub-agents, which can themselves have sub-agents (nested configurations) [00:06:00] [00:06:17] [00:06:21]. Addressing human approval and state saving in such complex, multi-level agent architectures presents additional challenges [00:06:30].

Agent Loop Persistence

The code driving an agent’s loop must run on a physical machine, whether in the cloud or on a desktop [00:06:56]. Most current agent frameworks require this agent loop to run continuously, even when waiting for external responses like human input from Slack [00:07:27]. This continuous persistence is a challenge, as it ties up resources [00:07:42].

Agent Continuations: A Solution

Agent continuations are a new mechanism inspired by the programming language theory concept of continuations [00:08:11]. In programming, a continuation allows stopping program execution at any point, bundling its state, and resuming it later [00:08:23]. This is akin to taking a snapshot of the program’s execution [00:08:46].

Agent continuations apply this concept to AI agents, allowing the pausing and saving of an agent’s state at any point during its execution, including across multiple tool calls, LLM calls, and even calls to sub-agents [00:09:18]. This saved state can then be returned to the application layer for purposes such as processing human approval or persisting the agent’s state for later resumption [00:09:38].

How Agent Continuations Work

The feasibility of agent continuations builds on how LLMs in agents already work: by maintaining a “messages array” [00:10:04]. This array acts as a log of all interactions and is replayed to the LLM for its next inference, effectively saving much of the agent’s state [00:10:09].

When an agent needs to suspend (e.g., for human approval or another predefined condition), a “continuation object” is created [00:15:12] [00:15:19]. This object embeds the standard messages array along with additional metadata [00:15:24]. This metadata provides enough information to resume the agent’s execution [00:15:28]. The continuation object is then sent back to the top-level application layer, which can inspect its metadata to understand the reason for suspension and provide updates, such as human approval [00:15:39].

Once the application layer updates the continuation object, it can be sent back to the agent [00:16:26]. The agent framework recognizes it as a continuation and contains the logic to reconstruct the agent’s state, allowing it to continue execution from the suspension point [00:16:31].

Key Advantage: Decoupled Execution

A crucial aspect of agent continuations is that after the agent is suspended and the continuation object is created, there is no need to keep any of the agent loops running in the system [00:16:48]. The loops can be shut down because all necessary information has been captured to restart everything exactly where it left off [00:17:02]. This capability addresses the challenge of agent loop persistence and resource consumption.

Support for Nested Agents

The continuation object format is recursive and can handle arbitrary layers of agent nesting [00:18:24] [00:18:29]. This means complex, multi-level agent structures can be suspended and resumed, with sub-agent states correctly captured within the overarching continuation [00:18:38] [00:21:03].

Agent continuations are considered novel because they combine both a human approval mechanism and the ability to handle arbitrary nesting depths for complex agents [00:25:36]. They represent a significant step in improving the reliability and manageability of agents in production environments [00:26:34].

Tubegraph

Explorer

Table of Contents