AIシステムが誤りから回復し最適解を見つける新手法を開発（Helping AI Systems Recover from Mistakes and Find Optimal Solutions）

2026-01-13 カリフォルニア工科大学（Caltech）

カリフォルニア工科大学（Caltech）とMIT、Asari AIの研究チームは、AIエージェントが誤りを犯しやすい「複雑な推論ステップ」を改善するための新フレームワーク EnCompass を開発した。大型言語モデル（LLM）などを使うAIエージェントは、多段階タスク中に小さな誤りが蓄積し最終的に失敗することが多いが、従来の手法ではこの問題に対処するために複雑なコードを繰り返し書き直す必要があった。EnCompassでは、コード内の分岐点を「branchpoints」としてマークし、各経路の評価「scores」を付与することで、異なる探索戦略を柔軟に試行・比較できる。これにより、コードの可読性を損なうことなく、エラー発生時に素早くバックトラックや最適解探索が可能になり、実装行数を大幅に削減しつつ精度を向上させることができる。研究成果はNeurIPS 2025で発表された。

The EnCompass framework compiles an AI agent program and its workflow into a branching search space object to identify the best pathway.Credit: Li et al. (2025) 39th Conference on Neural Information Processing Systems

＜関連情報＞

EnCompass: プログラム実行パスの検索によるエージェントプログラミングの強化 EnCompass: Enhancing Agent Programming with Search Over Program Execution Paths

Zhening Li, Armando Solar-Lezama, Yisong Yue, Stephan Zheng
Open Review Published: 19 Sept 2025

Abstract

We introduce a new approach to agent programming, the development of LLM-based agents. Current approaches to agent programming often entangle two aspects of agent design: the core workflow logic and the inference-time strategy (e.g., tree search). We introduce probabilistic angelic nondeterminism (PAN), a programming model that disentangles these two concerns, allowing the programmer to describe the agent workflow and independently experiment with different inference-time strategies by simply changing a few inputs. We provide an implementation of PAN in Python as the EnCompass framework, which uses a Python decorator to compile agent workflow programs into a search space. We present three case studies that demonstrate how the framework lets the programmer quickly improve the reliability of an agent and easily switch between different inference-time strategies, all with little additional coding.

月	火	水	木	金	土	日
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31