人間の思考様式を模したAIモデルを開発（EPFL Researchers Create an AI Model That Thinks Like We Do）

2026-06-26 スイス連邦工科大学ローザンヌ校（EPFL）

スイスのスイス連邦工科大学ローザンヌ校（EPFL）の研究グループは、人間の脳の機能分化を模倣した大規模言語モデル（LLM）MiCRo（Mixture of Cognitive Reasoners）を開発した。従来のLLMは内部処理が「ブラックボックス」で理解しにくいという課題があったが、MiCRoは「言語」「論理」「社会的推論」「世界知識」の4つの専門モジュール（エキスパート）で構成され、それぞれが脳の異なる領域に対応するよう設計されている。質問内容に応じて適切なモジュールを組み合わせて推論するため、どのような知識や推論が用いられたかを可視化しやすく、利用者がモデルの振る舞いを制御しやすい点が特徴である。研究チームは、人間の認知機能を取り入れたアーキテクチャにより、性能向上だけでなくAIの説明可能性や信頼性の向上につながると期待している。本成果は、脳科学と自然言語処理を融合した新しいAI設計の方向性を示すものであり、国際会議ICLRで発表された。

＜関連情報＞

認知推論者の混合脳のような特化機能を備えたモジュール型推論 Mixture of Cognitive Reasoners Modular Reasoning with Brain-Like Specialization

Badr AlKhamissi, C. Nicolò De Sabbata, Greta Tuckute^,, Zeming Chen,Martin Schrimpf^*,, Antoine Bosselut
ICLR 2026

Abstract

Human cognitive behavior arises from the interaction of specialized brain networks dedicated to distinct functions, such as language, logic, and social reasoning. Inspired by this organization, we propose Mixture of Cognitive Reasoners (MiCRo): a modular, transformer-based architecture post-trained with a curriculum that induces functional specialization across experts. Concretely, we partition the layers of a pretrained language model into four expert modules aligned with well-studied cognitive networks in the human brain. MiCRo offers three key advantages over standard language models. (1) The specialized experts are interpretable and causally meaningful—ablating a module causes substantial drops on benchmarks requiring its specialized domain. (2) MiCRo’s behavior can be dynamically steered at inference time by routing tokens to particular experts (e.g., favoring social over logical reasoning), enabling fine-grained control over outputs. (3) MiCRo outperforms or matches comparable baselines on both machine-learning reasoning benchmarks (e.g., GSM8K, BBH) and alignment to human behavior (CogBench), while maintaining interpretability. Taken together, cognitively grounded functional specialization yields models that are both more human-like and more human-interpretable.