AIチャットボットがミスをする理由を理解するために、言語と思考を切り離す(Separating Language from Thought to Understand why AI Chat Bots Make Mistakes)

2023-06-062023-08-27

2023-06-05 テキサス大学オースチン校(UT Austin)

◆テキサス大学オースティン校、マサチューセッツ工科大学、カリフォルニア大学ロサンゼルス校の科学者たちは、ChatGPTのようなAIプログラムが人間の作家に比べて誤りを犯しやすい理由を説明しました。
◆大規模言語モデル(LLM)は形式的な言語能力は持っているものの、機能的な言語能力には欠けているとされ、そのため奇妙なミスや誤りが生じることがあると述べています。
◆人間の思考と言語に関する研究にも影響を与える可能性があります。今後は、非言語的な認知能力も考慮した方法を取り入れて、機能的な言語能力を持つプログラムを作成する必要があると主張しています。

<関連情報>

大規模言語モデルにおける言語と思考の解離:認知的視点
Dissociating language and thought in large language models: a cognitive perspective

Kyle Mahowald, Anna A. Ivanova, Idan A. Blank, Nancy Kanwisher, Joshua B. Tenenbaum, Evelina Fedorenko
arXiv Submitted on: 16 Jan 2023
DOI:https://doi.org/10.48550/arXiv.2301.06627

Today’s large language models (LLMs) routinely generate coherent, grammatical and seemingly meaningful paragraphs of text. This achievement has led to speculation that these networks are — or will soon become — “thinking machines”, capable of performing tasks that require abstract knowledge and reasoning. Here, we review the capabilities of LLMs by considering their performance on two different aspects of language use: ‘formal linguistic competence’, which includes knowledge of rules and patterns of a given language, and ‘functional linguistic competence’, a host of cognitive abilities required for language understanding and use in the real world. Drawing on evidence from cognitive neuroscience, we show that formal competence in humans relies on specialized language processing mechanisms, whereas functional competence recruits multiple extralinguistic capacities that comprise human thought, such as formal reasoning, world knowledge, situation modeling, and social cognition. In line with this distinction, LLMs show impressive (although imperfect) performance on tasks requiring formal linguistic competence, but fail on many tests requiring functional competence. Based on this evidence, we argue that (1) contemporary LLMs should be taken seriously as models of formal linguistic skills; (2) models that master real-life language use would need to incorporate or develop not only a core language module, but also multiple non-language-specific cognitive capacities required for modeling thought. Overall, a distinction between formal and functional linguistic competence helps clarify the discourse surrounding LLMs’ potential and provides a path toward building models that understand and use language in human-like ways.