事前学習済みの大規模言語モデルに基づいて原子力発電所の運転パラメータを予測する新手法（New Method Predicts Nuclear Power Plant Operating Parameters Based on Pre-trained Large Language Model）

2026-03-12 中国科学院（CAS）

中国科学院合肥物質科学研究院の研究チームは、原子力発電所の運転パラメータを高精度で予測するAIフレームワーク「NPP-GPT」を開発した。複雑で高次元の運転データに対応するため、数値時系列データを事前学習済み言語モデルの表現空間へ整合させるクロスモーダル転移学習を採用。入力埋め込み再構成と自己教師学習（ランダムマスキング）により学習し、さらにLoRAによる効率的ファインチューニングで原子力分野の知識を組み込んだ。6種類の運転条件データで評価した結果、既存の時系列予測手法より高い精度と長期予測性能を示し、ノイズや欠損データにも高いロバスト性を確認。原発の安全監視や運転意思決定支援への応用が期待される。

The framework of the proposed NPP-GPT in nuclear energy systems (Image by CHANG Ling)

＜関連情報＞

NPP-GPT: Forecasting nuclear power plants operating parameters using pre-trained large language model

Ling Chang, Haibo Yu, Minghan Yang, Ziheng Zhang, Shuai Chen, Jianye Wang
Applied Energy Available online: 29 January 2026
DOI:https://doi.org/10.1016/j.apenergy.2026.127438

Highlights

Proposes NPP-GPT, a novel large language model-based approach with two-stage cross-modal transfer learning for forecasting nuclear power plant operating parameters.
Bridges modality gaps and integrates domain knowledge through redesigned input embedding, masked self-supervised learning, and fine-tuning.
Validates method efficacy through comparative, generalization, and robustness experiments across six typical operational scenarios.

Abstract

Accurate long-term forecasting of operating parameters in nuclear power plants (NPPs) is crucial for safety and cost-effective maintenance. However, the complexity and uncertainty of reactors, along with the high-dimensional and large-scale operating data, present challenges in capturing intricate dynamic behaviors and long-term dependencies. This paper presents NPP-GPT, which for the first time investigates the potential of using pre-trained Large Language Model (LLM) to forecast long-term parameters from historical NPP data without explicit prompt engineering. Considering the modal disparity between textual pre-training data and numerical energy data, NPP-GPT employs a two-stage cross-modal transfer learning strategy that preserves the native next-token forecasting capability of LLMs while unlocking their potential for precise energy forecasting. First, the modal gap is bridged through input embedding reconstruction and Self-Supervised Learning (SSL). Second, domain-specific energy knowledge is integrated via LoRA fine-tuning. The framework was rigorously validated using data from an established advanced nuclear energy research platform, focusing on a Chinese Pressurized Water Reactor (CPR-1000). Comprehensive experiments covering diverse operational scenarios, including normal and multiple fault conditions, demonstrated that NPP-GPT outperforms both classical and advanced time-series forecasting models in accuracy and generalization, especially in long-term forecasting and under conditions with noise and missing data. This study offers a novel and generalizable solution for forecasting tasks in energy sectors.

月	火	水	木	金	土	日
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30	31