2025-03-04 中国科学院(CAS)
Fig. Framework of the proposed robust feature selection method (Image by NIMTE)
<関連情報>
- https://english.cas.cn/newsroom/research_news/infotech/202503/t20250310_903409.shtml
- https://ieeexplore.ieee.org/document/10887394
限られたサンプルの工業データに対する相互情報内のノイズエントロピーの除去による頑健な特徴選択 Robust Feature Selection by Removing Noise Entropy Within Mutual Information for Limited-Sample Industrial Data
Chan Xu; Silu Chen; Xiangjie Kong; Chi Zhang; Guilin Yang; Zaojun Fang
IEEE Transactions on Industrial Informatics Published:14 February 2025
DOI:https://doi.org/10.1109/TII.2025.3534417
Abstract
Feature selection is challenging in high-dimensional and small-sample data, particularly in industrial informatics with diverse noise sources. The information entropy of feature noise is included in mutual information of a label and noise-corrupted features, which can be removed to increase classification accuracy. In this article, we propose a robust feature selection method by eliminating feature noise in the relevance measure. Feature noise is modeled as a zero-mean censored normal distribution, so its entropy is determined by solving the variance equation based on the maximum entropy principle. Then, a noisy channel for feature transmission is proposed to extract class-relevant noise component. Furthermore, a noise-free mutual information metric is developed by removing noise entropy within mutual information. Eventually, a novel criterion is proposed by maximizing relevance based on noise-free mutual information while minimizing redundancy. Experimental results confirm the effectiveness of our approach on datasets from various industrial sectors.