2026-04-07 中国科学院(CAS)
<関連情報>
- https://english.cas.cn/newsroom/cas-in-media/202604/t20260407_1155346.shtml
- https://essd.copernicus.org/articles/18/1995/2026/
ChinaAI-FSC:中国向け包括的なAI対応MODIS積雪率データセット(2000年~2022年) ChinaAI-FSC: a comprehensive AI-ready MODIS fractional snow cover dataset for China (2000–2022)
Jinliang Hou, Mingkai Zhang, Xiaohua Hao, Jifu Guo, Peng Dou, Ying Zhang, and Chunlin Huang
Earth System Science Data Published:17 Mar 2026
DOI:https://doi.org/10.5194/essd-18-1995-2026

Abstract
We present ChinaAI-FSC, the first large-scale, standardized, AI-ready fractional snow cover (FSC) sample collection for China, spanning 22 snow seasons from 2000 to 2022 and addressing a critical gap in long-term snow monitoring. The dataset consists of 47 728 samples (each 128 × 128 MODIS-pixel tiles), where high-resolution Landsat-5/7/8/9 and Sentinel-2 imagery provide consistent FSC reference labels. A total of 20 feature variables, including MODIS surface reflectance (bands 1–7), topographic attributes, forest and land cover information, and geolocation factors, were extracted to enable both point-scale and tile-scale spatially contextualized AI modelling. A structured and transparent workflow, encompassing systematic sample preparation, rigorous quality control, spatiotemporal sample partitioning, and standardized metadata, ensures reproducibility, physical consistency, and interoperability across machine learning and deep learning applications. Dataset reliability and AI-readiness were systematically evaluated using a novel “Four Layers-Four Domains-Fifteen Attributes (4L-4D-15A)” assessment protocol, covering data, information, system, and application dimensions. The quality, reliability, and usability of ChinaAI-FSC were demonstrated through three representative use cases: (1) benchmarking of six ML/DL models (ANN, SVR, RF, CNN, UNet, and ResNet), (2) validation of the standard MODIS FSC product, and (3) nationwide seamless FSC mapping. By providing harmonized, validated, and well-documented samples, ChinaAI-FSC establishes a unified foundation for AI-driven snow cover mapping, long-term monitoring, and cryosphere–hydrological modelling, promoting reproducible, interoperable, and next-generation research in cryospheric science. The dataset is publicly available from the National Tibetan Plateau Data Center (TPDC) (Hou et al., 2025a) at https://doi.org/10.11888/Cryos.tpdc.303034 (also accessible via https://cstr.cn/18406.11.Cryos.tpdc.303034, last access: 24 February 2026) and from Zenodo (Hou et al., 2025b) at https://doi.org/10.5281/zenodo.17707386.


