本文已被:浏览 6次 下载 78次
投稿时间:2024-08-01 修订日期:2025-07-08
投稿时间:2024-08-01 修订日期:2025-07-08
中文摘要: 使用2022年河南省23个辐射观测站总辐照度数据和CMA-WSP2.0模式产品,通过LASSO回归选取特征变量,建立训练数据集和测试数据集,采用机器学习方法(随机森林、XGBoost、LightGBM)对训练数据集进行模型训练,订正河南省CMA-WSP2.0模式预报的总辐照度,并对订正结果分站点、分区域、分季节、总辐照度分级检验,结论如下:随机森林、XGBoost、LightGBM三种机器学习方法订正效果良好,相较于CMA-WSP2.0预报结果,平均绝对误差和均方根误差显著降低,24 h的准确率和合格率显著提升。其中LightGBM订正效果最优,平均绝对误差相较于CMA-WSP2.0预报减小了18.32~32.91 W·m-2,减小比例在38%~56%,均方根误差减小比例在36%~52%;24 h的平均准确率和平均合格率较CMA-WSP2.0分别提升了7.3%、5.7%。区域统计与站点统计结果较为一致,对于5个区域而言,豫西区域订正效果最好。三种机器学习方法订正后的偏差范围相比CMA-WSP2.0预报集中范围更窄,偏差分布在零值附近的概率更大。在各季节检验结果中三种方法对于冬季订正效果更为显著。对于不同的总辐照度等级,三种机器学习方法均有效改善了CMA-WSP2.0预报,随着总辐照度等级的增加,订正效果总体呈逐渐减弱的趋势。研究结果可为提高河南省总辐照度预报能力提供有益参考。
中文关键词: CMA-WSP2.0,机器学习,总辐照度,分区建模
Abstract:Based on the total irradiance data from 23 radiation observation stations in Henan Province in 2022 and CMA-WSP2.0 model products, in this paper, characteristic variables are selected by LASSO regression, training data sets and test data sets are established, and models are trained by machine learning methods (random forest, XGBoost and LightGBM) with the training data sets. Besides, the total irradiance forecasts by CMA-WSP2.0 model in Henan Province are corrected, and the corrected results are tested by site and region, by season and by total irradiance classification. The results are as follows. The three machine learning methods (random forest, XGBoost and LightGBM) have good correction effects. Compared with the CMA-WSP2.0 prediction results, the mean absolute error (MAE) and root mean square error (RMSE) are significantly reduced, and the 24 h accuracy rate and 24 h qualification rate are significantly improved. Among them, LightGBM has the best correction effect. The MAE decreases by 18.32-32.91 W·m-2, the reduction proportion of MAE decreases by 38%-56%, and the reduction proportion of RMSE decreases by 36%-52%. Moreover, the 24 h average accuracy and 24 h average qualification rate are raised by 7.3% and 5.7%, respectively. The results of regional statistics are consistent with those of the stations. For the five regions of Henan Province, the correction effect for western Henan is the best. The corrected deviation range of the three machine machine learning methods is narrower than that of the CMA-WSP2.0 test set, and the probability of the deviation distribution near zero is greater. Among the seasonal test results, the three machine learning methods have more significant correction effect for the winter prediction. For different total irradiance levels, the three machine learning methods can effectively improve the CMA-WSP2.0 prediction, but the correction effect tends to gradually weaken with the increase of total irradiance levels. These findings could provide a useful reference for improving the ability of total irradiance forecast in Henan Province.
文章编号: 中图分类号: 文献标志码:
基金项目:中国气象局·河南省农业气象保障与应用技术重点实验室应用技术研究基金(KQ202320)、河南省科技研发计划联合基金 (222103810093、232103810092)共同资助
引用文本:
程凯琪,魏璐,李伊吟,孙睿藻,张凡,2025.基于机器学习的河南省短波辐射数值预报订正方法[J].气象,51(8):941-953.
CHENG Kaiqi,WEI Lu,LI Yiyin,SUN Ruizao,ZHANG Fan,2025.Correction Method of Shortwave Radiation Numerical Forecast in Henan Province Based on Machine Learning[J].Meteor Mon,51(8):941-953.
程凯琪,魏璐,李伊吟,孙睿藻,张凡,2025.基于机器学习的河南省短波辐射数值预报订正方法[J].气象,51(8):941-953.
CHENG Kaiqi,WEI Lu,LI Yiyin,SUN Ruizao,ZHANG Fan,2025.Correction Method of Shortwave Radiation Numerical Forecast in Henan Province Based on Machine Learning[J].Meteor Mon,51(8):941-953.
