TY - GEN
T1 - A Multi-Modal Transformer for Short-Term Wind Power Forecasting with Spatio-Temporal Meteorological Data
AU - Chen, Yu
AU - Huang, Wenqi
AU - Wang, Ying
AU - Zhang, Qing
AU - Li, Kai
AU - Wang, Xiaohua
N1 - Publisher Copyright:
© 2026 IEEE.
PY - 2026
Y1 - 2026
N2 - Wind power forecasting (WPF) is a significant and effective technique to cope with the negative impact of its intermittency on grid stability. Although several deep learning (DL) methods have been applied to WPF as time series analysis approaches, there are still fewer studies considering multi-modal features for short-term WPF. In this paper, a novel transformer-based DL architecture is proposed to extract temporal and spatial dependencies from regional meteorology and time series using different encoders customized for each modality. Finally, a hierarchical multi-head cross-attention module is designed based on the logical dependencies between different modalities. The experimental results show that, due to the effectiveness of spatiotemporal context and cross-attention, the proposed model reduces root mean square error by 0.495-0.950 MW over other baselines, and the forecasting accuracy is improved by 0.56%-1.15%.
AB - Wind power forecasting (WPF) is a significant and effective technique to cope with the negative impact of its intermittency on grid stability. Although several deep learning (DL) methods have been applied to WPF as time series analysis approaches, there are still fewer studies considering multi-modal features for short-term WPF. In this paper, a novel transformer-based DL architecture is proposed to extract temporal and spatial dependencies from regional meteorology and time series using different encoders customized for each modality. Finally, a hierarchical multi-head cross-attention module is designed based on the logical dependencies between different modalities. The experimental results show that, due to the effectiveness of spatiotemporal context and cross-attention, the proposed model reduces root mean square error by 0.495-0.950 MW over other baselines, and the forecasting accuracy is improved by 0.56%-1.15%.
KW - multi-modal fusion
KW - short-term WPF
KW - spatiotemporal context
KW - transformer
UR - https://www.scopus.com/pages/publications/105037450632
U2 - 10.1109/PESIM67009.2026.11438649
DO - 10.1109/PESIM67009.2026.11438649
M3 - 会议稿件
AN - SCOPUS:105037450632
T3 - 2026 IEEE PES International Meeting, PES IM 2026
BT - 2026 IEEE PES International Meeting, PES IM 2026
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2026 IEEE PES International Meeting, PES IM 2026
Y2 - 18 January 2026 through 21 January 2026
ER -