TY - JOUR
T1 - Attention mechanism-based transfer learning model for day-ahead energy demand forecasting of shopping mall buildings
AU - Yuan, Yue
AU - Chen, Zhihua
AU - Wang, Zhe
AU - Sun, Yifu
AU - Chen, Yixing
N1 - Publisher Copyright:
© 2023 Elsevier Ltd
PY - 2023/5/1
Y1 - 2023/5/1
N2 - The forecasting performance of data-driven models decreases rapidly with a limited training dataset. Herein, we sought to solve this problem by developing an attention mechanism-based transfer learning model and comparing its predictive ability in day-ahead energy consumption with those of three direct learning models: artificial neural networks with auto-regression (AR-ANN), random forest with auto-regression (AR-RF), and long short-term memory neural network (LSTM). Our target building was a large-scale shopping mall in Harbin, with 2 years of monitored data. The 2-months to 1-year data selected from the first year and all data from the second year were used as the training and testing sets, respectively. These models predicted the target building's peak electricity demand (PED) and total energy consumption (TEC). The results showed that the proposed transfer learning model outperformed the three direct learning models when data were insufficient in the training set. Specifically, the direct prediction models' lowest PED and TEC prediction errors were 34.34% and 26.32%, respectively, with 2-month training data available. In comparison, the corresponding prediction errors of the proposed model were only 12.48% and 10.78%, respectively. This study demonstrated the excellent performance of the proposed model with limited data.
AB - The forecasting performance of data-driven models decreases rapidly with a limited training dataset. Herein, we sought to solve this problem by developing an attention mechanism-based transfer learning model and comparing its predictive ability in day-ahead energy consumption with those of three direct learning models: artificial neural networks with auto-regression (AR-ANN), random forest with auto-regression (AR-RF), and long short-term memory neural network (LSTM). Our target building was a large-scale shopping mall in Harbin, with 2 years of monitored data. The 2-months to 1-year data selected from the first year and all data from the second year were used as the training and testing sets, respectively. These models predicted the target building's peak electricity demand (PED) and total energy consumption (TEC). The results showed that the proposed transfer learning model outperformed the three direct learning models when data were insufficient in the training set. Specifically, the direct prediction models' lowest PED and TEC prediction errors were 34.34% and 26.32%, respectively, with 2-month training data available. In comparison, the corresponding prediction errors of the proposed model were only 12.48% and 10.78%, respectively. This study demonstrated the excellent performance of the proposed model with limited data.
KW - Attention mechanisms
KW - Energy prediction
KW - Shopping mall building
KW - Transfer learning
UR - https://www.webofscience.com/wos/woscc/full-record/WOS:000945241100001
UR - https://openalex.org/W4319341382
UR - https://www.scopus.com/pages/publications/85147896152
U2 - 10.1016/j.energy.2023.126878
DO - 10.1016/j.energy.2023.126878
M3 - Journal Article
SN - 0360-5442
VL - 270
JO - Energy
JF - Energy
M1 - 126878
ER -