GRU and XGBoost Performance with Hyperparameter Tuning Using GridSearchCV and Bayesian Optimization on an IoT-Based Weather Prediction System

Hendri Darmawan; Mike Yuliana; Moch. Zen Samsono Hadi

doi:10.18517/ijaseit.13.3.18377

GRU and XGBoost Performance with Hyperparameter Tuning Using GridSearchCV and Bayesian Optimization on an IoT-Based Weather Prediction System

Hendri Darmawan, Mike Yuliana, Moch. Zen Samsono Hadi

Abstract

Weather is essential to human life, but it is difficult to forecast due to its diverse nature. We evaluated and compared the accuracy of two machine learning algorithms, GRU and XGBoost, in predicting weather patterns. We used GridSearchCV to tune the hyperparameters for the GRU algorithm and Bayesian optimization for the XGBoost algorithm. We used regression to predict weather sensor data and classification to predict rainfall in the following four days. We then deployed the best-performing model to the cloud server and connected it to the local IoT device with weather sensors in Sedati, Sidoarjo Regency, Indonesia. We conducted tests using data from the BMKG Juanda Sidoarjo and data from the local IoT device. The findings indicated that the XGBoost regression model outperformed the GRU model in the first stage, with an average RMSE of 1.2728125. In comparison, the average RMSE for GRU regression was 1.551666667. In the second stage, however, GRU regression performed better, with an average RMSE of 2.23, while the XGBoost regression had 2.28. In the classification tests, the GRU model had a higher F1 score of 0.88 in the first stage, while the XGBoost classification was 0.86. Both models had the same accuracy of 0.75 when tested with IoT data. However, the GRU classification model was better since it considered the context of the prediction, resulting in a lower likelihood of rain when it was not raining.

Keywords

Gated recurrent unit; XGBoost; multivariate weather prediction; internet of things

Full Text:

PDF

References

M. G. Schultz et al., â€œCan deep learning beat numerical weather prediction?,â€ Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, vol. 379, no. 2194. Royal Society Publishing, Apr. 05, 2021. doi: 10.1098/rsta.2020.0097.

S. F. Tekin, O. Karaahmetoglu, F. Ilhan, I. Balaban, and S. S. Kozat, â€œSpatio-temporal weather forecasting and attention mechanism on convolutional LSTMs,â€ ArXiv, Feb. 2021, doi: 10.48550/ARXIV.2102.00696.

D. Munandar, â€œMultilayer perceptron (MLP) and autoregressive integrated moving average (ARIMA) models in multivariate input time series data: solar irradiance forecasting,â€ International Journal on Advanced Science Engineering Information Technology, vol. 9, no. 1, 2019, doi: 10.18517/ijaseit.9.1.6426.

G. Chen, S. Liu, and F. Jiang, â€œDaily weather forecasting based on deep learning model: A case study of Shenzhen city, China,â€ Atmosphere (Basel), vol. 13, no. 8, Aug. 2022, doi: 10.3390/atmos13081208.

X. Chen, Y. Liu, Y. Shen, K. Zhang, and H. Wei, â€œA data interpolation method for missing irradiance data of photovoltaic power station,â€ in 2020 Chinese Automation Congress (CAC), Nov. 2020, pp. 4735â€“4740. doi: 10.1109/CAC51589.2020.9326730.

M. Chhetri, S. Kumar, P. P. Roy, and B. G. Kim, â€œDeep BLSTM-GRU model for monthly rainfall prediction: A case study of Simtokha, Bhutan,â€ Remote Sens (Basel), vol. 12, no. 19, pp. 1â€“13, Oct. 2020, doi: 10.3390/rs12193174.

T. E. Putra, Husaini, D. Asrina, and M. Dirhamsyah, â€œThe ability of the fast fourier transform to de-noise a strain signal,â€ in IOP Conference Series: Materials Science and Engineering, Oct. 2020, vol. 931, no. 1. doi: 10.1088/1757-899X/931/1/012011.

A. GonzÃ¡lez-DÃez, J. A. Barreda-ArgÃ¼eso, L. RodrÃguez-RodrÃguez, and J. FernÃ¡ndez-Lozano, â€œThe use of filters based on the Fast Fourier Transform applied to DEMs for the objective mapping of karstic features,â€ Geomorphology, vol. 385, Jul. 2021, doi: 10.1016/j.geomorph.2021.107724.

S. U. Khan, M. H. Siddiqi, and Y. Alhwaiti, â€œSignal-to-noise ratio comparison of several filters against Phantom image,â€ J Healthc Eng, vol. 2022, p. 4724342, 2022, doi: 10.1155/2022/4724342.

P. Bellavista, A. Corradi, and C. Giannelli, â€œEvaluating filtering strategies for decentralized handover prediction in the wireless internet,â€ in 11th IEEE Symposium on Computers and Communications (ISCCâ€™06), 2006, pp. 167â€“174. doi: 10.1109/ISCC.2006.70.

H. Darmawan, M. Yuliana, and Moch. Z. S. Hadi, â€œReal-time weather prediction system using GRU with daily surface observation data from IoT,â€ in 2022 International Electronics Symposium (IES), 2022, pp. 221â€“226. doi: 10.1109/IES55876.2022.9888468.

M. Steininger, K. Kobs, P. Davidson, A. Krause, and A. Hotho, â€œDensity-based weighting for imbalanced regression,â€ Mach Learn, vol. 110, no. 8, pp. 2187â€“2211, Aug. 2021, doi: 10.1007/s10994-021-06023-5.

J. M. Johnson and T. M. Khoshgoftaar, â€œSurvey on deep learning with class imbalance,â€ J Big Data, vol. 6, no. 1, Dec. 2019, doi: 10.1186/s40537-019-0192-5.

H. Patel, D. Singh Rajput, G. Thippa Reddy, C. Iwendi, A. Kashif Bashir, and O. Jo, â€œA review on classification of imbalanced data for wireless sensor networks,â€ International Journal of Distributed Sensor Networks, vol. 16, no. 4. SAGE Publications Ltd, Apr. 01, 2020. doi: 10.1177/1550147720916404.

P. Zhang, Y. Jia, and Y. Shang, â€œResearch and application of XGBoost in imbalanced data,â€ Int J Distrib Sens Netw, vol. 18, no. 6, Jun. 2022, doi: 10.1177/15501329221106935.

L. Huang, J. Qin, Y. Zhou, F. Zhu, L. Liu, and L. Shao, â€œNormalization techniques in training DNNs: Methodology, analysis and application,â€ Sep. 2020, doi: 10.48550/arXiv.2009.12836.

G. Aksu, C. O. GÃ¼zeller, and M. T. Eser, â€œThe effect of the normalization method used in different sample sizes on the success of artificial neural network model,â€ International Journal of Assessment Tools in Education, pp. 170â€“192, Apr. 2019, doi: 10.21449/ijate.479404.

X. Zhou, J. Xu, P. Zeng, and X. Meng, â€œAir pollutant concentration prediction based on GRU method,â€ in Journal of Physics: Conference Series, Mar. 2019, vol. 1168, no. 3. doi: 10.1088/1742-6596/1168/3/032058.

R. G. S. K., A. Kumar Verma, and S. Radhika, â€œK-nearest neighbors and grid search cv based real time fault monitoring system for industries,â€ in 2019 5th International Conference for Convergence in Technology (I2CT), 2019, pp. 1â€“5.

I. S. Isa, M. S. A. Rosli, U. K. Yusof, M. I. F. Maruzuki, and S. N. Sulaiman, â€œOptimizing the hyperparameter tuning of YOLOv5 for underwater detection,â€ IEEE Access, vol. 10, pp. 52818â€“52831, 2022, doi: 10.1109/ACCESS.2022.3174583.

K. Nakamura, B. Derbel, K. J. Won, and B. W. Hong, â€œLearning-rate annealing methods for deep neural networks,â€ Electronics (Switzerland), vol. 10, no. 16, Aug. 2021, doi: 10.3390/electronics10162029.

K. Mukherjee, A. Khare, and A. Verma, â€œA simple dynamic learning rate tuning algorithm for automated training of DNNs,â€ ArXiv, Oct. 2019, doi: 10.48550/ARXIV.1910.11605.

P. Cu Thi, J. E. Ball, and N. H. Dao, â€œEarly stopping technique using a genetic algorithm for calibration of an urban runoff model,â€ International Journal of River Basin Management, 2021, doi: 10.1080/15715124.2021.1910517.

A. Ibrahem Ahmed Osman, A. Najah Ahmed, M. F. Chow, Y. Feng Huang, and A. El-Shafie, â€œExtreme gradient boosting (Xgboost) model to predict the groundwater levels in Selangor Malaysia,â€ Ain Shams Engineering Journal, vol. 12, no. 2, pp. 1545â€“1556, Jun. 2021, doi: 10.1016/j.asej.2020.11.011.

M. Miranda, K. Valeriano, and J. Sulla-Torres, â€œA detailed study on the choice of hyperparameters for transfer learning in covid-19 image datasets using Bayesian optimization,â€ International Journal of Advanced Computer Science and Applications, vol. 12, no. 4, pp. 327â€“335, 2021, doi: 10.14569/IJACSA.2021.0120441.

Q. Liang et al., â€œBenchmarking the performance of Bayesian optimization across multiple experimental materials science domains,â€ NPJ Comput Mater, vol. 7, no. 1, Dec. 2021, doi: 10.1038/s41524-021-00656-9.

M. Alizamir et al., â€œAdvanced machine learning model for better prediction accuracy of soil temperature at different depths,â€ PLoS One, vol. 15, no. 4, Apr. 2020, doi: 10.1371/journal.pone.0231055.

C. Esposito, G. A. Landrum, N. Schneider, N. Stiefl, and S. Riniker, â€œGHOST: Adjusting the decision threshold to handle imbalanced data in machine learning,â€ J Chem Inf Model, vol. 61, no. 6, pp. 2623â€“2640, Jun. 2021, doi: 10.1021/acs.jcim.1c00160.

I. M. de Diego, A. R. Redondo, R. R. FernÃ¡ndez, J. Navarro, and J. M. Moguerza, â€œGeneral performance score for classification problems,â€ Applied Intelligence, vol. 52, no. 10, pp. 12049â€“12063, Aug. 2022, doi: 10.1007/s10489-021-03041-7.

P. FoltÃ½nek, M. Babiuch, and P. Å urÃ¡nek, â€œMeasurement and data processing from Internet of Things modules by dual-core application using ESP32 board,â€ Measurement and Control (United Kingdom), vol. 52, no. 7â€“8, pp. 970â€“984, Sep. 2019, doi: 10.1177/0020294019857748.

Y. S. Mandza and A. Raji, â€œIoTivity cloud-enabled platform for energy management applications,â€ IoT, vol. 3, no. 1, pp. 73â€“90, Dec. 2021, doi: 10.3390/iot3010004.

D. S. Anindya, M. Yuliana, and Moch. Z. S. Hadi, â€œIoT based climate prediction system using long short-term memory (LSTM) algorithm as part of smart farming 4.0,â€ in 2022 International Electronics Symposium (IES), 2022, pp. 255â€“260. doi: 10.1109/IES55876.2022.9888486.

M. Ohyver, J. v. Moniaga, I. Sungkawa, B. E. Subagyo, and I. A. Chandra, â€œThe comparison firebase real-time database and MySQL database performance using wilcoxon signed-rank test,â€ in Procedia Computer Science, 2019, vol. 157, pp. 396â€“405. doi: 10.1016/j.procs.2019.08.231.

T. O. Hodson, â€œRoot mean square error (RMSE) or mean absolute error (MAE): when to use them or not,â€ Geosci Model Dev, vol. 15, no. 14, pp. 5481â€“5487, 2022, doi: 10.5194/gmd-2022-64.

F. Zhang et al., â€œWhat is the predictability limit of midlatitude weather?,â€ J Atmos Sci, vol. 76, no. 4, pp. 1077â€“1091, 2019, doi: 10.1175/JAS-D-18-0269.1.

H. Zhu, M. C. Wheeler, A. H. Sobel, and D. Hudson, â€œSeamless precipitation prediction skill in the tropics and extratropics from a global model,â€ Mon Weather Rev, vol. 142, no. 4, pp. 1556â€“1569, 2014, doi: 10.1175/MWR-D-13-00222.1.

DOI: http://dx.doi.org/10.18517/ijaseit.13.3.18377

Refbacks

There are currently no refbacks.

Published by INSIGHT - Indonesian Society for Knowledge and Human Development

International Journal on Advanced Science, Engineering and Information Technology

GRU and XGBoost Performance with Hyperparameter Tuning Using GridSearchCV and Bayesian Optimization on an IoT-Based Weather Prediction System

Abstract

Keywords

Full Text:

References

Refbacks