Potential of kernel and tree-based machine-learning models for estimating missing data of rainfall

Yükleniyor...
Küçük Resim

Tarih

2020

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Hong Kong Polytechnic Univ, Dept Civil & Structural Eng

Erişim Hakkı

info:eu-repo/semantics/openAccess

Özet

In this study, two kernel-based models were used which include Support Vector Regression (SVR) and Gaussian Process Regression (GPR) and were compared with two tree-based models that are M5 and Random Forest (RF) for estimating missing monthly precipitation data in Antakya, Dortyol, Iskenderun and Samandag stations, which are the important precipitation stations in the Eastern Mediterranean region, Turkey. For this purpose, firstly 10% random precipitation data were assumed as missing data for the period 1980-2019. Secondly, the missing data in each station was estimated with the data of other stations within the framework of four data combinations scenarios. In Kernel-based SVR and GPR methods, the RBF kernel gave suitable results for the selected study area. While SVR and RF methods gave very close estimation results, the SVR method gave relatively better results than the other methods especially in error minimizing aspects. Gaussian function based GPR model generally tries to estimate missing data closer to means. This is the main disadvantage of the GPR model and therefore it is unsuccessful in the estimation process. Finally, the results showed that the algorithms based on machine learning are successful in estimating the missing precipitation data.

Açıklama

Anahtar Kelimeler

Missing data, rainfall, machine learning, random Forest, Eastern Mediterranean, Turkey

Kaynak

Engineering Applications of Computational Fluid Mechanics

WoS Q Değeri

Q1

Scopus Q Değeri

Q1

Cilt

14

Sayı

1

Künye