Estimation of rainfall and stream flow missing data for Terengganu, Malaysia by using interpolation technique methods





Missing Data, Interpolation Method, Rainfall and Stream Flow Data


Missing data is a serious problem in many climatological time series. Daily rainfall and stream flow datasets with no missing values are required for efficient estimation for application purposes. In order to estimate any missing observations in data, interpolation techniques are often used. This study focuses on comparing a few selected methods in the estimation of missing rainfall and stream flow data. The interpolation techniques studied were the Arithmetic Average (AA) method, Normal Ratio (NR) method, Inverse Distance (ID) method and Coefficient of Correlation (CC) method. However, in the case when there is no information from neighboring stations, the mean on the same day and month but at different years is taken as estimation of the missing value on that particular date. Twenty years of daily rainfall and stream flow data at 12 stations located at Terengganu were used for this study. In testing to verify which method is the best in evaluating missing values at the target station using information from the nearby stations (in the radius range of 10 km to 50 km), several percentages of missing values were considered. The validation of the best estimation methods is done based on the estimation error; with tests such as Root Mean Square Error (RMSE), Mean Absolute Error (MAE) and Correlation Coefficient (R) tests.


Burrough, P. A., McDonnell, R. A. 1998. Principles of Geographical Information Systems. Oxford: Oxford University Press.

Chow, V.T., Maidment, D. R. and Mays, L. W. 1988. Applied Hydrology. Singapore: Mc Graw Hill Book Company.

Dastorani, M. T, Moghadamnia, A., Piri, J. and Rico-Ramirez, M. A. 2010. Application of ANN and ANFIS models for reconstructing missing flow data. Environmental Monitoring and Assessment, 166, 421-434.

De Silva, R. P., Dayawansa, N. D. K., Ratnasari, M. D. 2007. A comparison of methods used in estimating missing rainfall data. The Journal of Agricultural Science, 3(2), 101-108.

Hasan, M. M., Croke, B. F. W. 2013. Filling gaps in daily rainfall data: a statistical approach. 20th International Congress on Modelling and Simulation. 1–6 December 2013. Adelaide, Australia, 380-386.

Ibrahim, N., Wibowo, A. 2014. Support vector regression with missing data treatment based variables selection for water level prediction of Galas River in Kelantan Malaysia. WSEAS Transactions on Mathematics, 13, 69-78.

Paulhus, J. L. H., Kohler, M. A. 1952. Interpolation of Missing precipitation records. Monthly Weather Review, 80, 8, 129-133.

Suhaila, J., Deni, S. M., Jemain, A. A. 2008. Revised spatial weighting methods for estimation of missing rainfall data. Asia-Pacific Journal of Atmospheric Sciences, 44, 2, 93-104.

Teegavarapu, R. S. V., Chandramouli, V. 2005. Improved weighting methods, deterministic and stochastic data-driven models for estimation of missing precipitation records. Journal of Hydrology, 312, 1, 191-206.