A comparison of method for treating missing daily rainfall data in Peninsular Malaysia





Daily rainfall, imputation, inverse distance, homogeneity


This study modified a method for treating missing values in daily rainfall data from 104 selected rainfall stations. The daily rainfall data were obtained from the Department of Irrigation and Drainage Malaysia (DID) for the periods of 1965 to 2015. The missing values throughout the 51 years period were estimated using the various types of weighting methods. In determining the best imputation method, three test for evaluating model performance has been used. The findings of this study indicate that the proposed method is more efficient than the traditional method. The homogeneity of the data series was checked using the homogeneity tests recommended by the existing literatures. The results indicated that more than 40% of the rainfall stations were homogenous based on the proposed method.


Bennett, N. D., Newham, L. T. H., Croke, B. F. W. & Jakeman, A. J. 2007. Patching and Disaccumulation of Rainfall Data for Hydrological Modelling. International Congress on Modelling and Simulation (MODSIM 2007). December 2007. University of Canterbury, Christchurch, New Zealand. 2520–2526.

Elshorbagy, A. A., Panu, U. S. & Simonovic, S. P. 2000. Group-Based Estimation of Missing Hydrological Data: I. Approach and General Methodology. Hydrological Sciences. 45(6), 849–866.

Kajornrit, J., Wong, K. W. & Fung, C. C. 2012. A Comparative Analysis of Soft Computing Techniques Used To Estimate Missing Precipitation Records. 19th ITS Biennial Conference 2012. 18-21 November 2012. Bangkok, Thailand.

Peterson, T. C., Easterling, D. R., Karl, T. R., Groisman, P., Nicholls, N., Plummer, N., Torok, S. et al. 1998. Homogeneity Adjustments of In Situ Atmospheric Climate Data: A Review. International Journal of Climatology. 18(13), 1493–1517.

Zhang, S. 2012. Nearest Neighbor Selection For Iteratively kNN Imputation. Journal of Systems and Software. 85(11), 2541–2552.

Peugh, J. L. & Enders, C. K. 2004. Missing Data in Educational Research : A Review of Reporting Practices and Suggestions for Improvement. Review of Educational Research. 74(4), 525–556.

Di Piazza, A., Lo Conti, F., Noto, L. V., Viola, F. & La Loggia, G. 2011. Comparative Analysis of Different Techniques for Spatial Interpolation of Rainfall Data To Create A Serially Complete Monthly Time Series of Precipitation for Sicily, Italy. International Journal of Applied Earth Observation and Geoinformation. 13(3), 396–408.

Kim, J. W. & Pachepsky, Y. A. 2010. Reconstructing Missing Daily Precipitation Data using Regression Trees and Artificial Neural Networks For SWAT Streamflow Simulation. Journal of Hydrology. 394(3–4), 305–314.

Lee, H. & Kang, K. 2015. Interpolation of Missing Precipitation Data Using Kernel Estimations for Hydrologic Modeling. Advances in Meteorology. 2015, 1–12.

Simolo, C., Brunetti, M., Maugeri, M. & Nanni, T. 2010. Improving Estimation of Missing Values In Daily Precipitation Series by a Probability Density Function-Preserving Approach. International Journal of Climatology. 30(10), 1564–1576.

Jemain, A. A., Mohd Deni, S., Syed Jamaludin, S. S. & Wan Zin, W. Z. 2015. Penyurihan Ikhtisar Data Hujan. Kuala Lumpur: Dewan Bahasa dan Pustaka.

Eischeid, J. K., Pasteris, P. A., Diaz, H. F., Plantico, M. S. & Lott, N. J. 2000. Creating a Serially Complete, National Daily Time Series of Temperature and Precipitation for The Western United States. Journal of Applied Meteorology. 39(9), 1580–1591.

Paulhus, J. L. H. & Kohler, M. A. 1952. Interpolation of Missing Precipitation Records. Monthly Weather Review. 80(8), 129–133.

Hasana, M. M. & Crokea, B. F. W. 2013. Filling Gaps in Daily Rainfall Data: A Statistical Approach. 20th International Congress on Modelling and Simulation. 1-6 December 2013. Adelaide, South Australia380–386.

Ramos-Calzado, P., Gomez-Camacho, J., Perez-Bernal, F. & Pita-Lopez, M. F. 2008. A Novel Approach to Precipitation Series Completion In Climatological Datasets: Application to Andalusia. International Journal of Climatology. 1525–1534.

Teegavarapu, R. S. V. & Chandramouli, V. 2005. Improved Weighting Methods, Deterministic and Stochastic Data-Driven Models for Estimation Of Missing Precipitation Records. Journal of Hydrology. 312(1–4), 191–206.

Zhang, S. 2008. Parimputation : From Imputation and Null-Imputation to Partially Imputation. IEEE Intelligent Informatics Bulletin, 9(1), 32–38.

Xia, Y., Fabian, P., Stohl, A. & Winterhalter, M. 1999. Forest Climatology: Estimation of Missing Values for Bavaria, Germany. Agricultural and Forest Meteorology. 96(1–3), 131–144.

Willmott, C. J., Robeson, S. M. & Feddema, J. J. 1994. Estimating Continental and Terrestrial Precipitation Averages From Rain-Gauge Networks. International Journal of Climatology. 14(4), 403–414.

Suhaila, J., Deni, S. M. & Jemain, A. A. 2008. Detecting inhomogeneity of rainfall series in Peninsular Malaysia. Asia-Pacific Journal of Atmospheric Sciences. 44(4), 369–380.

Young, K. C. 1992. A Three-Way Model for Interpolating for Monthly Precipitation Values. Monthly Weather Review. 120(11), 2561–2569.

Filippini, F., Galliani, G. & Pomi, L. 1994. The Estimation of Missing Meteorological Data in a Network of Automatic Stations. Transactions on Ecology and the Environment. 4(1), 14328–14336.

Ahrens, B. 2005. Distance in Spatial Interpolation of Daily Rain Gauge Data. Hydrology and Earth System Sciences Discussions. 2(5), 1893–1922.

Little, R. J. A. & Rubin, D. B. 2002. Statistical Analysis with Missing Data. New Jersey: John Wiley and Sons, Inc.

Schafer, J. L. 1997. Analysis of Incomplete Multivariate Data. New York: Chapman and Hall.

Little, R. J. A. & Rubin, D. B. 1987. Statistical Analysis With Missing Data. New York: John Wiley and Sons, Inc. 1987.

Presti, R. Lo, Barca, E. & Passarella, G. 2010. A Methodology for Treating Missing Data Applied to Daily Rainfall Data in the Candelaro River Basin (Italy). Environmental Monitoring and Assessment. 160(1–4), 1–22.

Moritz, S., Sardá, A., Bartz-Beielstein, T., Zaefferer, M. & Stork, J. 2015. Comparison of Different Methods for Univariate Time Series Imputation in R. arXiv preprint arXiv:1510.03924, 1–20.

Porth, L. S., Boes, D. C., Davis, R. A., Troendle, C. A. & King, R. M. 2001. Development of a Technique to Determine Adequate Sample Size Using Subsampling and Return Interval Estimation. Journal of Hydrology. 251(1–2), 110–116.

Kang, K. & Merwade, V. 2014. The Effect of Spatially Uniform and Non-Uniform Precipitation Bias Correction Methods on Improving NEXRAD Rainfall Accuracy for Distributed Hydrologic Modeling. Hydrology Research. 45(1), 23–42.

Evans, J. D. 1996. Straightforward Statistics for the Behavioral Sciences. University of California: Brooks/Cole Pub. Co.

Wijngaard, J. B., Klein Tank, A. M. G. & Können, G. P. 2003. Homogeneity of 20th Century European Daily Temperature and Precipitation Series. International Journal of Climatology. 23(6), 679–692.