search for




 

K-means clustering of the spatial distribution of rainfall over the Korean peninsula
Journal of the Korean Data & Information Science Society 2022;33:359-88
Published online May 31, 2022;  https://doi.org/10.7465/jkdi.2022.33.3.359
© 2022 Korean Data and Information Science Society.

Jae-Heon Lee1 · Song-Lak Kang2

12Department of Atmospheric & Environmental Sciences, Gangneung-Wonju National University
Correspondence to: This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (2021R1A6A1A03044326 and 2021R1I1A3044379).
1 Researcher, Multi-scale ABL Laboratory, Department of Atmospheric & Environmental Sciences, Gangneung-Wonju National University, Gangneung 25457, Korea.
2 Professor, Multi-scale ABL Laboratory, Department of Atmospheric & Environmental Sciences, Gangneung-Wonju National University, Gangneung 25457, Korea. E-mail: slkang@gwnu.ac.kr
Received March 16, 2022; Revised April 1, 2022; Accepted April 22, 2022.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Abstract
Using the k-means clustering, we analyze the spatial distribution of annual and seasonal mean precipitations, which are averaged over the 48-year period between 1973 to 2020. We use the precipitation data collected at 60 stations across the Korean peninsula. The data sets are somewhat differently composed by excluding some data based on the following conditions: data measured 1) at high-height stations, 2) at island stations, 3) during the typhoon Rusa period. We employ the Calinski-Harabasz (CH) and Davies-Bouldin (DB) indices to find the optimal number of the k-means clusters. The optimal number varies depending on the data set. We present the k-means clustering results along with the sensitivity test result to the different precipitation data sets.
Keywords : Annual mean precipitations, k-means clustering, seasonal mean precipitations.