Comparative Analysis of K-Means and Hierarchical Clustering for Regional Welfare Disparity Identification in West Java Province
DOI:
https://doi.org/10.58776/ijitcsa.v3i3.213Keywords:
K-Means Clustering, Hierarchical Clustering, Welfare indicatorsAbstract
This study aims to cluster regencies/cities in West Java Province based on public welfare indicators using the K-Means Clustering and Hierarchical Clustering methods. The data used includes health, economic, population density, and average length of schooling indicators in 2023. Cluster quality evaluation was performed using the silhouette score. The results show that K-Means Clustering with five clusters yields the highest silhouette score of 0.219. For comparison, Hierarchical Clustering with the Ward Linkage method and eight clusters was chosen, having a silhouette score of 0.202, which is the largest among other Hierarchical Clustering methods. The identification of each cluster's characteristics in K-Means reveals areas with multidimensional challenges (Cluster 1), industrial areas with unemployment issues (Cluster 2), areas with high stunting prevalence despite good access to basic facilities (Cluster 3), densely populated urban areas with good welfare but high unemployment (Cluster 4), and areas with very high health complaints and low welfare (Cluster 5). K-Means clusters (except Cluster 4) tend to have a low average length of schooling, below 12 years. Consistency in cluster patterns was found between K-Means and Ward Linkage, especially in advanced urban areas and areas with multidimensional welfare challenges in southern West Java. These findings are expected to serve as a reference for the government and policymakers in formulating more targeted and effective development strategies.
References
M. Musrafiyan, "POTENSI PEMBANGUNAN KAWASAN EKONOMI KHUSUS (KEK) HALAL BARSELA SEBAGAI DESTINASI PARIWISATA PRIORITAS DI ERA SOCIETY 5.0," PROCEEDINGS ICIS 2021, vol. 1, no. 1, 2021.
Badan Pusat Statistik Indonesia, "Indikator Kesejahteraan Rakyat 2024," 2024. Accessed: Mar. 12, 2025. [Online]. Available: https://www.bps.go.id/id/publication/2024/11/06/3ef10d3d82ed93f616ba9113/indikator-kesejahteraan-rakyat-2024.html
Badan Pusat Statistik Provinsi Jawa Barat, "Provinsi Jawa Barat Dalam Angka 2024," Feb. 2024. [Online]. Available: https://jabar.bps.go.id/id/publication/2024/02/28/35ffe2d35104b39feb577e8f/provinsi-jawa-barat-dalam-angka-2024.html
I. Wahyuni and S. P. Wulandari, "Pemetaan Kabupaten/Kota di Jawa Timur Berdasarkan Indikator Kesejahteraan Rakyat Menggunakan Analisis Cluster Hierarki," JURNAL SAINS DAN SENI ITS, vol. 11, no. 1, 2022.
M. H. Asnawi and P. Rahmah, "Analisis Klaster Hirarki untuk Mengelompokan Provinsi di Indonesia berdasarkan Indikator Kesejahteraan Rakyat," SEMINAR NASIONAL STATISTIKA X, 2021, doi: 10.1234/pns.v10i.84.
D. Andiani, S. Dwi, R. Septiani, and A. Riana, "Analisis Teknik non-Hierarki untuk Pengelompokan Kabupaten/Kota di Provinsi Jawa Barat Berdasarkan Indikator Kesejahteraan Rakyat 2020," Jurnal Riset Matematika dan Sains Terapan, vol. 21, no. 1, pp. 21-28, 2022.
N. Oktaviani, A. Fauzan, and G. Widyastuti, "Pengelompokan Kabupaten/Kota di Jawa Barat Berdasarkan Tingkat Kesejahteraan Masyarakat Menggunakan K-Means Cluster," Emerging Statistics and Data Science Journal, vol. 2, no. 2, 2024.
A. Eka Putra Haryanto, M. Ulfa Yanuar, D. Statistika Bisnis, and F. Vokasi, "Metode K-Means Clustering untuk Pengelompokan Kabupaten/Kota dalam Upaya Pengendalian Tingkat Inflasi di Pulau Jawa dan Sumatera K-Means Clustering Method for District/City Grouping in Effort to Control Inflation Rates in Java and Sumatera," pp. 29-42, 2022, doi: 10.21787/govstat.1.1.2022.29-42.
J. Han, J. Pei, and T. Hanghang, Data Mining: Concepts and Techniques, 4th ed. Morgan Kaufmann, 2022.
U. Syafiyah, D. Puspitasari, I. Asrafi, B. Wicaksono, and F. M. Sirait, "Analisis Perbandingan Hierarchical dan Non-Hierarchical Clustering Pada Data Indikator Ketenagakerjaan di Jawa Barat Tahun 2020," Seminar Nasional Official Statistics, vol. 2022, no. 1, pp. 803-812, Nov. 2022, doi: 10.34123/semnasoffstat.v2022i1.1221.
Daniel Wicaksono Nugroho, Farhan Bramhatchi, Sri Pingit Wulandari, and Albertus Eka, "Pengelompokan Indikator Kesejahteraan Masyarakat Berdasarkan Kabupaten/Kota di Jawa Tengah Tahun 2023 Menggunakan Analisis Cluster," Switch: Jurnal Sains dan Teknologi Informasi, vol. 2, no. 5, pp. 87-101, Nov. 2024, doi: 10.62951/switch.v2i5.285.
A. N. Alifah, H. N. Fadhilah, and T. M. Sianipar, "Klasterisasi Kabupaten/Kota di Jawa Barat Berdasarkan Tingkat Kenyamanan dengan Metode K-Means Clustering," Seminar Nasional Sains Data, vol. 2022.
M. Cui and others, "Introduction to the k-means clustering algorithm based on the elbow method," Accounting, Auditing and Finance, vol. 1, no. 1, pp. 5-8, 2020.
P. J. Rousseeuw, "Silhouettes: a graphical aid to the interpretation and validation of cluster analysis," J Comput Appl Math, vol. 20, pp. 53-65, 1987.
L. Hakim and A. Saefuddin, Introduction to machine learning using R: konsep, teori, dan praktik. IPB Press, 2022.
A. R. Damayanti and A. W. Wijayanto, "Comparison of hierarchical and non-hierarchical methods in clustering cities in Java Island using the human development index indicators year 2018," Eigen Mathematics Journal, pp. 8-17, 2021.
N. Thamrin and A. W. Wijayanto, "Comparison of Soft and Hard Clustering: A Case Study on Welfare Level in Cities on Java Island," Indonesian Journal of Statistics and Its Applications, vol. 5, no. 1, pp. 141-160, Mar. 2021, doi: 10.29244/ijsa.v5i1p141-160.
E. S. Barry, J. Merkebu, and L. Varpio, "State-of-the-art literature review methodology: A six-step approach for knowledge synthesis," Perspect Med Educ, vol. 11, no. 5, pp. 281-288, Oct. 2022, doi: 10.1007/s40037-022-00725-9.
T. A. Munandar, Data Mining Menggunakan R Teori dan Praktik, 1st ed. Serang: PT Bale Damar Publishing, 2023.
E. Suprihadi, Machine Learning: Dasar dan Praktis. Yogyakarta: Deepublish, 2022.
P. Palinggik Allorerung, A. Erna, M. Bagussahrir, and S. Alam, "Analisis Performa Normalisasi Data untuk Klasifikasi K-Nearest Neighbor pada Dataset Penyakit," 2024.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Muhamad Dani Yusuf, Tb Ai Munandar, Khairunnisa Fadhilla Ramdhania

This work is licensed under a Creative Commons Attribution 4.0 International License.
Attribution 4.0 International
You are free to:
- Share — copy and redistribute the material in any medium or format for any purpose, even commercially.
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
- The licensor cannot revoke these freedoms as long as you follow the license terms.
Under the following terms:
- Attribution — You must give appropriate credit , provide a link to the license, and indicate if changes were made . You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
Notices:
You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation .
No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.


