Optimalisasi K-Means Cluster dengan Principal Component Analysis pada Pengelompokan Kabupaten/Kota di Pulau Kalimantan Berdasarkan Indikator Tingkat Pengangguran Terbuka

Rais, Muhammad; Goejantoro, Rito; Prangga, Surya

Please use this identifier to cite or link to this item: http://repository.unmul.ac.id/handle/123456789/15905

Title:	Optimalisasi K-Means Cluster dengan Principal Component Analysis pada Pengelompokan Kabupaten/Kota di Pulau Kalimantan Berdasarkan Indikator Tingkat Pengangguran Terbuka
Other Titles:	Optimization of K-Means Cluster with Principal Component Analysis on the Grouping of Districts/Cities on the Island of Kalimantan Based on Unemployment Rate Indicator
Authors:	Rais, Muhammad Goejantoro, Rito Prangga, Surya
Keywords:	DBI, Unemployment Rate Indicator, K-Means, PCA
Issue Date:	Nov-2021
Publisher:	Jurusan Matematika FMIPA Universitas Mulawarman
Series/Report no.:	Vol 12 No 2 (2021): Jurnal Eksponensial; Vol 12 No 2 (2021): Jurnal Eksponensial;
Abstract:	Data mining or often also called knowledge discovery in databases is an activity that includes collecting, using historical data to find regularity, patterns, or relationships in large data sets resulting in useful new information. Cluster analysis is an analysis that aims to group data based on its likeness. This research uses the K-Means method combined with PCA. The K-Means method groups data in the form of one or more clusters that share the same characteristics. While the PCA method was used to reduce research variables. This grouping method was applied to the data indicator of the unemployment rate of districts/cities in Kalimantan Island in 2018. The cluster validation used in this study was the DaviesBouldin Index (DBI). Based on the results of the analysis, it was concluded that the number of principal components formed was as many as 2 principal components. The most optimal grouping of districts/cities in Kalimantan island in 2018 was to use 2 clusters with a DBI value of 0,507. The grouping of districts/cities in Kalimantan Island in 2018 produced 2 clusters, cluster 1 consisting of 51 districts/cities and clusters of 2 consisting of 5 districts/cities. Cluster 1 was a cluster that has the highest percentage of the poor population and the highest labor force participation rate when compared to cluster 2. While cluster 2 was a cluster that has an index value of human development, population, number of the labor force, number of unemployed, population density, and the minimum wage of district/city was high compared to cluster 1.
URI:	http://repository.unmul.ac.id/handle/123456789/15905
ISSN:	2798-3455
Appears in Collections:	A - Mathematics and Natural Sciences

Files in This Item:

File	Description	Size	Format
Optimalisasi K-Means Cluster dengan Principal Component Analysis.pdf		540.26 kB	Adobe PDF	View/Open

Show full item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets