Comparison of Clustering Algorithms on Air Quality Substances in Peninsular Malaysia

Air quality is one of the most popular environmental problems in this globalization era. Air pollution is the poisonous air that comes from car emissions, smog, open burning, chemicals from factories and other particles and gases. This harmful air can give adverse effects to human health and the env...

Full description

Bibliographic Details
Main Authors: Sitti Sufiah Atirah Rosly, Balkiah Moktar, Muhamad Hasbullah Mohd Razali
Format: Article
Language:English
Published: Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA Perlis 2018-01-01
Series:Journal of Computing Research and Innovation
Subjects:
Online Access:https://crinn.conferencehunter.com/index.php/jcrinn/article/view/28
id doaj-e64f597528e9450bb025aa0ca8063f29
record_format Article
spelling doaj-e64f597528e9450bb025aa0ca8063f292021-02-01T02:32:12ZengFaculty of Computer and Mathematical Sciences, Universiti Teknologi MARA PerlisJournal of Computing Research and Innovation2600-87932018-01-0121364419Comparison of Clustering Algorithms on Air Quality Substances in Peninsular MalaysiaSitti Sufiah Atirah RoslyBalkiah MoktarMuhamad Hasbullah Mohd RazaliAir quality is one of the most popular environmental problems in this globalization era. Air pollution is the poisonous air that comes from car emissions, smog, open burning, chemicals from factories and other particles and gases. This harmful air can give adverse effects to human health and the environment. In order to provide information which areas are better for the residents in Malaysia, cluster analysis is used to determine the areas that can be clustering together based on their air quality through several air quality substances. Monthly data from 37 monitoring stations in Peninsular Malaysia from the year 2013 to 2015 were used in this study. K-Means (KM) clustering algorithm, Expectation Maximization (EM) clustering algorithm and Density Based (DB) clustering algorithm have been chosen as the techniques to analyze the cluster analysis by utilizing the Waikato Environment for Knowledge Analysis (WEKA) tools. Results show that K-means clustering algorithm is the best method among other algorithms due to its simplicity and time taken to build the model. The output of K-means clustering algorithm shows that it can cluster the area into two clusters, namely as cluster 0 and cluster 1. Clusters 0 consist of 16 monitoring stations and cluster 1 consists of 36 monitoring stations in Peninsular Malaysia.https://crinn.conferencehunter.com/index.php/jcrinn/article/view/28air qualityclustering algorithmwekak-meandensity basedexpectation maximization
collection DOAJ
language English
format Article
sources DOAJ
author Sitti Sufiah Atirah Rosly
Balkiah Moktar
Muhamad Hasbullah Mohd Razali
spellingShingle Sitti Sufiah Atirah Rosly
Balkiah Moktar
Muhamad Hasbullah Mohd Razali
Comparison of Clustering Algorithms on Air Quality Substances in Peninsular Malaysia
Journal of Computing Research and Innovation
air quality
clustering algorithm
weka
k-mean
density based
expectation maximization
author_facet Sitti Sufiah Atirah Rosly
Balkiah Moktar
Muhamad Hasbullah Mohd Razali
author_sort Sitti Sufiah Atirah Rosly
title Comparison of Clustering Algorithms on Air Quality Substances in Peninsular Malaysia
title_short Comparison of Clustering Algorithms on Air Quality Substances in Peninsular Malaysia
title_full Comparison of Clustering Algorithms on Air Quality Substances in Peninsular Malaysia
title_fullStr Comparison of Clustering Algorithms on Air Quality Substances in Peninsular Malaysia
title_full_unstemmed Comparison of Clustering Algorithms on Air Quality Substances in Peninsular Malaysia
title_sort comparison of clustering algorithms on air quality substances in peninsular malaysia
publisher Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA Perlis
series Journal of Computing Research and Innovation
issn 2600-8793
publishDate 2018-01-01
description Air quality is one of the most popular environmental problems in this globalization era. Air pollution is the poisonous air that comes from car emissions, smog, open burning, chemicals from factories and other particles and gases. This harmful air can give adverse effects to human health and the environment. In order to provide information which areas are better for the residents in Malaysia, cluster analysis is used to determine the areas that can be clustering together based on their air quality through several air quality substances. Monthly data from 37 monitoring stations in Peninsular Malaysia from the year 2013 to 2015 were used in this study. K-Means (KM) clustering algorithm, Expectation Maximization (EM) clustering algorithm and Density Based (DB) clustering algorithm have been chosen as the techniques to analyze the cluster analysis by utilizing the Waikato Environment for Knowledge Analysis (WEKA) tools. Results show that K-means clustering algorithm is the best method among other algorithms due to its simplicity and time taken to build the model. The output of K-means clustering algorithm shows that it can cluster the area into two clusters, namely as cluster 0 and cluster 1. Clusters 0 consist of 16 monitoring stations and cluster 1 consists of 36 monitoring stations in Peninsular Malaysia.
topic air quality
clustering algorithm
weka
k-mean
density based
expectation maximization
url https://crinn.conferencehunter.com/index.php/jcrinn/article/view/28
work_keys_str_mv AT sittisufiahatirahrosly comparisonofclusteringalgorithmsonairqualitysubstancesinpeninsularmalaysia
AT balkiahmoktar comparisonofclusteringalgorithmsonairqualitysubstancesinpeninsularmalaysia
AT muhamadhasbullahmohdrazali comparisonofclusteringalgorithmsonairqualitysubstancesinpeninsularmalaysia
_version_ 1724315918343864320