Methods for imputation of missing values in air quality data sets

碩士 === 國立中興大學 === 環境工程學系所 === 96 === In order to discuss the problem of the air quality, to understand the pollution and the different pollution transmission in each place, and to design strategies to control and improve the contamination, Environmental Protection Administration in Taiwan in 1994 di...

Full description

Bibliographic Details
Main Authors: Chi-Lan Hung, 洪啟嵐
Other Authors: 望熙榮
Format: Others
Language:zh-TW
Published: 2008
Online Access:http://ndltd.ncl.edu.tw/handle/11587976008880936667
id ndltd-TW-096NCHU5087041
record_format oai_dc
spelling ndltd-TW-096NCHU50870412016-05-09T04:13:47Z http://ndltd.ncl.edu.tw/handle/11587976008880936667 Methods for imputation of missing values in air quality data sets 探討空氣品質資料庫中遺失值補值的方法 Chi-Lan Hung 洪啟嵐 碩士 國立中興大學 環境工程學系所 96 In order to discuss the problem of the air quality, to understand the pollution and the different pollution transmission in each place, and to design strategies to control and improve the contamination, Environmental Protection Administration in Taiwan in 1994 divided Taiwan into 7 air quality districts based on their different pollution features, the terrain, and the climate in each district. They have already set up over 70 air quality monitoring stations in Taiwan. Because of the air quality monitoring stations which monitored air quality condition of each place, we are able to have the historical information of air quality of everywhere for everybody to research and find out the useful information from this huge database. But around 10% of data are missing only during the transmission process. However, the missing values of the database will affect the data analysis, hence, it is very important to resolve the missing value problem. My research uses Inverse Square Distance Weighting method and Kriging method to impute the missing values, and discusses, analyzes, and compares the result of using the 2 different methods. Monte Carlo method is then used to test and verify which one is the better method to yield the accurate values to replace the missing values. After my research, the absolute error of Inverse Square Distance Weighting is smaller than it of Monte Carlo method for imputation of air quality data. After verifying, the absolute error of Inverse Square Distance Weighting and Kriging method is respectively 25% and 19%. It shows that Kriging method is better imputation method than Inverse Square Distance Weighting. 望熙榮 2008 學位論文 ; thesis 95 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立中興大學 === 環境工程學系所 === 96 === In order to discuss the problem of the air quality, to understand the pollution and the different pollution transmission in each place, and to design strategies to control and improve the contamination, Environmental Protection Administration in Taiwan in 1994 divided Taiwan into 7 air quality districts based on their different pollution features, the terrain, and the climate in each district. They have already set up over 70 air quality monitoring stations in Taiwan. Because of the air quality monitoring stations which monitored air quality condition of each place, we are able to have the historical information of air quality of everywhere for everybody to research and find out the useful information from this huge database. But around 10% of data are missing only during the transmission process. However, the missing values of the database will affect the data analysis, hence, it is very important to resolve the missing value problem. My research uses Inverse Square Distance Weighting method and Kriging method to impute the missing values, and discusses, analyzes, and compares the result of using the 2 different methods. Monte Carlo method is then used to test and verify which one is the better method to yield the accurate values to replace the missing values. After my research, the absolute error of Inverse Square Distance Weighting is smaller than it of Monte Carlo method for imputation of air quality data. After verifying, the absolute error of Inverse Square Distance Weighting and Kriging method is respectively 25% and 19%. It shows that Kriging method is better imputation method than Inverse Square Distance Weighting.
author2 望熙榮
author_facet 望熙榮
Chi-Lan Hung
洪啟嵐
author Chi-Lan Hung
洪啟嵐
spellingShingle Chi-Lan Hung
洪啟嵐
Methods for imputation of missing values in air quality data sets
author_sort Chi-Lan Hung
title Methods for imputation of missing values in air quality data sets
title_short Methods for imputation of missing values in air quality data sets
title_full Methods for imputation of missing values in air quality data sets
title_fullStr Methods for imputation of missing values in air quality data sets
title_full_unstemmed Methods for imputation of missing values in air quality data sets
title_sort methods for imputation of missing values in air quality data sets
publishDate 2008
url http://ndltd.ncl.edu.tw/handle/11587976008880936667
work_keys_str_mv AT chilanhung methodsforimputationofmissingvaluesinairqualitydatasets
AT hóngqǐlán methodsforimputationofmissingvaluesinairqualitydatasets
AT chilanhung tàntǎokōngqìpǐnzhìzīliàokùzhōngyíshīzhíbǔzhídefāngfǎ
AT hóngqǐlán tàntǎokōngqìpǐnzhìzīliàokùzhōngyíshīzhíbǔzhídefāngfǎ
_version_ 1718262644922646528