Methods for imputation of missing values in air quality data sets
碩士 === 國立中興大學 === 環境工程學系所 === 96 === In order to discuss the problem of the air quality, to understand the pollution and the different pollution transmission in each place, and to design strategies to control and improve the contamination, Environmental Protection Administration in Taiwan in 1994 di...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2008
|
Online Access: | http://ndltd.ncl.edu.tw/handle/11587976008880936667 |
id |
ndltd-TW-096NCHU5087041 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-096NCHU50870412016-05-09T04:13:47Z http://ndltd.ncl.edu.tw/handle/11587976008880936667 Methods for imputation of missing values in air quality data sets 探討空氣品質資料庫中遺失值補值的方法 Chi-Lan Hung 洪啟嵐 碩士 國立中興大學 環境工程學系所 96 In order to discuss the problem of the air quality, to understand the pollution and the different pollution transmission in each place, and to design strategies to control and improve the contamination, Environmental Protection Administration in Taiwan in 1994 divided Taiwan into 7 air quality districts based on their different pollution features, the terrain, and the climate in each district. They have already set up over 70 air quality monitoring stations in Taiwan. Because of the air quality monitoring stations which monitored air quality condition of each place, we are able to have the historical information of air quality of everywhere for everybody to research and find out the useful information from this huge database. But around 10% of data are missing only during the transmission process. However, the missing values of the database will affect the data analysis, hence, it is very important to resolve the missing value problem. My research uses Inverse Square Distance Weighting method and Kriging method to impute the missing values, and discusses, analyzes, and compares the result of using the 2 different methods. Monte Carlo method is then used to test and verify which one is the better method to yield the accurate values to replace the missing values. After my research, the absolute error of Inverse Square Distance Weighting is smaller than it of Monte Carlo method for imputation of air quality data. After verifying, the absolute error of Inverse Square Distance Weighting and Kriging method is respectively 25% and 19%. It shows that Kriging method is better imputation method than Inverse Square Distance Weighting. 望熙榮 2008 學位論文 ; thesis 95 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中興大學 === 環境工程學系所 === 96 === In order to discuss the problem of the air quality, to understand the pollution and the different pollution transmission in each place, and to design strategies to control and improve the contamination, Environmental Protection Administration in Taiwan in 1994 divided Taiwan into 7 air quality districts based on their different pollution features, the terrain, and the climate in each district. They have already set up over 70 air quality monitoring stations in Taiwan. Because of the air quality monitoring stations which monitored air quality condition of each place, we are able to have the historical information of air quality of everywhere for everybody to research and find out the useful information from this huge database. But around 10% of data are missing only during the transmission process. However, the missing values of the database will affect the data analysis, hence, it is very important to resolve the missing value problem. My research uses Inverse Square Distance Weighting method and Kriging method to impute the missing values, and discusses, analyzes, and compares the result of using the 2 different methods. Monte Carlo method is then used to test and verify which one is the better method to yield the accurate values to replace the missing values.
After my research, the absolute error of Inverse Square Distance Weighting is smaller than it of Monte Carlo method for imputation of air quality data. After verifying, the absolute error of Inverse Square Distance Weighting and Kriging method is respectively 25% and 19%. It shows that Kriging method is better imputation method than Inverse Square Distance Weighting.
|
author2 |
望熙榮 |
author_facet |
望熙榮 Chi-Lan Hung 洪啟嵐 |
author |
Chi-Lan Hung 洪啟嵐 |
spellingShingle |
Chi-Lan Hung 洪啟嵐 Methods for imputation of missing values in air quality data sets |
author_sort |
Chi-Lan Hung |
title |
Methods for imputation of missing values in air quality data sets |
title_short |
Methods for imputation of missing values in air quality data sets |
title_full |
Methods for imputation of missing values in air quality data sets |
title_fullStr |
Methods for imputation of missing values in air quality data sets |
title_full_unstemmed |
Methods for imputation of missing values in air quality data sets |
title_sort |
methods for imputation of missing values in air quality data sets |
publishDate |
2008 |
url |
http://ndltd.ncl.edu.tw/handle/11587976008880936667 |
work_keys_str_mv |
AT chilanhung methodsforimputationofmissingvaluesinairqualitydatasets AT hóngqǐlán methodsforimputationofmissingvaluesinairqualitydatasets AT chilanhung tàntǎokōngqìpǐnzhìzīliàokùzhōngyíshīzhíbǔzhídefāngfǎ AT hóngqǐlán tàntǎokōngqìpǐnzhìzīliàokùzhōngyíshīzhíbǔzhídefāngfǎ |
_version_ |
1718262644922646528 |