Exploring the Effective Evaluation Indices of Self-Organizing Map for Clustering Regional Flood Inundation Map

碩士 === 淡江大學 === 水資源及環境工程學系碩士班 === 106 === Today, Artificial Intelligence is one of popular issues with many research topics and practical applications. The relative AI issues on the study of water resource management or flood forecast have become one of important topics. The purpose of this study is...

Full description

Bibliographic Details
Main Authors: Wu-Han Wang, 王梧翰
Other Authors: Li-Chiu Chang
Format: Others
Language:zh-TW
Published: 2018
Online Access:http://ndltd.ncl.edu.tw/handle/sdu4s7
id ndltd-TW-106TKU05087003
record_format oai_dc
spelling ndltd-TW-106TKU050870032019-09-12T03:37:44Z http://ndltd.ncl.edu.tw/handle/sdu4s7 Exploring the Effective Evaluation Indices of Self-Organizing Map for Clustering Regional Flood Inundation Map 探討自組特徵映射網路有效之評估指標-以區域淹水分類為例 Wu-Han Wang 王梧翰 碩士 淡江大學 水資源及環境工程學系碩士班 106 Today, Artificial Intelligence is one of popular issues with many research topics and practical applications. The relative AI issues on the study of water resource management or flood forecast have become one of important topics. The purpose of this study is to propose the methodology to automatically build the Self-organizing maps (SOM) on clustering the flood spatial distribution. There are three major problems on building the SOM model; first one is the topological error, that is, any two neurons flip each other weights that makes the order of the topological map; second one is to the selection of the number of epochs. The training algorithm of SOM has two phases, ordering phase and convergent phase. Hence, these two phases have the different number of epochs and the number of epochs can influence the convergence; third one is to decide the optimal size. This study proposes two training strategies of the SOM models and takes Luermen Creek and Yenshui Creek located in Tainan, and Kemaman River located in Terengganu of Malaysia to investigate the convergence of the SOM models. The first strategy, called plan1, is to train the network in the ordering phase until the weights of the neurons have no obvious change, then transfer to the convergent phase and continue training the neurons until the weights have no obvious change. The second strategy, called plan2, is to rain the network in the ordering phase until the coverage rate of weights reaches 50%, then transfer to the convergent phase and continue training the same as the convergent phase of plan1. We use the flood simulation data of these three areas as the training data to build their own models. Through the different training strategy of plan1 and plan2, we can explore the influences of the ordering and convergent phases on building the SOM models. Through coverage rate, flip detector and five indices to compare the clustering results of the SOM clustering results. The coverage rate is defined as the difference of the cumulative distribution rates between maximum and minimum weights (neurons). The flip detector can check whether any two or more neurons flip each other weights or not and determine topological order correct or not. The clustering results of these three cases show that the number of epochs can influence the coverage rate and effectively improve the clustering quality. The larger number of epochs can get the larger coverage rate. The results show that plan2 can get convergent clustering results while plan1 occurs flip in Luermen Creek and Kemaman River. Hence plan2 is more suitable than plan1 for applying the SOM model on clustering the flood spatial distribution. Moreover, for comparison of the different size of the SOM models, the results demonstrate that the coverage rates of 3×3 model are smaller than those of 4×4 and 5×5 models, about 5%-10% less. That means 3×3 model cannot describe the characteristics of data as well as 4×4 and 5×5 models. The coverage rates of 4×4 and 5×5 models are almost the same, so the small models should be enough neurons to describe the data, that is, 4×4 is an appropriate size than other models. Hence, for choosing the size of topology map, the coverage rate is the great index to decide the optimal size. Li-Chiu Chang 張麗秋 2018 學位論文 ; thesis 71 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 淡江大學 === 水資源及環境工程學系碩士班 === 106 === Today, Artificial Intelligence is one of popular issues with many research topics and practical applications. The relative AI issues on the study of water resource management or flood forecast have become one of important topics. The purpose of this study is to propose the methodology to automatically build the Self-organizing maps (SOM) on clustering the flood spatial distribution. There are three major problems on building the SOM model; first one is the topological error, that is, any two neurons flip each other weights that makes the order of the topological map; second one is to the selection of the number of epochs. The training algorithm of SOM has two phases, ordering phase and convergent phase. Hence, these two phases have the different number of epochs and the number of epochs can influence the convergence; third one is to decide the optimal size. This study proposes two training strategies of the SOM models and takes Luermen Creek and Yenshui Creek located in Tainan, and Kemaman River located in Terengganu of Malaysia to investigate the convergence of the SOM models. The first strategy, called plan1, is to train the network in the ordering phase until the weights of the neurons have no obvious change, then transfer to the convergent phase and continue training the neurons until the weights have no obvious change. The second strategy, called plan2, is to rain the network in the ordering phase until the coverage rate of weights reaches 50%, then transfer to the convergent phase and continue training the same as the convergent phase of plan1. We use the flood simulation data of these three areas as the training data to build their own models. Through the different training strategy of plan1 and plan2, we can explore the influences of the ordering and convergent phases on building the SOM models. Through coverage rate, flip detector and five indices to compare the clustering results of the SOM clustering results. The coverage rate is defined as the difference of the cumulative distribution rates between maximum and minimum weights (neurons). The flip detector can check whether any two or more neurons flip each other weights or not and determine topological order correct or not. The clustering results of these three cases show that the number of epochs can influence the coverage rate and effectively improve the clustering quality. The larger number of epochs can get the larger coverage rate. The results show that plan2 can get convergent clustering results while plan1 occurs flip in Luermen Creek and Kemaman River. Hence plan2 is more suitable than plan1 for applying the SOM model on clustering the flood spatial distribution. Moreover, for comparison of the different size of the SOM models, the results demonstrate that the coverage rates of 3×3 model are smaller than those of 4×4 and 5×5 models, about 5%-10% less. That means 3×3 model cannot describe the characteristics of data as well as 4×4 and 5×5 models. The coverage rates of 4×4 and 5×5 models are almost the same, so the small models should be enough neurons to describe the data, that is, 4×4 is an appropriate size than other models. Hence, for choosing the size of topology map, the coverage rate is the great index to decide the optimal size.
author2 Li-Chiu Chang
author_facet Li-Chiu Chang
Wu-Han Wang
王梧翰
author Wu-Han Wang
王梧翰
spellingShingle Wu-Han Wang
王梧翰
Exploring the Effective Evaluation Indices of Self-Organizing Map for Clustering Regional Flood Inundation Map
author_sort Wu-Han Wang
title Exploring the Effective Evaluation Indices of Self-Organizing Map for Clustering Regional Flood Inundation Map
title_short Exploring the Effective Evaluation Indices of Self-Organizing Map for Clustering Regional Flood Inundation Map
title_full Exploring the Effective Evaluation Indices of Self-Organizing Map for Clustering Regional Flood Inundation Map
title_fullStr Exploring the Effective Evaluation Indices of Self-Organizing Map for Clustering Regional Flood Inundation Map
title_full_unstemmed Exploring the Effective Evaluation Indices of Self-Organizing Map for Clustering Regional Flood Inundation Map
title_sort exploring the effective evaluation indices of self-organizing map for clustering regional flood inundation map
publishDate 2018
url http://ndltd.ncl.edu.tw/handle/sdu4s7
work_keys_str_mv AT wuhanwang exploringtheeffectiveevaluationindicesofselforganizingmapforclusteringregionalfloodinundationmap
AT wángwúhàn exploringtheeffectiveevaluationindicesofselforganizingmapforclusteringregionalfloodinundationmap
AT wuhanwang tàntǎozìzǔtèzhēngyìngshèwǎnglùyǒuxiàozhīpínggūzhǐbiāoyǐqūyùyānshuǐfēnlèiwèilì
AT wángwúhàn tàntǎozìzǔtèzhēngyìngshèwǎnglùyǒuxiàozhīpínggūzhǐbiāoyǐqūyùyānshuǐfēnlèiwèilì
_version_ 1719250025653993472