Design and Implementation of Data Preprocessing for Deep Learning with Image Recognition

碩士 === 國立高雄科技大學 === 電機工程系 === 107 === In the past, the success or failure of traditional machine learning to identify images is often inextricably linked to the extraction of feature values. The tremendous increase in equipment and computing power has given depth learning an opportunity to emerge, a...

Full description

Bibliographic Details
Main Authors: LIN,CHUN-CHENG, 林俊丞
Other Authors: HUANG,KO-WEI
Format: Others
Language:zh-TW
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/592q48
id ndltd-TW-107NKUS0442006
record_format oai_dc
spelling ndltd-TW-107NKUS04420062019-10-31T05:22:34Z http://ndltd.ncl.edu.tw/handle/592q48 Design and Implementation of Data Preprocessing for Deep Learning with Image Recognition 應用前處理技術於深度學習之圖片辨識上 LIN,CHUN-CHENG 林俊丞 碩士 國立高雄科技大學 電機工程系 107 In the past, the success or failure of traditional machine learning to identify images is often inextricably linked to the extraction of feature values. The tremendous increase in equipment and computing power has given depth learning an opportunity to emerge, and its powerful automatic extraction feature capability can effectively find good features to improve the performance of the model recognition system. The purpose of this study is to apply a variety of binarization pre-processing algorithms, and to implement an image recognition system through the deep learning based convolutional neural network module GoogLeNet. We will use ImageNet, the world's largest image recognition database, and select five of them includes cars, airplanes, horses, boats and trucks as our training input data. The system attempts to grayscale and binarize the image using a variety of algorithms, and then uses GoogLeNet's Inception classification V1, Inception classification V2 and Inception classification V3 to train and try to improve the original model identification results accuracy. It will be implemented in two parts. First, the image pre-processing part, the image will be pre-processed using six different methods of selecting the threshold function provided by OpenCV and the Otsu algorithm. Second, using three different versions models to train and test the data that have already been image pre-processing. Through adjusting and updating the parameters to analyze and compare the results of the experiment. Finally we will use TensorBoard to observe the training process. Various visual data help us to understand and adjust the model to find out the optimal network model, and show that the image pre-processed image can effectively improve the correct rate of model identification. HUANG,KO-WEI 黃科瑋 2019 學位論文 ; thesis 91 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立高雄科技大學 === 電機工程系 === 107 === In the past, the success or failure of traditional machine learning to identify images is often inextricably linked to the extraction of feature values. The tremendous increase in equipment and computing power has given depth learning an opportunity to emerge, and its powerful automatic extraction feature capability can effectively find good features to improve the performance of the model recognition system. The purpose of this study is to apply a variety of binarization pre-processing algorithms, and to implement an image recognition system through the deep learning based convolutional neural network module GoogLeNet. We will use ImageNet, the world's largest image recognition database, and select five of them includes cars, airplanes, horses, boats and trucks as our training input data. The system attempts to grayscale and binarize the image using a variety of algorithms, and then uses GoogLeNet's Inception classification V1, Inception classification V2 and Inception classification V3 to train and try to improve the original model identification results accuracy. It will be implemented in two parts. First, the image pre-processing part, the image will be pre-processed using six different methods of selecting the threshold function provided by OpenCV and the Otsu algorithm. Second, using three different versions models to train and test the data that have already been image pre-processing. Through adjusting and updating the parameters to analyze and compare the results of the experiment. Finally we will use TensorBoard to observe the training process. Various visual data help us to understand and adjust the model to find out the optimal network model, and show that the image pre-processed image can effectively improve the correct rate of model identification.
author2 HUANG,KO-WEI
author_facet HUANG,KO-WEI
LIN,CHUN-CHENG
林俊丞
author LIN,CHUN-CHENG
林俊丞
spellingShingle LIN,CHUN-CHENG
林俊丞
Design and Implementation of Data Preprocessing for Deep Learning with Image Recognition
author_sort LIN,CHUN-CHENG
title Design and Implementation of Data Preprocessing for Deep Learning with Image Recognition
title_short Design and Implementation of Data Preprocessing for Deep Learning with Image Recognition
title_full Design and Implementation of Data Preprocessing for Deep Learning with Image Recognition
title_fullStr Design and Implementation of Data Preprocessing for Deep Learning with Image Recognition
title_full_unstemmed Design and Implementation of Data Preprocessing for Deep Learning with Image Recognition
title_sort design and implementation of data preprocessing for deep learning with image recognition
publishDate 2019
url http://ndltd.ncl.edu.tw/handle/592q48
work_keys_str_mv AT linchuncheng designandimplementationofdatapreprocessingfordeeplearningwithimagerecognition
AT línjùnchéng designandimplementationofdatapreprocessingfordeeplearningwithimagerecognition
AT linchuncheng yīngyòngqiánchùlǐjìshùyúshēndùxuéxízhītúpiànbiànshíshàng
AT línjùnchéng yīngyòngqiánchùlǐjìshùyúshēndùxuéxízhītúpiànbiànshíshàng
_version_ 1719285141188116480