Research and Development of Natural Language Query System in Insurance Domain

碩士 === 國立臺北大學 === 資訊管理研究所 === 107 === In recent years, the business scale of Taiwan insurance market has been booming. In 2018, hundreds of thousands of employees work in the insurance industry, when they are aware of their needs, some people using search engines to search document and others start...

Full description

Bibliographic Details
Main Authors: ZHOU,ZI-CE, 周子策
Other Authors: FANG-TSOU,CHAO-TSONG
Format: Others
Language:zh-TW
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/y24y7d
id ndltd-TW-107NTPU0396009
record_format oai_dc
spelling ndltd-TW-107NTPU03960092019-07-18T03:56:21Z http://ndltd.ncl.edu.tw/handle/y24y7d Research and Development of Natural Language Query System in Insurance Domain 保險領域之自然語言查詢系統應用研發 ZHOU,ZI-CE 周子策 碩士 國立臺北大學 資訊管理研究所 107 In recent years, the business scale of Taiwan insurance market has been booming. In 2018, hundreds of thousands of employees work in the insurance industry, when they are aware of their needs, some people using search engines to search document and others start to ask questions in the "Community Question Answering" (CQA) and wait for other users to answer. Another common approach is to find similar questions directly in the history of question answering corpus in the CQA, but finding out similar questions for users in the large data sets is a huge challenge. This study builds a question and answer corpus from the CQA, and integrates many information retrieval strategies to complete the insurance domain question and answer system. The strategy includes query expansion, word embedding, text similarity, traditional BM25 retrieval method. Finally, this study improves the defect that the IDF in BM25 does not conform to the actual application scenario, and proposes to establish an insurance important word weight method to improve IDF. FANG-TSOU,CHAO-TSONG 方鄒昭聰 2019 學位論文 ; thesis 72 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺北大學 === 資訊管理研究所 === 107 === In recent years, the business scale of Taiwan insurance market has been booming. In 2018, hundreds of thousands of employees work in the insurance industry, when they are aware of their needs, some people using search engines to search document and others start to ask questions in the "Community Question Answering" (CQA) and wait for other users to answer. Another common approach is to find similar questions directly in the history of question answering corpus in the CQA, but finding out similar questions for users in the large data sets is a huge challenge. This study builds a question and answer corpus from the CQA, and integrates many information retrieval strategies to complete the insurance domain question and answer system. The strategy includes query expansion, word embedding, text similarity, traditional BM25 retrieval method. Finally, this study improves the defect that the IDF in BM25 does not conform to the actual application scenario, and proposes to establish an insurance important word weight method to improve IDF.
author2 FANG-TSOU,CHAO-TSONG
author_facet FANG-TSOU,CHAO-TSONG
ZHOU,ZI-CE
周子策
author ZHOU,ZI-CE
周子策
spellingShingle ZHOU,ZI-CE
周子策
Research and Development of Natural Language Query System in Insurance Domain
author_sort ZHOU,ZI-CE
title Research and Development of Natural Language Query System in Insurance Domain
title_short Research and Development of Natural Language Query System in Insurance Domain
title_full Research and Development of Natural Language Query System in Insurance Domain
title_fullStr Research and Development of Natural Language Query System in Insurance Domain
title_full_unstemmed Research and Development of Natural Language Query System in Insurance Domain
title_sort research and development of natural language query system in insurance domain
publishDate 2019
url http://ndltd.ncl.edu.tw/handle/y24y7d
work_keys_str_mv AT zhouzice researchanddevelopmentofnaturallanguagequerysystemininsurancedomain
AT zhōuzicè researchanddevelopmentofnaturallanguagequerysystemininsurancedomain
AT zhouzice bǎoxiǎnlǐngyùzhīzìrányǔyáncháxúnxìtǒngyīngyòngyánfā
AT zhōuzicè bǎoxiǎnlǐngyùzhīzìrányǔyáncháxúnxìtǒngyīngyòngyánfā
_version_ 1719228361437347840