A Machine Learning Based Approach to WebExtraction from Template Pages

碩士 === 國立中央大學 === 資訊工程學系碩士在職專班 === 98 === A huge amount of information on the World Wide Web has a structured HTML form as they are generated dynamically from databases and have the same template. This paper proposes a page-level web data extraction system FiVaTech2 that extracts schema and template...

Full description

Bibliographic Details
Main Authors: Chih-Hao Chang, 張志豪
Other Authors: Chia-Hui Chang
Format: Others
Language:en_US
Published: 2010
Online Access:http://ndltd.ncl.edu.tw/handle/35548787181124476380