Extracting XML data from HTML repositories

There is a vast amount of valuable information in HTML documents, widely distributed across the World Wide Web and across corporate intranets. Unfortunately, HTML is mainly presentation oriented and hard to query. While XML is becoming a standard for online data representation and exchange, there is...

Full description

Bibliographic Details
Main Author: Zhang, Ruth Yuee
Format: Others
Language:English
Published: 2009
Online Access:http://hdl.handle.net/2429/15823