A Novel Hybrid Focused Crawling Algorithm to Build Domain-Specific Collections

The Web, containing a large amount of useful information and resources, is expanding rapidly. Collecting domain-specific documents/information from the Web is one of the most important methods to build digital libraries for the scientific community. Focused Crawlers can selectively retrieve Web docu...

Full description

Bibliographic Details
Main Author: Chen, Yuxin
Other Authors: Computer Science
Format: Others
Published: Virginia Tech 2014
Subjects:
Online Access:http://hdl.handle.net/10919/26220
http://scholar.lib.vt.edu/theses/available/etd-02162007-005107/