A Distributed Approach to Crawl Domain Specific Hidden Web
A large amount of on-line information resides on the invisible web - web pages generated dynamically from databases and other data sources hidden from current crawlers which retrieve content only from the publicly indexable Web. Specially, they ignore the tremendous amount of high quality content &q...
Main Author: | |
---|---|
Format: | Others |
Published: |
Digital Archive @ GSU
2007
|
Subjects: | |
Online Access: | http://digitalarchive.gsu.edu/cs_theses/47 http://digitalarchive.gsu.edu/cgi/viewcontent.cgi?article=1046&context=cs_theses |