SIMHAR - Smart Distributed Web Crawler for the Hidden Web Using SIM+Hash and Redis Server

Developing a distributed web crawler obliges major engineering challenges, all of which are eventually associated to scale. To retain corpus of search engine and a reasonable state of freshness, the crawler must be distributed over multiple computers. In distributed crawling, crawling agents are giv...

詳細記述

書誌詳細
出版年:IEEE Access
主要な著者: Sawroop Kaur, G. Geetha
フォーマット: 論文
言語:英語
出版事項: IEEE 2020-01-01
主題:
オンライン・アクセス:https://ieeexplore.ieee.org/document/9123854/