Information Retrieval System: a Domain Specific Parallel Crawler - Nidhi Tyagi - 图书 - VDM Verlag Dr. Müller - 9783639377798 - 2011年8月24日
如封面与标题不符,以标题为准

Information Retrieval System: a Domain Specific Parallel Crawler


商品到货时接收邮件提醒
Do you have a profile? 登录
添加至iMusic心愿单

Not rated yet

The World Wide Web is an interlinked collection of billions of documents formatted using HTML. Due to the growing and dynamic nature of the web, it has become a challenge to traverse all URLs in the web documents and handle these URLs, so it has become imperative to parallelize a crawling process. The crawler process is further being parallelized in the form ecology of crawler workers that parallely download information from the web. This paper proposes a novel architecture of parallel crawler, which is based on domain specific crawling, makes crawling task more effective, scalable and load-sharing among the different crawlers which parallel download web pages related to different domains specific URLs.

介质类型 图书     Paperback Book   (平装胶订图书)
已发行 2011年8月24日
ISBN13 9783639377798
出版商 VDM Verlag Dr. Müller
页数 92
商品尺寸 150 × 6 × 226 mm   ·   145 g
语言 英语  

Mere med samme udgiver