In this paper, we are describing Forum Crawler Under Supervision (FoCUS), a supervised web-scale forum crawler. The goal of FoCUS is to crawl relevant forum content from the web with minimal verhead. Forum threads contains the information about the target of forum crawlers. And the forums had several styles and they are powered by different forum software packages. They always have similar implicit navigation paths connected by specific URL types to lead users from entry pages to thread pages. Based on this observation, we reduce the web forum crawling problem to a URL-type recognition problem by the following techniques and methods.
Article Details
Unique Paper ID: 143665
Publication Volume & Issue: Volume 2, Issue 12
Page(s): 448 - 452
Article Preview & Download
Share This Article
Conference Alert
NCSST-2021
AICTE Sponsored National Conference on Smart Systems and Technologies
Last Date: 25th November 2021
SWEC- Management
LATEST INNOVATION’S AND FUTURE TRENDS IN MANAGEMENT