FOCUS : LEARNING TO CRAWL WEB FORUMS
Author(s):
Bagam Rakesh, S Ravi Kiran, Asst. Prof., N. Swapna Suhasini
Keywords:
FOCUS, Forums, Crawler, Page Classification, URL Pattern Learning
Abstract
In this paper, we are describing Forum Crawler Under Supervision (FoCUS), a supervised web-scale forum crawler. The goal of FoCUS is to crawl relevant forum content from the web with minimal verhead. Forum threads contains the information about the target of forum crawlers. And the forums had several styles and they are powered by different forum software packages. They always have similar implicit navigation paths connected by specific URL types to lead users from entry pages to thread pages. Based on this observation, we reduce the web forum crawling problem to a URL-type recognition problem by the following techniques and methods.
Article Details
Unique Paper ID: 143665

Publication Volume & Issue: Volume 2, Issue 12

Page(s): 448 - 452
Article Preview & Download


Share This Article

Conference Alert

NCSST-2021

AICTE Sponsored National Conference on Smart Systems and Technologies

Last Date: 25th November 2021

SWEC- Management

LATEST INNOVATION’S AND FUTURE TRENDS IN MANAGEMENT

Last Date: 7th November 2021

Go To Issue



Call For Paper

Volume 9 Issue 10

Last Date for paper submitting for March Issue is 25 March 2023

About Us

IJIRT.org enables door in research by providing high quality research articles in open access market.

Send us any query related to your research on editor@ijirt.org

Social Media

Google Verified Reviews

Contact Details

Telephone:6351679790
Email: editor@ijirt.org
Website: ijirt.org

Policies