Accession Number : ADA494589

Title :   Dynamic Scheduling for Web Monitoring Crawler

Descriptive Note : Final rept. 7 Mar 2008-7 Mar 2009

Corporate Author : TASMANIA UNIV HOBART (AUSTRALIA)

Personal Author(s) : Kang, Byeong Ho ; Compton, Paul ; Motoda, Hiroshi ; Salerno, John

PDF Url : ADA494589

Report Date : 27 FEB 2009

Pagination or Media Count : 49

Abstract : Web monitoring systems report any changes on the target web pages by revisiting them frequently. As they are operated under significant constrains such as network and computing, it is necessary to minimize revisits with minimal delay and maximum coverage. Various statistical scheduling methods were proposed to resolve this problem. However they are static and cannot easily cope with events in the real world. This paper proposes a new scheduling method that manages unpredictable events. MCRDR (Multiple Classification Ripple-Down Rules) document classification knowledge base was reused to detect events and to initiate a prompt web monitoring process regardless of static monitoring schedule. The experiment demonstrates that the approach proposed improves monitoring efficiency significantly.

Descriptors :   *INTERNET, *SCHEDULING, *MONITORING, ADAPTIVE SYSTEMS, STATISTICAL PROCESSES, CLASSIFICATION, DETECTION, ALGORITHMS, DOCUMENTS

Subject Categories : Information Science

Distribution Statement : APPROVED FOR PUBLIC RELEASE