World Scientific
Skip main navigation

Cookies Notification

We use cookies on this site to enhance your user experience. By continuing to browse the site, you consent to the use of our cookies. Learn More
×
Spring Sale: Get 35% off with a min. purchase of 2 titles. Use code SPRING35. Valid till 31st Mar 2025.

System Upgrade on Tue, May 28th, 2024 at 2am (EDT)

Existing users will be able to log into the site and access content. However, E-commerce and registration of new users may not be available for up to 12 hours.
For online purchase, please visit us again. Contact us at customercare@wspc.com for any enquiries.

GENERATION OF CLASSIFIER FOR DOMAIN-SPECIFIC HIDDEN WEB SEARCH INTERFACE

    This work is supported by the Grant No. 60373099 of National Natural Science Foundation of China.

    https://doi.org/10.1142/9789812701534_0148Cited by:0 (Source: Crossref)
    Abstract:

    As the Web grows, more and more data have become “hidden” or “deepen”. Web crawlers should not only have the ability of fetching the Publicly Indexable Web (PIW), but also be able to plunge into the Hidden Web to search for more useful information. In this paper, an approach for exploring and identifying the domain-specific search interfaces by using SVM classification scheme is presented. The method of integrating domain-specific search interface (DSI) into the topical crawling system is also introduced. This work is based on our observation of the conciseness and representative characteristics of DSI. Though intuitive and apparent this approach is, it seems more preferable for identifying DSIs. The experimental results show that such a feasible and practical way can achieve good performance.