Learning To Crawl Web Forums

CHF 41.50
Auf Lager
SKU
8S7URT83856
Stock 1 Verfügbar
Geliefert zwischen Mi., 07.01.2026 und Do., 08.01.2026

Details

Present Forum Crawler Under Supervision (FoCUS), a supervised web-scale forum crawler. The goal of FoCUS is to crawl relevant forum content from the web with minimal overhead. Forum threads contain information content that is the target of forum crawlers. Although forums have di erent layouts or styles and are powered by di erent forum software packages, they always have similar implicit navigation paths connected by speci c URL types to lead users from entry pages to thread pages. Based on this observation, we reduce the web forum crawling problem to a URL-type recognition problem. And we show how to learn accurate and e ective regular expression patterns of implicit navigation paths from automatically created training sets using aggregated results from weak page type classi ers. Robust page type clas-si ers can be trained from as few as ve annotated forums and applied to a large set of unseen forums.

Autorentext

Sr. Vipul D. Punjabi M.Tech IT, BE Computer. Professor assistente no R.C.Patel Institute of Technology, Shirpur. Membro vitalício da Sociedade Indiana para a Educação Técnica LMISTE (LM98909).

Weitere Informationen

  • Allgemeine Informationen
    • GTIN 09786135812343
    • Herausgeber LAP LAMBERT Academic Publishing
    • Anzahl Seiten 60
    • Genre IT Encyclopedias
    • Gewicht 107g
    • Größe H220mm x B150mm x T4mm
    • Jahr 2018
    • EAN 9786135812343
    • Format Kartonierter Einband
    • ISBN 6135812343
    • Veröffentlichung 10.01.2018
    • Titel Learning To Crawl Web Forums
    • Autor Vipul Punjabi
    • Sprache Englisch

Bewertungen

Schreiben Sie eine Bewertung
Nur registrierte Benutzer können Bewertungen schreiben. Bitte loggen Sie sich ein oder erstellen Sie ein Konto.
Made with ♥ in Switzerland | ©2025 Avento by Gametime AG
Gametime AG | Hohlstrasse 216 | 8004 Zürich | Schweiz | UID: CHE-112.967.470