Skip to main content

Diigo Home

WebSPHINX: A Personal, Customizable Web Crawler - The Diigo Meta page

www.cs.cmu.edu/websphinx - Cached

This link has been bookmarked by 22 people . It was first bookmarked on 02 Mar 2006, by Jorge Maestre.

  • 05 Nov 09
  • 18 Aug 09
  • 09 Jul 09
    • WebSPHINX is designed for advanced web users and Java programmers who want to
      crawl over a small part of the web (such as a single web site) automatically.
    • The WWW7 paper mentions a "CategoryClassifier", but I can't find it in
      the source code.  Where can I get it?

      The CategoryClassifier was part of an earlier web-crawling system, SPHINX,
      developed at Compaq SRC.  The original SPHINX code belongs to Compaq SRC
      and was never released.  WebSPHINX is an open-source reimplementation of
      the SPHINX interface.  CategoryClassifier was not part of this
      reimplementation because CategoryClassifier depended on some other software that
      belongs to SRC.
    • 2 more annotations...
  • 05 Feb 07
  • 22 Aug 06
  • 11 Mar 06
  • 22 Feb 06
  • 23 Jan 06
  • 13 Oct 05