This link has been bookmarked by 66 people . It was first bookmarked on 31 Jul 2006, by Olivier Ziller.
-
Patrick SautsInternet scale web searches
-
James Burkeopen source search engine as used by DiscoverEd
-
Rajkumar SinghWelcome to Nutch!
-
Simon ReavelyWeb crawler based on lucene and hadoop
-
Mamoud KassemNutch is open source web-search software. It builds on Lucene Java, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats, etc.
-
rubaiyat shatnersearch based on metadata and full text that can be chosen (which is a priority and how much)
-
Pasquale BasileNon è chiaro se possa essere utile o meno al crawling di files!
-
Bruno MartinsNutch is open source web-search software. It builds on Lucene Java, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats, etc.
-
Markus RadspielerNutch is open source web-search software. It builds on Lucene Java, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats, etc.
For more information about Nutch, please see the Nutch wiki.
Would you like to comment?
Join Diigo for a free account, or sign in if you are already a member.