This link has been bookmarked by 7 people . It was first bookmarked on 08 Oct 2006, by Raghu Rajagopalan.
-
15 Apr 09
-
This cast study discusses how TheServerSide built an infrastructure that allows
us to index, and search our different content using Lucene. -
You will also see a main(..) method. We created this to allow for a command
line program to kick off a build process. - 8 more annotations...
-
-
We created a simple Java representation of this data, SearchContentHolder,
-
create a fairly generic set of fields
-

-
have the concept of incremental
indexing, as well as a full indexing from scratch. To take care of this we have
an incrementalDays variable which is configured for the index process. If this
value is set to 0 or less, then do a full index. -
possessing
content that lives in different sources. -
this field NOT only the body
of a thread message, but rather a concatenation of the title of the message,
the owner of the message, and then finally the message contents itself. You
could try to use Boolean queries to make sure that a search
finds a good match, but we found it a LOT simpler to put in a cheeky concatenation! -
The date boost has been really important for us.
-
There have been settings in both the indexing process, and search process,
that were crying out for abstraction. Where should we put the index location,
the category lists, the boost values, and register the index sources? We didn’t
want to have this in code, and since the configuration was hierarchical we resorted
to using XML.
-
-
-
14 Feb 08
-
03 Jan 08
-
18 Jan 07
-
19 Dec 06
-
17 Jul 06
Would you like to comment?
Join Diigo for a free account, or sign in if you are already a member.