This link has been bookmarked by 44 people . It was first bookmarked on 11 Dec 2006, by Lee Odden.
-
27 Sep 11
-
14 Sep 11
-
Crawling
-
A lot of things have to happen before you see a web page containing your Google search results. Our first step is to crawl and index the billions of pages of the World Wide W -
e complete
-
So the next step is to build an index
-
-
Catie AndersonThis article tells us exactly how Google finds the results and puts them in a certain order based on the words that you search about. This website still kept me wondering how the speed is so quick.
-
"spider,"
-
An exercise for students
Once you see how to intersect two words in an index, it's not hard to do it for three or more words as well. Here's a fun exercise: try to find all the documents below that contain the words "civil" and "war" and "reconstruction."
civil: 1 9 15 19 22 35 38 48 53 55 65 68 73 78 82 88 91 99
war: 15 18 25 29 31 35 37 40 42 46 48 65 75 85 91 96
reconstruction: 35 42 48 64 73 91 95
The answer is at the end of the article.
-
-
07 Apr 11
-
01 Feb 11
-
07 Oct 10
-
10base TomOne of the most common questions we hear from librarians is "How does Google decide what result goes at the top of the list?" Here, from quality engineer Matt Cutts, is a quick primer on how we crawl and index the web and then rank search results. Matt also suggests exercises school librarians can do to help students.
-
14 May 10
-
05 May 10
-
08 Feb 10
-
04 Jan 10
-
We've developed an interesting trick that speeds up the first step: instead of storing the entire index on one very powerful computer, Google uses hundreds of computers to do the job. Because the task is divided among many machines, the answer can be found much faster. To illustrate, let's suppose an index for a book was 30 pages long. If one person had to search for several pieces of information in the index, it would take at least several seconds for each search. But what if you gave each page of the index to a different person? Thirty people could search their portions of the index much more quickly than one person could search the entire index alone. Similarly, Google splits its data between many machines to find matching documents faster.
-
pages that are both reputable and relevant
-
we'll often elevate a page with fewer links or lower PageRank if other signals suggest that the page is more relevant
-
For example, a web page dedicated entirely to the civil war is often more useful than an article that mentions the civil war in passing, even if the article is part of a reputable site such as Time.com.
-
-
16 Dec 09
-
19 Jan 09
-
23 Sep 08
-
18 May 08
-
19 Mar 08
Sheryl A. McCoyDIY on what you need to know about Google datamining rules
google seo copywriting pagerank algorithm definitions librarian reference searchengine DIY
-
17 Mar 08
-
16 Mar 08
-
15 Mar 08
-
05 Dec 07
-
14 Nov 07
-
06 Aug 07
-
18 Jul 07
-
27 Feb 07
-
24 Jan 07
-
11 Dec 06
-
26 Nov 06
-
16 May 06
Page Comments
Would you like to comment?
Join Diigo for a free account, or sign in if you are already a member.