Although it looks like Google is scanning each and every Web page the moment it displays your organic search results, it's not really that magical.
It turns out there was a lot of preparation leading up to that "magical" moment when the words are searched-search experts call that preparation indexing.
Indexing is the process that creates a search index, a special database that holds a list of all the words on all the pages on the Web.
Matching the Search Query
First, the search engine matches documents to the query
The search engine also eliminates or downgrades the important of commonly-used words (known as stop words), such as "of" and "the." During this step, the search engine processes search operators (such as "+" or "-") and notes the word order of the query.
Basically, the query analysis determined what words to look up-not just the words that were typed as the query, but any word variants (mouse and mice)-and which to ignore (stop words and antiphrases).
The engine goes to work looking up each word in the query to see which pages contain the word. (The search index can be thought of as an alphabetical list of every word that occurs on every page of the Web.) So, when you look up the word "glaucoma," you get a list of every page that contains that word.
That's the simplest case. It's more complicated when searchers enter more complex queries. If the searcher was looking for "glaucoma treatment," then the engine would look up every page that contains each word, giving it a list of pages that contain "glaucoma" and a list of pages that contain "treatment."
Most search engines, faced with this decision, decide to show just the pages that contain both words. So they look through the two lists and find which Web pages are listed in both.
Ranking Search Results
Next, the search engine decides the best order of the results
And with Web search, that's almost always the case. So, one of the most important parts of a search engine is the ranking algorithm-the part of the search engine that decides which pages show up at the top of the results list.
Ranking is just another word for sorting, the act of collating results into a certain order.
For some kinds of information, such as news stories, ranking results by the date of the in-formation (newest first) may make sense, but most organic search results are ranked by relevance, the degree to which the pages match the subject of the query.
Ranking algorithms usually take into account keyword density (the number of keyword occurrences compared to the total number of words on a page), keyword prominence (how close to the top of the page the kwywords occur), link popularity (how many other Web pages link to a page and how important they are), and many more.
Displaying Search Results
The final step is showing the list of matching pages
The search terms are usually displayed in bold, drawing the searcher's eye to them. You should understand that everything displayed in the search results is drawn from what the search engine previously stored in its index.
The search engine never examines the actual page while it is displaying search results, which is why the results page can sometimes contain outdated information, or even display pages that no longer exist (which are discovered when searchers click them).
The information displayed on the results page was correct when the spider last crawled the page, but the page may have changed (or even been removed) since.
Get the Book That Explains the Inner Workings of Search Engines
Learn how search engines work and how they can work for you
Search Engine Marketing, Inc. takes you step-by-step through every aspect of how search engines work and how you can work them. Your business can take advantage of the visitors that Google can send you every day.
by mikemoran
Co-author of Search Engine Marketing, Inc. and author of Do It Wrong Quickly, Mike Moran is a public speaker and consultant now serving as Chief Arch...
(more)
by 3 people |



