How Search Engines Work
One of the most frequently asked questions we get from website owners is,
“Why can't my site be found on Google?” They know it’s important to appear
at the top of search engine results but they just don’t know why it doesn’t
happen to them.
They may well be in awe of the ‘black arts’ of search engine optimization or
puzzled by the complexity of it all. If they’re unlucky they will have paid out
money to some snake oil salesman guaranteeing to get them to the top in
48 hours - and been sorely disappointed with the lack of traffic that results.
Most search engine optimizers are highly ethical, professional people but
unsurprisingly they tend to keep their cards close to their chest. So what is
the website owner to do to compete?
The big secret is there is no big secret
It’s true:
the ‘big secret’ of search engine optimization is that there is no big secret.
It is all about understanding what is going on, followed by the hard work and
attention to detail that are common to many business activities.
This lesson is about understanding the basics of what is going on inside a
search engine. It is just under 800 words in length but that is more than
enough to give you what you need to know, to give you what you need to
know to get started.
Inside the guts of a search engine
For simplicity’s sake, let’s say there are three pieces of software that together make up a search engine - the Spider software, the Index software and the Query software.
If you understand what these three do, then you have the foundation for
getting your website to the top of the search engines.
Here’s what the three types of software do:
The Spider software ‘crawls the web looking for new pages to collect and
add to the search engine indices’.
This is a metaphor. In reality, the spider doesn’t do any ‘crawling’ and
doesn’t ‘visit’ any web pages. It requests pages from a website in the same
way as Microsoft Explorer, or Firefox or whatever browser you use requests
pages to display on your screen.
The difference is that the spider doesn’t collect images or fancy designs - it
is only interested in text and links AND the URL, from which they come: it
doesn’t display anything and it gets as much information as it can in the
shortest time possible.
A spider loves links because they lead it to other web pages that have the
things that it loves, guess what? more text, links and URLs!
The Index software catches everything the Spider can throw at it (yes,
that’s another metaphor). The index makes sense of the mass of text, links
and URLs using what is called an algorithm - a complex mathematical
formula that indexes the words, the pairs of words and so on.
Essentially, an algorithm analyses the pages and links for word combinations
and assigns scores that allow the search engine to judge how important the
page (and URL) might be to the person that is searching. And of course it
stores all of this information and makes it available to people who are
searching.
The Query software is what you see when you use a search engine - it is
the front end that everybody thinks of as a search engine. It may look simple
but it presents the results of all the remarkable search engine software that
works away invisibly on our behalf.
The main feature of the query software is the box into which people type
their search terms.
Type in your words, hit search and the search engine will try to match your
words with the best web pages in can find through searching the web.
But this too is a metaphor and perhaps the most important one.
The query software doesn’t search the web - it checks the records that have
been created by its own index software. And those records have been made
possible by the raw material the spider software collects.
What you need to understand about search engines
What you need to understand is that the search engine has done all the hard
work of collecting and analyzing web pages, BUT it only makes that
information available when someone does a search by entering words in
the search box and hitting return.
The words people use when they search therefore determine the results
the search engine presents. We call them keywords - that might sound fancy
but keywords are only ‘the words people use when they search’.
Use keywords in your website and you will prosper:
ignore them and your online business will surely perish.