How Do Search Engines Work – Web Crawlers

How Search Engines Work

It is the search engines that finally bring your website to the notice of the prospective customers. Hence it is better to know how these search engines actually work and how they present information to the customer initiating a search.

There are basically two types of search engines. The first is by robots called crawlers or spiders.

Search Engines use spiders to index websites. When you submit your website pages to a search engine by completing their required submission page, the search engine spider will index your entire site. A ‘spider’ is an automated program that is run by the search engine system. Spider visits a web site, reads the content on the actual site, the site’s Meta tags and also follow the links that the site connects. The spider then returns all that information back to a central depository, where the data is indexed. It will visit each link you have on your website and index those sites as well. Some spiders will only index a certain number of pages on your site, so don’t create a site with 500 pages!

The spider will periodically return to the sites to check for any information that has changed. The frequency with which this happens is determined by the moderators of the search engine.

A spider is almost like a book where it contains the table of contents, the actual content and the links and references for all the websites it finds during its search, and it may index up to a million pages a day.

Example: Bing, Yahoo and Google.

When you ask a search engine to locate information, it is actually searching through the index which it has created and not actually searching the Web. Different search engines produce different rankings because not every search engine uses the same algorithm to search through the indices.

One of the things that a search engine algorithm scans for is the frequency and location of keywords on a web page, but it can also detect artificial keyword stuffing or spamdexing.

Then the algorithms analyze the way that pages link to other pages in the Web. By checking how pages link to each other, an engine can both determine what a page is about, if the keywords of the linked pages are similar to the keywords on the original page.

For more on Search Engines subscribe to our RSS feed for future updates.

<!– pingbacker_end –>

Why a Second Back Up is Worth Considering

While it pains me to admit that this blog (http://macproductions.com) was lost together with all the articles when our hosting company’s servers malfunctioned, it is even more embarrassing that I had no up to date back up of the blog. I was not unduly worried as my hosting company makes regular (2 hourly) back ups of all our websites.

It was therefore very disconcerting when the hosting company informed me that they were unable to find the MySql databases associated with our blogs. We had in effect just lost over 400 posts covering too many different topics to mention.

So it is that I write this as my first post of our new blog and take the opportunity to recommend the making of a back up of any website you may own.

As the saying goes “if you want it done right, then do it yourself”

**** Update to this post*******

I reinstalled WordPress and set up my blogs (2) again in a few days. Then 1 day later I noticed that the email on one blog was not working as I could not send or receive email. Furthermore I was getting the dreaded “554 5.7.1” error message on the returned emails. However after many hours of consulting with the customer support geeks at the hosting company regarding the email problem I was able to solve the email problem…….. My site was down again. We had all been looking too closely at email problems to look at the broader picture – No site No Email. The upside of this however is that I had made a backup on comleting the restoration of the site and so was able to save a days work on this 2nd restoration. 🙂

1 6 7 8

We are using cookies on our website

Please confirm, if you accept our tracking cookies. You can also decline the tracking, so you can continue to visit our website without any data sent to third party services.