Google

How can I get my website to show up on google and other search engines? What is a robots file?

If your website appears on Google it’s also most-likely to appear automatically in other search engines as well. Also note that Google has 70% market-share which means that 70% of the searches on the web are made through Google. That’s why I’m going to describe only how to get your website indexed on Google here and am not covering other search engines.

Before you start going through the steps described here login to your Administration Panel (Dashboard). Go to Settings, Privacy and under “Blog visibility” make sure
“I would like my blog to be visible to everyone, including search engines (like Google, Sphere, Technorati) and archivers”
is selected.

Chapter 1: Going through the process step by step
Chapter 2: What is a robots file? What is it good for? How is it made?

Chapter 1: Going through the process step by step

1) Submit your URL going to addurl

2) Register here to register to the Google Webmaster Tools.

3) Now create a sitemap http://www.xml-sitemaps.com/
-Put your base URL (www.yourdomain.com) where it says Starting URL
-Click on start

4) Download the un-compressed XML Sitemap

5) Open it in Safari or any other browser or full text editor (TextWrangler, Notepad) to see if almost all your pages are included, if they aren’t something went wrong. If you're on a Mac and it doesn’t open with Safari right-click on it (ctrl+click) and choose “Open with” and select “Safari”.

6) If it’s ok. Upload that file to the root (main location) on your server
Entering: http://www.yourdomain.com/sitemap.xml in your browser should bring up this file. If it doesn’t come up in the browser then something went wrong and you will have to find out what. You probably placed it into the wrong location.

7) Once you uploaded that file go to the Google Webmaster Tools you registered to in Step 2. In the Dashboard if it isn’t already already on the list you need to add your base URL. To do that hit “Add site” Google - 1, type your domain in the appearing box
http://yourdomain.com

Google - 2

and hit “Continue”.

8 ) Google will ask you to verify that it really is your website.

Under “Choose verification method” select “Upload an HTML file”:

Google - 3

Download the HTML file by hitting “this HTML verification file” and you will get a file called google8824XXXXXX.html or similar.

Put it in the main location (root) on your server (same location where the sitemap.xml file is)
Entering: www.yourdomain.com/google8824XXXXXX.html in your browser should bring up this file. If it doesn’t come up in the browser then something went wrong and you will have to find out what.

Once you created and uploaded the file hit “Verify” and you will see a message appearing at the top of the window

Google - 4

and get redirected to the Dashboard.

Now go to “Site configuration” and “Sitemaps”

Google - 5

and hit Google - 6.

9) If you uploaded your sitemap to the root of your server you will just have to type in "sitemap.xml" (or the name of the file if you renamed it)

Google - 7

and click on “Submit Sitemap”.

Again a message will appear at the top of the window

Google - 8

and your sitemap will be on the list:

Google - 9

10) That’s it. However you will have to wait about two weeks before you see your site indexed by google. To see if your website is indexed in google go to google, in the search box type site:’your url’ (see example here ) note that it changes whether you include or not the www. I find most of my pages indexed without because I first registered it that way.

From time to time replace the old sitemap with a new one
- Make a new sitemap
- Upload it to your Server
- Go to your Google Webmaster Tools account
- Click on the URL of your site
- Go to Sitemaps, Check the box near the sitemap file name and hit “Resubmit selected” and you’re done.

Chapter 2: What is a robots file? What is it good for? How is it made?

A robots.txt file is used to tell search engines (such as Google or Yahoo) which pages or folder you don't want to be indexed by them. There are many reasons why you would want to avoid some pages from being indexed. Either because a page or an entire site is just for testing purpose and not to be seen by everybody or because it contains private informations about you and your family and you don't want it to be searchable and so on.

I made a file filled with some sample informations. You can download that file by clicking here.

When you open it in TextWrangler (download it here for free) or Notpad or any other full-text editor application you will see the following page:

Disallow: /test/ means that search engines will not index the whole /test/ folder which is on my server and its entire content.
Disallow: /sample/Sample.html means that search engines will not index the Sample.html page which is contained in the /sample/folder. However all other pages in the /sample/ folder will be indexed.
Disallow: example.html means that they will not index the example.html page which is in the root of the server.

You have to upload the robots.txt file to the root of your server so that it is reachable under
http://yourdomain.com/robots.txt

This FAQ is also discussed in my WPFAQ.org E-Book (sample)

© WPFAQ.org - Tutorials, Tips & Tricks is made with WordPress and for WordPress.
It's an EBookStoretoday.com company. It’s presented by Cédric Giger and hosted by HostGator.com
Thank you for visiting and supporting my website,
- Cédric -

Now share it…

Share

Comments are closed.