Wednesday, April 16, 2008

Google Sitemaps

·

Google Sitemaps


In general, there are two types of sitemaps. The first type of sitemap is a HTML page listing the pages of your site - often by section - and is meant to help users find the information they need.

XML Sitemaps - usually called Sitemaps, with a capital S - are a way for you to give Google information about your site. This is the type of Sitemap we'll be discussing in this article.

In its simplest terms, a Sitemap is a list of the pages on your website. Creating and submitting a Sitemap helps make sure that Google knows about all the pages on your site, including URLs that may not be discoverable by Google's normal crawling process.

Sitemaps are particularly helpful if:

  • Your site has dynamic content.
  • Your site has pages that aren't easily discovered by Googlebot during the crawl process - for example, pages featuring rich AJAX or Flash.
  • Your site is new and has few links to it. (Googlebot crawls the web by following links from one page to another, so if your site isn't well linked, it may be hard for us to discover it.)
  • Your site has a large archive of content pages that are not well linked to each other, or are not linked at all.

You can also use a Sitemap to provide Google with additional information about your pages, including:

  • How often the pages on your site change. For example, you might update your product page daily, but update your About Me page only once every few months.
  • The date each page was last modified.
  • The relative importance of pages on your site. For example, your home page might have a relative importance of 1.0, category pages have an importance of 0.8, and individual blog entries or product pages have an importance of 0.5. This priority only indicates the importance of a particular URL relative to other URLs on your site, and doesn't impact the ranking of your pages in search results.

Sitemaps provide additional information about your site to Google, complementing our normal methods of crawling the web. We expect they will help us crawl more of your site and in a more timely fashion, but we can't guarantee that URLs from your Sitemap will be added to the Google index. Sites are never penalized for submitting Sitemaps.

Google adheres to Sitemap Protocol 0.9 as defined by sitemaps.org. The Sitemap Protocol is a dialect of XML for summarizing Sitemap information that is relevant to web crawlers. Sitemaps created for Google using Sitemap Protocol 0.9 are therefore compatible with other search engines that adopt the standards of sitemaps.org.

While a standard Sitemap works for most sites, you can also create and submit specialized Sitemaps for certain types of content. These Sitemap formats are specific to Google and are not used by other search engines. They're a good way to give Google detailed information about specific content types. For example, publishers can use News Sitemaps to give Google information that can appear in Google News search results, such as publication date, keywords, and stock ticker symbol. Sitemap formats include:


Our suite of webmaster tools provides you with a free and easy way to make your site more Google-friendly. They can show you Google’s view of your site, help you diagnose problems, and let you share info with us to help improve your site’s visibility.

Getting Google’s view of your site, and diagnosing potential problems
The first step to increasing your site’s visibility on Google is learning how our robots crawl and index your site.

  • Crawl info: You can make sure we have access to your site, and see when Googlebot last visited. You can also view URLs that we’ve had trouble crawling and why we couldn't crawl them. This way, you can fix any problems preventing us from indexing all of your pages.
  • Robots.txt file validation: See if we’re having trouble with your file, and test out changes to that file before you change it on your server.
  • Website content: View top content from your site and see the words that other sites use to link to it.

Seeing how your site performs
A second step is learning what drives traffic to your site.

  • Top queries: Find the top queries that drive traffic to your site and where your site is included in the top search results. This will let you learn how users are finding your site.
  • Indexing information: See how your site is indexed and which of your pages are included in the index. If we find violations in your site, we’ll give you the opportunity to fix the problems and request reinclusion of your site.

Sharing info with Google about your site
Since no one knows more about your site than you do, you can also share this info with Google and improve your crawlability.

  • Submit a Sitemap file: Tell us all about your pages by submitting a Sitemap file; help us learn which pages are most important to you and how often those pages change.
  • Specify your preferred domain: Tell us which URL to use when indexing your site; we’ll do our best to index the version you prefer.
Thanks.

1 comments:

tawau said...
October 30, 2008 at 9:48 PM  

hello... Thankz 4 your guide.. very helpful..

can you x-plain about robots.txt. how to use in google sitemaps.. Thankz...