What If You Dont Want Your Pages To Be Crawled and Cached site site articles site information about site what is site Site Promotion Information Search Now: What If You Dont Want Your Pages To Be Crawled and Cached plus articles and information on site
Article: 4429

What If You Dont Want Your Pages To Be Crawled and Cached


This information brought to you by Todays Sponsor! (web site travel marketing promotion)
Blinkx Video Search
World's largest video search engine. Over 26 million hours of video. Watch it all!
blinkx.com
 web site travel marketing promotion Listings
Your Source for Travel. Find and Compare Travel Listings Here.
areaconnect.com
 

Jerry Yu

Some website owners have pages that they want to hide from general public. The pages meet the following criteria:

  • Only accessible by trusted users if they know page URLs.
  • No links on the website that point to these pages.
  • No username and password are required to gain access as long as you know page URLs.

Lets see this scenario:

One day you created a page and you didnt put a link to it on your site. Then you told your family members about the pages URL. You thought nobody else would find it. You just made a mistake. Google and Yahoo would find your page if you or any family member ever visited websites with either Google toolbar PageRank enabled or Yahoo Companion Toolbar.

PageRank function

When you use Google toolbar with PageRank enabled, the toolbar automatically sends and records the pages URL you visited in Googles database. If a page URL is not found in Googles database, Googlebot - the robot of Google, will visit this page later to index it.

Your surfing activities are tracked whether you use the toolbar to search the web or directly type a pages URL in Google search page. Google records your visits anyway.

One day when you check what pages on your site have been indexed by Google, your hidden page comes up and you are worried. Furthermore, this page is cached. Even though you remove that page from your site, it can still be found and viewed from the cached version.

How to check what pages have been indexed

Go to Google, type in "site:www.yoursite.com" without quotes. This query will list all the pages that have been indexed but it will only display up to 999 records as this is the limit set by Google for any queries.

How to prevent your hidden pages to be indexed and cached

One simple but not sound solution is to disable PageRank function on the toolbar. To stop Google automatically track your surfing information, you can uncheck the PageRank checkbox to disable it.

Steps to disable PageRank function:

  • Click Options button on the toolbar you can see the word "Options" without quotes
  • In the pop-up windows Option tab, uncheck the PageRank checkbox.

See Google Toolbar Privacy Policy at http://toolbar.google.com/privacy.html for what information Google is collecting.

Unfortunately, disable the PageRank function is not going to completely solve your problem because, in our example, your other family members could have PageRank enabled.

A sound solution

Your problem can be tackled by using meta robots html tag. The following two tags are what you need to use. Put the tag in the <head> section of your HTML documents.

<meta name="robots" content="noindex,nofollow">

Search engines will read this page but will not index it and no links on this page will be traversed through to other pages.

<meta name="robots" content="noarchive">

Search engines will not archive/cache the page content.

How to remove an indexed and cached page

If your page has already been indexed and cached, to remove from search engine databases, do this:

1. Add <meta name="robots" content="noindex,nofollow,noarchive"> to your page head tag section. Next time when Googlebot or other robots visit your page, your page will be removed from their index and cache.

2. Do what Google suggests. "If you believe your request is urgent and cannot wait until the next time Google crawls your site, use our automatic URL removal system. In order for this automated process to work, your webmaster must first insert the appropriate meta tags into the pages HTML code."

cited from Google web site Remove Content from Googles Index at http://www.google.com/remove.html

One last note. Is your page now 100% hidden Not really. If you have outbound links on the hidden page and you click the links and navigate to other websites, your hidden pages URL will appear in other sites web traffic log as HTTP referer.

You can remove outbound links from your hidden pages if thats suitable.

More resources

What If...

Now you know how to safeguard any page on your site. What if you want to keep robots out from visiting all files in a directory The answer is in my article Robots.txt And Search Engine Robots - http://www.WebActionGuide.com/kb/robots-txt.php




Recommended Reading:

Blinkx Video Search 
  • World's largest video search engine. Over 26 million hours of video. Watch it all!

  • >> View Site
     
    web site travel marketing promotion Listings 
  • Your Source for Travel. Find and Compare Travel Listings Here.

  • >> View Site
     
    web site travel marketing promotion 
  • Find Local Marketing Information. Search Local Listings.

  • >> View Site
     
    Local Motels Listings 
  • Cheap, Affordable Motels. Get Rates Today.

  • >> View Site
     
    Watch Funny Videos! 
  • Click here to see funny videos, pictures, jokes, commercials, and more funny stuff from Comedy.com.

  • >> View Site
     
    Luxury Reviews and Trends 
  • Discover incredible luxury travel, shopping, articles, videos and more...

  • >> View Site
     
    Watch Free Videos At Mevio! 
  • Tons of Free Videos, Only At Mevio.com

  • >> View Site
     
    Free Tech and Gadget Reviews! 
  • Watch GeekBrief With Cali Lewis on Mevio!

  • >> View Site
     
    Free Online Kids Games 
  • Hundreds of fun free online games for kids.

  • >> View Site
     
    Going.com - Your Resource For Local Entertainment 
  • Parties, nightlife, concerts, arts. Check Going.com to find out what's happening in your city, and who's going!

  • >> View Site
     

    RELATED ARTICLES >>
    Getting Visitors To Stay Through Web Based Marketing - Site
     
    21 Free or Low-cost ways to Promote your On-line business - Site
     
    Google Ban - How not to get banned by Google! - Site
     
    5 Effective Ways to Promote Your Website For Free Online - Site
     
    Top 5 Methods to Promote Your Website - Site
     
    See No Google, Hear No Google, Speak No Google - Site
     
    Seek Engines: What If Seek Had Bumped Out Search - Site
     
    SEO Your PDF’s - Does This Work - Site
     
    How Search Engines Work - Site
     
    Why Directories Might Save Your Websites Life - Site
     
    "The Numbers Dont Lie!" - Site
     
    Googles AdWords Selecttm Groundbreaking Program - Site
     
    Learn How To Use These Six Explosive Marketing Techniques To Explode Your Website With Traffic - Site
     
    Branding Versus SEO - Site
     
    Last Updated: 2008-11-19     Need More? Check out Article-Max Table of Contents :: docuMAX Network