<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>{Complexity} &#187; Internet</title>
	<atom:link href="http://technote.wsjoung.com/category/internet/feed/" rel="self" type="application/rss+xml" />
	<link>http://technote.wsjoung.com</link>
	<description>simple note</description>
	<lastBuildDate>Fri, 06 Aug 2010 19:03:03 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Make it hide or searchable</title>
		<link>http://technote.wsjoung.com/2006/11/17/make-it-hide-or-searchable/</link>
		<comments>http://technote.wsjoung.com/2006/11/17/make-it-hide-or-searchable/#comments</comments>
		<pubDate>Fri, 17 Nov 2006 00:26:36 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Internet]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Web]]></category>

		<guid isPermaLink="false">http://wsjoung.wordpress.com/2006/11/17/make-it-hide-or-searchable/</guid>
		<description><![CDATA[Some people are concerned about their personal or some information which they want to hide. Even if they do, sometimes they up-load those information on the public web space. Okay, there is a way to hide those information from the search engine or robot. Currently most search robots don&#8217;t support Meta tags but, there is [...]]]></description>
			<content:encoded><![CDATA[<p>Some people are concerned about their personal or some information which they want to hide. Even if they do, sometimes they up-load those information on the public web space. Okay, there is a way to hide those information from the search engine or robot.<br />
Currently most search robots don&#8217;t support Meta tags but, there is another thing for robots. &#8220;robots.txt&#8221; this file gives a direction to the robots which are trying to search your web site. Which directories or files are allowed or not allowed for searching.<br />
There are some examples.</p>
<p><strong>To exclude all robots from the entire server</strong><br />
User-agent: *<br />
Disallow: /</p>
<p><strong>To allow all robots complete access</strong><br />
User-agent: *<br />
Disallow:<br />
Or create an empty &#8220;/robots.txt&#8221; file.</p>
<p><strong>To exclude all robots from part of the server</strong><br />
User-agent: *<br />
Disallow: /cgi-bin/<br />
Disallow: /tmp/<br />
Disallow: /private/</p>
<p><strong>To exclude a single robot</strong><br />
User-agent: BadBot<br />
Disallow: /</p>
<p><strong>To allow a single robot</strong><br />
User-agent: WebCrawler<br />
Disallow:</p>
<p>User-agent: *<br />
Disallow: /</p>
<p><strong>To exclude all files except one</strong><br />
This is currently a bit awkward, as there is no &#8220;Allow&#8221; field. The easy way is to put all files to be disallowed into a separate directory, say &#8220;docs&#8221;, and leave the one file in the level above this directory:</p>
<p>User-agent: *<br />
Disallow: /~joe/docs/</p>
<p><strong>Alternatively you can explicitly disallow all disallowed pages:</strong><br />
User-agent: *<br />
Disallow: /~joe/private.html<br />
Disallow: /~joe/foo.html<br />
Disallow: /~joe/bar.html</p>
<p><a href="http://www.google.com/bot.html">Googlebot</a><br />
<a href="http://www.robotstxt.org/">robotstxt.org</a></p>
]]></content:encoded>
			<wfw:commentRss>http://technote.wsjoung.com/2006/11/17/make-it-hide-or-searchable/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
