<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Tropical Blogging &#187; 301</title>
	<atom:link href="http://www.tropicalwebworks.org/tag/301/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.tropicalwebworks.org</link>
	<description>Warm breezes, sunshine, and random thoughts</description>
	<lastBuildDate>Sun, 04 Apr 2010 23:27:55 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Search vs. Direct Navigation</title>
		<link>http://www.tropicalwebworks.org/2008/07/29/search-vs-direct-navigation/</link>
		<comments>http://www.tropicalwebworks.org/2008/07/29/search-vs-direct-navigation/#comments</comments>
		<pubDate>Tue, 29 Jul 2008 18:35:02 +0000</pubDate>
		<dc:creator>Sonja</dc:creator>
				<category><![CDATA[Clients]]></category>
		<category><![CDATA[Search Engine Optimization]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[301]]></category>
		<category><![CDATA[client]]></category>
		<category><![CDATA[direct navigation]]></category>
		<category><![CDATA[dns]]></category>
		<category><![CDATA[navigation]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[Yahoo]]></category>

		<guid isPermaLink="false">http://www.tropicalwebworks.org/?p=146</guid>
		<description><![CDATA[So I have a client I&#8217;m developing a website for. The client has been great to work with, and we&#8217;ve made good progress on the new site. But there was a problem: The client kept complaining that he couldn&#8217;t reach his site &#8212; he was getting &#8220;not found&#8221; error messages. He told me that he [...]]]></description>
			<content:encoded><![CDATA[<p>So I have a client I&#8217;m developing a website for. The client has been great to work with, and we&#8217;ve made good progress on the new site. But there was a problem: The client kept complaining that he couldn&#8217;t reach his site &#8212; he was getting &#8220;not found&#8221; error messages. He told me that he could, at one point, get to the site without the &#8220;www&#8221; &#8212; which was particularly strange because one of the first things I do on a new site is implement a 301 permanent redirect from the non-www to the www version of the URL.<span id="more-146"></span></p>
<p>I checked the DNS settings on the server. I did traceroutes from my computer. I went to websites in other parts of the country and did traceroutes. I pinged. I double-checked and triple-checked the nameservers. Everything was exactly right, and I was tearing my hair out trying to figure out why the client couldn&#8217;t reach his own site.</p>
<p>Finally, it struck me: The client was <strong>searching</strong> for the site, using the search box on Yahoo and Comcast.</p>
<p>Once I told him that his site not indexed yet (because it&#8217;s brand new), and that he needed to enter the URL directly into the browser&#8217;s address bar &#8212; voila! Problem solved.</p>
<p>This is one to file away to deal with future &#8220;can&#8217;t reach my site&#8221; issues.</p>
<p>Note to website designers and developers: If you ever have clients who can&#8217;t reach their site, check to see if they&#8217;re searching for it.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.tropicalwebworks.org/2008/07/29/search-vs-direct-navigation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Infamous Canonical URL Issue</title>
		<link>http://www.tropicalwebworks.org/2007/01/18/infamous-canonical-url/</link>
		<comments>http://www.tropicalwebworks.org/2007/01/18/infamous-canonical-url/#comments</comments>
		<pubDate>Fri, 19 Jan 2007 01:42:24 +0000</pubDate>
		<dc:creator>Sonja</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Search Engine Optimization]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[301]]></category>
		<category><![CDATA[canonical URL]]></category>
		<category><![CDATA[duplicate content]]></category>

		<guid isPermaLink="false">http://www.tropicalwebworks.org/2007/01/18/the-infamous-canonical-url-issue/</guid>
		<description><![CDATA[Difficult as it may be to believe, but by January of 2007, Google is still unable to recognize when URLs that obviously lead to the same page are in fact the same page. So what&#8217;s a URL, and what&#8217;s the problem here? URL (pronounced you-are-ell, or sometimes &#8220;earl&#8221; as in Duke of) stands for Uniform [...]]]></description>
			<content:encoded><![CDATA[<p>Difficult as it may be to believe, but by January of 2007, Google is <strong>still</strong> unable to recognize when URLs that obviously lead to the same page are in fact the same page. So what&#8217;s a URL, and what&#8217;s the problem here?</p>
<p><span id="more-30"></span>URL (pronounced you-are-ell, or sometimes &#8220;earl&#8221; as in <em>Duke of</em>) stands for Uniform Resource Locator. It&#8217;s the technical name for the <strong>address</strong> of a particular web page. For example, the URL of this site&#8217;s home page is <code>http://www.tropicalwebworks.org</code>, and the URL of this page is <code>http://www.tropicalwebworks.org/2007/01/18/infamous-canonical-url/</code>.</p>
<p>It&#8217;s common that any particular web page may be reached at multiple URLs. If this site were not configured optimally, the home page might be reachable at both <code>http://www.tropicalwebworks.org</code> and <code>http://tropicalwebworks.org</code> (notice the missing &#8220;www.&#8221;). Normal people would logically think that this would be desirable: After all, you don&#8217;t want people to get a &#8220;server not found&#8221; error if they try to get to your site without including the www part.</p>
<p>But Google sees these as two completely separate URLs that just happen to contain exactly the same content. There are two problems with such a situation:</p>
<ol>
<li>First, the &#8220;strength&#8221; of that page, and its ability to turn up in the search engine results, is diluted. Some of the page&#8217;s strength is allotted to one version, and some to the other, and neither &#8220;page&#8221; performs as well as it would if all the strength were concentrated in one page.</li>
<li>And second, Google attempts to filter out pages containing duplicate content, based on the reasonable logic that people don&#8217;t want to see multiple results in their searches for the exact same thing. Thus, since both of these &#8220;pages&#8221; contain the exact same content, one of them will suffer in searches due to the dupe content filter.</li>
</ol>
<p>It&#8217;s a double whammy. It&#8217;s not that your site actually <strong>has</strong> duplicate content. No, we could possibly call this situation &#8220;virtual duplicate content.&#8221; But it&#8217;s all the same to Google: It&#8217;s duplicate content, period.</p>
<p>And if that&#8217;s not bad enough, many people link to their home page like this: http://www.example.com/index.html. Now Google sees yet another instance of duplicate content: http://www.example.com and http://www.example.com/index.html. So ultimately what Google sees is <strong>four</strong> &#8220;duplicate content&#8221; pages:</p>
<ul>
<li>http://www.example.com</li>
<li>http://example.com</li>
<li>http://www.example.com/index.html</li>
<li>http://example.com/index.html</li>
</ul>
<p>And all this before we&#8217;ve even gotten past the home page of your site!</p>
<p>It&#8217;s easy-peasy to configure the server to do what&#8217;s called a &#8220;301 permanent redirect&#8221; from the non-www version to the www version of your site. This technique, which is recommended by Google, tells Google that the two are indeed the same and keeps the poor Googlebot from deciding that you have duplicate content and splitting your page&#8217;s strength among more than one version. &#8220;301&#8243; refers to the status code that&#8217;s returned by the web server to the browser (or the spider, in this case), and it says, in effect, &#8220;Hey, the correct, permanent URL for the page you&#8217;re requesting is actually over there. Don&#8217;t index it at this URL.&#8221;</p>
<p>It&#8217;s likewise easy-peasy to link to your home page without the &#8220;index.html&#8221; (or other directory index name, such as home.htm or default.asp). For index pages in subdirectories, you simply link to the directory: <code>http://www.example.com/subdirectory/</code>, again leaving out the actual filename index.html.</p>
<p>I apply an appropriate 301 permanent redirect to the www version of every web site I develop. It&#8217;s not something I charge extra for, or something that I tout to my clients as being anything special. It&#8217;s about a 20-second task to set up the 301 properly. And I never link to directory index pages by filename. I don&#8217;t know why some of the big companies aren&#8217;t aware of this issue, or, if they are aware, why they don&#8217;t care enough to do it properly. It raises the question, if they&#8217;re so ignorant, or uncaring, about a thing that is so simple to do right, in how many other areas are they incompetent?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.tropicalwebworks.org/2007/01/18/infamous-canonical-url/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
