<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Give or take a billion</title>
	<atom:link href="http://crookedtimber.org/2005/09/26/give-or-take-a-billion/feed/" rel="self" type="application/rss+xml" />
	<link>http://crookedtimber.org/2005/09/26/give-or-take-a-billion/</link>
	<description>Out of the crooked timber of humanity, no straight thing was ever made</description>
	<lastBuildDate>Mon, 15 Mar 2010 03:44:29 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Bro. Bartleby</title>
		<link>http://crookedtimber.org/2005/09/26/give-or-take-a-billion/comment-page-1/#comment-104095</link>
		<dc:creator>Bro. Bartleby</dc:creator>
		<pubDate>Wed, 28 Sep 2005 21:41:55 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=3856#comment-104095</guid>
		<description>Bro. Anthony, it tells us that even though long ago we each were weaned from said glands, we still have the desire to return to the comfort that said glands provided upon our entry into the world. The birth trauma was quickly forgotten when warmth and tenderness and milk comforted us, and so too in this often traumatic world, we still find comfort with mere photographs of the said glands. So to find over 57 million websites devoted to tits, I take comfort in our collective honor of the remembrance of that very first suck.</description>
		<content:encoded><![CDATA[	<p>Bro. Anthony, it tells us that even though long ago we each were weaned from said glands, we still have the desire to return to the comfort that said glands provided upon our entry into the world. The birth trauma was quickly forgotten when warmth and tenderness and milk comforted us, and so too in this often traumatic world, we still find comfort with mere photographs of the said glands. So to find over 57 million websites devoted to tits, I take comfort in our collective honor of the remembrance of that very first suck.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Anthony</title>
		<link>http://crookedtimber.org/2005/09/26/give-or-take-a-billion/comment-page-1/#comment-103846</link>
		<dc:creator>Anthony</dc:creator>
		<pubDate>Tue, 27 Sep 2005 23:22:05 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=3856#comment-103846</guid>
		<description>Bro. Bartleby, I&#039;m not sure what searching for tits tells you about men.</description>
		<content:encoded><![CDATA[	<p>Bro. Bartleby, I&#8217;m not sure what searching for tits tells you about men.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Bro. Bartleby</title>
		<link>http://crookedtimber.org/2005/09/26/give-or-take-a-billion/comment-page-1/#comment-103699</link>
		<dc:creator>Bro. Bartleby</dc:creator>
		<pubDate>Tue, 27 Sep 2005 19:07:51 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=3856#comment-103699</guid>
		<description>&quot;tits&quot; scores 57,900,000. Now what does that tell us about humans? Or I should say, what does it tell us about males?</description>
		<content:encoded><![CDATA[	<p>&#8220;tits&#8221; scores 57,900,000. Now what does that tell us about humans? Or I should say, what does it tell us about males?</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: riting on the wall &#187; Blog Archive &#187; tech and usage</title>
		<link>http://crookedtimber.org/2005/09/26/give-or-take-a-billion/comment-page-1/#comment-103696</link>
		<dc:creator>riting on the wall &#187; Blog Archive &#187; tech and usage</dc:creator>
		<pubDate>Tue, 27 Sep 2005 18:21:49 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=3856#comment-103696</guid>
		<description>[...] at ct, eszter notes that google&#8217;s numbers don&#8217;t always add up all that well. A search on &#8220;www&#8221; yields results &#8220;of about 9,160,000,000&#8243;. This is curious given that according to Google&#8217;s homepage, the engine is &#8220;Searching 8,168,684,336 web pages&#8221;. Perhaps they are extrapolating to sites that they are not searching. Or perhaps those &#8220;of about&#8221; figures are not very accurate. In general, those numbers are hard to verify since Google won&#8217;t display more than 1000 results to any query. The figures may be helpful in establishing relative popularity, although it&#8217;s unclear whether the system can be trusted to be reliable even to that extent. [...]</description>
		<content:encoded><![CDATA[	<p>[...] at ct, eszter notes that google&#8217;s numbers don&#8217;t always add up all that well. A search on &#8220;www&#8221; yields results &#8220;of about 9,160,000,000&#8243;. This is curious given that according to Google&#8217;s homepage, the engine is &#8220;Searching 8,168,684,336 web pages&#8221;. Perhaps they are extrapolating to sites that they are not searching. Or perhaps those &#8220;of about&#8221; figures are not very accurate. In general, those numbers are hard to verify since Google won&#8217;t display more than 1000 results to any query. The figures may be helpful in establishing relative popularity, although it&#8217;s unclear whether the system can be trusted to be reliable even to that extent. [...]</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Tombstone</title>
		<link>http://crookedtimber.org/2005/09/26/give-or-take-a-billion/comment-page-1/#comment-103675</link>
		<dc:creator>Tombstone</dc:creator>
		<pubDate>Tue, 27 Sep 2005 16:25:48 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=3856#comment-103675</guid>
		<description>Although I&#039;ve been a long time Google fan, I&#039;ve recently begun using Yahoo search again - with great success.

I&#039;ve found that Yahoo is crawling sites extensively.  When looking for specific long phrases, coding examples, error discussions, and other detailed search strings, Yahoo is coming up as the winner.</description>
		<content:encoded><![CDATA[	<p>Although I&#8217;ve been a long time Google fan, I&#8217;ve recently begun using Yahoo search again &#8211; with great success.</p>

	<p>I&#8217;ve found that Yahoo is crawling sites extensively.  When looking for specific long phrases, coding examples, error discussions, and other detailed search strings, Yahoo is coming up as the winner.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff R.</title>
		<link>http://crookedtimber.org/2005/09/26/give-or-take-a-billion/comment-page-1/#comment-103674</link>
		<dc:creator>Jeff R.</dc:creator>
		<pubDate>Tue, 27 Sep 2005 16:18:16 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=3856#comment-103674</guid>
		<description>I did a search on &#039;the&#039; and got the same number as the original post (9,160,000,000.)

Interestingly, the top three hits for &#039;the&#039;, in order, are The Onion, the White House, and The Weather Channel.</description>
		<content:encoded><![CDATA[	<p>I did a search on &#8216;the&#8217; and got the same number as the original post (9,160,000,000.)</p>

	<p>Interestingly, the top three hits for &#8216;the&#8217;, in order, are The Onion, the White House, and The Weather Channel.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: joeblow</title>
		<link>http://crookedtimber.org/2005/09/26/give-or-take-a-billion/comment-page-1/#comment-103673</link>
		<dc:creator>joeblow</dc:creator>
		<pubDate>Tue, 27 Sep 2005 16:15:36 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=3856#comment-103673</guid>
		<description>Reportedly, if you search for &quot;http&quot; you get all the pages, sorted
into Page Rank order.</description>
		<content:encoded><![CDATA[	<p>Reportedly, if you search for &#8220;http&#8221; you get all the pages, sorted<br />
into Page Rank order.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Eszter</title>
		<link>http://crookedtimber.org/2005/09/26/give-or-take-a-billion/comment-page-1/#comment-103672</link>
		<dc:creator>Eszter</dc:creator>
		<pubDate>Tue, 27 Sep 2005 16:07:24 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=3856#comment-103672</guid>
		<description>Since I posted this entry yesterday, Google has taken down the exact number of claimed coverage.  Conveniently, however, Google caches its own homepage so I was able to retrieve a screen shot that way.  &lt;a href=&quot;http://www.flickr.com/photos/eszter/47152900/&quot; rel=&quot;nofollow&quot;&gt;Here it is.&lt;/a&gt;

On a different note, check out there homepage today for a cute birthday logo.  (I almost never go to the homepage since I use it from the toolbar. Today the logo is not copied on search results pages so I would&#039;ve missed it had I not been checking for their coverage estimate.)</description>
		<content:encoded><![CDATA[	<p>Since I posted this entry yesterday, Google has taken down the exact number of claimed coverage.  Conveniently, however, Google caches its own homepage so I was able to retrieve a screen shot that way.  <a href="http://www.flickr.com/photos/eszter/47152900/" rel="nofollow">Here it is.</a></p>

	<p>On a different note, check out there homepage today for a cute birthday logo.  (I almost never go to the homepage since I use it from the toolbar. Today the logo is not copied on search results pages so I would&#8217;ve missed it had I not been checking for their coverage estimate.)</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Matt Weiner</title>
		<link>http://crookedtimber.org/2005/09/26/give-or-take-a-billion/comment-page-1/#comment-103666</link>
		<dc:creator>Matt Weiner</dc:creator>
		<pubDate>Tue, 27 Sep 2005 15:54:45 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=3856#comment-103666</guid>
		<description>One way of controlling for the problem bza points out might be to search &quot;bush&quot; and &quot;clinton&quot; as denominators.  But that&#039;s still going to be too unreliable for serious work.</description>
		<content:encoded><![CDATA[	<p>One way of controlling for the problem bza points out might be to search &#8220;bush&#8221; and &#8220;clinton&#8221; as denominators.  But that&#8217;s still going to be too unreliable for serious work.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: bza</title>
		<link>http://crookedtimber.org/2005/09/26/give-or-take-a-billion/comment-page-1/#comment-103653</link>
		<dc:creator>bza</dc:creator>
		<pubDate>Tue, 27 Sep 2005 14:55:01 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=3856#comment-103653</guid>
		<description>Also, given the explosion of blogging in the last couple of years as well as the general increase in computer use since the Clinton era, there are going to be many more mentions of Bush online, period, no matter what other search terms you use. E.g.,

bush gandhi: 1,570,000
clinton gandhi: 769,000

Running this kind of Googlefight is a valid method only when the two figures are roughly contemporaneous, and in Internet time Clinton and Bush are decidedly not so.</description>
		<content:encoded><![CDATA[	<p>Also, given the explosion of blogging in the last couple of years as well as the general increase in computer use since the Clinton era, there are going to be many more mentions of Bush online, period, no matter what other search terms you use. E.g.,</p>

	<p>bush gandhi: 1,570,000<br />
clinton gandhi: 769,000</p>

	<p>Running this kind of Googlefight is a valid method only when the two figures are roughly contemporaneous, and in Internet time Clinton and Bush are decidedly not so.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Seth Finkelstein</title>
		<link>http://crookedtimber.org/2005/09/26/give-or-take-a-billion/comment-page-1/#comment-103652</link>
		<dc:creator>Seth Finkelstein</dc:creator>
		<pubDate>Tue, 27 Sep 2005 14:45:31 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=3856#comment-103652</guid>
		<description>andrew: To perhaps be tedious, mentions of Bush and Hitler don&#039;t tell you if they&#039;re left-wingers who mean it (&quot;Bush is like Hitler&quot;), or right-wingers who are crying persecution over a fiction (&quot;Left-wingers always say Bush is like Hitler&quot;). This is big problem with Godwin&#039;s Law - there&#039;s a substantial incentive to make a phony accusation as a debate tactic (highly ironic ...).

There&#039;s an amazing amount of what I call &quot;Googery&quot;,  abuse of meaningless numbers.

My favorite involved the size of &lt;a href=&quot;http://www.sethf.com/infothought/blog/archives/000553.html&quot; rel=&quot;nofollow&quot;&gt;&quot;free  porn&quot;&lt;/a&gt;</description>
		<content:encoded><![CDATA[	<p>andrew: To perhaps be tedious, mentions of Bush and Hitler don&#8217;t tell you if they&#8217;re left-wingers who mean it (&#8220;Bush is like Hitler&#8221;), or right-wingers who are crying persecution over a fiction (&#8220;Left-wingers always say Bush is like Hitler&#8221;). This is big problem with Godwin&#8217;s Law &#8211; there&#8217;s a substantial incentive to make a phony accusation as a debate tactic (highly ironic &#8230;).</p>

	<p>There&#8217;s an amazing amount of what I call &#8220;Googery&#8221;,  abuse of meaningless numbers.</p>

	<p>My favorite involved the size of <a href="http://www.sethf.com/infothought/blog/archives/000553.html" rel="nofollow">&#8220;free  porn&#8221;</a></p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Sean</title>
		<link>http://crookedtimber.org/2005/09/26/give-or-take-a-billion/comment-page-1/#comment-103650</link>
		<dc:creator>Sean</dc:creator>
		<pubDate>Tue, 27 Sep 2005 14:30:38 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=3856#comment-103650</guid>
		<description>Each search queries the high-page-rank entries first, and estimates the total number of pages found from that number.  So it&#039;s not perfectly accurate, and it can even change as you click &quot;Next.&quot;

&lt;a href=&quot;http://www.google.com/search?q=the&amp;btnG=Search&quot; rel=&quot;nofollow&quot;&gt;The&lt;/a&gt; also gets about 9 billion -- and it&#039;s very funny to see what comes up first.  Here are &lt;a href=&quot;http://cosmicvariance.com/2005/08/10/relative-importance/&quot; rel=&quot;nofollow&quot;&gt;some other enlightening results&lt;/a&gt;.</description>
		<content:encoded><![CDATA[	<p>Each search queries the high-page-rank entries first, and estimates the total number of pages found from that number.  So it&#8217;s not perfectly accurate, and it can even change as you click &#8220;Next.&#8221;</p>

	<p><a href="http://www.google.com/search?q=the&#038;btnG=Search" rel="nofollow">The</a> also gets about 9 billion&#8212;and it&#8217;s very funny to see what comes up first.  Here are <a href="http://cosmicvariance.com/2005/08/10/relative-importance/" rel="nofollow">some other enlightening results</a>.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Andrew Gelman</title>
		<link>http://crookedtimber.org/2005/09/26/give-or-take-a-billion/comment-page-1/#comment-103648</link>
		<dc:creator>Andrew Gelman</dc:creator>
		<pubDate>Tue, 27 Sep 2005 14:23:18 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=3856#comment-103648</guid>
		<description>We had some discussion &lt;a href=&quot;http://www.stat.columbia.edu/~cook/movabletype/archives/2005/03/research_google.html&quot; rel=&quot;nofollow&quot;&gt;here&lt;/a&gt; and &lt;a&gt;here&lt;/a&gt; about using googlefighting to estimate the frequency of Godwin&#039;s Law violations regarding Bush and Clinton.  The consensus was that the 1000-link limit would make it a difficult statistical problem--excellent for class discussion, maybe not such an easy research tool. 

Some example results:

bush hitler: 1.5 million
clinton hitler: 0.7 million

bush: 83 million
clinton: 25 million

bush mcdonalds: 440,000
clinton mcdonalds: 200,000

But I haven&#039;t read &lt;a href=&quot;http://www.arxiv.org/abs/%20cs.CL/0412098&quot; rel=&quot;nofollow&quot;&gt;this reference&lt;/a&gt; which maybe gets around the counting problem somehow.</description>
		<content:encoded><![CDATA[	<p>We had some discussion <a href="http://www.stat.columbia.edu/~cook/movabletype/archives/2005/03/research_google.html" rel="nofollow">here</a> and <a>here</a> about using googlefighting to estimate the frequency of Godwin&#8217;s Law violations regarding Bush and Clinton.  The consensus was that the 1000-link limit would make it a difficult statistical problem&#8212;excellent for class discussion, maybe not such an easy research tool.</p>

	<p>Some example results:</p>

	<p>bush hitler: 1.5 million<br />
clinton hitler: 0.7 million</p>

	<p>bush: 83 million<br />
clinton: 25 million</p>

	<p>bush mcdonalds: 440,000<br />
clinton mcdonalds: 200,000</p>

	<p>But I haven&#8217;t read <a href="http://www.arxiv.org/abs/%20cs.CL/0412098" rel="nofollow">this reference</a> which maybe gets around the counting problem somehow.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Eszter</title>
		<link>http://crookedtimber.org/2005/09/26/give-or-take-a-billion/comment-page-1/#comment-103633</link>
		<dc:creator>Eszter</dc:creator>
		<pubDate>Tue, 27 Sep 2005 13:44:03 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=3856#comment-103633</guid>
		<description>Kieran, so did you try on your computer?:)  What did you get?  Yes, these numbers do tend to change (this computer is giving me 9,650,000,000), which adds to the concern that those numbers are unreliable.

It seems that being off by a billion is quite a bit if your total figure is just in the single billions. If you were off by a million and you&#039;re dealing with billions, that&#039;s another thing.  (Or if you&#039;re off by a billion, but really dealing with trillions that may be a different matter as well.)

I know there are various reasons for it and that&#039;s fine. Part of the point of mentioning all this is to note that drawing conclusions from the numbers that come up in Google searches (or searches on other engines) may be quite problematic.  People seem to use those numbers for various things yet it&#039;s important to remember how inexact those figures can be.</description>
		<content:encoded><![CDATA[	<p>Kieran, so did you try on your computer?:)  What did you get?  Yes, these numbers do tend to change (this computer is giving me 9,650,000,000), which adds to the concern that those numbers are unreliable.</p>

	<p>It seems that being off by a billion is quite a bit if your total figure is just in the single billions. If you were off by a million and you&#8217;re dealing with billions, that&#8217;s another thing.  (Or if you&#8217;re off by a billion, but really dealing with trillions that may be a different matter as well.)</p>

	<p>I know there are various reasons for it and that&#8217;s fine. Part of the point of mentioning all this is to note that drawing conclusions from the numbers that come up in Google searches (or searches on other engines) may be quite problematic.  People seem to use those numbers for various things yet it&#8217;s important to remember how inexact those figures can be.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Chris</title>
		<link>http://crookedtimber.org/2005/09/26/give-or-take-a-billion/comment-page-1/#comment-103628</link>
		<dc:creator>Chris</dc:creator>
		<pubDate>Tue, 27 Sep 2005 13:21:31 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=3856#comment-103628</guid>
		<description>Crikey, what a nerdy question - who cares?  
&quot;Search engine finds more pages than it claims&quot; - what kind of story is that??
Does the search get you what you want - generally either a specific site or the answer to a specific question?  Answer - mostly yes.  Does it cost me anything - no.  Do I have plenty of other search engines I can use - yes.
Job done.</description>
		<content:encoded><![CDATA[	<p>Crikey, what a nerdy question &#8211; who cares?<br />
&#8220;Search engine finds more pages than it claims&#8221; &#8211; what kind of story is that??<br />
Does the search get you what you want &#8211; generally either a specific site or the answer to a specific question?  Answer &#8211; mostly yes.  Does it cost me anything &#8211; no.  Do I have plenty of other search engines I can use &#8211; yes.<br />
Job done.</p>
 ]]></content:encoded>
	</item>
</channel>
</rss>
