<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: The AOL data mess</title>
	<atom:link href="http://crookedtimber.org/2006/08/07/the-aol-data-mess/feed/" rel="self" type="application/rss+xml" />
	<link>http://crookedtimber.org/2006/08/07/the-aol-data-mess/</link>
	<description>Out of the crooked timber of humanity, no straight thing was ever made</description>
	<lastBuildDate>Mon, 13 Feb 2012 05:39:10 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: head</title>
		<link>http://crookedtimber.org/2006/08/07/the-aol-data-mess/comment-page-1/#comment-168443</link>
		<dc:creator>head</dc:creator>
		<pubDate>Sat, 12 Aug 2006 23:53:59 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=4998#comment-168443</guid>
		<description>Yes.. try out the AOL search database yourself.. It is just fun to look at some of the search data..

http://data.aolsearchlogs.com/log/random.cgi</description>
		<content:encoded><![CDATA[	<p>Yes.. try out the <span class="caps">AOL</span> search database yourself.. It is just fun to look at some of the search data..</p>

	<p><a href="http://data.aolsearchlogs.com/log/random.cgi" rel="nofollow">http://data.aolsearchlogs.com/log/random.cgi</a></p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Alex R</title>
		<link>http://crookedtimber.org/2006/08/07/the-aol-data-mess/comment-page-1/#comment-167849</link>
		<dc:creator>Alex R</dc:creator>
		<pubDate>Wed, 09 Aug 2006 09:42:48 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=4998#comment-167849</guid>
		<description>And here&#039;s the &lt;a href=&quot;http://www.nytimes.com/2006/08/09/technology/09aol.html?ei=5094&amp;en=9b5fd9ff341e3216&amp;hp=&amp;ex=1155182400&amp;adxnnl=1&amp;partner=homepage&amp;adxnnlx=1155115662-OeSr7K5NkuyIXksn6q9Fqw&amp;pagewanted=all&quot; rel=&quot;nofollow&quot;&gt;followup&lt;/a&gt;... The NY Times reporters track down one of the searchers based on her searches, and interview her.  Will the EU still consider this kind of data not to be &quot;personally identifiable&quot; given this kind of example?  It&#039;s one thing to look at the data and say &quot;Gee, you could probably identify some of these searchers&quot; and another thing to actually do it...</description>
		<content:encoded><![CDATA[	<p>And here&#8217;s the <a href="http://www.nytimes.com/2006/08/09/technology/09aol.html?ei=5094&#038;en=9b5fd9ff341e3216&#038;hp=&#038;ex=1155182400&#038;adxnnl=1&#038;partner=homepage&#038;adxnnlx=1155115662-OeSr7K5NkuyIXksn6q9Fqw&#038;pagewanted=all" rel="nofollow">followup</a>&#8230; The <span class="caps">NY </span>Times reporters track down one of the searchers based on her searches, and interview her.  Will the EU still consider this kind of data not to be &#8220;personally identifiable&#8221; given this kind of example?  It&#8217;s one thing to look at the data and say &#8220;Gee, you could probably identify some of these searchers&#8221; and another thing to actually do it&#8230;</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: a thaumaturgical compendium &#187; Blog Archive &#187; AOL and the aftermath</title>
		<link>http://crookedtimber.org/2006/08/07/the-aol-data-mess/comment-page-1/#comment-167830</link>
		<dc:creator>a thaumaturgical compendium &#187; Blog Archive &#187; AOL and the aftermath</dc:creator>
		<pubDate>Wed, 09 Aug 2006 02:37:17 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=4998#comment-167830</guid>
		<description>[...] The AOL thing turns out to be big news, complete with EFF recriminations and a mea culpa from AOL execs. Eszter blogs about the research dilemma over on Crooked Timber. But the secondary effect is the more difficult one. [...]</description>
		<content:encoded><![CDATA[	<p>[...] The <span class="caps">AOL</span> thing turns out to be big news, complete with <span class="caps">EFF</span> recriminations and a mea culpa from <span class="caps">AOL</span> execs. Eszter blogs about the research dilemma over on Crooked Timber. But the secondary effect is the more difficult one. [...]</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: agm</title>
		<link>http://crookedtimber.org/2006/08/07/the-aol-data-mess/comment-page-1/#comment-167779</link>
		<dc:creator>agm</dc:creator>
		<pubDate>Tue, 08 Aug 2006 17:14:54 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=4998#comment-167779</guid>
		<description>Funny, Seth, cause on my end of the Net the google subpeona brouhaha was deemed to be exactly that, a fishing expedition with no justifiable reason to be done. It just ain&#039;t all that hard to figure out who is who given a sufficiently connected set of searches and link trail and such.</description>
		<content:encoded><![CDATA[	<p>Funny, Seth, cause on my end of the Net the google subpeona brouhaha was deemed to be exactly that, a fishing expedition with no justifiable reason to be done. It just ain&#8217;t all that hard to figure out who is who given a sufficiently connected set of searches and link trail and such.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Matt Weiner</title>
		<link>http://crookedtimber.org/2006/08/07/the-aol-data-mess/comment-page-1/#comment-167773</link>
		<dc:creator>Matt Weiner</dc:creator>
		<pubDate>Tue, 08 Aug 2006 16:57:15 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=4998#comment-167773</guid>
		<description>Someone was recently convicted of murdering his ex-wife in part because &lt;a href=&quot;http://www2.ljworld.com/news/2005/mar/18/kstate_professor_convicted/&quot; rel=&quot;nofollow&quot;&gt;he searched&lt;/a&gt; for &quot;how to kill someone and not get caught.&quot; Though there was other evidence. 

via the &lt;a href=&quot;http://mattweiner.net/blog/archives/000742.html#comments&quot; rel=&quot;nofollow&quot;&gt;comment&lt;/a&gt; from &quot;Abbie Normal&quot;</description>
		<content:encoded><![CDATA[	<p>Someone was recently convicted of murdering his ex-wife in part because <a href="http://www2.ljworld.com/news/2005/mar/18/kstate_professor_convicted/" rel="nofollow">he searched</a> for &#8220;how to kill someone and not get caught.&#8221; Though there was other evidence.</p>

	<p>via the <a href="http://mattweiner.net/blog/archives/000742.html#comments" rel="nofollow">comment</a> from &#8220;Abbie Normal&#8221; </p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: abb1</title>
		<link>http://crookedtimber.org/2006/08/07/the-aol-data-mess/comment-page-1/#comment-167766</link>
		<dc:creator>abb1</dc:creator>
		<pubDate>Tue, 08 Aug 2006 16:13:19 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=4998#comment-167766</guid>
		<description>How is the search &quot;&lt;i&gt;how to kill my wife&lt;/i&gt;&quot; (&lt;i&gt;my&lt;/i&gt; wife?) indicative of an intent to commit a murder? That&#039;s just foolishness, absurdity, ain&#039;t it?</description>
		<content:encoded><![CDATA[	<p>How is the search &#8220;<i>how to kill my wife</i>&#8221; (<i>my</i> wife?) indicative of an intent to commit a murder? That&#8217;s just foolishness, absurdity, ain&#8217;t it?</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Eszter</title>
		<link>http://crookedtimber.org/2006/08/07/the-aol-data-mess/comment-page-1/#comment-167756</link>
		<dc:creator>Eszter</dc:creator>
		<pubDate>Tue, 08 Aug 2006 15:16:54 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=4998#comment-167756</guid>
		<description>Interesting and helpful, Maria, thanks!  

BTW, I agree with the earlier comments about how people shouldn&#039;t jump to conclusions based on some searches. In some cases we really don&#039;t have enough information to make much sense of just a few queries without much context.</description>
		<content:encoded><![CDATA[	<p>Interesting and helpful, Maria, thanks!</p>

	<p><span class="caps">BTW</span>, I agree with the earlier comments about how people shouldn&#8217;t jump to conclusions based on some searches. In some cases we really don&#8217;t have enough information to make much sense of just a few queries without much context.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Maria</title>
		<link>http://crookedtimber.org/2006/08/07/the-aol-data-mess/comment-page-1/#comment-167752</link>
		<dc:creator>Maria</dc:creator>
		<pubDate>Tue, 08 Aug 2006 14:30:01 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=4998#comment-167752</guid>
		<description>Yep, the data do not include name and physical address that are generally held to be personally identifiable data. (it&#039;s a legal term under the EU data protection directive, meaning information that pertains to an individual.)  

Search data is arguably &#039;traffic data&#039;, i.e. data about network activities that isn&#039;t itself directly tied to an identifed (n.b. not identifiable) individual, even though it may be possible to infer connections. E.g. it&#039;s the difference between an IP number that may or may not be shared you and other customers of your ISP and your name and postal address which are uniquely yours.

Even in the EU, traffic data are afforded a much lower level of protection (even when they are tied directly to personally identifiable data such as a phone number and billing data) than are personal data. 

Though there have been some recent developments on the treatment of traffic data in the EU which I really must get around to writing about...</description>
		<content:encoded><![CDATA[	<p>Yep, the data do not include name and physical address that are generally held to be personally identifiable data. (it&#8217;s a legal term under the EU data protection directive, meaning information that pertains to an individual.)</p>

	<p>Search data is arguably &#8216;traffic data&#8217;, i.e. data about network activities that isn&#8217;t itself directly tied to an identifed (n.b. not identifiable) individual, even though it may be possible to infer connections. E.g. it&#8217;s the difference between an IP number that may or may not be shared you and other customers of your <span class="caps">ISP</span> and your name and postal address which are uniquely yours.</p>

	<p>Even in the EU, traffic data are afforded a much lower level of protection (even when they are tied directly to personally identifiable data such as a phone number and billing data) than are personal data.</p>

	<p>Though there have been some recent developments on the treatment of traffic data in the EU which I really must get around to writing about&#8230;</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Sökmotorer och forskningsetik at Det perfekta tomrummet</title>
		<link>http://crookedtimber.org/2006/08/07/the-aol-data-mess/comment-page-1/#comment-167751</link>
		<dc:creator>Sökmotorer och forskningsetik at Det perfekta tomrummet</dc:creator>
		<pubDate>Tue, 08 Aug 2006 14:13:42 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=4998#comment-167751</guid>
		<description>[...] Forska, till exempel. Tydligen var AOLs anledning att släppa materialet just akademiskt. Som sociologen Eszter Hargittai vid Northwestern University och Stanford skriver, kan frågan komma upp om en forskare skulle få lov av ett universitets etiska kommitté att forska på datamängden. [...]</description>
		<content:encoded><![CDATA[	<p>[...] Forska, till exempel. Tydligen var AOLs anledning att sl&#228;ppa materialet just akademiskt. Som sociologen Eszter Hargittai vid Northwestern University och Stanford skriver, kan fr&#229;gan komma upp om en forskare skulle f&#229; lov av ett universitets etiska kommitt&#233; att forska p&#229; datam&#228;ngden. [...]</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: engels</title>
		<link>http://crookedtimber.org/2006/08/07/the-aol-data-mess/comment-page-1/#comment-167746</link>
		<dc:creator>engels</dc:creator>
		<pubDate>Tue, 08 Aug 2006 13:39:20 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=4998#comment-167746</guid>
		<description>&lt;i&gt;After all, someone could be running lots of searches on his/her own name, address, interests, etc, or someone else could be trying to find out information about said person.&lt;/i&gt;

Well yes but from a small number of searches you could triangulate. Only one person has connections with exactly that set of five people or institutions, etc...</description>
		<content:encoded><![CDATA[	<p><i>After all, someone could be running lots of searches on his/her own name, address, interests, etc, or someone else could be trying to find out information about said person.</i></p>

	<p>Well yes but from a small number of searches you could triangulate. Only one person has connections with exactly that set of five people or institutions, etc&#8230;</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Eszter</title>
		<link>http://crookedtimber.org/2006/08/07/the-aol-data-mess/comment-page-1/#comment-167744</link>
		<dc:creator>Eszter</dc:creator>
		<pubDate>Tue, 08 Aug 2006 13:26:38 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=4998#comment-167744</guid>
		<description>Maria, the data are not personally identifiable, because the user IDs have been removed?  Based on someone&#039;s searches, in some cases a good guess could be made about the user.  But it&#039;s true that there could be no guarantee it&#039;s one particular person.  After all, someone could be running lots of searches on his/her own name, address, interests, etc, or someone else could be trying to find out information about said person.

My recollection is that the other companies (Yahoo!, MSN, AOL) that had been approached by the US government also weren&#039;t going to hand over any data with identifying information.  (If I wasn&#039;t rushing out the door I&#039;d look up some references, sorry.)</description>
		<content:encoded><![CDATA[	<p>Maria, the data are not personally identifiable, because the user IDs have been removed?  Based on someone&#8217;s searches, in some cases a good guess could be made about the user.  But it&#8217;s true that there could be no guarantee it&#8217;s one particular person.  After all, someone could be running lots of searches on his/her own name, address, interests, etc, or someone else could be trying to find out information about said person.</p>

	<p>My recollection is that the other companies (Yahoo!, <span class="caps">MSN</span>, AOL) that had been approached by the US government also weren&#8217;t going to hand over any data with identifying information.  (If I wasn&#8217;t rushing out the door I&#8217;d look up some references, sorry.)</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Maria</title>
		<link>http://crookedtimber.org/2006/08/07/the-aol-data-mess/comment-page-1/#comment-167740</link>
		<dc:creator>Maria</dc:creator>
		<pubDate>Tue, 08 Aug 2006 12:57:07 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=4998#comment-167740</guid>
		<description>exactly, seth. this is what john battelle has in mind vis a vis the &#039;database of intentions&#039;, and is precisely the data that google tried to keep private earlier in the year. also agree with you on the expediency and pr efforts surrounding google&#039;s response to said request. 

brief point from the data protection point of view - this does not implicate European (and most other) DP laws as it is not personally identifiable data. It&#039;s interesting, nonetheless.</description>
		<content:encoded><![CDATA[	<p>exactly, seth. this is what john battelle has in mind vis a vis the &#8216;database of intentions&#8217;, and is precisely the data that google tried to keep private earlier in the year. also agree with you on the expediency and pr efforts surrounding google&#8217;s response to said request.</p>

	<p>brief point from the data protection point of view &#8211; this does not implicate European (and most other) DP laws as it is not personally identifiable data. It&#8217;s interesting, nonetheless.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Seth Finkelstein</title>
		<link>http://crookedtimber.org/2006/08/07/the-aol-data-mess/comment-page-1/#comment-167734</link>
		<dc:creator>Seth Finkelstein</dc:creator>
		<pubDate>Tue, 08 Aug 2006 11:38:39 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=4998#comment-167734</guid>
		<description>Yes indeed - this issue was something I wondered about, during the huge Google subpoena panic. I&#039;m bad at politics, but I thought there was a lot of fear-mongering and political expediency going on, sensationalism about the goals of the specific request. It was reported as if it was a scary Big Brother fishing expedition, but was in fact basically an extremely tame academic-type study. Though the general tracking issue is quite real.

I did the system administration for one of the early studies of user behaviour using log files, back in 1999. I don&#039;t know if such a study could be done done, from viewpoint of the attacks it would get for using the data.

I suppose the only advice I can give on that point is to say think carefully about the career implications, though I believe going ahead with such studies would help establish they can produce useful results (though this is not advice to anyone to hurt themselves, pioneers are the ones with arrows in their backs). Think: &quot;So that the data have not been released in vain ...&quot;</description>
		<content:encoded><![CDATA[	<p>Yes indeed &#8211; this issue was something I wondered about, during the huge Google subpoena panic. I&#8217;m bad at politics, but I thought there was a lot of fear-mongering and political expediency going on, sensationalism about the goals of the specific request. It was reported as if it was a scary Big Brother fishing expedition, but was in fact basically an extremely tame academic-type study. Though the general tracking issue is quite real.</p>

	<p>I did the system administration for one of the early studies of user behaviour using log files, back in 1999. I don&#8217;t know if such a study could be done done, from viewpoint of the attacks it would get for using the data.</p>

	<p>I suppose the only advice I can give on that point is to say think carefully about the career implications, though I believe going ahead with such studies would help establish they can produce useful results (though this is not advice to anyone to hurt themselves, pioneers are the ones with arrows in their backs). Think: &#8220;So that the data have not been released in vain &#8230;&#8221; </p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: joel turnipseed</title>
		<link>http://crookedtimber.org/2006/08/07/the-aol-data-mess/comment-page-1/#comment-167728</link>
		<dc:creator>joel turnipseed</dc:creator>
		<pubDate>Tue, 08 Aug 2006 06:40:09 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=4998#comment-167728</guid>
		<description>Fascinating... as a writer, I&#039;d love to know what 500,000 people were thinking over a three month period--a sort of &quot;mind of God&quot; snapshot. Still, at 500MB for the zipped file--that&#039;s a HUGE file. I don&#039;t think the vast majority of home PC users could even open the unzipped file, could they (it must be several gig)? 

Several people have noted the &quot;kill my wife&quot; search, but you wonder: could this have been a wannabe crime novelist? someone whose wife was already murdered by someone else? or an actual would-be murderer? I&#039;m reminded of the Pete Townsend case (interesting that he was acquitted, no? would a non-famous, say, elementary school teacher have been?). There are just so many things to consider w/r/t to these things even after they&#039;re out of the bag. 

Meantime, you do wonder, at some point, whether any notion of &quot;privacy&quot; should go out the door altogether? Or at least certain forms of it? Wouldn&#039;t airing out what we&#039;re all REALLY like, en masse, do some social good? I&#039;m thinking in particular of the bullshit campaign launched (and ongoing) by the Swift Boat Veterans for Truth, even in the face of obvious, and widespread, war crimes in Vietnam--and even where there were no war crimes, damned obvious horrors of war. Not only was Kerry defeated in no small part from this obvious lack of widespread insight into human nature, but a lot of the bullshit in Iraq would&#039;ve been obvious to anyone with the slightest self-awareness and a little honest human experience. That is, if you&#039;re really open to what humans are like (Goethe on the &quot;crime imagined&quot; is good here), you just shouldn&#039;t be surprised at rapes, mutilations, mass murders, beheadings, etcetera: they&#039;re part of who we are as a species &amp; as such, unavoidable--especially in times/places of war. Is it better that we know &lt;em&gt;that&lt;/em&gt; (if not, per AOL&#039;s scrambling of user names, which) people search/think certain things with whatever degree of frequency?


I know, I know... there&#039;s the Scylla of the Scarlet Letter that inheres in a &lt;em&gt;real&lt;/em&gt; global village... and the Charybdis of Big Brother which is actual the Big Evil Twins of States and Corporations. But don&#039;t we all already have a kind of social punchcard consisting of our education levels (and grades), credit reports, police records, property records, etcetera? If the job/house/school/life you can lead is already nearly inexorably determined for you in the private confines of admissions committees, HR departments, and banks... what&#039;s a dirty thought or two aired to the public?</description>
		<content:encoded><![CDATA[	<p>Fascinating&#8230; as a writer, I&#8217;d love to know what 500,000 people were thinking over a three month period&#8212;a sort of &#8220;mind of God&#8221; snapshot. Still, at 500MB for the zipped file&#8212;that&#8217;s a <span class="caps">HUGE</span> file. I don&#8217;t think the vast majority of home PC users could even open the unzipped file, could they (it must be several gig)?</p>

	<p>Several people have noted the &#8220;kill my wife&#8221; search, but you wonder: could this have been a wannabe crime novelist? someone whose wife was already murdered by someone else? or an actual would-be murderer? I&#8217;m reminded of the Pete Townsend case (interesting that he was acquitted, no? would a non-famous, say, elementary school teacher have been?). There are just so many things to consider w/r/t to these things even after they&#8217;re out of the bag.</p>

	<p>Meantime, you do wonder, at some point, whether any notion of &#8220;privacy&#8221; should go out the door altogether? Or at least certain forms of it? Wouldn&#8217;t airing out what we&#8217;re all <span class="caps">REALLY</span> like, en masse, do some social good? I&#8217;m thinking in particular of the bullshit campaign launched (and ongoing) by the Swift Boat Veterans for Truth, even in the face of obvious, and widespread, war crimes in Vietnam&#8212;and even where there were no war crimes, damned obvious horrors of war. Not only was Kerry defeated in no small part from this obvious lack of widespread insight into human nature, but a lot of the bullshit in Iraq would&#8217;ve been obvious to anyone with the slightest self-awareness and a little honest human experience. That is, if you&#8217;re really open to what humans are like (Goethe on the &#8220;crime imagined&#8221; is good here), you just shouldn&#8217;t be surprised at rapes, mutilations, mass murders, beheadings, etcetera: they&#8217;re part of who we are as a species &#038; as such, unavoidable&#8212;especially in times/places of war. Is it better that we know <em>that</em> (if not, per <span class="caps">AOL</span>&#8217;s scrambling of user names, which) people search/think certain things with whatever degree of frequency?</p>


	<p>I know, I know&#8230; there&#8217;s the Scylla of the Scarlet Letter that inheres in a <em>real</em> global village&#8230; and the Charybdis of Big Brother which is actual the Big Evil Twins of States and Corporations. But don&#8217;t we all already have a kind of social punchcard consisting of our education levels (and grades), credit reports, police records, property records, etcetera? If the job/house/school/life you can lead is already nearly inexorably determined for you in the private confines of admissions committees, HR departments, and banks&#8230; what&#8217;s a dirty thought or two aired to the public?</p>
 ]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk: basic
Page Caching using disk: enhanced

Served from: crookedtimber.org @ 2012-02-13 06:08:40 -->
