<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Looking at Data</title>
	<atom:link href="http://crookedtimber.org/2009/10/13/looking-at-data/feed/" rel="self" type="application/rss+xml" />
	<link>http://crookedtimber.org/2009/10/13/looking-at-data/</link>
	<description>Out of the crooked timber of humanity, no straight thing was ever made</description>
	<lastBuildDate>Sun, 27 May 2012 09:28:09 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
	<item>
		<title>By: Jeremy</title>
		<link>http://crookedtimber.org/2009/10/13/looking-at-data/comment-page-1/#comment-291780</link>
		<dc:creator>Jeremy</dc:creator>
		<pubDate>Fri, 16 Oct 2009 17:28:06 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13327#comment-291780</guid>
		<description>@13 - I wondered about the cusp issue.  My presumption is that, given that I don&#039;t think NORC really takes the ZODIAC variable that seriously, they just had some algorithm that always coded the same month/day of birth the same way, even if this might not be the fully astrologically correct way of doing it.</description>
		<content:encoded><![CDATA[	<p>@13 &#8211; I wondered about the cusp issue.  My presumption is that, given that I don&#8217;t think <span class="caps">NORC</span> really takes the <span class="caps">ZODIAC</span> variable that seriously, they just had some algorithm that always coded the same month/day of birth the same way, even if this might not be the fully astrologically correct way of doing it.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Alan Peakall</title>
		<link>http://crookedtimber.org/2009/10/13/looking-at-data/comment-page-1/#comment-291742</link>
		<dc:creator>Alan Peakall</dc:creator>
		<pubDate>Fri, 16 Oct 2009 12:00:09 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13327#comment-291742</guid>
		<description>Jeremy,

If, as you say, the zodiac sign is processed data rather than raw data, is it possible that many of zodiac sign mismatches are a result of the encoding being done for the reporting year, rather than the birth year of the respondent?  As the reporting years differ by an odd multiple of two years, the leap year cycle could skew half of those born on a cusp.</description>
		<content:encoded><![CDATA[	<p>Jeremy,</p>

	<p>If, as you say, the zodiac sign is processed data rather than raw data, is it possible that many of zodiac sign mismatches are a result of the encoding being done for the reporting year, rather than the birth year of the respondent?  As the reporting years differ by an odd multiple of two years, the leap year cycle could skew half of those born on a cusp.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Jeremy</title>
		<link>http://crookedtimber.org/2009/10/13/looking-at-data/comment-page-1/#comment-291732</link>
		<dc:creator>Jeremy</dc:creator>
		<pubDate>Fri, 16 Oct 2009 06:36:48 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13327#comment-291732</guid>
		<description>@2 -  they don&#039;t actually ask respondents their zodiac sign.  They ask them their birthdate, which is then recoded to zodiac sign reflecting the interest some people have in debunking astrology.  There were more one-sign differences in zodiac for ostensibly matching respondents than I would have guessed; I suspect this is less about slight misreports of birthday by respondents than about interviewer typing errors, but that&#039;s a suspicion.

@6 - Given that the entire grant justification for doing the repeated interview was the possibility of doing panel analysis, it&#039;s not a real possibility that NORC will not release the panel identifier.  Their not having done so already is annoying, though.</description>
		<content:encoded><![CDATA[	<p>@2 &#8211;  they don&#8217;t actually ask respondents their zodiac sign.  They ask them their birthdate, which is then recoded to zodiac sign reflecting the interest some people have in debunking astrology.  There were more one-sign differences in zodiac for ostensibly matching respondents than I would have guessed; I suspect this is less about slight misreports of birthday by respondents than about interviewer typing errors, but that&#8217;s a suspicion.</p>

	<p>@6 &#8211; Given that the entire grant justification for doing the repeated interview was the possibility of doing panel analysis, it&#8217;s not a real possibility that <span class="caps">NORC</span> will not release the panel identifier.  Their not having done so already is annoying, though.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Phil</title>
		<link>http://crookedtimber.org/2009/10/13/looking-at-data/comment-page-1/#comment-291510</link>
		<dc:creator>Phil</dc:creator>
		<pubDate>Tue, 13 Oct 2009 21:35:44 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13327#comment-291510</guid>
		<description>I was subjected to a sort of Monty Python catechism the other day. &quot;All right, Sir, just one more thing, could you tell me: &lt;i&gt;Famous place...&lt;/i&gt;?&quot; &quot;Purley&quot;, I replied, and he was prepared to process my query (which as it happened was &quot;how can I give you some more money?&quot;, so the ID-verification really was overkill).

What I think must have happened is that he read out the hint by mistake for the prompt - although obviously I&#039;m not going to confide here what Purley is to me or I to Purley. (Not &lt;a href=&quot;http://www.everydayfiction.com/security-question-by-ramon-rozas-iii/&quot; rel=&quot;nofollow&quot;&gt;the obvious&lt;/a&gt;, I&#039;ll say that.)</description>
		<content:encoded><![CDATA[	<p>I was subjected to a sort of Monty Python catechism the other day. &#8220;All right, Sir, just one more thing, could you tell me: <i>Famous place&#8230;</i>?&#8221; &#8220;Purley&#8221;, I replied, and he was prepared to process my query (which as it happened was &#8220;how can I give you some more money?&#8221;, so the ID-verification really was overkill).</p>

	<p>What I think must have happened is that he read out the hint by mistake for the prompt &#8211; although obviously I&#8217;m not going to confide here what Purley is to me or I to Purley. (Not <a href="http://www.everydayfiction.com/security-question-by-ramon-rozas-iii/" rel="nofollow">the obvious</a>, I&#8217;ll say that.)</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Tim B</title>
		<link>http://crookedtimber.org/2009/10/13/looking-at-data/comment-page-1/#comment-291505</link>
		<dc:creator>Tim B</dc:creator>
		<pubDate>Tue, 13 Oct 2009 20:28:01 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13327#comment-291505</guid>
		<description>frances: &quot;&lt;i&gt;...I work in id management policy which is the odd thing.&lt;/i&gt;&quot;

Id management policy.  That&#039;s something the world needs a lot more of, in my opinion.</description>
		<content:encoded><![CDATA[	<p>frances: &#8220;<i>&#8230;I work in id management policy which is the odd thing.</i>&#8221;</p>

	<p>Id management policy.  That&#8217;s something the world needs a lot more of, in my opinion.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: frances</title>
		<link>http://crookedtimber.org/2009/10/13/looking-at-data/comment-page-1/#comment-291504</link>
		<dc:creator>frances</dc:creator>
		<pubDate>Tue, 13 Oct 2009 20:07:24 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13327#comment-291504</guid>
		<description>boyfried? where did that come from</description>
		<content:encoded><![CDATA[	<p>boyfried? where did that come from</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: frances</title>
		<link>http://crookedtimber.org/2009/10/13/looking-at-data/comment-page-1/#comment-291502</link>
		<dc:creator>frances</dc:creator>
		<pubDate>Tue, 13 Oct 2009 20:06:31 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13327#comment-291502</guid>
		<description>This is absolutely fascinating and proves to me [yet again] my ignorance - all I will add is I can never get through identity authentication routines by use of &quot;known facts&quot;.  First boyfried, pet, school - whatever it is - I don&#039;t think there is more then one instance in my memory bank but there must be as I always seem to fail the test whether internet or call centre.  And I work in id management policy which is the odd thing.  (Only item I saw in a suggested list of remembered facts I stood a chance with was mother&#039;s Co-op divi number - but you hve to be UK &amp; a certain age and class before that works.)

Human beings always complicate things</description>
		<content:encoded><![CDATA[	<p>This is absolutely fascinating and proves to me [yet again] my ignorance &#8211; all I will add is I can never get through identity authentication routines by use of &#8220;known facts&#8221;.  First boyfried, pet, school &#8211; whatever it is &#8211; I don&#8217;t think there is more then one instance in my memory bank but there must be as I always seem to fail the test whether internet or call centre.  And I work in id management policy which is the odd thing.  (Only item I saw in a suggested list of remembered facts I stood a chance with was mother&#8217;s Co-op divi number &#8211; but you hve to be <span class="caps">UK </span>&#038; a certain age and class before that works.)</p>

	<p>Human beings always complicate things</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Barry</title>
		<link>http://crookedtimber.org/2009/10/13/looking-at-data/comment-page-1/#comment-291492</link>
		<dc:creator>Barry</dc:creator>
		<pubDate>Tue, 13 Oct 2009 18:03:48 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13327#comment-291492</guid>
		<description>eudoxis 10.13.09 at 4:43 pm

&quot;Data that are more “real”, like medical histories, present with the same problem. Merging patient information on a grid is impossible without a unique identifier. Greater accuracy is required, of course, but still, one would think that collecting and merging all the obvious fields would work.&quot;

Depending on circumstances:

1)  Some fields might (will) have missing data for one or the other wave.
2) Enough fields in data set 1 would have to have to also be in data set 2.
3) As I pointed out above, if there are a set of overlapping fields in the two data
sets, but there are errors in recording (recalling) the values, then that set of
values will not be the same for a person between the two data sets.</description>
		<content:encoded><![CDATA[	<p>eudoxis 10.13.09 at 4:43 pm</p>

	<p>&#8220;Data that are more &#8220;real&#8221;, like medical histories, present with the same problem. Merging patient information on a grid is impossible without a unique identifier. Greater accuracy is required, of course, but still, one would think that collecting and merging all the obvious fields would work.&#8221;</p>

	<p>Depending on circumstances:</p>

	<p>1)  Some fields might (will) have missing data for one or the other wave.<br />
2) Enough fields in data set 1 would have to have to also be in data set 2.<br />
3) As I pointed out above, if there are a set of overlapping fields in the two data<br />
sets, but there are errors in recording (recalling) the values, then that set of<br />
values will not be the same for a person between the two data sets.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Ken Houghton</title>
		<link>http://crookedtimber.org/2009/10/13/looking-at-data/comment-page-1/#comment-291491</link>
		<dc:creator>Ken Houghton</dc:creator>
		<pubDate>Tue, 13 Oct 2009 17:57:42 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13327#comment-291491</guid>
		<description>I fear Ray&#039;s point--and Jeremy&#039;s desire to avoid downtime--will keep them from now cross-referencing the respondents so that datamining can be used to see who l/i/e/d/changed their 2004 voting preference  in 2008.</description>
		<content:encoded><![CDATA[	<p>I fear Ray&#8217;s point&#8212;and Jeremy&#8217;s desire to avoid downtime&#8212;will keep them from now cross-referencing the respondents so that datamining can be used to see who l/i/e/d/changed their 2004 voting preference  in 2008.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: eudoxis</title>
		<link>http://crookedtimber.org/2009/10/13/looking-at-data/comment-page-1/#comment-291487</link>
		<dc:creator>eudoxis</dc:creator>
		<pubDate>Tue, 13 Oct 2009 16:43:54 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13327#comment-291487</guid>
		<description>Data that are more &quot;real&quot;, like medical histories, present with the same problem.  Merging patient information on a grid is impossible without a unique identifier.  Greater accuracy is required, of course, but still, one would think that collecting and merging all the obvious fields would work.</description>
		<content:encoded><![CDATA[	<p>Data that are more &#8220;real&#8221;, like medical histories, present with the same problem.  Merging patient information on a grid is impossible without a unique identifier.  Greater accuracy is required, of course, but still, one would think that collecting and merging all the obvious fields would work.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Billikin</title>
		<link>http://crookedtimber.org/2009/10/13/looking-at-data/comment-page-1/#comment-291473</link>
		<dc:creator>Billikin</dc:creator>
		<pubDate>Tue, 13 Oct 2009 14:44:53 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13327#comment-291473</guid>
		<description>&quot;The real distinction between qualitative and quantitative is not widely appreciated. People think it has something to do with counting versus not counting, but this is a mistake. If the interpretive work necessary to make sense of things is immediately obvious to everyone, it’s qualitative data. If the interpretative work you need to do is immediately obvious only to experts, it’s quantitative data.&quot;

That is funny. ;)</description>
		<content:encoded><![CDATA[	<p>&#8220;The real distinction between qualitative and quantitative is not widely appreciated. People think it has something to do with counting versus not counting, but this is a mistake. If the interpretive work necessary to make sense of things is immediately obvious to everyone, it&#8217;s qualitative data. If the interpretative work you need to do is immediately obvious only to experts, it&#8217;s quantitative data.&#8221;</p>

	<p>That is funny. ;)</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Barry</title>
		<link>http://crookedtimber.org/2009/10/13/looking-at-data/comment-page-1/#comment-291469</link>
		<dc:creator>Barry</dc:creator>
		<pubDate>Tue, 13 Oct 2009 14:33:59 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13327#comment-291469</guid>
		<description>Recall errors would also contaminate things.  If you&#039;re matching on 10 remembered data points per person, a recall error rate of less than excellent would really mess you up.</description>
		<content:encoded><![CDATA[	<p>Recall errors would also contaminate things.  If you&#8217;re matching on 10 remembered data points per person, a recall error rate of less than excellent would really mess you up.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Ray</title>
		<link>http://crookedtimber.org/2009/10/13/looking-at-data/comment-page-1/#comment-291465</link>
		<dc:creator>Ray</dc:creator>
		<pubDate>Tue, 13 Oct 2009 14:28:13 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13327#comment-291465</guid>
		<description>I&#039;m guessing there was a large difference between the answers to this question, &quot;who they said they voted for in 2004&quot; in 2006 and 2008. 

Some of the other questions  - marital status, employment, if they live in the same place  - may have different answers in a later survey, because those things have changed. Zodiac sign and number of years mother went to school may not be remembered well. &#039;Self-identified race&#039; is also obviously susceptible to change. 

Including all of those fields is probably making things harder, not easier.</description>
		<content:encoded><![CDATA[	<p>I&#8217;m guessing there was a large difference between the answers to this question, &#8220;who they said they voted for in 2004&#8221; in 2006 and 2008.</p>

	<p>Some of the other questions  &#8211; marital status, employment, if they live in the same place  &#8211; may have different answers in a later survey, because those things have changed. Zodiac sign and number of years mother went to school may not be remembered well. &#8216;Self-identified race&#8217; is also obviously susceptible to change.</p>

	<p>Including all of those fields is probably making things harder, not easier.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Ahistoricality</title>
		<link>http://crookedtimber.org/2009/10/13/looking-at-data/comment-page-1/#comment-291459</link>
		<dc:creator>Ahistoricality</dc:creator>
		<pubDate>Tue, 13 Oct 2009 13:36:10 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13327#comment-291459</guid>
		<description>&lt;i&gt;If the interpretive work necessary to make sense of things....&lt;/i&gt;

That&#039;s almost funny.</description>
		<content:encoded><![CDATA[	<p><i>If the interpretive work necessary to make sense of things&#8230;.</i></p>

	<p>That&#8217;s almost funny.</p>
 ]]></content:encoded>
	</item>
</channel>
</rss>

